CN101993861A - 羧酸酯酶的重组表达 - Google Patents
羧酸酯酶的重组表达 Download PDFInfo
- Publication number
- CN101993861A CN101993861A CN2009101668391A CN200910166839A CN101993861A CN 101993861 A CN101993861 A CN 101993861A CN 2009101668391 A CN2009101668391 A CN 2009101668391A CN 200910166839 A CN200910166839 A CN 200910166839A CN 101993861 A CN101993861 A CN 101993861A
- Authority
- CN
- China
- Prior art keywords
- leu
- ala
- gly
- val
- pro
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- 108010051152 Carboxylesterase Proteins 0.000 title claims description 201
- 102000013392 Carboxylesterase Human genes 0.000 title claims description 191
- 238000003259 recombinant expression Methods 0.000 title description 4
- 210000003527 eukaryotic cell Anatomy 0.000 claims abstract description 59
- 238000000034 method Methods 0.000 claims abstract description 53
- 239000000203 mixture Substances 0.000 claims abstract description 38
- 239000013604 expression vector Substances 0.000 claims abstract description 35
- 108090000623 proteins and genes Proteins 0.000 claims description 99
- 210000004027 cell Anatomy 0.000 claims description 96
- 230000002538 fungal effect Effects 0.000 claims description 35
- 102000004169 proteins and genes Human genes 0.000 claims description 29
- 210000005253 yeast cell Anatomy 0.000 claims description 22
- 150000007942 carboxylates Chemical group 0.000 claims description 17
- 244000005700 microbiome Species 0.000 claims description 16
- 230000009466 transformation Effects 0.000 claims description 14
- 150000001875 compounds Chemical class 0.000 claims description 8
- 241000228212 Aspergillus Species 0.000 claims description 6
- 230000012447 hatching Effects 0.000 claims 1
- 230000014509 gene expression Effects 0.000 abstract description 44
- 102000004308 Carboxylic Ester Hydrolases Human genes 0.000 abstract description 37
- 108090000863 Carboxylic Ester Hydrolases Proteins 0.000 abstract description 37
- 239000013612 plasmid Substances 0.000 description 48
- 241000228245 Aspergillus niger Species 0.000 description 46
- 239000002773 nucleotide Substances 0.000 description 41
- 125000003729 nucleotide group Chemical group 0.000 description 41
- 238000003752 polymerase chain reaction Methods 0.000 description 34
- 108020004414 DNA Proteins 0.000 description 31
- 230000000694 effects Effects 0.000 description 30
- 125000003275 alpha amino acid group Chemical group 0.000 description 29
- 102000004190 Enzymes Human genes 0.000 description 27
- 108090000790 Enzymes Proteins 0.000 description 27
- 229940088598 enzyme Drugs 0.000 description 27
- 239000000047 product Substances 0.000 description 27
- 230000001105 regulatory effect Effects 0.000 description 25
- 108010076504 Protein Sorting Signals Proteins 0.000 description 23
- 238000011534 incubation Methods 0.000 description 22
- 239000002609 medium Substances 0.000 description 22
- 108090000765 processed proteins & peptides Proteins 0.000 description 21
- 235000018102 proteins Nutrition 0.000 description 21
- 239000013598 vector Substances 0.000 description 20
- 241000193385 Geobacillus stearothermophilus Species 0.000 description 19
- 235000001014 amino acid Nutrition 0.000 description 19
- 150000001413 amino acids Chemical class 0.000 description 17
- 238000001962 electrophoresis Methods 0.000 description 17
- 239000012634 fragment Substances 0.000 description 17
- 240000006439 Aspergillus oryzae Species 0.000 description 16
- PEDCQBHIVMGVHV-UHFFFAOYSA-N Glycerine Chemical compound OCC(O)CO PEDCQBHIVMGVHV-UHFFFAOYSA-N 0.000 description 15
- 241001506991 Komagataella phaffii GS115 Species 0.000 description 15
- OKKJLVBELUTLKV-UHFFFAOYSA-N Methanol Chemical compound OC OKKJLVBELUTLKV-UHFFFAOYSA-N 0.000 description 15
- 102000004196 processed proteins & peptides Human genes 0.000 description 15
- 235000002247 Aspergillus oryzae Nutrition 0.000 description 14
- 238000004519 manufacturing process Methods 0.000 description 14
- 239000000575 pesticide Substances 0.000 description 14
- 241000620209 Escherichia coli DH5[alpha] Species 0.000 description 13
- 239000000872 buffer Substances 0.000 description 13
- 108010034529 leucyl-lysine Proteins 0.000 description 13
- 101150054232 pyrG gene Proteins 0.000 description 13
- 239000006228 supernatant Substances 0.000 description 13
- 241000320412 Ogataea angusta Species 0.000 description 12
- 108010047857 aspartylglycine Proteins 0.000 description 12
- 230000002255 enzymatic effect Effects 0.000 description 12
- 239000000499 gel Substances 0.000 description 12
- 108010050848 glycylleucine Proteins 0.000 description 12
- 229920001184 polypeptide Polymers 0.000 description 12
- 239000000243 solution Substances 0.000 description 12
- 239000000758 substrate Substances 0.000 description 12
- 244000286779 Hansenula anomala Species 0.000 description 11
- 241000235058 Komagataella pastoris Species 0.000 description 11
- 235000014680 Saccharomyces cerevisiae Nutrition 0.000 description 11
- 108020004999 messenger RNA Proteins 0.000 description 11
- 238000002415 sodium dodecyl sulfate polyacrylamide gel electrophoresis Methods 0.000 description 11
- 108091026890 Coding region Proteins 0.000 description 10
- YBAFDPFAUTYYRW-UHFFFAOYSA-N N-L-alpha-glutamyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CCC(O)=O YBAFDPFAUTYYRW-UHFFFAOYSA-N 0.000 description 10
- FAPWRFPIFSIZLT-UHFFFAOYSA-M Sodium chloride Chemical compound [Na+].[Cl-] FAPWRFPIFSIZLT-UHFFFAOYSA-M 0.000 description 10
- 230000035772 mutation Effects 0.000 description 10
- 210000001938 protoplast Anatomy 0.000 description 10
- 229960000723 ampicillin Drugs 0.000 description 9
- AVKUERGKIZMTKX-NJBDSQKTSA-N ampicillin Chemical compound C1([C@@H](N)C(=O)N[C@H]2[C@H]3SC([C@@H](N3C2=O)C(O)=O)(C)C)=CC=CC=C1 AVKUERGKIZMTKX-NJBDSQKTSA-N 0.000 description 9
- 108010093581 aspartyl-proline Proteins 0.000 description 9
- 238000006243 chemical reaction Methods 0.000 description 9
- VPZXBVLAVMBEQI-UHFFFAOYSA-N glycyl-DL-alpha-alanine Natural products OC(=O)C(C)NC(=O)CN VPZXBVLAVMBEQI-UHFFFAOYSA-N 0.000 description 9
- XBGGUPMXALFZOT-UHFFFAOYSA-N glycyl-L-tyrosine hemihydrate Natural products NCC(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 XBGGUPMXALFZOT-UHFFFAOYSA-N 0.000 description 9
- RAXXELZNTBOGNW-UHFFFAOYSA-N imidazole Natural products C1=CNC=N1 RAXXELZNTBOGNW-UHFFFAOYSA-N 0.000 description 9
- VGKONPUVOVVNSU-UHFFFAOYSA-N naphthalen-1-yl acetate Chemical compound C1=CC=C2C(OC(=O)C)=CC=CC2=C1 VGKONPUVOVVNSU-UHFFFAOYSA-N 0.000 description 9
- 238000013518 transcription Methods 0.000 description 9
- 230000035897 transcription Effects 0.000 description 9
- 241000196324 Embryophyta Species 0.000 description 8
- 241000626621 Geobacillus Species 0.000 description 8
- 241000880493 Leptailurus serval Species 0.000 description 8
- 239000011543 agarose gel Substances 0.000 description 8
- 108010047495 alanylglycine Proteins 0.000 description 8
- 238000000605 extraction Methods 0.000 description 8
- 230000001939 inductive effect Effects 0.000 description 8
- 108010057821 leucylproline Proteins 0.000 description 8
- 239000013642 negative control Substances 0.000 description 8
- 108010051242 phenylalanylserine Proteins 0.000 description 8
- 229940002612 prodrug Drugs 0.000 description 8
- 239000000651 prodrug Substances 0.000 description 8
- 108010026333 seryl-proline Proteins 0.000 description 8
- 241000894007 species Species 0.000 description 8
- JBCLFWXMTIKCCB-UHFFFAOYSA-N H-Gly-Phe-OH Natural products NCC(=O)NC(C(O)=O)CC1=CC=CC=C1 JBCLFWXMTIKCCB-UHFFFAOYSA-N 0.000 description 7
- 108091028043 Nucleic acid sequence Proteins 0.000 description 7
- 239000008103 glucose Substances 0.000 description 7
- 108010025306 histidylleucine Proteins 0.000 description 7
- 239000002917 insecticide Substances 0.000 description 7
- 239000003550 marker Substances 0.000 description 7
- 108010070409 phenylalanyl-glycyl-glycine Proteins 0.000 description 7
- 238000000746 purification Methods 0.000 description 7
- 239000000523 sample Substances 0.000 description 7
- 239000000126 substance Substances 0.000 description 7
- 108010061238 threonyl-glycine Proteins 0.000 description 7
- 108010015385 valyl-prolyl-proline Proteins 0.000 description 7
- YBJHBAHKTGYVGT-ZKWXMUAHSA-N (+)-Biotin Chemical compound N1C(=O)N[C@@H]2[C@H](CCCCC(=O)O)SC[C@@H]21 YBJHBAHKTGYVGT-ZKWXMUAHSA-N 0.000 description 6
- 229920001817 Agar Polymers 0.000 description 6
- IJGRMHOSHXDMSA-UHFFFAOYSA-N Atomic nitrogen Chemical compound N#N IJGRMHOSHXDMSA-UHFFFAOYSA-N 0.000 description 6
- 241000233866 Fungi Species 0.000 description 6
- WQZGKKKJIJFFOK-GASJEMHNSA-N Glucose Natural products OC[C@H]1OC(O)[C@H](O)[C@@H](O)[C@@H]1O WQZGKKKJIJFFOK-GASJEMHNSA-N 0.000 description 6
- 108010002311 N-glycylglutamic acid Proteins 0.000 description 6
- DRTQHJPVMGBUCF-XVFCMESISA-N Uridine Chemical compound O[C@@H]1[C@H](O)[C@@H](CO)O[C@H]1N1C(=O)NC(=O)C=C1 DRTQHJPVMGBUCF-XVFCMESISA-N 0.000 description 6
- 239000008272 agar Substances 0.000 description 6
- 108010005233 alanylglutamic acid Proteins 0.000 description 6
- 238000004458 analytical method Methods 0.000 description 6
- 108010013835 arginine glutamate Proteins 0.000 description 6
- 108010038633 aspartylglutamate Proteins 0.000 description 6
- 230000015572 biosynthetic process Effects 0.000 description 6
- 238000004113 cell culture Methods 0.000 description 6
- 239000012228 culture supernatant Substances 0.000 description 6
- 108010087823 glycyltyrosine Proteins 0.000 description 6
- 108010044311 leucyl-glycyl-glycine Proteins 0.000 description 6
- 239000012528 membrane Substances 0.000 description 6
- 230000000813 microbial effect Effects 0.000 description 6
- 229920002401 polyacrylamide Polymers 0.000 description 6
- 108010070643 prolylglutamic acid Proteins 0.000 description 6
- 230000001737 promoting effect Effects 0.000 description 6
- 230000000644 propagated effect Effects 0.000 description 6
- 108010020532 tyrosyl-proline Proteins 0.000 description 6
- 108010073969 valyllysine Proteins 0.000 description 6
- 241000193830 Bacillus <bacterium> Species 0.000 description 5
- 241000894006 Bacteria Species 0.000 description 5
- 241000588724 Escherichia coli Species 0.000 description 5
- 108700007698 Genetic Terminator Regions Proteins 0.000 description 5
- RCFDOSNHHZGBOY-UHFFFAOYSA-N L-isoleucyl-L-alanine Natural products CCC(C)C(N)C(=O)NC(C)C(O)=O RCFDOSNHHZGBOY-UHFFFAOYSA-N 0.000 description 5
- 101100297639 Mus musculus Pigf gene Proteins 0.000 description 5
- LKUDRJSNRWVGMS-QSFUFRPTSA-N Val-Ile-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N LKUDRJSNRWVGMS-QSFUFRPTSA-N 0.000 description 5
- KOSRFJWDECSPRO-UHFFFAOYSA-N alpha-L-glutamyl-L-glutamic acid Natural products OC(=O)CCC(N)C(=O)NC(CCC(O)=O)C(O)=O KOSRFJWDECSPRO-UHFFFAOYSA-N 0.000 description 5
- 150000001732 carboxylic acid derivatives Chemical class 0.000 description 5
- 238000005119 centrifugation Methods 0.000 description 5
- 238000010276 construction Methods 0.000 description 5
- 229940079593 drug Drugs 0.000 description 5
- 239000003814 drug Substances 0.000 description 5
- 238000005516 engineering process Methods 0.000 description 5
- 108010036413 histidylglycine Proteins 0.000 description 5
- 229930027917 kanamycin Natural products 0.000 description 5
- 229960000318 kanamycin Drugs 0.000 description 5
- SBUJHOSQTJFQJX-NOAMYHISSA-N kanamycin Chemical compound O[C@@H]1[C@@H](O)[C@H](O)[C@@H](CN)O[C@@H]1O[C@H]1[C@H](O)[C@@H](O[C@@H]2[C@@H]([C@@H](N)[C@H](O)[C@@H](CO)O2)O)[C@H](N)C[C@@H]1N SBUJHOSQTJFQJX-NOAMYHISSA-N 0.000 description 5
- 229930182823 kanamycin A Natural products 0.000 description 5
- 230000000670 limiting effect Effects 0.000 description 5
- 108010054155 lysyllysine Proteins 0.000 description 5
- 230000004048 modification Effects 0.000 description 5
- 238000012986 modification Methods 0.000 description 5
- 230000008488 polyadenylation Effects 0.000 description 5
- 229920001223 polyethylene glycol Polymers 0.000 description 5
- 108091033319 polynucleotide Proteins 0.000 description 5
- 102000040430 polynucleotide Human genes 0.000 description 5
- 108010031719 prolyl-serine Proteins 0.000 description 5
- 108091008146 restriction endonucleases Proteins 0.000 description 5
- 239000011780 sodium chloride Substances 0.000 description 5
- 238000006467 substitution reaction Methods 0.000 description 5
- 238000003786 synthesis reaction Methods 0.000 description 5
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Substances O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 5
- 238000001262 western blot Methods 0.000 description 5
- YBADLXQNJCMBKR-UHFFFAOYSA-M (4-nitrophenyl)acetate Chemical compound [O-]C(=O)CC1=CC=C([N+]([O-])=O)C=C1 YBADLXQNJCMBKR-UHFFFAOYSA-M 0.000 description 4
- 108091032973 (ribonucleotides)n+m Proteins 0.000 description 4
- KJCVRFUGPWSIIH-UHFFFAOYSA-N 1-naphthol Chemical compound C1=CC=C2C(O)=CC=CC2=C1 KJCVRFUGPWSIIH-UHFFFAOYSA-N 0.000 description 4
- YHKANGMVQWRMAP-DCAQKATOSA-N Ala-Leu-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N YHKANGMVQWRMAP-DCAQKATOSA-N 0.000 description 4
- VCSABYLVNWQYQE-UHFFFAOYSA-N Ala-Lys-Lys Natural products NCCCCC(NC(=O)C(N)C)C(=O)NC(CCCCN)C(O)=O VCSABYLVNWQYQE-UHFFFAOYSA-N 0.000 description 4
- DHBKYZYFEXXUAK-ONGXEEELSA-N Ala-Phe-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=CC=C1 DHBKYZYFEXXUAK-ONGXEEELSA-N 0.000 description 4
- 102000007698 Alcohol dehydrogenase Human genes 0.000 description 4
- 108010021809 Alcohol dehydrogenase Proteins 0.000 description 4
- 241001225321 Aspergillus fumigatus Species 0.000 description 4
- 241000351920 Aspergillus nidulans Species 0.000 description 4
- FBPFZTCFMRRESA-FSIIMWSLSA-N D-Glucitol Natural products OC[C@H](O)[C@H](O)[C@@H](O)[C@H](O)CO FBPFZTCFMRRESA-FSIIMWSLSA-N 0.000 description 4
- 102000048120 Galactokinases Human genes 0.000 description 4
- 108700023157 Galactokinases Proteins 0.000 description 4
- 241000287828 Gallus gallus Species 0.000 description 4
- XPJBQTCXPJNIFE-ZETCQYMHSA-N Gly-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)CN XPJBQTCXPJNIFE-ZETCQYMHSA-N 0.000 description 4
- 241000238631 Hexapoda Species 0.000 description 4
- OXKYZSRZKBTVEY-ZPFDUUQYSA-N Leu-Asn-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O OXKYZSRZKBTVEY-ZPFDUUQYSA-N 0.000 description 4
- WQWSMEOYXJTFRU-GUBZILKMSA-N Leu-Glu-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O WQWSMEOYXJTFRU-GUBZILKMSA-N 0.000 description 4
- 241001465754 Metazoa Species 0.000 description 4
- 108010014251 Muramidase Proteins 0.000 description 4
- 102000016943 Muramidase Human genes 0.000 description 4
- 108010062010 N-Acetylmuramoyl-L-alanine Amidase Proteins 0.000 description 4
- PESQCPHRXOFIPX-UHFFFAOYSA-N N-L-methionyl-L-tyrosine Natural products CSCCC(N)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 PESQCPHRXOFIPX-UHFFFAOYSA-N 0.000 description 4
- KZNQNBZMBZJQJO-UHFFFAOYSA-N N-glycyl-L-proline Natural products NCC(=O)N1CCCC1C(O)=O KZNQNBZMBZJQJO-UHFFFAOYSA-N 0.000 description 4
- 108010079364 N-glycylalanine Proteins 0.000 description 4
- 239000002033 PVDF binder Substances 0.000 description 4
- GRLJIIJNZJVMGP-UHFFFAOYSA-N S-Methyl butanethioate Chemical compound CCCC(=O)SC GRLJIIJNZJVMGP-UHFFFAOYSA-N 0.000 description 4
- 241001138501 Salmonella enterica Species 0.000 description 4
- UOLGINIHBRIECN-FXQIFTODSA-N Ser-Glu-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O UOLGINIHBRIECN-FXQIFTODSA-N 0.000 description 4
- UYTYTDMCDBPDSC-URLPEUOOSA-N Thr-Ile-Phe Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N UYTYTDMCDBPDSC-URLPEUOOSA-N 0.000 description 4
- 239000007983 Tris buffer Substances 0.000 description 4
- DWAMXBFJNZIHMC-KBPBESRZSA-N Tyr-Leu-Gly Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O DWAMXBFJNZIHMC-KBPBESRZSA-N 0.000 description 4
- DOFAQXCYFQKSHT-SRVKXCTJSA-N Val-Pro-Pro Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 DOFAQXCYFQKSHT-SRVKXCTJSA-N 0.000 description 4
- 241000269368 Xenopus laevis Species 0.000 description 4
- 108010086434 alanyl-seryl-glycine Proteins 0.000 description 4
- 108010041407 alanylaspartic acid Proteins 0.000 description 4
- 108010070944 alanylhistidine Proteins 0.000 description 4
- 125000000539 amino acid group Chemical group 0.000 description 4
- 101150008194 argB gene Proteins 0.000 description 4
- 108010040443 aspartyl-aspartic acid Proteins 0.000 description 4
- 108010092854 aspartyllysine Proteins 0.000 description 4
- 229940091771 aspergillus fumigatus Drugs 0.000 description 4
- 229940041514 candida albicans extract Drugs 0.000 description 4
- 239000002299 complementary DNA Substances 0.000 description 4
- 238000010586 diagram Methods 0.000 description 4
- 238000004520 electroporation Methods 0.000 description 4
- XKUKSGPZAADMRA-UHFFFAOYSA-N glycyl-glycyl-glycine Natural products NCC(=O)NCC(=O)NCC(O)=O XKUKSGPZAADMRA-UHFFFAOYSA-N 0.000 description 4
- 108010026364 glycyl-glycyl-leucine Proteins 0.000 description 4
- 108010089804 glycyl-threonine Proteins 0.000 description 4
- 239000001963 growth medium Substances 0.000 description 4
- 239000000411 inducer Substances 0.000 description 4
- 238000002955 isolation Methods 0.000 description 4
- 108010044374 isoleucyl-tyrosine Proteins 0.000 description 4
- 108010073472 leucyl-prolyl-proline Proteins 0.000 description 4
- 239000004325 lysozyme Substances 0.000 description 4
- 229960000274 lysozyme Drugs 0.000 description 4
- 235000010335 lysozyme Nutrition 0.000 description 4
- 108010064235 lysylglycine Proteins 0.000 description 4
- 108020004707 nucleic acids Proteins 0.000 description 4
- 102000039446 nucleic acids Human genes 0.000 description 4
- 150000007523 nucleic acids Chemical class 0.000 description 4
- 108010024607 phenylalanylalanine Proteins 0.000 description 4
- 239000002157 polynucleotide Substances 0.000 description 4
- 229920002981 polyvinylidene fluoride Polymers 0.000 description 4
- 108010015796 prolylisoleucine Proteins 0.000 description 4
- 108010090894 prolylleucine Proteins 0.000 description 4
- 108010053725 prolylvaline Proteins 0.000 description 4
- 230000002441 reversible effect Effects 0.000 description 4
- 150000003839 salts Chemical class 0.000 description 4
- 108010048818 seryl-histidine Proteins 0.000 description 4
- 239000000600 sorbitol Substances 0.000 description 4
- 239000000725 suspension Substances 0.000 description 4
- LENZDBCJOHFCAS-UHFFFAOYSA-N tris Chemical compound OCC(N)(CO)CO LENZDBCJOHFCAS-UHFFFAOYSA-N 0.000 description 4
- 239000012137 tryptone Substances 0.000 description 4
- 239000012138 yeast extract Substances 0.000 description 4
- QKNYBSVHEMOAJP-UHFFFAOYSA-N 2-amino-2-(hydroxymethyl)propane-1,3-diol;hydron;chloride Chemical compound Cl.OCC(N)(CO)CO QKNYBSVHEMOAJP-UHFFFAOYSA-N 0.000 description 3
- SHYYAQLDNVHPFT-DLOVCJGASA-N Ala-Asn-Phe Chemical compound C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 SHYYAQLDNVHPFT-DLOVCJGASA-N 0.000 description 3
- SUMYEVXWCAYLLJ-GUBZILKMSA-N Ala-Leu-Gln Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O SUMYEVXWCAYLLJ-GUBZILKMSA-N 0.000 description 3
- VCSABYLVNWQYQE-SRVKXCTJSA-N Ala-Lys-Lys Chemical compound NCCCC[C@H](NC(=O)[C@@H](N)C)C(=O)N[C@@H](CCCCN)C(O)=O VCSABYLVNWQYQE-SRVKXCTJSA-N 0.000 description 3
- PEEYDECOOVQKRZ-DLOVCJGASA-N Ala-Ser-Phe Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O PEEYDECOOVQKRZ-DLOVCJGASA-N 0.000 description 3
- RWCLSUOSKWTXLA-FXQIFTODSA-N Arg-Asp-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O RWCLSUOSKWTXLA-FXQIFTODSA-N 0.000 description 3
- BNYNOWJESJJIOI-XUXIUFHCSA-N Arg-Lys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CCCN=C(N)N)N BNYNOWJESJJIOI-XUXIUFHCSA-N 0.000 description 3
- KZXPVYVSHUJCEO-ULQDDVLXSA-N Arg-Phe-Lys Chemical compound NC(=N)NCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CCCCN)C(O)=O)CC1=CC=CC=C1 KZXPVYVSHUJCEO-ULQDDVLXSA-N 0.000 description 3
- 239000004475 Arginine Substances 0.000 description 3
- YRTOMUMWSTUQAX-FXQIFTODSA-N Asn-Pro-Asp Chemical compound NC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O YRTOMUMWSTUQAX-FXQIFTODSA-N 0.000 description 3
- BUVNWKQBMZLCDW-UGYAYLCHSA-N Asp-Asn-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O BUVNWKQBMZLCDW-UGYAYLCHSA-N 0.000 description 3
- FTNVLGCFIJEMQT-CIUDSAMLSA-N Asp-Cys-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CC(=O)O)N FTNVLGCFIJEMQT-CIUDSAMLSA-N 0.000 description 3
- 241000255789 Bombyx mori Species 0.000 description 3
- 102000004163 DNA-directed RNA polymerases Human genes 0.000 description 3
- 108090000626 DNA-directed RNA polymerases Proteins 0.000 description 3
- MUSGDMDGNGXULI-DCAQKATOSA-N Glu-Glu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O MUSGDMDGNGXULI-DCAQKATOSA-N 0.000 description 3
- HVKAAUOFFTUSAA-XDTLVQLUSA-N Glu-Tyr-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C)C(O)=O HVKAAUOFFTUSAA-XDTLVQLUSA-N 0.000 description 3
- BRFJMRSRMOMIMU-WHFBIAKZSA-N Gly-Ala-Asn Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O BRFJMRSRMOMIMU-WHFBIAKZSA-N 0.000 description 3
- LJPIRKICOISLKN-WHFBIAKZSA-N Gly-Ala-Ser Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O LJPIRKICOISLKN-WHFBIAKZSA-N 0.000 description 3
- XRTDOIOIBMAXCT-NKWVEPMBSA-N Gly-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)CN)C(=O)O XRTDOIOIBMAXCT-NKWVEPMBSA-N 0.000 description 3
- QSVCIFZPGLOZGH-WDSKDSINSA-N Gly-Glu-Ser Chemical compound NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O QSVCIFZPGLOZGH-WDSKDSINSA-N 0.000 description 3
- ALOBJFDJTMQQPW-ONGXEEELSA-N Gly-His-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)CN ALOBJFDJTMQQPW-ONGXEEELSA-N 0.000 description 3
- IBYOLNARKHMLBG-WHOFXGATSA-N Gly-Phe-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CC=CC=C1 IBYOLNARKHMLBG-WHOFXGATSA-N 0.000 description 3
- BMWFDYIYBAFROD-WPRPVWTQSA-N Gly-Pro-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)CN BMWFDYIYBAFROD-WPRPVWTQSA-N 0.000 description 3
- SOEGEPHNZOISMT-BYPYZUCNSA-N Gly-Ser-Gly Chemical compound NCC(=O)N[C@@H](CO)C(=O)NCC(O)=O SOEGEPHNZOISMT-BYPYZUCNSA-N 0.000 description 3
- GBYYQVBXFVDJPJ-WLTAIBSBSA-N Gly-Tyr-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)CN)O GBYYQVBXFVDJPJ-WLTAIBSBSA-N 0.000 description 3
- BAYQNCWLXIDLHX-ONGXEEELSA-N Gly-Val-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)CN BAYQNCWLXIDLHX-ONGXEEELSA-N 0.000 description 3
- YGHSQRJSHKYUJY-SCZZXKLOSA-N Gly-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN YGHSQRJSHKYUJY-SCZZXKLOSA-N 0.000 description 3
- SDTPKSOWFXBACN-GUBZILKMSA-N His-Glu-Asp Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O SDTPKSOWFXBACN-GUBZILKMSA-N 0.000 description 3
- 241000282414 Homo sapiens Species 0.000 description 3
- DTPGSUQHUMELQB-GVARAGBVSA-N Ile-Tyr-Ala Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C)C(O)=O)CC1=CC=C(O)C=C1 DTPGSUQHUMELQB-GVARAGBVSA-N 0.000 description 3
- FADYJNXDPBKVCA-UHFFFAOYSA-N L-Phenylalanyl-L-lysin Natural products NCCCCC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 FADYJNXDPBKVCA-UHFFFAOYSA-N 0.000 description 3
- ROHFNLRQFUQHCH-YFKPBYRVSA-N L-leucine Chemical compound CC(C)C[C@H](N)C(O)=O ROHFNLRQFUQHCH-YFKPBYRVSA-N 0.000 description 3
- LHSGPCFBGJHPCY-UHFFFAOYSA-N L-leucine-L-tyrosine Natural products CC(C)CC(N)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 LHSGPCFBGJHPCY-UHFFFAOYSA-N 0.000 description 3
- GRZSCTXVCDUIPO-SRVKXCTJSA-N Leu-Arg-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(O)=O GRZSCTXVCDUIPO-SRVKXCTJSA-N 0.000 description 3
- LAPSXOAUPNOINL-YUMQZZPRSA-N Leu-Gly-Asp Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(O)=O LAPSXOAUPNOINL-YUMQZZPRSA-N 0.000 description 3
- VGPCJSXPPOQPBK-YUMQZZPRSA-N Leu-Gly-Ser Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O VGPCJSXPPOQPBK-YUMQZZPRSA-N 0.000 description 3
- PTRKPHUGYULXPU-KKUMJFAQSA-N Leu-Phe-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O PTRKPHUGYULXPU-KKUMJFAQSA-N 0.000 description 3
- YRRCOJOXAJNSAX-IHRRRGAJSA-N Leu-Pro-Lys Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)O)N YRRCOJOXAJNSAX-IHRRRGAJSA-N 0.000 description 3
- JDBQSGMJBMPNFT-AVGNSLFASA-N Leu-Pro-Val Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O JDBQSGMJBMPNFT-AVGNSLFASA-N 0.000 description 3
- SBANPBVRHYIMRR-UHFFFAOYSA-N Leu-Ser-Pro Natural products CC(C)CC(N)C(=O)NC(CO)C(=O)N1CCCC1C(O)=O SBANPBVRHYIMRR-UHFFFAOYSA-N 0.000 description 3
- QWWPYKKLXWOITQ-VOAKCMCISA-N Leu-Thr-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(C)C QWWPYKKLXWOITQ-VOAKCMCISA-N 0.000 description 3
- LMDVGHQPPPLYAR-IHRRRGAJSA-N Leu-Val-His Chemical compound N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)O LMDVGHQPPPLYAR-IHRRRGAJSA-N 0.000 description 3
- KCXUCYYZNZFGLL-SRVKXCTJSA-N Lys-Ala-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O KCXUCYYZNZFGLL-SRVKXCTJSA-N 0.000 description 3
- KNKHAVVBVXKOGX-JXUBOQSCSA-N Lys-Ala-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KNKHAVVBVXKOGX-JXUBOQSCSA-N 0.000 description 3
- ZJWIXBZTAAJERF-IHRRRGAJSA-N Lys-Lys-Arg Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CCCN=C(N)N ZJWIXBZTAAJERF-IHRRRGAJSA-N 0.000 description 3
- 244000141359 Malus pumila Species 0.000 description 3
- 235000011430 Malus pumila Nutrition 0.000 description 3
- 235000015103 Malus silvestris Nutrition 0.000 description 3
- AJHCSUXXECOXOY-UHFFFAOYSA-N N-glycyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)CN)C(O)=O)=CNC2=C1 AJHCSUXXECOXOY-UHFFFAOYSA-N 0.000 description 3
- NAXPHWZXEXNDIW-JTQLQIEISA-N Phe-Gly-Gly Chemical compound OC(=O)CNC(=O)CNC(=O)[C@@H](N)CC1=CC=CC=C1 NAXPHWZXEXNDIW-JTQLQIEISA-N 0.000 description 3
- OSBADCBXAMSPQD-YESZJQIVSA-N Phe-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CC=CC=C2)N OSBADCBXAMSPQD-YESZJQIVSA-N 0.000 description 3
- 241000235648 Pichia Species 0.000 description 3
- VZKBJNBZMZHKRC-XUXIUFHCSA-N Pro-Ile-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O VZKBJNBZMZHKRC-XUXIUFHCSA-N 0.000 description 3
- RYJRPPUATSKNAY-STECZYCISA-N Pro-Ile-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@@H]2CCCN2 RYJRPPUATSKNAY-STECZYCISA-N 0.000 description 3
- BTKUIVBNGBFTTP-WHFBIAKZSA-N Ser-Ala-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)NCC(O)=O BTKUIVBNGBFTTP-WHFBIAKZSA-N 0.000 description 3
- XNCUYZKGQOCOQH-YUMQZZPRSA-N Ser-Leu-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O XNCUYZKGQOCOQH-YUMQZZPRSA-N 0.000 description 3
- DBMJMQXJHONAFJ-UHFFFAOYSA-M Sodium laurylsulphate Chemical compound [Na+].CCCCCCCCCCCCOS([O-])(=O)=O DBMJMQXJHONAFJ-UHFFFAOYSA-M 0.000 description 3
- ZMYCLHFLHRVOEA-HEIBUPTGSA-N Thr-Thr-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O ZMYCLHFLHRVOEA-HEIBUPTGSA-N 0.000 description 3
- UUZYQOUJTORBQO-ZVZYQTTQSA-N Trp-Val-Gln Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O)=CNC2=C1 UUZYQOUJTORBQO-ZVZYQTTQSA-N 0.000 description 3
- AYHSJESDFKREAR-KKUMJFAQSA-N Tyr-Asn-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 AYHSJESDFKREAR-KKUMJFAQSA-N 0.000 description 3
- JWHOIHCOHMZSAR-QWRGUYRKSA-N Tyr-Asp-Gly Chemical compound OC(=O)CNC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 JWHOIHCOHMZSAR-QWRGUYRKSA-N 0.000 description 3
- NJLQMKZSXYQRTO-FHWLQOOXSA-N Tyr-Glu-Tyr Chemical compound C([C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=C(O)C=C1 NJLQMKZSXYQRTO-FHWLQOOXSA-N 0.000 description 3
- 108010064997 VPY tripeptide Proteins 0.000 description 3
- ZIGZPYJXIWLQFC-QTKMDUPCSA-N Val-His-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](C(C)C)N)O ZIGZPYJXIWLQFC-QTKMDUPCSA-N 0.000 description 3
- KISFXYYRKKNLOP-IHRRRGAJSA-N Val-Phe-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)O)N KISFXYYRKKNLOP-IHRRRGAJSA-N 0.000 description 3
- QWCZXKIFPWPQHR-JYJNAYRXSA-N Val-Pro-Tyr Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 QWCZXKIFPWPQHR-JYJNAYRXSA-N 0.000 description 3
- DEGUERSKQBRZMZ-FXQIFTODSA-N Val-Ser-Ala Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O DEGUERSKQBRZMZ-FXQIFTODSA-N 0.000 description 3
- 108010076324 alanyl-glycyl-glycine Proteins 0.000 description 3
- 108010087924 alanylproline Proteins 0.000 description 3
- 210000004102 animal cell Anatomy 0.000 description 3
- 239000003242 anti bacterial agent Substances 0.000 description 3
- 229940088710 antibiotic agent Drugs 0.000 description 3
- ODKSFYDXXFIFQN-UHFFFAOYSA-N arginine Natural products OC(=O)C(N)CCCNC(N)=N ODKSFYDXXFIFQN-UHFFFAOYSA-N 0.000 description 3
- 108010068380 arginylarginine Proteins 0.000 description 3
- 108010060035 arginylproline Proteins 0.000 description 3
- 108010077245 asparaginyl-proline Proteins 0.000 description 3
- 230000001580 bacterial effect Effects 0.000 description 3
- DRTQHJPVMGBUCF-PSQAKQOGSA-N beta-L-uridine Natural products O[C@H]1[C@@H](O)[C@H](CO)O[C@@H]1N1C(=O)NC(=O)C=C1 DRTQHJPVMGBUCF-PSQAKQOGSA-N 0.000 description 3
- 229960002685 biotin Drugs 0.000 description 3
- 235000020958 biotin Nutrition 0.000 description 3
- 239000011616 biotin Substances 0.000 description 3
- 150000001733 carboxylic acid esters Chemical class 0.000 description 3
- 238000012512 characterization method Methods 0.000 description 3
- 238000004587 chromatography analysis Methods 0.000 description 3
- 238000010367 cloning Methods 0.000 description 3
- 230000029087 digestion Effects 0.000 description 3
- 239000013024 dilution buffer Substances 0.000 description 3
- 108010006664 gamma-glutamyl-glycyl-glycine Proteins 0.000 description 3
- 238000001502 gel electrophoresis Methods 0.000 description 3
- 108010055341 glutamyl-glutamic acid Proteins 0.000 description 3
- 108010049041 glutamylalanine Proteins 0.000 description 3
- 108010015792 glycyllysine Proteins 0.000 description 3
- 108010084389 glycyltryptophan Proteins 0.000 description 3
- 108010037850 glycylvaline Proteins 0.000 description 3
- 108010040030 histidinoalanine Proteins 0.000 description 3
- 108010092114 histidylphenylalanine Proteins 0.000 description 3
- 108010085325 histidylproline Proteins 0.000 description 3
- 230000003301 hydrolyzing effect Effects 0.000 description 3
- 238000003780 insertion Methods 0.000 description 3
- 230000037431 insertion Effects 0.000 description 3
- 108010053037 kyotorphin Proteins 0.000 description 3
- 108010077158 leucinyl-arginyl-tryptophan Proteins 0.000 description 3
- 108010012058 leucyltyrosine Proteins 0.000 description 3
- 108010003700 lysyl aspartic acid Proteins 0.000 description 3
- 108010009298 lysylglutamic acid Proteins 0.000 description 3
- 238000005259 measurement Methods 0.000 description 3
- 101150095344 niaD gene Proteins 0.000 description 3
- 229910052757 nitrogen Inorganic materials 0.000 description 3
- 108010065135 phenylalanyl-phenylalanyl-phenylalanine Proteins 0.000 description 3
- 239000013641 positive control Substances 0.000 description 3
- 108010079317 prolyl-tyrosine Proteins 0.000 description 3
- 108010029020 prolylglycine Proteins 0.000 description 3
- 230000002829 reductive effect Effects 0.000 description 3
- 238000011160 research Methods 0.000 description 3
- 238000012360 testing method Methods 0.000 description 3
- 108010033670 threonyl-aspartyl-tyrosine Proteins 0.000 description 3
- 238000013519 translation Methods 0.000 description 3
- 108010051110 tyrosyl-lysine Proteins 0.000 description 3
- DRTQHJPVMGBUCF-UHFFFAOYSA-N uracil arabinoside Natural products OC1C(O)C(CO)OC1N1C(=O)NC(=O)C=C1 DRTQHJPVMGBUCF-UHFFFAOYSA-N 0.000 description 3
- 229940045145 uridine Drugs 0.000 description 3
- 108010009962 valyltyrosine Proteins 0.000 description 3
- 108010013043 Acetylesterase Proteins 0.000 description 2
- WQVFQXXBNHHPLX-ZKWXMUAHSA-N Ala-Ala-His Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](Cc1cnc[nH]1)C(O)=O WQVFQXXBNHHPLX-ZKWXMUAHSA-N 0.000 description 2
- VBDMWOKJZDCFJM-FXQIFTODSA-N Ala-Ala-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@H](C)N VBDMWOKJZDCFJM-FXQIFTODSA-N 0.000 description 2
- YYSWCHMLFJLLBJ-ZLUOBGJFSA-N Ala-Ala-Ser Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O YYSWCHMLFJLLBJ-ZLUOBGJFSA-N 0.000 description 2
- JBVSSSZFNTXJDX-YTLHQDLWSA-N Ala-Ala-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@H](C)N JBVSSSZFNTXJDX-YTLHQDLWSA-N 0.000 description 2
- KQFRUSHJPKXBMB-BHDSKKPTSA-N Ala-Ala-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](C)NC(=O)[C@@H](N)C)C(O)=O)=CNC2=C1 KQFRUSHJPKXBMB-BHDSKKPTSA-N 0.000 description 2
- DVWVZSJAYIJZFI-FXQIFTODSA-N Ala-Arg-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(O)=O DVWVZSJAYIJZFI-FXQIFTODSA-N 0.000 description 2
- SVBXIUDNTRTKHE-CIUDSAMLSA-N Ala-Arg-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O SVBXIUDNTRTKHE-CIUDSAMLSA-N 0.000 description 2
- JAMAWBXXKFGFGX-KZVJFYERSA-N Ala-Arg-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O JAMAWBXXKFGFGX-KZVJFYERSA-N 0.000 description 2
- ZIBWKCRKNFYTPT-ZKWXMUAHSA-N Ala-Asn-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O ZIBWKCRKNFYTPT-ZKWXMUAHSA-N 0.000 description 2
- ZIWWTZWAKYBUOB-CIUDSAMLSA-N Ala-Asp-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O ZIWWTZWAKYBUOB-CIUDSAMLSA-N 0.000 description 2
- AWAXZRDKUHOPBO-GUBZILKMSA-N Ala-Gln-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCCN)C(O)=O AWAXZRDKUHOPBO-GUBZILKMSA-N 0.000 description 2
- FUSPCLTUKXQREV-ACZMJKKPSA-N Ala-Glu-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O FUSPCLTUKXQREV-ACZMJKKPSA-N 0.000 description 2
- ZVFVBBGVOILKPO-WHFBIAKZSA-N Ala-Gly-Ala Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(O)=O ZVFVBBGVOILKPO-WHFBIAKZSA-N 0.000 description 2
- VWEWCZSUWOEEFM-WDSKDSINSA-N Ala-Gly-Ala-Gly Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(=O)NCC(O)=O VWEWCZSUWOEEFM-WDSKDSINSA-N 0.000 description 2
- PCIFXPRIFWKWLK-YUMQZZPRSA-N Ala-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@H](C)N PCIFXPRIFWKWLK-YUMQZZPRSA-N 0.000 description 2
- TZDNWXDLYFIFPT-BJDJZHNGSA-N Ala-Ile-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O TZDNWXDLYFIFPT-BJDJZHNGSA-N 0.000 description 2
- VNYMOTCMNHJGTG-JBDRJPRFSA-N Ala-Ile-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O VNYMOTCMNHJGTG-JBDRJPRFSA-N 0.000 description 2
- NOGFDULFCFXBHB-CIUDSAMLSA-N Ala-Leu-Cys Chemical compound C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CS)C(=O)O)N NOGFDULFCFXBHB-CIUDSAMLSA-N 0.000 description 2
- CCDFBRZVTDDJNM-GUBZILKMSA-N Ala-Leu-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O CCDFBRZVTDDJNM-GUBZILKMSA-N 0.000 description 2
- DPNZTBKGAUAZQU-DLOVCJGASA-N Ala-Leu-His Chemical compound C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N DPNZTBKGAUAZQU-DLOVCJGASA-N 0.000 description 2
- GFEDXKNBZMPEDM-KZVJFYERSA-N Ala-Met-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GFEDXKNBZMPEDM-KZVJFYERSA-N 0.000 description 2
- CNQAFFMNJIQYGX-DRZSPHRISA-N Ala-Phe-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=CC=C1 CNQAFFMNJIQYGX-DRZSPHRISA-N 0.000 description 2
- ZBLQIYPCUWZSRZ-QEJZJMRPSA-N Ala-Phe-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](C)N)CC1=CC=CC=C1 ZBLQIYPCUWZSRZ-QEJZJMRPSA-N 0.000 description 2
- IORKCNUBHNIMKY-CIUDSAMLSA-N Ala-Pro-Glu Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O IORKCNUBHNIMKY-CIUDSAMLSA-N 0.000 description 2
- BHTBAVZSZCQZPT-GUBZILKMSA-N Ala-Pro-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](C)N BHTBAVZSZCQZPT-GUBZILKMSA-N 0.000 description 2
- YNOCMHZSWJMGBB-GCJQMDKQSA-N Ala-Thr-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(O)=O YNOCMHZSWJMGBB-GCJQMDKQSA-N 0.000 description 2
- RIPMDCIXRYWXSH-KNXALSJPSA-N Ala-Trp-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N3CCC[C@@H]3C(=O)O)N RIPMDCIXRYWXSH-KNXALSJPSA-N 0.000 description 2
- IYKVSFNGSWTTNZ-GUBZILKMSA-N Ala-Val-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N IYKVSFNGSWTTNZ-GUBZILKMSA-N 0.000 description 2
- 102100036826 Aldehyde oxidase Human genes 0.000 description 2
- 239000004382 Amylase Substances 0.000 description 2
- 108010065511 Amylases Proteins 0.000 description 2
- 102000013142 Amylases Human genes 0.000 description 2
- YFWTXMRJJDNTLM-LSJOCFKGSA-N Arg-Ala-His Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N YFWTXMRJJDNTLM-LSJOCFKGSA-N 0.000 description 2
- VBFJESQBIWCWRL-DCAQKATOSA-N Arg-Ala-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCNC(N)=N VBFJESQBIWCWRL-DCAQKATOSA-N 0.000 description 2
- IASNWHAGGYTEKX-IUCAKERBSA-N Arg-Arg-Gly Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)NCC(O)=O IASNWHAGGYTEKX-IUCAKERBSA-N 0.000 description 2
- HPKSHFSEXICTLI-CIUDSAMLSA-N Arg-Glu-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O HPKSHFSEXICTLI-CIUDSAMLSA-N 0.000 description 2
- JAYIQMNQDMOBFY-KKUMJFAQSA-N Arg-Glu-Tyr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O JAYIQMNQDMOBFY-KKUMJFAQSA-N 0.000 description 2
- YBZMTKUDWXZLIX-UWVGGRQHSA-N Arg-Leu-Gly Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O YBZMTKUDWXZLIX-UWVGGRQHSA-N 0.000 description 2
- OISWSORSLQOGFV-AVGNSLFASA-N Arg-Met-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCSC)NC(=O)[C@@H](N)CCCN=C(N)N OISWSORSLQOGFV-AVGNSLFASA-N 0.000 description 2
- UZSQXCMNUPKLCC-FJXKBIBVSA-N Arg-Thr-Gly Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O UZSQXCMNUPKLCC-FJXKBIBVSA-N 0.000 description 2
- XRNXPIGJPQHCPC-RCWTZXSCSA-N Arg-Thr-Val Chemical compound CC(C)[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)CCCNC(N)=N)[C@@H](C)O)C(O)=O XRNXPIGJPQHCPC-RCWTZXSCSA-N 0.000 description 2
- XYOVHPDDWCEUDY-CIUDSAMLSA-N Asn-Ala-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O XYOVHPDDWCEUDY-CIUDSAMLSA-N 0.000 description 2
- JJGRJMKUOYXZRA-LPEHRKFASA-N Asn-Arg-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC(=O)N)N)C(=O)O JJGRJMKUOYXZRA-LPEHRKFASA-N 0.000 description 2
- APHUDFFMXFYRKP-CIUDSAMLSA-N Asn-Asn-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC(=O)N)N APHUDFFMXFYRKP-CIUDSAMLSA-N 0.000 description 2
- XWFPGQVLOVGSLU-CIUDSAMLSA-N Asn-Gln-Arg Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N XWFPGQVLOVGSLU-CIUDSAMLSA-N 0.000 description 2
- MSBDSTRUMZFSEU-PEFMBERDSA-N Asn-Glu-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O MSBDSTRUMZFSEU-PEFMBERDSA-N 0.000 description 2
- CTQIOCMSIJATNX-WHFBIAKZSA-N Asn-Gly-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](C)C(O)=O CTQIOCMSIJATNX-WHFBIAKZSA-N 0.000 description 2
- HYQYLOSCICEYTR-YUMQZZPRSA-N Asn-Gly-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O HYQYLOSCICEYTR-YUMQZZPRSA-N 0.000 description 2
- FTCGGKNCJZOPNB-WHFBIAKZSA-N Asn-Gly-Ser Chemical compound NC(=O)C[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O FTCGGKNCJZOPNB-WHFBIAKZSA-N 0.000 description 2
- UDSVWSUXKYXSTR-QWRGUYRKSA-N Asn-Gly-Tyr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O UDSVWSUXKYXSTR-QWRGUYRKSA-N 0.000 description 2
- NYGILGUOUOXGMJ-YUMQZZPRSA-N Asn-Lys-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O NYGILGUOUOXGMJ-YUMQZZPRSA-N 0.000 description 2
- WCRQQIPFSXFIRN-LPEHRKFASA-N Asn-Met-Pro Chemical compound CSCC[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)N)N WCRQQIPFSXFIRN-LPEHRKFASA-N 0.000 description 2
- CDGHMJJJHYKMPA-DLOVCJGASA-N Asn-Phe-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](CC(=O)N)N CDGHMJJJHYKMPA-DLOVCJGASA-N 0.000 description 2
- YUOXLJYVSZYPBJ-CIUDSAMLSA-N Asn-Pro-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O YUOXLJYVSZYPBJ-CIUDSAMLSA-N 0.000 description 2
- KVMPVNGOKHTUHZ-GCJQMDKQSA-N Asp-Ala-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KVMPVNGOKHTUHZ-GCJQMDKQSA-N 0.000 description 2
- SBHUBSDEZQFJHJ-CIUDSAMLSA-N Asp-Asp-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC(O)=O SBHUBSDEZQFJHJ-CIUDSAMLSA-N 0.000 description 2
- HRGGPWBIMIQANI-GUBZILKMSA-N Asp-Gln-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O HRGGPWBIMIQANI-GUBZILKMSA-N 0.000 description 2
- LDGUZSIPGSPBJP-XVYDVKMFSA-N Asp-His-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CC(=O)O)N LDGUZSIPGSPBJP-XVYDVKMFSA-N 0.000 description 2
- IVPNEDNYYYFAGI-GARJFASQSA-N Asp-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)O)N IVPNEDNYYYFAGI-GARJFASQSA-N 0.000 description 2
- BWJZSLQJNBSUPM-FXQIFTODSA-N Asp-Pro-Asn Chemical compound OC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O BWJZSLQJNBSUPM-FXQIFTODSA-N 0.000 description 2
- YFGUZQQCSDZRBN-DCAQKATOSA-N Asp-Pro-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O YFGUZQQCSDZRBN-DCAQKATOSA-N 0.000 description 2
- GXIUDSXIUSTSLO-QXEWZRGKSA-N Asp-Val-Met Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CC(=O)O)N GXIUDSXIUSTSLO-QXEWZRGKSA-N 0.000 description 2
- 240000001009 Aspergillus oryzae RIB40 Species 0.000 description 2
- 235000013023 Aspergillus oryzae RIB40 Nutrition 0.000 description 2
- UXVMQQNJUSDDNG-UHFFFAOYSA-L Calcium chloride Chemical compound [Cl-].[Cl-].[Ca+2] UXVMQQNJUSDDNG-UHFFFAOYSA-L 0.000 description 2
- 244000241235 Citrullus lanatus Species 0.000 description 2
- 235000012828 Citrullus lanatus var citroides Nutrition 0.000 description 2
- 108020004705 Codon Proteins 0.000 description 2
- JXVFJOMFOLFPMP-KKUMJFAQSA-N Cys-Leu-Tyr Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O JXVFJOMFOLFPMP-KKUMJFAQSA-N 0.000 description 2
- MXZYQNJCBVJHSR-KATARQTJSA-N Cys-Lys-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CS)N)O MXZYQNJCBVJHSR-KATARQTJSA-N 0.000 description 2
- 108090000204 Dipeptidase 1 Proteins 0.000 description 2
- 241000255601 Drosophila melanogaster Species 0.000 description 2
- LFQSCWFLJHTTHZ-UHFFFAOYSA-N Ethanol Chemical compound CCO LFQSCWFLJHTTHZ-UHFFFAOYSA-N 0.000 description 2
- 241001468176 Geobacillus thermoleovorans Species 0.000 description 2
- KVYVOGYEMPEXBT-GUBZILKMSA-N Gln-Ala-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(N)=O KVYVOGYEMPEXBT-GUBZILKMSA-N 0.000 description 2
- PNENQZWRFMUZOM-DCAQKATOSA-N Gln-Glu-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O PNENQZWRFMUZOM-DCAQKATOSA-N 0.000 description 2
- MAGNEQBFSBREJL-DCAQKATOSA-N Gln-Glu-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCC(=O)N)N MAGNEQBFSBREJL-DCAQKATOSA-N 0.000 description 2
- VSXBYIJUAXPAAL-WDSKDSINSA-N Gln-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)[C@@H](N)CCC(N)=O VSXBYIJUAXPAAL-WDSKDSINSA-N 0.000 description 2
- IKFZXRLDMYWNBU-YUMQZZPRSA-N Gln-Gly-Arg Chemical compound NC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N IKFZXRLDMYWNBU-YUMQZZPRSA-N 0.000 description 2
- XFAUJGNLHIGXET-AVGNSLFASA-N Gln-Leu-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O XFAUJGNLHIGXET-AVGNSLFASA-N 0.000 description 2
- XZUUUKNKNWVPHQ-JYJNAYRXSA-N Gln-Phe-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O XZUUUKNKNWVPHQ-JYJNAYRXSA-N 0.000 description 2
- VOUSELYGTNGEPB-NUMRIWBASA-N Gln-Thr-Asp Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(O)=O VOUSELYGTNGEPB-NUMRIWBASA-N 0.000 description 2
- VLOLPWWCNKWRNB-LOKLDPHHSA-N Gln-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)N)N)O VLOLPWWCNKWRNB-LOKLDPHHSA-N 0.000 description 2
- ZZLDMBMFKZFQMU-NRPADANISA-N Gln-Val-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O ZZLDMBMFKZFQMU-NRPADANISA-N 0.000 description 2
- UTKUTMJSWKKHEM-WDSKDSINSA-N Glu-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(O)=O UTKUTMJSWKKHEM-WDSKDSINSA-N 0.000 description 2
- RLZBLVSJDFHDBL-KBIXCLLPSA-N Glu-Ala-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O RLZBLVSJDFHDBL-KBIXCLLPSA-N 0.000 description 2
- FYBSCGZLICNOBA-XQXXSGGOSA-N Glu-Ala-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O FYBSCGZLICNOBA-XQXXSGGOSA-N 0.000 description 2
- OJGLIOXAKGFFDW-SRVKXCTJSA-N Glu-Arg-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CCC(=O)O)N OJGLIOXAKGFFDW-SRVKXCTJSA-N 0.000 description 2
- WPLGNDORMXTMQS-FXQIFTODSA-N Glu-Gln-Ser Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O WPLGNDORMXTMQS-FXQIFTODSA-N 0.000 description 2
- HNVFSTLPVJWIDV-CIUDSAMLSA-N Glu-Glu-Gln Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O HNVFSTLPVJWIDV-CIUDSAMLSA-N 0.000 description 2
- AUTNXSQEVVHSJK-YVNDNENWSA-N Glu-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O AUTNXSQEVVHSJK-YVNDNENWSA-N 0.000 description 2
- BUAKRRKDHSSIKK-IHRRRGAJSA-N Glu-Glu-Tyr Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 BUAKRRKDHSSIKK-IHRRRGAJSA-N 0.000 description 2
- OAGVHWYIBZMWLA-YFKPBYRVSA-N Glu-Gly-Gly Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)NCC(O)=O OAGVHWYIBZMWLA-YFKPBYRVSA-N 0.000 description 2
- KRGZZKWSBGPLKL-IUCAKERBSA-N Glu-Gly-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CCC(=O)O)N KRGZZKWSBGPLKL-IUCAKERBSA-N 0.000 description 2
- DNPCBMNFQVTHMA-DCAQKATOSA-N Glu-Leu-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O DNPCBMNFQVTHMA-DCAQKATOSA-N 0.000 description 2
- MWMJCGBSIORNCD-AVGNSLFASA-N Glu-Leu-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O MWMJCGBSIORNCD-AVGNSLFASA-N 0.000 description 2
- ILWHFUZZCFYSKT-AVGNSLFASA-N Glu-Lys-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O ILWHFUZZCFYSKT-AVGNSLFASA-N 0.000 description 2
- JHSRJMUJOGLIHK-GUBZILKMSA-N Glu-Met-Glu Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CCC(=O)O)N JHSRJMUJOGLIHK-GUBZILKMSA-N 0.000 description 2
- YRMZCZIRHYCNHX-RYUDHWBXSA-N Glu-Phe-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)NCC(O)=O YRMZCZIRHYCNHX-RYUDHWBXSA-N 0.000 description 2
- QOXDAWODGSIDDI-GUBZILKMSA-N Glu-Ser-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCC(=O)O)N QOXDAWODGSIDDI-GUBZILKMSA-N 0.000 description 2
- UMZHHILWZBFPGL-LOKLDPHHSA-N Glu-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)O)N)O UMZHHILWZBFPGL-LOKLDPHHSA-N 0.000 description 2
- CAQXJMUDOLSBPF-SUSMZKCASA-N Glu-Thr-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CAQXJMUDOLSBPF-SUSMZKCASA-N 0.000 description 2
- RMWAOBGCZZSJHE-UMNHJUIQSA-N Glu-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)O)N RMWAOBGCZZSJHE-UMNHJUIQSA-N 0.000 description 2
- QXUPRMQJDWJDFR-NRPADANISA-N Glu-Val-Ser Chemical compound CC(C)[C@H](NC(=O)[C@@H](N)CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O QXUPRMQJDWJDFR-NRPADANISA-N 0.000 description 2
- 108010073178 Glucan 1,4-alpha-Glucosidase Proteins 0.000 description 2
- 102100022624 Glucoamylase Human genes 0.000 description 2
- PYTZFYUXZZHOAD-WHFBIAKZSA-N Gly-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)CN PYTZFYUXZZHOAD-WHFBIAKZSA-N 0.000 description 2
- KRRMJKMGWWXWDW-STQMWFEESA-N Gly-Arg-Phe Chemical compound NC(=N)NCCC[C@H](NC(=O)CN)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 KRRMJKMGWWXWDW-STQMWFEESA-N 0.000 description 2
- XCLCVBYNGXEVDU-WHFBIAKZSA-N Gly-Asn-Ser Chemical compound NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O XCLCVBYNGXEVDU-WHFBIAKZSA-N 0.000 description 2
- GRIRDMVMJJDZKV-RCOVLWMOSA-N Gly-Asn-Val Chemical compound [H]NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O GRIRDMVMJJDZKV-RCOVLWMOSA-N 0.000 description 2
- QSTLUOIOYLYLLF-WDSKDSINSA-N Gly-Asp-Glu Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O QSTLUOIOYLYLLF-WDSKDSINSA-N 0.000 description 2
- LXXLEUBUOMCAMR-NKWVEPMBSA-N Gly-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)CN)C(=O)O LXXLEUBUOMCAMR-NKWVEPMBSA-N 0.000 description 2
- SOEATRRYCIPEHA-BQBZGAKWSA-N Gly-Glu-Glu Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O SOEATRRYCIPEHA-BQBZGAKWSA-N 0.000 description 2
- XTQFHTHIAKKCTM-YFKPBYRVSA-N Gly-Glu-Gly Chemical compound NCC(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O XTQFHTHIAKKCTM-YFKPBYRVSA-N 0.000 description 2
- UFPXDFOYHVEIPI-BYPYZUCNSA-N Gly-Gly-Asp Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CC(O)=O UFPXDFOYHVEIPI-BYPYZUCNSA-N 0.000 description 2
- BHPQOIPBLYJNAW-NGZCFLSTSA-N Gly-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN BHPQOIPBLYJNAW-NGZCFLSTSA-N 0.000 description 2
- UHPAZODVFFYEEL-QWRGUYRKSA-N Gly-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)CN UHPAZODVFFYEEL-QWRGUYRKSA-N 0.000 description 2
- TVUWMSBGMVAHSJ-KBPBESRZSA-N Gly-Leu-Phe Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 TVUWMSBGMVAHSJ-KBPBESRZSA-N 0.000 description 2
- LOEANKRDMMVOGZ-YUMQZZPRSA-N Gly-Lys-Asp Chemical compound NCCCC[C@H](NC(=O)CN)C(=O)N[C@@H](CC(O)=O)C(O)=O LOEANKRDMMVOGZ-YUMQZZPRSA-N 0.000 description 2
- WDEHMRNSGHVNOH-VHSXEESVSA-N Gly-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)CN)C(=O)O WDEHMRNSGHVNOH-VHSXEESVSA-N 0.000 description 2
- WMGHDYWNHNLGBV-ONGXEEELSA-N Gly-Phe-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H](NC(=O)CN)CC1=CC=CC=C1 WMGHDYWNHNLGBV-ONGXEEELSA-N 0.000 description 2
- ZLCLYFGMKFCDCN-XPUUQOCRSA-N Gly-Ser-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CO)NC(=O)CN)C(O)=O ZLCLYFGMKFCDCN-XPUUQOCRSA-N 0.000 description 2
- DBUNZBWUWCIELX-JHEQGTHGSA-N Gly-Thr-Glu Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O DBUNZBWUWCIELX-JHEQGTHGSA-N 0.000 description 2
- FFALDIDGPLUDKV-ZDLURKLDSA-N Gly-Thr-Ser Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O FFALDIDGPLUDKV-ZDLURKLDSA-N 0.000 description 2
- BNMRSWQOHIQTFL-JSGCOSHPSA-N Gly-Val-Phe Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 BNMRSWQOHIQTFL-JSGCOSHPSA-N 0.000 description 2
- RVKIPWVMZANZLI-UHFFFAOYSA-N H-Lys-Trp-OH Natural products C1=CC=C2C(CC(NC(=O)C(N)CCCCN)C(O)=O)=CNC2=C1 RVKIPWVMZANZLI-UHFFFAOYSA-N 0.000 description 2
- 235000014683 Hansenula anomala Nutrition 0.000 description 2
- 108010093488 His-His-His-His-His-His Proteins 0.000 description 2
- 101000928314 Homo sapiens Aldehyde oxidase Proteins 0.000 description 2
- 101000878605 Homo sapiens Low affinity immunoglobulin epsilon Fc receptor Proteins 0.000 description 2
- NKVZTQVGUNLLQW-JBDRJPRFSA-N Ile-Ala-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(=O)O)N NKVZTQVGUNLLQW-JBDRJPRFSA-N 0.000 description 2
- VSZALHITQINTGC-GHCJXIJMSA-N Ile-Ala-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CC(=O)O)C(=O)O)N VSZALHITQINTGC-GHCJXIJMSA-N 0.000 description 2
- MKWSZEHGHSLNPF-NAKRPEOUSA-N Ile-Ala-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)O)N MKWSZEHGHSLNPF-NAKRPEOUSA-N 0.000 description 2
- YKRIXHPEIZUDDY-GMOBBJLQSA-N Ile-Asn-Arg Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N YKRIXHPEIZUDDY-GMOBBJLQSA-N 0.000 description 2
- HZMLFETXHFHGBB-UGYAYLCHSA-N Ile-Asn-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N HZMLFETXHFHGBB-UGYAYLCHSA-N 0.000 description 2
- FJWYJQRCVNGEAQ-ZPFDUUQYSA-N Ile-Asn-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N FJWYJQRCVNGEAQ-ZPFDUUQYSA-N 0.000 description 2
- RGSOCXHDOPQREB-ZPFDUUQYSA-N Ile-Asp-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(C)C)C(=O)O)N RGSOCXHDOPQREB-ZPFDUUQYSA-N 0.000 description 2
- LGMUPVWZEYYUMU-YVNDNENWSA-N Ile-Glu-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N LGMUPVWZEYYUMU-YVNDNENWSA-N 0.000 description 2
- ODPKZZLRDNXTJZ-WHOFXGATSA-N Ile-Gly-Phe Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N ODPKZZLRDNXTJZ-WHOFXGATSA-N 0.000 description 2
- UASTVUQJMLZWGG-PEXQALLHSA-N Ile-His-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)NCC(=O)O)N UASTVUQJMLZWGG-PEXQALLHSA-N 0.000 description 2
- KBAPKNDWAGVGTH-IGISWZIWSA-N Ile-Ile-Tyr Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 KBAPKNDWAGVGTH-IGISWZIWSA-N 0.000 description 2
- GAZGFPOZOLEYAJ-YTFOTSKYSA-N Ile-Leu-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N GAZGFPOZOLEYAJ-YTFOTSKYSA-N 0.000 description 2
- XLXPYSDGMXTTNQ-UHFFFAOYSA-N Ile-Phe-Leu Natural products CCC(C)C(N)C(=O)NC(C(=O)NC(CC(C)C)C(O)=O)CC1=CC=CC=C1 XLXPYSDGMXTTNQ-UHFFFAOYSA-N 0.000 description 2
- IITVUURPOYGCTD-NAKRPEOUSA-N Ile-Pro-Ala Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O IITVUURPOYGCTD-NAKRPEOUSA-N 0.000 description 2
- IVXJIMGDOYRLQU-XUXIUFHCSA-N Ile-Pro-Leu Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O IVXJIMGDOYRLQU-XUXIUFHCSA-N 0.000 description 2
- XMYURPUVJSKTMC-KBIXCLLPSA-N Ile-Ser-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N XMYURPUVJSKTMC-KBIXCLLPSA-N 0.000 description 2
- JNLSTRPWUXOORL-MMWGEVLESA-N Ile-Ser-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N1CCC[C@@H]1C(=O)O)N JNLSTRPWUXOORL-MMWGEVLESA-N 0.000 description 2
- CNMOKANDJMLAIF-CIQUZCHMSA-N Ile-Thr-Ala Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O CNMOKANDJMLAIF-CIQUZCHMSA-N 0.000 description 2
- RQZFWBLDTBDEOF-RNJOBUHISA-N Ile-Val-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N RQZFWBLDTBDEOF-RNJOBUHISA-N 0.000 description 2
- JZBVBOKASHNXAD-NAKRPEOUSA-N Ile-Val-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(=O)O)N JZBVBOKASHNXAD-NAKRPEOUSA-N 0.000 description 2
- YHFPHRUWZMEOIX-CYDGBPFRSA-N Ile-Val-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C(C)C)C(=O)O)N YHFPHRUWZMEOIX-CYDGBPFRSA-N 0.000 description 2
- 101150045458 KEX2 gene Proteins 0.000 description 2
- HGCNKOLVKRAVHD-UHFFFAOYSA-N L-Met-L-Phe Natural products CSCCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 HGCNKOLVKRAVHD-UHFFFAOYSA-N 0.000 description 2
- KFKWRHQBZQICHA-STQMWFEESA-N L-leucyl-L-phenylalanine Natural products CC(C)C[C@H](N)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 KFKWRHQBZQICHA-STQMWFEESA-N 0.000 description 2
- LZDNBBYBDGBADK-UHFFFAOYSA-N L-valyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)C(C)C)C(O)=O)=CNC2=C1 LZDNBBYBDGBADK-UHFFFAOYSA-N 0.000 description 2
- CZCSUZMIRKFFFA-CIUDSAMLSA-N Leu-Ala-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O CZCSUZMIRKFFFA-CIUDSAMLSA-N 0.000 description 2
- CQQGCWPXDHTTNF-GUBZILKMSA-N Leu-Ala-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(O)=O CQQGCWPXDHTTNF-GUBZILKMSA-N 0.000 description 2
- YKNBJXOJTURHCU-DCAQKATOSA-N Leu-Asp-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N YKNBJXOJTURHCU-DCAQKATOSA-N 0.000 description 2
- CLVUXCBGKUECIT-HJGDQZAQSA-N Leu-Asp-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CLVUXCBGKUECIT-HJGDQZAQSA-N 0.000 description 2
- GBDMISNMNXVTNV-XIRDDKMYSA-N Leu-Asp-Trp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O GBDMISNMNXVTNV-XIRDDKMYSA-N 0.000 description 2
- CQGSYZCULZMEDE-UHFFFAOYSA-N Leu-Gln-Pro Natural products CC(C)CC(N)C(=O)NC(CCC(N)=O)C(=O)N1CCCC1C(O)=O CQGSYZCULZMEDE-UHFFFAOYSA-N 0.000 description 2
- GPICTNQYKHHHTH-GUBZILKMSA-N Leu-Gln-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O GPICTNQYKHHHTH-GUBZILKMSA-N 0.000 description 2
- ZFNLIDNJUWNIJL-WDCWCFNPSA-N Leu-Glu-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ZFNLIDNJUWNIJL-WDCWCFNPSA-N 0.000 description 2
- UCDHVOALNXENLC-KBPBESRZSA-N Leu-Gly-Tyr Chemical compound CC(C)C[C@H]([NH3+])C(=O)NCC(=O)N[C@H](C([O-])=O)CC1=CC=C(O)C=C1 UCDHVOALNXENLC-KBPBESRZSA-N 0.000 description 2
- POZULHZYLPGXMR-ONGXEEELSA-N Leu-Gly-Val Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O POZULHZYLPGXMR-ONGXEEELSA-N 0.000 description 2
- CFZZDVMBRYFFNU-QWRGUYRKSA-N Leu-His-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)NCC(O)=O CFZZDVMBRYFFNU-QWRGUYRKSA-N 0.000 description 2
- DBSLVQBXKVKDKJ-BJDJZHNGSA-N Leu-Ile-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O DBSLVQBXKVKDKJ-BJDJZHNGSA-N 0.000 description 2
- QJXHMYMRGDOHRU-NHCYSSNCSA-N Leu-Ile-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O QJXHMYMRGDOHRU-NHCYSSNCSA-N 0.000 description 2
- YOKVEHGYYQEQOP-QWRGUYRKSA-N Leu-Leu-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O YOKVEHGYYQEQOP-QWRGUYRKSA-N 0.000 description 2
- KYIIALJHAOIAHF-KKUMJFAQSA-N Leu-Leu-His Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 KYIIALJHAOIAHF-KKUMJFAQSA-N 0.000 description 2
- XVZCXCTYGHPNEM-UHFFFAOYSA-N Leu-Leu-Pro Natural products CC(C)CC(N)C(=O)NC(CC(C)C)C(=O)N1CCCC1C(O)=O XVZCXCTYGHPNEM-UHFFFAOYSA-N 0.000 description 2
- FKQPWMZLIIATBA-AJNGGQMLSA-N Leu-Lys-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FKQPWMZLIIATBA-AJNGGQMLSA-N 0.000 description 2
- BGZCJDGBBUUBHA-KKUMJFAQSA-N Leu-Lys-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O BGZCJDGBBUUBHA-KKUMJFAQSA-N 0.000 description 2
- VULJUQZPSOASBZ-SRVKXCTJSA-N Leu-Pro-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O VULJUQZPSOASBZ-SRVKXCTJSA-N 0.000 description 2
- IWMJFLJQHIDZQW-KKUMJFAQSA-N Leu-Ser-Phe Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 IWMJFLJQHIDZQW-KKUMJFAQSA-N 0.000 description 2
- GZRABTMNWJXFMH-UVOCVTCTSA-N Leu-Thr-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GZRABTMNWJXFMH-UVOCVTCTSA-N 0.000 description 2
- WFCKERTZVCQXKH-KBPBESRZSA-N Leu-Tyr-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(O)=O WFCKERTZVCQXKH-KBPBESRZSA-N 0.000 description 2
- VUBIPAHVHMZHCM-KKUMJFAQSA-N Leu-Tyr-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CO)C(O)=O)CC1=CC=C(O)C=C1 VUBIPAHVHMZHCM-KKUMJFAQSA-N 0.000 description 2
- YQFZRHYZLARWDY-IHRRRGAJSA-N Leu-Val-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCCN YQFZRHYZLARWDY-IHRRRGAJSA-N 0.000 description 2
- 102100038007 Low affinity immunoglobulin epsilon Fc receptor Human genes 0.000 description 2
- ALSRJRIWBNENFY-DCAQKATOSA-N Lys-Arg-Asn Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(O)=O ALSRJRIWBNENFY-DCAQKATOSA-N 0.000 description 2
- DGAAQRAUOFHBFJ-CIUDSAMLSA-N Lys-Asn-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(O)=O DGAAQRAUOFHBFJ-CIUDSAMLSA-N 0.000 description 2
- DNEJSAIMVANNPA-DCAQKATOSA-N Lys-Asn-Arg Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O DNEJSAIMVANNPA-DCAQKATOSA-N 0.000 description 2
- HIIZIQUUHIXUJY-GUBZILKMSA-N Lys-Asp-Gln Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O HIIZIQUUHIXUJY-GUBZILKMSA-N 0.000 description 2
- YVMQJGWLHRWMDF-MNXVOIDGSA-N Lys-Gln-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCCCN)N YVMQJGWLHRWMDF-MNXVOIDGSA-N 0.000 description 2
- GCMWRRQAKQXDED-IUCAKERBSA-N Lys-Glu-Gly Chemical compound [NH3+]CCCC[C@H]([NH3+])C(=O)N[C@@H](CCC([O-])=O)C(=O)NCC([O-])=O GCMWRRQAKQXDED-IUCAKERBSA-N 0.000 description 2
- GQFDWEDHOQRNLC-QWRGUYRKSA-N Lys-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CCCCN GQFDWEDHOQRNLC-QWRGUYRKSA-N 0.000 description 2
- NNKLKUUGESXCBS-KBPBESRZSA-N Lys-Gly-Tyr Chemical compound [H]N[C@@H](CCCCN)C(=O)NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O NNKLKUUGESXCBS-KBPBESRZSA-N 0.000 description 2
- MXMDJEJWERYPMO-XUXIUFHCSA-N Lys-Ile-Arg Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O MXMDJEJWERYPMO-XUXIUFHCSA-N 0.000 description 2
- JYXBNQOKPRQNQS-YTFOTSKYSA-N Lys-Ile-Ile Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JYXBNQOKPRQNQS-YTFOTSKYSA-N 0.000 description 2
- MSSJJDVQTFTLIF-KBPBESRZSA-N Lys-Phe-Gly Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](Cc1ccccc1)C(=O)NCC(O)=O MSSJJDVQTFTLIF-KBPBESRZSA-N 0.000 description 2
- NQSFIPWBPXNJII-PMVMPFDFSA-N Lys-Phe-Trp Chemical compound C([C@H](NC(=O)[C@@H](N)CCCCN)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(O)=O)C1=CC=CC=C1 NQSFIPWBPXNJII-PMVMPFDFSA-N 0.000 description 2
- LECIJRIRMVOFMH-ULQDDVLXSA-N Lys-Pro-Phe Chemical compound NCCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 LECIJRIRMVOFMH-ULQDDVLXSA-N 0.000 description 2
- RPWTZTBIFGENIA-VOAKCMCISA-N Lys-Thr-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O RPWTZTBIFGENIA-VOAKCMCISA-N 0.000 description 2
- NROQVSYLPRLJIP-PMVMPFDFSA-N Lys-Trp-Tyr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O NROQVSYLPRLJIP-PMVMPFDFSA-N 0.000 description 2
- CSNNHWWHGAXBCP-UHFFFAOYSA-L Magnesium sulfate Chemical compound [Mg+2].[O-][S+2]([O-])([O-])[O-] CSNNHWWHGAXBCP-UHFFFAOYSA-L 0.000 description 2
- ACYHZNZHIZWLQF-BQBZGAKWSA-N Met-Asn-Gly Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O ACYHZNZHIZWLQF-BQBZGAKWSA-N 0.000 description 2
- CRVSHEPROQHVQT-AVGNSLFASA-N Met-Met-Lys Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCCCN)C(=O)O)N CRVSHEPROQHVQT-AVGNSLFASA-N 0.000 description 2
- HLZORBMOISUNIV-DCAQKATOSA-N Met-Ser-Leu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC(C)C HLZORBMOISUNIV-DCAQKATOSA-N 0.000 description 2
- QQPMHUCGDRJFQK-RHYQMDGZSA-N Met-Thr-Leu Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(C)C QQPMHUCGDRJFQK-RHYQMDGZSA-N 0.000 description 2
- CQRGINSEMFBACV-WPRPVWTQSA-N Met-Val-Gly Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O CQRGINSEMFBACV-WPRPVWTQSA-N 0.000 description 2
- OTKQHDPECKUDSB-SZMVWBNQSA-N Met-Val-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCSC)C(O)=O)=CNC2=C1 OTKQHDPECKUDSB-SZMVWBNQSA-N 0.000 description 2
- 241000699666 Mus <mouse, genus> Species 0.000 description 2
- KWYHDKDOAIKMQN-UHFFFAOYSA-N N,N,N',N'-tetramethylethylenediamine Chemical compound CN(C)CCN(C)C KWYHDKDOAIKMQN-UHFFFAOYSA-N 0.000 description 2
- XMBSYZWANAQXEV-UHFFFAOYSA-N N-alpha-L-glutamyl-L-phenylalanine Natural products OC(=O)CCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 XMBSYZWANAQXEV-UHFFFAOYSA-N 0.000 description 2
- 108010066427 N-valyltryptophan Proteins 0.000 description 2
- 108010047562 NGR peptide Proteins 0.000 description 2
- 229910002651 NO3 Inorganic materials 0.000 description 2
- BQVUABVGYYSDCJ-UHFFFAOYSA-N Nalpha-L-Leucyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)CC(C)C)C(O)=O)=CNC2=C1 BQVUABVGYYSDCJ-UHFFFAOYSA-N 0.000 description 2
- 229930193140 Neomycin Natural products 0.000 description 2
- 108010065395 Neuropep-1 Proteins 0.000 description 2
- NHNBFGGVMKEFGY-UHFFFAOYSA-N Nitrate Chemical compound [O-][N+]([O-])=O NHNBFGGVMKEFGY-UHFFFAOYSA-N 0.000 description 2
- 108090000913 Nitrate Reductases Proteins 0.000 description 2
- ULECEJGNDHWSKD-QEJZJMRPSA-N Phe-Ala-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=CC=C1 ULECEJGNDHWSKD-QEJZJMRPSA-N 0.000 description 2
- HTKNPQZCMLBOTQ-XVSYOHENSA-N Phe-Asn-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC1=CC=CC=C1)N)O HTKNPQZCMLBOTQ-XVSYOHENSA-N 0.000 description 2
- ZLGQEBCCANLYRA-RYUDHWBXSA-N Phe-Gly-Glu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O ZLGQEBCCANLYRA-RYUDHWBXSA-N 0.000 description 2
- KZRQONDKKJCAOL-DKIMLUQUSA-N Phe-Leu-Ile Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KZRQONDKKJCAOL-DKIMLUQUSA-N 0.000 description 2
- WLYPRKLMRIYGPP-JYJNAYRXSA-N Phe-Lys-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC1=CC=CC=C1 WLYPRKLMRIYGPP-JYJNAYRXSA-N 0.000 description 2
- CBENHWCORLVGEQ-HJOGWXRNSA-N Phe-Phe-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 CBENHWCORLVGEQ-HJOGWXRNSA-N 0.000 description 2
- MVIJMIZJPHQGEN-IHRRRGAJSA-N Phe-Ser-Val Chemical compound CC(C)[C@@H](C([O-])=O)NC(=O)[C@H](CO)NC(=O)[C@@H]([NH3+])CC1=CC=CC=C1 MVIJMIZJPHQGEN-IHRRRGAJSA-N 0.000 description 2
- VDTYRPWRWRCROL-UFYCRDLUSA-N Phe-Val-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 VDTYRPWRWRCROL-UFYCRDLUSA-N 0.000 description 2
- 108091000080 Phosphotransferase Proteins 0.000 description 2
- WCUXLLCKKVVCTQ-UHFFFAOYSA-M Potassium chloride Chemical compound [Cl-].[K+] WCUXLLCKKVVCTQ-UHFFFAOYSA-M 0.000 description 2
- ILMLVTGTUJPQFP-FXQIFTODSA-N Pro-Asp-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O ILMLVTGTUJPQFP-FXQIFTODSA-N 0.000 description 2
- XZONQWUEBAFQPO-HJGDQZAQSA-N Pro-Gln-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XZONQWUEBAFQPO-HJGDQZAQSA-N 0.000 description 2
- LGSANCBHSMDFDY-GARJFASQSA-N Pro-Glu-Pro Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CCC(=O)O)C(=O)N2CCC[C@@H]2C(=O)O LGSANCBHSMDFDY-GARJFASQSA-N 0.000 description 2
- UUHXBJHVTVGSKM-BQBZGAKWSA-N Pro-Gly-Asn Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O UUHXBJHVTVGSKM-BQBZGAKWSA-N 0.000 description 2
- GURGCNUWVSDYTP-SRVKXCTJSA-N Pro-Leu-Gln Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O GURGCNUWVSDYTP-SRVKXCTJSA-N 0.000 description 2
- ULWBBFKQBDNGOY-RWMBFGLXSA-N Pro-Lys-Pro Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CCCCN)C(=O)N2CCC[C@@H]2C(=O)O ULWBBFKQBDNGOY-RWMBFGLXSA-N 0.000 description 2
- CGSOWZUPLOKYOR-AVGNSLFASA-N Pro-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 CGSOWZUPLOKYOR-AVGNSLFASA-N 0.000 description 2
- PCWLNNZTBJTZRN-AVGNSLFASA-N Pro-Pro-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 PCWLNNZTBJTZRN-AVGNSLFASA-N 0.000 description 2
- MKGIILKDUGDRRO-FXQIFTODSA-N Pro-Ser-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H]1CCCN1 MKGIILKDUGDRRO-FXQIFTODSA-N 0.000 description 2
- QUBVFEANYYWBTM-VEVYYDQMSA-N Pro-Thr-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(O)=O QUBVFEANYYWBTM-VEVYYDQMSA-N 0.000 description 2
- ZMLRZBWCXPQADC-TUAOUCFPSA-N Pro-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@@H]2CCCN2 ZMLRZBWCXPQADC-TUAOUCFPSA-N 0.000 description 2
- 108091034057 RNA (poly(A)) Proteins 0.000 description 2
- 241000235403 Rhizomucor miehei Species 0.000 description 2
- 241000235070 Saccharomyces Species 0.000 description 2
- 240000004808 Saccharomyces cerevisiae Species 0.000 description 2
- 241000235060 Scheffersomyces stipitis Species 0.000 description 2
- MWMKFWJYRRGXOR-ZLUOBGJFSA-N Ser-Ala-Asn Chemical compound N[C@H](C(=O)N[C@H](C(=O)N[C@H](C(=O)O)CC(N)=O)C)CO MWMKFWJYRRGXOR-ZLUOBGJFSA-N 0.000 description 2
- BRKHVZNDAOMAHX-BIIVOSGPSA-N Ser-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N BRKHVZNDAOMAHX-BIIVOSGPSA-N 0.000 description 2
- YQHZVYJAGWMHES-ZLUOBGJFSA-N Ser-Ala-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O YQHZVYJAGWMHES-ZLUOBGJFSA-N 0.000 description 2
- SMIDBHKWSYUBRZ-ACZMJKKPSA-N Ser-Glu-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O SMIDBHKWSYUBRZ-ACZMJKKPSA-N 0.000 description 2
- HJEBZBMOTCQYDN-ACZMJKKPSA-N Ser-Glu-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O HJEBZBMOTCQYDN-ACZMJKKPSA-N 0.000 description 2
- MUARUIBTKQJKFY-WHFBIAKZSA-N Ser-Gly-Asp Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O MUARUIBTKQJKFY-WHFBIAKZSA-N 0.000 description 2
- MQQBBLVOUUJKLH-HJPIBITLSA-N Ser-Ile-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O MQQBBLVOUUJKLH-HJPIBITLSA-N 0.000 description 2
- QMCDMHWAKMUGJE-IHRRRGAJSA-N Ser-Phe-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(O)=O QMCDMHWAKMUGJE-IHRRRGAJSA-N 0.000 description 2
- CKDXFSPMIDSMGV-GUBZILKMSA-N Ser-Pro-Val Chemical compound [H]N[C@@H](CO)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O CKDXFSPMIDSMGV-GUBZILKMSA-N 0.000 description 2
- HHJFMHQYEAAOBM-ZLUOBGJFSA-N Ser-Ser-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O HHJFMHQYEAAOBM-ZLUOBGJFSA-N 0.000 description 2
- VGQVAVQWKJLIRM-FXQIFTODSA-N Ser-Ser-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O VGQVAVQWKJLIRM-FXQIFTODSA-N 0.000 description 2
- PURRNJBBXDDWLX-ZDLURKLDSA-N Ser-Thr-Gly Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CO)N)O PURRNJBBXDDWLX-ZDLURKLDSA-N 0.000 description 2
- SNXUIBACCONSOH-BWBBJGPYSA-N Ser-Thr-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CO)C(O)=O SNXUIBACCONSOH-BWBBJGPYSA-N 0.000 description 2
- DDPVJPIGACCMEH-XQXXSGGOSA-N Thr-Ala-Gln Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(O)=O DDPVJPIGACCMEH-XQXXSGGOSA-N 0.000 description 2
- ZUUDNCOCILSYAM-KKHAAJSZSA-N Thr-Asp-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O ZUUDNCOCILSYAM-KKHAAJSZSA-N 0.000 description 2
- UCCNDUPVIFOOQX-CUJWVEQBSA-N Thr-Cys-His Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 UCCNDUPVIFOOQX-CUJWVEQBSA-N 0.000 description 2
- LGNBRHZANHMZHK-NUMRIWBASA-N Thr-Glu-Asp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N)O LGNBRHZANHMZHK-NUMRIWBASA-N 0.000 description 2
- XOTBWOCSLMBGMF-SUSMZKCASA-N Thr-Glu-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XOTBWOCSLMBGMF-SUSMZKCASA-N 0.000 description 2
- NIEWSKWFURSECR-FOHZUACHSA-N Thr-Gly-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O NIEWSKWFURSECR-FOHZUACHSA-N 0.000 description 2
- AQAMPXBRJJWPNI-JHEQGTHGSA-N Thr-Gly-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O AQAMPXBRJJWPNI-JHEQGTHGSA-N 0.000 description 2
- IMDMLDSVUSMAEJ-HJGDQZAQSA-N Thr-Leu-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O IMDMLDSVUSMAEJ-HJGDQZAQSA-N 0.000 description 2
- RRRRCRYTLZVCEN-HJGDQZAQSA-N Thr-Leu-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O RRRRCRYTLZVCEN-HJGDQZAQSA-N 0.000 description 2
- OWQKBXKXZFRRQL-XGEHTFHBSA-N Thr-Met-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CS)C(=O)O)N)O OWQKBXKXZFRRQL-XGEHTFHBSA-N 0.000 description 2
- FDQXPJCLVPFKJW-KJEVXHAQSA-N Thr-Met-Tyr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N)O FDQXPJCLVPFKJW-KJEVXHAQSA-N 0.000 description 2
- VEIKMWOMUYMMMK-FCLVOEFKSA-N Thr-Phe-Phe Chemical compound C([C@H](NC(=O)[C@@H](N)[C@H](O)C)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 VEIKMWOMUYMMMK-FCLVOEFKSA-N 0.000 description 2
- AIISTODACBDQLW-WDSOQIARSA-N Trp-Leu-Arg Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O)=CNC2=C1 AIISTODACBDQLW-WDSOQIARSA-N 0.000 description 2
- WVAKXMOGMWLWHK-VJBMBRPKSA-N Trp-Trp-Gln Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC3=CNC4=CC=CC=C43)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N WVAKXMOGMWLWHK-VJBMBRPKSA-N 0.000 description 2
- SWSUXOKZKQRADK-FDARSICLSA-N Trp-Val-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N SWSUXOKZKQRADK-FDARSICLSA-N 0.000 description 2
- UOXPLPBMEPLZBW-WDSOQIARSA-N Trp-Val-Lys Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCCN)C(O)=O)=CNC2=C1 UOXPLPBMEPLZBW-WDSOQIARSA-N 0.000 description 2
- KSVMDJJCYKIXTK-IGNZVWTISA-N Tyr-Ala-Tyr Chemical compound C([C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=C(O)C=C1 KSVMDJJCYKIXTK-IGNZVWTISA-N 0.000 description 2
- ADBDQGBDNUTRDB-ULQDDVLXSA-N Tyr-Arg-Leu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O ADBDQGBDNUTRDB-ULQDDVLXSA-N 0.000 description 2
- MBFJIHUHHCJBSN-AVGNSLFASA-N Tyr-Asn-Gln Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O MBFJIHUHHCJBSN-AVGNSLFASA-N 0.000 description 2
- HKYTWJOWZTWBQB-AVGNSLFASA-N Tyr-Glu-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 HKYTWJOWZTWBQB-AVGNSLFASA-N 0.000 description 2
- SLCSPPCQWUHPPO-JYJNAYRXSA-N Tyr-Glu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 SLCSPPCQWUHPPO-JYJNAYRXSA-N 0.000 description 2
- HIINQLBHPIQYHN-JTQLQIEISA-N Tyr-Gly-Gly Chemical compound OC(=O)CNC(=O)CNC(=O)[C@@H](N)CC1=CC=C(O)C=C1 HIINQLBHPIQYHN-JTQLQIEISA-N 0.000 description 2
- WSFXJLFSJSXGMQ-MGHWNKPDSA-N Tyr-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N WSFXJLFSJSXGMQ-MGHWNKPDSA-N 0.000 description 2
- CDBXVDXSLPLFMD-BPNCWPANSA-N Tyr-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CC1=CC=C(O)C=C1 CDBXVDXSLPLFMD-BPNCWPANSA-N 0.000 description 2
- IZFVRRYRMQFVGX-NRPADANISA-N Val-Ala-Gln Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N IZFVRRYRMQFVGX-NRPADANISA-N 0.000 description 2
- COYSIHFOCOMGCF-UHFFFAOYSA-N Val-Arg-Gly Natural products CC(C)C(N)C(=O)NC(C(=O)NCC(O)=O)CCCN=C(N)N COYSIHFOCOMGCF-UHFFFAOYSA-N 0.000 description 2
- QHDXUYOYTPWCSK-RCOVLWMOSA-N Val-Asp-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)NCC(=O)O)N QHDXUYOYTPWCSK-RCOVLWMOSA-N 0.000 description 2
- TZVUSFMQWPWHON-NHCYSSNCSA-N Val-Asp-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](C(C)C)N TZVUSFMQWPWHON-NHCYSSNCSA-N 0.000 description 2
- VVZDBPBZHLQPPB-XVKPBYJWSA-N Val-Glu-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O VVZDBPBZHLQPPB-XVKPBYJWSA-N 0.000 description 2
- ROLGIBMFNMZANA-GVXVVHGQSA-N Val-Glu-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](C(C)C)N ROLGIBMFNMZANA-GVXVVHGQSA-N 0.000 description 2
- CELJCNRXKZPTCX-XPUUQOCRSA-N Val-Gly-Ala Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(O)=O CELJCNRXKZPTCX-XPUUQOCRSA-N 0.000 description 2
- WFENBJPLZMPVAX-XVKPBYJWSA-N Val-Gly-Glu Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(O)=O WFENBJPLZMPVAX-XVKPBYJWSA-N 0.000 description 2
- MANXHLOVEUHVFD-DCAQKATOSA-N Val-His-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CS)C(=O)O)N MANXHLOVEUHVFD-DCAQKATOSA-N 0.000 description 2
- PTFPUAXGIKTVNN-ONGXEEELSA-N Val-His-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)NCC(=O)O)N PTFPUAXGIKTVNN-ONGXEEELSA-N 0.000 description 2
- HJSLDXZAZGFPDK-ULQDDVLXSA-N Val-Phe-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](C(C)C)N HJSLDXZAZGFPDK-ULQDDVLXSA-N 0.000 description 2
- USLVEJAHTBLSIL-CYDGBPFRSA-N Val-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)C(C)C USLVEJAHTBLSIL-CYDGBPFRSA-N 0.000 description 2
- JXWGBRRVTRAZQA-ULQDDVLXSA-N Val-Tyr-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](C(C)C)N JXWGBRRVTRAZQA-ULQDDVLXSA-N 0.000 description 2
- 241000700605 Viruses Species 0.000 description 2
- IXKSXJFAGXLQOQ-XISFHERQSA-N WHWLQLKPGQPMY Chemical compound C([C@@H](C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)NC(=O)[C@@H](N)CC=1C2=CC=CC=C2NC=1)C1=CNC=N1 IXKSXJFAGXLQOQ-XISFHERQSA-N 0.000 description 2
- 241000222126 [Candida] glabrata Species 0.000 description 2
- 238000002835 absorbance Methods 0.000 description 2
- 108010048241 acetamidase Proteins 0.000 description 2
- 239000002253 acid Substances 0.000 description 2
- 230000004913 activation Effects 0.000 description 2
- 238000007792 addition Methods 0.000 description 2
- OIRDTQYFTABQOQ-KQYNXXCUSA-N adenosine Chemical compound C1=NC=2C(N)=NC=NC=2N1[C@@H]1O[C@H](CO)[C@@H](O)[C@H]1O OIRDTQYFTABQOQ-KQYNXXCUSA-N 0.000 description 2
- 150000001298 alcohols Chemical class 0.000 description 2
- 235000019418 amylase Nutrition 0.000 description 2
- 108010001271 arginyl-glutamyl-arginine Proteins 0.000 description 2
- 210000004507 artificial chromosome Anatomy 0.000 description 2
- 108010069205 aspartyl-phenylalanine Proteins 0.000 description 2
- 238000003556 assay Methods 0.000 description 2
- 239000001110 calcium chloride Substances 0.000 description 2
- 229910001628 calcium chloride Inorganic materials 0.000 description 2
- 235000011148 calcium chloride Nutrition 0.000 description 2
- 239000006143 cell culture medium Substances 0.000 description 2
- 230000001413 cellular effect Effects 0.000 description 2
- 210000004978 chinese hamster ovary cell Anatomy 0.000 description 2
- 230000000295 complement effect Effects 0.000 description 2
- 239000000356 contaminant Substances 0.000 description 2
- 230000003247 decreasing effect Effects 0.000 description 2
- 230000002950 deficient Effects 0.000 description 2
- 238000012217 deletion Methods 0.000 description 2
- 230000037430 deletion Effects 0.000 description 2
- 238000001514 detection method Methods 0.000 description 2
- UQLDLKMNUJERMK-UHFFFAOYSA-L di(octadecanoyloxy)lead Chemical compound [Pb+2].CCCCCCCCCCCCCCCCCC([O-])=O.CCCCCCCCCCCCCCCCCC([O-])=O UQLDLKMNUJERMK-UHFFFAOYSA-L 0.000 description 2
- 108010054813 diprotin B Proteins 0.000 description 2
- 230000008034 disappearance Effects 0.000 description 2
- 238000006911 enzymatic reaction Methods 0.000 description 2
- 238000001952 enzyme assay Methods 0.000 description 2
- 150000002148 esters Chemical class 0.000 description 2
- ZMMJGEGLRURXTF-UHFFFAOYSA-N ethidium bromide Chemical compound [Br-].C12=CC(N)=CC=C2C2=CC=C(N)C=C2[N+](CC)=C1C1=CC=CC=C1 ZMMJGEGLRURXTF-UHFFFAOYSA-N 0.000 description 2
- 229960005542 ethidium bromide Drugs 0.000 description 2
- 239000000194 fatty acid Substances 0.000 description 2
- 108020001507 fusion proteins Proteins 0.000 description 2
- 102000037865 fusion proteins Human genes 0.000 description 2
- 229930182830 galactose Natural products 0.000 description 2
- 108010063718 gamma-glutamylaspartic acid Proteins 0.000 description 2
- 230000002068 genetic effect Effects 0.000 description 2
- 108010078144 glutaminyl-glycine Proteins 0.000 description 2
- 108010042598 glutamyl-aspartyl-glycine Proteins 0.000 description 2
- 108010013768 glutamyl-aspartyl-proline Proteins 0.000 description 2
- 108010079547 glutamylmethionine Proteins 0.000 description 2
- 108010066198 glycyl-leucyl-phenylalanine Proteins 0.000 description 2
- 108010010147 glycylglutamine Proteins 0.000 description 2
- 108010077515 glycylproline Proteins 0.000 description 2
- 238000004128 high performance liquid chromatography Methods 0.000 description 2
- 238000009396 hybridization Methods 0.000 description 2
- 230000007062 hydrolysis Effects 0.000 description 2
- 238000006460 hydrolysis reaction Methods 0.000 description 2
- 238000000338 in vitro Methods 0.000 description 2
- 238000002347 injection Methods 0.000 description 2
- 239000007924 injection Substances 0.000 description 2
- KWGKDLIKAYFUFQ-UHFFFAOYSA-M lithium chloride Chemical compound [Li+].[Cl-] KWGKDLIKAYFUFQ-UHFFFAOYSA-M 0.000 description 2
- 108010017391 lysylvaline Proteins 0.000 description 2
- 239000000463 material Substances 0.000 description 2
- 239000011159 matrix material Substances 0.000 description 2
- 108010022588 methionyl-lysyl-proline Proteins 0.000 description 2
- 108010068488 methionylphenylalanine Proteins 0.000 description 2
- 238000010369 molecular cloning Methods 0.000 description 2
- 239000003471 mutagenic agent Substances 0.000 description 2
- 231100000707 mutagenic chemical Toxicity 0.000 description 2
- 230000000869 mutational effect Effects 0.000 description 2
- 229960004927 neomycin Drugs 0.000 description 2
- VSZGPKBBMSAYNT-RRFJBIMHSA-N oseltamivir Chemical compound CCOC(=O)C1=C[C@@H](OC(CC)CC)[C@H](NC(C)=O)[C@@H](N)C1 VSZGPKBBMSAYNT-RRFJBIMHSA-N 0.000 description 2
- 239000008188 pellet Substances 0.000 description 2
- 108010084525 phenylalanyl-phenylalanyl-glycine Proteins 0.000 description 2
- 108010012581 phenylalanylglutamate Proteins 0.000 description 2
- 102000020233 phosphotransferase Human genes 0.000 description 2
- 108700042769 prolyl-leucyl-glycine Proteins 0.000 description 2
- 238000010188 recombinant method Methods 0.000 description 2
- 230000006798 recombination Effects 0.000 description 2
- 238000005215 recombination Methods 0.000 description 2
- 230000022532 regulation of transcription, DNA-dependent Effects 0.000 description 2
- 238000003757 reverse transcription PCR Methods 0.000 description 2
- 239000012266 salt solution Substances 0.000 description 2
- 230000028327 secretion Effects 0.000 description 2
- 230000037432 silent mutation Effects 0.000 description 2
- VWDWKYIASSYTQR-UHFFFAOYSA-N sodium nitrate Chemical compound [Na+].[O-][N+]([O-])=O VWDWKYIASSYTQR-UHFFFAOYSA-N 0.000 description 2
- 239000007787 solid Substances 0.000 description 2
- 231100000331 toxic Toxicity 0.000 description 2
- 230000002588 toxic effect Effects 0.000 description 2
- 239000011573 trace mineral Substances 0.000 description 2
- 235000013619 trace mineral Nutrition 0.000 description 2
- 238000012546 transfer Methods 0.000 description 2
- 230000001131 transforming effect Effects 0.000 description 2
- 108010080629 tryptophan-leucine Proteins 0.000 description 2
- 108010084932 tryptophyl-proline Proteins 0.000 description 2
- 108010038745 tryptophylglycine Proteins 0.000 description 2
- 108010003137 tyrosyltyrosine Proteins 0.000 description 2
- 238000009281 ultraviolet germicidal irradiation Methods 0.000 description 2
- 239000007222 ypd medium Substances 0.000 description 2
- BRPMXFSTKXXNHF-IUCAKERBSA-N (2s)-1-[2-[[(2s)-pyrrolidine-2-carbonyl]amino]acetyl]pyrrolidine-2-carboxylic acid Chemical compound OC(=O)[C@@H]1CCCN1C(=O)CNC(=O)[C@H]1NCCC1 BRPMXFSTKXXNHF-IUCAKERBSA-N 0.000 description 1
- 102000040650 (ribonucleotides)n+m Human genes 0.000 description 1
- BDOYKFSQFYNPKF-UHFFFAOYSA-N 2-[2-[bis(carboxymethyl)amino]ethyl-(carboxymethyl)amino]acetic acid;sodium Chemical compound [Na].[Na].OC(=O)CN(CC(O)=O)CCN(CC(O)=O)CC(O)=O BDOYKFSQFYNPKF-UHFFFAOYSA-N 0.000 description 1
- ONEGZXHXCLCVRF-UHFFFAOYSA-N 2-[[2-[[1-(2-amino-3-methylbutanoyl)pyrrolidine-2-carbonyl]amino]-3-methylbutanoyl]amino]-4-methylpentanoic acid Chemical compound CC(C)CC(C(O)=O)NC(=O)C(C(C)C)NC(=O)C1CCCN1C(=O)C(N)C(C)C ONEGZXHXCLCVRF-UHFFFAOYSA-N 0.000 description 1
- LHYQAEFVHIZFLR-UHFFFAOYSA-L 4-(4-diazonio-3-methoxyphenyl)-2-methoxybenzenediazonium;dichloride Chemical compound [Cl-].[Cl-].C1=C([N+]#N)C(OC)=CC(C=2C=C(OC)C([N+]#N)=CC=2)=C1 LHYQAEFVHIZFLR-UHFFFAOYSA-L 0.000 description 1
- FWMNVWWHGCHHJJ-SKKKGAJSSA-N 4-amino-1-[(2r)-6-amino-2-[[(2r)-2-[[(2r)-2-[[(2r)-2-amino-3-phenylpropanoyl]amino]-3-phenylpropanoyl]amino]-4-methylpentanoyl]amino]hexanoyl]piperidine-4-carboxylic acid Chemical compound C([C@H](C(=O)N[C@H](CC(C)C)C(=O)N[C@H](CCCCN)C(=O)N1CCC(N)(CC1)C(O)=O)NC(=O)[C@H](N)CC=1C=CC=CC=1)C1=CC=CC=C1 FWMNVWWHGCHHJJ-SKKKGAJSSA-N 0.000 description 1
- ALYNCZNDIQEVRV-UHFFFAOYSA-N 4-aminobenzoic acid Chemical compound NC1=CC=C(C(O)=O)C=C1 ALYNCZNDIQEVRV-UHFFFAOYSA-N 0.000 description 1
- UHPMCKVQTMMPCG-UHFFFAOYSA-N 5,8-dihydroxy-2-methoxy-6-methyl-7-(2-oxopropyl)naphthalene-1,4-dione Chemical compound CC1=C(CC(C)=O)C(O)=C2C(=O)C(OC)=CC(=O)C2=C1O UHPMCKVQTMMPCG-UHFFFAOYSA-N 0.000 description 1
- 108010011619 6-Phytase Proteins 0.000 description 1
- FJHBVJOVLFPMQE-QFIPXVFZSA-N 7-Ethyl-10-Hydroxy-Camptothecin Chemical compound C1=C(O)C=C2C(CC)=C(CN3C(C4=C([C@@](C(=O)OC4)(O)CC)C=C33)=O)C3=NC2=C1 FJHBVJOVLFPMQE-QFIPXVFZSA-N 0.000 description 1
- 108010044087 AS-I toxin Proteins 0.000 description 1
- 241001019659 Acremonium <Plectosphaerellaceae> Species 0.000 description 1
- 229920000936 Agarose Polymers 0.000 description 1
- HHGYNJRJIINWAK-FXQIFTODSA-N Ala-Ala-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N HHGYNJRJIINWAK-FXQIFTODSA-N 0.000 description 1
- DKJPOZOEBONHFS-ZLUOBGJFSA-N Ala-Ala-Asp Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC(O)=O DKJPOZOEBONHFS-ZLUOBGJFSA-N 0.000 description 1
- RLMISHABBKUNFO-WHFBIAKZSA-N Ala-Ala-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O RLMISHABBKUNFO-WHFBIAKZSA-N 0.000 description 1
- PIPTUBPKYFRLCP-NHCYSSNCSA-N Ala-Ala-Phe Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 PIPTUBPKYFRLCP-NHCYSSNCSA-N 0.000 description 1
- ZFXQNADNEBRERM-BJDJZHNGSA-N Ala-Ala-Pro-Pro Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 ZFXQNADNEBRERM-BJDJZHNGSA-N 0.000 description 1
- QDRGPQWIVZNJQD-CIUDSAMLSA-N Ala-Arg-Gln Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(O)=O QDRGPQWIVZNJQD-CIUDSAMLSA-N 0.000 description 1
- TTXMOJWKNRJWQJ-FXQIFTODSA-N Ala-Arg-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CCCN=C(N)N TTXMOJWKNRJWQJ-FXQIFTODSA-N 0.000 description 1
- XEXJJJRVTFGWIC-FXQIFTODSA-N Ala-Asn-Arg Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N XEXJJJRVTFGWIC-FXQIFTODSA-N 0.000 description 1
- PXKLCFFSVLKOJM-ACZMJKKPSA-N Ala-Asn-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O PXKLCFFSVLKOJM-ACZMJKKPSA-N 0.000 description 1
- KIUYPHAMDKDICO-WHFBIAKZSA-N Ala-Asp-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O KIUYPHAMDKDICO-WHFBIAKZSA-N 0.000 description 1
- MKZCBYZBCINNJN-DLOVCJGASA-N Ala-Asp-Phe Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 MKZCBYZBCINNJN-DLOVCJGASA-N 0.000 description 1
- IKKVASZHTMKJIR-ZKWXMUAHSA-N Ala-Asp-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O IKKVASZHTMKJIR-ZKWXMUAHSA-N 0.000 description 1
- KRHRBKYBJXMYBB-WHFBIAKZSA-N Ala-Cys-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CS)C(=O)NCC(O)=O KRHRBKYBJXMYBB-WHFBIAKZSA-N 0.000 description 1
- HWPXGQCMZITGFN-XVYDVKMFSA-N Ala-Cys-His Chemical compound C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N HWPXGQCMZITGFN-XVYDVKMFSA-N 0.000 description 1
- CXZFXHGJJPVUJE-CIUDSAMLSA-N Ala-Cys-Leu Chemical compound C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(=O)O)N CXZFXHGJJPVUJE-CIUDSAMLSA-N 0.000 description 1
- YEELWQSXYBJVSV-UWJYBYFXSA-N Ala-Cys-Tyr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O YEELWQSXYBJVSV-UWJYBYFXSA-N 0.000 description 1
- LGFCAXJBAZESCF-ACZMJKKPSA-N Ala-Gln-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O LGFCAXJBAZESCF-ACZMJKKPSA-N 0.000 description 1
- BLGHHPHXVJWCNK-GUBZILKMSA-N Ala-Gln-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O BLGHHPHXVJWCNK-GUBZILKMSA-N 0.000 description 1
- JPGBXANAQYHTLA-DRZSPHRISA-N Ala-Gln-Phe Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 JPGBXANAQYHTLA-DRZSPHRISA-N 0.000 description 1
- YIGLXQRFQVWFEY-NRPADANISA-N Ala-Gln-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O YIGLXQRFQVWFEY-NRPADANISA-N 0.000 description 1
- VBRDBGCROKWTPV-XHNCKOQMSA-N Ala-Glu-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N VBRDBGCROKWTPV-XHNCKOQMSA-N 0.000 description 1
- MPLOSMWGDNJSEV-WHFBIAKZSA-N Ala-Gly-Asp Chemical compound [H]N[C@@H](C)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O MPLOSMWGDNJSEV-WHFBIAKZSA-N 0.000 description 1
- WMYJZJRILUVVRG-WDSKDSINSA-N Ala-Gly-Gln Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(N)=O WMYJZJRILUVVRG-WDSKDSINSA-N 0.000 description 1
- BEMGNWZECGIJOI-WDSKDSINSA-N Ala-Gly-Glu Chemical compound [H]N[C@@H](C)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O BEMGNWZECGIJOI-WDSKDSINSA-N 0.000 description 1
- VGPWRRFOPXVGOH-BYPYZUCNSA-N Ala-Gly-Gly Chemical compound C[C@H](N)C(=O)NCC(=O)NCC(O)=O VGPWRRFOPXVGOH-BYPYZUCNSA-N 0.000 description 1
- LMFXXZPPZDCPTA-ZKWXMUAHSA-N Ala-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@H](C)N LMFXXZPPZDCPTA-ZKWXMUAHSA-N 0.000 description 1
- QHASENCZLDHBGX-ONGXEEELSA-N Ala-Gly-Phe Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 QHASENCZLDHBGX-ONGXEEELSA-N 0.000 description 1
- MQIGTEQXYCRLGK-BQBZGAKWSA-N Ala-Gly-Pro Chemical compound C[C@H](N)C(=O)NCC(=O)N1CCC[C@H]1C(O)=O MQIGTEQXYCRLGK-BQBZGAKWSA-N 0.000 description 1
- JDIQCVUDDFENPU-ZKWXMUAHSA-N Ala-His-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CNC=N1 JDIQCVUDDFENPU-ZKWXMUAHSA-N 0.000 description 1
- 108010076441 Ala-His-His Proteins 0.000 description 1
- ATAKEVCGTRZKLI-UWJYBYFXSA-N Ala-His-His Chemical compound C([C@H](NC(=O)[C@@H](N)C)C(=O)N[C@@H](CC=1NC=NC=1)C(O)=O)C1=CN=CN1 ATAKEVCGTRZKLI-UWJYBYFXSA-N 0.000 description 1
- KMGOBAQSCKTBGD-DLOVCJGASA-N Ala-His-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](C)N)CC1=CN=CN1 KMGOBAQSCKTBGD-DLOVCJGASA-N 0.000 description 1
- CKLDHDOIYBVUNP-KBIXCLLPSA-N Ala-Ile-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(O)=O CKLDHDOIYBVUNP-KBIXCLLPSA-N 0.000 description 1
- WUHJHHGYVVJMQE-BJDJZHNGSA-N Ala-Leu-Ile Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WUHJHHGYVVJMQE-BJDJZHNGSA-N 0.000 description 1
- AWZKCUCQJNTBAD-SRVKXCTJSA-N Ala-Leu-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCCN AWZKCUCQJNTBAD-SRVKXCTJSA-N 0.000 description 1
- SOBIAADAMRHGKH-CIUDSAMLSA-N Ala-Leu-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O SOBIAADAMRHGKH-CIUDSAMLSA-N 0.000 description 1
- QUIGLPSHIFPEOV-CIUDSAMLSA-N Ala-Lys-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O QUIGLPSHIFPEOV-CIUDSAMLSA-N 0.000 description 1
- MFMDKJIPHSWSBM-GUBZILKMSA-N Ala-Lys-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O MFMDKJIPHSWSBM-GUBZILKMSA-N 0.000 description 1
- SUHLZMHFRALVSY-YUMQZZPRSA-N Ala-Lys-Gly Chemical compound NCCCC[C@H](NC(=O)[C@@H](N)C)C(=O)NCC(O)=O SUHLZMHFRALVSY-YUMQZZPRSA-N 0.000 description 1
- IHRGVZXPTIQNIP-NAKRPEOUSA-N Ala-Met-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](C)N IHRGVZXPTIQNIP-NAKRPEOUSA-N 0.000 description 1
- DEWWPUNXRNGMQN-LPEHRKFASA-N Ala-Met-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CCSC)C(=O)N1CCC[C@@H]1C(=O)O)N DEWWPUNXRNGMQN-LPEHRKFASA-N 0.000 description 1
- 108010011667 Ala-Phe-Ala Proteins 0.000 description 1
- BDQNLQSWRAPHGU-DLOVCJGASA-N Ala-Phe-Cys Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CS)C(=O)O)N BDQNLQSWRAPHGU-DLOVCJGASA-N 0.000 description 1
- YCRAFFCYWOUEOF-DLOVCJGASA-N Ala-Phe-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=CC=C1 YCRAFFCYWOUEOF-DLOVCJGASA-N 0.000 description 1
- FQNILRVJOJBFFC-FXQIFTODSA-N Ala-Pro-Asp Chemical compound C[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(=O)O)C(=O)O)N FQNILRVJOJBFFC-FXQIFTODSA-N 0.000 description 1
- ADSGHMXEAZJJNF-DCAQKATOSA-N Ala-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](C)N ADSGHMXEAZJJNF-DCAQKATOSA-N 0.000 description 1
- GMGWOTQMUKYZIE-UBHSHLNASA-N Ala-Pro-Phe Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 GMGWOTQMUKYZIE-UBHSHLNASA-N 0.000 description 1
- XWFWAXPOLRTDFZ-FXQIFTODSA-N Ala-Pro-Ser Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O XWFWAXPOLRTDFZ-FXQIFTODSA-N 0.000 description 1
- FFZJHQODAYHGPO-KZVJFYERSA-N Ala-Pro-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](C)N FFZJHQODAYHGPO-KZVJFYERSA-N 0.000 description 1
- DCVYRWFAMZFSDA-ZLUOBGJFSA-N Ala-Ser-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O DCVYRWFAMZFSDA-ZLUOBGJFSA-N 0.000 description 1
- RMAWDDRDTRSZIR-ZLUOBGJFSA-N Ala-Ser-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O RMAWDDRDTRSZIR-ZLUOBGJFSA-N 0.000 description 1
- MSWSRLGNLKHDEI-ACZMJKKPSA-N Ala-Ser-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(O)=O MSWSRLGNLKHDEI-ACZMJKKPSA-N 0.000 description 1
- RTZCUEHYUQZIDE-WHFBIAKZSA-N Ala-Ser-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O RTZCUEHYUQZIDE-WHFBIAKZSA-N 0.000 description 1
- NZGRHTKZFSVPAN-BIIVOSGPSA-N Ala-Ser-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CO)C(=O)N1CCC[C@@H]1C(=O)O)N NZGRHTKZFSVPAN-BIIVOSGPSA-N 0.000 description 1
- NCQMBSJGJMYKCK-ZLUOBGJFSA-N Ala-Ser-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O NCQMBSJGJMYKCK-ZLUOBGJFSA-N 0.000 description 1
- WQKAQKZRDIZYNV-VZFHVOOUSA-N Ala-Ser-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O WQKAQKZRDIZYNV-VZFHVOOUSA-N 0.000 description 1
- ARHJJAAWNWOACN-FXQIFTODSA-N Ala-Ser-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O ARHJJAAWNWOACN-FXQIFTODSA-N 0.000 description 1
- QOIGKCBMXUCDQU-KDXUFGMBSA-N Ala-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](C)N)O QOIGKCBMXUCDQU-KDXUFGMBSA-N 0.000 description 1
- KUFVXLQLDHJVOG-SHGPDSBTSA-N Ala-Thr-Thr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](C)N)O KUFVXLQLDHJVOG-SHGPDSBTSA-N 0.000 description 1
- YXXPVUOMPSZURS-ZLIFDBKOSA-N Ala-Trp-Leu Chemical compound C1=CC=C2C(C[C@@H](C(=O)N[C@@H](CC(C)C)C(O)=O)NC(=O)[C@H](C)N)=CNC2=C1 YXXPVUOMPSZURS-ZLIFDBKOSA-N 0.000 description 1
- XPBVBZPVNFIHOA-UVBJJODRSA-N Ala-Trp-Val Chemical compound C1=CC=C2C(C[C@@H](C(=O)N[C@@H](C(C)C)C(O)=O)NC(=O)[C@H](C)N)=CNC2=C1 XPBVBZPVNFIHOA-UVBJJODRSA-N 0.000 description 1
- MTDDMSUUXNQMKK-BPNCWPANSA-N Ala-Tyr-Arg Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N MTDDMSUUXNQMKK-BPNCWPANSA-N 0.000 description 1
- BGGAIXWIZCIFSG-XDTLVQLUSA-N Ala-Tyr-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O BGGAIXWIZCIFSG-XDTLVQLUSA-N 0.000 description 1
- QRIYOHQJRDHFKF-UWJYBYFXSA-N Ala-Tyr-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=C(O)C=C1 QRIYOHQJRDHFKF-UWJYBYFXSA-N 0.000 description 1
- DEAGTWNKODHUIY-MRFFXTKBSA-N Ala-Tyr-Trp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O DEAGTWNKODHUIY-MRFFXTKBSA-N 0.000 description 1
- YJHKTAMKPGFJCT-NRPADANISA-N Ala-Val-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O YJHKTAMKPGFJCT-NRPADANISA-N 0.000 description 1
- CLOMBHBBUKAUBP-LSJOCFKGSA-N Ala-Val-His Chemical compound C[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N CLOMBHBBUKAUBP-LSJOCFKGSA-N 0.000 description 1
- XCIGOVDXZULBBV-DCAQKATOSA-N Ala-Val-Lys Chemical compound CC(C)[C@H](NC(=O)[C@H](C)N)C(=O)N[C@@H](CCCCN)C(O)=O XCIGOVDXZULBBV-DCAQKATOSA-N 0.000 description 1
- OMSKGWFGWCQFBD-KZVJFYERSA-N Ala-Val-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OMSKGWFGWCQFBD-KZVJFYERSA-N 0.000 description 1
- 241000219194 Arabidopsis Species 0.000 description 1
- 241000219195 Arabidopsis thaliana Species 0.000 description 1
- 241000203069 Archaea Species 0.000 description 1
- KWKQGHSSNHPGOW-BQBZGAKWSA-N Arg-Ala-Gly Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(=O)NCC(O)=O KWKQGHSSNHPGOW-BQBZGAKWSA-N 0.000 description 1
- VYSRNGOMGHOJCK-GUBZILKMSA-N Arg-Ala-Met Chemical compound C[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N VYSRNGOMGHOJCK-GUBZILKMSA-N 0.000 description 1
- DBKNLHKEVPZVQC-LPEHRKFASA-N Arg-Ala-Pro Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@@H]1C(O)=O DBKNLHKEVPZVQC-LPEHRKFASA-N 0.000 description 1
- DCGLNNVKIZXQOJ-FXQIFTODSA-N Arg-Asn-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCCN=C(N)N)N DCGLNNVKIZXQOJ-FXQIFTODSA-N 0.000 description 1
- NONSEUUPKITYQT-BQBZGAKWSA-N Arg-Asn-Gly Chemical compound C(C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)NCC(=O)O)N)CN=C(N)N NONSEUUPKITYQT-BQBZGAKWSA-N 0.000 description 1
- QPOARHANPULOTM-GMOBBJLQSA-N Arg-Asn-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCCN=C(N)N)N QPOARHANPULOTM-GMOBBJLQSA-N 0.000 description 1
- DPNHSNLIULPOBH-GUBZILKMSA-N Arg-Asn-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCCN=C(N)N)N DPNHSNLIULPOBH-GUBZILKMSA-N 0.000 description 1
- ITVINTQUZMQWJR-QXEWZRGKSA-N Arg-Asn-Val Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O ITVINTQUZMQWJR-QXEWZRGKSA-N 0.000 description 1
- RRGPUNYIPJXJBU-GUBZILKMSA-N Arg-Asp-Met Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCSC)C(O)=O RRGPUNYIPJXJBU-GUBZILKMSA-N 0.000 description 1
- IGULQRCJLQQPSM-DCAQKATOSA-N Arg-Cys-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(O)=O IGULQRCJLQQPSM-DCAQKATOSA-N 0.000 description 1
- QAODJPUKWNNNRP-DCAQKATOSA-N Arg-Glu-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O QAODJPUKWNNNRP-DCAQKATOSA-N 0.000 description 1
- PNQWAUXQDBIJDY-GUBZILKMSA-N Arg-Glu-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O PNQWAUXQDBIJDY-GUBZILKMSA-N 0.000 description 1
- PBSOQGZLPFVXPU-YUMQZZPRSA-N Arg-Glu-Gly Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O PBSOQGZLPFVXPU-YUMQZZPRSA-N 0.000 description 1
- NKBQZKVMKJJDLX-SRVKXCTJSA-N Arg-Glu-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O NKBQZKVMKJJDLX-SRVKXCTJSA-N 0.000 description 1
- SKTGPBFTMNLIHQ-KKUMJFAQSA-N Arg-Glu-Phe Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O SKTGPBFTMNLIHQ-KKUMJFAQSA-N 0.000 description 1
- AQPVUEJJARLJHB-BQBZGAKWSA-N Arg-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)[C@@H](N)CCCN=C(N)N AQPVUEJJARLJHB-BQBZGAKWSA-N 0.000 description 1
- PNIGSVZJNVUVJA-BQBZGAKWSA-N Arg-Gly-Asn Chemical compound NC(N)=NCCC[C@H](N)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O PNIGSVZJNVUVJA-BQBZGAKWSA-N 0.000 description 1
- AUFHLLPVPSMEOG-YUMQZZPRSA-N Arg-Gly-Glu Chemical compound NC(N)=NCCC[C@H](N)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O AUFHLLPVPSMEOG-YUMQZZPRSA-N 0.000 description 1
- UPKMBGAAEZGHOC-RWMBFGLXSA-N Arg-His-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CN=CN2)NC(=O)[C@H](CCCN=C(N)N)N)C(=O)O UPKMBGAAEZGHOC-RWMBFGLXSA-N 0.000 description 1
- OKKMBOSPBDASEP-CYDGBPFRSA-N Arg-Ile-Met Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCSC)C(O)=O OKKMBOSPBDASEP-CYDGBPFRSA-N 0.000 description 1
- OFIYLHVAAJYRBC-HJWJTTGWSA-N Arg-Ile-Phe Chemical compound CC[C@H](C)[C@H](NC(=O)[C@@H](N)CCCNC(N)=N)C(=O)N[C@@H](Cc1ccccc1)C(O)=O OFIYLHVAAJYRBC-HJWJTTGWSA-N 0.000 description 1
- GXXWTNKNFFKTJB-NAKRPEOUSA-N Arg-Ile-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O GXXWTNKNFFKTJB-NAKRPEOUSA-N 0.000 description 1
- HJDNZFIYILEIKR-OSUNSFLBSA-N Arg-Ile-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O HJDNZFIYILEIKR-OSUNSFLBSA-N 0.000 description 1
- LVMUGODRNHFGRA-AVGNSLFASA-N Arg-Leu-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O LVMUGODRNHFGRA-AVGNSLFASA-N 0.000 description 1
- UZGFHWIJWPUPOH-IHRRRGAJSA-N Arg-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N UZGFHWIJWPUPOH-IHRRRGAJSA-N 0.000 description 1
- IIAXFBUTKIDDIP-ULQDDVLXSA-N Arg-Leu-Phe Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O IIAXFBUTKIDDIP-ULQDDVLXSA-N 0.000 description 1
- NMRHDSAOIURTNT-RWMBFGLXSA-N Arg-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N NMRHDSAOIURTNT-RWMBFGLXSA-N 0.000 description 1
- COXMUHNBYCVVRG-DCAQKATOSA-N Arg-Leu-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O COXMUHNBYCVVRG-DCAQKATOSA-N 0.000 description 1
- CLICCYPMVFGUOF-IHRRRGAJSA-N Arg-Lys-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O CLICCYPMVFGUOF-IHRRRGAJSA-N 0.000 description 1
- YLVGUOGAFAJMKP-JYJNAYRXSA-N Arg-Met-Tyr Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N YLVGUOGAFAJMKP-JYJNAYRXSA-N 0.000 description 1
- FKQITMVNILRUCQ-IHRRRGAJSA-N Arg-Phe-Asp Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O FKQITMVNILRUCQ-IHRRRGAJSA-N 0.000 description 1
- BSGSDLYGGHGMND-IHRRRGAJSA-N Arg-Phe-Cys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N BSGSDLYGGHGMND-IHRRRGAJSA-N 0.000 description 1
- UGZUVYDKAYNCII-ULQDDVLXSA-N Arg-Phe-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O UGZUVYDKAYNCII-ULQDDVLXSA-N 0.000 description 1
- MNBHKGYCLBUIBC-UFYCRDLUSA-N Arg-Phe-Phe Chemical compound C([C@H](NC(=O)[C@H](CCCNC(N)=N)N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 MNBHKGYCLBUIBC-UFYCRDLUSA-N 0.000 description 1
- LXMKTIZAGIBQRX-HRCADAONSA-N Arg-Phe-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)NC(=O)[C@H](CCCN=C(N)N)N)C(=O)O LXMKTIZAGIBQRX-HRCADAONSA-N 0.000 description 1
- PRLPSDIHSRITSF-UNQGMJICSA-N Arg-Phe-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PRLPSDIHSRITSF-UNQGMJICSA-N 0.000 description 1
- OVQJAKFLFTZDNC-GUBZILKMSA-N Arg-Pro-Asp Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O OVQJAKFLFTZDNC-GUBZILKMSA-N 0.000 description 1
- QHVRVUNEAIFTEK-SZMVWBNQSA-N Arg-Pro-Trp Chemical compound N[C@@H](CCCNC(N)=N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](Cc1c[nH]c2ccccc12)C(O)=O QHVRVUNEAIFTEK-SZMVWBNQSA-N 0.000 description 1
- VUGWHBXPMAHEGZ-SRVKXCTJSA-N Arg-Pro-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CCCN=C(N)N VUGWHBXPMAHEGZ-SRVKXCTJSA-N 0.000 description 1
- ISJWBVIYRBAXEB-CIUDSAMLSA-N Arg-Ser-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(O)=O ISJWBVIYRBAXEB-CIUDSAMLSA-N 0.000 description 1
- DNLQVHBBMPZUGJ-BQBZGAKWSA-N Arg-Ser-Gly Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O DNLQVHBBMPZUGJ-BQBZGAKWSA-N 0.000 description 1
- LFAUVOXPCGJKTB-DCAQKATOSA-N Arg-Ser-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCCN=C(N)N)N LFAUVOXPCGJKTB-DCAQKATOSA-N 0.000 description 1
- ICRHGPYYXMWHIE-LPEHRKFASA-N Arg-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CCCN=C(N)N)N)C(=O)O ICRHGPYYXMWHIE-LPEHRKFASA-N 0.000 description 1
- FRBAHXABMQXSJQ-FXQIFTODSA-N Arg-Ser-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O FRBAHXABMQXSJQ-FXQIFTODSA-N 0.000 description 1
- ZJBUILVYSXQNSW-YTWAJWBKSA-N Arg-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N)O ZJBUILVYSXQNSW-YTWAJWBKSA-N 0.000 description 1
- OGZBJJLRKQZRHL-KJEVXHAQSA-N Arg-Thr-Tyr Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 OGZBJJLRKQZRHL-KJEVXHAQSA-N 0.000 description 1
- LFWOQHSQNCKXRU-UFYCRDLUSA-N Arg-Tyr-Phe Chemical compound C([C@H](NC(=O)[C@H](CCCN=C(N)N)N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=C(O)C=C1 LFWOQHSQNCKXRU-UFYCRDLUSA-N 0.000 description 1
- CNBIWSCSSCAINS-UFYCRDLUSA-N Arg-Tyr-Tyr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O CNBIWSCSSCAINS-UFYCRDLUSA-N 0.000 description 1
- VYZBPPBKFCHCIS-WPRPVWTQSA-N Arg-Val-Gly Chemical compound OC(=O)CNC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCN=C(N)N VYZBPPBKFCHCIS-WPRPVWTQSA-N 0.000 description 1
- SUMJNGAMIQSNGX-TUAOUCFPSA-N Arg-Val-Pro Chemical compound CC(C)[C@H](NC(=O)[C@@H](N)CCCNC(N)=N)C(=O)N1CCC[C@@H]1C(O)=O SUMJNGAMIQSNGX-TUAOUCFPSA-N 0.000 description 1
- CPTXATAOUQJQRO-GUBZILKMSA-N Arg-Val-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O CPTXATAOUQJQRO-GUBZILKMSA-N 0.000 description 1
- ANAHQDPQQBDOBM-UHFFFAOYSA-N Arg-Val-Tyr Natural products CC(C)C(NC(=O)C(N)CCNC(=N)N)C(=O)NC(Cc1ccc(O)cc1)C(=O)O ANAHQDPQQBDOBM-UHFFFAOYSA-N 0.000 description 1
- 241000235349 Ascomycota Species 0.000 description 1
- YNDLOUMBVDVALC-ZLUOBGJFSA-N Asn-Ala-Ala Chemical compound C[C@@H](C(=O)N[C@@H](C)C(=O)O)NC(=O)[C@H](CC(=O)N)N YNDLOUMBVDVALC-ZLUOBGJFSA-N 0.000 description 1
- NUHQMYUWLUSRJX-BIIVOSGPSA-N Asn-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)N)N NUHQMYUWLUSRJX-BIIVOSGPSA-N 0.000 description 1
- HUZGPXBILPMCHM-IHRRRGAJSA-N Asn-Arg-Phe Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O HUZGPXBILPMCHM-IHRRRGAJSA-N 0.000 description 1
- JEPNYDRDYNSFIU-QXEWZRGKSA-N Asn-Arg-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H](N)CC(N)=O)C(O)=O JEPNYDRDYNSFIU-QXEWZRGKSA-N 0.000 description 1
- ZZXMOQIUIJJOKZ-ZLUOBGJFSA-N Asn-Asn-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CC(N)=O ZZXMOQIUIJJOKZ-ZLUOBGJFSA-N 0.000 description 1
- DXZNJWFECGJCQR-FXQIFTODSA-N Asn-Asn-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC(=O)N)N DXZNJWFECGJCQR-FXQIFTODSA-N 0.000 description 1
- BVLIJXXSXBUGEC-SRVKXCTJSA-N Asn-Asn-Tyr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O BVLIJXXSXBUGEC-SRVKXCTJSA-N 0.000 description 1
- QNJIRRVTOXNGMH-GUBZILKMSA-N Asn-Gln-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CC(N)=O QNJIRRVTOXNGMH-GUBZILKMSA-N 0.000 description 1
- QPTAGIPWARILES-AVGNSLFASA-N Asn-Gln-Phe Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O QPTAGIPWARILES-AVGNSLFASA-N 0.000 description 1
- FUHFYEKSGWOWGZ-XHNCKOQMSA-N Asn-Gln-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC(=O)N)N)C(=O)O FUHFYEKSGWOWGZ-XHNCKOQMSA-N 0.000 description 1
- KWQPAXYXVMHJJR-AVGNSLFASA-N Asn-Gln-Tyr Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 KWQPAXYXVMHJJR-AVGNSLFASA-N 0.000 description 1
- KLKHFFMNGWULBN-VKHMYHEASA-N Asn-Gly Chemical compound NC(=O)C[C@H](N)C(=O)NCC(O)=O KLKHFFMNGWULBN-VKHMYHEASA-N 0.000 description 1
- DDPXDCKYWDGZAL-BQBZGAKWSA-N Asn-Gly-Arg Chemical compound NC(=O)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N DDPXDCKYWDGZAL-BQBZGAKWSA-N 0.000 description 1
- OLVIPTLKNSAYRJ-YUMQZZPRSA-N Asn-Gly-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CC(=O)N)N OLVIPTLKNSAYRJ-YUMQZZPRSA-N 0.000 description 1
- NKLRWRRVYGQNIH-GHCJXIJMSA-N Asn-Ile-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O NKLRWRRVYGQNIH-GHCJXIJMSA-N 0.000 description 1
- KMCRKVOLRCOMBG-DJFWLOJKSA-N Asn-Ile-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC(=O)N)N KMCRKVOLRCOMBG-DJFWLOJKSA-N 0.000 description 1
- YYSYDIYQTUPNQQ-SXTJYALSSA-N Asn-Ile-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O YYSYDIYQTUPNQQ-SXTJYALSSA-N 0.000 description 1
- SEKBHZJLARBNPB-GHCJXIJMSA-N Asn-Ile-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O SEKBHZJLARBNPB-GHCJXIJMSA-N 0.000 description 1
- JQBCANGGAVVERB-CFMVVWHZSA-N Asn-Ile-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@H](CC(=O)N)N JQBCANGGAVVERB-CFMVVWHZSA-N 0.000 description 1
- SPCONPVIDFMDJI-QSFUFRPTSA-N Asn-Ile-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O SPCONPVIDFMDJI-QSFUFRPTSA-N 0.000 description 1
- WIDVAWAQBRAKTI-YUMQZZPRSA-N Asn-Leu-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O WIDVAWAQBRAKTI-YUMQZZPRSA-N 0.000 description 1
- MYCSPQIARXTUTP-SRVKXCTJSA-N Asn-Leu-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC(=O)N)N MYCSPQIARXTUTP-SRVKXCTJSA-N 0.000 description 1
- BZWRLDPIWKOVKB-ZPFDUUQYSA-N Asn-Leu-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O BZWRLDPIWKOVKB-ZPFDUUQYSA-N 0.000 description 1
- JEEFEQCRXKPQHC-KKUMJFAQSA-N Asn-Leu-Phe Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O JEEFEQCRXKPQHC-KKUMJFAQSA-N 0.000 description 1
- JLNFZLNDHONLND-GARJFASQSA-N Asn-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)N)N JLNFZLNDHONLND-GARJFASQSA-N 0.000 description 1
- DJIMLSXHXKWADV-CIUDSAMLSA-N Asn-Leu-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC(N)=O DJIMLSXHXKWADV-CIUDSAMLSA-N 0.000 description 1
- FHETWELNCBMRMG-HJGDQZAQSA-N Asn-Leu-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O FHETWELNCBMRMG-HJGDQZAQSA-N 0.000 description 1
- JWKDQOORUCYUIW-ZPFDUUQYSA-N Asn-Lys-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JWKDQOORUCYUIW-ZPFDUUQYSA-N 0.000 description 1
- ORJQQZIXTOYGGH-SRVKXCTJSA-N Asn-Lys-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O ORJQQZIXTOYGGH-SRVKXCTJSA-N 0.000 description 1
- NTWOPSIUJBMNRI-KKUMJFAQSA-N Asn-Lys-Tyr Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 NTWOPSIUJBMNRI-KKUMJFAQSA-N 0.000 description 1
- HGGIYWURFPGLIU-FXQIFTODSA-N Asn-Met-Cys Chemical compound SC[C@@H](C(O)=O)NC(=O)[C@H](CCSC)NC(=O)[C@@H](N)CC(N)=O HGGIYWURFPGLIU-FXQIFTODSA-N 0.000 description 1
- RLHANKIRBONJBK-IHRRRGAJSA-N Asn-Met-Phe Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CC(=O)N)N RLHANKIRBONJBK-IHRRRGAJSA-N 0.000 description 1
- KEUNWIXNKVWCFL-FXQIFTODSA-N Asn-Met-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CO)C(O)=O KEUNWIXNKVWCFL-FXQIFTODSA-N 0.000 description 1
- HZZIFFOVHLWGCS-KKUMJFAQSA-N Asn-Phe-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O HZZIFFOVHLWGCS-KKUMJFAQSA-N 0.000 description 1
- FTNRWCPWDWRPAV-BZSNNMDCSA-N Asn-Phe-Phe Chemical compound C([C@H](NC(=O)[C@H](CC(N)=O)N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 FTNRWCPWDWRPAV-BZSNNMDCSA-N 0.000 description 1
- JTXVXGXTRXMOFJ-FXQIFTODSA-N Asn-Pro-Asn Chemical compound NC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O JTXVXGXTRXMOFJ-FXQIFTODSA-N 0.000 description 1
- NJSNXIOKBHPFMB-GMOBBJLQSA-N Asn-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CC(=O)N)N NJSNXIOKBHPFMB-GMOBBJLQSA-N 0.000 description 1
- BYLSYQASFJJBCL-DCAQKATOSA-N Asn-Pro-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O BYLSYQASFJJBCL-DCAQKATOSA-N 0.000 description 1
- GMUOCGCDOYYWPD-FXQIFTODSA-N Asn-Pro-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O GMUOCGCDOYYWPD-FXQIFTODSA-N 0.000 description 1
- REQUGIWGOGSOEZ-ZLUOBGJFSA-N Asn-Ser-Asn Chemical compound C([C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(=O)N)C(=O)O)N)C(=O)N REQUGIWGOGSOEZ-ZLUOBGJFSA-N 0.000 description 1
- SNYCNNPOFYBCEK-ZLUOBGJFSA-N Asn-Ser-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O SNYCNNPOFYBCEK-ZLUOBGJFSA-N 0.000 description 1
- NCXTYSVDWLAQGZ-ZKWXMUAHSA-N Asn-Ser-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC(N)=O NCXTYSVDWLAQGZ-ZKWXMUAHSA-N 0.000 description 1
- YHXNKGKUDJCAHB-PBCZWWQYSA-N Asn-Thr-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC(=O)N)N)O YHXNKGKUDJCAHB-PBCZWWQYSA-N 0.000 description 1
- AMGQTNHANMRPOE-LKXGYXEUSA-N Asn-Thr-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O AMGQTNHANMRPOE-LKXGYXEUSA-N 0.000 description 1
- DAYDURRBMDCCFL-AAEUAGOBSA-N Asn-Trp-Gly Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CC(=O)N)N DAYDURRBMDCCFL-AAEUAGOBSA-N 0.000 description 1
- YSYTWUMRHSFODC-QWRGUYRKSA-N Asn-Tyr-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(O)=O YSYTWUMRHSFODC-QWRGUYRKSA-N 0.000 description 1
- DATSKXOXPUAOLK-KKUMJFAQSA-N Asn-Tyr-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(O)=O DATSKXOXPUAOLK-KKUMJFAQSA-N 0.000 description 1
- LRCIOEVFVGXZKB-BZSNNMDCSA-N Asn-Tyr-Tyr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O LRCIOEVFVGXZKB-BZSNNMDCSA-N 0.000 description 1
- DPSUVAPLRQDWAO-YDHLFZDLSA-N Asn-Tyr-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CC(=O)N)N DPSUVAPLRQDWAO-YDHLFZDLSA-N 0.000 description 1
- MJIJBEYEHBKTIM-BYULHYEWSA-N Asn-Val-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N MJIJBEYEHBKTIM-BYULHYEWSA-N 0.000 description 1
- CBHVAFXKOYAHOY-NHCYSSNCSA-N Asn-Val-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O CBHVAFXKOYAHOY-NHCYSSNCSA-N 0.000 description 1
- XOQYDFCQPWAMSA-KKHAAJSZSA-N Asn-Val-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XOQYDFCQPWAMSA-KKHAAJSZSA-N 0.000 description 1
- WQAOZCVOOYUWKG-LSJOCFKGSA-N Asn-Val-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H](CC(=O)N)N WQAOZCVOOYUWKG-LSJOCFKGSA-N 0.000 description 1
- MFMJRYHVLLEMQM-DCAQKATOSA-N Asp-Arg-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC(=O)O)N MFMJRYHVLLEMQM-DCAQKATOSA-N 0.000 description 1
- HTOZUYZQPICRAP-BPUTZDHNSA-N Asp-Arg-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC(=O)O)N HTOZUYZQPICRAP-BPUTZDHNSA-N 0.000 description 1
- XACXDSRQIXRMNS-OLHMAJIHSA-N Asp-Asn-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC(=O)O)N)O XACXDSRQIXRMNS-OLHMAJIHSA-N 0.000 description 1
- RDRMWJBLOSRRAW-BYULHYEWSA-N Asp-Asn-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O RDRMWJBLOSRRAW-BYULHYEWSA-N 0.000 description 1
- SVFOIXMRMLROHO-SRVKXCTJSA-N Asp-Asp-Phe Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 SVFOIXMRMLROHO-SRVKXCTJSA-N 0.000 description 1
- BFOYULZBKYOKAN-OLHMAJIHSA-N Asp-Asp-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O BFOYULZBKYOKAN-OLHMAJIHSA-N 0.000 description 1
- UWOPETAWXDZUJR-ACZMJKKPSA-N Asp-Cys-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(O)=O)C(O)=O UWOPETAWXDZUJR-ACZMJKKPSA-N 0.000 description 1
- NYQHSUGFEWDWPD-ACZMJKKPSA-N Asp-Gln-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC(=O)O)N NYQHSUGFEWDWPD-ACZMJKKPSA-N 0.000 description 1
- BKXPJCBEHWFSTF-ACZMJKKPSA-N Asp-Gln-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O BKXPJCBEHWFSTF-ACZMJKKPSA-N 0.000 description 1
- LJRPYAZQQWHEEV-FXQIFTODSA-N Asp-Gln-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O LJRPYAZQQWHEEV-FXQIFTODSA-N 0.000 description 1
- RYKWOUUZJFSJOH-FXQIFTODSA-N Asp-Gln-Glu Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CC(=O)O)N RYKWOUUZJFSJOH-FXQIFTODSA-N 0.000 description 1
- YBMUFUWSMIKJQA-GUBZILKMSA-N Asp-Gln-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC(=O)O)N YBMUFUWSMIKJQA-GUBZILKMSA-N 0.000 description 1
- PMEHKVHZQKJACS-PEFMBERDSA-N Asp-Gln-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O PMEHKVHZQKJACS-PEFMBERDSA-N 0.000 description 1
- SNAWMGHSCHKSDK-GUBZILKMSA-N Asp-Gln-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC(=O)O)N SNAWMGHSCHKSDK-GUBZILKMSA-N 0.000 description 1
- SPKRHJOVRVDJGG-CIUDSAMLSA-N Asp-Gln-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC(=O)O)N SPKRHJOVRVDJGG-CIUDSAMLSA-N 0.000 description 1
- ZSJFGGSPCCHMNE-LAEOZQHASA-N Asp-Gln-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC(=O)O)N ZSJFGGSPCCHMNE-LAEOZQHASA-N 0.000 description 1
- PDECQIHABNQRHN-GUBZILKMSA-N Asp-Glu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC(O)=O PDECQIHABNQRHN-GUBZILKMSA-N 0.000 description 1
- LTXGDRFJRZSZAV-CIUDSAMLSA-N Asp-Glu-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC(=O)O)N LTXGDRFJRZSZAV-CIUDSAMLSA-N 0.000 description 1
- YNCHFVRXEQFPBY-BQBZGAKWSA-N Asp-Gly-Arg Chemical compound OC(=O)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N YNCHFVRXEQFPBY-BQBZGAKWSA-N 0.000 description 1
- OMMIEVATLAGRCK-BYPYZUCNSA-N Asp-Gly-Gly Chemical compound OC(=O)C[C@H](N)C(=O)NCC(=O)NCC(O)=O OMMIEVATLAGRCK-BYPYZUCNSA-N 0.000 description 1
- ZSVJVIOVABDTTL-YUMQZZPRSA-N Asp-Gly-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CC(=O)O)N ZSVJVIOVABDTTL-YUMQZZPRSA-N 0.000 description 1
- WSGVTKZFVJSJOG-RCOVLWMOSA-N Asp-Gly-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O WSGVTKZFVJSJOG-RCOVLWMOSA-N 0.000 description 1
- MFTVXYMXSAQZNL-DJFWLOJKSA-N Asp-Ile-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC(=O)O)N MFTVXYMXSAQZNL-DJFWLOJKSA-N 0.000 description 1
- NHSDEZURHWEZPN-SXTJYALSSA-N Asp-Ile-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)NC(=O)[C@H](CC(=O)O)N NHSDEZURHWEZPN-SXTJYALSSA-N 0.000 description 1
- RTXQQDVBACBSCW-CFMVVWHZSA-N Asp-Ile-Tyr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O RTXQQDVBACBSCW-CFMVVWHZSA-N 0.000 description 1
- KYQNAIMCTRZLNP-QSFUFRPTSA-N Asp-Ile-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O KYQNAIMCTRZLNP-QSFUFRPTSA-N 0.000 description 1
- JNNVNVRBYUJYGS-CIUDSAMLSA-N Asp-Leu-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O JNNVNVRBYUJYGS-CIUDSAMLSA-N 0.000 description 1
- XLILXFRAKOYEJX-GUBZILKMSA-N Asp-Leu-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O XLILXFRAKOYEJX-GUBZILKMSA-N 0.000 description 1
- DWOGMPWRQQWPPF-GUBZILKMSA-N Asp-Leu-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O DWOGMPWRQQWPPF-GUBZILKMSA-N 0.000 description 1
- RQHLMGCXCZUOGT-ZPFDUUQYSA-N Asp-Leu-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O RQHLMGCXCZUOGT-ZPFDUUQYSA-N 0.000 description 1
- CJUKAWUWBZCTDQ-SRVKXCTJSA-N Asp-Leu-Lys Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(O)=O CJUKAWUWBZCTDQ-SRVKXCTJSA-N 0.000 description 1
- HKEZZWQWXWGASX-KKUMJFAQSA-N Asp-Leu-Phe Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 HKEZZWQWXWGASX-KKUMJFAQSA-N 0.000 description 1
- ORRJQLIATJDMQM-HJGDQZAQSA-N Asp-Leu-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC(O)=O ORRJQLIATJDMQM-HJGDQZAQSA-N 0.000 description 1
- MYOHQBFRJQFIDZ-KKUMJFAQSA-N Asp-Leu-Tyr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O MYOHQBFRJQFIDZ-KKUMJFAQSA-N 0.000 description 1
- GYWQGGUCMDCUJE-DLOVCJGASA-N Asp-Phe-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C)C(O)=O GYWQGGUCMDCUJE-DLOVCJGASA-N 0.000 description 1
- YRZIYQGXTSBRLT-AVGNSLFASA-N Asp-Phe-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O YRZIYQGXTSBRLT-AVGNSLFASA-N 0.000 description 1
- LTCKTLYKRMCFOC-KKUMJFAQSA-N Asp-Phe-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O LTCKTLYKRMCFOC-KKUMJFAQSA-N 0.000 description 1
- GPPIDDWYKJPRES-YDHLFZDLSA-N Asp-Phe-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(O)=O GPPIDDWYKJPRES-YDHLFZDLSA-N 0.000 description 1
- KPSHWSWFPUDEGF-FXQIFTODSA-N Asp-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CC(O)=O KPSHWSWFPUDEGF-FXQIFTODSA-N 0.000 description 1
- XUVTWGPERWIERB-IHRRRGAJSA-N Asp-Pro-Phe Chemical compound N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](Cc1ccccc1)C(O)=O XUVTWGPERWIERB-IHRRRGAJSA-N 0.000 description 1
- SXLCDCZHNCLFGZ-BPUTZDHNSA-N Asp-Pro-Trp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O SXLCDCZHNCLFGZ-BPUTZDHNSA-N 0.000 description 1
- WMLFFCRUSPNENW-ZLUOBGJFSA-N Asp-Ser-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O WMLFFCRUSPNENW-ZLUOBGJFSA-N 0.000 description 1
- KGHLGJAXYSVNJP-WHFBIAKZSA-N Asp-Ser-Gly Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O KGHLGJAXYSVNJP-WHFBIAKZSA-N 0.000 description 1
- JSHWXQIZOCVWIA-ZKWXMUAHSA-N Asp-Ser-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O JSHWXQIZOCVWIA-ZKWXMUAHSA-N 0.000 description 1
- JJQGZGOEDSSHTE-FOHZUACHSA-N Asp-Thr-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O JJQGZGOEDSSHTE-FOHZUACHSA-N 0.000 description 1
- RCGVPVZHKAXDPA-NYVOZVTQSA-N Asp-Trp-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC3=CNC4=CC=CC=C43)C(=O)O)NC(=O)[C@H](CC(=O)O)N RCGVPVZHKAXDPA-NYVOZVTQSA-N 0.000 description 1
- OYSYWMMZGJSQRB-AVGNSLFASA-N Asp-Tyr-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O OYSYWMMZGJSQRB-AVGNSLFASA-N 0.000 description 1
- OTKUAVXGMREHRX-CFMVVWHZSA-N Asp-Tyr-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CC(O)=O)CC1=CC=C(O)C=C1 OTKUAVXGMREHRX-CFMVVWHZSA-N 0.000 description 1
- NWAHPBGBDIFUFD-KKUMJFAQSA-N Asp-Tyr-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(O)=O NWAHPBGBDIFUFD-KKUMJFAQSA-N 0.000 description 1
- BYLPQJAWXJWUCJ-YDHLFZDLSA-N Asp-Tyr-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O BYLPQJAWXJWUCJ-YDHLFZDLSA-N 0.000 description 1
- GFYOIYJJMSHLSN-QXEWZRGKSA-N Asp-Val-Arg Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O GFYOIYJJMSHLSN-QXEWZRGKSA-N 0.000 description 1
- SFJUYBCDQBAYAJ-YDHLFZDLSA-N Asp-Val-Phe Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 SFJUYBCDQBAYAJ-YDHLFZDLSA-N 0.000 description 1
- JGLWFWXGOINXEA-YDHLFZDLSA-N Asp-Val-Tyr Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 JGLWFWXGOINXEA-YDHLFZDLSA-N 0.000 description 1
- ZUNMTUPRQMWMHX-LSJOCFKGSA-N Asp-Val-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C(C)C)C(O)=O ZUNMTUPRQMWMHX-LSJOCFKGSA-N 0.000 description 1
- 102000035101 Aspartic proteases Human genes 0.000 description 1
- 108091005502 Aspartic proteases Proteins 0.000 description 1
- 241001513093 Aspergillus awamori Species 0.000 description 1
- 241000892910 Aspergillus foetidus Species 0.000 description 1
- 241001480052 Aspergillus japonicus Species 0.000 description 1
- 101000757144 Aspergillus niger Glucoamylase Proteins 0.000 description 1
- 241000271566 Aves Species 0.000 description 1
- 239000005552 B01AC04 - Clopidogrel Substances 0.000 description 1
- 241000193755 Bacillus cereus Species 0.000 description 1
- 244000063299 Bacillus subtilis Species 0.000 description 1
- 235000014469 Bacillus subtilis Nutrition 0.000 description 1
- 241000221198 Basidiomycota Species 0.000 description 1
- 239000002126 C01EB10 - Adenosine Substances 0.000 description 1
- 241000244203 Caenorhabditis elegans Species 0.000 description 1
- 101100129088 Caenorhabditis elegans lys-2 gene Proteins 0.000 description 1
- OYPRJOBELJOOCE-UHFFFAOYSA-N Calcium Chemical compound [Ca] OYPRJOBELJOOCE-UHFFFAOYSA-N 0.000 description 1
- 241000222120 Candida <Saccharomycetales> Species 0.000 description 1
- 241000222122 Candida albicans Species 0.000 description 1
- 241000222128 Candida maltosa Species 0.000 description 1
- 241000191338 Candida methylica Species 0.000 description 1
- 241000222178 Candida tropicalis Species 0.000 description 1
- OKTJSMMVPCPJKN-UHFFFAOYSA-N Carbon Chemical compound [C] OKTJSMMVPCPJKN-UHFFFAOYSA-N 0.000 description 1
- 108010059892 Cellulase Proteins 0.000 description 1
- 241000235646 Cyberlindnera jadinii Species 0.000 description 1
- 241000878745 Cyberlindnera saturnus Species 0.000 description 1
- PLBJMUUEGBBHRH-ZLUOBGJFSA-N Cys-Ala-Asn Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O PLBJMUUEGBBHRH-ZLUOBGJFSA-N 0.000 description 1
- NOCCABSVTRONIN-CIUDSAMLSA-N Cys-Ala-Leu Chemical compound C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)O)NC(=O)[C@H](CS)N NOCCABSVTRONIN-CIUDSAMLSA-N 0.000 description 1
- PKNIZMPLMSKROD-BIIVOSGPSA-N Cys-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CS)N PKNIZMPLMSKROD-BIIVOSGPSA-N 0.000 description 1
- HRJLVSQKBLZHSR-ZLUOBGJFSA-N Cys-Asn-Ala Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(O)=O HRJLVSQKBLZHSR-ZLUOBGJFSA-N 0.000 description 1
- IIGHQOPGMGKDMT-SRVKXCTJSA-N Cys-Asp-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CS)N IIGHQOPGMGKDMT-SRVKXCTJSA-N 0.000 description 1
- ZJBWJHQDOIMVLM-WHFBIAKZSA-N Cys-Cys-Gly Chemical compound SC[C@H](N)C(=O)N[C@@H](CS)C(=O)NCC(O)=O ZJBWJHQDOIMVLM-WHFBIAKZSA-N 0.000 description 1
- ZIKWRNJXFIQECJ-CIUDSAMLSA-N Cys-Cys-Leu Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(O)=O ZIKWRNJXFIQECJ-CIUDSAMLSA-N 0.000 description 1
- QADHATDBZXHRCA-ACZMJKKPSA-N Cys-Gln-Asn Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CS)N QADHATDBZXHRCA-ACZMJKKPSA-N 0.000 description 1
- HQZGVYJBRSISDT-BQBZGAKWSA-N Cys-Gly-Arg Chemical compound [H]N[C@@H](CS)C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(O)=O HQZGVYJBRSISDT-BQBZGAKWSA-N 0.000 description 1
- XVLMKWWVBNESPX-XVYDVKMFSA-N Cys-His-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CS)N XVLMKWWVBNESPX-XVYDVKMFSA-N 0.000 description 1
- RRJOQIBQVZDVCW-SRVKXCTJSA-N Cys-His-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CS)N RRJOQIBQVZDVCW-SRVKXCTJSA-N 0.000 description 1
- ABLJDBFJPUWQQB-DCAQKATOSA-N Cys-Leu-Arg Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CS)N ABLJDBFJPUWQQB-DCAQKATOSA-N 0.000 description 1
- UCSXXFRXHGUXCQ-SRVKXCTJSA-N Cys-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CS)N UCSXXFRXHGUXCQ-SRVKXCTJSA-N 0.000 description 1
- LHMSYHSAAJOEBL-CIUDSAMLSA-N Cys-Lys-Asn Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O LHMSYHSAAJOEBL-CIUDSAMLSA-N 0.000 description 1
- PGBLJHDDKCVSTC-CIUDSAMLSA-N Cys-Met-Gln Chemical compound CSCC[C@H](NC(=O)[C@@H](N)CS)C(=O)N[C@@H](CCC(N)=O)C(O)=O PGBLJHDDKCVSTC-CIUDSAMLSA-N 0.000 description 1
- TXCCRYAZQBUCOV-CIUDSAMLSA-N Cys-Pro-Gln Chemical compound [H]N[C@@H](CS)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O TXCCRYAZQBUCOV-CIUDSAMLSA-N 0.000 description 1
- TXGDWPBLUFQODU-XGEHTFHBSA-N Cys-Pro-Thr Chemical compound [H]N[C@@H](CS)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O TXGDWPBLUFQODU-XGEHTFHBSA-N 0.000 description 1
- JLZCAZJGWNRXCI-XKBZYTNZSA-N Cys-Thr-Glu Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O JLZCAZJGWNRXCI-XKBZYTNZSA-N 0.000 description 1
- IRDBEBCCTCNXGZ-AVGNSLFASA-N Cys-Tyr-Gln Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CS)N)O IRDBEBCCTCNXGZ-AVGNSLFASA-N 0.000 description 1
- 239000003298 DNA probe Substances 0.000 description 1
- 230000004543 DNA replication Effects 0.000 description 1
- 229940123780 DNA topoisomerase I inhibitor Drugs 0.000 description 1
- 241000252212 Danio rerio Species 0.000 description 1
- 240000007175 Datura inoxia Species 0.000 description 1
- 101710088194 Dehydrogenase Proteins 0.000 description 1
- 241000222175 Diutina rugosa Species 0.000 description 1
- 241000255581 Drosophila <fruit fly, genus> Species 0.000 description 1
- 241000194032 Enterococcus faecalis Species 0.000 description 1
- 241000701959 Escherichia virus Lambda Species 0.000 description 1
- 241001524679 Escherichia virus M13 Species 0.000 description 1
- 108090000371 Esterases Proteins 0.000 description 1
- 108700024394 Exon Proteins 0.000 description 1
- 241000223218 Fusarium Species 0.000 description 1
- 108700039691 Genetic Promoter Regions Proteins 0.000 description 1
- 241000193419 Geobacillus kaustophilus Species 0.000 description 1
- YJIUYQKQBBQYHZ-ACZMJKKPSA-N Gln-Ala-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O YJIUYQKQBBQYHZ-ACZMJKKPSA-N 0.000 description 1
- LKUWAWGNJYJODH-KBIXCLLPSA-N Gln-Ala-Ile Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O LKUWAWGNJYJODH-KBIXCLLPSA-N 0.000 description 1
- NUMFTVCBONFQIQ-DRZSPHRISA-N Gln-Ala-Phe Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O NUMFTVCBONFQIQ-DRZSPHRISA-N 0.000 description 1
- SHERTACNJPYHAR-ACZMJKKPSA-N Gln-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(N)=O SHERTACNJPYHAR-ACZMJKKPSA-N 0.000 description 1
- KJRXLVZYJJLUCV-DCAQKATOSA-N Gln-Arg-Met Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCSC)C(O)=O KJRXLVZYJJLUCV-DCAQKATOSA-N 0.000 description 1
- AAOBFSKXAVIORT-GUBZILKMSA-N Gln-Asn-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O AAOBFSKXAVIORT-GUBZILKMSA-N 0.000 description 1
- RMOCFPBLHAOTDU-ACZMJKKPSA-N Gln-Asn-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O RMOCFPBLHAOTDU-ACZMJKKPSA-N 0.000 description 1
- IXFVOPOHSRKJNG-LAEOZQHASA-N Gln-Asp-Val Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O IXFVOPOHSRKJNG-LAEOZQHASA-N 0.000 description 1
- PKVWNYGXMNWJSI-CIUDSAMLSA-N Gln-Gln-Gln Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O PKVWNYGXMNWJSI-CIUDSAMLSA-N 0.000 description 1
- UFNSPPFJOHNXRE-AUTRQRHGSA-N Gln-Gln-Val Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O UFNSPPFJOHNXRE-AUTRQRHGSA-N 0.000 description 1
- PXAFHUATEHLECW-GUBZILKMSA-N Gln-Glu-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCC(=O)N)N PXAFHUATEHLECW-GUBZILKMSA-N 0.000 description 1
- ZNZPKVQURDQFFS-FXQIFTODSA-N Gln-Glu-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O ZNZPKVQURDQFFS-FXQIFTODSA-N 0.000 description 1
- MFJAPSYJQJCQDN-BQBZGAKWSA-N Gln-Gly-Glu Chemical compound NC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O MFJAPSYJQJCQDN-BQBZGAKWSA-N 0.000 description 1
- GNMQDOGFWYWPNM-LAEOZQHASA-N Gln-Gly-Ile Chemical compound CC[C@H](C)[C@H](NC(=O)CNC(=O)[C@@H](N)CCC(N)=O)C(O)=O GNMQDOGFWYWPNM-LAEOZQHASA-N 0.000 description 1
- XSBGUANSZDGULP-IUCAKERBSA-N Gln-Gly-Lys Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)NCC(=O)N[C@@H](CCCCN)C(O)=O XSBGUANSZDGULP-IUCAKERBSA-N 0.000 description 1
- NSORZJXKUQFEKL-JGVFFNPUSA-N Gln-Gly-Pro Chemical compound C1C[C@@H](N(C1)C(=O)CNC(=O)[C@H](CCC(=O)N)N)C(=O)O NSORZJXKUQFEKL-JGVFFNPUSA-N 0.000 description 1
- SMLDOQHTOAAFJQ-WDSKDSINSA-N Gln-Gly-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)NCC(=O)N[C@@H](CO)C(O)=O SMLDOQHTOAAFJQ-WDSKDSINSA-N 0.000 description 1
- HDUDGCZEOZEFOA-KBIXCLLPSA-N Gln-Ile-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)O)NC(=O)[C@H](CCC(=O)N)N HDUDGCZEOZEFOA-KBIXCLLPSA-N 0.000 description 1
- JXBZEDIQFFCHPZ-PEFMBERDSA-N Gln-Ile-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CCC(=O)N)N JXBZEDIQFFCHPZ-PEFMBERDSA-N 0.000 description 1
- HXOLDXKNWKLDMM-YVNDNENWSA-N Gln-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CCC(=O)N)N HXOLDXKNWKLDMM-YVNDNENWSA-N 0.000 description 1
- FYAULIGIFPPOAA-ZPFDUUQYSA-N Gln-Ile-Met Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCSC)C(O)=O FYAULIGIFPPOAA-ZPFDUUQYSA-N 0.000 description 1
- QKCZZAZNMMVICF-DCAQKATOSA-N Gln-Leu-Glu Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O QKCZZAZNMMVICF-DCAQKATOSA-N 0.000 description 1
- IULKWYSYZSURJK-AVGNSLFASA-N Gln-Leu-Lys Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(O)=O IULKWYSYZSURJK-AVGNSLFASA-N 0.000 description 1
- HSHCEAUPUPJPTE-JYJNAYRXSA-N Gln-Leu-Tyr Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@H](CCC(=O)N)N HSHCEAUPUPJPTE-JYJNAYRXSA-N 0.000 description 1
- IOFDDSNZJDIGPB-GVXVVHGQSA-N Gln-Leu-Val Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O IOFDDSNZJDIGPB-GVXVVHGQSA-N 0.000 description 1
- TWIAMTNJOMRDAK-GUBZILKMSA-N Gln-Lys-Asp Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O TWIAMTNJOMRDAK-GUBZILKMSA-N 0.000 description 1
- KLKYKPXITJBSNI-CIUDSAMLSA-N Gln-Met-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C)C(O)=O KLKYKPXITJBSNI-CIUDSAMLSA-N 0.000 description 1
- ROHVCXBMIAAASL-HJGDQZAQSA-N Gln-Met-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](CCC(=O)N)N)O ROHVCXBMIAAASL-HJGDQZAQSA-N 0.000 description 1
- DOQUICBEISTQHE-CIUDSAMLSA-N Gln-Pro-Asp Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O DOQUICBEISTQHE-CIUDSAMLSA-N 0.000 description 1
- FQCILXROGNOZON-YUMQZZPRSA-N Gln-Pro-Gly Chemical compound NC(=O)CC[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O FQCILXROGNOZON-YUMQZZPRSA-N 0.000 description 1
- WLRYGVYQFXRJDA-DCAQKATOSA-N Gln-Pro-Pro Chemical compound NC(=O)CC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 WLRYGVYQFXRJDA-DCAQKATOSA-N 0.000 description 1
- YPFFHGRJCUBXPX-NHCYSSNCSA-N Gln-Pro-Val Chemical compound CC(C)[C@H](NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CCC(N)=O)C(O)=O YPFFHGRJCUBXPX-NHCYSSNCSA-N 0.000 description 1
- BYKZWDGMJLNFJY-XKBZYTNZSA-N Gln-Ser-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCC(=O)N)N)O BYKZWDGMJLNFJY-XKBZYTNZSA-N 0.000 description 1
- OTQSTOXRUBVWAP-NRPADANISA-N Gln-Ser-Val Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O OTQSTOXRUBVWAP-NRPADANISA-N 0.000 description 1
- UEILCTONAMOGBR-RWRJDSDZSA-N Gln-Thr-Ile Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O UEILCTONAMOGBR-RWRJDSDZSA-N 0.000 description 1
- ARYKRXHBIPLULY-XKBZYTNZSA-N Gln-Thr-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O ARYKRXHBIPLULY-XKBZYTNZSA-N 0.000 description 1
- XKPACHRGOWQHFH-IRIUXVKKSA-N Gln-Thr-Tyr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O XKPACHRGOWQHFH-IRIUXVKKSA-N 0.000 description 1
- GTBXHETZPUURJE-KKUMJFAQSA-N Gln-Tyr-Arg Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O GTBXHETZPUURJE-KKUMJFAQSA-N 0.000 description 1
- CMBXOSFZCFGDLE-IHRRRGAJSA-N Gln-Tyr-Gln Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)N)N)O CMBXOSFZCFGDLE-IHRRRGAJSA-N 0.000 description 1
- SJMJMEWQMBJYPR-DZKIICNBSA-N Gln-Tyr-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CCC(=O)N)N SJMJMEWQMBJYPR-DZKIICNBSA-N 0.000 description 1
- QGWXAMDECCKGRU-XVKPBYJWSA-N Gln-Val-Gly Chemical compound CC(C)[C@H](NC(=O)[C@@H](N)CCC(N)=O)C(=O)NCC(O)=O QGWXAMDECCKGRU-XVKPBYJWSA-N 0.000 description 1
- UTKICHUQEQBDGC-ACZMJKKPSA-N Glu-Ala-Cys Chemical compound C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCC(=O)O)N UTKICHUQEQBDGC-ACZMJKKPSA-N 0.000 description 1
- LKDIBBOKUAASNP-FXQIFTODSA-N Glu-Ala-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O LKDIBBOKUAASNP-FXQIFTODSA-N 0.000 description 1
- ITYRYNUZHPNCIK-GUBZILKMSA-N Glu-Ala-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O ITYRYNUZHPNCIK-GUBZILKMSA-N 0.000 description 1
- IRDASPPCLZIERZ-XHNCKOQMSA-N Glu-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)O)N IRDASPPCLZIERZ-XHNCKOQMSA-N 0.000 description 1
- RSUVOPBMWMTVDI-XEGUGMAKSA-N Glu-Ala-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)CCC(O)=O)C)C(O)=O)=CNC2=C1 RSUVOPBMWMTVDI-XEGUGMAKSA-N 0.000 description 1
- KBKGRMNVKPSQIF-XDTLVQLUSA-N Glu-Ala-Tyr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O KBKGRMNVKPSQIF-XDTLVQLUSA-N 0.000 description 1
- WOMUDRVDJMHTCV-DCAQKATOSA-N Glu-Arg-Arg Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O WOMUDRVDJMHTCV-DCAQKATOSA-N 0.000 description 1
- CVPXINNKRTZBMO-CIUDSAMLSA-N Glu-Arg-Asn Chemical compound C(C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)O)N)CN=C(N)N CVPXINNKRTZBMO-CIUDSAMLSA-N 0.000 description 1
- VTTSANCGJWLPNC-ZPFDUUQYSA-N Glu-Arg-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O VTTSANCGJWLPNC-ZPFDUUQYSA-N 0.000 description 1
- LTUVYLVIZHJCOQ-KKUMJFAQSA-N Glu-Arg-Phe Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O LTUVYLVIZHJCOQ-KKUMJFAQSA-N 0.000 description 1
- WOSRKEJQESVHGA-CIUDSAMLSA-N Glu-Arg-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O WOSRKEJQESVHGA-CIUDSAMLSA-N 0.000 description 1
- DYFJZDDQPNIPAB-NHCYSSNCSA-N Glu-Arg-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(O)=O DYFJZDDQPNIPAB-NHCYSSNCSA-N 0.000 description 1
- FLLRAEJOLZPSMN-CIUDSAMLSA-N Glu-Asn-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N FLLRAEJOLZPSMN-CIUDSAMLSA-N 0.000 description 1
- ZOXBSICWUDAOHX-GUBZILKMSA-N Glu-Asn-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CCC(O)=O ZOXBSICWUDAOHX-GUBZILKMSA-N 0.000 description 1
- LXAUHIRMWXQRKI-XHNCKOQMSA-N Glu-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)O)N)C(=O)O LXAUHIRMWXQRKI-XHNCKOQMSA-N 0.000 description 1
- QPRZKNOOOBWXSU-CIUDSAMLSA-N Glu-Asp-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N QPRZKNOOOBWXSU-CIUDSAMLSA-N 0.000 description 1
- JPHYJQHPILOKHC-ACZMJKKPSA-N Glu-Asp-Asp Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O JPHYJQHPILOKHC-ACZMJKKPSA-N 0.000 description 1
- NADWTMLCUDMDQI-ACZMJKKPSA-N Glu-Asp-Cys Chemical compound C(CC(=O)O)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CS)C(=O)O)N NADWTMLCUDMDQI-ACZMJKKPSA-N 0.000 description 1
- PAQUJCSYVIBPLC-AVGNSLFASA-N Glu-Asp-Phe Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 PAQUJCSYVIBPLC-AVGNSLFASA-N 0.000 description 1
- JRCUFCXYZLPSDZ-ACZMJKKPSA-N Glu-Asp-Ser Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O JRCUFCXYZLPSDZ-ACZMJKKPSA-N 0.000 description 1
- KLJMRPIBBLTDGE-ACZMJKKPSA-N Glu-Cys-Asn Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(N)=O)C(O)=O KLJMRPIBBLTDGE-ACZMJKKPSA-N 0.000 description 1
- PNAOVYHADQRJQU-GUBZILKMSA-N Glu-Cys-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CCC(=O)O)N PNAOVYHADQRJQU-GUBZILKMSA-N 0.000 description 1
- XHUCVVHRLNPZSZ-CIUDSAMLSA-N Glu-Gln-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O XHUCVVHRLNPZSZ-CIUDSAMLSA-N 0.000 description 1
- LVCHEMOPBORRLB-DCAQKATOSA-N Glu-Gln-Lys Chemical compound NCCCC[C@H](NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CCC(O)=O)C(O)=O LVCHEMOPBORRLB-DCAQKATOSA-N 0.000 description 1
- WLIPTFCZLHCNFD-LPEHRKFASA-N Glu-Gln-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCC(=O)O)N)C(=O)O WLIPTFCZLHCNFD-LPEHRKFASA-N 0.000 description 1
- HUFCEIHAFNVSNR-IHRRRGAJSA-N Glu-Gln-Tyr Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 HUFCEIHAFNVSNR-IHRRRGAJSA-N 0.000 description 1
- CGOHAEBMDSEKFB-FXQIFTODSA-N Glu-Glu-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O CGOHAEBMDSEKFB-FXQIFTODSA-N 0.000 description 1
- NKLRYVLERDYDBI-FXQIFTODSA-N Glu-Glu-Asp Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O NKLRYVLERDYDBI-FXQIFTODSA-N 0.000 description 1
- SJPMNHCEWPTRBR-BQBZGAKWSA-N Glu-Glu-Gly Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O SJPMNHCEWPTRBR-BQBZGAKWSA-N 0.000 description 1
- QJCKNLPMTPXXEM-AUTRQRHGSA-N Glu-Glu-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O QJCKNLPMTPXXEM-AUTRQRHGSA-N 0.000 description 1
- AIGROOHQXCACHL-WDSKDSINSA-N Glu-Gly-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H](C)C(O)=O AIGROOHQXCACHL-WDSKDSINSA-N 0.000 description 1
- WRNAXCVRSBBKGS-BQBZGAKWSA-N Glu-Gly-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H](CCC(N)=O)C(O)=O WRNAXCVRSBBKGS-BQBZGAKWSA-N 0.000 description 1
- OGNJZUXUTPQVBR-BQBZGAKWSA-N Glu-Gly-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O OGNJZUXUTPQVBR-BQBZGAKWSA-N 0.000 description 1
- CUXJIASLBRJOFV-LAEOZQHASA-N Glu-Gly-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H]([C@@H](C)CC)C(O)=O CUXJIASLBRJOFV-LAEOZQHASA-N 0.000 description 1
- LRPXYSGPOBVBEH-IUCAKERBSA-N Glu-Gly-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O LRPXYSGPOBVBEH-IUCAKERBSA-N 0.000 description 1
- HPJLZFTUUJKWAJ-JHEQGTHGSA-N Glu-Gly-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O HPJLZFTUUJKWAJ-JHEQGTHGSA-N 0.000 description 1
- XMPAXPSENRSOSV-RYUDHWBXSA-N Glu-Gly-Tyr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O XMPAXPSENRSOSV-RYUDHWBXSA-N 0.000 description 1
- HILMIYALTUQTRC-XVKPBYJWSA-N Glu-Gly-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O HILMIYALTUQTRC-XVKPBYJWSA-N 0.000 description 1
- COSBSYQVPSODFX-GUBZILKMSA-N Glu-His-Cys Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCC(=O)O)N COSBSYQVPSODFX-GUBZILKMSA-N 0.000 description 1
- XIKYNVKEUINBGL-IUCAKERBSA-N Glu-His-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)NCC(O)=O XIKYNVKEUINBGL-IUCAKERBSA-N 0.000 description 1
- VGOFRWOTSXVPAU-SDDRHHMPSA-N Glu-His-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CN=CN2)NC(=O)[C@H](CCC(=O)O)N)C(=O)O VGOFRWOTSXVPAU-SDDRHHMPSA-N 0.000 description 1
- WVTIBGWZUMJBFY-GUBZILKMSA-N Glu-His-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(O)=O WVTIBGWZUMJBFY-GUBZILKMSA-N 0.000 description 1
- ZPASCJBSSCRWMC-GVXVVHGQSA-N Glu-His-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CCC(=O)O)N ZPASCJBSSCRWMC-GVXVVHGQSA-N 0.000 description 1
- ZWABFSSWTSAMQN-KBIXCLLPSA-N Glu-Ile-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O ZWABFSSWTSAMQN-KBIXCLLPSA-N 0.000 description 1
- CXRWMMRLEMVSEH-PEFMBERDSA-N Glu-Ile-Asn Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O CXRWMMRLEMVSEH-PEFMBERDSA-N 0.000 description 1
- BKRQSECBKKCCKW-HVTMNAMFSA-N Glu-Ile-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CCC(=O)O)N BKRQSECBKKCCKW-HVTMNAMFSA-N 0.000 description 1
- QXDXIXFSFHUYAX-MNXVOIDGSA-N Glu-Ile-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCC(O)=O QXDXIXFSFHUYAX-MNXVOIDGSA-N 0.000 description 1
- INGJLBQKTRJLFO-UKJIMTQDSA-N Glu-Ile-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCC(O)=O INGJLBQKTRJLFO-UKJIMTQDSA-N 0.000 description 1
- VMKCPNBBPGGQBJ-GUBZILKMSA-N Glu-Leu-Asn Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)O)N VMKCPNBBPGGQBJ-GUBZILKMSA-N 0.000 description 1
- IVGJYOOGJLFKQE-AVGNSLFASA-N Glu-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)O)N IVGJYOOGJLFKQE-AVGNSLFASA-N 0.000 description 1
- UGSVSNXPJJDJKL-SDDRHHMPSA-N Glu-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)O)N UGSVSNXPJJDJKL-SDDRHHMPSA-N 0.000 description 1
- BCYGDJXHAGZNPQ-DCAQKATOSA-N Glu-Lys-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O BCYGDJXHAGZNPQ-DCAQKATOSA-N 0.000 description 1
- OCJRHJZKGGSPRW-IUCAKERBSA-N Glu-Lys-Gly Chemical compound NCCCC[C@@H](C(=O)NCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O OCJRHJZKGGSPRW-IUCAKERBSA-N 0.000 description 1
- MFNUFCFRAZPJFW-JYJNAYRXSA-N Glu-Lys-Phe Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 MFNUFCFRAZPJFW-JYJNAYRXSA-N 0.000 description 1
- RBXSZQRSEGYDFG-GUBZILKMSA-N Glu-Lys-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O RBXSZQRSEGYDFG-GUBZILKMSA-N 0.000 description 1
- QDMVXRNLOPTPIE-WDCWCFNPSA-N Glu-Lys-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O QDMVXRNLOPTPIE-WDCWCFNPSA-N 0.000 description 1
- SUIAHERNFYRBDZ-GVXVVHGQSA-N Glu-Lys-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O SUIAHERNFYRBDZ-GVXVVHGQSA-N 0.000 description 1
- BHXSLRDWXIFKTP-SRVKXCTJSA-N Glu-Met-His Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CCC(=O)O)N BHXSLRDWXIFKTP-SRVKXCTJSA-N 0.000 description 1
- LKOAAMXDJGEYMS-ZPFDUUQYSA-N Glu-Met-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O LKOAAMXDJGEYMS-ZPFDUUQYSA-N 0.000 description 1
- SOEPMWQCTJITPZ-SRVKXCTJSA-N Glu-Met-Lys Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)O)N SOEPMWQCTJITPZ-SRVKXCTJSA-N 0.000 description 1
- JZJGEKDPWVJOLD-QEWYBTABSA-N Glu-Phe-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JZJGEKDPWVJOLD-QEWYBTABSA-N 0.000 description 1
- QJVZSVUYZFYLFQ-CIUDSAMLSA-N Glu-Pro-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O QJVZSVUYZFYLFQ-CIUDSAMLSA-N 0.000 description 1
- BIYNPVYAZOUVFQ-CIUDSAMLSA-N Glu-Pro-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O BIYNPVYAZOUVFQ-CIUDSAMLSA-N 0.000 description 1
- BPLNJYHNAJVLRT-ACZMJKKPSA-N Glu-Ser-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O BPLNJYHNAJVLRT-ACZMJKKPSA-N 0.000 description 1
- HMJULNMJWOZNFI-XHNCKOQMSA-N Glu-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CCC(=O)O)N)C(=O)O HMJULNMJWOZNFI-XHNCKOQMSA-N 0.000 description 1
- JWNZHMSRZXXGTM-XKBZYTNZSA-N Glu-Ser-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O JWNZHMSRZXXGTM-XKBZYTNZSA-N 0.000 description 1
- WXONSNSSBYQGNN-AVGNSLFASA-N Glu-Ser-Tyr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O WXONSNSSBYQGNN-AVGNSLFASA-N 0.000 description 1
- HZISRJBYZAODRV-XQXXSGGOSA-N Glu-Thr-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O HZISRJBYZAODRV-XQXXSGGOSA-N 0.000 description 1
- TWYSSILQABLLME-HJGDQZAQSA-N Glu-Thr-Arg Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O TWYSSILQABLLME-HJGDQZAQSA-N 0.000 description 1
- RGJKYNUINKGPJN-RWRJDSDZSA-N Glu-Thr-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@H](CCC(=O)O)N RGJKYNUINKGPJN-RWRJDSDZSA-N 0.000 description 1
- ZGXGVBYEJGVJMV-HJGDQZAQSA-N Glu-Thr-Met Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CCC(=O)O)N)O ZGXGVBYEJGVJMV-HJGDQZAQSA-N 0.000 description 1
- ZTNHPMZHAILHRB-JSGCOSHPSA-N Glu-Trp-Gly Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CCC(O)=O)N)C(=O)NCC(O)=O)=CNC2=C1 ZTNHPMZHAILHRB-JSGCOSHPSA-N 0.000 description 1
- BKMOHWJHXQLFEX-IRIUXVKKSA-N Glu-Tyr-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CCC(=O)O)N)O BKMOHWJHXQLFEX-IRIUXVKKSA-N 0.000 description 1
- YPHPEHMXOYTEQG-LAEOZQHASA-N Glu-Val-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCC(O)=O YPHPEHMXOYTEQG-LAEOZQHASA-N 0.000 description 1
- HQTDNEZTGZUWSY-XVKPBYJWSA-N Glu-Val-Gly Chemical compound CC(C)[C@H](NC(=O)[C@@H](N)CCC(O)=O)C(=O)NCC(O)=O HQTDNEZTGZUWSY-XVKPBYJWSA-N 0.000 description 1
- XIJOPMSILDNVNJ-ZVZYQTTQSA-N Glu-Val-Trp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O XIJOPMSILDNVNJ-ZVZYQTTQSA-N 0.000 description 1
- RLFSBAPJTYKSLG-WHFBIAKZSA-N Gly-Ala-Asp Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(O)=O RLFSBAPJTYKSLG-WHFBIAKZSA-N 0.000 description 1
- UGVQELHRNUDMAA-BYPYZUCNSA-N Gly-Ala-Gly Chemical compound [NH3+]CC(=O)N[C@@H](C)C(=O)NCC([O-])=O UGVQELHRNUDMAA-BYPYZUCNSA-N 0.000 description 1
- FKJQNJCQTKUBCD-XPUUQOCRSA-N Gly-Ala-His Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)O FKJQNJCQTKUBCD-XPUUQOCRSA-N 0.000 description 1
- QXPRJQPCFXMCIY-NKWVEPMBSA-N Gly-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN QXPRJQPCFXMCIY-NKWVEPMBSA-N 0.000 description 1
- JLXVRFDTDUGQEE-YFKPBYRVSA-N Gly-Arg Chemical compound NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N JLXVRFDTDUGQEE-YFKPBYRVSA-N 0.000 description 1
- OGCIHJPYKVSMTE-YUMQZZPRSA-N Gly-Arg-Glu Chemical compound [H]NCC(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O OGCIHJPYKVSMTE-YUMQZZPRSA-N 0.000 description 1
- RJIVPOXLQFJRTG-LURJTMIESA-N Gly-Arg-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)CN)CCCN=C(N)N RJIVPOXLQFJRTG-LURJTMIESA-N 0.000 description 1
- MXXXVOYFNVJHMA-IUCAKERBSA-N Gly-Arg-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)CN MXXXVOYFNVJHMA-IUCAKERBSA-N 0.000 description 1
- GNBMOZPQUXTCRW-STQMWFEESA-N Gly-Asn-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CC(N)=O)NC(=O)CN)C(O)=O)=CNC2=C1 GNBMOZPQUXTCRW-STQMWFEESA-N 0.000 description 1
- LURCIJSJAKFCRO-QWRGUYRKSA-N Gly-Asn-Tyr Chemical compound [H]NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O LURCIJSJAKFCRO-QWRGUYRKSA-N 0.000 description 1
- LLXVQPKEQQCISF-YUMQZZPRSA-N Gly-Asp-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)CN LLXVQPKEQQCISF-YUMQZZPRSA-N 0.000 description 1
- XBWMTPAIUQIWKA-BYULHYEWSA-N Gly-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)CN XBWMTPAIUQIWKA-BYULHYEWSA-N 0.000 description 1
- FZQLXNIMCPJVJE-YUMQZZPRSA-N Gly-Asp-Leu Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O FZQLXNIMCPJVJE-YUMQZZPRSA-N 0.000 description 1
- MHHUEAIBJZWDBH-YUMQZZPRSA-N Gly-Asp-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)CN MHHUEAIBJZWDBH-YUMQZZPRSA-N 0.000 description 1
- RPLLQZBOVIVGMX-QWRGUYRKSA-N Gly-Asp-Phe Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O RPLLQZBOVIVGMX-QWRGUYRKSA-N 0.000 description 1
- CEXINUGNTZFNRY-BYPYZUCNSA-N Gly-Cys-Gly Chemical compound [NH3+]CC(=O)N[C@@H](CS)C(=O)NCC([O-])=O CEXINUGNTZFNRY-BYPYZUCNSA-N 0.000 description 1
- UEGIPZAXNBYCCP-NKWVEPMBSA-N Gly-Cys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CS)NC(=O)CN)C(=O)O UEGIPZAXNBYCCP-NKWVEPMBSA-N 0.000 description 1
- KTSZUNRRYXPZTK-BQBZGAKWSA-N Gly-Gln-Glu Chemical compound NCC(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O KTSZUNRRYXPZTK-BQBZGAKWSA-N 0.000 description 1
- LXXANCRPFBSSKS-IUCAKERBSA-N Gly-Gln-Leu Chemical compound [H]NCC(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O LXXANCRPFBSSKS-IUCAKERBSA-N 0.000 description 1
- GNPVTZJUUBPZKW-WDSKDSINSA-N Gly-Gln-Ser Chemical compound [H]NCC(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O GNPVTZJUUBPZKW-WDSKDSINSA-N 0.000 description 1
- MOJKRXIRAZPZLW-WDSKDSINSA-N Gly-Glu-Ala Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O MOJKRXIRAZPZLW-WDSKDSINSA-N 0.000 description 1
- LHRXAHLCRMQBGJ-RYUDHWBXSA-N Gly-Glu-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)CN LHRXAHLCRMQBGJ-RYUDHWBXSA-N 0.000 description 1
- CCQOOWAONKGYKQ-BYPYZUCNSA-N Gly-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)CN CCQOOWAONKGYKQ-BYPYZUCNSA-N 0.000 description 1
- HQRHFUYMGCHHJS-LURJTMIESA-N Gly-Gly-Arg Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N HQRHFUYMGCHHJS-LURJTMIESA-N 0.000 description 1
- KMSGYZQRXPUKGI-BYPYZUCNSA-N Gly-Gly-Asn Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CC(N)=O KMSGYZQRXPUKGI-BYPYZUCNSA-N 0.000 description 1
- QPTNELDXWKRIFX-YFKPBYRVSA-N Gly-Gly-Gln Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CCC(N)=O QPTNELDXWKRIFX-YFKPBYRVSA-N 0.000 description 1
- GDOZQTNZPCUARW-YFKPBYRVSA-N Gly-Gly-Glu Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CCC(O)=O GDOZQTNZPCUARW-YFKPBYRVSA-N 0.000 description 1
- KAJAOGBVWCYGHZ-JTQLQIEISA-N Gly-Gly-Phe Chemical compound [NH3+]CC(=O)NCC(=O)N[C@H](C([O-])=O)CC1=CC=CC=C1 KAJAOGBVWCYGHZ-JTQLQIEISA-N 0.000 description 1
- UQJNXZSSGQIPIQ-FBCQKBJTSA-N Gly-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)CN UQJNXZSSGQIPIQ-FBCQKBJTSA-N 0.000 description 1
- OLPPXYMMIARYAL-QMMMGPOBSA-N Gly-Gly-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)CNC(=O)CN OLPPXYMMIARYAL-QMMMGPOBSA-N 0.000 description 1
- FQKKPCWTZZEDIC-XPUUQOCRSA-N Gly-His-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H](NC(=O)CN)CC1=CN=CN1 FQKKPCWTZZEDIC-XPUUQOCRSA-N 0.000 description 1
- VAXIVIPMCTYSHI-YUMQZZPRSA-N Gly-His-Asp Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)CN VAXIVIPMCTYSHI-YUMQZZPRSA-N 0.000 description 1
- ADZGCWWDPFDHCY-ZETCQYMHSA-N Gly-His-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)CN)CC1=CN=CN1 ADZGCWWDPFDHCY-ZETCQYMHSA-N 0.000 description 1
- FSPVILZGHUJOHS-QWRGUYRKSA-N Gly-His-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CNC=N1 FSPVILZGHUJOHS-QWRGUYRKSA-N 0.000 description 1
- SWQALSGKVLYKDT-UHFFFAOYSA-N Gly-Ile-Ala Natural products NCC(=O)NC(C(C)CC)C(=O)NC(C)C(O)=O SWQALSGKVLYKDT-UHFFFAOYSA-N 0.000 description 1
- UTYGDAHJBBDPBA-BYULHYEWSA-N Gly-Ile-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)CN UTYGDAHJBBDPBA-BYULHYEWSA-N 0.000 description 1
- ITZOBNKQDZEOCE-NHCYSSNCSA-N Gly-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)CN ITZOBNKQDZEOCE-NHCYSSNCSA-N 0.000 description 1
- ULZCYBYDTUMHNF-IUCAKERBSA-N Gly-Leu-Glu Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O ULZCYBYDTUMHNF-IUCAKERBSA-N 0.000 description 1
- LLZXNUUIBOALNY-QWRGUYRKSA-N Gly-Leu-Lys Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCCN LLZXNUUIBOALNY-QWRGUYRKSA-N 0.000 description 1
- UUYBFNKHOCJCHT-VHSXEESVSA-N Gly-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN UUYBFNKHOCJCHT-VHSXEESVSA-N 0.000 description 1
- NNCSJUBVFBDDLC-YUMQZZPRSA-N Gly-Leu-Ser Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O NNCSJUBVFBDDLC-YUMQZZPRSA-N 0.000 description 1
- LHYJCVCQPWRMKZ-WEDXCCLWSA-N Gly-Leu-Thr Chemical compound [H]NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LHYJCVCQPWRMKZ-WEDXCCLWSA-N 0.000 description 1
- AFWYPMDMDYCKMD-KBPBESRZSA-N Gly-Leu-Tyr Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 AFWYPMDMDYCKMD-KBPBESRZSA-N 0.000 description 1
- VEPBEGNDJYANCF-QWRGUYRKSA-N Gly-Lys-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCCN VEPBEGNDJYANCF-QWRGUYRKSA-N 0.000 description 1
- NTBOEZICHOSJEE-YUMQZZPRSA-N Gly-Lys-Ser Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O NTBOEZICHOSJEE-YUMQZZPRSA-N 0.000 description 1
- FXGRXIATVXUAHO-WEDXCCLWSA-N Gly-Lys-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCCN FXGRXIATVXUAHO-WEDXCCLWSA-N 0.000 description 1
- OJNZVYSGVYLQIN-BQBZGAKWSA-N Gly-Met-Asp Chemical compound [H]NCC(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(O)=O)C(O)=O OJNZVYSGVYLQIN-BQBZGAKWSA-N 0.000 description 1
- YHYDTTUSJXGTQK-UWVGGRQHSA-N Gly-Met-Leu Chemical compound CSCC[C@H](NC(=O)CN)C(=O)N[C@@H](CC(C)C)C(O)=O YHYDTTUSJXGTQK-UWVGGRQHSA-N 0.000 description 1
- OMOZPGCHVWOXHN-BQBZGAKWSA-N Gly-Met-Ser Chemical compound CSCC[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)CN OMOZPGCHVWOXHN-BQBZGAKWSA-N 0.000 description 1
- MDKCBHZLQJZOCJ-STQMWFEESA-N Gly-Met-Tyr Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)CN MDKCBHZLQJZOCJ-STQMWFEESA-N 0.000 description 1
- GAFKBWKVXNERFA-QWRGUYRKSA-N Gly-Phe-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CC=CC=C1 GAFKBWKVXNERFA-QWRGUYRKSA-N 0.000 description 1
- IGOYNRWLWHWAQO-JTQLQIEISA-N Gly-Phe-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)CN)CC1=CC=CC=C1 IGOYNRWLWHWAQO-JTQLQIEISA-N 0.000 description 1
- JPVGHHQGKPQYIL-KBPBESRZSA-N Gly-Phe-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CC=CC=C1 JPVGHHQGKPQYIL-KBPBESRZSA-N 0.000 description 1
- FEUPVVCGQLNXNP-IRXDYDNUSA-N Gly-Phe-Phe Chemical compound C([C@H](NC(=O)CN)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 FEUPVVCGQLNXNP-IRXDYDNUSA-N 0.000 description 1
- GGAPHLIUUTVYMX-QWRGUYRKSA-N Gly-Phe-Ser Chemical compound OC[C@@H](C([O-])=O)NC(=O)[C@@H](NC(=O)C[NH3+])CC1=CC=CC=C1 GGAPHLIUUTVYMX-QWRGUYRKSA-N 0.000 description 1
- WDXLKVQATNEAJQ-BQBZGAKWSA-N Gly-Pro-Asp Chemical compound NCC(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O WDXLKVQATNEAJQ-BQBZGAKWSA-N 0.000 description 1
- JJGBXTYGTKWGAT-YUMQZZPRSA-N Gly-Pro-Glu Chemical compound NCC(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O JJGBXTYGTKWGAT-YUMQZZPRSA-N 0.000 description 1
- HFPVRZWORNJRRC-UWVGGRQHSA-N Gly-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)CN HFPVRZWORNJRRC-UWVGGRQHSA-N 0.000 description 1
- WNGHUXFWEWTKAO-YUMQZZPRSA-N Gly-Ser-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)CN WNGHUXFWEWTKAO-YUMQZZPRSA-N 0.000 description 1
- FFJQHWKSGAWSTJ-BFHQHQDPSA-N Gly-Thr-Ala Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O FFJQHWKSGAWSTJ-BFHQHQDPSA-N 0.000 description 1
- ZZWUYQXMIFTIIY-WEDXCCLWSA-N Gly-Thr-Leu Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O ZZWUYQXMIFTIIY-WEDXCCLWSA-N 0.000 description 1
- PYFIQROSWQERAS-LBPRGKRZSA-N Gly-Trp-Gly Chemical compound C1=CC=C2C(C[C@H](NC(=O)CN)C(=O)NCC(O)=O)=CNC2=C1 PYFIQROSWQERAS-LBPRGKRZSA-N 0.000 description 1
- SFOXOSKVTLDEDM-HOTGVXAUSA-N Gly-Trp-Leu Chemical compound C1=CC=C2C(C[C@@H](C(=O)N[C@@H](CC(C)C)C(O)=O)NC(=O)CN)=CNC2=C1 SFOXOSKVTLDEDM-HOTGVXAUSA-N 0.000 description 1
- PASHZZBXZYEXFE-LSDHHAIUSA-N Gly-Trp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CNC3=CC=CC=C32)NC(=O)CN)C(=O)O PASHZZBXZYEXFE-LSDHHAIUSA-N 0.000 description 1
- ONSARSFSJHTMFJ-STQMWFEESA-N Gly-Trp-Ser Chemical compound [H]NCC(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CO)C(O)=O ONSARSFSJHTMFJ-STQMWFEESA-N 0.000 description 1
- MREVELMMFOLESM-HOCLYGCPSA-N Gly-Trp-Val Chemical compound [H]NCC(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](C(C)C)C(O)=O MREVELMMFOLESM-HOCLYGCPSA-N 0.000 description 1
- GNNJKUYDWFIBTK-QWRGUYRKSA-N Gly-Tyr-Asp Chemical compound [H]NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O GNNJKUYDWFIBTK-QWRGUYRKSA-N 0.000 description 1
- UVTSZKIATYSKIR-RYUDHWBXSA-N Gly-Tyr-Glu Chemical compound [H]NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O UVTSZKIATYSKIR-RYUDHWBXSA-N 0.000 description 1
- HQSKKSLNLSTONK-JTQLQIEISA-N Gly-Tyr-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)CN)CC1=CC=C(O)C=C1 HQSKKSLNLSTONK-JTQLQIEISA-N 0.000 description 1
- DNAZKGFYFRGZIH-QWRGUYRKSA-N Gly-Tyr-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CC=C(O)C=C1 DNAZKGFYFRGZIH-QWRGUYRKSA-N 0.000 description 1
- NGBGZCUWFVVJKC-IRXDYDNUSA-N Gly-Tyr-Tyr Chemical compound C([C@H](NC(=O)CN)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=C(O)C=C1 NGBGZCUWFVVJKC-IRXDYDNUSA-N 0.000 description 1
- DKJWUIYLMLUBDX-XPUUQOCRSA-N Gly-Val-Cys Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CS)C(=O)O DKJWUIYLMLUBDX-XPUUQOCRSA-N 0.000 description 1
- RYAOJUMWLWUGNW-QMMMGPOBSA-N Gly-Val-Gly Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O RYAOJUMWLWUGNW-QMMMGPOBSA-N 0.000 description 1
- SBVMXEZQJVUARN-XPUUQOCRSA-N Gly-Val-Ser Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O SBVMXEZQJVUARN-XPUUQOCRSA-N 0.000 description 1
- AFMOTCMSEBITOE-YEPSODPASA-N Gly-Val-Thr Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O AFMOTCMSEBITOE-YEPSODPASA-N 0.000 description 1
- 101100295959 Halobacterium salinarum (strain ATCC 700922 / JCM 11081 / NRC-1) arcB gene Proteins 0.000 description 1
- KZTLOHBDLMIFSH-XVYDVKMFSA-N His-Ala-Asp Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(O)=O KZTLOHBDLMIFSH-XVYDVKMFSA-N 0.000 description 1
- DCRODRAURLJOFY-XPUUQOCRSA-N His-Ala-Gly Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(=O)NCC(O)=O DCRODRAURLJOFY-XPUUQOCRSA-N 0.000 description 1
- VSLXGYMEHVAJBH-DLOVCJGASA-N His-Ala-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O VSLXGYMEHVAJBH-DLOVCJGASA-N 0.000 description 1
- TVQGUFGDVODUIF-LSJOCFKGSA-N His-Arg-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC1=CN=CN1)N TVQGUFGDVODUIF-LSJOCFKGSA-N 0.000 description 1
- SVHKVHBPTOMLTO-DCAQKATOSA-N His-Arg-Asp Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O SVHKVHBPTOMLTO-DCAQKATOSA-N 0.000 description 1
- SYMSVYVUSPSAAO-IHRRRGAJSA-N His-Arg-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O SYMSVYVUSPSAAO-IHRRRGAJSA-N 0.000 description 1
- MWAJSVTZZOUOBU-IHRRRGAJSA-N His-Arg-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H](N)CC1=CN=CN1 MWAJSVTZZOUOBU-IHRRRGAJSA-N 0.000 description 1
- CJGDTAHEMXLRMB-ULQDDVLXSA-N His-Arg-Phe Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O CJGDTAHEMXLRMB-ULQDDVLXSA-N 0.000 description 1
- SOFSRBYHDINIRG-QTKMDUPCSA-N His-Arg-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC1=CN=CN1)N)O SOFSRBYHDINIRG-QTKMDUPCSA-N 0.000 description 1
- XJQDHFMUUBRCGA-KKUMJFAQSA-N His-Asn-Phe Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O XJQDHFMUUBRCGA-KKUMJFAQSA-N 0.000 description 1
- VOEGKUNRHYKYSU-XVYDVKMFSA-N His-Asp-Ala Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O VOEGKUNRHYKYSU-XVYDVKMFSA-N 0.000 description 1
- BQYZXYCEKYJKAM-VGDYDELISA-N His-Cys-Ile Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CS)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O BQYZXYCEKYJKAM-VGDYDELISA-N 0.000 description 1
- HQKADFMLECZIQJ-HVTMNAMFSA-N His-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC1=CN=CN1)N HQKADFMLECZIQJ-HVTMNAMFSA-N 0.000 description 1
- OSZUPUINVNPCOE-SDDRHHMPSA-N His-Glu-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC2=CN=CN2)N)C(=O)O OSZUPUINVNPCOE-SDDRHHMPSA-N 0.000 description 1
- FDQYIRHBVVUTJF-ZETCQYMHSA-N His-Gly-Gly Chemical compound [O-]C(=O)CNC(=O)CNC(=O)[C@@H]([NH3+])CC1=CN=CN1 FDQYIRHBVVUTJF-ZETCQYMHSA-N 0.000 description 1
- NTXIJPDAHXSHNL-ONGXEEELSA-N His-Gly-Val Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O NTXIJPDAHXSHNL-ONGXEEELSA-N 0.000 description 1
- OZBDSFBWIDPVDA-BZSNNMDCSA-N His-His-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CC2=CN=CN2)NC(=O)[C@H](CC3=CN=CN3)N OZBDSFBWIDPVDA-BZSNNMDCSA-N 0.000 description 1
- XWUIHCZETFNRPA-IHPCNDPISA-N His-His-Trp Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1NC=NC=1)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(O)=O)C1=CN=CN1 XWUIHCZETFNRPA-IHPCNDPISA-N 0.000 description 1
- SKYULSWNBYAQMG-IHRRRGAJSA-N His-Leu-Arg Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O SKYULSWNBYAQMG-IHRRRGAJSA-N 0.000 description 1
- ZSKJIISDJXJQPV-BZSNNMDCSA-N His-Leu-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CN=CN1 ZSKJIISDJXJQPV-BZSNNMDCSA-N 0.000 description 1
- SKOKHBGDXGTDDP-MELADBBJSA-N His-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CN=CN2)N SKOKHBGDXGTDDP-MELADBBJSA-N 0.000 description 1
- LVXFNTIIGOQBMD-SRVKXCTJSA-N His-Leu-Ser Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O LVXFNTIIGOQBMD-SRVKXCTJSA-N 0.000 description 1
- UXSATKFPUVZVDK-KKUMJFAQSA-N His-Lys-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC1=CN=CN1)N UXSATKFPUVZVDK-KKUMJFAQSA-N 0.000 description 1
- YVCGJPIKRMGNPA-LSJOCFKGSA-N His-Met-Ala Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C)C(O)=O YVCGJPIKRMGNPA-LSJOCFKGSA-N 0.000 description 1
- BSVLMPMIXPQNKC-KBPBESRZSA-N His-Phe-Gly Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)NCC(O)=O BSVLMPMIXPQNKC-KBPBESRZSA-N 0.000 description 1
- YXXKBPJEIYFGOD-MGHWNKPDSA-N His-Phe-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](CC2=CN=CN2)N YXXKBPJEIYFGOD-MGHWNKPDSA-N 0.000 description 1
- JSQIXEHORHLQEE-MEYUZBJRSA-N His-Phe-Thr Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O JSQIXEHORHLQEE-MEYUZBJRSA-N 0.000 description 1
- ZVKDCQVQTGYBQT-LSJOCFKGSA-N His-Pro-Ala Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O ZVKDCQVQTGYBQT-LSJOCFKGSA-N 0.000 description 1
- QCBYAHHNOHBXIH-UWVGGRQHSA-N His-Pro-Gly Chemical compound C([C@H](N)C(=O)N1[C@@H](CCC1)C(=O)NCC(O)=O)C1=CN=CN1 QCBYAHHNOHBXIH-UWVGGRQHSA-N 0.000 description 1
- YEKYGQZUBCRNGH-DCAQKATOSA-N His-Pro-Ser Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CC2=CN=CN2)N)C(=O)N[C@@H](CO)C(=O)O YEKYGQZUBCRNGH-DCAQKATOSA-N 0.000 description 1
- KAXZXLSXFWSNNZ-XVYDVKMFSA-N His-Ser-Ala Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O KAXZXLSXFWSNNZ-XVYDVKMFSA-N 0.000 description 1
- FHKZHRMERJUXRJ-DCAQKATOSA-N His-Ser-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CN=CN1 FHKZHRMERJUXRJ-DCAQKATOSA-N 0.000 description 1
- GIRSNERMXCMDBO-GARJFASQSA-N His-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CC2=CN=CN2)N)C(=O)O GIRSNERMXCMDBO-GARJFASQSA-N 0.000 description 1
- FBVHRDXSCYELMI-PBCZWWQYSA-N His-Thr-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N)O FBVHRDXSCYELMI-PBCZWWQYSA-N 0.000 description 1
- YAJQKIBLYPFAET-NAZCDGGXSA-N His-Thr-Trp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CC3=CN=CN3)N)O YAJQKIBLYPFAET-NAZCDGGXSA-N 0.000 description 1
- DEMIXZCKUXVEBO-BWAGICSOSA-N His-Thr-Tyr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@H](CC2=CN=CN2)N)O DEMIXZCKUXVEBO-BWAGICSOSA-N 0.000 description 1
- FBOMZVOKCZMDIG-XQQFMLRXSA-N His-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CN=CN2)N FBOMZVOKCZMDIG-XQQFMLRXSA-N 0.000 description 1
- 108010033040 Histones Proteins 0.000 description 1
- 241000282412 Homo Species 0.000 description 1
- 101001104570 Homo sapiens Proteasome assembly chaperone 2 Proteins 0.000 description 1
- 101000625842 Homo sapiens Tubulin-specific chaperone E Proteins 0.000 description 1
- 241000223198 Humicola Species 0.000 description 1
- 241001480714 Humicola insolens Species 0.000 description 1
- 102000004157 Hydrolases Human genes 0.000 description 1
- 108090000604 Hydrolases Proteins 0.000 description 1
- AVXURJPOCDRRFD-UHFFFAOYSA-N Hydroxylamine Chemical compound ON AVXURJPOCDRRFD-UHFFFAOYSA-N 0.000 description 1
- LQSBBHNVAVNZSX-GHCJXIJMSA-N Ile-Ala-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CC(=O)N)C(=O)O)N LQSBBHNVAVNZSX-GHCJXIJMSA-N 0.000 description 1
- YKRYHWJRQUSTKG-KBIXCLLPSA-N Ile-Ala-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N YKRYHWJRQUSTKG-KBIXCLLPSA-N 0.000 description 1
- AQCUAZTZSPQJFF-ZKWXMUAHSA-N Ile-Ala-Gly Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O AQCUAZTZSPQJFF-ZKWXMUAHSA-N 0.000 description 1
- CYHYBSGMHMHKOA-CIQUZCHMSA-N Ile-Ala-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N CYHYBSGMHMHKOA-CIQUZCHMSA-N 0.000 description 1
- QLRMMMQNCWBNPQ-QXEWZRGKSA-N Ile-Arg-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)NCC(=O)O)N QLRMMMQNCWBNPQ-QXEWZRGKSA-N 0.000 description 1
- DMHGKBGOUAJRHU-UHFFFAOYSA-N Ile-Arg-Pro Natural products CCC(C)C(N)C(=O)NC(CCCN=C(N)N)C(=O)N1CCCC1C(O)=O DMHGKBGOUAJRHU-UHFFFAOYSA-N 0.000 description 1
- CWJQMCPYXNVMBS-STECZYCISA-N Ile-Arg-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N CWJQMCPYXNVMBS-STECZYCISA-N 0.000 description 1
- PJLLMGWWINYQPB-PEFMBERDSA-N Ile-Asn-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N PJLLMGWWINYQPB-PEFMBERDSA-N 0.000 description 1
- SCHZQZPYHBWYEQ-PEFMBERDSA-N Ile-Asn-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N SCHZQZPYHBWYEQ-PEFMBERDSA-N 0.000 description 1
- UKTUOMWSJPXODT-GUDRVLHUSA-N Ile-Asn-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N1CCC[C@@H]1C(=O)O)N UKTUOMWSJPXODT-GUDRVLHUSA-N 0.000 description 1
- LEDRIAHEWDJRMF-CFMVVWHZSA-N Ile-Asn-Tyr Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 LEDRIAHEWDJRMF-CFMVVWHZSA-N 0.000 description 1
- AQTWDZDISVGCAC-CFMVVWHZSA-N Ile-Asp-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N AQTWDZDISVGCAC-CFMVVWHZSA-N 0.000 description 1
- LJKDGRWXYUTRSH-YVNDNENWSA-N Ile-Gln-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N LJKDGRWXYUTRSH-YVNDNENWSA-N 0.000 description 1
- OVPYIUNCVSOVNF-KQXIARHKSA-N Ile-Gln-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N1CCC[C@@H]1C(=O)O)N OVPYIUNCVSOVNF-KQXIARHKSA-N 0.000 description 1
- OVPYIUNCVSOVNF-ZPFDUUQYSA-N Ile-Gln-Pro Natural products CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N1CCC[C@H]1C(O)=O OVPYIUNCVSOVNF-ZPFDUUQYSA-N 0.000 description 1
- HTDRTKMNJRRYOJ-SIUGBPQLSA-N Ile-Gln-Tyr Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 HTDRTKMNJRRYOJ-SIUGBPQLSA-N 0.000 description 1
- WZDCVAWMBUNDDY-KBIXCLLPSA-N Ile-Glu-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](C)C(=O)O)N WZDCVAWMBUNDDY-KBIXCLLPSA-N 0.000 description 1
- MTFVYKQRLXYAQN-LAEOZQHASA-N Ile-Glu-Gly Chemical compound [H]N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O MTFVYKQRLXYAQN-LAEOZQHASA-N 0.000 description 1
- UBHUJPVCJHPSEU-GRLWGSQLSA-N Ile-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N UBHUJPVCJHPSEU-GRLWGSQLSA-N 0.000 description 1
- LPXHYGGZJOCAFR-MNXVOIDGSA-N Ile-Glu-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC(C)C)C(=O)O)N LPXHYGGZJOCAFR-MNXVOIDGSA-N 0.000 description 1
- SLQVFYWBGNNOTK-BYULHYEWSA-N Ile-Gly-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N[C@@H](CC(=O)N)C(=O)O)N SLQVFYWBGNNOTK-BYULHYEWSA-N 0.000 description 1
- CDGLBYSAZFIIJO-RCOVLWMOSA-N Ile-Gly-Gly Chemical compound CC[C@H](C)[C@H]([NH3+])C(=O)NCC(=O)NCC([O-])=O CDGLBYSAZFIIJO-RCOVLWMOSA-N 0.000 description 1
- MQFGXJNSUJTXDT-QSFUFRPTSA-N Ile-Gly-Ile Chemical compound N[C@@H]([C@@H](C)CC)C(=O)NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)O MQFGXJNSUJTXDT-QSFUFRPTSA-N 0.000 description 1
- ZXIGYKICRDFISM-DJFWLOJKSA-N Ile-His-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CC(=O)N)C(=O)O)N ZXIGYKICRDFISM-DJFWLOJKSA-N 0.000 description 1
- OUUCIIJSBIBCHB-ZPFDUUQYSA-N Ile-Leu-Asp Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O OUUCIIJSBIBCHB-ZPFDUUQYSA-N 0.000 description 1
- FZWVCYCYWCLQDH-NHCYSSNCSA-N Ile-Leu-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)NCC(=O)O)N FZWVCYCYWCLQDH-NHCYSSNCSA-N 0.000 description 1
- TVYWVSJGSHQWMT-AJNGGQMLSA-N Ile-Leu-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N TVYWVSJGSHQWMT-AJNGGQMLSA-N 0.000 description 1
- IOVUXUSIGXCREV-DKIMLUQUSA-N Ile-Leu-Phe Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 IOVUXUSIGXCREV-DKIMLUQUSA-N 0.000 description 1
- GVKKVHNRTUFCCE-BJDJZHNGSA-N Ile-Leu-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)O)N GVKKVHNRTUFCCE-BJDJZHNGSA-N 0.000 description 1
- DSDPLOODKXISDT-XUXIUFHCSA-N Ile-Leu-Val Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O DSDPLOODKXISDT-XUXIUFHCSA-N 0.000 description 1
- NZGTYCMLUGYMCV-XUXIUFHCSA-N Ile-Lys-Arg Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N NZGTYCMLUGYMCV-XUXIUFHCSA-N 0.000 description 1
- RMNMUUCYTMLWNA-ZPFDUUQYSA-N Ile-Lys-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(=O)O)C(=O)O)N RMNMUUCYTMLWNA-ZPFDUUQYSA-N 0.000 description 1
- PNTWNAXGBOZMBO-MNXVOIDGSA-N Ile-Lys-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N PNTWNAXGBOZMBO-MNXVOIDGSA-N 0.000 description 1
- PARSHQDZROHERM-NHCYSSNCSA-N Ile-Lys-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)NCC(=O)O)N PARSHQDZROHERM-NHCYSSNCSA-N 0.000 description 1
- GLYJPWIRLBAIJH-UHFFFAOYSA-N Ile-Lys-Pro Natural products CCC(C)C(N)C(=O)NC(CCCCN)C(=O)N1CCCC1C(O)=O GLYJPWIRLBAIJH-UHFFFAOYSA-N 0.000 description 1
- CEPIAEUVRKGPGP-DSYPUSFNSA-N Ile-Lys-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)[C@@H](C)CC)C(O)=O)=CNC2=C1 CEPIAEUVRKGPGP-DSYPUSFNSA-N 0.000 description 1
- BKPPWVSPSIUXHZ-OSUNSFLBSA-N Ile-Met-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N BKPPWVSPSIUXHZ-OSUNSFLBSA-N 0.000 description 1
- UYNXBNHVWFNVIN-HJWJTTGWSA-N Ile-Phe-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)[C@@H](C)CC)CC1=CC=CC=C1 UYNXBNHVWFNVIN-HJWJTTGWSA-N 0.000 description 1
- OTSVBELRDMSPKY-PCBIJLKTSA-N Ile-Phe-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(=O)N)C(=O)O)N OTSVBELRDMSPKY-PCBIJLKTSA-N 0.000 description 1
- CIDLJWVDMNDKPT-FIRPJDEBSA-N Ile-Phe-Phe Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC2=CC=CC=C2)C(=O)O)N CIDLJWVDMNDKPT-FIRPJDEBSA-N 0.000 description 1
- FHPZJWJWTWZKNA-LLLHUVSDSA-N Ile-Phe-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N2CCC[C@@H]2C(=O)O)N FHPZJWJWTWZKNA-LLLHUVSDSA-N 0.000 description 1
- FGBRXCZYVRFNKQ-MXAVVETBSA-N Ile-Phe-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)O)N FGBRXCZYVRFNKQ-MXAVVETBSA-N 0.000 description 1
- OWSWUWDMSNXTNE-GMOBBJLQSA-N Ile-Pro-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(=O)O)C(=O)O)N OWSWUWDMSNXTNE-GMOBBJLQSA-N 0.000 description 1
- CIJLNXXMDUOFPH-HJWJTTGWSA-N Ile-Pro-Phe Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 CIJLNXXMDUOFPH-HJWJTTGWSA-N 0.000 description 1
- YKZAMJXNJUWFIK-JBDRJPRFSA-N Ile-Ser-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(=O)O)N YKZAMJXNJUWFIK-JBDRJPRFSA-N 0.000 description 1
- PELCGFMHLZXWBQ-BJDJZHNGSA-N Ile-Ser-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)O)N PELCGFMHLZXWBQ-BJDJZHNGSA-N 0.000 description 1
- VGSPNSSCMOHRRR-BJDJZHNGSA-N Ile-Ser-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)O)N VGSPNSSCMOHRRR-BJDJZHNGSA-N 0.000 description 1
- ZDNNDIJTUHQCAM-MXAVVETBSA-N Ile-Ser-Phe Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N ZDNNDIJTUHQCAM-MXAVVETBSA-N 0.000 description 1
- WLRJHVNFGAOYPS-HJPIBITLSA-N Ile-Ser-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N WLRJHVNFGAOYPS-HJPIBITLSA-N 0.000 description 1
- SAEWJTCJQVZQNZ-IUKAMOBKSA-N Ile-Thr-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N SAEWJTCJQVZQNZ-IUKAMOBKSA-N 0.000 description 1
- YBKKLDBBPFIXBQ-MBLNEYKQSA-N Ile-Thr-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(=O)O)N YBKKLDBBPFIXBQ-MBLNEYKQSA-N 0.000 description 1
- HJDZMPFEXINXLO-QPHKQPEJSA-N Ile-Thr-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N HJDZMPFEXINXLO-QPHKQPEJSA-N 0.000 description 1
- KBDIBHQICWDGDL-PPCPHDFISA-N Ile-Thr-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)O)N KBDIBHQICWDGDL-PPCPHDFISA-N 0.000 description 1
- NURNJECQNNCRBK-FLBSBUHZSA-N Ile-Thr-Thr Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O NURNJECQNNCRBK-FLBSBUHZSA-N 0.000 description 1
- KWHFUMYCSPJCFQ-NGTWOADLSA-N Ile-Thr-Trp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N KWHFUMYCSPJCFQ-NGTWOADLSA-N 0.000 description 1
- WKSHBPRUIRGWRZ-KCTSRDHCSA-N Ile-Trp-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)NCC(=O)O)N WKSHBPRUIRGWRZ-KCTSRDHCSA-N 0.000 description 1
- RTSQPLLOYSGMKM-DSYPUSFNSA-N Ile-Trp-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CC(C)C)C(=O)O)N RTSQPLLOYSGMKM-DSYPUSFNSA-N 0.000 description 1
- XDVKZSJODLMNLJ-GGQYPGDFSA-N Ile-Trp-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CC=3C4=CC=CC=C4NC=3)NC(=O)[C@@H](N)[C@@H](C)CC)C(O)=O)=CNC2=C1 XDVKZSJODLMNLJ-GGQYPGDFSA-N 0.000 description 1
- MGUTVMBNOMJLKC-VKOGCVSHSA-N Ile-Trp-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](C(C)C)C(=O)O)N MGUTVMBNOMJLKC-VKOGCVSHSA-N 0.000 description 1
- OMDWJWGZGMCQND-CFMVVWHZSA-N Ile-Tyr-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N OMDWJWGZGMCQND-CFMVVWHZSA-N 0.000 description 1
- PRTZQMBYUZFSFA-XEGUGMAKSA-N Ile-Tyr-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)NCC(=O)O)N PRTZQMBYUZFSFA-XEGUGMAKSA-N 0.000 description 1
- ZUWSVOYKBCHLRR-MGHWNKPDSA-N Ile-Tyr-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCCCN)C(=O)O)N ZUWSVOYKBCHLRR-MGHWNKPDSA-N 0.000 description 1
- JERJIYYCOGBAIJ-OBAATPRFSA-N Ile-Tyr-Trp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC2=CNC3=CC=CC=C32)C(=O)O)N JERJIYYCOGBAIJ-OBAATPRFSA-N 0.000 description 1
- YWCJXQKATPNPOE-UKJIMTQDSA-N Ile-Val-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N YWCJXQKATPNPOE-UKJIMTQDSA-N 0.000 description 1
- 241000235649 Kluyveromyces Species 0.000 description 1
- IBMVEYRWAWIOTN-UHFFFAOYSA-N L-Leucyl-L-Arginyl-L-Proline Natural products CC(C)CC(N)C(=O)NC(CCCN=C(N)N)C(=O)N1CCCC1C(O)=O IBMVEYRWAWIOTN-UHFFFAOYSA-N 0.000 description 1
- SENJXOPIZNYLHU-UHFFFAOYSA-N L-leucyl-L-arginine Natural products CC(C)CC(N)C(=O)NC(C(O)=O)CCCN=C(N)N SENJXOPIZNYLHU-UHFFFAOYSA-N 0.000 description 1
- FBOZXECLQNJBKD-ZDUSSCGKSA-N L-methotrexate Chemical compound C=1N=C2N=C(N)N=C(N)C2=NC=1CN(C)C1=CC=C(C(=O)N[C@@H](CCC(O)=O)C(O)=O)C=C1 FBOZXECLQNJBKD-ZDUSSCGKSA-N 0.000 description 1
- QIVBCDIJIAJPQS-VIFPVBQESA-N L-tryptophane Chemical compound C1=CC=C2C(C[C@H](N)C(O)=O)=CNC2=C1 QIVBCDIJIAJPQS-VIFPVBQESA-N 0.000 description 1
- 101150007280 LEU2 gene Proteins 0.000 description 1
- XBBKIIGCUMBKCO-JXUBOQSCSA-N Leu-Ala-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XBBKIIGCUMBKCO-JXUBOQSCSA-N 0.000 description 1
- JUWJEAPUNARGCF-DCAQKATOSA-N Leu-Arg-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(O)=O JUWJEAPUNARGCF-DCAQKATOSA-N 0.000 description 1
- HASRFYOMVPJRPU-SRVKXCTJSA-N Leu-Arg-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(O)=O)C(O)=O HASRFYOMVPJRPU-SRVKXCTJSA-N 0.000 description 1
- KSZCCRIGNVSHFH-UWVGGRQHSA-N Leu-Arg-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O KSZCCRIGNVSHFH-UWVGGRQHSA-N 0.000 description 1
- VKOAHIRLIUESLU-ULQDDVLXSA-N Leu-Arg-Phe Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O VKOAHIRLIUESLU-ULQDDVLXSA-N 0.000 description 1
- UCOCBWDBHCUPQP-DCAQKATOSA-N Leu-Arg-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O UCOCBWDBHCUPQP-DCAQKATOSA-N 0.000 description 1
- XYUBOFCTGPZFSA-WDSOQIARSA-N Leu-Arg-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H](N)CC(C)C)C(O)=O)=CNC2=C1 XYUBOFCTGPZFSA-WDSOQIARSA-N 0.000 description 1
- CUXRXAIAVYLVFD-ULQDDVLXSA-N Leu-Arg-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 CUXRXAIAVYLVFD-ULQDDVLXSA-N 0.000 description 1
- WUFYAPWIHCUMLL-CIUDSAMLSA-N Leu-Asn-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(O)=O WUFYAPWIHCUMLL-CIUDSAMLSA-N 0.000 description 1
- DBVWMYGBVFCRBE-CIUDSAMLSA-N Leu-Asn-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O DBVWMYGBVFCRBE-CIUDSAMLSA-N 0.000 description 1
- VCSBGUACOYUIGD-CIUDSAMLSA-N Leu-Asn-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O VCSBGUACOYUIGD-CIUDSAMLSA-N 0.000 description 1
- KKXDHFKZWKLYGB-GUBZILKMSA-N Leu-Asn-Glu Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N KKXDHFKZWKLYGB-GUBZILKMSA-N 0.000 description 1
- WGNOPSQMIQERPK-GARJFASQSA-N Leu-Asn-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N1CCC[C@@H]1C(=O)O)N WGNOPSQMIQERPK-GARJFASQSA-N 0.000 description 1
- WGNOPSQMIQERPK-UHFFFAOYSA-N Leu-Asn-Pro Natural products CC(C)CC(N)C(=O)NC(CC(=O)N)C(=O)N1CCCC1C(=O)O WGNOPSQMIQERPK-UHFFFAOYSA-N 0.000 description 1
- WXHFZJFZWNCDNB-KKUMJFAQSA-N Leu-Asn-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 WXHFZJFZWNCDNB-KKUMJFAQSA-N 0.000 description 1
- ZURHXHNAEJJRNU-CIUDSAMLSA-N Leu-Asp-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O ZURHXHNAEJJRNU-CIUDSAMLSA-N 0.000 description 1
- PJYSOYLLTJKZHC-GUBZILKMSA-N Leu-Asp-Gln Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCC(N)=O PJYSOYLLTJKZHC-GUBZILKMSA-N 0.000 description 1
- ILJREDZFPHTUIE-GUBZILKMSA-N Leu-Asp-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O ILJREDZFPHTUIE-GUBZILKMSA-N 0.000 description 1
- JQSXWJXBASFONF-KKUMJFAQSA-N Leu-Asp-Phe Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O JQSXWJXBASFONF-KKUMJFAQSA-N 0.000 description 1
- LJKJVTCIRDCITR-SRVKXCTJSA-N Leu-Cys-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N LJKJVTCIRDCITR-SRVKXCTJSA-N 0.000 description 1
- PNUCWVAGVNLUMW-CIUDSAMLSA-N Leu-Cys-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CS)C(=O)N[C@@H](CO)C(O)=O PNUCWVAGVNLUMW-CIUDSAMLSA-N 0.000 description 1
- ZTLGVASZOIKNIX-DCAQKATOSA-N Leu-Gln-Glu Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N ZTLGVASZOIKNIX-DCAQKATOSA-N 0.000 description 1
- RSFGIMMPWAXNML-MNXVOIDGSA-N Leu-Gln-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O RSFGIMMPWAXNML-MNXVOIDGSA-N 0.000 description 1
- FQZPTCNSNPWHLJ-AVGNSLFASA-N Leu-Gln-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCCN)C(O)=O FQZPTCNSNPWHLJ-AVGNSLFASA-N 0.000 description 1
- RVVBWTWPNFDYBE-SRVKXCTJSA-N Leu-Glu-Arg Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O RVVBWTWPNFDYBE-SRVKXCTJSA-N 0.000 description 1
- KVMULWOHPPMHHE-DCAQKATOSA-N Leu-Glu-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O KVMULWOHPPMHHE-DCAQKATOSA-N 0.000 description 1
- WIDZHJTYKYBLSR-DCAQKATOSA-N Leu-Glu-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O WIDZHJTYKYBLSR-DCAQKATOSA-N 0.000 description 1
- HPBCTWSUJOGJSH-MNXVOIDGSA-N Leu-Glu-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O HPBCTWSUJOGJSH-MNXVOIDGSA-N 0.000 description 1
- LLBQJYDYOLIQAI-JYJNAYRXSA-N Leu-Glu-Tyr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O LLBQJYDYOLIQAI-JYJNAYRXSA-N 0.000 description 1
- BABSVXFGKFLIGW-UWVGGRQHSA-N Leu-Gly-Arg Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCNC(N)=N BABSVXFGKFLIGW-UWVGGRQHSA-N 0.000 description 1
- QJUWBDPGGYVRHY-YUMQZZPRSA-N Leu-Gly-Cys Chemical compound CC(C)C[C@@H](C(=O)NCC(=O)N[C@@H](CS)C(=O)O)N QJUWBDPGGYVRHY-YUMQZZPRSA-N 0.000 description 1
- FIYMBBHGYNQFOP-IUCAKERBSA-N Leu-Gly-Gln Chemical compound CC(C)C[C@@H](C(=O)NCC(=O)N[C@@H](CCC(=O)N)C(=O)O)N FIYMBBHGYNQFOP-IUCAKERBSA-N 0.000 description 1
- KGCLIYGPQXUNLO-IUCAKERBSA-N Leu-Gly-Glu Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(O)=O KGCLIYGPQXUNLO-IUCAKERBSA-N 0.000 description 1
- VWHGTYCRDRBSFI-ZETCQYMHSA-N Leu-Gly-Gly Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)NCC(O)=O VWHGTYCRDRBSFI-ZETCQYMHSA-N 0.000 description 1
- VBZOAGIPCULURB-QWRGUYRKSA-N Leu-Gly-His Chemical compound CC(C)C[C@@H](C(=O)NCC(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N VBZOAGIPCULURB-QWRGUYRKSA-N 0.000 description 1
- YFBBUHJJUXXZOF-UWVGGRQHSA-N Leu-Gly-Pro Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N1CCC[C@H]1C(O)=O YFBBUHJJUXXZOF-UWVGGRQHSA-N 0.000 description 1
- BKTXKJMNTSMJDQ-AVGNSLFASA-N Leu-His-Gln Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N BKTXKJMNTSMJDQ-AVGNSLFASA-N 0.000 description 1
- KXODZBLFVFSLAI-AVGNSLFASA-N Leu-His-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CC(C)C)CC1=CN=CN1 KXODZBLFVFSLAI-AVGNSLFASA-N 0.000 description 1
- AOFYPTOHESIBFZ-KKUMJFAQSA-N Leu-His-His Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](Cc1cnc[nH]1)C(=O)N[C@@H](Cc1cnc[nH]1)C(O)=O AOFYPTOHESIBFZ-KKUMJFAQSA-N 0.000 description 1
- CSFVADKICPDRRF-KKUMJFAQSA-N Leu-His-Leu Chemical compound CC(C)C[C@H]([NH3+])C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C([O-])=O)CC1=CN=CN1 CSFVADKICPDRRF-KKUMJFAQSA-N 0.000 description 1
- XBCWOTOCBXXJDG-BZSNNMDCSA-N Leu-His-Phe Chemical compound C([C@H](NC(=O)[C@@H](N)CC(C)C)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CN=CN1 XBCWOTOCBXXJDG-BZSNNMDCSA-N 0.000 description 1
- WRLPVDVHNWSSCL-MELADBBJSA-N Leu-His-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N2CCC[C@@H]2C(=O)O)N WRLPVDVHNWSSCL-MELADBBJSA-N 0.000 description 1
- OMHLATXVNQSALM-FQUUOJAGSA-N Leu-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(C)C)N OMHLATXVNQSALM-FQUUOJAGSA-N 0.000 description 1
- DSFYPIUSAMSERP-IHRRRGAJSA-N Leu-Leu-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N DSFYPIUSAMSERP-IHRRRGAJSA-N 0.000 description 1
- IAJFFZORSWOZPQ-SRVKXCTJSA-N Leu-Leu-Asn Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O IAJFFZORSWOZPQ-SRVKXCTJSA-N 0.000 description 1
- IFMPDNRWZZEZSL-SRVKXCTJSA-N Leu-Leu-Cys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CS)C(O)=O IFMPDNRWZZEZSL-SRVKXCTJSA-N 0.000 description 1
- QNBVTHNJGCOVFA-AVGNSLFASA-N Leu-Leu-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCC(O)=O QNBVTHNJGCOVFA-AVGNSLFASA-N 0.000 description 1
- LXKNSJLSGPNHSK-KKUMJFAQSA-N Leu-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N LXKNSJLSGPNHSK-KKUMJFAQSA-N 0.000 description 1
- RXGLHDWAZQECBI-SRVKXCTJSA-N Leu-Leu-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O RXGLHDWAZQECBI-SRVKXCTJSA-N 0.000 description 1
- IEWBEPKLKUXQBU-VOAKCMCISA-N Leu-Leu-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O IEWBEPKLKUXQBU-VOAKCMCISA-N 0.000 description 1
- UCNNZELZXFXXJQ-BZSNNMDCSA-N Leu-Leu-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 UCNNZELZXFXXJQ-BZSNNMDCSA-N 0.000 description 1
- ZRHDPZAAWLXXIR-SRVKXCTJSA-N Leu-Lys-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O ZRHDPZAAWLXXIR-SRVKXCTJSA-N 0.000 description 1
- RZXLZBIUTDQHJQ-SRVKXCTJSA-N Leu-Lys-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O RZXLZBIUTDQHJQ-SRVKXCTJSA-N 0.000 description 1
- HVHRPWQEQHIQJF-AVGNSLFASA-N Leu-Lys-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O HVHRPWQEQHIQJF-AVGNSLFASA-N 0.000 description 1
- VVQJGYPTIYOFBR-IHRRRGAJSA-N Leu-Lys-Met Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCSC)C(=O)O)N VVQJGYPTIYOFBR-IHRRRGAJSA-N 0.000 description 1
- HDHQQEDVWQGBEE-DCAQKATOSA-N Leu-Met-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CO)C(O)=O HDHQQEDVWQGBEE-DCAQKATOSA-N 0.000 description 1
- GCXGCIYIHXSKAY-ULQDDVLXSA-N Leu-Phe-Arg Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O GCXGCIYIHXSKAY-ULQDDVLXSA-N 0.000 description 1
- INCJJHQRZGQLFC-KBPBESRZSA-N Leu-Phe-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)NCC(O)=O INCJJHQRZGQLFC-KBPBESRZSA-N 0.000 description 1
- DRWMRVFCKKXHCH-BZSNNMDCSA-N Leu-Phe-Leu Chemical compound CC(C)C[C@H]([NH3+])C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C([O-])=O)CC1=CC=CC=C1 DRWMRVFCKKXHCH-BZSNNMDCSA-N 0.000 description 1
- PJWOOBTYQNNRBF-BZSNNMDCSA-N Leu-Phe-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCCN)C(=O)O)N PJWOOBTYQNNRBF-BZSNNMDCSA-N 0.000 description 1
- MVVSHHJKJRZVNY-ACRUOGEOSA-N Leu-Phe-Tyr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O MVVSHHJKJRZVNY-ACRUOGEOSA-N 0.000 description 1
- WMIOEVKKYIMVKI-DCAQKATOSA-N Leu-Pro-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O WMIOEVKKYIMVKI-DCAQKATOSA-N 0.000 description 1
- RRVCZCNFXIFGRA-DCAQKATOSA-N Leu-Pro-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O RRVCZCNFXIFGRA-DCAQKATOSA-N 0.000 description 1
- UCBPDSYUVAAHCD-UWVGGRQHSA-N Leu-Pro-Gly Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O UCBPDSYUVAAHCD-UWVGGRQHSA-N 0.000 description 1
- MUCIDQMDOYQYBR-IHRRRGAJSA-N Leu-Pro-His Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N MUCIDQMDOYQYBR-IHRRRGAJSA-N 0.000 description 1
- YUTNOGOMBNYPFH-XUXIUFHCSA-N Leu-Pro-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(O)=O YUTNOGOMBNYPFH-XUXIUFHCSA-N 0.000 description 1
- XXXXOVFBXRERQL-ULQDDVLXSA-N Leu-Pro-Phe Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 XXXXOVFBXRERQL-ULQDDVLXSA-N 0.000 description 1
- DPURXCQCHSQPAN-AVGNSLFASA-N Leu-Pro-Pro Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 DPURXCQCHSQPAN-AVGNSLFASA-N 0.000 description 1
- CHJKEDSZNSONPS-DCAQKATOSA-N Leu-Pro-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O CHJKEDSZNSONPS-DCAQKATOSA-N 0.000 description 1
- IDGZVZJLYFTXSL-DCAQKATOSA-N Leu-Ser-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCN=C(N)N IDGZVZJLYFTXSL-DCAQKATOSA-N 0.000 description 1
- KZZCOWMDDXDKSS-CIUDSAMLSA-N Leu-Ser-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O KZZCOWMDDXDKSS-CIUDSAMLSA-N 0.000 description 1
- IZPVWNSAVUQBGP-CIUDSAMLSA-N Leu-Ser-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O IZPVWNSAVUQBGP-CIUDSAMLSA-N 0.000 description 1
- ADJWHHZETYAAAX-SRVKXCTJSA-N Leu-Ser-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N ADJWHHZETYAAAX-SRVKXCTJSA-N 0.000 description 1
- MVHXGBZUJLWZOH-BJDJZHNGSA-N Leu-Ser-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O MVHXGBZUJLWZOH-BJDJZHNGSA-N 0.000 description 1
- XOWMDXHFSBCAKQ-SRVKXCTJSA-N Leu-Ser-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC(C)C XOWMDXHFSBCAKQ-SRVKXCTJSA-N 0.000 description 1
- SBANPBVRHYIMRR-GARJFASQSA-N Leu-Ser-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CO)C(=O)N1CCC[C@@H]1C(=O)O)N SBANPBVRHYIMRR-GARJFASQSA-N 0.000 description 1
- BRTVHXHCUSXYRI-CIUDSAMLSA-N Leu-Ser-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O BRTVHXHCUSXYRI-CIUDSAMLSA-N 0.000 description 1
- LCNASHSOFMRYFO-WDCWCFNPSA-N Leu-Thr-Gln Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCC(N)=O LCNASHSOFMRYFO-WDCWCFNPSA-N 0.000 description 1
- VDIARPPNADFEAV-WEDXCCLWSA-N Leu-Thr-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O VDIARPPNADFEAV-WEDXCCLWSA-N 0.000 description 1
- FGZVGOAAROXFAB-IXOXFDKPSA-N Leu-Thr-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC(C)C)N)O FGZVGOAAROXFAB-IXOXFDKPSA-N 0.000 description 1
- ODRREERHVHMIPT-OEAJRASXSA-N Leu-Thr-Phe Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 ODRREERHVHMIPT-OEAJRASXSA-N 0.000 description 1
- AIQWYVFNBNNOLU-RHYQMDGZSA-N Leu-Thr-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O AIQWYVFNBNNOLU-RHYQMDGZSA-N 0.000 description 1
- YLMIDMSLKLRNHX-HSCHXYMDSA-N Leu-Trp-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O YLMIDMSLKLRNHX-HSCHXYMDSA-N 0.000 description 1
- ONHCDMBHPQIPAI-YTQUADARSA-N Leu-Trp-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N3CCC[C@@H]3C(=O)O)N ONHCDMBHPQIPAI-YTQUADARSA-N 0.000 description 1
- SUYRAPCRSCCPAK-VFAJRCTISA-N Leu-Trp-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SUYRAPCRSCCPAK-VFAJRCTISA-N 0.000 description 1
- ISSAURVGLGAPDK-KKUMJFAQSA-N Leu-Tyr-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O ISSAURVGLGAPDK-KKUMJFAQSA-N 0.000 description 1
- JGKHAFUAPZCCDU-BZSNNMDCSA-N Leu-Tyr-Leu Chemical compound CC(C)C[C@H]([NH3+])C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C([O-])=O)CC1=CC=C(O)C=C1 JGKHAFUAPZCCDU-BZSNNMDCSA-N 0.000 description 1
- FBNPMTNBFFAMMH-UHFFFAOYSA-N Leu-Val-Arg Natural products CC(C)CC(N)C(=O)NC(C(C)C)C(=O)NC(C(O)=O)CCCN=C(N)N FBNPMTNBFFAMMH-UHFFFAOYSA-N 0.000 description 1
- MVJRBCJCRYGCKV-GVXVVHGQSA-N Leu-Val-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O MVJRBCJCRYGCKV-GVXVVHGQSA-N 0.000 description 1
- AAKRWBIIGKPOKQ-ONGXEEELSA-N Leu-Val-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O AAKRWBIIGKPOKQ-ONGXEEELSA-N 0.000 description 1
- FDBTVENULFNTAL-XQQFMLRXSA-N Leu-Val-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N FDBTVENULFNTAL-XQQFMLRXSA-N 0.000 description 1
- -1 Leu2 Proteins 0.000 description 1
- ROHFNLRQFUQHCH-UHFFFAOYSA-N Leucine Natural products CC(C)CC(N)C(O)=O ROHFNLRQFUQHCH-UHFFFAOYSA-N 0.000 description 1
- 229910009891 LiAc Inorganic materials 0.000 description 1
- 102000003960 Ligases Human genes 0.000 description 1
- 108090000364 Ligases Proteins 0.000 description 1
- 108090001060 Lipase Proteins 0.000 description 1
- 102000004882 Lipase Human genes 0.000 description 1
- 239000004367 Lipase Substances 0.000 description 1
- MPGHETGWWWUHPY-CIUDSAMLSA-N Lys-Ala-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN MPGHETGWWWUHPY-CIUDSAMLSA-N 0.000 description 1
- VHFFQUSNFFIZBT-CIUDSAMLSA-N Lys-Ala-Cys Chemical compound C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCCCN)N VHFFQUSNFFIZBT-CIUDSAMLSA-N 0.000 description 1
- JCFYLFOCALSNLQ-GUBZILKMSA-N Lys-Ala-Gln Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(O)=O JCFYLFOCALSNLQ-GUBZILKMSA-N 0.000 description 1
- WSXTWLJHTLRFLW-SRVKXCTJSA-N Lys-Ala-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCCN)C(O)=O WSXTWLJHTLRFLW-SRVKXCTJSA-N 0.000 description 1
- WQWZXKWOEVSGQM-DCAQKATOSA-N Lys-Ala-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN WQWZXKWOEVSGQM-DCAQKATOSA-N 0.000 description 1
- YIBOAHAOAWACDK-QEJZJMRPSA-N Lys-Ala-Phe Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 YIBOAHAOAWACDK-QEJZJMRPSA-N 0.000 description 1
- WXJKFRMKJORORD-DCAQKATOSA-N Lys-Arg-Ala Chemical compound NC(=N)NCCC[C@@H](C(=O)N[C@@H](C)C(O)=O)NC(=O)[C@@H](N)CCCCN WXJKFRMKJORORD-DCAQKATOSA-N 0.000 description 1
- YNNPKXBBRZVIRX-IHRRRGAJSA-N Lys-Arg-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O YNNPKXBBRZVIRX-IHRRRGAJSA-N 0.000 description 1
- SWWCDAGDQHTKIE-RHYQMDGZSA-N Lys-Arg-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SWWCDAGDQHTKIE-RHYQMDGZSA-N 0.000 description 1
- DEFGUIIUYAUEDU-ZPFDUUQYSA-N Lys-Asn-Ile Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O DEFGUIIUYAUEDU-ZPFDUUQYSA-N 0.000 description 1
- ZQCVMVCVPFYXHZ-SRVKXCTJSA-N Lys-Asn-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CCCCN ZQCVMVCVPFYXHZ-SRVKXCTJSA-N 0.000 description 1
- LZWNAOIMTLNMDW-NHCYSSNCSA-N Lys-Asn-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCCCN)N LZWNAOIMTLNMDW-NHCYSSNCSA-N 0.000 description 1
- FLCMXEFCTLXBTL-DCAQKATOSA-N Lys-Asp-Arg Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N FLCMXEFCTLXBTL-DCAQKATOSA-N 0.000 description 1
- OVIVOCSURJYCTM-GUBZILKMSA-N Lys-Asp-Glu Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCC(O)=O OVIVOCSURJYCTM-GUBZILKMSA-N 0.000 description 1
- WGCKDDHUFPQSMZ-ZPFDUUQYSA-N Lys-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CCCCN WGCKDDHUFPQSMZ-ZPFDUUQYSA-N 0.000 description 1
- IWWMPCPLFXFBAF-SRVKXCTJSA-N Lys-Asp-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O IWWMPCPLFXFBAF-SRVKXCTJSA-N 0.000 description 1
- LMVOVCYVZBBWQB-SRVKXCTJSA-N Lys-Asp-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN LMVOVCYVZBBWQB-SRVKXCTJSA-N 0.000 description 1
- MQMIRLVJXQNTRJ-SDDRHHMPSA-N Lys-Gln-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCCCN)N)C(=O)O MQMIRLVJXQNTRJ-SDDRHHMPSA-N 0.000 description 1
- NNCDAORZCMPZPX-GUBZILKMSA-N Lys-Gln-Ser Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CO)C(=O)O)N NNCDAORZCMPZPX-GUBZILKMSA-N 0.000 description 1
- HEWWNLVEWBJBKA-WDCWCFNPSA-N Lys-Gln-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CCCCN HEWWNLVEWBJBKA-WDCWCFNPSA-N 0.000 description 1
- LLSUNJYOSCOOEB-GUBZILKMSA-N Lys-Glu-Asp Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O LLSUNJYOSCOOEB-GUBZILKMSA-N 0.000 description 1
- DCRWPTBMWMGADO-AVGNSLFASA-N Lys-Glu-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O DCRWPTBMWMGADO-AVGNSLFASA-N 0.000 description 1
- WGLAORUKDGRINI-WDCWCFNPSA-N Lys-Glu-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O WGLAORUKDGRINI-WDCWCFNPSA-N 0.000 description 1
- ODUQLUADRKMHOZ-JYJNAYRXSA-N Lys-Glu-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCCCN)N)O ODUQLUADRKMHOZ-JYJNAYRXSA-N 0.000 description 1
- ULUQBUKAPDUKOC-GVXVVHGQSA-N Lys-Glu-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O ULUQBUKAPDUKOC-GVXVVHGQSA-N 0.000 description 1
- GPJGFSFYBJGYRX-YUMQZZPRSA-N Lys-Gly-Asp Chemical compound NCCCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(O)=O GPJGFSFYBJGYRX-YUMQZZPRSA-N 0.000 description 1
- UETQMSASAVBGJY-QWRGUYRKSA-N Lys-Gly-His Chemical compound NCCCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC1=CNC=N1 UETQMSASAVBGJY-QWRGUYRKSA-N 0.000 description 1
- DTUZCYRNEJDKSR-NHCYSSNCSA-N Lys-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CCCCN DTUZCYRNEJDKSR-NHCYSSNCSA-N 0.000 description 1
- JZMGVXLDOQOKAH-UWVGGRQHSA-N Lys-Gly-Met Chemical compound [H]N[C@@H](CCCCN)C(=O)NCC(=O)N[C@@H](CCSC)C(O)=O JZMGVXLDOQOKAH-UWVGGRQHSA-N 0.000 description 1
- CANPXOLVTMKURR-WEDXCCLWSA-N Lys-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CCCCN CANPXOLVTMKURR-WEDXCCLWSA-N 0.000 description 1
- KZJQUYFDSCFSCO-DLOVCJGASA-N Lys-His-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CCCCN)N KZJQUYFDSCFSCO-DLOVCJGASA-N 0.000 description 1
- WOEDRPCHKPSFDT-MXAVVETBSA-N Lys-His-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CCCCN)N WOEDRPCHKPSFDT-MXAVVETBSA-N 0.000 description 1
- SLQJJFAVWSZLBL-BJDJZHNGSA-N Lys-Ile-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCCCN SLQJJFAVWSZLBL-BJDJZHNGSA-N 0.000 description 1
- KYNNSEJZFVCDIV-ZPFDUUQYSA-N Lys-Ile-Asn Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O KYNNSEJZFVCDIV-ZPFDUUQYSA-N 0.000 description 1
- IVFUVMSKSFSFBT-NHCYSSNCSA-N Lys-Ile-Gly Chemical compound OC(=O)CNC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCCCN IVFUVMSKSFSFBT-NHCYSSNCSA-N 0.000 description 1
- ZXFRGTAIIZHNHG-AJNGGQMLSA-N Lys-Ile-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)O)NC(=O)[C@H](CCCCN)N ZXFRGTAIIZHNHG-AJNGGQMLSA-N 0.000 description 1
- KEPWSUPUFAPBRF-DKIMLUQUSA-N Lys-Ile-Phe Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O KEPWSUPUFAPBRF-DKIMLUQUSA-N 0.000 description 1
- PRSBSVAVOQOAMI-BJDJZHNGSA-N Lys-Ile-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCCCN PRSBSVAVOQOAMI-BJDJZHNGSA-N 0.000 description 1
- NJNRBRKHOWSGMN-SRVKXCTJSA-N Lys-Leu-Asn Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O NJNRBRKHOWSGMN-SRVKXCTJSA-N 0.000 description 1
- ONPDTSFZAIWMDI-AVGNSLFASA-N Lys-Leu-Gln Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O ONPDTSFZAIWMDI-AVGNSLFASA-N 0.000 description 1
- VMTYLUGCXIEDMV-QWRGUYRKSA-N Lys-Leu-Gly Chemical compound OC(=O)CNC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CCCCN VMTYLUGCXIEDMV-QWRGUYRKSA-N 0.000 description 1
- AIRZWUMAHCDDHR-KKUMJFAQSA-N Lys-Leu-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O AIRZWUMAHCDDHR-KKUMJFAQSA-N 0.000 description 1
- RBEATVHTWHTHTJ-KKUMJFAQSA-N Lys-Leu-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(O)=O RBEATVHTWHTHTJ-KKUMJFAQSA-N 0.000 description 1
- XIZQPFCRXLUNMK-BZSNNMDCSA-N Lys-Leu-Phe Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CCCCN)N XIZQPFCRXLUNMK-BZSNNMDCSA-N 0.000 description 1
- YDDDRTIPNTWGIG-SRVKXCTJSA-N Lys-Lys-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O YDDDRTIPNTWGIG-SRVKXCTJSA-N 0.000 description 1
- PLDJDCJLRCYPJB-VOAKCMCISA-N Lys-Lys-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PLDJDCJLRCYPJB-VOAKCMCISA-N 0.000 description 1
- WWEWGPOLIJXGNX-XUXIUFHCSA-N Lys-Met-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](CCCCN)N WWEWGPOLIJXGNX-XUXIUFHCSA-N 0.000 description 1
- VSTNAUBHKQPVJX-IHRRRGAJSA-N Lys-Met-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O VSTNAUBHKQPVJX-IHRRRGAJSA-N 0.000 description 1
- KFSALEZVQJYHCE-AVGNSLFASA-N Lys-Met-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](CCCCN)N KFSALEZVQJYHCE-AVGNSLFASA-N 0.000 description 1
- LMGNWHDWJDIOPK-DKIMLUQUSA-N Lys-Phe-Ile Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O LMGNWHDWJDIOPK-DKIMLUQUSA-N 0.000 description 1
- MSSABBQOBUZFKZ-IHRRRGAJSA-N Lys-Pro-His Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CCCCN)N)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O MSSABBQOBUZFKZ-IHRRRGAJSA-N 0.000 description 1
- LUTDBHBIHHREDC-IHRRRGAJSA-N Lys-Pro-Lys Chemical compound NCCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(O)=O LUTDBHBIHHREDC-IHRRRGAJSA-N 0.000 description 1
- UQJOKDAYFULYIX-AVGNSLFASA-N Lys-Pro-Pro Chemical compound NCCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 UQJOKDAYFULYIX-AVGNSLFASA-N 0.000 description 1
- JMNRXRPBHFGXQX-GUBZILKMSA-N Lys-Ser-Glu Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O JMNRXRPBHFGXQX-GUBZILKMSA-N 0.000 description 1
- DIBZLYZXTSVGLN-CIUDSAMLSA-N Lys-Ser-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O DIBZLYZXTSVGLN-CIUDSAMLSA-N 0.000 description 1
- PLOUVAYOMTYJRG-JXUBOQSCSA-N Lys-Thr-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O PLOUVAYOMTYJRG-JXUBOQSCSA-N 0.000 description 1
- IEVXCWPVBYCJRZ-IXOXFDKPSA-N Lys-Thr-His Chemical compound NCCCC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 IEVXCWPVBYCJRZ-IXOXFDKPSA-N 0.000 description 1
- VHTOGMKQXXJOHG-RHYQMDGZSA-N Lys-Thr-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O VHTOGMKQXXJOHG-RHYQMDGZSA-N 0.000 description 1
- RMKJOQSYLQQRFN-KKUMJFAQSA-N Lys-Tyr-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O RMKJOQSYLQQRFN-KKUMJFAQSA-N 0.000 description 1
- RQILLQOQXLZTCK-KBPBESRZSA-N Lys-Tyr-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(O)=O RQILLQOQXLZTCK-KBPBESRZSA-N 0.000 description 1
- IMDJSVBFQKDDEQ-MGHWNKPDSA-N Lys-Tyr-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CCCCN)N IMDJSVBFQKDDEQ-MGHWNKPDSA-N 0.000 description 1
- MIMXMVDLMDMOJD-BZSNNMDCSA-N Lys-Tyr-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(O)=O MIMXMVDLMDMOJD-BZSNNMDCSA-N 0.000 description 1
- VVURYEVJJTXWNE-ULQDDVLXSA-N Lys-Tyr-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O VVURYEVJJTXWNE-ULQDDVLXSA-N 0.000 description 1
- VWPJQIHBBOJWDN-DCAQKATOSA-N Lys-Val-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O VWPJQIHBBOJWDN-DCAQKATOSA-N 0.000 description 1
- MDDUIRLQCYVRDO-NHCYSSNCSA-N Lys-Val-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCCN MDDUIRLQCYVRDO-NHCYSSNCSA-N 0.000 description 1
- 102000013460 Malate Dehydrogenase Human genes 0.000 description 1
- 108010026217 Malate Dehydrogenase Proteins 0.000 description 1
- MUYQDMBLDFEVRJ-LSJOCFKGSA-N Met-Ala-His Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CNC=N1 MUYQDMBLDFEVRJ-LSJOCFKGSA-N 0.000 description 1
- QGQGAIBGTUJRBR-NAKRPEOUSA-N Met-Ala-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCSC QGQGAIBGTUJRBR-NAKRPEOUSA-N 0.000 description 1
- IVCPHARVJUYDPA-FXQIFTODSA-N Met-Asn-Asp Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N IVCPHARVJUYDPA-FXQIFTODSA-N 0.000 description 1
- IYXDSYWCVVXSKB-CIUDSAMLSA-N Met-Asn-Glu Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O IYXDSYWCVVXSKB-CIUDSAMLSA-N 0.000 description 1
- UZVWDRPUTHXQAM-FXQIFTODSA-N Met-Asp-Ala Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O UZVWDRPUTHXQAM-FXQIFTODSA-N 0.000 description 1
- HGKJFNCLOHKEHS-FXQIFTODSA-N Met-Cys-Asp Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@H](C(O)=O)CC(O)=O HGKJFNCLOHKEHS-FXQIFTODSA-N 0.000 description 1
- IECZNARPMKQGJC-XIRDDKMYSA-N Met-Gln-Trp Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N IECZNARPMKQGJC-XIRDDKMYSA-N 0.000 description 1
- KQBJYJXPZBNEIK-DCAQKATOSA-N Met-Glu-Arg Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCNC(N)=N KQBJYJXPZBNEIK-DCAQKATOSA-N 0.000 description 1
- OGAZPKJHHZPYFK-GARJFASQSA-N Met-Glu-Pro Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N OGAZPKJHHZPYFK-GARJFASQSA-N 0.000 description 1
- DGNZGCQSVGGYJS-BQBZGAKWSA-N Met-Gly-Asp Chemical compound CSCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(O)=O DGNZGCQSVGGYJS-BQBZGAKWSA-N 0.000 description 1
- STLBOMUOQNIALW-BQBZGAKWSA-N Met-Gly-Cys Chemical compound CSCC[C@H](N)C(=O)NCC(=O)N[C@@H](CS)C(O)=O STLBOMUOQNIALW-BQBZGAKWSA-N 0.000 description 1
- FGAMAYQCWQCUNF-DCAQKATOSA-N Met-His-Asn Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CC(=O)N)C(=O)O)N FGAMAYQCWQCUNF-DCAQKATOSA-N 0.000 description 1
- MXEASDMFHUKOGE-ULQDDVLXSA-N Met-His-Tyr Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O)N MXEASDMFHUKOGE-ULQDDVLXSA-N 0.000 description 1
- FZUNSVYYPYJYAP-NAKRPEOUSA-N Met-Ile-Ala Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O FZUNSVYYPYJYAP-NAKRPEOUSA-N 0.000 description 1
- ZIIMORLEZLVRIP-SRVKXCTJSA-N Met-Leu-Gln Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O ZIIMORLEZLVRIP-SRVKXCTJSA-N 0.000 description 1
- HGAJNEWOUHDUMZ-SRVKXCTJSA-N Met-Leu-Glu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCC(O)=O HGAJNEWOUHDUMZ-SRVKXCTJSA-N 0.000 description 1
- HZVXPUHLTZRQEL-UWVGGRQHSA-N Met-Leu-Gly Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O HZVXPUHLTZRQEL-UWVGGRQHSA-N 0.000 description 1
- XDGFFEZAZHRZFR-RHYQMDGZSA-N Met-Leu-Thr Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XDGFFEZAZHRZFR-RHYQMDGZSA-N 0.000 description 1
- BEZJTLKUMFMITF-AVGNSLFASA-N Met-Lys-Arg Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CCCNC(N)=N BEZJTLKUMFMITF-AVGNSLFASA-N 0.000 description 1
- JCMMNFZUKMMECJ-DCAQKATOSA-N Met-Lys-Asn Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O JCMMNFZUKMMECJ-DCAQKATOSA-N 0.000 description 1
- HOZNVKDCKZPRER-XUXIUFHCSA-N Met-Lys-Ile Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O HOZNVKDCKZPRER-XUXIUFHCSA-N 0.000 description 1
- IRVONVRHHJXWTK-RWMBFGLXSA-N Met-Lys-Pro Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N1CCC[C@@H]1C(=O)O)N IRVONVRHHJXWTK-RWMBFGLXSA-N 0.000 description 1
- LCPUWQLULVXROY-RHYQMDGZSA-N Met-Lys-Thr Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LCPUWQLULVXROY-RHYQMDGZSA-N 0.000 description 1
- KBTQZYASLSUFJR-KKUMJFAQSA-N Met-Phe-Gln Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N KBTQZYASLSUFJR-KKUMJFAQSA-N 0.000 description 1
- KRLKICLNEICJGV-STQMWFEESA-N Met-Phe-Gly Chemical compound CSCC[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CC=CC=C1 KRLKICLNEICJGV-STQMWFEESA-N 0.000 description 1
- WYDFQSJOARJAMM-GUBZILKMSA-N Met-Pro-Asp Chemical compound CSCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O WYDFQSJOARJAMM-GUBZILKMSA-N 0.000 description 1
- QLESZRANMSYLCZ-CYDGBPFRSA-N Met-Pro-Ile Chemical compound [H]N[C@@H](CCSC)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(O)=O QLESZRANMSYLCZ-CYDGBPFRSA-N 0.000 description 1
- LXCSZPUQKMTXNW-BQBZGAKWSA-N Met-Ser-Gly Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O LXCSZPUQKMTXNW-BQBZGAKWSA-N 0.000 description 1
- SMVTWPOATVIXTN-NAKRPEOUSA-N Met-Ser-Ile Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O SMVTWPOATVIXTN-NAKRPEOUSA-N 0.000 description 1
- OOXVBECOTYHTCK-WDSOQIARSA-N Met-Trp-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H](CCSC)N OOXVBECOTYHTCK-WDSOQIARSA-N 0.000 description 1
- VVWQHJUYBPJCNS-UMPQAUOISA-N Met-Trp-Thr Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](N)CCSC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O)=CNC2=C1 VVWQHJUYBPJCNS-UMPQAUOISA-N 0.000 description 1
- VOAKKHOIAFKOQZ-JYJNAYRXSA-N Met-Tyr-Arg Chemical compound NC(=N)NCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CCSC)CC1=CC=C(O)C=C1 VOAKKHOIAFKOQZ-JYJNAYRXSA-N 0.000 description 1
- XTSBLBXAUIBMLW-KKUMJFAQSA-N Met-Tyr-Glu Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N XTSBLBXAUIBMLW-KKUMJFAQSA-N 0.000 description 1
- LIIXIZKVWNYQHB-STECZYCISA-N Met-Tyr-Ile Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O LIIXIZKVWNYQHB-STECZYCISA-N 0.000 description 1
- KPVLLNDCBYXKNV-CYDGBPFRSA-N Met-Val-Ile Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KPVLLNDCBYXKNV-CYDGBPFRSA-N 0.000 description 1
- CKAVKDJBSNTJDB-SRVKXCTJSA-N Met-Val-Met Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCSC CKAVKDJBSNTJDB-SRVKXCTJSA-N 0.000 description 1
- 241000235048 Meyerozyma guilliermondii Species 0.000 description 1
- 241000235575 Mortierella Species 0.000 description 1
- 241000235395 Mucor Species 0.000 description 1
- 241000699660 Mus musculus Species 0.000 description 1
- 102000014415 Muscarinic acetylcholine receptor Human genes 0.000 description 1
- 108050003473 Muscarinic acetylcholine receptor Proteins 0.000 description 1
- 241000226677 Myceliophthora Species 0.000 description 1
- WUGMRIBZSVSJNP-UHFFFAOYSA-N N-L-alanyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)C)C(O)=O)=CNC2=C1 WUGMRIBZSVSJNP-UHFFFAOYSA-N 0.000 description 1
- SITLTJHOQZFJGG-UHFFFAOYSA-N N-L-alpha-glutamyl-L-valine Natural products CC(C)C(C(O)=O)NC(=O)C(N)CCC(O)=O SITLTJHOQZFJGG-UHFFFAOYSA-N 0.000 description 1
- XZFYRXDAULDNFX-UHFFFAOYSA-N N-L-cysteinyl-L-phenylalanine Natural products SCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 XZFYRXDAULDNFX-UHFFFAOYSA-N 0.000 description 1
- 241001235463 Nakazawaea wickerhamii Species 0.000 description 1
- 241000221960 Neurospora Species 0.000 description 1
- 101100285000 Neurospora crassa (strain ATCC 24698 / 74-OR23-1A / CBS 708.71 / DSM 1257 / FGSC 987) his-3 gene Proteins 0.000 description 1
- 244000061176 Nicotiana tabacum Species 0.000 description 1
- 235000002637 Nicotiana tabacum Nutrition 0.000 description 1
- 239000000020 Nitrocellulose Substances 0.000 description 1
- 238000000636 Northern blotting Methods 0.000 description 1
- 241001452677 Ogataea methanolica Species 0.000 description 1
- 108020005187 Oligonucleotide Probes Proteins 0.000 description 1
- 108700026244 Open Reading Frames Proteins 0.000 description 1
- 102000007981 Ornithine carbamoyltransferase Human genes 0.000 description 1
- 101710113020 Ornithine transcarbamylase, mitochondrial Proteins 0.000 description 1
- 102100037214 Orotidine 5'-phosphate decarboxylase Human genes 0.000 description 1
- 108010055012 Orotidine-5'-phosphate decarboxylase Proteins 0.000 description 1
- 101100450270 Oryzias latipes hcea gene Proteins 0.000 description 1
- 239000008118 PEG 6000 Substances 0.000 description 1
- 241000228143 Penicillium Species 0.000 description 1
- 108091005804 Peptidases Proteins 0.000 description 1
- 102000035195 Peptidases Human genes 0.000 description 1
- 108010002747 Pfu DNA polymerase Proteins 0.000 description 1
- BJEYSVHMGIJORT-NHCYSSNCSA-N Phe-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=CC=C1 BJEYSVHMGIJORT-NHCYSSNCSA-N 0.000 description 1
- WSXKXSBOJXEZDV-DLOVCJGASA-N Phe-Ala-Asn Chemical compound NC(=O)C[C@@H](C([O-])=O)NC(=O)[C@H](C)NC(=O)[C@@H]([NH3+])CC1=CC=CC=C1 WSXKXSBOJXEZDV-DLOVCJGASA-N 0.000 description 1
- AJOKKVTWEMXZHC-DRZSPHRISA-N Phe-Ala-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=CC=C1 AJOKKVTWEMXZHC-DRZSPHRISA-N 0.000 description 1
- FPTXMUIBLMGTQH-ONGXEEELSA-N Phe-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=CC=C1 FPTXMUIBLMGTQH-ONGXEEELSA-N 0.000 description 1
- DFEVBOYEUQJGER-JURCDPSOSA-N Phe-Ala-Ile Chemical compound N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O DFEVBOYEUQJGER-JURCDPSOSA-N 0.000 description 1
- QMMRHASQEVCJGR-UBHSHLNASA-N Phe-Ala-Pro Chemical compound C([C@H](N)C(=O)N[C@@H](C)C(=O)N1[C@@H](CCC1)C(O)=O)C1=CC=CC=C1 QMMRHASQEVCJGR-UBHSHLNASA-N 0.000 description 1
- UHRNIXJAGGLKHP-DLOVCJGASA-N Phe-Ala-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O UHRNIXJAGGLKHP-DLOVCJGASA-N 0.000 description 1
- CGOMLCQJEMWMCE-STQMWFEESA-N Phe-Arg-Gly Chemical compound NC(N)=NCCC[C@@H](C(=O)NCC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 CGOMLCQJEMWMCE-STQMWFEESA-N 0.000 description 1
- YYRCPTVAPLQRNC-ULQDDVLXSA-N Phe-Arg-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCNC(N)=N)NC(=O)[C@@H](N)CC1=CC=CC=C1 YYRCPTVAPLQRNC-ULQDDVLXSA-N 0.000 description 1
- QCHNRQQVLJYDSI-DLOVCJGASA-N Phe-Asn-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 QCHNRQQVLJYDSI-DLOVCJGASA-N 0.000 description 1
- CDNPIRSCAFMMBE-SRVKXCTJSA-N Phe-Asn-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O CDNPIRSCAFMMBE-SRVKXCTJSA-N 0.000 description 1
- ZENDEDYRYVHBEG-SRVKXCTJSA-N Phe-Asp-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 ZENDEDYRYVHBEG-SRVKXCTJSA-N 0.000 description 1
- IQXOZIDWLZYYAW-IHRRRGAJSA-N Phe-Asp-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N IQXOZIDWLZYYAW-IHRRRGAJSA-N 0.000 description 1
- OJUMUUXGSXUZJZ-SRVKXCTJSA-N Phe-Asp-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O OJUMUUXGSXUZJZ-SRVKXCTJSA-N 0.000 description 1
- QPQDWBAJWOGAMJ-IHPCNDPISA-N Phe-Asp-Trp Chemical compound C([C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(O)=O)C1=CC=CC=C1 QPQDWBAJWOGAMJ-IHPCNDPISA-N 0.000 description 1
- CUMXHKAOHNWRFQ-BZSNNMDCSA-N Phe-Asp-Tyr Chemical compound C([C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=CC=C1 CUMXHKAOHNWRFQ-BZSNNMDCSA-N 0.000 description 1
- UMKYAYXCMYYNHI-AVGNSLFASA-N Phe-Gln-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC(=O)N)C(=O)O)N UMKYAYXCMYYNHI-AVGNSLFASA-N 0.000 description 1
- LLGTYVHITPVGKR-RYUDHWBXSA-N Phe-Gln-Gly Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O LLGTYVHITPVGKR-RYUDHWBXSA-N 0.000 description 1
- NKLDZIPTGKBDBB-HTUGSXCWSA-N Phe-Gln-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC1=CC=CC=C1)N)O NKLDZIPTGKBDBB-HTUGSXCWSA-N 0.000 description 1
- JWQWPTLEOFNCGX-AVGNSLFASA-N Phe-Glu-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 JWQWPTLEOFNCGX-AVGNSLFASA-N 0.000 description 1
- JEBWZLWTRPZQRX-QWRGUYRKSA-N Phe-Gly-Asp Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O JEBWZLWTRPZQRX-QWRGUYRKSA-N 0.000 description 1
- RVRRHFPCEOVRKQ-KKUMJFAQSA-N Phe-His-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)N[C@@H](CC(=O)N)C(=O)O)N RVRRHFPCEOVRKQ-KKUMJFAQSA-N 0.000 description 1
- NAOVYENZCWFBDG-BZSNNMDCSA-N Phe-His-His Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1NC=NC=1)C(=O)N[C@@H](CC=1NC=NC=1)C(O)=O)C1=CC=CC=C1 NAOVYENZCWFBDG-BZSNNMDCSA-N 0.000 description 1
- VADLTGVIOIOKGM-BZSNNMDCSA-N Phe-His-Leu Chemical compound C([C@@H](C(=O)N[C@@H](CC(C)C)C(O)=O)NC(=O)[C@@H](N)CC=1C=CC=CC=1)C1=CN=CN1 VADLTGVIOIOKGM-BZSNNMDCSA-N 0.000 description 1
- BVHFFNYBKRTSIU-MEYUZBJRSA-N Phe-His-Thr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O BVHFFNYBKRTSIU-MEYUZBJRSA-N 0.000 description 1
- DVOCGBNHAUHKHJ-DKIMLUQUSA-N Phe-Ile-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O DVOCGBNHAUHKHJ-DKIMLUQUSA-N 0.000 description 1
- XMQSOOJRRVEHRO-ULQDDVLXSA-N Phe-Leu-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 XMQSOOJRRVEHRO-ULQDDVLXSA-N 0.000 description 1
- YKUGPVXSDOOANW-KKUMJFAQSA-N Phe-Leu-Asp Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O YKUGPVXSDOOANW-KKUMJFAQSA-N 0.000 description 1
- RSPUIENXSJYZQO-JYJNAYRXSA-N Phe-Leu-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 RSPUIENXSJYZQO-JYJNAYRXSA-N 0.000 description 1
- METZZBCMDXHFMK-BZSNNMDCSA-N Phe-Leu-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC2=CC=CC=C2)N METZZBCMDXHFMK-BZSNNMDCSA-N 0.000 description 1
- YTILBRIUASDGBL-BZSNNMDCSA-N Phe-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 YTILBRIUASDGBL-BZSNNMDCSA-N 0.000 description 1
- MSHZERMPZKCODG-ACRUOGEOSA-N Phe-Leu-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 MSHZERMPZKCODG-ACRUOGEOSA-N 0.000 description 1
- YCCUXNNKXDGMAM-KKUMJFAQSA-N Phe-Leu-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O YCCUXNNKXDGMAM-KKUMJFAQSA-N 0.000 description 1
- KNYPNEYICHHLQL-ACRUOGEOSA-N Phe-Leu-Tyr Chemical compound C([C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=CC=C1 KNYPNEYICHHLQL-ACRUOGEOSA-N 0.000 description 1
- DMEYUTSDVRCWRS-ULQDDVLXSA-N Phe-Lys-Arg Chemical compound NC(=N)NCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC1=CC=CC=C1 DMEYUTSDVRCWRS-ULQDDVLXSA-N 0.000 description 1
- ZIQQNOXKEFDPBE-BZSNNMDCSA-N Phe-Lys-His Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N ZIQQNOXKEFDPBE-BZSNNMDCSA-N 0.000 description 1
- IEOHQGFKHXUALJ-JYJNAYRXSA-N Phe-Met-Arg Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O IEOHQGFKHXUALJ-JYJNAYRXSA-N 0.000 description 1
- OHIYMVFLQXTZAW-UFYCRDLUSA-N Phe-Met-Phe Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O OHIYMVFLQXTZAW-UFYCRDLUSA-N 0.000 description 1
- JKJSIYKSGIDHPM-WBAXXEDZSA-N Phe-Phe-Ala Chemical compound C[C@H](NC(=O)[C@H](Cc1ccccc1)NC(=O)[C@@H](N)Cc1ccccc1)C(O)=O JKJSIYKSGIDHPM-WBAXXEDZSA-N 0.000 description 1
- KAJLHCWRWDSROH-BZSNNMDCSA-N Phe-Phe-Asp Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)N[C@@H](CC(O)=O)C(O)=O)C1=CC=CC=C1 KAJLHCWRWDSROH-BZSNNMDCSA-N 0.000 description 1
- ROOQMPCUFLDOSB-FHWLQOOXSA-N Phe-Phe-Gln Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)N[C@@H](CCC(N)=O)C(O)=O)C1=CC=CC=C1 ROOQMPCUFLDOSB-FHWLQOOXSA-N 0.000 description 1
- RYQWALWYQWBUKN-FHWLQOOXSA-N Phe-Phe-Glu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O RYQWALWYQWBUKN-FHWLQOOXSA-N 0.000 description 1
- GPLWGAYGROGDEN-BZSNNMDCSA-N Phe-Phe-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O GPLWGAYGROGDEN-BZSNNMDCSA-N 0.000 description 1
- JLLJTMHNXQTMCK-UBHSHLNASA-N Phe-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CC1=CC=CC=C1 JLLJTMHNXQTMCK-UBHSHLNASA-N 0.000 description 1
- YMIZSYUAZJSOFL-SRVKXCTJSA-N Phe-Ser-Asn Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O YMIZSYUAZJSOFL-SRVKXCTJSA-N 0.000 description 1
- UNBFGVQVQGXXCK-KKUMJFAQSA-N Phe-Ser-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O UNBFGVQVQGXXCK-KKUMJFAQSA-N 0.000 description 1
- IAOZOFPONWDXNT-IXOXFDKPSA-N Phe-Ser-Thr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O IAOZOFPONWDXNT-IXOXFDKPSA-N 0.000 description 1
- XNMYNGDKJNOKHH-BZSNNMDCSA-N Phe-Ser-Tyr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O XNMYNGDKJNOKHH-BZSNNMDCSA-N 0.000 description 1
- RAGOJJCBGXARPO-XVSYOHENSA-N Phe-Thr-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H]([C@H](O)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 RAGOJJCBGXARPO-XVSYOHENSA-N 0.000 description 1
- KLYYKKGCPOGDPE-OEAJRASXSA-N Phe-Thr-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O KLYYKKGCPOGDPE-OEAJRASXSA-N 0.000 description 1
- BPIFSOUEUYDJRM-DCPHZVHLSA-N Phe-Trp-Ala Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)N[C@@H](C)C(O)=O)C1=CC=CC=C1 BPIFSOUEUYDJRM-DCPHZVHLSA-N 0.000 description 1
- YCEWAVIRWNGGSS-NQCBNZPSSA-N Phe-Trp-Ile Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O)C1=CC=CC=C1 YCEWAVIRWNGGSS-NQCBNZPSSA-N 0.000 description 1
- LKRUQZQZMXMKEQ-SFJXLCSZSA-N Phe-Trp-Thr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LKRUQZQZMXMKEQ-SFJXLCSZSA-N 0.000 description 1
- DBNGDEAQXGFGRA-ACRUOGEOSA-N Phe-Tyr-Lys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)N[C@@H](CCCCN)C(=O)O)N DBNGDEAQXGFGRA-ACRUOGEOSA-N 0.000 description 1
- XALFIVXGQUEGKV-JSGCOSHPSA-N Phe-Val-Gly Chemical compound OC(=O)CNC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 XALFIVXGQUEGKV-JSGCOSHPSA-N 0.000 description 1
- 241000223960 Plasmodium falciparum Species 0.000 description 1
- 229920002584 Polyethylene Glycol 6000 Polymers 0.000 description 1
- 239000002202 Polyethylene glycol Substances 0.000 description 1
- 229920001213 Polysorbate 20 Polymers 0.000 description 1
- 241000709992 Potato virus X Species 0.000 description 1
- FELJDCNGZFDUNR-WDSKDSINSA-N Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1 FELJDCNGZFDUNR-WDSKDSINSA-N 0.000 description 1
- IWNOFCGBMSFTBC-CIUDSAMLSA-N Pro-Ala-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O IWNOFCGBMSFTBC-CIUDSAMLSA-N 0.000 description 1
- KIZQGKLMXKGDIV-BQBZGAKWSA-N Pro-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1 KIZQGKLMXKGDIV-BQBZGAKWSA-N 0.000 description 1
- IHCXPSYCHXFXKT-DCAQKATOSA-N Pro-Arg-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O IHCXPSYCHXFXKT-DCAQKATOSA-N 0.000 description 1
- UVKNEILZSJMKSR-FXQIFTODSA-N Pro-Asn-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H]1CCCN1 UVKNEILZSJMKSR-FXQIFTODSA-N 0.000 description 1
- VPVHXWGPALPDGP-GUBZILKMSA-N Pro-Asn-Arg Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O VPVHXWGPALPDGP-GUBZILKMSA-N 0.000 description 1
- XWYXZPHPYKRYPA-GMOBBJLQSA-N Pro-Asn-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O XWYXZPHPYKRYPA-GMOBBJLQSA-N 0.000 description 1
- AMBLXEMWFARNNQ-DCAQKATOSA-N Pro-Asn-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@@H]1CCCN1 AMBLXEMWFARNNQ-DCAQKATOSA-N 0.000 description 1
- TXPUNZXZDVJUJQ-LPEHRKFASA-N Pro-Asn-Pro Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC(=O)N)C(=O)N2CCC[C@@H]2C(=O)O TXPUNZXZDVJUJQ-LPEHRKFASA-N 0.000 description 1
- FUVBEZJCRMHWEM-FXQIFTODSA-N Pro-Asn-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O FUVBEZJCRMHWEM-FXQIFTODSA-N 0.000 description 1
- MLQVJYMFASXBGZ-IHRRRGAJSA-N Pro-Asn-Tyr Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O MLQVJYMFASXBGZ-IHRRRGAJSA-N 0.000 description 1
- UTAUEDINXUMHLG-FXQIFTODSA-N Pro-Asp-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@@H]1CCCN1 UTAUEDINXUMHLG-FXQIFTODSA-N 0.000 description 1
- JARJPEMLQAWNBR-GUBZILKMSA-N Pro-Asp-Arg Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O JARJPEMLQAWNBR-GUBZILKMSA-N 0.000 description 1
- CJZTUKSFZUSNCC-FXQIFTODSA-N Pro-Asp-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H]1CCCN1 CJZTUKSFZUSNCC-FXQIFTODSA-N 0.000 description 1
- QVIZLAUEAMQKGS-GUBZILKMSA-N Pro-Asp-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@@H]1CCCN1 QVIZLAUEAMQKGS-GUBZILKMSA-N 0.000 description 1
- NOXSEHJOXCWRHK-DCAQKATOSA-N Pro-Cys-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@@H]1CCCN1 NOXSEHJOXCWRHK-DCAQKATOSA-N 0.000 description 1
- ZBAGOWGNNAXMOY-IHRRRGAJSA-N Pro-Cys-Phe Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CS)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O ZBAGOWGNNAXMOY-IHRRRGAJSA-N 0.000 description 1
- SKICPQLTOXGWGO-GARJFASQSA-N Pro-Gln-Pro Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CCC(=O)N)C(=O)N2CCC[C@@H]2C(=O)O SKICPQLTOXGWGO-GARJFASQSA-N 0.000 description 1
- PULPZRAHVFBVTO-DCAQKATOSA-N Pro-Glu-Arg Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O PULPZRAHVFBVTO-DCAQKATOSA-N 0.000 description 1
- FRKBNXCFJBPJOL-GUBZILKMSA-N Pro-Glu-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O FRKBNXCFJBPJOL-GUBZILKMSA-N 0.000 description 1
- WVOXLKUUVCCCSU-ZPFDUUQYSA-N Pro-Glu-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WVOXLKUUVCCCSU-ZPFDUUQYSA-N 0.000 description 1
- QGOZJLYCGRYYRW-KKUMJFAQSA-N Pro-Glu-Tyr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O QGOZJLYCGRYYRW-KKUMJFAQSA-N 0.000 description 1
- FEPSEIDIPBMIOS-QXEWZRGKSA-N Pro-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H]1CCCN1 FEPSEIDIPBMIOS-QXEWZRGKSA-N 0.000 description 1
- DXTOOBDIIAJZBJ-BQBZGAKWSA-N Pro-Gly-Ser Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](CO)C(O)=O DXTOOBDIIAJZBJ-BQBZGAKWSA-N 0.000 description 1
- HAEGAELAYWSUNC-WPRPVWTQSA-N Pro-Gly-Val Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O HAEGAELAYWSUNC-WPRPVWTQSA-N 0.000 description 1
- BAKAHWWRCCUDAF-IHRRRGAJSA-N Pro-His-Lys Chemical compound C([C@@H](C(=O)N[C@@H](CCCCN)C(O)=O)NC(=O)[C@H]1NCCC1)C1=CN=CN1 BAKAHWWRCCUDAF-IHRRRGAJSA-N 0.000 description 1
- GSPPWVHVBBSPSY-FHWLQOOXSA-N Pro-His-Trp Chemical compound OC(=O)[C@H](Cc1c[nH]c2ccccc12)NC(=O)[C@H](Cc1cnc[nH]1)NC(=O)[C@@H]1CCCN1 GSPPWVHVBBSPSY-FHWLQOOXSA-N 0.000 description 1
- AQGUSRZKDZYGGV-GMOBBJLQSA-N Pro-Ile-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O AQGUSRZKDZYGGV-GMOBBJLQSA-N 0.000 description 1
- BCNRNJWSRFDPTQ-HJWJTTGWSA-N Pro-Ile-Phe Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O BCNRNJWSRFDPTQ-HJWJTTGWSA-N 0.000 description 1
- UREQLMJCKFLLHM-NAKRPEOUSA-N Pro-Ile-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O UREQLMJCKFLLHM-NAKRPEOUSA-N 0.000 description 1
- KLSOMAFWRISSNI-OSUNSFLBSA-N Pro-Ile-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H]1CCCN1 KLSOMAFWRISSNI-OSUNSFLBSA-N 0.000 description 1
- FMLRRBDLBJLJIK-DCAQKATOSA-N Pro-Leu-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]1CCCN1 FMLRRBDLBJLJIK-DCAQKATOSA-N 0.000 description 1
- FXGIMYRVJJEIIM-UWVGGRQHSA-N Pro-Leu-Gly Chemical compound OC(=O)CNC(=O)[C@H](CC(C)C)NC(=O)[C@@H]1CCCN1 FXGIMYRVJJEIIM-UWVGGRQHSA-N 0.000 description 1
- BRJGUPWVFXKBQI-XUXIUFHCSA-N Pro-Leu-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O BRJGUPWVFXKBQI-XUXIUFHCSA-N 0.000 description 1
- FKYKZHOKDOPHSA-DCAQKATOSA-N Pro-Leu-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O FKYKZHOKDOPHSA-DCAQKATOSA-N 0.000 description 1
- VTFXTWDFPTWNJY-RHYQMDGZSA-N Pro-Leu-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O VTFXTWDFPTWNJY-RHYQMDGZSA-N 0.000 description 1
- OFGUOWQVEGTVNU-DCAQKATOSA-N Pro-Lys-Ala Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O OFGUOWQVEGTVNU-DCAQKATOSA-N 0.000 description 1
- WFIVLLFYUZZWOD-RHYQMDGZSA-N Pro-Lys-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O WFIVLLFYUZZWOD-RHYQMDGZSA-N 0.000 description 1
- KLOQCCRTPHPIFN-DCAQKATOSA-N Pro-Met-Gln Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@@H]1CCCN1 KLOQCCRTPHPIFN-DCAQKATOSA-N 0.000 description 1
- ZZCJYPLMOPTZFC-SRVKXCTJSA-N Pro-Met-Met Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCSC)C(O)=O ZZCJYPLMOPTZFC-SRVKXCTJSA-N 0.000 description 1
- APIAILHCTSBGLU-JYJNAYRXSA-N Pro-Met-Phe Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@@H]2CCCN2 APIAILHCTSBGLU-JYJNAYRXSA-N 0.000 description 1
- ZUZINZIJHJFJRN-UBHSHLNASA-N Pro-Phe-Ala Chemical compound C([C@@H](C(=O)N[C@@H](C)C(O)=O)NC(=O)[C@H]1NCCC1)C1=CC=CC=C1 ZUZINZIJHJFJRN-UBHSHLNASA-N 0.000 description 1
- AJBQTGZIZQXBLT-STQMWFEESA-N Pro-Phe-Gly Chemical compound C([C@@H](C(=O)NCC(=O)O)NC(=O)[C@H]1NCCC1)C1=CC=CC=C1 AJBQTGZIZQXBLT-STQMWFEESA-N 0.000 description 1
- MHBSUKYVBZVQRW-HJWJTTGWSA-N Pro-Phe-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O MHBSUKYVBZVQRW-HJWJTTGWSA-N 0.000 description 1
- WHNJMTHJGCEKGA-ULQDDVLXSA-N Pro-Phe-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O WHNJMTHJGCEKGA-ULQDDVLXSA-N 0.000 description 1
- BUEIYHBJHCDAMI-UFYCRDLUSA-N Pro-Phe-Phe Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O BUEIYHBJHCDAMI-UFYCRDLUSA-N 0.000 description 1
- YYARMJSFDLIDFS-FKBYEOEOSA-N Pro-Phe-Trp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O YYARMJSFDLIDFS-FKBYEOEOSA-N 0.000 description 1
- SBVPYBFMIGDIDX-SRVKXCTJSA-N Pro-Pro-Pro Chemical compound OC(=O)[C@@H]1CCCN1C(=O)[C@H]1N(C(=O)[C@H]2NCCC2)CCC1 SBVPYBFMIGDIDX-SRVKXCTJSA-N 0.000 description 1
- FDMKYQQYJKYCLV-GUBZILKMSA-N Pro-Pro-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 FDMKYQQYJKYCLV-GUBZILKMSA-N 0.000 description 1
- AJNGQVUFQUVRQT-JYJNAYRXSA-N Pro-Pro-Tyr Chemical compound C([C@@H](C(=O)O)NC(=O)[C@H]1N(CCC1)C(=O)[C@H]1NCCC1)C1=CC=C(O)C=C1 AJNGQVUFQUVRQT-JYJNAYRXSA-N 0.000 description 1
- KBUAPZAZPWNYSW-SRVKXCTJSA-N Pro-Pro-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 KBUAPZAZPWNYSW-SRVKXCTJSA-N 0.000 description 1
- POQFNPILEQEODH-FXQIFTODSA-N Pro-Ser-Ala Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O POQFNPILEQEODH-FXQIFTODSA-N 0.000 description 1
- OWQXAJQZLWHPBH-FXQIFTODSA-N Pro-Ser-Asn Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O OWQXAJQZLWHPBH-FXQIFTODSA-N 0.000 description 1
- SEZGGSHLMROBFX-CIUDSAMLSA-N Pro-Ser-Gln Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O SEZGGSHLMROBFX-CIUDSAMLSA-N 0.000 description 1
- LNICFEXCAHIJOR-DCAQKATOSA-N Pro-Ser-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O LNICFEXCAHIJOR-DCAQKATOSA-N 0.000 description 1
- QKDIHFHGHBYTKB-IHRRRGAJSA-N Pro-Ser-Phe Chemical compound N([C@@H](CO)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C(=O)[C@@H]1CCCN1 QKDIHFHGHBYTKB-IHRRRGAJSA-N 0.000 description 1
- XSXABUHLKPUVLX-JYJNAYRXSA-N Pro-Ser-Trp Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC2=CNC3=CC=CC=C32)C(=O)O XSXABUHLKPUVLX-JYJNAYRXSA-N 0.000 description 1
- PKHDJFHFMGQMPS-RCWTZXSCSA-N Pro-Thr-Arg Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O PKHDJFHFMGQMPS-RCWTZXSCSA-N 0.000 description 1
- HRIXMVRZRGFKNQ-HJGDQZAQSA-N Pro-Thr-Gln Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(O)=O HRIXMVRZRGFKNQ-HJGDQZAQSA-N 0.000 description 1
- VVAWNPIOYXAMAL-KJEVXHAQSA-N Pro-Thr-Tyr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O VVAWNPIOYXAMAL-KJEVXHAQSA-N 0.000 description 1
- NBDHWLZEMKSVHH-UVBJJODRSA-N Pro-Trp-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@@H]3CCCN3 NBDHWLZEMKSVHH-UVBJJODRSA-N 0.000 description 1
- VBZXFFYOBDLLFE-HSHDSVGOSA-N Pro-Trp-Thr Chemical compound N([C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)N[C@@H]([C@H](O)C)C(O)=O)C(=O)[C@@H]1CCCN1 VBZXFFYOBDLLFE-HSHDSVGOSA-N 0.000 description 1
- DLZBBDSPTJBOOD-BPNCWPANSA-N Pro-Tyr-Ala Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C)C(O)=O DLZBBDSPTJBOOD-BPNCWPANSA-N 0.000 description 1
- QHSSUIHLAIWXEE-IHRRRGAJSA-N Pro-Tyr-Asn Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(O)=O QHSSUIHLAIWXEE-IHRRRGAJSA-N 0.000 description 1
- YHUBAXGAAYULJY-ULQDDVLXSA-N Pro-Tyr-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(O)=O YHUBAXGAAYULJY-ULQDDVLXSA-N 0.000 description 1
- VEUACYMXJKXALX-IHRRRGAJSA-N Pro-Tyr-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(O)=O VEUACYMXJKXALX-IHRRRGAJSA-N 0.000 description 1
- FIDNSJUXESUDOV-JYJNAYRXSA-N Pro-Tyr-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O FIDNSJUXESUDOV-JYJNAYRXSA-N 0.000 description 1
- STGVYUTZKGPRCI-GUBZILKMSA-N Pro-Val-Cys Chemical compound SC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H]1CCCN1 STGVYUTZKGPRCI-GUBZILKMSA-N 0.000 description 1
- FUOGXAQMNJMBFG-WPRPVWTQSA-N Pro-Val-Gly Chemical compound OC(=O)CNC(=O)[C@H](C(C)C)NC(=O)[C@@H]1CCCN1 FUOGXAQMNJMBFG-WPRPVWTQSA-N 0.000 description 1
- KHRLUIPIMIQFGT-AVGNSLFASA-N Pro-Val-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O KHRLUIPIMIQFGT-AVGNSLFASA-N 0.000 description 1
- XRGIDCGRSSWCKE-SRVKXCTJSA-N Pro-Val-Met Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCSC)C(O)=O XRGIDCGRSSWCKE-SRVKXCTJSA-N 0.000 description 1
- VDHGTOHMHHQSKG-JYJNAYRXSA-N Pro-Val-Phe Chemical compound CC(C)[C@H](NC(=O)[C@@H]1CCCN1)C(=O)N[C@@H](Cc1ccccc1)C(O)=O VDHGTOHMHHQSKG-JYJNAYRXSA-N 0.000 description 1
- YDTUEBLEAVANFH-RCWTZXSCSA-N Pro-Val-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H]1CCCN1 YDTUEBLEAVANFH-RCWTZXSCSA-N 0.000 description 1
- PGSWNLRYYONGPE-JYJNAYRXSA-N Pro-Val-Tyr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O PGSWNLRYYONGPE-JYJNAYRXSA-N 0.000 description 1
- FHJQROWZEJFZPO-SRVKXCTJSA-N Pro-Val-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H]1CCCN1 FHJQROWZEJFZPO-SRVKXCTJSA-N 0.000 description 1
- 241000205156 Pyrococcus furiosus Species 0.000 description 1
- 238000004617 QSAR study Methods 0.000 description 1
- 238000010240 RT-PCR analysis Methods 0.000 description 1
- 241000235072 Saccharomyces bayanus Species 0.000 description 1
- 241000235346 Schizosaccharomyces Species 0.000 description 1
- IDQFQFVEWMWRQQ-DLOVCJGASA-N Ser-Ala-Phe Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O IDQFQFVEWMWRQQ-DLOVCJGASA-N 0.000 description 1
- WDXYVIIVDIDOSX-DCAQKATOSA-N Ser-Arg-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CO)CCCN=C(N)N WDXYVIIVDIDOSX-DCAQKATOSA-N 0.000 description 1
- RZUOXAKGNHXZTB-GUBZILKMSA-N Ser-Arg-Met Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCSC)C(O)=O RZUOXAKGNHXZTB-GUBZILKMSA-N 0.000 description 1
- OBXVZEAMXFSGPU-FXQIFTODSA-N Ser-Asn-Arg Chemical compound C(C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CO)N)CN=C(N)N OBXVZEAMXFSGPU-FXQIFTODSA-N 0.000 description 1
- BCKYYTVFBXHPOG-ACZMJKKPSA-N Ser-Asn-Gln Chemical compound C(CC(=O)N)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CO)N BCKYYTVFBXHPOG-ACZMJKKPSA-N 0.000 description 1
- FIDMVVBUOCMMJG-CIUDSAMLSA-N Ser-Asn-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CO FIDMVVBUOCMMJG-CIUDSAMLSA-N 0.000 description 1
- CNIIKZQXBBQHCX-FXQIFTODSA-N Ser-Asp-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O CNIIKZQXBBQHCX-FXQIFTODSA-N 0.000 description 1
- SFZKGGOGCNQPJY-CIUDSAMLSA-N Ser-Asp-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CO)N SFZKGGOGCNQPJY-CIUDSAMLSA-N 0.000 description 1
- QPFJSHSJFIYDJZ-GHCJXIJMSA-N Ser-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CO QPFJSHSJFIYDJZ-GHCJXIJMSA-N 0.000 description 1
- GHPQVUYZQQGEDA-BIIVOSGPSA-N Ser-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CO)N)C(=O)O GHPQVUYZQQGEDA-BIIVOSGPSA-N 0.000 description 1
- BTPAWKABYQMKKN-LKXGYXEUSA-N Ser-Asp-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O BTPAWKABYQMKKN-LKXGYXEUSA-N 0.000 description 1
- SWSRFJZZMNLMLY-ZKWXMUAHSA-N Ser-Asp-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O SWSRFJZZMNLMLY-ZKWXMUAHSA-N 0.000 description 1
- ZHYMUFQVKGJNRM-ZLUOBGJFSA-N Ser-Cys-Asn Chemical compound OC[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@H](C(O)=O)CC(N)=O ZHYMUFQVKGJNRM-ZLUOBGJFSA-N 0.000 description 1
- ULVMNZOKDBHKKI-ACZMJKKPSA-N Ser-Gln-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O ULVMNZOKDBHKKI-ACZMJKKPSA-N 0.000 description 1
- XWCYBVBLJRWOFR-WDSKDSINSA-N Ser-Gln-Gly Chemical compound OC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O XWCYBVBLJRWOFR-WDSKDSINSA-N 0.000 description 1
- BQWCDDAISCPDQV-XHNCKOQMSA-N Ser-Gln-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CO)N)C(=O)O BQWCDDAISCPDQV-XHNCKOQMSA-N 0.000 description 1
- FMDHKPRACUXATF-ACZMJKKPSA-N Ser-Gln-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O FMDHKPRACUXATF-ACZMJKKPSA-N 0.000 description 1
- IXCHOHLPHNGFTJ-YUMQZZPRSA-N Ser-Gly-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CO)N IXCHOHLPHNGFTJ-YUMQZZPRSA-N 0.000 description 1
- IOVHBRCQOGWAQH-ZKWXMUAHSA-N Ser-Gly-Ile Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H]([C@@H](C)CC)C(O)=O IOVHBRCQOGWAQH-ZKWXMUAHSA-N 0.000 description 1
- GZFAWAQTEYDKII-YUMQZZPRSA-N Ser-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CO GZFAWAQTEYDKII-YUMQZZPRSA-N 0.000 description 1
- WSTIOCFMWXNOCX-YUMQZZPRSA-N Ser-Gly-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CO)N WSTIOCFMWXNOCX-YUMQZZPRSA-N 0.000 description 1
- XXXAXOWMBOKTRN-XPUUQOCRSA-N Ser-Gly-Val Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O XXXAXOWMBOKTRN-XPUUQOCRSA-N 0.000 description 1
- FYUIFUJFNCLUIX-XVYDVKMFSA-N Ser-His-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(O)=O FYUIFUJFNCLUIX-XVYDVKMFSA-N 0.000 description 1
- QBUWQRKEHJXTOP-DCAQKATOSA-N Ser-His-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O QBUWQRKEHJXTOP-DCAQKATOSA-N 0.000 description 1
- JEHPKECJCALLRW-CUJWVEQBSA-N Ser-His-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O JEHPKECJCALLRW-CUJWVEQBSA-N 0.000 description 1
- ZUDXUJSYCCNZQJ-DCAQKATOSA-N Ser-His-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CO)N ZUDXUJSYCCNZQJ-DCAQKATOSA-N 0.000 description 1
- IFPBAGJBHSNYPR-ZKWXMUAHSA-N Ser-Ile-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O IFPBAGJBHSNYPR-ZKWXMUAHSA-N 0.000 description 1
- RIAKPZVSNBBNRE-BJDJZHNGSA-N Ser-Ile-Leu Chemical compound OC[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O RIAKPZVSNBBNRE-BJDJZHNGSA-N 0.000 description 1
- YMDNFPNTIPQMJP-NAKRPEOUSA-N Ser-Ile-Met Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCSC)C(O)=O YMDNFPNTIPQMJP-NAKRPEOUSA-N 0.000 description 1
- UIPXCLNLUUAMJU-JBDRJPRFSA-N Ser-Ile-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O UIPXCLNLUUAMJU-JBDRJPRFSA-N 0.000 description 1
- DOSZISJPMCYEHT-NAKRPEOUSA-N Ser-Ile-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O DOSZISJPMCYEHT-NAKRPEOUSA-N 0.000 description 1
- FUMGHWDRRFCKEP-CIUDSAMLSA-N Ser-Leu-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O FUMGHWDRRFCKEP-CIUDSAMLSA-N 0.000 description 1
- NLOAIFSWUUFQFR-CIUDSAMLSA-N Ser-Leu-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O NLOAIFSWUUFQFR-CIUDSAMLSA-N 0.000 description 1
- GJFYFGOEWLDQGW-GUBZILKMSA-N Ser-Leu-Gln Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CO)N GJFYFGOEWLDQGW-GUBZILKMSA-N 0.000 description 1
- HEUVHBXOVZONPU-BJDJZHNGSA-N Ser-Leu-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O HEUVHBXOVZONPU-BJDJZHNGSA-N 0.000 description 1
- UBRMZSHOOIVJPW-SRVKXCTJSA-N Ser-Leu-Lys Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(O)=O UBRMZSHOOIVJPW-SRVKXCTJSA-N 0.000 description 1
- XXNYYSXNXCJYKX-DCAQKATOSA-N Ser-Leu-Met Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(O)=O XXNYYSXNXCJYKX-DCAQKATOSA-N 0.000 description 1
- MUJQWSAWLLRJCE-KATARQTJSA-N Ser-Leu-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MUJQWSAWLLRJCE-KATARQTJSA-N 0.000 description 1
- GVMUJUPXFQFBBZ-GUBZILKMSA-N Ser-Lys-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O GVMUJUPXFQFBBZ-GUBZILKMSA-N 0.000 description 1
- PTWIYDNFWPXQSD-GARJFASQSA-N Ser-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)[C@H](CO)N)C(=O)O PTWIYDNFWPXQSD-GARJFASQSA-N 0.000 description 1
- LRZLZIUXQBIWTB-KATARQTJSA-N Ser-Lys-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LRZLZIUXQBIWTB-KATARQTJSA-N 0.000 description 1
- PMCMLDNPAZUYGI-DCAQKATOSA-N Ser-Lys-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O PMCMLDNPAZUYGI-DCAQKATOSA-N 0.000 description 1
- AMRRYKHCILPAKD-FXQIFTODSA-N Ser-Met-Asn Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CO)N AMRRYKHCILPAKD-FXQIFTODSA-N 0.000 description 1
- XKFJENWJGHMDLI-QWRGUYRKSA-N Ser-Phe-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)NCC(O)=O XKFJENWJGHMDLI-QWRGUYRKSA-N 0.000 description 1
- KZPRPBLHYMZIMH-MXAVVETBSA-N Ser-Phe-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KZPRPBLHYMZIMH-MXAVVETBSA-N 0.000 description 1
- UPLYXVPQLJVWMM-KKUMJFAQSA-N Ser-Phe-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O UPLYXVPQLJVWMM-KKUMJFAQSA-N 0.000 description 1
- TVPQRPNBYCRRLL-IHRRRGAJSA-N Ser-Phe-Met Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCSC)C(O)=O TVPQRPNBYCRRLL-IHRRRGAJSA-N 0.000 description 1
- ADJDNJCSPNFFPI-FXQIFTODSA-N Ser-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CO ADJDNJCSPNFFPI-FXQIFTODSA-N 0.000 description 1
- BSXKBOUZDAZXHE-CIUDSAMLSA-N Ser-Pro-Glu Chemical compound [H]N[C@@H](CO)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O BSXKBOUZDAZXHE-CIUDSAMLSA-N 0.000 description 1
- PPCZVWHJWJFTFN-ZLUOBGJFSA-N Ser-Ser-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O PPCZVWHJWJFTFN-ZLUOBGJFSA-N 0.000 description 1
- FZXOPYUEQGDGMS-ACZMJKKPSA-N Ser-Ser-Gln Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O FZXOPYUEQGDGMS-ACZMJKKPSA-N 0.000 description 1
- AABIBDJHSKIMJK-FXQIFTODSA-N Ser-Ser-Met Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCSC)C(O)=O AABIBDJHSKIMJK-FXQIFTODSA-N 0.000 description 1
- ILZAUMFXKSIUEF-SRVKXCTJSA-N Ser-Ser-Phe Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 ILZAUMFXKSIUEF-SRVKXCTJSA-N 0.000 description 1
- CUXJENOFJXOSOZ-BIIVOSGPSA-N Ser-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CO)N)C(=O)O CUXJENOFJXOSOZ-BIIVOSGPSA-N 0.000 description 1
- SQHKXWODKJDZRC-LKXGYXEUSA-N Ser-Thr-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(O)=O SQHKXWODKJDZRC-LKXGYXEUSA-N 0.000 description 1
- NADLKBTYNKUJEP-KATARQTJSA-N Ser-Thr-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O NADLKBTYNKUJEP-KATARQTJSA-N 0.000 description 1
- PCJLFYBAQZQOFE-KATARQTJSA-N Ser-Thr-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CO)N)O PCJLFYBAQZQOFE-KATARQTJSA-N 0.000 description 1
- ZSDXEKUKQAKZFE-XAVMHZPKSA-N Ser-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N)O ZSDXEKUKQAKZFE-XAVMHZPKSA-N 0.000 description 1
- ZKOKTQPHFMRSJP-YJRXYDGGSA-N Ser-Thr-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O ZKOKTQPHFMRSJP-YJRXYDGGSA-N 0.000 description 1
- BDMWLJLPPUCLNV-XGEHTFHBSA-N Ser-Thr-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O BDMWLJLPPUCLNV-XGEHTFHBSA-N 0.000 description 1
- WMZVVNLPHFSUPA-BPUTZDHNSA-N Ser-Trp-Arg Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CO)N)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O)=CNC2=C1 WMZVVNLPHFSUPA-BPUTZDHNSA-N 0.000 description 1
- FVFUOQIYDPAIJR-XIRDDKMYSA-N Ser-Trp-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H](CO)N FVFUOQIYDPAIJR-XIRDDKMYSA-N 0.000 description 1
- HAUVENOGHPECML-BPUTZDHNSA-N Ser-Trp-Val Chemical compound C1=CC=C2C(C[C@@H](C(=O)N[C@@H](C(C)C)C(O)=O)NC(=O)[C@@H](N)CO)=CNC2=C1 HAUVENOGHPECML-BPUTZDHNSA-N 0.000 description 1
- GSCVDSBEYVGMJQ-SRVKXCTJSA-N Ser-Tyr-Asp Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CO)N)O GSCVDSBEYVGMJQ-SRVKXCTJSA-N 0.000 description 1
- UBTNVMGPMYDYIU-HJPIBITLSA-N Ser-Tyr-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O UBTNVMGPMYDYIU-HJPIBITLSA-N 0.000 description 1
- KIEIJCFVGZCUAS-MELADBBJSA-N Ser-Tyr-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)[C@H](CO)N)C(=O)O KIEIJCFVGZCUAS-MELADBBJSA-N 0.000 description 1
- OQSQCUWQOIHECT-YJRXYDGGSA-N Ser-Tyr-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OQSQCUWQOIHECT-YJRXYDGGSA-N 0.000 description 1
- HAYADTTXNZFUDM-IHRRRGAJSA-N Ser-Tyr-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O HAYADTTXNZFUDM-IHRRRGAJSA-N 0.000 description 1
- UKKROEYWYIHWBD-ZKWXMUAHSA-N Ser-Val-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O UKKROEYWYIHWBD-ZKWXMUAHSA-N 0.000 description 1
- JZRYFUGREMECBH-XPUUQOCRSA-N Ser-Val-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O JZRYFUGREMECBH-XPUUQOCRSA-N 0.000 description 1
- LGIMRDKGABDMBN-DCAQKATOSA-N Ser-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CO)N LGIMRDKGABDMBN-DCAQKATOSA-N 0.000 description 1
- HSWXBJCBYSWBPT-GUBZILKMSA-N Ser-Val-Val Chemical compound CC(C)[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)CO)C(C)C)C(O)=O HSWXBJCBYSWBPT-GUBZILKMSA-N 0.000 description 1
- 101100370749 Streptomyces coelicolor (strain ATCC BAA-471 / A3(2) / M145) trpC1 gene Proteins 0.000 description 1
- 102000004523 Sulfate Adenylyltransferase Human genes 0.000 description 1
- 108010022348 Sulfate adenylyltransferase Proteins 0.000 description 1
- 241000205101 Sulfolobus Species 0.000 description 1
- 241000205091 Sulfolobus solfataricus Species 0.000 description 1
- 241000223258 Thermomyces lanuginosus Species 0.000 description 1
- 241001494489 Thielavia Species 0.000 description 1
- IGROJMCBGRFRGI-YTLHQDLWSA-N Thr-Ala-Ala Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O IGROJMCBGRFRGI-YTLHQDLWSA-N 0.000 description 1
- NJEMRSFGDNECGF-GCJQMDKQSA-N Thr-Ala-Asp Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC(O)=O NJEMRSFGDNECGF-GCJQMDKQSA-N 0.000 description 1
- FQPQPTHMHZKGFM-XQXXSGGOSA-N Thr-Ala-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O FQPQPTHMHZKGFM-XQXXSGGOSA-N 0.000 description 1
- GFDUZZACIWNMPE-KZVJFYERSA-N Thr-Ala-Met Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCSC)C(O)=O GFDUZZACIWNMPE-KZVJFYERSA-N 0.000 description 1
- XYEXCEPTALHNEV-RCWTZXSCSA-N Thr-Arg-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O XYEXCEPTALHNEV-RCWTZXSCSA-N 0.000 description 1
- JMZKMSTYXHFYAK-VEVYYDQMSA-N Thr-Arg-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC(=O)N)C(=O)O)N)O JMZKMSTYXHFYAK-VEVYYDQMSA-N 0.000 description 1
- SWIKDOUVROTZCW-GCJQMDKQSA-N Thr-Asn-Ala Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](C)C(=O)O)N)O SWIKDOUVROTZCW-GCJQMDKQSA-N 0.000 description 1
- SKHPKKYKDYULDH-HJGDQZAQSA-N Thr-Asn-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O SKHPKKYKDYULDH-HJGDQZAQSA-N 0.000 description 1
- JBHMLZSKIXMVFS-XVSYOHENSA-N Thr-Asn-Phe Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O JBHMLZSKIXMVFS-XVSYOHENSA-N 0.000 description 1
- LXWZOMSOUAMOIA-JIOCBJNQSA-N Thr-Asn-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N1CCC[C@@H]1C(=O)O)N)O LXWZOMSOUAMOIA-JIOCBJNQSA-N 0.000 description 1
- OJRNZRROAIAHDL-LKXGYXEUSA-N Thr-Asn-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O OJRNZRROAIAHDL-LKXGYXEUSA-N 0.000 description 1
- PZVGOVRNGKEFCB-KKHAAJSZSA-N Thr-Asn-Val Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](C(C)C)C(=O)O)N)O PZVGOVRNGKEFCB-KKHAAJSZSA-N 0.000 description 1
- DCCGCVLVVSAJFK-NUMRIWBASA-N Thr-Asp-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N)O DCCGCVLVVSAJFK-NUMRIWBASA-N 0.000 description 1
- YBXMGKCLOPDEKA-NUMRIWBASA-N Thr-Asp-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O YBXMGKCLOPDEKA-NUMRIWBASA-N 0.000 description 1
- NOWXWJLVGTVJKM-PBCZWWQYSA-N Thr-Asp-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N)O NOWXWJLVGTVJKM-PBCZWWQYSA-N 0.000 description 1
- JXKMXEBNZCKSDY-JIOCBJNQSA-N Thr-Asp-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N)O JXKMXEBNZCKSDY-JIOCBJNQSA-N 0.000 description 1
- OHAJHDJOCKKJLV-LKXGYXEUSA-N Thr-Asp-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O OHAJHDJOCKKJLV-LKXGYXEUSA-N 0.000 description 1
- DCLBXIWHLVEPMQ-JRQIVUDYSA-N Thr-Asp-Tyr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 DCLBXIWHLVEPMQ-JRQIVUDYSA-N 0.000 description 1
- OYTNZCBFDXGQGE-XQXXSGGOSA-N Thr-Gln-Ala Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](C)C(=O)O)N)O OYTNZCBFDXGQGE-XQXXSGGOSA-N 0.000 description 1
- QILPDQCTQZDHFM-HJGDQZAQSA-N Thr-Gln-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O QILPDQCTQZDHFM-HJGDQZAQSA-N 0.000 description 1
- LAFLAXHTDVNVEL-WDCWCFNPSA-N Thr-Gln-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N)O LAFLAXHTDVNVEL-WDCWCFNPSA-N 0.000 description 1
- DIPIPFHFLPTCLK-LOKLDPHHSA-N Thr-Gln-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N1CCC[C@@H]1C(=O)O)N)O DIPIPFHFLPTCLK-LOKLDPHHSA-N 0.000 description 1
- LIXBDERDAGNVAV-XKBZYTNZSA-N Thr-Gln-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O LIXBDERDAGNVAV-XKBZYTNZSA-N 0.000 description 1
- KGKWKSSSQGGYAU-SUSMZKCASA-N Thr-Gln-Thr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N)O KGKWKSSSQGGYAU-SUSMZKCASA-N 0.000 description 1
- DXNUZQGVOMCGNS-SWRJLBSHSA-N Thr-Gln-Trp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N)O DXNUZQGVOMCGNS-SWRJLBSHSA-N 0.000 description 1
- UDQBCBUXAQIZAK-GLLZPBPUSA-N Thr-Glu-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O UDQBCBUXAQIZAK-GLLZPBPUSA-N 0.000 description 1
- SHOMROOOQBDGRL-JHEQGTHGSA-N Thr-Glu-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O SHOMROOOQBDGRL-JHEQGTHGSA-N 0.000 description 1
- LHEZGZQRLDBSRR-WDCWCFNPSA-N Thr-Glu-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O LHEZGZQRLDBSRR-WDCWCFNPSA-N 0.000 description 1
- ONNSECRQFSTMCC-XKBZYTNZSA-N Thr-Glu-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O ONNSECRQFSTMCC-XKBZYTNZSA-N 0.000 description 1
- SLUWOCTZVGMURC-BFHQHQDPSA-N Thr-Gly-Ala Chemical compound C[C@@H](O)[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(O)=O SLUWOCTZVGMURC-BFHQHQDPSA-N 0.000 description 1
- ZTPXSEUVYNNZRB-CDMKHQONSA-N Thr-Gly-Phe Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O ZTPXSEUVYNNZRB-CDMKHQONSA-N 0.000 description 1
- DJDSEDOKJTZBAR-ZDLURKLDSA-N Thr-Gly-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O DJDSEDOKJTZBAR-ZDLURKLDSA-N 0.000 description 1
- XUGYQLFEJYZOKQ-NGTWOADLSA-N Thr-Ile-Trp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N XUGYQLFEJYZOKQ-NGTWOADLSA-N 0.000 description 1
- VTVVYQOXJCZVEB-WDCWCFNPSA-N Thr-Leu-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O VTVVYQOXJCZVEB-WDCWCFNPSA-N 0.000 description 1
- RFKVQLIXNVEOMB-WEDXCCLWSA-N Thr-Leu-Gly Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)NCC(=O)O)N)O RFKVQLIXNVEOMB-WEDXCCLWSA-N 0.000 description 1
- MECLEFZMPPOEAC-VOAKCMCISA-N Thr-Leu-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N)O MECLEFZMPPOEAC-VOAKCMCISA-N 0.000 description 1
- PRNGXSILMXSWQQ-OEAJRASXSA-N Thr-Leu-Phe Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O PRNGXSILMXSWQQ-OEAJRASXSA-N 0.000 description 1
- NCXVJIQMWSGRHY-KXNHARMFSA-N Thr-Leu-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N)O NCXVJIQMWSGRHY-KXNHARMFSA-N 0.000 description 1
- YOOAQCZYZHGUAZ-KATARQTJSA-N Thr-Leu-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O YOOAQCZYZHGUAZ-KATARQTJSA-N 0.000 description 1
- VRUFCJZQDACGLH-UVOCVTCTSA-N Thr-Leu-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O VRUFCJZQDACGLH-UVOCVTCTSA-N 0.000 description 1
- KZSYAEWQMJEGRZ-RHYQMDGZSA-N Thr-Leu-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O KZSYAEWQMJEGRZ-RHYQMDGZSA-N 0.000 description 1
- SCSVNSNWUTYSFO-WDCWCFNPSA-N Thr-Lys-Glu Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O SCSVNSNWUTYSFO-WDCWCFNPSA-N 0.000 description 1
- UUSQVWOVUYMLJA-PPCPHDFISA-N Thr-Lys-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O UUSQVWOVUYMLJA-PPCPHDFISA-N 0.000 description 1
- WFAUDCSNCWJJAA-KXNHARMFSA-N Thr-Lys-Pro Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N1CCC[C@@H]1C(O)=O WFAUDCSNCWJJAA-KXNHARMFSA-N 0.000 description 1
- KKPOGALELPLJTL-MEYUZBJRSA-N Thr-Lys-Tyr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 KKPOGALELPLJTL-MEYUZBJRSA-N 0.000 description 1
- KPNSNVTUVKSBFL-ZJDVBMNYSA-N Thr-Met-Thr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N)O KPNSNVTUVKSBFL-ZJDVBMNYSA-N 0.000 description 1
- ABWNZPOIUJMNKT-IXOXFDKPSA-N Thr-Phe-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O ABWNZPOIUJMNKT-IXOXFDKPSA-N 0.000 description 1
- WTMPKZWHRCMMMT-KZVJFYERSA-N Thr-Pro-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O WTMPKZWHRCMMMT-KZVJFYERSA-N 0.000 description 1
- NYQIZWROIMIQSL-VEVYYDQMSA-N Thr-Pro-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O NYQIZWROIMIQSL-VEVYYDQMSA-N 0.000 description 1
- XKWABWFMQXMUMT-HJGDQZAQSA-N Thr-Pro-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O XKWABWFMQXMUMT-HJGDQZAQSA-N 0.000 description 1
- JAJOFWABAUKAEJ-QTKMDUPCSA-N Thr-Pro-His Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N)O JAJOFWABAUKAEJ-QTKMDUPCSA-N 0.000 description 1
- GFRIEEKFXOVPIR-RHYQMDGZSA-N Thr-Pro-Lys Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(O)=O GFRIEEKFXOVPIR-RHYQMDGZSA-N 0.000 description 1
- YGCDFAJJCRVQKU-RCWTZXSCSA-N Thr-Pro-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)[C@@H](C)O YGCDFAJJCRVQKU-RCWTZXSCSA-N 0.000 description 1
- IVDFVBVIVLJJHR-LKXGYXEUSA-N Thr-Ser-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O IVDFVBVIVLJJHR-LKXGYXEUSA-N 0.000 description 1
- AHERARIZBPOMNU-KATARQTJSA-N Thr-Ser-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O AHERARIZBPOMNU-KATARQTJSA-N 0.000 description 1
- IQPWNQRRAJHOKV-KATARQTJSA-N Thr-Ser-Lys Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCCN IQPWNQRRAJHOKV-KATARQTJSA-N 0.000 description 1
- WKGAAMOJPMBBMC-IXOXFDKPSA-N Thr-Ser-Phe Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O WKGAAMOJPMBBMC-IXOXFDKPSA-N 0.000 description 1
- HUPLKEHTTQBXSC-YJRXYDGGSA-N Thr-Ser-Tyr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 HUPLKEHTTQBXSC-YJRXYDGGSA-N 0.000 description 1
- QYDKSNXSBXZPFK-ZJDVBMNYSA-N Thr-Thr-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O QYDKSNXSBXZPFK-ZJDVBMNYSA-N 0.000 description 1
- VGNLMPBYWWNQFS-ZEILLAHLSA-N Thr-Thr-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N)O VGNLMPBYWWNQFS-ZEILLAHLSA-N 0.000 description 1
- NHQVWACSJZJCGJ-FLBSBUHZSA-N Thr-Thr-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O NHQVWACSJZJCGJ-FLBSBUHZSA-N 0.000 description 1
- QJIODPFLAASXJC-JHYOHUSXSA-N Thr-Thr-Phe Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N)O QJIODPFLAASXJC-JHYOHUSXSA-N 0.000 description 1
- SOUPNXUJAJENFU-SWRJLBSHSA-N Thr-Trp-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N)O SOUPNXUJAJENFU-SWRJLBSHSA-N 0.000 description 1
- NLWDSYKZUPRMBJ-IEGACIPQSA-N Thr-Trp-Leu Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CC(C)C)C(=O)O)N)O NLWDSYKZUPRMBJ-IEGACIPQSA-N 0.000 description 1
- BEZTUFWTPVOROW-KJEVXHAQSA-N Thr-Tyr-Arg Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N)O BEZTUFWTPVOROW-KJEVXHAQSA-N 0.000 description 1
- BZTSQFWJNJYZSX-JRQIVUDYSA-N Thr-Tyr-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O BZTSQFWJNJYZSX-JRQIVUDYSA-N 0.000 description 1
- VMSSYINFMOFLJM-KJEVXHAQSA-N Thr-Tyr-Met Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCSC)C(=O)O)N)O VMSSYINFMOFLJM-KJEVXHAQSA-N 0.000 description 1
- CYCGARJWIQWPQM-YJRXYDGGSA-N Thr-Tyr-Ser Chemical compound C[C@@H](O)[C@H]([NH3+])C(=O)N[C@H](C(=O)N[C@@H](CO)C([O-])=O)CC1=CC=C(O)C=C1 CYCGARJWIQWPQM-YJRXYDGGSA-N 0.000 description 1
- KVEWWQRTAVMOFT-KJEVXHAQSA-N Thr-Tyr-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O KVEWWQRTAVMOFT-KJEVXHAQSA-N 0.000 description 1
- FYBFTPLPAXZBOY-KKHAAJSZSA-N Thr-Val-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O FYBFTPLPAXZBOY-KKHAAJSZSA-N 0.000 description 1
- ILUOMMDDGREELW-OSUNSFLBSA-N Thr-Val-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)[C@@H](C)O ILUOMMDDGREELW-OSUNSFLBSA-N 0.000 description 1
- QNXZCKMXHPULME-ZNSHCXBVSA-N Thr-Val-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N)O QNXZCKMXHPULME-ZNSHCXBVSA-N 0.000 description 1
- VYVBSMCZNHOZGD-RCWTZXSCSA-N Thr-Val-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C(C)C)C(O)=O VYVBSMCZNHOZGD-RCWTZXSCSA-N 0.000 description 1
- 241001149964 Tolypocladium Species 0.000 description 1
- 239000000365 Topoisomerase I Inhibitor Substances 0.000 description 1
- 241000223259 Trichoderma Species 0.000 description 1
- MJBBMTOGSOSAKJ-HJXMPXNTSA-N Trp-Ala-Ile Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O MJBBMTOGSOSAKJ-HJXMPXNTSA-N 0.000 description 1
- VZBWRZGNEPBRDE-HZUKXOBISA-N Trp-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)N VZBWRZGNEPBRDE-HZUKXOBISA-N 0.000 description 1
- XZSJDSBPEJBEFZ-QRTARXTBSA-N Trp-Asn-Val Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O XZSJDSBPEJBEFZ-QRTARXTBSA-N 0.000 description 1
- VKMOGXREKGVZAF-QEJZJMRPSA-N Trp-Asp-Gln Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N VKMOGXREKGVZAF-QEJZJMRPSA-N 0.000 description 1
- LHHDBONOFZDWMW-AAEUAGOBSA-N Trp-Asp-Gly Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)NCC(=O)O)N LHHDBONOFZDWMW-AAEUAGOBSA-N 0.000 description 1
- CZSMNLQMRWPGQF-XEGUGMAKSA-N Trp-Gln-Ala Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O CZSMNLQMRWPGQF-XEGUGMAKSA-N 0.000 description 1
- GWQUSADRQCTMHN-NWLDYVSISA-N Trp-Gln-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N)O GWQUSADRQCTMHN-NWLDYVSISA-N 0.000 description 1
- UDCHKDYNMRJYMI-QEJZJMRPSA-N Trp-Glu-Ser Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O UDCHKDYNMRJYMI-QEJZJMRPSA-N 0.000 description 1
- FEZASNVQLJQBHW-CABZTGNLSA-N Trp-Gly-Ala Chemical compound C1=CC=C2C(C[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(O)=O)=CNC2=C1 FEZASNVQLJQBHW-CABZTGNLSA-N 0.000 description 1
- BEWOXKJJMBKRQL-AAEUAGOBSA-N Trp-Gly-Asp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)NCC(=O)N[C@@H](CC(=O)O)C(=O)O)N BEWOXKJJMBKRQL-AAEUAGOBSA-N 0.000 description 1
- OZUJUVFWMHTWCZ-HOCLYGCPSA-N Trp-Gly-His Chemical compound N[C@@H](Cc1c[nH]c2ccccc12)C(=O)NCC(=O)N[C@@H](Cc1cnc[nH]1)C(O)=O OZUJUVFWMHTWCZ-HOCLYGCPSA-N 0.000 description 1
- HLDFBNPSURDYEN-VHWLVUOQSA-N Trp-Ile-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N HLDFBNPSURDYEN-VHWLVUOQSA-N 0.000 description 1
- MKDXQPMIQPTTAW-SIXJUCDHSA-N Trp-Ile-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)N MKDXQPMIQPTTAW-SIXJUCDHSA-N 0.000 description 1
- LDMUNXDDIDAPJH-VMBFOHBNSA-N Trp-Ile-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N LDMUNXDDIDAPJH-VMBFOHBNSA-N 0.000 description 1
- ILDJYIDXESUBOE-HSCHXYMDSA-N Trp-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N ILDJYIDXESUBOE-HSCHXYMDSA-N 0.000 description 1
- RRXPAFGTFQIEMD-IVJVFBROSA-N Trp-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)N RRXPAFGTFQIEMD-IVJVFBROSA-N 0.000 description 1
- CXPJPTFWKXNDKV-NUTKFTJISA-N Trp-Leu-Ala Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O)=CNC2=C1 CXPJPTFWKXNDKV-NUTKFTJISA-N 0.000 description 1
- WKCFCVBOFKEVKY-HSCHXYMDSA-N Trp-Leu-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N WKCFCVBOFKEVKY-HSCHXYMDSA-N 0.000 description 1
- GWBWCGITOYODER-YTQUADARSA-N Trp-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)N GWBWCGITOYODER-YTQUADARSA-N 0.000 description 1
- IVBJBFSWJDNQFW-XIRDDKMYSA-N Trp-Pro-Glu Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O IVBJBFSWJDNQFW-XIRDDKMYSA-N 0.000 description 1
- BOBZBMOTRORUPT-XIRDDKMYSA-N Trp-Ser-Leu Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O)=CNC2=C1 BOBZBMOTRORUPT-XIRDDKMYSA-N 0.000 description 1
- GBEAUNVBIMLWIB-IHPCNDPISA-N Trp-Ser-Phe Chemical compound C([C@H](NC(=O)[C@H](CO)NC(=O)[C@H](CC=1C2=CC=CC=C2NC=1)N)C(O)=O)C1=CC=CC=C1 GBEAUNVBIMLWIB-IHPCNDPISA-N 0.000 description 1
- GEGYPBOPIGNZIF-CWRNSKLLSA-N Trp-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)N)C(=O)O GEGYPBOPIGNZIF-CWRNSKLLSA-N 0.000 description 1
- DTPWXZXGFAHEKL-NWLDYVSISA-N Trp-Thr-Glu Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O DTPWXZXGFAHEKL-NWLDYVSISA-N 0.000 description 1
- DYIXEGROAOVQPK-VFAJRCTISA-N Trp-Thr-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N)O DYIXEGROAOVQPK-VFAJRCTISA-N 0.000 description 1
- UPUNWAXSLPBMRK-XTWBLICNSA-N Trp-Thr-Thr Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O UPUNWAXSLPBMRK-XTWBLICNSA-N 0.000 description 1
- FHHYVSCGOMPLLO-IHPCNDPISA-N Trp-Tyr-Asp Chemical compound C([C@H](NC(=O)[C@H](CC=1C2=CC=CC=C2NC=1)N)C(=O)N[C@@H](CC(O)=O)C(O)=O)C1=CC=C(O)C=C1 FHHYVSCGOMPLLO-IHPCNDPISA-N 0.000 description 1
- QIVBCDIJIAJPQS-UHFFFAOYSA-N Tryptophan Natural products C1=CC=C2C(CC(N)C(O)=O)=CNC2=C1 QIVBCDIJIAJPQS-UHFFFAOYSA-N 0.000 description 1
- 102100024769 Tubulin-specific chaperone E Human genes 0.000 description 1
- ZWZOCUWOXSDYFZ-CQDKDKBSSA-N Tyr-Ala-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 ZWZOCUWOXSDYFZ-CQDKDKBSSA-N 0.000 description 1
- LGEYOIQBBIPHQN-UWJYBYFXSA-N Tyr-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 LGEYOIQBBIPHQN-UWJYBYFXSA-N 0.000 description 1
- FBVGQXJIXFZKSQ-GMVOTWDCSA-N Tyr-Ala-Trp Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CC3=CC=C(C=C3)O)N FBVGQXJIXFZKSQ-GMVOTWDCSA-N 0.000 description 1
- HTHCZRWCFXMENJ-KKUMJFAQSA-N Tyr-Arg-Glu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O HTHCZRWCFXMENJ-KKUMJFAQSA-N 0.000 description 1
- CRWOSTCODDFEKZ-HRCADAONSA-N Tyr-Arg-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC2=CC=C(C=C2)O)N)C(=O)O CRWOSTCODDFEKZ-HRCADAONSA-N 0.000 description 1
- DKKHULUSOSWGHS-UWJYBYFXSA-N Tyr-Asn-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC1=CC=C(C=C1)O)N DKKHULUSOSWGHS-UWJYBYFXSA-N 0.000 description 1
- PEVVXUGSAKEPEN-AVGNSLFASA-N Tyr-Asn-Glu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O PEVVXUGSAKEPEN-AVGNSLFASA-N 0.000 description 1
- ZNFPUOSTMUMUDR-JRQIVUDYSA-N Tyr-Asn-Thr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ZNFPUOSTMUMUDR-JRQIVUDYSA-N 0.000 description 1
- NSTPFWRAIDTNGH-BZSNNMDCSA-N Tyr-Asn-Tyr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O NSTPFWRAIDTNGH-BZSNNMDCSA-N 0.000 description 1
- DANHCMVVXDXOHN-SRVKXCTJSA-N Tyr-Asp-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 DANHCMVVXDXOHN-SRVKXCTJSA-N 0.000 description 1
- BEIGSKUPTIFYRZ-SRVKXCTJSA-N Tyr-Asp-Asp Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N)O BEIGSKUPTIFYRZ-SRVKXCTJSA-N 0.000 description 1
- MNMYOSZWCKYEDI-JRQIVUDYSA-N Tyr-Asp-Thr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MNMYOSZWCKYEDI-JRQIVUDYSA-N 0.000 description 1
- FXYOYUMPUJONGW-FHWLQOOXSA-N Tyr-Gln-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=C(O)C=C1 FXYOYUMPUJONGW-FHWLQOOXSA-N 0.000 description 1
- KEHKBBUYZWAMHL-DZKIICNBSA-N Tyr-Gln-Val Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O KEHKBBUYZWAMHL-DZKIICNBSA-N 0.000 description 1
- WAPFQMXRSDEGOE-IHRRRGAJSA-N Tyr-Glu-Gln Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O WAPFQMXRSDEGOE-IHRRRGAJSA-N 0.000 description 1
- HDSKHCBAVVWPCQ-FHWLQOOXSA-N Tyr-Glu-Phe Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O HDSKHCBAVVWPCQ-FHWLQOOXSA-N 0.000 description 1
- ZRPLVTZTKPPSBT-AVGNSLFASA-N Tyr-Glu-Ser Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O ZRPLVTZTKPPSBT-AVGNSLFASA-N 0.000 description 1
- KOVXHANYYYMBRF-IRIUXVKKSA-N Tyr-Glu-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N)O KOVXHANYYYMBRF-IRIUXVKKSA-N 0.000 description 1
- PMDWYLVWHRTJIW-STQMWFEESA-N Tyr-Gly-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC1=CC=C(O)C=C1 PMDWYLVWHRTJIW-STQMWFEESA-N 0.000 description 1
- JWGXUKHIKXZWNG-RYUDHWBXSA-N Tyr-Gly-Gln Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)NCC(=O)N[C@@H](CCC(=O)N)C(=O)O)N)O JWGXUKHIKXZWNG-RYUDHWBXSA-N 0.000 description 1
- KCPFDGNYAMKZQP-KBPBESRZSA-N Tyr-Gly-Leu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O KCPFDGNYAMKZQP-KBPBESRZSA-N 0.000 description 1
- AZZLDIDWPZLCCW-ZEWNOJEFSA-N Tyr-Ile-Phe Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O AZZLDIDWPZLCCW-ZEWNOJEFSA-N 0.000 description 1
- OHOVFPKXPZODHS-SJWGOKEGSA-N Tyr-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N OHOVFPKXPZODHS-SJWGOKEGSA-N 0.000 description 1
- GULIUBBXCYPDJU-CQDKDKBSSA-N Tyr-Leu-Ala Chemical compound [O-]C(=O)[C@H](C)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]([NH3+])CC1=CC=C(O)C=C1 GULIUBBXCYPDJU-CQDKDKBSSA-N 0.000 description 1
- MVFQLSPDMMFCMW-KKUMJFAQSA-N Tyr-Leu-Asn Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O MVFQLSPDMMFCMW-KKUMJFAQSA-N 0.000 description 1
- BSCBBPKDVOZICB-KKUMJFAQSA-N Tyr-Leu-Asp Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O BSCBBPKDVOZICB-KKUMJFAQSA-N 0.000 description 1
- NKUGCYDFQKFVOJ-JYJNAYRXSA-N Tyr-Leu-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 NKUGCYDFQKFVOJ-JYJNAYRXSA-N 0.000 description 1
- KSCVLGXNQXKUAR-JYJNAYRXSA-N Tyr-Leu-Glu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O KSCVLGXNQXKUAR-JYJNAYRXSA-N 0.000 description 1
- WDGDKHLSDIOXQC-ACRUOGEOSA-N Tyr-Leu-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=C(O)C=C1 WDGDKHLSDIOXQC-ACRUOGEOSA-N 0.000 description 1
- CDKZJGMPZHPAJC-ULQDDVLXSA-N Tyr-Leu-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 CDKZJGMPZHPAJC-ULQDDVLXSA-N 0.000 description 1
- JLKVWTICWVWGSK-JYJNAYRXSA-N Tyr-Lys-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 JLKVWTICWVWGSK-JYJNAYRXSA-N 0.000 description 1
- ZOBLBMGJKVJVEV-BZSNNMDCSA-N Tyr-Lys-Lys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)O)N)O ZOBLBMGJKVJVEV-BZSNNMDCSA-N 0.000 description 1
- KZOZXAYPVKKDIO-UFYCRDLUSA-N Tyr-Met-Tyr Chemical compound C([C@H](N)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=C(O)C=C1 KZOZXAYPVKKDIO-UFYCRDLUSA-N 0.000 description 1
- BIWVVOHTKDLRMP-ULQDDVLXSA-N Tyr-Pro-Leu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O BIWVVOHTKDLRMP-ULQDDVLXSA-N 0.000 description 1
- VXFXIBCCVLJCJT-JYJNAYRXSA-N Tyr-Pro-Pro Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N1CCC[C@H]1C(=O)N1CCC[C@H]1C(O)=O VXFXIBCCVLJCJT-JYJNAYRXSA-N 0.000 description 1
- GQVZBMROTPEPIF-SRVKXCTJSA-N Tyr-Ser-Asp Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O GQVZBMROTPEPIF-SRVKXCTJSA-N 0.000 description 1
- XUIOBCQESNDTDE-FQPOAREZSA-N Tyr-Thr-Ala Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N)O XUIOBCQESNDTDE-FQPOAREZSA-N 0.000 description 1
- BIVIUZRBCAUNPW-JRQIVUDYSA-N Tyr-Thr-Asn Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(O)=O BIVIUZRBCAUNPW-JRQIVUDYSA-N 0.000 description 1
- NZBSVMQZQMEUHI-WZLNRYEVSA-N Tyr-Thr-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N NZBSVMQZQMEUHI-WZLNRYEVSA-N 0.000 description 1
- AOIZTZRWMSPPAY-KAOXEZKKSA-N Tyr-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N)O AOIZTZRWMSPPAY-KAOXEZKKSA-N 0.000 description 1
- KLQPIEVIKOQRAW-IZPVPAKOSA-N Tyr-Thr-Thr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N)O KLQPIEVIKOQRAW-IZPVPAKOSA-N 0.000 description 1
- YMZYSCDRTXEOKD-IHPCNDPISA-N Tyr-Trp-Asn Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC3=CC=C(C=C3)O)N YMZYSCDRTXEOKD-IHPCNDPISA-N 0.000 description 1
- YOTRXXBHTZHKLU-BVSLBCMMSA-N Tyr-Trp-Met Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)N[C@@H](CCSC)C(O)=O)C1=CC=C(O)C=C1 YOTRXXBHTZHKLU-BVSLBCMMSA-N 0.000 description 1
- MQUYPYFPHIPVHJ-MNSWYVGCSA-N Tyr-Trp-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H](CC3=CC=C(C=C3)O)N)O MQUYPYFPHIPVHJ-MNSWYVGCSA-N 0.000 description 1
- HZDQUVQEVVYDDA-ACRUOGEOSA-N Tyr-Tyr-Leu Chemical compound C([C@@H](C(=O)N[C@@H](CC(C)C)C(O)=O)NC(=O)[C@@H](N)CC=1C=CC(O)=CC=1)C1=CC=C(O)C=C1 HZDQUVQEVVYDDA-ACRUOGEOSA-N 0.000 description 1
- QVYFTFIBKCDHIE-ACRUOGEOSA-N Tyr-Tyr-Lys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)N[C@@H](CCCCN)C(=O)O)N)O QVYFTFIBKCDHIE-ACRUOGEOSA-N 0.000 description 1
- AGDDLOQMXUQPDY-BZSNNMDCSA-N Tyr-Tyr-Ser Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(O)=O AGDDLOQMXUQPDY-BZSNNMDCSA-N 0.000 description 1
- AEOFMCAKYIQQFY-YDHLFZDLSA-N Tyr-Val-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 AEOFMCAKYIQQFY-YDHLFZDLSA-N 0.000 description 1
- NVJCMGGZHOJNBU-UFYCRDLUSA-N Tyr-Val-Phe Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N NVJCMGGZHOJNBU-UFYCRDLUSA-N 0.000 description 1
- 108091023045 Untranslated Region Proteins 0.000 description 1
- DDRBQONWVBDQOY-GUBZILKMSA-N Val-Ala-Arg Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O DDRBQONWVBDQOY-GUBZILKMSA-N 0.000 description 1
- UEOOXDLMQZBPFR-ZKWXMUAHSA-N Val-Ala-Asn Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N UEOOXDLMQZBPFR-ZKWXMUAHSA-N 0.000 description 1
- ASQFIHTXXMFENG-XPUUQOCRSA-N Val-Ala-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O ASQFIHTXXMFENG-XPUUQOCRSA-N 0.000 description 1
- RUCNAYOMFXRIKJ-DCAQKATOSA-N Val-Ala-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCCN RUCNAYOMFXRIKJ-DCAQKATOSA-N 0.000 description 1
- AZSHAZJLOZQYAY-FXQIFTODSA-N Val-Ala-Ser Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O AZSHAZJLOZQYAY-FXQIFTODSA-N 0.000 description 1
- COYSIHFOCOMGCF-WPRPVWTQSA-N Val-Arg-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CCCN=C(N)N COYSIHFOCOMGCF-WPRPVWTQSA-N 0.000 description 1
- UBTBGUDNDFZLGP-SRVKXCTJSA-N Val-Arg-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](C(C)C)C(=O)O)N UBTBGUDNDFZLGP-SRVKXCTJSA-N 0.000 description 1
- BYOHPUZJVXWHAE-BYULHYEWSA-N Val-Asn-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)N)C(=O)O)N BYOHPUZJVXWHAE-BYULHYEWSA-N 0.000 description 1
- JLFKWDAZBRYCGX-ZKWXMUAHSA-N Val-Asn-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CO)C(=O)O)N JLFKWDAZBRYCGX-ZKWXMUAHSA-N 0.000 description 1
- HHSILIQTHXABKM-YDHLFZDLSA-N Val-Asp-Phe Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](Cc1ccccc1)C(O)=O HHSILIQTHXABKM-YDHLFZDLSA-N 0.000 description 1
- XEYUMGGWQCIWAR-XVKPBYJWSA-N Val-Gln-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)NCC(=O)O)N XEYUMGGWQCIWAR-XVKPBYJWSA-N 0.000 description 1
- PWRITNSESKQTPW-NRPADANISA-N Val-Gln-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CO)C(=O)O)N PWRITNSESKQTPW-NRPADANISA-N 0.000 description 1
- GBESYURLQOYWLU-LAEOZQHASA-N Val-Glu-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N GBESYURLQOYWLU-LAEOZQHASA-N 0.000 description 1
- AHHJARQXFFGOKF-NRPADANISA-N Val-Glu-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CS)C(=O)O)N AHHJARQXFFGOKF-NRPADANISA-N 0.000 description 1
- ZXAGTABZUOMUDO-GVXVVHGQSA-N Val-Glu-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N ZXAGTABZUOMUDO-GVXVVHGQSA-N 0.000 description 1
- WDIGUPHXPBMODF-UMNHJUIQSA-N Val-Glu-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N WDIGUPHXPBMODF-UMNHJUIQSA-N 0.000 description 1
- NXRAUQGGHPCJIB-RCOVLWMOSA-N Val-Gly-Asn Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O NXRAUQGGHPCJIB-RCOVLWMOSA-N 0.000 description 1
- YTPLVNUZZOBFFC-SCZZXKLOSA-N Val-Gly-Pro Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N1CCC[C@@H]1C(O)=O YTPLVNUZZOBFFC-SCZZXKLOSA-N 0.000 description 1
- WJVLTYSHNXRCLT-NHCYSSNCSA-N Val-His-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CC(=O)O)C(=O)O)N WJVLTYSHNXRCLT-NHCYSSNCSA-N 0.000 description 1
- APQIVBCUIUDSMB-OSUNSFLBSA-N Val-Ile-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](C(C)C)N APQIVBCUIUDSMB-OSUNSFLBSA-N 0.000 description 1
- BZWUSZGQOILYEU-STECZYCISA-N Val-Ile-Tyr Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 BZWUSZGQOILYEU-STECZYCISA-N 0.000 description 1
- DJQIUOKSNRBTSV-CYDGBPFRSA-N Val-Ile-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H](C(C)C)N DJQIUOKSNRBTSV-CYDGBPFRSA-N 0.000 description 1
- OTJMMKPMLUNTQT-AVGNSLFASA-N Val-Leu-Arg Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](C(C)C)N OTJMMKPMLUNTQT-AVGNSLFASA-N 0.000 description 1
- UMPVMAYCLYMYGA-ONGXEEELSA-N Val-Leu-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O UMPVMAYCLYMYGA-ONGXEEELSA-N 0.000 description 1
- AEMPCGRFEZTWIF-IHRRRGAJSA-N Val-Leu-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(O)=O AEMPCGRFEZTWIF-IHRRRGAJSA-N 0.000 description 1
- SYSWVVCYSXBVJG-RHYQMDGZSA-N Val-Leu-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](C(C)C)N)O SYSWVVCYSXBVJG-RHYQMDGZSA-N 0.000 description 1
- WBAJDGWKRIHOAC-GVXVVHGQSA-N Val-Lys-Gln Chemical compound [H]N[C@@H](C(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O WBAJDGWKRIHOAC-GVXVVHGQSA-N 0.000 description 1
- IJGPOONOTBNTFS-GVXVVHGQSA-N Val-Lys-Glu Chemical compound [H]N[C@@H](C(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O IJGPOONOTBNTFS-GVXVVHGQSA-N 0.000 description 1
- MLADEWAIYAPAAU-IHRRRGAJSA-N Val-Lys-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N MLADEWAIYAPAAU-IHRRRGAJSA-N 0.000 description 1
- YMTOEGGOCHVGEH-IHRRRGAJSA-N Val-Lys-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(O)=O YMTOEGGOCHVGEH-IHRRRGAJSA-N 0.000 description 1
- CXWJFWAZIVWBOS-XQQFMLRXSA-N Val-Lys-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N1CCC[C@@H]1C(=O)O)N CXWJFWAZIVWBOS-XQQFMLRXSA-N 0.000 description 1
- OFQGGTGZTOTLGH-NHCYSSNCSA-N Val-Met-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N OFQGGTGZTOTLGH-NHCYSSNCSA-N 0.000 description 1
- ILMVQSHENUZYIZ-JYJNAYRXSA-N Val-Met-Tyr Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N ILMVQSHENUZYIZ-JYJNAYRXSA-N 0.000 description 1
- MJFSRZZJQWZHFQ-SRVKXCTJSA-N Val-Met-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C(C)C)C(=O)O)N MJFSRZZJQWZHFQ-SRVKXCTJSA-N 0.000 description 1
- LJSZPMSUYKKKCP-UBHSHLNASA-N Val-Phe-Ala Chemical compound CC(C)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C)C(O)=O)CC1=CC=CC=C1 LJSZPMSUYKKKCP-UBHSHLNASA-N 0.000 description 1
- CKTMJBPRVQWPHU-JSGCOSHPSA-N Val-Phe-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)NCC(=O)O)N CKTMJBPRVQWPHU-JSGCOSHPSA-N 0.000 description 1
- MHHAWNPHDLCPLF-ULQDDVLXSA-N Val-Phe-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C(C)C)CC1=CC=CC=C1 MHHAWNPHDLCPLF-ULQDDVLXSA-N 0.000 description 1
- XBJKAZATRJBDCU-GUBZILKMSA-N Val-Pro-Ala Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O XBJKAZATRJBDCU-GUBZILKMSA-N 0.000 description 1
- RYQUMYBMOJYYDK-NHCYSSNCSA-N Val-Pro-Glu Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(=O)O)C(=O)O)N RYQUMYBMOJYYDK-NHCYSSNCSA-N 0.000 description 1
- NHXZRXLFOBFMDM-AVGNSLFASA-N Val-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)C(C)C NHXZRXLFOBFMDM-AVGNSLFASA-N 0.000 description 1
- QIVPZSWBBHRNBA-JYJNAYRXSA-N Val-Pro-Phe Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](Cc1ccccc1)C(O)=O QIVPZSWBBHRNBA-JYJNAYRXSA-N 0.000 description 1
- SSYBNWFXCFNRFN-GUBZILKMSA-N Val-Pro-Ser Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O SSYBNWFXCFNRFN-GUBZILKMSA-N 0.000 description 1
- NSUUANXHLKKHQB-BZSNNMDCSA-N Val-Pro-Trp Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CNC2=CC=CC=C12 NSUUANXHLKKHQB-BZSNNMDCSA-N 0.000 description 1
- AJNUKMZFHXUBMK-GUBZILKMSA-N Val-Ser-Arg Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N AJNUKMZFHXUBMK-GUBZILKMSA-N 0.000 description 1
- RYHUIHUOYRNNIE-NRPADANISA-N Val-Ser-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N RYHUIHUOYRNNIE-NRPADANISA-N 0.000 description 1
- UGFMVXRXULGLNO-XPUUQOCRSA-N Val-Ser-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O UGFMVXRXULGLNO-XPUUQOCRSA-N 0.000 description 1
- VHIZXDZMTDVFGX-DCAQKATOSA-N Val-Ser-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](C(C)C)N VHIZXDZMTDVFGX-DCAQKATOSA-N 0.000 description 1
- QZKVWWIUSQGWMY-IHRRRGAJSA-N Val-Ser-Phe Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 QZKVWWIUSQGWMY-IHRRRGAJSA-N 0.000 description 1
- PZTZYZUTCPZWJH-FXQIFTODSA-N Val-Ser-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)O)N PZTZYZUTCPZWJH-FXQIFTODSA-N 0.000 description 1
- PQSNETRGCRUOGP-KKHAAJSZSA-N Val-Thr-Asn Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(N)=O PQSNETRGCRUOGP-KKHAAJSZSA-N 0.000 description 1
- WUFHZIRMAZZWRS-OSUNSFLBSA-N Val-Thr-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@H](C(C)C)N WUFHZIRMAZZWRS-OSUNSFLBSA-N 0.000 description 1
- LCHZBEUVGAVMKS-RHYQMDGZSA-N Val-Thr-Leu Chemical compound CC(C)C[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)C(C)C)[C@@H](C)O)C(O)=O LCHZBEUVGAVMKS-RHYQMDGZSA-N 0.000 description 1
- USXYVSTVPHELAF-RCWTZXSCSA-N Val-Thr-Met Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](C(C)C)N)O USXYVSTVPHELAF-RCWTZXSCSA-N 0.000 description 1
- HTONZBWRYUKUKC-RCWTZXSCSA-N Val-Thr-Val Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O HTONZBWRYUKUKC-RCWTZXSCSA-N 0.000 description 1
- WFTKOJGOOUJLJV-VKOGCVSHSA-N Val-Trp-Ile Chemical compound C1=CC=C2C(C[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C([O-])=O)NC(=O)[C@@H]([NH3+])C(C)C)=CNC2=C1 WFTKOJGOOUJLJV-VKOGCVSHSA-N 0.000 description 1
- QTXGUIMEHKCPBH-FHWLQOOXSA-N Val-Trp-Lys Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](N)C(C)C)C(=O)N[C@@H](CCCCN)C(O)=O)=CNC2=C1 QTXGUIMEHKCPBH-FHWLQOOXSA-N 0.000 description 1
- MIAZWUMFUURQNP-YDHLFZDLSA-N Val-Tyr-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N MIAZWUMFUURQNP-YDHLFZDLSA-N 0.000 description 1
- VTIAEOKFUJJBTC-YDHLFZDLSA-N Val-Tyr-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N VTIAEOKFUJJBTC-YDHLFZDLSA-N 0.000 description 1
- RLVTVHSDKHBFQP-ULQDDVLXSA-N Val-Tyr-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C(C)C)CC1=CC=C(O)C=C1 RLVTVHSDKHBFQP-ULQDDVLXSA-N 0.000 description 1
- BGTDGENDNWGMDQ-KJEVXHAQSA-N Val-Tyr-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](C(C)C)N)O BGTDGENDNWGMDQ-KJEVXHAQSA-N 0.000 description 1
- ZLNYBMWGPOKSLW-LSJOCFKGSA-N Val-Val-Asp Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O ZLNYBMWGPOKSLW-LSJOCFKGSA-N 0.000 description 1
- NLNCNKIVJPEFBC-DLOVCJGASA-N Val-Val-Glu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCC(O)=O NLNCNKIVJPEFBC-DLOVCJGASA-N 0.000 description 1
- AOILQMZPNLUXCM-AVGNSLFASA-N Val-Val-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCCN AOILQMZPNLUXCM-AVGNSLFASA-N 0.000 description 1
- XNLUVJPMPAZHCY-JYJNAYRXSA-N Val-Val-Phe Chemical compound CC(C)[C@H]([NH3+])C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C([O-])=O)CC1=CC=CC=C1 XNLUVJPMPAZHCY-JYJNAYRXSA-N 0.000 description 1
- LLJLBRRXKZTTRD-GUBZILKMSA-N Val-Val-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(=O)O)N LLJLBRRXKZTTRD-GUBZILKMSA-N 0.000 description 1
- JVGDAEKKZKKZFO-RCWTZXSCSA-N Val-Val-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](C(C)C)N)O JVGDAEKKZKKZFO-RCWTZXSCSA-N 0.000 description 1
- 208000036142 Viral infection Diseases 0.000 description 1
- 241001452678 Wickerhamomyces canadensis Species 0.000 description 1
- 241000607479 Yersinia pestis Species 0.000 description 1
- 241000222124 [Candida] boidinii Species 0.000 description 1
- 108010081404 acein-2 Proteins 0.000 description 1
- 108020002494 acetyltransferase Proteins 0.000 description 1
- 102000005421 acetyltransferase Human genes 0.000 description 1
- 230000009471 action Effects 0.000 description 1
- 229960005305 adenosine Drugs 0.000 description 1
- UDMBCSSLTHHNCD-KQYNXXCUSA-N adenosine 5'-monophosphate Chemical group C1=NC=2C(N)=NC=NC=2N1[C@@H]1O[C@H](COP(O)(O)=O)[C@@H](O)[C@H]1O UDMBCSSLTHHNCD-KQYNXXCUSA-N 0.000 description 1
- 238000001042 affinity chromatography Methods 0.000 description 1
- 238000001261 affinity purification Methods 0.000 description 1
- 108010024078 alanyl-glycyl-serine Proteins 0.000 description 1
- 108010031014 alanyl-histidyl-leucyl-leucine Proteins 0.000 description 1
- 108010045350 alanyl-tyrosyl-alanine Proteins 0.000 description 1
- 108010044940 alanylglutamine Proteins 0.000 description 1
- 102000004139 alpha-Amylases Human genes 0.000 description 1
- 108090000637 alpha-Amylases Proteins 0.000 description 1
- 229940024171 alpha-amylase Drugs 0.000 description 1
- 229960004050 aminobenzoic acid Drugs 0.000 description 1
- 229940126575 aminoglycoside Drugs 0.000 description 1
- 238000012870 ammonium sulfate precipitation Methods 0.000 description 1
- 230000001093 anti-cancer Effects 0.000 description 1
- 229940127218 antiplatelet drug Drugs 0.000 description 1
- 108010052670 arginyl-glutamyl-glutamic acid Proteins 0.000 description 1
- 108010072041 arginyl-glycyl-aspartic acid Proteins 0.000 description 1
- 108010009111 arginyl-glycyl-glutamic acid Proteins 0.000 description 1
- 108010018691 arginyl-threonyl-arginine Proteins 0.000 description 1
- 108010062796 arginyllysine Proteins 0.000 description 1
- 210000004436 artificial bacterial chromosome Anatomy 0.000 description 1
- 210000001106 artificial yeast chromosome Anatomy 0.000 description 1
- 108010010430 asparagine-proline-alanine Proteins 0.000 description 1
- 108010068265 aspartyltyrosine Proteins 0.000 description 1
- 210000001130 astrocyte Anatomy 0.000 description 1
- 239000011324 bead Substances 0.000 description 1
- 108010051210 beta-Fructofuranosidase Proteins 0.000 description 1
- 230000004071 biological effect Effects 0.000 description 1
- 230000033228 biological regulation Effects 0.000 description 1
- KGBXLFKZBHKPEV-UHFFFAOYSA-N boric acid Chemical compound OB(O)O KGBXLFKZBHKPEV-UHFFFAOYSA-N 0.000 description 1
- 239000004327 boric acid Substances 0.000 description 1
- 210000004556 brain Anatomy 0.000 description 1
- UDSAIICHUKSCKT-UHFFFAOYSA-N bromophenol blue Chemical compound C1=C(Br)C(O)=C(Br)C=C1C1(C=2C=C(Br)C(O)=C(Br)C=2)C2=CC=CC=C2S(=O)(=O)O1 UDSAIICHUKSCKT-UHFFFAOYSA-N 0.000 description 1
- 239000011575 calcium Substances 0.000 description 1
- 229910052791 calcium Inorganic materials 0.000 description 1
- 239000001506 calcium phosphate Substances 0.000 description 1
- 229910000389 calcium phosphate Inorganic materials 0.000 description 1
- 235000011010 calcium phosphates Nutrition 0.000 description 1
- 229940095731 candida albicans Drugs 0.000 description 1
- 208000032343 candida glabrata infection Diseases 0.000 description 1
- 239000000073 carbamate insecticide Substances 0.000 description 1
- 229910052799 carbon Inorganic materials 0.000 description 1
- 150000001735 carboxylic acids Chemical class 0.000 description 1
- 239000000969 carrier Substances 0.000 description 1
- 239000005018 casein Substances 0.000 description 1
- BECPQYXYKAMYBN-UHFFFAOYSA-N casein, tech. Chemical compound NCCCCC(C(O)=O)N=C(O)C(CC(O)=O)N=C(O)C(CCC(O)=N)N=C(O)C(CC(C)C)N=C(O)C(CCC(O)=O)N=C(O)C(CC(O)=O)N=C(O)C(CCC(O)=O)N=C(O)C(C(C)O)N=C(O)C(CCC(O)=N)N=C(O)C(CCC(O)=N)N=C(O)C(CCC(O)=N)N=C(O)C(CCC(O)=O)N=C(O)C(CCC(O)=O)N=C(O)C(COP(O)(O)=O)N=C(O)C(CCC(O)=N)N=C(O)C(N)CC1=CC=CC=C1 BECPQYXYKAMYBN-UHFFFAOYSA-N 0.000 description 1
- 235000021240 caseins Nutrition 0.000 description 1
- 230000010261 cell growth Effects 0.000 description 1
- 239000013592 cell lysate Substances 0.000 description 1
- 210000000170 cell membrane Anatomy 0.000 description 1
- 239000006285 cell suspension Substances 0.000 description 1
- 230000010307 cell transformation Effects 0.000 description 1
- 229940106157 cellulase Drugs 0.000 description 1
- 230000008859 change Effects 0.000 description 1
- 239000013043 chemical agent Substances 0.000 description 1
- 239000002962 chemical mutagen Substances 0.000 description 1
- 239000003153 chemical reaction reagent Substances 0.000 description 1
- 239000012916 chromogenic reagent Substances 0.000 description 1
- GKTWGGQPFAXNFI-HNNXBMFYSA-N clopidogrel Chemical compound C1([C@H](N2CC=3C=CSC=3CC2)C(=O)OC)=CC=CC=C1Cl GKTWGGQPFAXNFI-HNNXBMFYSA-N 0.000 description 1
- 229960003009 clopidogrel Drugs 0.000 description 1
- GFHNAMRJFCEERV-UHFFFAOYSA-L cobalt chloride hexahydrate Chemical compound O.O.O.O.O.O.[Cl-].[Cl-].[Co+2] GFHNAMRJFCEERV-UHFFFAOYSA-L 0.000 description 1
- 238000004590 computer program Methods 0.000 description 1
- 230000001268 conjugating effect Effects 0.000 description 1
- 230000021615 conjugation Effects 0.000 description 1
- 108091036078 conserved sequence Proteins 0.000 description 1
- 238000007796 conventional method Methods 0.000 description 1
- NKLPQNGYXWVELD-UHFFFAOYSA-M coomassie brilliant blue Chemical compound [Na+].C1=CC(OCC)=CC=C1NC1=CC=C(C(=C2C=CC(C=C2)=[N+](CC)CC=2C=C(C=CC=2)S([O-])(=O)=O)C=2C=CC(=CC=2)N(CC)CC=2C=C(C=CC=2)S([O-])(=O)=O)C=C1 NKLPQNGYXWVELD-UHFFFAOYSA-M 0.000 description 1
- JZCCFEFSEZPSOG-UHFFFAOYSA-L copper(II) sulfate pentahydrate Chemical compound O.O.O.O.O.[Cu+2].[O-]S([O-])(=O)=O JZCCFEFSEZPSOG-UHFFFAOYSA-L 0.000 description 1
- 230000008878 coupling Effects 0.000 description 1
- 238000010168 coupling process Methods 0.000 description 1
- 238000005859 coupling reaction Methods 0.000 description 1
- 238000012258 culturing Methods 0.000 description 1
- 108010004073 cysteinylcysteine Proteins 0.000 description 1
- 108010069495 cysteinyltyrosine Proteins 0.000 description 1
- 230000007423 decrease Effects 0.000 description 1
- 238000011161 development Methods 0.000 description 1
- 230000018109 developmental process Effects 0.000 description 1
- 235000014113 dietary fatty acids Nutrition 0.000 description 1
- MTHSVFCYNBDYFN-UHFFFAOYSA-N diethylene glycol Chemical compound OCCOCCO MTHSVFCYNBDYFN-UHFFFAOYSA-N 0.000 description 1
- FSXRLASFHBWESK-UHFFFAOYSA-N dipeptide phenylalanyl-tyrosine Natural products C=1C=C(O)C=CC=1CC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 FSXRLASFHBWESK-UHFFFAOYSA-N 0.000 description 1
- LOKCTEFSRHRXRJ-UHFFFAOYSA-I dipotassium trisodium dihydrogen phosphate hydrogen phosphate dichloride Chemical compound P(=O)(O)(O)[O-].[K+].P(=O)(O)([O-])[O-].[Na+].[Na+].[Cl-].[K+].[Cl-].[Na+] LOKCTEFSRHRXRJ-UHFFFAOYSA-I 0.000 description 1
- 108010054812 diprotin A Proteins 0.000 description 1
- 239000012154 double-distilled water Substances 0.000 description 1
- 235000013399 edible fruits Nutrition 0.000 description 1
- 239000012636 effector Substances 0.000 description 1
- 239000012149 elution buffer Substances 0.000 description 1
- 239000002158 endotoxin Substances 0.000 description 1
- 238000009585 enzyme analysis Methods 0.000 description 1
- 235000020774 essential nutrients Nutrition 0.000 description 1
- 238000001704 evaporation Methods 0.000 description 1
- 230000008020 evaporation Effects 0.000 description 1
- 238000002474 experimental method Methods 0.000 description 1
- 239000000284 extract Substances 0.000 description 1
- 229930195729 fatty acid Natural products 0.000 description 1
- 150000004665 fatty acids Chemical class 0.000 description 1
- 238000000855 fermentation Methods 0.000 description 1
- 230000004151 fermentation Effects 0.000 description 1
- 210000005254 filamentous fungi cell Anatomy 0.000 description 1
- 238000001914 filtration Methods 0.000 description 1
- 235000013305 food Nutrition 0.000 description 1
- 230000008014 freezing Effects 0.000 description 1
- 238000007710 freezing Methods 0.000 description 1
- 230000005714 functional activity Effects 0.000 description 1
- 230000004927 fusion Effects 0.000 description 1
- 108010010662 galactose epimerase Proteins 0.000 description 1
- 238000012224 gene deletion Methods 0.000 description 1
- 238000012239 gene modification Methods 0.000 description 1
- 238000010363 gene targeting Methods 0.000 description 1
- 238000012252 genetic analysis Methods 0.000 description 1
- 238000010353 genetic engineering Methods 0.000 description 1
- 230000005017 genetic modification Effects 0.000 description 1
- 235000013617 genetically modified food Nutrition 0.000 description 1
- 239000003292 glue Substances 0.000 description 1
- 108010057083 glutamyl-aspartyl-leucine Proteins 0.000 description 1
- 102000006602 glyceraldehyde-3-phosphate dehydrogenase Human genes 0.000 description 1
- 108020004445 glyceraldehyde-3-phosphate dehydrogenase Proteins 0.000 description 1
- AWUCVROLDVIAJX-UHFFFAOYSA-N glycerol 1-phosphate Chemical compound OCC(O)COP(O)(O)=O AWUCVROLDVIAJX-UHFFFAOYSA-N 0.000 description 1
- 230000013595 glycosylation Effects 0.000 description 1
- 238000006206 glycosylation reaction Methods 0.000 description 1
- HPAIKDPJURGQLN-UHFFFAOYSA-N glycyl-L-histidyl-L-phenylalanine Natural products C=1C=CC=CC=1CC(C(O)=O)NC(=O)C(NC(=O)CN)CC1=CN=CN1 HPAIKDPJURGQLN-UHFFFAOYSA-N 0.000 description 1
- 108010075431 glycyl-alanyl-phenylalanine Proteins 0.000 description 1
- 108010027668 glycyl-alanyl-valine Proteins 0.000 description 1
- 108010078326 glycyl-glycyl-valine Proteins 0.000 description 1
- 108010033719 glycyl-histidyl-glycine Proteins 0.000 description 1
- 108010028188 glycyl-histidyl-serine Proteins 0.000 description 1
- 108010050475 glycyl-leucyl-tyrosine Proteins 0.000 description 1
- 108010077435 glycyl-phenylalanyl-glycine Proteins 0.000 description 1
- 108010079413 glycyl-prolyl-glutamic acid Proteins 0.000 description 1
- 108010074027 glycyl-seryl-phenylalanine Proteins 0.000 description 1
- 108010045126 glycyl-tyrosyl-glycine Proteins 0.000 description 1
- 108010081551 glycylphenylalanine Proteins 0.000 description 1
- 210000003494 hepatocyte Anatomy 0.000 description 1
- 150000004688 heptahydrates Chemical class 0.000 description 1
- HNDVDQJCIGZPNO-UHFFFAOYSA-N histidine Natural products OC(=O)C(N)CC1=CN=CN1 HNDVDQJCIGZPNO-UHFFFAOYSA-N 0.000 description 1
- 125000000487 histidyl group Chemical group [H]N([H])C(C(=O)O*)C([H])([H])C1=C([H])N([H])C([H])=N1 0.000 description 1
- 108010045383 histidyl-glycyl-glutamic acid Proteins 0.000 description 1
- 108010018006 histidylserine Proteins 0.000 description 1
- 238000002744 homologous recombination Methods 0.000 description 1
- 230000006801 homologous recombination Effects 0.000 description 1
- 230000002209 hydrophobic effect Effects 0.000 description 1
- 238000003384 imaging method Methods 0.000 description 1
- 238000001114 immunoprecipitation Methods 0.000 description 1
- 208000015181 infectious disease Diseases 0.000 description 1
- 206010022000 influenza Diseases 0.000 description 1
- 239000004615 ingredient Substances 0.000 description 1
- 230000000977 initiatory effect Effects 0.000 description 1
- 238000002743 insertional mutagenesis Methods 0.000 description 1
- 239000001573 invertase Substances 0.000 description 1
- 235000011073 invertase Nutrition 0.000 description 1
- 238000004255 ion exchange chromatography Methods 0.000 description 1
- UWKQSNNFCGGAFS-XIFFEERXSA-N irinotecan Chemical compound C1=C2C(CC)=C3CN(C(C4=C([C@@](C(=O)OC4)(O)CC)C=4)=O)C=4C3=NC2=CC=C1OC(=O)N(CC1)CCC1N1CCCCC1 UWKQSNNFCGGAFS-XIFFEERXSA-N 0.000 description 1
- 229960004768 irinotecan Drugs 0.000 description 1
- SURQXAFEQWPFPV-UHFFFAOYSA-L iron(2+) sulfate heptahydrate Chemical compound O.O.O.O.O.O.O.[Fe+2].[O-]S([O-])(=O)=O SURQXAFEQWPFPV-UHFFFAOYSA-L 0.000 description 1
- 238000001155 isoelectric focusing Methods 0.000 description 1
- 101150107787 kex1 gene Proteins 0.000 description 1
- 210000003734 kidney Anatomy 0.000 description 1
- 210000001985 kidney epithelial cell Anatomy 0.000 description 1
- 238000002372 labelling Methods 0.000 description 1
- 229960003136 leucine Drugs 0.000 description 1
- 108010073093 leucyl-glycyl-glycyl-glycine Proteins 0.000 description 1
- 108010051673 leucyl-glycyl-phenylalanine Proteins 0.000 description 1
- 108010044056 leucyl-phenylalanine Proteins 0.000 description 1
- 108010000761 leucylarginine Proteins 0.000 description 1
- 235000019421 lipase Nutrition 0.000 description 1
- 229920006008 lipopolysaccharide Polymers 0.000 description 1
- 239000002502 liposome Substances 0.000 description 1
- 239000007788 liquid Substances 0.000 description 1
- XIXADJRWDQXREU-UHFFFAOYSA-M lithium acetate Chemical compound [Li+].CC([O-])=O XIXADJRWDQXREU-UHFFFAOYSA-M 0.000 description 1
- 239000012160 loading buffer Substances 0.000 description 1
- 238000011068 loading method Methods 0.000 description 1
- 230000033001 locomotion Effects 0.000 description 1
- 101150039489 lysZ gene Proteins 0.000 description 1
- 108010025153 lysyl-alanyl-alanine Proteins 0.000 description 1
- 108010044348 lysyl-glutamyl-aspartic acid Proteins 0.000 description 1
- 108010057952 lysyl-phenylalanyl-lysine Proteins 0.000 description 1
- 108010038320 lysylphenylalanine Proteins 0.000 description 1
- 229910052943 magnesium sulfate Inorganic materials 0.000 description 1
- 235000019341 magnesium sulphate Nutrition 0.000 description 1
- 210000004962 mammalian cell Anatomy 0.000 description 1
- CNFDGXZLMLFIJV-UHFFFAOYSA-L manganese(II) chloride tetrahydrate Chemical compound O.O.O.O.[Cl-].[Cl-].[Mn+2] CNFDGXZLMLFIJV-UHFFFAOYSA-L 0.000 description 1
- 230000001404 mediated effect Effects 0.000 description 1
- 230000004060 metabolic process Effects 0.000 description 1
- 108010063431 methionyl-aspartyl-glycine Proteins 0.000 description 1
- 229960000485 methotrexate Drugs 0.000 description 1
- 210000000274 microglia Anatomy 0.000 description 1
- 238000000520 microinjection Methods 0.000 description 1
- 238000002156 mixing Methods 0.000 description 1
- 108091005601 modified peptides Proteins 0.000 description 1
- 239000003607 modifier Substances 0.000 description 1
- 238000003032 molecular docking Methods 0.000 description 1
- 229910000402 monopotassium phosphate Inorganic materials 0.000 description 1
- 235000019796 monopotassium phosphate Nutrition 0.000 description 1
- 238000002703 mutagenesis Methods 0.000 description 1
- 231100000350 mutagenesis Toxicity 0.000 description 1
- 230000003505 mutagenic effect Effects 0.000 description 1
- 230000007935 neutral effect Effects 0.000 description 1
- 229920001220 nitrocellulos Polymers 0.000 description 1
- 238000007797 non-conventional method Methods 0.000 description 1
- 231100000252 nontoxic Toxicity 0.000 description 1
- 230000003000 nontoxic effect Effects 0.000 description 1
- 210000004940 nucleus Anatomy 0.000 description 1
- 235000015097 nutrients Nutrition 0.000 description 1
- 239000002751 oligonucleotide probe Substances 0.000 description 1
- 230000003287 optical effect Effects 0.000 description 1
- 239000003986 organophosphate insecticide Substances 0.000 description 1
- 229960003752 oseltamivir Drugs 0.000 description 1
- 239000002245 particle Substances 0.000 description 1
- 230000037361 pathway Effects 0.000 description 1
- 239000000447 pesticide residue Substances 0.000 description 1
- 108010074082 phenylalanyl-alanyl-lysine Proteins 0.000 description 1
- 108010082795 phenylalanyl-arginyl-arginine Proteins 0.000 description 1
- 108010072637 phenylalanyl-arginyl-phenylalanine Proteins 0.000 description 1
- 108010064486 phenylalanyl-leucyl-valine Proteins 0.000 description 1
- 108010024654 phenylalanyl-prolyl-alanine Proteins 0.000 description 1
- 108010018625 phenylalanylarginine Proteins 0.000 description 1
- 108010073025 phenylalanylphenylalanine Proteins 0.000 description 1
- 239000002953 phosphate buffered saline Substances 0.000 description 1
- 108010082527 phosphinothricin N-acetyltransferase Proteins 0.000 description 1
- PJNZPQUBCPKICU-UHFFFAOYSA-N phosphoric acid;potassium Chemical compound [K].OP(O)(O)=O PJNZPQUBCPKICU-UHFFFAOYSA-N 0.000 description 1
- 230000026731 phosphorylation Effects 0.000 description 1
- 238000006366 phosphorylation reaction Methods 0.000 description 1
- 108010025488 pinealon Proteins 0.000 description 1
- 244000000003 plant pathogen Species 0.000 description 1
- 231100000614 poison Toxicity 0.000 description 1
- 231100000572 poisoning Toxicity 0.000 description 1
- 230000000607 poisoning effect Effects 0.000 description 1
- 239000000256 polyoxyethylene sorbitan monolaurate Substances 0.000 description 1
- 235000010486 polyoxyethylene sorbitan monolaurate Nutrition 0.000 description 1
- 230000004481 post-translational protein modification Effects 0.000 description 1
- 230000001124 posttranscriptional effect Effects 0.000 description 1
- 239000001103 potassium chloride Substances 0.000 description 1
- 235000011164 potassium chloride Nutrition 0.000 description 1
- 239000008057 potassium phosphate buffer Substances 0.000 description 1
- 239000000843 powder Substances 0.000 description 1
- 238000001556 precipitation Methods 0.000 description 1
- 238000002360 preparation method Methods 0.000 description 1
- 238000004321 preservation Methods 0.000 description 1
- 125000002924 primary amino group Chemical group [H]N([H])* 0.000 description 1
- 108010077112 prolyl-proline Proteins 0.000 description 1
- 108010004914 prolylarginine Proteins 0.000 description 1
- 235000019833 protease Nutrition 0.000 description 1
- 238000002818 protein evolution Methods 0.000 description 1
- 238000001742 protein purification Methods 0.000 description 1
- 239000002728 pyrethroid Substances 0.000 description 1
- 238000011002 quantification Methods 0.000 description 1
- 230000008929 regeneration Effects 0.000 description 1
- 238000011069 regeneration method Methods 0.000 description 1
- 230000008439 repair process Effects 0.000 description 1
- 239000011347 resin Substances 0.000 description 1
- 229920005989 resin Polymers 0.000 description 1
- 239000012508 resin bead Substances 0.000 description 1
- 230000004044 response Effects 0.000 description 1
- 230000000717 retained effect Effects 0.000 description 1
- 238000012552 review Methods 0.000 description 1
- 229960002181 saccharomyces boulardii Drugs 0.000 description 1
- 238000005070 sampling Methods 0.000 description 1
- 238000012216 screening Methods 0.000 description 1
- 238000000926 separation method Methods 0.000 description 1
- 238000012163 sequencing technique Methods 0.000 description 1
- 108010069117 seryl-lysyl-aspartic acid Proteins 0.000 description 1
- 108010071207 serylmethionine Proteins 0.000 description 1
- 238000012807 shake-flask culturing Methods 0.000 description 1
- 230000035939 shock Effects 0.000 description 1
- 238000001542 size-exclusion chromatography Methods 0.000 description 1
- 235000020183 skimmed milk Nutrition 0.000 description 1
- 235000015393 sodium molybdate Nutrition 0.000 description 1
- 239000011684 sodium molybdate Substances 0.000 description 1
- TVXXNOYZHKPKGW-UHFFFAOYSA-N sodium molybdate (anhydrous) Chemical compound [Na+].[Na+].[O-][Mo]([O-])(=O)=O TVXXNOYZHKPKGW-UHFFFAOYSA-N 0.000 description 1
- 239000004317 sodium nitrate Substances 0.000 description 1
- 235000010344 sodium nitrate Nutrition 0.000 description 1
- 239000002689 soil Substances 0.000 description 1
- 238000010563 solid-state fermentation Methods 0.000 description 1
- 238000001694 spray drying Methods 0.000 description 1
- 238000010561 standard procedure Methods 0.000 description 1
- 235000000346 sugar Nutrition 0.000 description 1
- 150000008163 sugars Chemical class 0.000 description 1
- 108010031491 threonyl-lysyl-glutamic acid Proteins 0.000 description 1
- 108010071097 threonyl-lysyl-proline Proteins 0.000 description 1
- 108010072986 threonyl-seryl-lysine Proteins 0.000 description 1
- 230000001526 topogenic effect Effects 0.000 description 1
- 239000003440 toxic substance Substances 0.000 description 1
- 230000005026 transcription initiation Effects 0.000 description 1
- 230000005030 transcription termination Effects 0.000 description 1
- 238000010361 transduction Methods 0.000 description 1
- 230000026683 transduction Effects 0.000 description 1
- 238000001890 transfection Methods 0.000 description 1
- 230000014616 translation Effects 0.000 description 1
- 230000017105 transposition Effects 0.000 description 1
- 238000011282 treatment Methods 0.000 description 1
- QORWJWZARLRLPR-UHFFFAOYSA-H tricalcium bis(phosphate) Chemical compound [Ca+2].[Ca+2].[Ca+2].[O-]P([O-])([O-])=O.[O-]P([O-])([O-])=O QORWJWZARLRLPR-UHFFFAOYSA-H 0.000 description 1
- 101150016309 trpC gene Proteins 0.000 description 1
- 229960004799 tryptophan Drugs 0.000 description 1
- 108010017949 tyrosyl-glycyl-glycine Proteins 0.000 description 1
- 108010035534 tyrosyl-leucyl-alanine Proteins 0.000 description 1
- 241000701161 unidentified adenovirus Species 0.000 description 1
- 241001515965 unidentified phage Species 0.000 description 1
- 241001430294 unidentified retrovirus Species 0.000 description 1
- 238000011144 upstream manufacturing Methods 0.000 description 1
- 235000013311 vegetables Nutrition 0.000 description 1
- 230000009385 viral infection Effects 0.000 description 1
- 230000003612 virological effect Effects 0.000 description 1
- 239000011782 vitamin Substances 0.000 description 1
- 235000013343 vitamin Nutrition 0.000 description 1
- 229940088594 vitamin Drugs 0.000 description 1
- 229930003231 vitamin Natural products 0.000 description 1
- 239000011534 wash buffer Substances 0.000 description 1
- 238000005406 washing Methods 0.000 description 1
- NWONKYPBYAMBJT-UHFFFAOYSA-L zinc sulfate Chemical compound [Zn+2].[O-]S([O-])(=O)=O NWONKYPBYAMBJT-UHFFFAOYSA-L 0.000 description 1
- 229910000368 zinc sulfate Inorganic materials 0.000 description 1
- 229960001763 zinc sulfate Drugs 0.000 description 1
- DGVVWUTYPXICAM-UHFFFAOYSA-N β‐Mercaptoethanol Chemical compound OCCS DGVVWUTYPXICAM-UHFFFAOYSA-N 0.000 description 1
Images
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
- C12N9/14—Hydrolases (3)
- C12N9/16—Hydrolases (3) acting on ester bonds (3.1)
- C12N9/18—Carboxylic ester hydrolases (3.1.1)
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
- C12N15/79—Vectors or expression systems specially adapted for eukaryotic hosts
- C12N15/80—Vectors or expression systems specially adapted for eukaryotic hosts for fungi
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
- C12N15/79—Vectors or expression systems specially adapted for eukaryotic hosts
- C12N15/80—Vectors or expression systems specially adapted for eukaryotic hosts for fungi
- C12N15/81—Vectors or expression systems specially adapted for eukaryotic hosts for fungi for yeasts
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
- C12N15/79—Vectors or expression systems specially adapted for eukaryotic hosts
- C12N15/80—Vectors or expression systems specially adapted for eukaryotic hosts for fungi
- C12N15/81—Vectors or expression systems specially adapted for eukaryotic hosts for fungi for yeasts
- C12N15/815—Vectors or expression systems specially adapted for eukaryotic hosts for fungi for yeasts for yeasts other than Saccharomyces
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12P—FERMENTATION OR ENZYME-USING PROCESSES TO SYNTHESISE A DESIRED CHEMICAL COMPOUND OR COMPOSITION OR TO SEPARATE OPTICAL ISOMERS FROM A RACEMIC MIXTURE
- C12P7/00—Preparation of oxygen-containing organic compounds
- C12P7/40—Preparation of oxygen-containing organic compounds containing a carboxyl group including Peroxycarboxylic acids
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Y—ENZYMES
- C12Y301/00—Hydrolases acting on ester bonds (3.1)
- C12Y301/01—Carboxylic ester hydrolases (3.1.1)
- C12Y301/01001—Carboxylesterase (3.1.1.1)
Landscapes
- Life Sciences & Earth Sciences (AREA)
- Health & Medical Sciences (AREA)
- Genetics & Genomics (AREA)
- Chemical & Material Sciences (AREA)
- Engineering & Computer Science (AREA)
- Organic Chemistry (AREA)
- Wood Science & Technology (AREA)
- Zoology (AREA)
- Bioinformatics & Cheminformatics (AREA)
- General Engineering & Computer Science (AREA)
- Biotechnology (AREA)
- Biomedical Technology (AREA)
- General Health & Medical Sciences (AREA)
- Biochemistry (AREA)
- Microbiology (AREA)
- Mycology (AREA)
- Molecular Biology (AREA)
- Physics & Mathematics (AREA)
- Plant Pathology (AREA)
- Biophysics (AREA)
- General Chemical & Material Sciences (AREA)
- Chemical Kinetics & Catalysis (AREA)
- Medicinal Chemistry (AREA)
- Micro-Organisms Or Cultivation Processes Thereof (AREA)
- Preparation Of Compounds By Using Micro-Organisms (AREA)
- Enzymes And Modification Thereof (AREA)
Abstract
本公开内容提供了在真核细胞中产生羧酸酯酶或其变体的方法。本公开内容还提供了用于高水平表达羧酸酯酶的表达载体、包含所述表达载体的真核细胞以及它们的用途。本公开内容还提供了包含通过本申请所述方法所产生的羧酸酯酶或其变体的组合物,以及该组合物的用途。
Description
技术领域
本发明涉及生物技术领域。
背景技术
羧酸酯酶能够水解羧酸酯,从而产生羧酸(盐)和醇。羧酸酯酶属于水解酶超家族。已经在从原核细胞到真核细胞的多种物种中鉴定出了羧酸酯酶。
发明内容
在一个方面中,本公开内容提供了一种用于产生蛋白质的方法,包括将经改造而表达编码羧酸酯酶或其变体之基因的真核细胞在适于表达该羧酸酯酶或其变体的条件下进行培养。
在另一个方面中,本公开内容提供了一种用于产生蛋白质的方法,包括将经改造而表达编码微生物羧酸酯酶或其变体之基因的真核细胞在适于表达该微生物羧酸酯酶或其变体的条件下进行培养。
在另一个方面中,本公开内容提供了一种用于产生蛋白质的方法,包括将经改造而表达编码羧酸酯酶或其变体之基因的丝状真菌细胞在适于表达该羧酸酯酶或其变体的条件下进行培养。
在另一个方面中,本公开内容提供了表达载体,其包含编码羧酸酯酶或其变体的基因以及能促进该羧酸酯酶或其变体在真核细胞中表达的调节序列,其中所述调节序列与该基因可操纵地连接。
在另一个方面中,本公开内容提供了包含表达载体的真核细胞,所述表达载体包含编码羧酸酯酶或其变体的基因以及能促进该羧酸酯酶或其变体在真核细胞中表达的调节序列,其中所述调节序列与该基因可操纵地连接。
在另一个方面中,本公开内容提供了组合物,其包含真核细胞以及该真核细胞所表达的羧酸酯酶或其变体。
在另一个方面中,本公开内容提供了组合物,其包含丝状真菌细胞以及该丝状真菌细胞所表达的羧酸酯酶或其变体。
在另一个方面中,本公开内容提供了组合物,其包含通过本公开内容的方法所产生的分离的羧酸酯酶或其变体。
在另一个方面中,本公开内容提供了使用本公开内容之组合物的方法。
以上概述仅用于说明,而并非旨在以任何方式进行限制。除了上述说明的各方面、实施方案和特征以外,其它方面、实施方案和特征在参考附图和下文的详细说明之后将是很明显的。
附图说明
图1显示质粒pIGF的示意图。
图2显示质粒pYG1.2的示意图。
图3显示来自以下菌株的培养基凝胶电泳结果:转化有pYG1.2-CarE-his的黑曲霉(Aspergillus niger)M54(L2);转化有pYG1.2的黑曲霉M54(L3);以及未转化的黑曲霉M54(L4)。蛋白分子量标志物示于L1。L2中所示的约29.0KD的条带是羧酸酯酶的条带。
图4显示以下菌株所表达的羧酸酯酶的Western印迹结果:转化有pPIC9K-CarE-His的巴斯德毕赤酵母(Pichia pastoris)GS115(L4);以及转化有pYG1.2-CarE-His的黑曲霉M54(L5)。在以下阴性对照中未检测到羧酸酯酶的表达:转化有pPIC9K的巴斯德毕赤酵母GS115(L3);未转化的黑曲霉M54(L6);转化有pYG1.2的黑曲霉M54(L7)。阳性对照(含有His标签的蛋白质)示于L1,蛋白标志物示于L2。
图5显示转化有pPIC9K-CarE-His的巴斯德毕赤酵母GS115所表达羧酸酯酶的凝胶电泳结果,其中在培养24小时(L3)、48小时(L4)和72小时(L5)后采样。蛋白标志物示于L1,转化有pPIC9K的巴斯德毕赤酵母GS115的培养基示于L2。
图6显示在培养的不同时间点从培养基中分离的羧酸酯酶的酶活性。
图7显示在不同pH值下测量的重组羧酸酯酶的相对酶活性。
图8显示在以不同温度将羧酸酯酶处理10或30分钟后在37℃下测量的重组羧酸酯酶的相对酶活性。
图9显示在不同温度下测量的重组羧酸酯酶的相对酶活性。
具体实施方案
在下文的详细描述中,将参考构成本文一部分的附图。在附图中,相似的符号一般指代相似的部分,除非其上下文有其他明确表述。在具体实施方案、附图和权利要求中描述的说明性实施方案并无限制之意。可以利用其他实施方案、进行其他改动,而不偏离本文之主题的构思或范围。
本公开内容涉及用于产生羧酸酯酶及其变体的重组方法、可用于重组产生羧酸酯酶及其变体的表达载体和宿主细胞。本公开内容还涉及包含所述重组产生的羧酸酯酶及其变体的组合物以及使用该组合物的方法。
在一个方面中,本公开内容提供了一种用于产生蛋白质的方法,所述方法包括将经改造而表达编码羧酸酯酶或其变体之基因的真核细胞在适于表达该羧酸酯酶或其变体的条件下进行培养。
在另一个方面中,本公开内容提供了一种用于产生蛋白质的方法,所述方法包括将经改造而表达编码羧酸酯酶或其变体之基因的丝状真菌细胞在适于表达该羧酸酯酶或其变体的条件下进行培养。
在另一个方面中,本公开内容提供了一种用于产生蛋白质的方法,所述方法包括将经改造而表达编码微生物羧酸酯酶或其变体之基因的真核细胞在适于表达该微生物羧酸酯酶或其变体的条件下进行培养。
在某些实施方案中,该方法还可包括从真核细胞培养物中分离所述羧酸酯酶或其变体。在某些实施方案中,该方法还可包括将含有编码羧酸酯酶或其变体之基因的表达载体引入所述真核细胞中。
真核细胞
真核细胞是组织成包裹在膜中的复杂结构(尤其包括含有遗传物质的与膜结合的核)的细胞。本公开内容的真核细胞包括但不限于真菌细胞、原生生物细胞、动物细胞和植物细胞。
真菌细胞可包括但不限于酵母细胞和丝状真菌细胞。
本公开内容的酵母细胞可根据其系统发生特征而归为子囊真菌亚门(Ascomycota)和担子真菌亚门(Basidiomycota)。酵母细胞的说明性实例包括但不仅限于毕赤酵母属物种如安格斯毕赤酵母(Pichia angusta)、巴斯德毕赤酵母(Pichia pastoris)、异常毕赤酵母(Pichia anomala)、树干毕赤酵母(Pichia stipitis)、甲醇毕赤酵母(Pichia methanolica)和季也蒙毕赤酵母(Pichia guilliermondii);汉逊酵母属(Hansenula)物种如异常汉逊酵母(Hansenula anomala)、多形汉逊酵母(Hansenulapolymorpha)、温奇汉逊酵母(Hansenula wingei)、杰丁汉逊酵母(Hansenula jadinii)和土星汉逊酵母(Hansenula saturnus);酵母属(Saccharomyces)如酿酒酵母(Saccharomyces cerevisiae)、贝酵母(Saccharomyces bayanus)、布拉酵母(Saccharomyces boulardii);假丝酵母属(Candida)物种如白假丝酵母(Candida albicans)、Candidamethylica、波氏假丝酵母(Candida boidinii)、热带假丝酵母(Candidatropicalis)、Candida wickerhamii、麦芽糖假丝酵母(Candida maltosa)和光滑假丝酵母(Candida glabrata)、光滑球拟酵母(Torulopsis glabrata);以及克鲁维酵母菌属(Kluyveromyces)物种和裂殖酵母属(Schizosaccharomyces)物种。
在一些实施方案中,所述酵母细胞为巴斯德毕赤酵母、多形汉逊酵母、酿酒酵母或光滑球拟酵母中的一种或多种。在一些实施方案中,所述酵母细胞为巴斯德毕赤酵母。在一些实施方案中,所述酵母细胞为巴斯德毕赤酵母菌株GS115细胞、巴斯德毕赤酵母菌株KM71细胞或巴斯德毕赤酵母菌株MC100-3细胞中的一种或多种。在一些实施方案中,所述酵母细胞为多形汉逊酵母菌株ATCC34438细胞。
丝状真菌可包括但不仅限于以多细胞丝的形式生长的任何微小真菌(microscopic fungi)物种。在一些实施方案中,所述丝状真菌细胞包括但不仅限于支顶孢属(Acremonium)、曲霉属(Aspergillus)、镰刀菌属(Fusarium)、腐质霉属(Humicola)、毛霉属(Mucor)、毁丝霉属(Myceliophthora)、链孢霉属(Neurospora)、青霉属(Penicillium)、梭孢壳菌属(Thielavia)、弯颈霉属(Tolypocladium)和木霉属(Trichoderma)中的多个物种。
在一些实施方案中,所述丝状真菌细胞为泡盛曲霉(Aspergillusawamori)、臭曲霉(Aspergillus foetidus)、日本曲霉(Aspergillusjaponicus)、构巢曲霉(Aspergillus nidulans)、黑曲霉(Aspergillus niger)或米曲霉(Aspergillus oryzae)细胞。在一个说明性实施方案中,所述丝状真菌细胞为黑曲霉ATCC 12049菌株细胞。在另一个说明性实施方案中,所述丝状真菌细胞为米曲霉RIB40菌株细胞。
原生生物细胞可包括但不仅限于原生动物细胞和藻类细胞。
动物细胞可包括但不仅限于哺乳动物细胞、鸟类细胞、两栖类细胞和昆虫细胞。动物细胞的说明性实例包括猪肝细胞、人胚肾293(HEK293)细胞、中国仓鼠卵巢细胞(CHO)、斑马鱼PAC2细胞、非洲爪蟾(Xenopuslaevis)A6肾上皮细胞、秀丽隐杆线虫(Caenorhabditis elegans)细胞和果蝇细胞。
植物细胞可包括但不限于薄壁组织细胞、厚角细胞和厚壁组织细胞。植物细胞的说明性实例为烟草BY-2细胞、毛曼陀罗(Datura innoxia)细胞系和SB-1细胞系。
在一些实施方案中,本公开内容的真核细胞可带有导致与野生型菌株相比发生表型改变的一种或多种突变。真核细胞中的突变可以是天然或非天然的。天然突变可在进化过程中自发形成。非天然突变可以使用本领域已知方法人工产生。在一个说明性实例中,可以通过使细胞接触物理诱变剂(如UV照射)或化学诱变剂(如羟胺和溴化乙锭)来产生突变(参阅如Hopwood,The Isolation of Mutants in Methods in Microbiology(J.R.Norris和D.W.Ribbons编辑)1970,363-433,Academic Press,New York)。在另一个说明性实例中,可通过基因缺失技术(如同源重组)来产生突变,以破坏一种或多种靶基因的表达(参阅如Alberts等,Chapter 5:DNAReplication,Repair,and Recombination,Molecular biology of the cell,2002,845,Garland Science.New York)。在另一个说明性实例中,可以通过基因修饰技术(如聚合酶链式反应(PCR))来产生突变(参阅如Botstein等,Strategies and applications of in vitro mutagenesis,Science 1985,vol 229,No.4719,1193-1201;Lo等,Specific amino acid substitutions in bacterioopsin:Replacement of a restriction fragment in the structural gene by syntheticDNA fragments containing altered codons,Proc.Natl.Acad.Sci.USA 1985,vol 81,No.8,2285-2289;Youngman等,Genetic transposition and insertionalmutagenesis in Bacillus subtilis with Streptococcus faecalis transposon Tn917,Proc.Natl.Acad.Sci.USA 1983,vol 80,No.8,2305-2309)。
在一些实施方案中,本公开内容的真核细胞可带有使其无法合成细胞生长所必需物质的一种或多种突变。突变可发生在参与氨基酸、核苷酸、糖、脂肪酸、维生素和其它必需物质的合成和/或代谢的基因中。
在一些实施方案中,所述真核细胞可以是突变体酵母细胞,其在分别参与尿苷、色氨酸、腺苷和亮氨酸合成的ura、trp、ade和leu基因中带有突变(Agaphonov等,Isolation and characterization of the LEU2 gene ofHansenula polymorpha,Yeast 1994,vol 10,509-513;Bogdanova等,Plasmideorganization during integrative transformation in Hansenula polymorpha,Yeast 1995,vol 11,343-353;Merckelbach等,Cloning and sequencing of theura3 locus of the methylotrophic yeast Hansenula polymorpha and its use forthe generation of a deletion by gene replacement,Appl.Microbiol.Biotechnol.1993,vol 40,361-364)。
在一些实施方案中,所述真核细胞可以是缺失pyrG基因功能的突变体丝状真菌细胞(例如黑曲霉菌株),它们无法合成尿苷,因此不能在无尿苷的培养基上生长(Liu等,Construction of pyrG auxotrophic Aspergillusniger strain,Journal of microbiology 2001,vol 21,No.3,15-16)。在一个说明性实施方案中,所述真核细胞是黑曲霉M54,该菌株在2009年6月14日按照《国际承认用于专利程序的微生物保存布达佩斯条约》(布达佩斯条约)的规定和条件保藏于中国武汉大学的中国典型培养物保藏中心(CCTCC),保藏号为CCTCC M 209121。
在另一个说明性实施方案中,所述真核细胞是营养缺陷型米曲霉突变体菌株,例如缺失argB基因的米曲霉M-2-3,该菌株无法合成精氨酸,因此不能在无精氨酸的培养基上生长(Gomi等,Integrative transformation ofAspergillus oryzae with a plasmid containing the Aspergillus nidulans argBgene.Agric Biol Chem.1987,vol 51,2549-2555),还例如缺失niaD基因的米曲霉,其缺少硝酸还原酶,因此无法在以硝酸盐作为唯一氮源的培养基上生长(Unkles等,The development of a homologous transformation systemfor Aspergillus oryzae based on the nitrate assimilation pathway:Aconvenient and general selection system for filamentous fungaltransformation,Molecular and General Genetics1989,vol 218,No.1,99-104)。
羧酸酯酶及其变体
本文使用的术语“羧酸酯酶”指能将羧酸酯水解成羧酸(盐)和醇的酶多肽。羧酸酯酶可以是野生型羧酸酯酶或其任何变体。野生型羧酸酯酶的变体与野生型羧酸酯酶的氨基酸序列和/或氨基酸修饰不同,但仍有将羧酸酯水解成羧酸(盐)和醇的能力。变体可具有野生型羧酸酯酶的一个或多个氨基酸替换、添加、缺失、插入、截短、修饰(如磷酸化、糖基化、标记等)或其任何组合。变体可包括野生型羧酸酯酶的天然变体以及人工多肽序列,例如通过化学合成或重组方法获得的多肽序列。变体可包括野生型羧酸酯酶的片段、突变体、杂合体、类似物和衍生物。变体可含有非天然氨基酸残基。
已经从大量物种(包括但不仅限于动物、昆虫、植物和微生物)中鉴定和分离了羧酸酯酶。已经鉴定了来自许多物种的羧酸酯酶的核苷酸序列和氨基酸序列。
在一个实施方案中,所述羧酸酯酶来自微生物。术语“微生物”指除人、动物和植物以外的任何有生命的生物。微生物可包括但不限于原核生物如细菌、原生动物、真菌、原生生物和古细菌。微生物的说明性实例为大肠杆菌(Escherichia coli)、嗜热脂肪地芽孢杆菌(Geobacillusstearothermophilus)、蜡样芽孢杆菌(Bacillus cereus)、皱褶假丝酵母(Candida rugosa)、恶性疟原虫(Plasmodium falciparum)、激烈火球菌(Pyrococcus furiosus)、肠道沙门氏菌(Salmonella enterica)和烟曲霉(Aspergillus fumigatus)。
已经从许多微生物中分离了羧酸酯酶,并获得了其相应的核苷酸序列和氨基酸序列。表1列出了微生物羧酸酯酶的说明性实例以及以GenBank登记号标明的其核苷酸及多肽序列。
表1.不同微生物的羧酸酯酶的说明性实例
| 物种 | GenBank氨基酸序列登记号 | GenBank核苷酸序列登记号 |
| 嗜热地芽孢杆菌(Geob acillus | BAD77330(SEQ ID NO:1) | BA000043,区域:3067043..3067783 |
| kaustophilus) | (SEQ ID NO:2) | |
| 高温烷烃芽孢杆菌(Geobacillusthermoleovorans) | AAG53982(SEQ ID NO:3) | AF327065(SEQ ID NO:4) |
| 肠道沙门氏菌(Salmonella enterica) | YP_002245400(SEQ ID NO:5) | NC_011294,区域:3548658..3549428(SEQ ID NO:6) |
| 烟曲霉 | XP_755184(SEQ ID NO:7) | XM_750091(SEQ ID NO:8) |
在一个实施方案中,所述羧酸酯酶来自细菌。细菌的说明性实例为大肠杆菌、嗜热脂肪地芽孢杆菌(Geobacillus stearothermophilus)、嗜热地芽孢杆菌、硫矿硫化叶菌(Sulfolobus solfataricus)和高温烷烃芽孢杆菌。在另一个实施方案中,所述羧酸酯酶来自嗜热菌。嗜热菌的说明性实例为嗜热脂肪地芽孢杆菌、嗜热地芽孢杆菌、硫矿硫化叶菌和高温烷烃芽孢杆菌。在另一个实施方案中,所述羧酸酯酶来自嗜热脂肪地芽孢杆菌。在一些实施方案中,所述羧酸酯酶或其变体与自嗜热脂肪地芽孢杆菌分离的羧酸酯酶的氨基酸序列具有至少70%的序列同一性。已经从嗜热脂肪地芽孢杆菌中鉴定了四种羧酸酯酶。这四种羧酸酯酶的氨基酸序列和核苷酸序列以SEQ ID NO:9-16示于下文表2中。
表2.嗜热脂肪地芽孢杆菌的羧酸酯酶的说明性实例
| 物种 | GenBank氨基酸序列登记号 | GenBank核苷酸序列登记号 |
| 嗜热脂肪地芽孢杆菌 | AAN81911(SEQ ID NO:9) | AY186197.1,区域:1742...2485(SEQ ID NO:10) |
此外,表3列出了来自动物、昆虫和植物的羧酸酯酶的一些说明性实例,及其相应核苷酸及氨基酸序列的GenBank登记号
表3.不同物种的羧酸酯酶的说明性实例
可以通过对野生型羧酸酯酶进行保守性替换来产生羧酸酯酶变体,其中替换氨基酸具有与天然氨基酸相似的结构或化学特性,例如在残基的极性、电荷、溶解度、疏水性、亲水性和/或两亲性方面具有相似性。还可以通过对野生型羧酸酯酶的氨基酸序列进行非保守性替换或其它改变来产生变体,只要该变体保留羧酸酯酶活性即可。关于有哪些以及有多少氨基酸残基可以进行替换、插入或缺失而不消除功能或生物活性方面的指导可以使用本领域所熟知的计算机软件来获得,例如STAR软件(参阅Bauer等,STAR:predicting recombination sites from aminoacid sequence,BMC Bioinformatics,2006,vol 7,437)。
在一些实施方案中,本公开内容提供了与SEQ ID NO:9、11、13和15所示羧酸酯酶的氨基酸序列中的一种或多种具有至少70%、80%、85%、90%、95%、96%、97%、98%或99%氨基酸序列同一性的羧酸酯酶及其变体。
就本文所述羧酸酯酶多肽序列而言,“百分比(%)氨基酸序列同一性”定义为候选序列中与特定羧酸酯酶多肽序列之氨基酸残基相同的氨基酸残基的百分比,这在比对序列并在必要时引入缺口以达到最高百分比序列同一性之后进行,并且不把任何保守性替换认为是序列同一性的一部分。为确定百分比氨基酸序列同一性而进行的比对可通过本领域中的多种方式来实现,例如使用公众可用的计算机软件如BLAST、BLAST-2、ALIGN或Megalign(DNASTAR)软件。本领域技术人员能够确定用于测量比对的合适参数,包括在所比较序列的全长上实现最高比对所需的任何算法。
在一些实施方案中,本公开内容提供了编码羧酸酯酶或其变体的核苷酸序列,其与SEQ ID NO:10、12、14和16所示羧酸酯酶核苷酸序列中的一种或多种具有至少70%、80%、85%、90%、95%、96%、97%、98%或99%核苷酸序列同一性。
就本文所述羧酸酯酶编码核苷酸序列而言,“核苷酸序列同一性百分比(%)”定义为候选序列中与目的羧酸酯酶核苷酸序列之核苷酸相同的核苷酸的百分比,这在比对序列并在必要时引入缺口以达到最高序列同一性百分比之后进行。为确定核苷酸序列同一性百分比而进行的比对可通过本领域中的多种方式来实现,例如使用公众可用的计算机软件如BLAST、BLAST-2、ALIGN或Megalign(DNASTAR)软件。
在一个实施方案中,可以使用序列比较程序NCBI-BLAST2来测定氨基酸序列同一性百分比和核苷酸序列同一性百分比(Altschul等,Nucleic Acids Res.25:3389-3402(1997))。NCBI-BLAST2序列比较程序可从http://www.ncbi.nlm.nih.gov下载。NCBI-BLAST2使用若干检索参数,其中所有这些检索参数均设置成默认值,包括如unmask=yes,strand=all,expected occurrences=10,minimum low complexitylength=15/5,multi-pass e-value=0.01,constant for multi-pass=25,dropoff for final gapped alignment=25以及scoringmatrix=BLOSUM62。在使用NCBI-BLAST2进行氨基酸(或核苷酸)序列比较的情况下,给定氨基酸序列A(或给定核苷酸序列A)相对于给定氨基酸序列B(或给定核苷酸序列B)(或者也可将其称为相对于给定氨基酸序列B(或给定核苷酸序列B)而言具有一定氨基酸(或核苷酸)序列同一性百分比的给定氨基酸序列A(或给定核苷酸序列A))的氨基酸(或核苷酸)序列同一性百分比按如下进行计算:100乘以分数X/Y,其中X是在A与B的程序比对中被序列比对程序NCBI-BLAST2评定为相同匹配物的氨基酸(或核苷酸)的数目,Y是B中氨基酸(或核苷酸)残基的总数。显然,当序列A的长度与序列B的长度不等时,A相对于B的序列同一性百分比并不等于B相对于A的序列同一性百分比。
在另一实施方案中,还可如下所述通过使用WU-BLAST-2计算机程序来获得氨基酸序列同一性百分比和核苷酸序列同一性百分比(Altschul等,Methods in Enzymology 266:460-480(1996))。所有WU-BLAST-2检索参数均设置成默认值。当使用WU-BLAST-2时,给定氨基酸序列A(或核苷酸序列A)相对于给定氨基酸序列B(或给定核苷酸序列B)的氨基酸(或核苷酸)序列同一性百分比按如下进行测定:用(a)通过WU-BLAST-2测得的序列A与序列B之间匹配的相同氨基酸(或核苷酸)残基数除以(b)序列B的总残基数,并(c)乘以100。
在一些实施方案中,本公开内容提供了热稳定的羧酸酯酶。本文使用的术语“热稳定的羧酸酯酶”指在与等于或高于约40℃的高温接触一段时间后能够保持水解羧酸酯基团的可检测酶活性的羧酸酯酶。羧酸酯酶的酶活性可使用本领域已知的任何方法进行检测,例如,通过测量在给定的一组反应条件下底物的消失或者产物的形成。检测羧酸酯酶酶活性的示例性方法为分光光度法、放射性测量法、比色法或基于高效液相色谱的方法。羧酸酯酶底物的说明性实例为乙酸萘酯(NA)、乙酸对硝基苯酯(p-NPA)、硫代丁酸甲酯(MtB)或14C标记的酯。可以使用相同的方法在相同条件下(但无羧酸酯酶)测量对照酶活性。如果羧酸酯酶酶活性的数值高于对照酶活性的,则认为它是可检测的。
在一些实施方案中,所述高温为约40℃至约100℃,或者约50℃至约90℃,或者约50℃至约70℃。在一些实施方案中,所述高温为约50℃、约55℃、约60℃、约65℃、约70℃、约75℃、约80℃、约85℃或约90℃。羧酸酯酶可接触所述高温的接触时间可由本领域技术人员来确定。在一些实施方案中,所述接触时间高达10分钟、30分钟、1小时、2小时、3小时、4小时、5小时、6小时、12小时、1天、2天、3天、4天、5天、6天、7天、8天、9天或10天。在一些实施方案中,所述接触时间为30分钟至10天,或者30分钟至5天,或者30分钟至1天,或者30分钟至6小时,或者30分钟至2小时、1小时至2小时或者10分钟至30分钟。
在一些实施方案中,本公开内容提供了嗜热脂肪地芽孢杆菌的热稳定羧酸酯酶,其具有SEQ ID NO:9、11、13和15所示的氨基酸序列。如图8所示,SEQ ID NO:9的热稳定羧酸酯酶的酶活性在该酶已与高温接触一段时间(例如但不限于与40℃、50℃、60℃、70℃和80℃分别接触10分钟或30分钟)后仍可检测到。还测量了接触37℃并在该温度下测试的羧酸酯酶的酶活性作为标准参照。样品和标准参照在其他方面均相同的条件下测试和测量。与60℃接触10分钟后,该羧酸酯酶的酶活性可以为标准参照的将近100%。与70℃接触30分钟后,该羧酸酯酶的酶活性可以为标准参照的60%以上。
表达载体和宿主细胞
在一个方面中,本公开内容提供了经改造而表达编码羧酸酯或其变体之基因的真核细胞。
本文使用的术语“表达”包括羧酸酯酶的产生中所涉及的一个或多个步骤,包括但不仅限于转录、转录后修饰、翻译、翻译后修饰和分泌。本文使用的术语“经改造而表达”指以宿主细胞中非天然的方式使该宿主细胞能够表达羧酸酯酶或其变体的一个或多个步骤。在一些实施方案中,术语“经改造而表达”包括将编码羧酸酯酶或其变体的外源基因引入宿主细胞以在该宿主细胞中表达羧酸酯酶或其变体的一个或多个步骤。在一些实施方案中,通过使宿主细胞接触诱变剂来改造该宿主细胞而使其表达突变的羧酸酯酶或者带有突变的调节序列的羧酸酯酶。
本文使用的术语“基因”指含有编码肽或多肽之信息的多聚核糖核苷酸或多聚脱氧核糖核苷酸或者混合的多聚核糖核苷酸-多聚脱氧核糖核苷酸。其包括单链和双链的分子,即DNA-DNA、DNA-RNA和RNA-RNA杂交体,以及通过将碱基缀合到氨基酸骨架上形成的“蛋白质核酸”(protein nucleic acid,PNA)。基因包括天然多核苷酸或者由天然碱基或修饰碱基形成的合成多核苷酸。术语“基因”还涵盖结构基因的编码区和位于5′和3′末端编码区附近的用于RNA或多肽的转录或翻译的序列,以及各个编码区段(外显子)之间的间插序列(内含子)。
术语“肽”或“多肽”指通过肽键或修饰的肽键彼此连接的氨基酸(即,在后一情况下为肽的等构物),并可含有除20种天然氨基酸以外的修饰氨基酸。术语“肽”或“多肽”还包括肽或多肽的片段、基序等,糖基化的肽或多肽,以及其他修饰的肽或多肽。
本文使用的术语“编码”指能够被转录成mRNA和/或翻译成肽或蛋白质。
在一些实施方案中,将编码羧酸酯酶或其变体的基因插入表达载体中,以由宿主细胞进行表达。
本文使用的术语“表达载体”指核苷酸运载体,其中可操纵地插入了编码肽或蛋白质的基因,从而可以表达所编码的肽或蛋白质。在一些实施方案中,通过将编码羧酸酯酶或其变体的基因插入到适于在真核细胞中表达的重组载体中来形成表达载体。可用于构建表达载体的核苷酸运载体的说明性实例包括但不限于质粒、噬菌粒、粘粒、人工染色体(如酵母人工染色体、细菌人工染色体或P1衍生的人工染色体)、噬菌体(如λ噬菌体或M13噬菌体)、动物病毒(如逆转录病毒、腺病毒或乳多空病毒)以及植物病毒(如马铃薯X病毒)。许多真核表达载体以商品提供。对合适表达载体的选择在本领域技术人员的能力之内。
在一些实施方案中,所述表达载体是适于在酵母细胞中表达的载体。说明性实例为pPIC3K(Invitrogen,Carlsbad,CA)、pPIC9K(Invitrogen)、pAO815(Invitrogen)、pGAPZ(Invitrogen)、pYC2/CT(Invitrogen)、pYD1酵母展示载体(Invitrogen)、pESC载体(Stratagene,La Jolla,CA)、pESC-HIS载体(Stratagene)和pHIPX4(Gietl等,Mutational analysis of theN-terminal topgenic signal of watermelon glyoxysomal malate dehydrogenaseusing the heterologous host Hansenula polymorpha.Proc.Natl.Acad.Sci.USA 1994,vol 91,3151-3155)。在一些实施方案中,所述表达载体是适于在酵母细胞中表达的质粒。在一些实施方案中,所述表达载体是适于在巴斯德毕赤酵母中表达的质粒。在一些实施方案中,所述表达载体是pPIC9K。
在一些实施方案中,所述表达载体是适于在丝状真菌细胞中表达的载体。说明性实例为pPTR(TaKaRa Bio Inc.,Shiga,Japan)、pDG1(ATCC目录号53005)、pAB366(ATCC目录号77134)、pAB520(ATCC目录号77137)、质粒pYG1.2(Liu等,Construction of recombinant expressionplasmid for Aspergillus niger,Journal of Tongji University(Medical science)2001,vol 22,1-3)、pTAex3(Sakuradani等,D6-Fatty acid desaturase from anarachidonic acid-producing Mortierella fungus Gene cloning and itsheterologous expression in a fungus,Aspergillus,Gene 1999,vol 238,445-453)、pSa123(Gomi等,Integrative transformation of Aspergillus oryzaewith a plasmid containing the Aspergillus nidulans argB gene,Agric.Biol.Chem.1987,vol 51,2549-2555)、pNAN8142(Hiroyuki等,Expression ofAspergillus oryzae Phytase Gene in Aspergillus oryzae RIB40 niaD,Journalof bioscience and bioengineering,2006,Vol 102,No.6,564-567)。在一些实施方案中,所述表达载体是适于在丝状真菌细胞中表达的质粒。在一些实施方案中,所述表达载体是适于在黑曲霉中表达的质粒。在一些实施方案中,所述表达载体是pYG1.2。含有pYG1.2质粒的大肠杆菌DH5α菌株(大肠杆菌DH5α/pYG1.2)在2009年7月27日按照布达佩斯条约的规定和条件保藏于中国武汉大学的中国典型培养物保藏中心(CCTCC),保藏号为CCTCC M 209165。
在另一个方面中,本公开内容提供了表达载体,其包含编码来自微生物的羧酸酯酶或其变体的基因以及能促进该羧酸酯酶或其变体在真核细胞中表达的调节序列,其中所述调节序列与该基因可操纵地连接。
在另一个方面中,本公开内容提供了表达载体,其包含编码羧酸酯酶或其变体的基因以及能促进该羧酸酯酶或其变体在丝状真菌细胞中表达的调节序列,其中所述调节序列与该基因可操纵地连接。
在一些实施方案中,所述羧酸酯酶或其变体的氨基酸序列与SEQ IDNO:9、11、13或15的氨基酸序列具有至少70%的序列同一性。在一些实施方案中,所述羧酸酯酶或其变体的氨基酸序列与SEQ ID NO:9、11、13或15的氨基酸序列具有至少90%的序列同一性。在一些实施方案中,所述羧酸酯酶或其变体的核苷酸序列与SEQ ID NO:10、12、14或16的核苷酸序列具有至少70%的序列同一性。在一些实施方案中,所述羧酸酯酶或其变体的核苷酸序列与SEQ ID NO:10、12、14或16的核苷酸序列具有至少90%的序列同一性。
术语“调节序列”包括对于表达本公开内容的羧酸酯酶或其变体而言是必要或有利的任何组分。这些调节序列可包括但不限于启动子序列、转录终止子、前导序列和多腺苷酸化序列。术语“可操纵连接”指基因序列以允许编码羧酸酯酶或其变体之基因表达的方式与一个或多个调节序列直接或间接地连接或关联。所述基因编码序列和所述一个或多个调节序列可位于同一多核苷酸分子上,并以允许编码羧酸酯酶或其变体之基因表达的方式定位。所述基因编码序列和所述一个或多个调节序列可位于不同的多核苷酸分子上,但该调节序列能够发挥功能来影响编码羧酸酯酶或其变体之基因的表达。
所述调节序列可含有适当的启动子序列。本文使用的“启动子序列”指控制与其可操纵连接的DNA序列转录的DNA区段。启动子序列包括足以进行RNA聚合酶的识别、结合和转录起始的特定序列。此外,启动子序列可包括调节RNA聚合酶的这种识别、结合和转录起始活性的序列。这些序列可影响同一分子或不同分子上的转录。启动子序列的功能(根据调节的性质)可以是组成型的,或者可由刺激诱导。适于在真核细胞中进行转录控制的任何启动子序列均可使用。在一些实施方案中,所述启动子序列适于在酵母细胞和/或丝状真菌细胞中进行转录控制。酵母细胞中合适的启动子序列的说明性实例为TEF启动子、CYC启动子、ADH1启动子、3-磷酸甘油激酶启动子、甘油醛-3-磷酸脱氢酶(GAFDH或GAP)启动子、半乳糖激酶(GAL1)启动子、半乳糖差向异构酶启动子和醇脱氢酶(ADH1)启动子。丝状真菌细胞中合适的启动子序列的说明性实例为α-淀粉酶启动子、葡萄糖淀粉酶启动子、醇脱氢酶启动子(Kinghorn等,Applied moleculargenetics of filamentous fungi,Springer press,1992,第18页)。在一些实施方案中,所述启动子为诱导型启动子,该启动子能够应答于化学或物理刺激而开启或关闭羧酸酯酶的表达。诱导型启动子的说明性实例为AOX1启动子(可由甲醇诱导)、GAL1启动子(可由半乳糖诱导)、CUP启动子(可由Cu2+诱导)(Wei Xiao,Yeast protocols,第二版,Humana Press,2005,p320)和alc A启动子(可由醇诱导)。
调节序列可含有适当的转录终止子序列,该序列是被真核细胞RNA聚合酶识别以终止转录的序列。终止子序列可与编码羧酸酯酶或其变体之核苷酸序列的3′端可操纵地连接。可在真核细胞中发挥功能的任何终止子序列均可用于本公开内容。在一些实施方案中,终止子序列可以是在酵母细胞或丝状真菌细胞中具有转录终止活性的核苷酸序列。
调节序列还可含有适当的前导序列,该序列是对于真核细胞的翻译具有重要性的mRNA非翻译区。前导序列可与编码羧酸酯酶或其变体之核苷酸序列的5′端可操纵地连接。可在真核细胞中发挥功能的任何前导序列均可用于本公开内容。在一些实施方案中,前导序列可以是可在酵母细胞或丝状真菌细胞中发挥功能的核苷酸序列。
调节序列还可含有多聚腺苷酸化序列,这是可与羧酸酯酶基因序列的3′端可操纵连接的序列,其在转录时被真核细胞识别成向所转录的mRNA添加多聚腺苷酸残基的信号。可在真核细胞中发挥功能的任何多聚腺苷酸化序列均可用于本公开内容。在一些实施方案中,所述多聚腺苷酸化序列可在酵母细胞或丝状真菌细胞中发挥功能。
在另一个方面中,本公开内容提供了表达载体,其任选地还包含用于选择性鉴定该表达载体的标志物基因。标志物基因是编码蛋白质的基因,所述蛋白质可作为用于鉴定包含该基因之细胞的选择标志物。典型的标志物基因编码具有一个或多个以下特征的蛋白质:i)赋予对抗生素或其他有毒物质(如氨苄青霉素、新霉素、氨甲喋呤等)的抗性;ii)对营养缺陷进行回补(complement);以及iii)提供无法从培养基中获得的必要养分。标志物基因可以为诱导型或非诱导型的,并且一般允许进行阳性选择。用于酵母宿主细胞的合适标志物基因包括但不限于Ade2、His3、Leu2、Lys2、Met3、Trp1、Ura3以及新霉素或卡那霉素或氨苄青霉素抗性基因。用于丝状真菌宿主细胞的合适标志物基因包括但不限于amdS(乙酰胺酶)、argB(鸟氨酸氨甲酰基转移酶)、bar(膦丝菌素乙酰转移酶)、hygB(潮霉素磷酸转移酶)、niaD(硝酸还原酶)、pyrG(乳清苷-5′-磷酸脱羧酶)、sC(硫酸腺苷酰转移酶)、trpC(氨基苯甲酸合成酶)以及它们的等同物。
在另一个方面中,本公开内容提供了一种产生羧酸酯酶或其变体的方法,其中所表达的羧酸酯酶或其变体分泌到真核细胞的外面。在一些实施方案中,编码羧酸酯酶或其变体的基因与编码信号肽的信号序列连接。本文使用的术语“信号肽”指这样的氨基酸序列,其与羧酸酯酶或其变体连接,并使得所表达的羧酸酯酶或其变体能够被转运/分泌到细胞膜的外面。在一些实施方案中,信号肽可以与羧酸酯酶或其变体的氨基端连接。在一些实施方案中,信号肽可被肽酶切割,以从羧酸酯酶或其变体上除去该信号肽。
信号序列可以是一种或多种天然存在的羧酸酯酶信号序列,或者是加入羧酸酯酶中的外源信号序列。在一些实施方案中,所述信号序列可在酵母细胞和/或丝状真菌细胞中发挥功能。可在酵母细胞中发挥功能的信号肽的说明性实例为鸡溶菌酶信号肽(CLSP)、酿酒酵母α-因子的信号肽和酿酒酵母转化酶的信号肽。可在丝状真菌细胞中发挥功能的信号肽的说明性实例为米曲霉TAKA淀粉酶的信号肽、黑曲霉中性淀粉酶的信号肽、黑曲霉葡萄糖淀粉酶的信号肽、米赫根毛霉(Rhizomucor miehei)天冬氨酸蛋白酶的信号肽、特异腐质霉(Humicola insolens)纤维素酶的信号肽以及柔毛腐质霉(Humicola lanuginosa)脂肪酶的信号肽。
本文使用的术语“宿主细胞”指对表达载体的转化、转染、转导等敏感的细胞。
可以使用本领域中任何合适的方法将表达载体引入真核细胞中,所述方法包括但不限于电穿孔;氯化钙、氯化锂、乙酸锂/聚乙二醇、磷酸钙、DEAE-葡聚糖、脂质体介导的DNA摄入;原生质球(spheroplasting);注射;显微注射;微粒轰击;噬菌体感染;病毒感染或其他成熟的方法。或者,可对含有目的基因序列的表达载体进行体外转录,并可通过熟知的方法(如注射)将所得mRNA引入宿主细胞中(参阅Kubo等,Location of aregion of the muscarinic acetylcholine receptor involved in selective effectorcoupling,FEBS Letts.1988,vol 241,119)。
在一些实施方案中,可通过诸如以下的方法将表达载体引入酵母细胞中:原生质体转化(参阅如Spencer等,Genetic manipulation ofnon-conventional yeasts by conventional and non-conventional methods.JBasic Microbiol.,1988,vol 28,No.5,321-333)、感受态细胞转化(参阅如Gietz等,Frozen competent yeast cells that can be transformed with highefficiency using the LiAc/SS carrier DNA/PEG method,Nat Protoc.2007;2(1):1-4)、电穿孔(参阅如Suga等,High-efficiency electroporation byfreezing intact yeast cells with addition of calcium,Curr Genet.,2003,vol 43,No.3,206-211)或者通过细胞之间的接触进行接合(参阅如Nishikawa等,Trans-kingdom conjugation offers a powerful gene targeting:tool in yeast,1998,Genetic Analysis:Biomolecular Engineering,vol 14,No.3,65-73)。
在一些实施方案中,可通过原生质体转化将表达载体引入丝状真菌细胞中,其包括原生质体分离、再生和融合的步骤(参阅Arora等,Handbookof fungal biotechnology,第二版,CRC Press,2004,第9-24页)。用于转化丝状真菌细胞的合适方法已描述于多种出版物中(参阅如Ruiz等,Strategies for the transformation of filamentous fungi,J Appl Microbiol.,2002,vol 92,No.2,189-195;Hynes等,Genetic transformation offilamentous fungi,Journal of Genetics,1996,vol 75,No.3,297-311)。
在另一个方面中,本公开内容提供了包含表达载体的真核细胞,其中所述表达载体中含有编码来自微生物的羧酸酯酶或其变体的基因以及能够促进所述羧酸酯酶或其变体在真核细胞中表达的调节序列,其中所述调节序列与该基因可操纵地连接。在一些实施方案中,所述真核细胞为酵母细胞。在一些实施方案中,所述真核细胞为巴斯德毕赤酵母。
在另一个方面中,本公开内容提供了包含表达载体的真核细胞,所述表达载体中含有编码羧酸酯酶或其变体的基因以及能够促进所述羧酸酯酶或其变体在丝状真菌细胞中表达的调节序列,其中所述调节序列与该基因可操纵地连接。在一些实施方案中,所述丝状真菌细胞为黑曲霉。
细胞培养
经改造而表达本公开内容的羧酸酯酶或其变体的真核细胞可以在适于表达该羧酸酯酶或其变体的条件下在任何合适的培养基中进行培养。例如,所述细胞可通过摇瓶培养、实验室或工业发酵罐中的小规模或大规模发酵(包括连续发酵、分批发酵、分批补料发酵或固态发酵)来进行培养。培养可在包含碳源、氮源和无机盐的合适的营养培养基中进行。合适的培养基可从商业供应商获得,或者可使用市售的成分来制备。
可以调节培养条件(如温度、pH、孵育时间以及诱导物的存在)以使得能够更高地表达羧酸酯酶。培养条件可由本领域技术人员进行调节。在一些实施方案中,可通过以下步骤来确定培养条件:在广泛的条件下培养经改造而表达羧酸酯酶或其变体的细胞,测量羧酸酯酶或其变体的表达,以及选择允许相对高水平地表达羧酸酯酶或其变体的培养条件。
细胞培养中合适的温度、pH和孵育时间一般取决于宿主细胞。在一些实施方案中,培养温度可为约20℃至约80℃、约30℃至约70℃、约30℃至约60℃、约30℃至约50℃或者约30℃至约40℃。在一些实施方案中,培养温度为约20℃、约25℃、约30℃、约35℃、约37℃、约40℃、约50℃、约60℃、约70℃或约80℃。
在一些实施方案中,培养pH可为约2至约8.5、约3至约8.5、约4至约8.5、约5至约8.5、约6至约8.5或者约7至约8.5。在一些实施方案中,培养pH为约2、约3、约4、约5、约6、约7、约8或约8.5。
在一些实施方案中,孵育时间可为至少1天、至少2天、至少3天或至少4天。在一些实施方案中,孵育时间可为1天至10天、2天至9天、3天至8天或者4天至7天。在一些实施方案中,孵育时间为1天、2天、3天、4天、5天、6天、7天、8天、9天或10天。
在一些实施方案中,可在诱导物存在下培养真核细胞,所述诱导物可诱导羧酸酯酶或其变体的表达。可以基于与编码羧酸酯酶或其变体之基因可操纵连接的诱导型启动子来选择诱导物。在一个说明性实施方案中,在甲醇存在下培养真核细胞,以诱导与编码羧酸酯酶或其变体之基因可操纵连接的AOX1启动子。在另一说明性实施方案中,在半乳糖存在下培养真核细胞,以诱导GAL1启动子。在另一说明性实施方案中,在Cu2+存在下培养真核细胞,以诱导CUP启动子。在另一说明性实施方案中,在一种或多种醇存在下培养真核细胞,以诱导alc A启动子。细胞培养基中诱导物的量可由本领域技术人员进行调整,以允许相对更高水平地表达羧酸酯酶或其变体。
可使用本领域的成熟技术来测定羧酸酯酶或其变体的表达水平。在一些实施方案中,通过对所转录mRNA的量或所翻译蛋白质的量进行定量来测定羧酸酯酶或其变体的表达水平
在一个说明性实施方案中,可通过Northern印迹分析来测定mRNA水平(Alwine等,Method for detection of specific RNAs in agarose gels bytransfer to diazobenzyloxymethyl-paper and hybridization with DNA probes,Proc.Natl.Acad.Sci.USA 1977,vol 74,5350-5354;Bird,Size separationand quantification of mRNA by northern analysis,Methods Mol.Biol.1998,vol 105,325-36)。简言之,从细胞中分离poly(A)RNA,通过凝胶电泳进行分离,印迹到支持表面上(例如硝酸纤维素或Immobilon-Ny转移膜(Millipore Corp.,Bedford,Mass.)),并与能与目的mRNA杂交的标记(例如荧光标记或放射性标记)寡核苷酸探针一起孵育。在另一说明性实施方案中,可通过定量RT-PCR(综述参阅Freeman等,Quantitative RT-PCR:pitfalls and potential,Biotechniques 1999,vol 26,112-122)或半定量RT-PCR分析(Ren等,Lipopolysaccharide-induced expression of IP-10 mRNA in ratbrain and in cultured rat astrocytes and microglia,Mol.Brain Res.1998,vol59,256-263)来测定mRNA水平。根据该技术,从细胞中分离poly(A)RNA,用于合成cDNA,并将所得cDNA与PCR引物一起孵育,所述引物能与模板杂交并扩增该模板序列,以产生与目的mRNA的细胞水平成正比的PCR产物水平。
在一个说明性实例中,通过电泳(如十二烷基硫酸钠聚丙烯酰胺凝胶电泳(SDS-PAGE))来检测所表达的羧酸酯酶或其变体,可以使用商品扫描仪(例如Bio-Rad的GS-800光密度计)对SDS-PAGE上羧酸酯酶条带的光密度进行扫描,以对蛋白质进行定量。在另一说明性实例中,使用特异性识别羧酸酯酶或其变体的抗体,通过Western印迹分析来检测所表达的羧酸酯酶或其变体。在另一说明性实施方案中,通过使用底物测量其酶活性来检测所表达的羧酸酯酶或其变体。
可以使用本领域已知的方法来测定所表达羧酸酯酶的酶活性。可以通过测量底物的消失或产物的形成来表征酶活性。所述测量可以是分光测量、放射性测量、比色测量或者基于高效液相色谱的测量。羧酸酯酶反应的任何合适的底物均可使用。羧酸酯酶底物的说明性实例为乙酸萘酯(NA)、乙酸对硝基苯酯(p-NPA)、硫代丁酸甲酯(MtB)或14C标记的酯。在一个说明性实施方案中,可通过对生色试剂Fast Blue B盐与α-萘酚之间所形成的复合体(其为底物α-萘基乙酸酯被羧酸酯酶水解的产物)进行分光测量来定量羧酸酯酶的酶活性。
在另一个方面中,本公开内容提供了一种产生羧酸酯酶或其变体的方法,其中所述羧酸酯酶或其变体以至少约12mg/L以及高达100mg/L的量产生。在一些实施方案中,羧酸酯酶或其变体的产量为至少12mg/L或至少15mg/L或至少17mg/L或至少19mg/L。在一些实施方案中,羧酸酯酶或其变体的产量为1mg/L至100mg/L、10mg/L至100mg/L、15mg/L至100mg/L、20mg/L至100mg/L、30mg/L至100mg/L、10mg/L至50mg/L、15mg/L至50mg/L、20mg/L至50mg/L或30mg/L至50mg/L。在一些实施方案中,羧酸酯酶或其变体以约1mg/L、约5mg/L、约10mg/L、约12mg/L、约15mg/L、约20mg/L、约30mg/L、约40mg/L、约50mg/L或约100mg/L的量产生。
分离所表达的羧酸酯酶
可以使用本领域已知的标准方法从细胞培养基中分离所表达的羧酸酯酶或其变体,包括但不限于离心、过滤、萃取、喷雾干燥、蒸发或沉淀(参阅I.M.Rosenberg编辑,Protein Analysis and Purification:BenchtopTechniques,1996,Birkhauser,Boston,Cambridge,Mass.;Janson等,Protein Purification,1989,VCH Publish ers,New York))。
还可以通过本领域已知的多种方法进一步纯化所表达的羧酸酯酶,所述方法包括但不限于色谱法(如离子交换色谱、亲和色谱、疏水色谱、色谱聚焦和尺寸排阻色谱)、电泳法(如制备型等电聚焦、SDS-PAGE)和差异性溶解度(如硫酸铵沉淀)。
在一些实施方案中,使用基于抗体的方法来分离和纯化所表达的羧酸酯酶或其变体。可以使用本领域已知且已实践过的方法来产生和分离能与羧酸酯酶或其变体结合的抗体。可以通过在与抗体缀合的固相基质上的色谱(如免疫沉淀)从细胞裂解物或培养基上清液中纯化羧酸酯酶或其变体(参阅Harlow等,Using Antibodies:A Laboratory Manual,Cold SpringHarbor Laboratory,1999,Cold Spring Harbor,N.Y.)。
在另一个方面中,本公开内容提供了分离的羧酸酯酶或其变体。本文使用的“分离的羧酸酯酶”指其中基本不含有在本文所述生产方法中与其相关的其他细胞组分的羧酸酯酶。“基本不含有”包括这样的羧酸酯酶制备物,其中含有的不是目的羧酸酯酶或其变体的其他细胞组分或其他污染物低于约50%、40%、30%、20%、10%、5%或1%(干重)。在一些实施方案中,分离的羧酸酯酶中含有的不是目的羧酸酯酶或其变体的污染物低于50%、40%、30%、20%、10%、5%或1%(干重)。
含有所表达羧酸酯酶的组合物及其用途
在另一个方面中,本公开内容提供了一种包含真核细胞以及该真核细胞所表达的羧酸酯酶或其变体的组合物。在一些实施方案中,本公开内容提供了一种包含真核细胞以及该真核细胞所表达的微生物羧酸酯酶或其变体的组合物。在一些实施方案中,本公开内容提供了一种包含丝状真菌细胞以及该丝状真菌细胞所表达的羧酸酯酶或其变体的组合物。在一个说明性实施方案中,所述组合物直接得自细胞培养物,所述细胞培养物中含有经改造而表达编码羧酸酯酶或其变体之基因的真核细胞。可对所述细胞培养物进行过滤或离心或其他处理,以除去培养基、细胞碎片和/或其他不想要的物质。还可对所述细胞培养物进行本领域技术人员认为适当的其他纯化处理,以提高其中羧酸酯酶的浓度。所述组合物可以液体形式或干燥固体形式制备。
在另一个方面中,本公开内容提供了一种包含通过本文所述方法产生的分离的羧酸酯酶或其变体的组合物。在一些实施方案中,所述含有分离的羧酸酯酶或其变体的组合物中包含的不是目的羧酸酯酶或其变体的污染物不高于50%、40%、30%、20%、10%、5%或1%(干重)。
本公开内容的组合物可用于多种生物、农业和药物应用中。本公开内容的组合物可用于将含有羧酸酯基团的化合物转化成无羧酸酯基团的化合物,包括将所述含有羧酸酯基团的化合物与羧酸酯酶或其变体一起孵育。在一个说明性实施方案中,该组合物用于将含有羧酸酯基团的前药转化成无羧酸酯基团的药物。在另一说明性实施方案中,该组合物用于将含有羧酸酯基团的杀虫剂转化成无羧酸酯基团的去毒性杀虫剂。
在另一个方面中,本公开内容提供了一种使杀虫剂去毒性的方法,所述方法包括将杀虫剂与本文提供的组合物一起孵育。本文使用的“杀虫剂”指用于对害虫(如昆虫)、植物病原体和杂草进行杀伤、抵制或发挥对抗作用的化学药剂。一些杀虫剂的化学结构中含有一个或多个羧酸酯基团。利用羧酸酯酶对羧酸酯基团进行水解可将有毒的杀虫剂转化成无毒物质。这可用于清除农产品(如蔬菜和水果)或环境(如水和土壤)中不想要的杀虫剂残留。这还可用于在中毒事件中减轻或消除杀虫剂的毒效应。可通过羧酸酯酶去毒性的杀虫剂的说明性实例为有机磷酸酯类杀虫剂、氨基甲酸酯类杀虫剂和拟除虫菊酯类杀虫剂。
在一些实施方案中,所述组合物可以经改造而表达羧酸酯酶或其变体之宿主细胞的细胞提取物或培养物上清液的形式提供,或者可以分离的羧酸酯酶或其变体的形式提供。待使用组合物的量可由实施本方法的人员根据需要来确定。在一些实施方案中,待使用的所述组合物的量可取决于待去毒性的样品中所含杀虫剂的量和类型,以及该组合物水解该特定杀虫剂的酶活性。在一个说明性实施方案中,将0.1nmol羧酸酯酶与含有1nmol的含羧酸酯基团之杀虫剂的样品一起孵育;该羧酸酯酶在孵育4小时后降解了约100%的杀虫剂。在另一个说明性实施方案中,将0.1nmol羧酸酯酶与含有2nmol的含羧酸酯基团之杀虫剂的样品一起孵育;该羧酸酯酶在孵育6小时后降解了约85%的杀虫剂。
孵育时间可由实施本方法的人员根据需要来确定。在一些实施方案中,孵育时间可为约1小时至3周、约3小时至约7天以及约4小时至约3天,例如约1小时、约1.5小时、约3小时、约6小时、约9小时、约1天、约3天、约7天、约9天、约12天、约15天和约3周。本领域技术人员可对其他孵育条件(如温度、pH、辅助因子的存在情况)进行选择,从而以更高的百分比对杀虫剂进行去毒性。
在一个实施方案中,所处理样品中杀虫剂的量可减少至少30%、至少40%、至少50%、至少60%、至少70%、至少80%或至少90%。在一些实施方案中,所处理样品中杀虫剂的量可减少30%至100%、40%至90%、50%至80%或者60%至70%。在一些实施方案中,所处理样品中杀虫剂的量可减少30%、40%、50%、60%、70%、80%、90%、95%、99%和100%。
在另一个方面中,本公开内容提供了一种将前药转化成药物的方法,所述方法包括将该前药与本文提供的组合物一起孵育。一些前药中可含有使该前药无药物活性的羧酸酯基团。本文所述羧酸酯酶能水解羧酸酯基团并将该无活性前药转化成活性药物。在一个说明性实例中,通过羧酸酯酶将伊立替康(一种抗癌前药)转化成活性药物化合物7-乙基-10-羟基喜树碱(一种拓扑异构酶I抑制剂)(Yoon等,Activation of a camptothecinprodrug by specific carboxylesterases as predicted by quantitativestructure-activity relationship and molecular docking studies,Mol CancerTher 2003,vol 2,1171)。在另一个说明性实例中,通过羧酸酯酶将前药奥塞米韦转化成活性药物奥塞米韦羧酸酯(Shi等,anti-influenza viral prodrugoseltamivir is activated by carboxylesterase hce1 and the activation isinhibited by anti-platelet agent clopidogrel,J.Phar.Exp.Ther.2006,vol 319,1477-1484)。
实施例
公开以下实施例以帮助理解本公开内容,不应将其理解为以任何方式限制本发明的范围,本发明的范围由其后的权利要求书限定。
材料和培养基
1%琼脂糖凝胶:400mg琼脂糖、39.2ml H2O、0.8ml 50×TAE和1μg溴化乙锭。
LB培养基(1L):10g胰蛋白胨、5g酵母提取物、10g NaCl,pH7.4。
LB平板:含有1.5%琼脂的LB培养基。
RDB平板:1M山梨醇、2%葡萄糖、1.34%酵母氮源(YNB)、2%琼脂、4×10-5%生物素和0.005%氨基酸。
YPD培养基:1%酵母提取物、2%胰蛋白胨和2%葡萄糖。
YPD平板:1%酵母提取物、2%胰蛋白胨、2%葡萄糖和2%琼脂粉。
BMMY培养基:1%酵母提取物、2%胰蛋白胨、100mM磷酸钾缓冲液(pH 6.0)、1.34%YNB、4×10-5%生物素和0.5%甲醇。
MN培养液(400ml):16ml MN盐溶液(250ml H2O中的37.5g硝酸钠、3.25g氯化钾、9.5g磷酸二氢钾)、0.4ml微量元素(每100ml中含有2.2g七水合硫酸锌、1.1g硼酸、0.5g四水合氯化锰、0.5g七水合硫酸亚铁、0.17g六水合氯化钴、0.16g五水合硫酸铜、0.15g钼酸钠、5g乙二胺四乙酸二钠,pH 6.5)、6g葡萄糖、0.4g酸水解酪蛋白、8ml 50×MgSO4溶液(250ml H2O中的6.5g MgSO4·7H2O),pH 6.5。
MN+URI培养液(400ml):0.4g尿苷、400ml MN培养液。
MN+SORB琼脂培养基(1L):40ml MN盐溶液、1ml微量元素、10g葡萄糖、218.64g山梨醇、15g琼脂、20ml 50×MgSO4溶液,pH6.5。
STC缓冲液(300ml):65.6g山梨醇、0.36g Tris碱、2.2g CaCl2·2H2O,pH 7.5。
PEG溶液(100ml):60g PEG6000、0.12g Tris、0.74g CaCl2,pH7.5。
NM缓冲液(500ml):29.25g NaCl、2.132g MES,pH 5.8。
MM缓冲液(200ml):59.15g MgSO4·7H2O、0.8g MES,pH 5.8。
6×SDS-PAGE上样缓冲液:300mM Tris-HCl(pH 6.8)、12%(w/v/)SDS、0.6%(w/v)溴酚蓝、60%(v/v)甘油、6%(w/v)β-巯基乙醇。
PBST缓冲液:补充了0.1(v/v)Tween 20的0.01M磷酸盐缓冲盐水,pH 7.2。
MGY培养基:1.34%YNB、1%甘油、4×10-5%生物素。
酶稀释缓冲液(100ml):3.5g NaCl、0.11g CaCl2、1g葡萄糖,pH5.8。
溶壁酶:0.2g溶壁酶溶于酶稀释缓冲液中。
BSA:12mg/ml BSA溶于酶稀释缓冲液中。
12%聚丙烯酰胺分离胶(10ml):4ml H2O、3.3ml 30%聚丙烯酰胺、2.5ml 1.5M Tris-HCl(pH 8.8)、0.1ml 10%SDS、0.1ml 10%AP、4μl TEMED。
5%聚丙烯酰胺成层胶(5ml):3.44ml H2O、0.83ml 30%聚丙烯酰胺、0.63ml 0.5M Tris-HCl(pH 6.8)、0.05ml 10%SDS、0.05ml 10%AP、4μl TEMED。
所有的培养基和溶液均已灭菌。
实施例1:在黑曲霉M54菌株中表达重组羧酸酯酶。
通过PCR从嗜热脂肪地芽孢杆菌CICC 20156菌株(购自中国上海的中国工业微生物菌种保藏中心)中扩增羧酸酯酶基因,其中使用CICC20156菌株的基因组DNA作为模板,并使用以下引物:正向引物P1(SEQID NO:33)和反向引物P2(SEQ ID NO:34)。该引物含有来自SEQ IDNO:9所示的羧酸酯酶基因的序列。正向引物含有Xba I位点和KEX2位点,而反向引物包含HpaI位点以及编码六聚组氨酸标签的核苷酸(His-tag,His-tag的编码序列以波浪线标出),所述六聚组氨酸标签由可用于亲和力纯化和抗体检测的六个组氨酸残基组成。反向引物中含有与羧酸酯酶基因3′末端核苷酸序列的一部分相连的His-tag编码序列。使用引物P1和P2产生的PCR产物称为CarE-his基因(SEQ ID NO:35),其编码羧酸酯酶在C端与His-tag融合而成的融合蛋白。
表4.引物P1和P2以及CarE-his基因的核苷酸序列。
| 名称 | 序列 |
| P1 | 5’CGTCTAGAAAGAGAATGATGAAAATTGTTCCGCCG 3’(SEQ ID NO: |
所述PCR在具有以下组成的50μl反应体系中进行:5μl 10×Pfu缓冲液(天根生化科技(北京)有限公司)、4μl dNTP混合物(天根生化科技(北京)有限公司)、1μl正向引物P1、1μl反向引物P2、1μl模板DNA、1μl pfu DNA聚合酶(天根生化科技(北京)有限公司)和37μl双蒸水。PCR中使用以下的循环:95℃ 5分钟,然后是95℃45秒、60℃ 45秒和72℃ 90秒的30个循环;其后是72℃ 5分钟。接着将PCR产物置于1%琼脂糖凝胶上进行电泳,切下780bp的条带并使用凝胶提取试剂盒(天根生化科技(北京)有限公司),根据生产商的说明进行凝胶提取。
将纯化的PCR产物插入pYG1.2载体中,该载体使用之前已公开的方法来构建(Liu,Zhongbin等,Construction of recombinant expressionplasmid for Aspergillus niger,Journal of Tongji University(medicalscience),2001,22,vol 3,1-3)。pYG1.2载体含有来自黑曲霉ATCC 12049菌株的pyrG基因、gla A编码序列和调节序列。下面简要描述产生pYG1.2载体的步骤。
使用市售质粒pUC18(Fermentas Inc.,Burlington,Canada)来构建pIGF载体。pUC18载体含有赋予氨苄青霉素抗性的β-内酰胺酶基因。将4.8kb的片段插入pUC18载体中,以产生pIGF载体,该片段含有gla A编码序列(编码葡萄糖淀粉酶的前498个氨基酸)、2.0kb的上游调节序列(gla A的启动子)和2.3kb的下游调节序列(gla A的终止子序列)。插入gla A编码序列来帮助提高靶蛋白的表达水平。图1显示了pIGF载体结构的示意图。所插入的gla A编码序列中包含可用于产生与gla A的融合蛋白的Xba I/Hap I克隆位点(示于图1)。
接着向pIGF载体中插入黑曲霉ATCC12049菌株的pyrG基因,以产生pYG1.2载体。以pAB4.1质粒(该质粒含有来自黑曲霉ATCC9029菌株的pyrG基因,由Institute of Food Research,Norwich,UK提供)为模板作PCR获得含有pyrG基因保守序列的600bp片段,用32P标记该片段,并以其为探针作噬菌斑杂交(Sambrook,J等,A laboratory manual,New York:Cold Spring Harbor Laboratory Press,1989)从ATCC 12049菌株的基因文库中筛选出含pyrG基因的9.8kb的核酸片段,进一步用XhoI消化以获得2.3kb的片段,通过限制性酶谱分析和PCR证实该片段含有ATCC 12049菌株的pyrG基因。
将所得的2.3kb片段与经XhoI消化的pIGF载体之线性化片段连接在一起。将连接产物转化进大肠杆菌(E.coli)DH5α中,并挑取在含有氨苄青霉素的LB平板上生长的克隆。进行PCR以鉴定含有所插入的pyrG基因的阳性克隆。增殖所鉴定的阳性克隆,以提取质粒,并将重组质粒测序以证实pyrG基因的成功插入。将一种证实后的质粒命名为pYG1.2,并用作以下分子克隆研究中的载体。pYG1.2的示意图示于图2。含有pYG1.2质粒的大肠杆菌DH5α菌株(大肠杆菌DH5α/pYG1.2)于2009年7月27日保藏于中国武汉大学的CCTCC。保藏号为CCTCCM 209165。该保藏物将在布达佩斯条约的规定和条件下维持。可通过常规质粒提取法从大肠杆菌DH5α/pYG1.2菌株中回收质粒pYG1.2。
为了将羧酸酯酶基因插入pYG1.2载体中,将纯化的CarE-his基因产物和pYG1.2质粒分别同时用XbaI和HpaI(New Englan Biolabs,Inc.,Ipswich,MA)进行消化,其后在1%琼脂糖凝胶上进行电泳并通过凝胶提取进行纯化。使用T4连接酶(Takara Bio.Inc.,Japan)将消化的CarE-his基因产物与消化的pYG1.2片段连接在一起。
通过以下步骤转化大肠杆菌DH5α:将感受态大肠杆菌DH5α细胞与20μl连接产物混合,在冰上孵育30分钟,42℃热激90秒,在冰上孵育5分钟,其后加入200μl LB培养基并在37℃下在摇床上孵育45分钟。用50μl所得细菌培养物接种含有氨苄青霉素的LB平板,并在37℃下孵育过夜。
通过在与上述相同的PCR反应条件下使用个体大肠杆菌菌落作为模板进行PCR,对在含有氨苄青霉素的LB平板上生长的各个大肠杆菌菌落进行筛选。通过在1%琼脂糖凝胶上进行电泳来表征PCR产物,将显示780bp条带的阳性菌落在含有氨苄青霉素的LB培养基中增殖过夜,之后使用质粒提取试剂盒(天根生化科技(北京)有限公司),根据生产商的说明进行质粒纯化。
通过用XbaI和HpaI消化并随后进行电泳来表征所得的重组质粒。对阳性质粒进行测序,结果证实已将羧酸酯酶基因成功插入pYG1.2质粒中。将该重组质粒命名为pYG1.2-CarE-his,并用于后续的研究。
使用黑曲霉M54菌株来表达羧酸酯酶。黑曲霉M54是一种pyrG缺陷型菌株,其不能在不含尿苷的培养基上生长。通过使黑曲霉ATCC12049菌株暴露于UV照射并筛选pyrG基因缺陷型菌株来获得黑曲霉M54,如上文所述(Liu,Zhongbin等,Construction of pyrGauxotrophic Aspergillus niger strain,Journal of microbiology,2001,21,vol 3,15-16)。该菌株已在2009年6月14日保藏于中国武汉大学的CCTCC。保藏号为CCTCC M 209121。该保藏物将在布达佩斯条约的规定和条件下维持。
使用质粒pYG1.2-CarE-his来转化黑曲霉M54的原生质体。原生质体按以下制备:以4×108的细胞密度用黑曲霉M54悬液接种含有200ml MN+URI培养液的1L瓶,并在30℃下以200rpm在摇床上孵育24小时;收集经处理的细胞并将1g细胞(湿重)悬于10ml冰冷的MM缓冲液中;在30℃下以80rpm轻柔摇动10分钟后向细胞悬液中加入1ml溶壁酶和0.5ml BSA,然后以50rpm摇动2小时;接着在4℃下将悬液以1500rpm离心2分钟,转移出上清液并与NM缓冲液混合,随后在4℃下以2500rpm离心10分钟;将沉淀与冰冷的STC缓冲液混合至总体积50ml,其后在4℃下以2000rpm离心10分钟,获得乳白色原生质体,接着将其悬于1ml冰冷的STC缓冲液中。
分别用pYG1.2-CarE-his质粒、pYG1.2质粒(阳性对照)和空白缓冲液(阴性对照)转化原生质体。在15ml管中将3.0μg质粒DNA与100μl原生质体悬液混合,并在室温下孵育25分钟。将总体积1250μl的PEG溶液逐滴加入质粒与原生质体的混合物中并轻柔混合,随后在室温下孵育20分钟。加入冰冷的STC缓冲液将管充满,其后轻柔混合直至PEG溶液被完全稀释。在4℃下以2000rpm离心10分钟后,将细胞重悬于300μl STC缓冲液中,从其中取出100μl接种到不含尿苷的MN+SORB琼脂培养基平板上,并在30℃下孵育3至5天,以选择能在该选择培养基上生长的转化体。未转化的黑曲霉M54将不能在不含尿苷的培养基上生长,而用pYG1.2-CarE-his和pYG1.2转化的黑曲霉均能在不含尿苷的培养基上生长,表明所述质粒在黑曲霉M54中成功表达。
使用SDS-PAGE来表征羧酸酯酶的表达。在30℃下,将转化有pYG1.2-CarE-his的黑曲霉M54在不含尿苷的MN培养液中以200rpm培养5天。接着取出该培养物的上清液并通过电泳进行分析。平行地对转化有pYG1.2的黑曲霉M54的上清液以及未转化黑曲霉M54的上清液进行分析,作为阴性对照。将15μl上清液与3μl 6×上样SDS-PAGE缓冲液混合并煮沸5分钟,之后上样到SDS-PAGE胶上,所述SDS-PAGE胶由12%的聚丙烯酰胺分离胶和5%的聚丙烯酰胺成层胶组成。在120V下将样品恒压电泳3至4小时。接着在室温下用考马斯亮蓝将胶染色30分钟并洗涤,以显示蛋白质条带。在转化有pYG1.2-CarE-his的黑曲霉M54的上清液的泳道中观察到了29.0KD的独特蛋白质条带,而在阴性对照中则未观察到(图3)。
通过Western印迹进一步证实羧酸酯酶的表达。通过SDS-PAGE电泳对转化有pYG1.2-CarE-his的黑曲霉M54的上清液进行分离,并通过施加0.65mA/cm2强度的电流而将胶上的蛋白质向聚偏二氟乙烯(PVDF)膜上转移1小时。用脱脂乳封闭PVDF膜,其后在4℃下与适当稀释的抗his抗体(IgG型抗体)孵育过夜,并在室温下与二抗(抗IgG抗体)孵育90分钟。用PBST洗涤后,使PVDF膜与X光胶片接触适当的时间,其后显影并照相。使用未转化黑曲霉M54的上清液和转化有pYG1.2载体的黑曲霉M54的上清液作为阴性对照。使用带His标签的组蛋白作为阳性对照。在pYG1.2-CarE-his转化体中观察到了29KD的独特条带,而在阴性对照中则没有检测到条带(图4),这证实了羧酸酯酶在pYG1.2-CarE-his转化体中的表达。
实施例2:在巴斯德毕赤酵母GS115中表达羧酸酯酶
羧酸酯酶的DNA序列含有内部的XhoI位点CTCGAG,使用PCR,通过位点特异性沉默突变将其改变成CTCGAA(以双线标出)。分别使用SEQ ID NO:36-37所示的引物P3和P4以及SEQ ID NO:38-39所示的引物P5和P6进行两个独立的PCR反应,其中P3含有Xho I位点、KEX2位点和KEX1位点,P4和P5含有沉默突变,P6含有EcoRI位点和His-tag的六个组氨酸密码子(His-tag的编码序列以波浪线标出)。PCR条件与实施例1中所述相同。在1%琼脂糖凝胶上电泳后,通过凝胶提取来纯化来自这两个独立反应的PCR产物。来自这两个反应的PCR产物具有重叠序列(显示为表5中P4和P5的下划线部分),这将允许这两种产物在一种产物的5′末端和另一产物的3′末端彼此结合。因此,将这两种纯化的PCR产物混合在一起进行第三轮PCR,以产生将这两个模板组合在一起的PCR产物。将第三轮PCR的产物在1%琼脂糖凝胶上进行电泳,切下约800bp的条带并通过凝胶提取进行纯化。
表5.引物P3、P4、P5和P6以及CarE-his基因的核苷酸序列。
| 名称 | 序列 |
| P3 | 5’CGCTCGAGAAAAGAGAGGCTGAAGCTATGATGAAAATTGTTCCG 3’(SEQ ID NO:36) |
| P4 | 5’CGCGCGCATATTCGAGGACGCCTTCGTAC 3’(SEQ IDNO:37) |
| P5 | 5’CGTCCTCGAATATGCGCGCGAGTATAAAA 3’(SEQ IDNO:38) |
| P6 | 5’CGGAATTCTTAATGGTGATGGTGATGGTGCCAATCTAAC |
将纯化的最终PCR产物和pBluescriptII-SKM(Stratagene,LaJolla,CA)分别同时使用XhoI和EcoRI进行消化,其后在1%琼脂糖凝胶中电泳并通过凝胶提取进行纯化。pBluescriptII-SKM含有赋予氨苄青霉素抗性的β-内酰胺酶基因。在将消化的PCR产物和消化的pBluescriptII-SKM片段连接之后,按照与实施例1所述相同的方法用此连接产物转化大肠杆菌DH5α。使用PCR对在含有氨苄青霉素的LB平板上生长的各个大肠杆菌菌落进行筛选,并通过PCR将所鉴定的阳性菌落增殖,其后纯化质粒。通过用XhoI和EcoRI消化纯化的质粒其后进行电泳来证实靶基因的插入。将所得的重组质粒命名为pBluescriptII-SKM-CarE-his,并用于以下的分子克隆实验。
用XhoI和EcoRI消化质粒pBluescriptII-SKM-CarE-his,并与经相同限制性酶消化的pPIC9质粒(Invitrogen)的片段连接。使用此连接产物转化大肠杆菌DH5α,以获得按照与实施例1所述相同方法通过PCR和XhoI及EcoRI酶消化鉴定的阳性重组体菌落。纯化来自阳性菌落的质粒并命名为pPIC9-CarE-his。
用BamHI和EcoRI消化质粒pPIC9-CarE-his,并将含有羧酸酯酶基因的片段与经相同限制性酶消化的pPIC9K质粒(Invitrogen)片段连接。pPIC9K质粒带有卡那霉素抗性基因,其赋予宿主细胞对卡那霉素及与卡那霉素具有结构相似性的某些抗生素的抗性。使用连接产物转化大肠杆菌DH5α,以获得按照与实施例1所述相同方法通过PCR和BamHI及EcoRI酶消化而鉴定的阳性重组体菌落。电泳结果显示出1100bp的靶标条带,其对应于同时含有羧酸酯酶基因以及从pPIC9质粒中引入的α-因子分泌信号序列的消化片段。将该重组质粒命名为pPIC9K-CarE-his并用于后续的研究。
用BglII消化pPIC9K-CarE-his质粒,接着纯化以获得线性化的质粒DNA。将巴斯德毕赤酵母GS115菌株的80μl悬液与所述线性化的质粒DNA混合,并在冰冷的试管中平衡5分钟,其后以1500v、25μF和200Ω进行电泳,并立即加入1ml冰冷的1M山梨醇。将混合物在冰上放置2至3小时,接着接种到RDB平板上。将平板在30℃下孵育3至5天。
从RDB平板上洗下所有菌落,并用YPD培养基稀释到约106个细胞/ml。将100μl稀释的巴斯德毕赤酵母细胞接种到补充有不同浓度(0.25mg/ml、1mg/ml和2mg/ml)G418(购自Invitrogen,是一种与卡那霉素结构相似的氨基糖苷抗生素)的YPD平板上,其后在30℃下孵育。
挑取在补充有最高浓度G418的平板上生长的各个菌落,并分别接种到3ml MGY培养基中,接着在30℃下孵育并以250rpm摇动,直至OD600达到2至6。在室温下以1500g离心5分钟后,将沉淀重悬于3ml BMMY培养基(补充有5‰甲醇)中,并在30℃下孵育并以250rpm摇动,以诱导靶基因表达。每24小时向培养物中加入甲醇(5‰),同时对培养物采样。96小时后,对不同时间间隔的样品进行SDS-PAGE分析。
按照与实施例1所述相同的方法,使用SDS-PAGE电泳和Western印迹对样品的上清液进行分析。平行地对转化有pPIC9K载体的巴斯德毕赤酵母GS115的上清液进行分析,作为阴性对照。从电泳和Western印迹获得的结果均显示,在转化有pPIC9K-CarE-his的巴斯德毕赤酵母GS115的上清液中存在29.0KD的独特蛋白质条带,而在阴性对照中则不存在(图4和图5)。
挑取相对高水平地表达羧酸酯酶的pPIC9K-CarE-his转化体,增殖并保存于-80℃。
实施例3:表征羧酸酯酶的活性
如下测定羧酸酯酶活性:将α-萘基乙酸酯分别与转化的黑曲霉M54的上清液和转化的巴斯德毕赤酵母GS115的上清液在37℃ pH 7.0孵育10分钟,其后立即终止反应。在600nm处测量吸光度。酶活性单位计算为每分钟从0.03M α-萘基乙酸酯溶液中释放1μmol α-萘酚所需的酶量。
为了确定酶产生最高的孵育时间,在孵育的第1天、第2天、第3天、第4天、第5天和第6天在含有羧酸酯酶的培养物上清液中取样,接着测量羧酸酯酶活性。如图6所示,重组羧酸酯酶活性在已转化黑曲霉M54中在孵育5天后达到峰值,在已转化巴斯德毕赤酵母GS115中在孵育4天后达到峰值,其后开始下降。通过凝胶成像分析系统测得,在转化的黑曲霉M54的第5天培养中所产生羧酸酯酶的量为15.3mg/ml,在转化的巴斯德毕赤酵母GS115的第4天培养物中所产生的量为30.7mg/ml。结果提示,在转化的黑曲霉M54和转化的巴斯德毕赤酵母GS115中,对于羧酸酯酶的产生而言合适的孵育时间分别为5天和4天。转化的巴斯德毕赤酵母GS115中重组羧酸酯酶的产量高于转化的黑曲霉M54中的。
为了测试羧酸酯酶活性的pH依赖性,在37℃下和pH值为5至11的缓冲液中,将含有由转化的黑曲霉M54和转化的巴斯德毕赤酵母GS115表达的羧酸酯酶的培养物上清液与底物一起孵育30分钟,其后进行酶促反应和活性测定。结果显示,重组羧酸酯酶在pH 8.0下显示最高的酶活性(将其定义为100%相对酶活性),在6.0至8.5的pH范围内显示高于75%的相对酶活性,当pH低于6.0或高于8.5时活性下降(图7)。
为了测定就羧酸酯酶活性而言合适的反应温度,在20℃至80℃的温度下,将含有羧酸酯酶的培养物上清液与底物溶液在pH 7.0下孵育30分钟,其后测定酶活性。转化的黑曲霉M54和转化的巴斯德毕赤酵母GS115所表达的羧酸酯酶均在60℃下显示出最高的酶活性(将其定义为100%相对酶活性),在70℃和80℃的更高温度下活性降低(图9)。
为了测试重组羧酸酯酶的热稳定性,将含有转化的黑曲霉M54和转化的巴斯德毕赤酵母GS115所表达羧酸酯酶的培养物上清液分别在40℃至80℃的温度下孵育10分钟和30分钟,随后在冰浴中冷却。将底物溶液加入酶孵育物中,随后在37℃下使酶促反应进行10分钟,并进行活性测定。结果显示,在60℃孵育10分钟后,重组羧酸酯酶保留近100%的酶活性,在70℃孵育30分钟后则表现出约60%的相对酶活性(图8)。
实施例4:在多形汉逊酵母中表达来自嗜热地芽孢杆菌HTA426菌株的羧酸酯酶。
使用从已知DNA序列中设计的引物,通过PCR从嗜热地芽孢杆菌HTA426菌株的cDNA文库中克隆来自嗜热地芽孢杆菌HTA426菌株的羧酸酯酶(登记号BA000043,SEQ ID NO:10)。纯化PCR产物并用适当的限制性酶消化,接着与pHIPX4载体连接(Gietl等,Mutationalanalysis of the N-terminal topogenic signal of watermelon glyoxysomalmalate dehydrogenase using the heterologous host Hansenula polymorpha.Proc Natl Acad Sci USA 1994,vol 91,31513155)。在大肠杆菌DH5α中增殖已插入羧酸酯酶基因的重组pHIPX4质粒,接着线性化以使用电穿孔转化多形汉逊酵母菌株leu 1.1(Gleeson等,Transformation of themethylotrophic yeast Hansenula polymorpha.J Gen Microbiol 1986,vol,132,3459-65)。
使用无亮氨酸的培养基筛选已转化的汉逊酵母。对在无亮氨酸培养基平板上生长的单克隆进行孵育,并以α-萘基乙酸酯作为底物,使用酶测定来表征羧酸酯酶的表达。
实施例5:在米曲霉中表达来自高温烷烃芽孢杆菌菌株的羧酸酯酶。
使用从已知DNA序列中设计的引物,通过PCR从高温烷烃芽孢杆菌菌株的cDNA文库中克隆来自高温烷烃芽孢杆菌菌株的羧酸酯酶(登记号AF327065,SEQ ID NO:4)。纯化PCR产物并用适当的限制性酶消化,接着与带有精氨酸合成基因的pSal23载体连接(Gomi等,Integrativetransformation of Aspergillus oryzae with a plasmid containing theAspergillus nidulans argB gene.Agric.Biol.Chem.1987,vol 51,2549-2555)。在大肠杆菌DH5α中增殖已插入羧酸酯酶基因的重组pSal23质粒,接着线性化以使用米曲霉M-2-3原生质体转化缺少精氨酸合成基因的米曲霉M-2-3(Ozeki等,Construction of a promoter probevector autonomously maintained in Aspergillus and characterization of promoter regions derived from A.niger and A.oryzae genomes.Biosci.Biotech.Biochem.1996,vol 60,383-389)。
使用无精氨酸的培养基筛选已转化的米曲霉。对在无精氨酸培养基平板上生长的单克隆进行孵育,并以α-萘基乙酸酯作为底物,使用酶测定来表征羧酸酯酶的表达。
实施例6:纯化所表达的羧酸酯酶
将转化有pPIC9K-CarE-his的巴斯德毕赤酵母GS115在30℃下培养4天。收获培养基并以0.2μm滤器过滤,之后添加NaAzide至终浓度为0.01%。向每升所收获的培养基中加入100ml甘油、30ml 5MNaCl、10ml 1M咪唑和50ml Ni-NTA superflow树脂,其后在室温下以150rpm进行回转运动(Gyro-rotary motion)30至40分钟。以3750rpm将树脂珠离心10分钟。将珠子装入柱中,并用含有50mM Tris(pH8.0)、300mM NaCl、10%甘油和10mM咪唑的洗涤缓冲液洗柱,直至280nm的UV吸收稳定。接着用含有50mM Tris pH 8.0、300mM NaCl、10%甘油和250mM咪唑的洗脱缓冲液将柱进行洗脱,直至洗脱液中检测不到蛋白质。通过电泳分析洗脱的级分,以鉴定出含有单个靶蛋白条带的级分。收集靶标级分并测试其量和酶活性。
一般陈述
对于本文使用的基本上任何复数和/或单数术语,本领域技术人员能够就本文和/或本申请而言适当地将复数解读为单数和/或将单数解读为复数。
本领域技术人员将会理解,一般而言,本文中,尤其是所附权利要求书(例如,所附权利要求书的正文)中所用的术语通常意为“开放式”术语(例如,术语“包括”应该解释为“包括但不限于”,术语“具有”应该解释为“至少具有”,等等)。本领域技术人员还应该明白,有意引述权利要求的具体数目,则该意图会在权利要求中明确地表述出来,如果没有这种表述,则不存在该意图。例如,为了便于理解,下面所附的权利要求书中可使用引导性短语“至少一项”及“一项或多项”来引述权利要求。然而,不应将使用这种引导性短语解释成通过不定冠词“一”引述权利要求即意味着将包含该权利要求引述的任何特定权利要求限定为仅包含一项该引述的实施方案,甚至在同一权利要求包含引导性短语“一项或多项”或“至少一项”以及诸如“一”之列的不定冠词(例如,“一”应该解释成意味着“至少一项”或“一项或多项”)时也是如此;这同样适用于使用“所述”、“该”等定冠词来引述权利要求的情况。此外,即使明确表述了所引述权利要求的具体数目,本领域技术人员也应该认识到,这种表述应该解释成意指至少是所表述的数目(例如,只说“两项引述”,而没有其它修饰语,则意为至少两项引述,或者说两项或更多项引述)。
另外,当以马库什组的形式描述本发明的特征或方面时,本领域技术人员会公认,本文也因此描述了该马库什组的任何个体成员或亚组成员。
本领域技术人员会理解,对于任何及全部目的,例如就提供书面描述而言,本文公开的所有范围还包括任何及全部可能的小范围及其小范围的组合。任何列出的范围都可以很容易地认为是对同一范围分成相等的至少两部分、三部分、四部分、五部分、十部分等而进行了描述。作为非限定性的实例,本文所述的每个范围都可以容易地分为前三分之一、中间三分之一和后三分之一等。本领域技术人员还会理解,所有诸如“多达”、“至少”、“多于”、“少于”等的词语均包括所提到的数值,并指可以如上所述分成小范围的范围。最后,本领域技术人员会理解,范围包括每个个体成员。因此,例如,含有1-3个细胞的组是指含有1个、2个或3个细胞的组,依此类推。
本发明不仅限于本申请中所述的具体实施方案,这些实施方案旨在举例说明本发明的多个方面。对于本领域技术人员而言显而易见的是,可以进行许多修改和改变,而不偏离其构思和范围。根据上文的描述,本发明范围内除本文所列举以外的功能上等同的方法和组合物对于本领域技术人员而言将是很明显的。这些修改和改变旨在落入所附权利要求书的范围内。本发明仅受到所附权利要求书以及这些权利要求所赋予等同方案的全部范围的限制。应当理解,本发明不受限于具体的方法、试剂、化合物组合物或生物体系,它们当然是可以变化的。还应当理解,本文使用的术语仅用于描述具体的实施方案,而并非旨在限制。
本文中公开了多个方面和实施方式,而其它方面和实施方式对于本领域技术人员将是显而易见的。本文中公开的多个方面和实施方式的目的在于举例说明,并非旨在限定,真正的范围和精神由下面的权利要求书表明。
序列表
<110>同济大学
<120>羧酸酯酶的重组表达
<130>32052.6509.US00
<160>40
<170>PatentIn version 3.5
<210>1
<211>246
<212>PRT
<213>嗜热地芽孢杆菌(Geobacillus kaustophilus)
<400>1
Met Lys Ile Val Pro Pro Lys Pro Phe Phe Phe Glu Ala Gly Glu Arg
1 5 10 15
Ala Val Leu Leu Leu His Gly Phe Thr Gly Asn Ser Ala Asp Val Arg
20 25 30
Met Leu Gly Arg Phe Leu Glu Ser Lys Gly Tyr Thr Cys His Ala Pro
35 40 45
Ile Tyr Lys Gly His Gly Val Pro Pro Glu Glu Leu Val His Thr Gly
50 55 60
Pro Asp Asp Trp Trp Gln Asp Val Met Asn Gly Tyr Gln Phe Leu Lys
65 70 75 80
Asn Lys Gly Tyr Glu Lys Ile Ala Val Ala Gly Leu Ser Leu Gly Gly
85 90 95
Val Phe Ser Leu Lys Leu Gly Tyr Thr Val Pro Ile Gln Gly Ile Val
100 105 110
Thr Met Cys Ala Pro Met Tyr Ile Lys Ser Glu Glu Thr Met Tyr Glu
115 120 125
Gly Val Leu Glu Tyr Ala Arg Glu Tyr Lys Lys Arg Glu Gly Lys Ser
130 135 140
Glu Glu Gln Ile Glu Gln Glu Met Glu Arg Phe Lys Gln Thr Pro Met
145 150 155 160
Lys Thr Leu Lys Ala Leu Gln Glu Leu Ile Ala Asp Val Arg Ala His
165 170 175
Leu Asp Leu Val Tyr Ala Pro Thr Phe Val Val Gln Ala Arg His Asp
180 185 190
Glu Met Ile Asn Pro Asp Ser Ala Asn Ile Ile Tyr Asn Glu Ile Glu
195 200 205
Ser Pro Val Lys Gln Ile Lys Trp Tyr Glu Gln Ser Gly His Val Ile
210 215 220
Thr Leu Asp Gln Glu Lys Asp Gln Leu His Glu Asp Ile Tyr Ala Phe
225 230 235 240
Leu Glu Ser Leu Asp Trp
245
<210>2
<211>741
<212>DNA
<213>嗜热地芽孢杆菌
<400>2
ttaccaatct aacgattcaa gaaatgcata aatatcttca tgcagctgat ctttttcttg 60
atcaagcgta atcacatggc ctgattgctc ataccatttg atttgtttga ccggcgattc 120
aatttcgtta taaatgatgt tcgcgctgtc tggattgatc atctcatcat ggcgcgcttg 180
gacgacgaac gtcggtgcat aaaccaaatc aaggtgggcg cgcacatcgg caatgagttc 240
ttgcaaggct ttcaacgtct tcatcggcgt ttgtttgaac cgttccattt cctgttcgat 300
ttgttcctct gatttccctt cccgcttttt atactcgcgc gcatactcga gcacaccttc 360
gtacatcgtt tcttcgcttt tgatgtacat cggcgcgcac atcgtcacaa tgccttgtat 420
aggtacagtg tagcctaatt tgagagaaaa tacgcctcca agcgacaatc cagccacggc 480
aattttttcg tagcctttgt ttttcaaaaa ctgatagccg ttcatgacgt cttgccacca 540
atcatccggt ccggtgtgga cgagctcttc cggcggcacg ccatgccctt tgtaaatcgg 600
agcgtggcac gtataccctt tcgattccaa gaatcgccca agcatccgaa cgtcggcgga 660
attgccggta aacccatgca aaagcagcac cgcccgctcc ccggcttcaa agaaaaacgg 720
cttcggcgga acaattttca t 741
<210>3
<211>246
<212>PRT
<213>高温烷烃地芽孢杆菌(Geobacillus thermoleovorans)
<400>3
Met Met Lys Ile Val Pro Pro Lys Pro Phe Phe Phe Glu Ala Gly Glu
1 5 10 15
Arg Ala Val Leu Leu Leu His Gly Phe Thr Gly Asn Ser Ala Asp Val
20 25 30
Arg Met Leu Gly Arg Phe Leu Glu Ser Lys Gly Tyr Thr Cys His Ala
35 40 45
Pro Ile Thr Lys Gly Met Val Pro Pro Glu Glu Leu Val His Thr Gly
50 55 60
Pro Asp Asp Trp Trp Gln Asp Val Met Asn Gly Tyr Gln Phe Leu Lys
65 70 75 80
Asn Lys Gly Tyr Glu Lys Ile Ala Val Ala Gly Leu Ser Leu Gly Gly
85 90 95
Val Phe Ser Leu Lys Leu Gly Tyr Thr Val Pro Ile Glu Gly Ile Val
100 105 110
Thr Met Cys Ala Pro Met Tyr Ile Lys Ser Glu Glu Thr Met Tyr Glu
115 120 125
Gly Val Leu Glu Tyr Ala Arg Glu Tyr Lys Lys Arg Glu Gly Lys Ser
130 135 140
Glu Glu Gln Ile Glu Gln Glu Met Glu Arg Phe Lys Gln Thr Pro Met
145 150 155 160
Lys Thr Leu Lys Ala Leu Gln Glu Leu Ile Ala Asp Val Arg Ala His
165 170 175
Leu Asp Leu Val Tyr Ala Arg Thr Phe Val Val Gln Ala Arg His Asp
180 185 190
Lys Met Ile Asn Pro Asp Ser Ala Asn Ile Ile Tyr Asn Glu Ile Glu
195 200 205
Ser Pro Val Lys Gln Ile Lys Trp Tyr Glu Gln Ser Gly His Val Ile
210 215 220
Thr Leu Asp Gln Glu Lys Asp Gln Leu His Glu Asp Ile Tyr Ala Phe
225 230 235 240
Leu Glu Ser Leu Asp Trp
245
<210>4
<211>741
<212>DNA
<213>高温烷烃地芽孢杆菌
<400>4
atgatgaaaa ttgttccgcc gaagccgttt ttctttgaag ccggggagcg ggcggtgctg 60
cttttgcatg ggtttaccgg caattccgcc gacgttcgga tgcttgggcg attcttggaa 120
tcgaaagggt atacgtgcca cgctccgatt acaaagggca tggtgccgcc ggaagagctc 180
gtccacaccg gaccggatga ttggtggcaa gacgtcatga acggctatca gtttttgaaa 240
aacaaaggct acgaaaaaat tgccgtggct ggattgtctc ttggaggcgt attttctctc 300
aaattaggct acactgtacc tatagaaggc attgtgacga tgtgcgcgcc gatgtacatc 360
aaaagcgaag aaacgatgta cgaaggtgtg ctcgagtatg cgcgcgagta taaaaagcgg 420
gaagggaaat cagaggaaca aatcgaacag gaaatggaac ggttcaaaca aacgccgatg 480
aagacgttga aagccttgca agaactcatt gccgatgtgc gcgcccacct tgatttggtt 540
tatgcacgca cgttcgtcgt ccaagcgcgc catgataaga tgatcaatcc agacagcgcg 600
aacatcattt ataacgaaat tgaatcgccg gtcaaacaaa tcaaatggta tgagcaatca 660
ggccatgtga ttacgcttga tcaagaaaaa gatcagctgc atgaagatat ttatgcattt 720
cttgaatcgt tagattggta a 741
<210>5
<211>256
<212>PRT
<213>肠道沙门氏菌(Salmonella enterica)
<400>5
Met Asn Asp Ile Trp Trp Gln Thr Tyr Gly Glu Gly Ash Cys His Leu
1 5 10 15
Val Leu Leu His Gly Trp Gly Leu Asn Ala Glu Val Trp His Cys Ile
20 25 30
Arg Glu Glu Leu Gly Ser His Phe Thr Leu His Leu Val Asp Leu Pro
35 40 45
Gly Tyr Gly Arg Ser Ser Gly Phe Gly Ala Met Thr Leu Glu Glu Met
50 55 60
Thr Ala Gln Val Ala Lys Asn Ala Pro Asp Gln Ala Ile Trp Leu Gly
65 70 75 80
Trp Ser Leu Gly Gly Leu Val Ala Ser Gln Met Ala Leu Thr His Pro
85 90 95
Glu Arg Val Gln Ala Leu Val Thr Val Ala Ser Ser Pro Cys Phe Ser
100 105 110
Ala Arg Glu Gly Trp Pro Gly Ile Lys Pro Glu Ile Leu Gly Gly Phe
115 120 125
Gln Gln Gln Leu Ser Asp Asp Phe Gln Arg Thr Val Glu Arg Phe Leu
130 135 140
Ala Leu Gln Thr Leu Gly Thr Glu Thr Ala Arg Gln Asp Ala Arg Thr
145 150 155 160
Leu Lys Ser Val Val Leu Ala Gln Pro Met Pro Asp Val Glu Val Leu
165 170 175
Asn Gly Gly Leu Glu Ile Leu Lys Thr Val Asp Leu Arg Glu Ala Leu
180 185 190
Lys Asn Val Asn Met Pro Phe Leu Arg Leu Tyr Gly Tyr Leu Asp Gly
195 200 205
Leu Val Pro Arg Lys Ile Ala Pro Leu Leu Asp Thr Leu Trp Pro His
210 215 220
Ser Thr Ser Gln Ile Met Ala Lys Ala Ala His Ala Pro Phe Ile Ser
225 230 235 240
His Pro Ala Ala Phe Cys Gln Ala Leu Met Thr Leu Lys Ser Ser Leu
245 250 255
<210>6
<211>771
<212>DNA
<213>肠道沙门氏菌
<400>6
ttacagcgat gattttagcg tcatcagcgc ctgacaaaac gccgccggat gcgagataaa 60
cggcgcatgg gccgccttcg ccattatctg tgatgtactg tgcggccata acgtatcgag 120
caaaggcgcg attttacgcg gcaccagacc gtccagataa ccatacaaac gcaaaaacgg 180
catgttcaca tttttaagcg cttctcgtag atcgaccgtt tttaagattt ccagtccgcc 240
attgagcacc tctacatccg gcataggctg cgccagcact acgcttttta aggtgcgggc 300
atcctgacgc gccgtctccg tccctaacgt ttgcagcgcc agaaaacgct ccaccgtgcg 360
ctgaaaatcg tcgctaagct gctgctggaa tccgccgagg atttctggtt ttattcccgg 420
ccacccctca cgcgcgctaa agcatggcga agaggcgact gtcaccagcg cctgaacgcg 480
ttcagggtgg gtgagcgcca tctgactcgc caccaggccg cccaggctcc agccaagcca 540
gatagcctgg tccggcgcgt ttttcgctac ctgcgccgtc atctcttcaa gcgtcatggc 600
gccaaacccc gagctgcgac catagcccgg caagtcgacc agatgcagcg taaaatgcga 660
gccgagttcc tcgcgaatgc aatgccatac ctccgcgttc aatccccatc cgtgcagcag 720
cacaagatga caatttccct cgccgtaggt ctgccaccag atgtcattca t 771
<210>7
<211>541
<212>PRT
<213>烟曲霉(Aspergillus fumigatus)
<400>7
Met Val Ile Ser Thr Lys Tyr Ile Phe Ala Leu Cys Val Leu Leu Leu
1 5 10 15
Thr Phe Ser Leu Ser Ser Ala Tyr Glu Asp Pro Leu Val Glu Leu Asp
20 25 30
Tyr Gly Gln Phe Gln Gly Lys Tyr Asp Ser Thr Tyr Asn Leu Ser Tyr
35 40 45
Phe Arg Lys Ile Pro Phe Ala Ala Pro Pro Thr Gly Glu Asn Arg Phe
50 55 60
Arg Ala Pro Gln Pro Pro Leu Arg Ile Thr His Gly Val Tyr Asp Thr
65 70 75 80
Asp Gln Asp Phe Asp Met Cys Pro Gln Arg Thr Val Asn Gly Ser Glu
85 90 95
Asp Cys Leu Tyr Leu Gly Leu Phe Ser Arg Pro Trp Asp Ala Thr Val
100 105 110
Ala Pro Ser Ser Ala Ser Arg Pro Val Leu Val Val Phe Tyr Gly Gly
115 120 125
Gly Phe Ile Gln Gly Ser Ala Ser Phe Thr Leu Pro Pro Ser Ser Tyr
130 135 140
Pro Ile Leu Asn Val Thr Glu Leu Asn Asp Tyr Val Val Ile Tyr Pro
145 150 155 160
Asn Tyr Arg Val Asn Ala Phe Gly Phe Leu Pro Gly Lys Ala Ile Lys
165 170 175
Arg Ser Pro Thr Ser Asp Leu Asn Pro Gly Leu Leu Asp Gln Gln Tyr
180 185 190
Val Leu Lys Trp Val Gln Lys Tyr Ile His His Phe Gly Gly Asp Pro
195 200 205
Arg Asn Val Thr Ile Trp Gly Gln Ser Ala Gly Ala Gly Ser Val Val
210 215 220
Ala Gln Val Leu Ala Asn Gly Arg Gly Asn Gln Pro Lys Leu Phe Asp
225 230 235 240
Lys Ala Leu Val Ser Ser Pro Phe Trp Pro Lys Thr Tyr Ala Tyr Asp
245 250 255
Ala Pro Glu Ala Glu Ala Ile Tyr Asp Gln Leu Val Thr Leu Thr Gly
260 265 270
Cys Ala Asn Ala Thr Asp Ser Leu Ala Cys Leu Lys Ser Val Asp Val
275 280 285
Gln Thr Ile Arg Asp Ala Asn Leu Ile Ile Ser Ala Ser His Thr Tyr
290 295 300
Asn Thr Ser Ser Tyr Thr Trp Ala Pro Val Ile Asp Gly Glu Phe Leu
305 310 315 320
Gln Glu Ser Leu Thr Thr Ala Val Ala Arg Arg Lys Ile Lys Thr His
325 330 335
Phe Val Phe Gly Met Tyr Asn Thr His Glu Gly Glu Asn Phe Leu Pro
340 345 350
Ser Gly Leu Gly Lys Thr Ala Thr Thr Ala Gly Phe Asn Ser Ser Ala
355 360 365
Ala Ser Phe His Thr Trp Leu Thr Gly Phe Leu Pro Gly Leu Ser Pro
370 375 380
Lys His Ile Ala Leu Leu Glu Thr Lys Tyr Tyr Pro Pro Thr Gly Glu
385 390 395 400
Thr Glu Thr Ile Asp Leu Tyr Asn Thr Thr Leu Val Arg Ala Gly Leu
405 410 415
Val Tyr Arg Asp Leu Val Leu Ala Cys Pro Ala Tyr Trp Leu Thr Ser
420 425 430
Ala Ala Arg Arg Arg Gly Tyr Leu Gly Glu Tyr Thr Ile Ser Pro Ala
435 440 445
Lys His Ala Ser Asp Thr Ile Tyr Trp Asn Arg Val Asn Pro Ile Gln
450 455 460
Gln Thr Asp Pro Leu Ile Tyr Asp Gly Phe Ala Gly Ala Phe Gly Ser
465 470 475 480
Phe Phe Gln Thr Gly Asp Pro Asn Ala His Lys Leu Thr Asn Ala Ser
485 490 495
Glu Lys Gly Val Pro Val Leu Glu Lys Thr Gly Glu Glu Trp Val Ile
500 505 510
Ala Pro Asp Gly Phe Ala Thr Ala Gln Val Ser Phe Leu Lys Glu Arg
515 520 525
Cys Asp Phe Trp Glu Ser Val Gly Glu Arg Val Pro Val
530 535 540
<210>8
<211>1626
<212>DNA
<213>烟曲霉
<400>8
atggtgatct caacgaagta tatatttgcc ctttgcgtcc tcttgctgac cttctcactc 60
tctagcgcct acgaagatcc cctcgtcgag ctcgactatg ggcagttcca gggcaaatat 120
gactctacgt ataatctctc atacttccgc aagatcccct ttgcggcgcc tccaacgggg 180
gagaaccggt ttagagcccc tcagccacct ctaaggatca cgcatggcgt ctatgacact 240
gatcaggact ttgacatgtg cccgcaacgc acggtcaatg gctccgaaga ctgcctctac 300
cttggcctgt tctcacgacc gtgggatgct acagtggctc cctcgtctgc ttctagacca 360
gtcctggtag tcttctacgg tggtggcttc atccaaggca gcgcttcgtt tacactacct 420
ccgtcctcat atccaatcct gaacgtcacc gagctgaatg actatgtggt catctacccc 480
aactaccggg tcaatgcatt tggtttcctt ccgggcaagg cgatcaagcg atctccaacg 540
tctgatctca accccggcct cttggaccag cagtacgttc tcaagtgggt gcagaaatac 600
attcaccact tcggcggtga ccctcgcaac gtcacgatct ggggccaatc cgccggcgcc 660
ggctcagtgg ttgcgcaggt tctcgccaac ggacgaggca accaacccaa gctcttcgac 720
aaagcgctcg tcagctcgcc cttctggcca aagacctacg cctacgacgc ccccgaagca 780
gaagccatct acgaccagct tgtcactctc accggctgcg ccaatgccac cgactccctc 840
gcctgcctga aatccgtcga cgtccagacc atccgcgacg caaacctcat catcagcgcc 900
agccacacct acaacacaag ctcctacacc tgggcccccg tcatcgacgg cgaattcctc 960
caagaatccc tcaccaccgc cgtcgcccgc cgcaaaatca aaacccactt cgtcttcggc 1020
atgtacaaca cccacgaggg cgagaacttc ctcccctccg gcctgggcaa gaccgctaca 1080
accgctgggt tcaactcctc tgctgctagc ttccacacct ggctgacggg cttcctgccg 1140
ggtctctcgc ccaagcacat cgccctcctc gaaaccaagt actacccgcc caccggcgaa 1200
acagagacaa tcgacctgta caacacgacg ctcgtccgcg cgggcctggt ctacagggat 1260
cttgtcctcg cctgtccggc gtactggctt acctcggccg caagacggag gggctatcta 1320
ggtgaatata cgatttcgcc ggctaagcac gcgagcgata ccatctattg gaaccgagtg 1380
aacccgatcc agcagactga tccactgatt tatgacggct tcgcaggcgc ttttgggagt 1440
ttcttccaga cgggcgatcc gaatgcgcat aagctgacga atgcgtcgga gaagggtgtg 1500
ccggttctcg agaagaccgg ggaggagtgg gtgattgctc cggatgggtt tgcaaccgcg 1560
caggtgtcgt ttttaaagga gaggtgtgat ttctgggagt cggtggggga gcgggttcct 1620
gtctga 1626
<210>9
<211>247
<212>PRT
<213>嗜热脂肪地芽孢杆菌(Geobacillus stearothermophilus)
<400>9
Met Met Lys Ile Val Pro Pro Lys Pro Phe Phe Phe Glu Ala Gly Glu
1 5 10 15
Arg Ala Val Leu Leu Leu His Gly Phe Thr Gly Asn Ser Ala Asp Val
20 25 30
Arg Met Leu Gly Arg Phe Leu Glu Ser Lys Gly Tyr Thr Cys His Ala
35 40 45
Pro Ile Tyr Lys Gly His Gly Val Pro Pro Glu Glu Leu Val His Thr
50 55 60
Gly Pro Asp Asp Trp Trp Gln Asp Val Met Asn Gly Tyr Glu Phe Leu
65 70 75 80
Lys Asn Lys Gly Tyr Glu Lys Ile Ala Val Ala Gly Leu Ser Leu Gly
85 90 95
Gly Val Phe Ser Leu Lys Leu Gly Tyr Thr Val Pro Ile Glu Gly Ile
100 105 110
Val Thr Met Cys Ala Pro Met Tyr Ile Lys Ser Glu Glu Thr Met Tyr
115 120 125
Glu Gly Val Leu Glu Tyr Ala Arg Glu Tyr Lys Lys Arg Glu Gly Lys
130 135 140
Ser Glu Glu Gln Ile Glu Gln Glu Met Glu Lys Phe Lys Gln Thr Pro
145 150 155 160
Met Lys Thr Leu Lys Ala Leu Gln Glu Leu Ile Ala Asp Val Arg Asp
165 170 175
His Leu Asp Leu Ile Tyr Ala Pro Thr Phe Val Val Gln Ala Arg His
180 185 190
Asp Glu Met Ile Asn Pro Asp Ser Ala Asn Ile Ile Tyr Asn Glu Ile
195 200 205
Glu Ser Pro Val Lys Gln Ile Lys Trp Tyr Glu Gln Ser Gly His Val
210 215 220
Ile Thr Leu Asp Gln Glu Lys Asp Gln Leu His Glu Asp Ile Tyr Ala
225 230 235 240
Phe Leu Glu Ser Leu Asp Trp
245
<210>10
<211>744
<212>DNA
<213>嗜热脂肪地芽孢杆菌
<400>10
atgatgaaaa ttgttccgcc gaagccgttt ttctttgaag ccggggagcg ggcggtgctg 60
ctgttgcatg ggtttaccgg caattccgct gatgttcgga tgctcgggcg ttttttagaa 120
tccaaaggct atacgtgcca tgcgcctatt tacaaaggac acggcgtgcc gcctgaggag 180
ctcgtccaca ccgggccgga tgactggtgg caagatgtca tgaacggcta cgagtttttg 240
aaaaacaagg gctacgaaaa aatcgccgtc gccggactgt cgcttggagg cgtattttca 300
ttgaaattag gttacactgt acctatagag ggcattgtga cgatgtgcgc gccgatgtac 360
atcaaaagcg aggaaacgat gtacgaaggc gtgctcgagt atgcgcgcga gtataaaaag 420
cgggaaggaa aatcagagga gcagatcgaa caggagatgg agaagttcaa gcagacgccg 480
atgaagacgt taaaggcgct gcaggagctg atcgccgatg tgcgtgacca tcttgatttg 540
atttatgccc cgacgtttgt tgtgcaggcg cgccatgatg agatgatcaa cccggacagc 600
gcgaacatca tttataacga aattgaatcg ccggtcaaac aaatcaagtg gtatgagcaa 660
tcaggccatg tgattacgct tgatcaagaa aaagatcagc tgcatgaaga tatttatgca 720
tttcttgaat cgttagattg gtaa 744
<210>11
<211>176
<212>PRT
<213>嗜热脂肪地芽孢杆菌
<400>11
Lys Leu Gly Glu Lys Glu Leu Leu Asp Arg Ile Asn Arg Glu Val Gly
1 5 10 15
Pro Val Pro Glu Glu Ala Ile Arg Tyr Tyr Lys Glu Thr Ala Glu Pro
20 25 30
Ser Ala Pro Thr Trp Gln Thr Trp Leu Arg Ile Met Thr Tyr Arg Val
35 40 45
Phe Val Glu Gly Met Leu Arg Thr Ala Asp Ala Gln Ala Ala Gln Gly
50 55 60
Ala Asp Val Tyr Met Tyr Arg Phe Asp Tyr Glu Thr Pro Val Phe Gly
65 70 75 80
Gly Gln Leu Lys Ala Cys His Ala Leu Glu Leu Pro Phe Val Phe His
85 90 95
Asn Leu His Gln Pro Gly Val Ala Asn Phe Val Gly Asn Arg Pro Glu
100 105 110
Arg Glu Ala Ile Ala Asn Glu Met His Tyr Ala Trp Leu Ser Phe Ala
115 120 125
Arg Thr Gly Asp Pro Asn Gly Ala His Leu Pro Glu Ala Trp Pro Ala
130 135 140
Tyr Thr Asn Glu Arg Lys Ala Ala Phe Val Phe Ser Ala Ala Ser His
145 150 155 160
Val Glu Asp Asp Pro Phe Gly Arg Glu Arg Ala Ala Trp Gln Gly Arg
165 170 175
<210>12
<211>531
<212>DNA
<213>嗜热脂肪地芽孢杆菌
<400>12
aagcttggcg aaaaggaact tcttgaccga atcaaccggg aagtcgggcc ggtgccagaa 60
gaggccatcc gctattacaa agaaacggcg gagccgtcgg cgcctacttg gcaaacgtgg 120
cttcgcatca tgacgtaccg cgtatttgtc gaggggatgc tgcggacggc ggacgcccaa 180
gcggcgcaag gggcggatgt gtacatgtac cgctttgact atgagacgcc ggtgttcggc 240
ggccagctga aagcatgcca cgcgctcgag ctgccgtttg tgtttcacaa tctccatcag 300
ccgggcgtcg cgaatttcgt cggcaaccgg ccggagcgcg aggcgatcgc caatgaaatg 360
cattacgctt ggctctcgtt tgcccgcacc ggagacccga acggtgctca cttgccggaa 420
gcgtggccgg cgtatacgaa cgagcgcaag gcggcctttg tcttttcggc cgccagccat 480
gtcgaggacg acccgttcgg ccgcgagcgg gcggcatggc aaggacgcta g 531
<210>13
<211>498
<212>PRT
<213>嗜热脂肪地芽孢杆菌
<400>13
Met Glu Arg Thr Val Val Glu Thr Arg Tyr Gly Arg Leu Arg Gly Glu
1 5 10 15
Met Asn Glu Gly Val Phe Val Trp Lys Gly Ile Pro Tyr Ala Lys Ala
20 25 30
Pro Val Gly Glu Arg Arg Phe Leu Pro Pro Glu Pro Pro Asp Ala Trp
35 40 45
Asp Gly Val Arg Glu Ala Thr Ser Phe Gly Pro Val Val Met Gln Pro
50 55 60
Ser Asp Pro Ile Phe Ser Gly Leu Leu Gly Arg Met Ser Glu Ala Pro
65 70 75 80
Ser Glu Asp Gly Leu Tyr Leu Asn Ile Trp Ser Pro Ala Ala Asp Gly
85 90 95
Lys Lys Arg Pro Val Leu Phe Trp Ile His Gly Gly Ala Phe Leu Phe
100 105 110
Gly Ser Gly Ser Ser Pro Trp Tyr Asp Gly Thr Ala Phe Ala Lys His
115 120 125
Gly Asp Val Val Val Val Thr Ile Asn Tyr Arg Met Asn Val Phe Gly
130 135 140
Phe Leu His Leu Gly Asp Ser Phe Gly Glu Ala Tyr Ala Gln Ala Gly
145 150 155 160
Asn Leu Gly Ile Leu Asp Gln Val Ala Ala Leu Arg Trp Val Lys Glu
165 170 175
Asn Ile Ala Ala Phe Gly Gly Asp Pro Asp Asn Ile Thr Ile Phe Gly
180 185 190
Glu Ser Ala Gly Ala Ala Ser Val Gly Val Leu Leu Ser Leu Pro Glu
195 200 205
Ala Ser Gly Leu Phe Arg Arg Ala Met Leu Gln Ser Gly Ser Gly Ser
210 215 220
Leu Leu Leu Arg Ser Pro Glu Thr Ala Met Ala Met Thr Glu Arg Ile
225 230 235 240
Leu Asp Lys Ala Gly Ile Arg Pro Gly Asp Arg Glu Arg Leu Leu Ser
245 250 255
Ile Pro Ala Glu Glu Leu Leu Arg Ala Ala Leu Ser Leu Gly Pro Gly
260 265 270
Val Met Tyr Gly Pro Val Val Asp Gly Arg Val Leu Arg Arg His Pro
275 280 285
Ile Glu Ala Leu Arg Tyr Gly Ala Ala Ser Gly Ile Pro Ile Leu Ile
290 295 300
Gly Val Thr Lys Asp Glu Tyr Asn Leu Phe Thr Leu Thr Asp Pro Ser
305 310 315 320
Trp Thr Lys Leu Gly Glu Lys Glu Leu Leu Asp Arg Ile Asn Arg Glu
325 330 335
Val Gly Pro Val Pro Glu Glu Ala Ile Arg Tyr Tyr Lys Glu Thr Ala
340 345 350
Glu Pro Ser Ala Pro Thr Trp Gln Thr Trp Leu Arg Ile Met Thr Tyr
355 360 365
Arg Val Phe Val Glu Gly Met Leu Arg Thr Ala Asp Ala Gln Ala Ala
370 375 380
Gln Gly Ala Asp Val Tyr Met Tyr Arg Phe Asp Tyr Glu Thr Pro Val
385 390 395 400
Phe Gly Gly Gln Leu Lys Ala Cys His Ala Leu Glu Leu Pro Phe Val
405 410 415
Phe His Asn Leu His Gln Pro Gly Val Ala Asn Phe Val Gly Asn Arg
420 425 430
Pro Glu Arg Glu Ala Ile Ala Asn Glu Met His Tyr Ala Trp Leu Ser
435 440 445
Phe Ala Arg Thr Gly Asp Pro Asn Gly Ala His Leu Pro Glu Ala Trp
450 455 460
Pro Ala Tyr Thr Asn Glu Arg Lys Ala Ala Phe Val Phe Ser Ala Ala
465 470 475 480
Ser His Val Glu Asp Asp Pro Phe Gly Arg Glu Arg Ala Ala Trp Gln
485 490 495
Gly Arg
<210>14
<211>1497
<212>DNA
<213>嗜热脂肪地芽孢杆菌
<400>14
atggagcgaa ccgttgttga aacaaggtac ggacggttgc gcggggaaat gaatgaaggc 60
gtttttgttt ggaaaggaat tccgtacgcg aaagcgccgg tcggtgagcg ccggtttttg 120
ccgccggagc cgcccgatgc gtgggatggg gtgcgggagg cgacatcgtt cggtcctgtc 180
gtgatgcagc cgtcggatcc gattttcagc ggattgctcg ggcggatgag cgaggcgccg 240
agcgaagacg ggctgtactt gaacatttgg tcgccggcgg cggatgggaa gaagcgcccg 300
gtgttgtttt ggattcacgg cggcgccttt ttgttcggtt cgggttcttc gccgtggtat 360
gacgggacgg cgttcgcgaa acacggcgat gtcgttgtcg tgacgatcaa ctaccgaatg 420
aacgtgtttg gctttttgca tctcggtgat tcgttcggcg aagcgtacgc gcaagccggg 480
aatctcggca ttttggacca agtggcggcg ctgcgctggg tgaaggagaa cattgcggcg 540
tttggtggtg atccggacaa catcacgatt ttcggtgaat cggccggagc ggcgagcgtc 600
ggcgtgctgt tgtcgcttcc ggaggccagc gggctgtttc ggcgcgccat gttgcaaagc 660
ggttcgggat cgcttcttct ccgttcgccg gagaccgcga tggcgatgac cgaacgcatt 720
cttgataagg ctggcatccg tccgggcgac cgcgaacggc tgttgtcgat tcctgccgag 780
gagctgctgc gggcggcgct gtcgctcggt ccaggggtca tgtacggtcc ggtggtggat 840
ggccgcgtat tgcgccgcca tccgatcgaa gcgctccgct acggggcggc cagcggcatt 900
ccgattctca tcggcgtgac gaaagacgag tacaacttgt ttaccttgac ggatccgtca 960
tggacaaagc ttggcgaaaa ggaacttctt gaccgaatca accgggaagt cgggccggtg 1020
ccagaagagg ccatccgcta ttacaaagaa acggcggagc cgtcggcgcc tacttggcaa 1080
acgtggcttc gcatcatgac gtaccgcgta tttgtcgagg ggatgctgcg gacggcggac 1140
gcccaagcgg cgcaaggggc ggatgtgtac atgtaccgct ttgactatga gacgccggtg 1200
ttcggcggcc agctgaaagc atgccacgcg ctcgagctgc cgtttgtgtt tcacaatctc 1260
catcagccgg gcgtcgcgaa tttcgtcggc aaccggccgg agcgcgaggc gatcgccaat 1320
gaaatgcatt acgcttggct ctcgtttgcc cgcaccggag acccgaacgg tgctcacttg 1380
ccggaagcgt ggccggcgta tacgaacgag cgcaaggcgg cctttgtctt ttcggccgcc 1440
agccatgtcg aggacgaccc gttcggccgc gagcgggcgg catggcaagg acgctag 1497
<210>15
<211>226
<212>PRT
<213>嗜热脂肪地芽孢杆菌
<400>15
Met His Asn Asp Leu Ala Tyr Glu Tyr Asp Ile His Leu Pro Ser Gly
1 5 10 15
Gly Glu Ala Gly Lys Lys Tyr Pro Ala Val Phe Ala Leu His Gly Ile
20 25 30
Gly Tyr Asp Glu Gln Tyr Met Leu Thr Leu Val Lys Asp Leu Lys Glu
35 40 45
Glu Phe Ile Leu Ile Gly Ile Arg Gly Asp Leu Pro Tyr Glu Asp Gly
50 55 60
Tyr Ala Tyr Tyr Tyr Leu Lys Glu Tyr Gly Lys Pro Glu Arg Lys Met
65 70 75 80
Phe Asp Asp Ser Ile Gly Lys Leu Lys His Phe Ile Glu Tyr Ala Leu
85 90 95
Asn Gln Tyr Pro Ile Asp Ser Asp Arg Val Tyr Leu Ile Gly Phe Ser
100 105 110
Gln Gly Ala Ile Leu Ser Met Ser Leu Ala Leu Ile Leu Gly Asp Lys
115 120 125
Ile Lys Gly Ile Ala Ala Met Asn Gly Tyr Ile Pro Ser Phe Val Lys
130 135 140
Glu Glu Tyr Pro Leu Gln Pro Ile Ser His Leu Ser Val Phe Leu Thr
145 150 155 160
Gln Gly Glu Ser Asp His Ile Phe Pro Leu Asn Ile Gly Gln Glu Asn
165 170 175
Tyr Glu Tyr Leu Arg Gln His Ala Gly Ala Val Lys Tyr Thr Ile Tyr
180 185 190
Pro Ala Gly His Glu Ile Ser Gln Asp Asn Gln Arg Asp Ile Val Ser
195 200 205
Trp Leu Arg His Asp Ala Phe His His Asn Ser Asn Lys Ala Thr Asn
210 215 220
Pro Ala
225
<210>16
<211>681
<212>DNA
<213>嗜热脂肪地芽孢杆菌
<400>16
atgcataatg acttggcata tgaatatgac attcatcttc cttctggcgg agaagcgggg 60
aaaaagtatc cggctgtttt cgcgttgcac ggcatcgggt atgacgaaca atacatgctt 120
actttagtga aagatttaaa agaagaattt attttaatag gcattagagg ggatctcccg 180
tatgaagatg gatatgccta ttattatttg aaagaatatg gaaagccaga acggaaaatg 240
ttcgatgata gcataggcaa attaaagcac ttcattgaat atgcattaaa ccaatatccg 300
attgattccg atcgagtgta tttgatcggg tttagtcaag gcgccatttt aagtatgtct 360
ctcgccttga tactgggcga taaaattaaa gggattgccg caatgaacgg atatatacca 420
tcgtttgtga aggaagaata tccgttgcag cctatcagtc acttgtctgt gtttctcacc 480
caaggcgaat cagatcatat ttttccttta aatattgggc aggaaaatta tgaatacttg 540
cgccagcatg cgggggctgt gaagtatacc atttatccgg caggacatga aatatcgcaa 600
gacaaccaac gcgacatcgt ttcatggctg cgtcatgatg cattccatca caattccaat 660
aaggcaacaa atcccgcatg a 681
<210>17
<211>454
<212>PRT
<213>人(Homo sapiens)
<400>17
Glu His Cys Leu Tyr Leu Asn Ile Tyr Thr Pro Ala Asp Leu Thr Lys
1 5 10 15
Lys Asn Arg Leu Pro Val Met Val Trp Ile His Gly Gly Gly Leu Met
20 25 30
Val Gly Ala Ala Ser Thr Tyr Asp Gly Leu Ala Leu Ala Ala His Glu
35 40 45
Asn Val Val Val Val Thr Ile Gln Tyr Arg Leu Gly Ile Trp Gly Phe
50 55 60
Phe Ser Thr Gly Asp Glu His Ser Arg Gly Asn Trp Gly His Leu Asp
65 70 75 80
Gln Val Ala Ala Leu Arg Trp Val Gln Asp Asn Ile Ala Ser Phe Gly
85 90 95
Gly Asn Pro Gly Ser Val Thr Ile Phe Gly Glu Ser Ala Gly Gly Glu
100 105 110
Ser Val Ser Val Leu Val Leu Ser Pro Leu Ala Lys Asn Leu Phe His
115 120 125
Arg Ala Ile Ser Glu Ser Gly Val Ala Leu Thr Ser Val Leu Val Lys
130 135 140
Lys Gly Asp Val Lys Pro Leu Ala Glu Gln Ile Ala Ile Thr Ala Gly
145 150 155 160
Cys Lys Thr Thr Thr Ser Ala Ala Met Val His Cys Leu Arg Gln Lys
165 170 175
Thr Glu Glu Glu Leu Leu Glu Thr Thr Leu Lys Ile Gly Asn Ser Tyr
180 185 190
Leu Trp Thr Tyr Arg Glu Thr Gln Arg Glu Ser Thr Leu Leu Gly Thr
195 200 205
Val Ile Asp Gly Met Leu Leu Leu Lys Thr Pro Glu Glu Leu Gln Arg
210 215 220
Glu Arg Asn Phe His Thr Val Pro Tyr Met Val Gly Ile Asn Lys Gln
225 230 235 240
Glu Phe Gly Trp Leu Ile Pro Met Gln Leu Met Ser Tyr Pro Leu Ser
245 250 255
Glu Gly Gln Leu Asp Gln Lys Thr Ala Met Ser Leu Leu Gly Ser Pro
260 265 270
Ile Pro Leu Phe Ala Ile Ala Lys Glu Leu Ile Pro Glu Ala Thr Glu
275 280 285
Lys Tyr Leu Gly Gly Thr Asp Asp Thr Val Lys Lys Lys Asp Leu Ile
290 295 300
Leu Asp Leu Ile Ala Asp Val Met Phe Gly Val Pro Ser Val Ile Val
305 310 315 320
Ala Arg Asn His Arg Asp Ala Gly Ala Pro Thr Tyr Met Tyr Glu Phe
325 330 335
Gln Tyr Arg Pro Ser Phe Ser Ser Asp Met Lys Pro Lys Thr Val Ile
340 345 350
Gly Asp His Gly Asp Glu Leu Phe Ser Val Phe Gly Ala Pro Phe Leu
355 360 365
Lys Glu Gly Ala Ser Glu Glu Glu Ile Arg Leu Ser Lys Met Val Met
370 375 380
Lys Phe Trp Ala Asn Phe Ala Arg Asn Gly Asn Pro Asn Gly Lys Gly
385 390 395 400
Leu Pro His Trp Pro Glu Tyr Asn Gln Lys Glu Gly Tyr Leu Gln Ile
405 410 415
Gly Ala Asn Thr Gln Ala Ala Gln Lys Leu Lys Asp Lys Glu Val Ala
420 425 430
Phe Trp Thr Asn Leu Phe Ala Lys Lys Ala Val Glu Lys Pro Pro Gln
435 440 445
Thr Asp His Ile Glu Leu
450
<210>18
<211>1367
<212>DNA
<213>人
<400>18
ctgaacactg tctttacctc aatatttaca ctcctgcaga cttgaccaag aaaaacaggc 60
tgccggtgat ggtgtggatc cacggagggg ggctgatggt gggtgcggca tcaacctatg 120
atgggctggc ccttgctgcc catgaaaacg tggtggtggt gaccattcaa tatcgcctgg 180
gcatctgggg attcttcagc acaggggatg aacacagccg ggggaactgg ggtcacctgg 240
accaggtggc tgccctgcgc tgggtccagg acaacattgc cagctttgga gggaacccag 300
gctctgtgac catctttgga gagtcagcgg gaggagaaag tgtctctgtt cttgttttgt 360
ctccattggc caagaacctc ttccaccggg ccatttctga gagtggcgtg gccctcactt 420
ctgttctggt gaagaaaggt gatgtcaagc ccttggctga gcaaattgct atcactgctg 480
ggtgcaaaac caccacctct gctgctatgg ttcactgcct gcgacagaag acggaagagg 540
agctcttgga gacgacattg aaaattggaa attcttatct ctggacttac agggagaccc 600
agagagagtc aacccttctg ggcactgtga ttgatgggat gctgctgctg aaaacacctg 660
aagagcttca acgtgaaagg aatttccaca ctgtccccta catggtcgga attaacaagc 720
aggagtttgg ctggttgatt ccaatgcagt tgatgagcta tccactctcc gaagggcaac 780
tggaccagaa gacagccatg tcactccttg gaagtcctat ccccttgttt gccattgcta 840
aggaactgat tccagaagcc actgagaaat acttaggagg aacagacgac actgtcaaaa 900
agaaagacct gatcctggac ttgatagcag atgtgatgtt tggtgtccca tctgtgattg 960
tggcccggaa ccacagagat gctggagcac ccacctacat gtatgagttt cagtaccgtc 1020
caagcttctc atcagacatg aaacccaaga cggtgatagg agaccacggg gatgagctct 1080
tctccgtctt tggggcccca tttttaaaag agggtgcctc agaagaggag atcagactta 1140
gcaagatggt gatgaaattc tgggccaact ttgctcgcaa tggaaacccc aatgggaaag 1200
ggctgcccca ctggccagag tacaaccaga aggaagggta tctgcagatt ggtgccaaca 1260
cccaggcggc ccagaagctg aaggacaaag aagtagcttt ctggaccaac ctctttgcca 1320
agaaggcagt ggagaagcca ccccagacag accacataga gctgtga 1367
<210>19
<211>565
<212>PRT
<213>小鼠(Mus musculus)
<400>19
Met Trp Leu Cys Ala Leu Ser Leu Ile Ser Leu Thr Ala Cys Leu Ser
1 5 10 15
Leu Gly His Pro Ser Leu Pro Pro Val Val His Thr Val His Gly Lys
20 25 30
Val Leu Gly Lys Tyr Val Thr Leu Glu Gly Phe Ser Gln Pro Val Ala
35 40 45
Val Phe Leu Gly Val Pro Phe Ala Lys Pro Pro Leu Gly Ser Leu Arg
50 55 60
Phe Ala Pro Pro Glu Pro Ala Glu Pro Trp Ser Phe Val Lys His Thr
65 70 75 80
Thr Ser Tyr Pro Pro Leu Cys Tyr Gln Asn Pro Glu Ala Ala Leu Arg
85 90 95
Leu Ala Glu Arg Phe Thr Asn Gln Arg Lys Ile Ile Pro His Lys Phe
100 105 110
Ser Glu Asp Cys Leu Tyr Leu Asn Ile Tyr Thr Pro Ala Asp Leu Thr
115 120 125
Gln Asn Ser Arg Leu Pro Val Met Val Trp Ile His Gly Gly Gly Leu
130 135 140
Val Ile Asp Gly Ala Ser Thr Tyr Asp Gly Val Pro Leu Ala Val His
145 150 155 160
Glu Asn Val Val Val Val Val Ile Gln Tyr Arg Leu Gly Ile Trp Gly
165 170 175
Phe Phe Ser Thr Glu Asp Glu His Ser Arg Gly Asn Trp Gly His Leu
180 185 190
Asp Gln Val Ala Ala Leu His Trp Val Gln Asp Asn Ile Ala Asn Phe
195 200 205
Gly Gly Asn Pro Gly Ser Val Thr Ile Phe Gly Glu Ser Ala Gly Gly
210 215 220
Glu Ser Val Ser Val Leu Val Leu Ser Pro Leu Ala Lys Asn Leu Phe
225 230 235 240
His Arg Ala Ile Ala Gln Ser Ser Val Ile Phe Asn Pro Cys Leu Phe
245 250 255
Gly Arg Ala Ala Arg Pro Leu Ala Lys Lys Ile Ala Ala Leu Ala Gly
260 265 270
Cys Lys Thr Thr Thr Ser Ala Ala Met Val His Cys Leu Arg Gln Lys
275 280 285
Thr Glu Asp Glu Leu Leu Glu Val Ser Leu Lys Met Lys Phe Gly Thr
290 295 300
Val Asp Phe Leu Gly Asp Pro Arg Glu Ser Tyr Pro Phe Leu Pro Thr
305 310 315 320
Val Ile Asp Gly Val Leu Leu Pro Lys Ala Pro Glu Glu Ile Leu Ala
325 330 335
Glu Lys Ser Phe Asn Thr Val Pro Tyr Met Val Gly Ile Asn Lys His
340 345 350
Glu Phe Gly Trp Ile Ile Pro Met Phe Leu Asp Phe Pro Leu Ser Glu
355 360 365
Arg Lys Leu Glu Gln Lys Thr Ala Ala Ser Ile Leu Trp Gln Ala Tyr
370 375 380
Pro Ile Leu Asn Ile Ser Glu Lys Leu Ile Pro Ala Ala Ile Glu Lys
385 390 395 400
Tyr Leu Gly Gly Thr Glu Asp Pro Ala Thr Met Thr Asp Leu Phe Leu
405 410 415
Asp Leu Ile Gly Asp Ile Met Phe Gly Val Pro Ser Val Ile Val Ser
420 425 430
Arg Ser His Arg Asp Ala Gly Ala Pro Thr Tyr Met Tyr Glu Tyr Gln
435 440 445
Tyr Arg Pro Ser Phe Val Ser Asp Asp Arg Pro Gln Glu Leu Leu Gly
450 455 460
Asp His Ala Asp Glu Leu Phe Ser Val Trp Gly Ala Pro Phe Leu Lys
465 470 475 480
Glu Gly Ala Ser Glu Glu Glu Ile Asn Leu Ser Asn Met Val Met Lys
485 490 495
Phe Trp Ala Asn Phe Ala Arg Asn Gly Asn Pro Asn Gly Glu Gly Leu
500 505 510
Pro His Trp Pro Glu Tyr Asp Gln Lys Glu Gly Tyr Leu Gln Ile Gly
515 520 525
Val Pro Ala Gln Ala Ala His Arg Leu Lys Asp Lys Glu Val Asp Phe
530 535 540
Trp Thr Glu Leu Arg Ala Lys Glu Thr Ala Glu Arg Ser Ser His Arg
545 550 555 560
Glu His Val Glu Leu
565
<2l0>20
<211>1698
<212>DNA
<213>小鼠
<400>20
atgtggctct gtgctttgag tctgatctct ctcactgctt gcttgagtct gggacaccca 60
tccttaccgc ctgtggtaca caccgttcat ggcaaagtcc tggggaagta tgtcacctta 120
gaaggattct cacagcctgt ggccgtcttc ctgggagtcc cctttgccaa gccccctctt 180
ggatctctga ggtttgctcc accagagcct gcagagccct ggagcttcgt gaagcacacc 240
acttcctacc ctcctttgtg ctaccaaaac ccagaggcag cattgaggct cgctgagcgc 300
ttcaccaacc aaaggaagat cattccccac aaattttctg aggactgtct ctacctcaac 360
atttatactc ctgctgactt aacacagaac agcaggttgc ccgtgatggt gtggatacat 420
ggaggtggac ttgtgataga tggagcatca acctatgatg gagtgcccct ggctgtccat 480
gaaaatgtgg ttgtagtggt cattcagtat cgcctgggca tctggggatt cttcagcaca 540
gaggatgaac acagccgggg gaactggggt cacttggacc aggtggctgc actacattgg 600
gtccaagaca acattgccaa ctttgggggc aacccaggat ctgtgactat cttcggcgag 660
tcagcaggag gtgaaagtgt ctctgttctt gtgttaagcc cactggccaa gaacctcttc 720
cacagggcca tcgctcagag tagtgtcatt ttcaatcctt gcctttttgg gagagctgcc 780
agacccttgg ctaagaaaat tgctgctctt gctggctgta aaaccaccac ctccgctgcc 840
atggttcact gcctgcgcca gaagactgaa gatgagctct tggaggtctc actgaaaatg 900
aaatttggga ctgttgattt tcttggagac cccagagaga gctatccctt cctccctact 960
gtgattgatg gagtgttgct gccaaaggca ccagaagaga ttctggctga gaagagtttc 1020
aacactgtcc cctacatggt gggcatcaac aagcatgagt ttggctggat cattccaatg 1080
tttttggact tcccactctc tgaaagaaaa ctggaacaga agacagctgc atccatcctg 1140
tggcaggcct acccaattct taacatctct gaaaagctga ttccagcagc tattgaaaag 1200
tatttaggag ggacagaaga ccctgccaca atgacagacc tgttcctgga cttgattgga 1260
gacattatgt tcggtgtccc atctgtaatc gtgtcccgta gtcacagaga tgctggagcc 1320
ccaacctaca tgtatgaata tcagtatcgc ccaagttttg tatcagacga tagaccccag 1380
gaattgttag gagaccacgc tgatgaactc ttttctgtat ggggagcccc gtttttaaaa 1440
gagggtgctt cagaagaaga gatcaacctc agcaacatgg tgatgaaatt ctgggccaac 1500
tttgctcgga atgggaaccc taatggtgaa gggctgcctc attggccaga atatgaccag 1560
aaggaaggat accttcagat tggagtccca gcacaggcag cccataggct gaaagacaag 1620
gaagtggact tttggactga gctcagagcc aaggaaacag cagagaggtc atcccatagg 1680
gaacatgttg aactgtga 1698
<210>21
<211>553
<212>PRT
<213>非洲爪蟾(Xenopus laevis)
<400>21
Met Ala Leu Trp Ala Ser Leu Ala Leu Ala Phe Ala Ser Leu Val Ala
1 5 10 15
Val Ser Gln Ala Ala Ser Leu Gly Val Val Tyr Thr Glu Gly Gly Phe
20 25 30
Val Glu Gly Thr Ser Lys Lys Ile Gly Ile Leu Phe Pro Asn Tyr Ile
35 40 45
Asp Val Phe Lys Gly Ile Pro Phe Ala Ala Pro Pro Lys Ala Phe Glu
50 55 60
Lys Ala Gln Leu His Pro Gly Trp Ser Gly Thr Leu Lys Ala Thr Asn
65 70 75 80
Phe Lys Glu Arg Cys Leu Gln Ser Thr Leu Thr Gln Thr Asn Val Arg
85 90 95
Gly Asp Leu Asp Cys Leu Tyr Leu Asn Ile Trp Val Pro Gln Thr Arg
100 105 110
Ser Ser Val Ser Thr Asn Leu Pro Val Met Val Trp Ile Tyr Gly Gly
115 120 125
Ala Phe Leu Leu Gly Ser Ser Gln Gly Ala Asn Val Leu Asp Asn Tyr
130 135 140
Leu Tyr Asp Gly Glu Glu Leu Ala Leu Arg Gly Asn Val Ile Val Val
145 150 155 160
Thr Leu Asn Tyr Arg Leu Gly Pro Leu Gly Phe Leu Ser Thr Gly Asp
165 170 175
Ser Asn Leu Pro Gly Asn Tyr Gly Leu Trp Asp Gln His Met Ala Ile
180 185 190
Ala Trp Val Lys Arg Asn Ile Ala Ala Phe Gly Gly Asn Pro Asp Asn
195 200 205
Ile Thr Ile Phe Gly Glu Ser Ala Gly Gly Ala Ser Val Ser Leu Gln
210 215 220
Thr Leu Ser Pro Tyr Asn Lys Gly Leu Ile Lys Arg Ala Ile Ser Gln
225 230 235 240
Ser Gly Val Gly Met Ser Pro Trp Ala Leu Gln Ser Asn Pro Leu Phe
245 250 255
Trp Thr Thr Lys Val Ala Glu Lys Val Gly Cys Pro Val His Asp Thr
260 265 270
Ala Ala Met Ala Asn Cys Leu Arg Ile Ser Asp Pro Lys Ala Val Thr
275 280 285
Leu Ala Tyr Lys Leu Asp Pro Ser Val Leu Glu Tyr Pro Ala Val Tyr
290 295 300
Tyr Leu Gly Ile Ser Pro Val Ile Asp Gly Asp Phe Ile Pro Asp Glu
305 310 315 320
Pro Met Asn Leu Phe Ala Asn Ala Ala Asp Val Asp Tyr Met Ala Gly
325 330 335
Val Asn Asn Met Asp Ala His Leu Phe Ala Gly Ile Asp Leu Pro Val
340 345 350
Ile Asn Gln Pro Leu Gln Lys Ile Ser Glu Ala Glu Val Tyr Asn Leu
355 360 365
Val Gln Gly Leu Thr Leu Thr Lys Ile Ser Ser Ala Leu Glu Thr Ala
370 375 380
Tyr Asn Leu Tyr Thr Ala Asn Trp Gly Pro Asn Pro Glu Gln Glu Asn
385 390 395 400
Met Lys Arg Thr Val Ile Asp Leu Glu Thr Asp Tyr Leu Phe Leu Val
405 410 415
Pro Thr Gln Glu Ala Leu Ala Leu His Ser Met Asn Ala Arg Ser Gly
420 425 430
Arg Thr Tyr Asn Tyr Val Phe Ser Leu Pro Thr Arg Met Pro Ile Tyr
435 440 445
Pro Ser Trp Val Gly Ala Asp His Ala Asp Asp Leu Gln Tyr Val Phe
450 455 460
Gly Lys Pro Phe Gln Thr Pro Leu Ala Tyr Arg Pro Lys Asp Arg Asp
465 470 475 480
Val Ser Ala Ala Met Ile Ala Tyr Trp Thr Asn Phe Ala Ala Thr Gly
485 490 495
Asp Pro Asn Gln Gly Pro Ser Lys Val Pro Thr Asp Trp Leu Pro Tyr
500 505 510
Thr Thr His Leu Gly Gln Tyr Leu Glu Ile Asn Asp Asn Met Ser Tyr
515 520 525
Gln Ser Val Lys Gln Ser Leu Arg Ser Pro Tyr Val Lys Phe Trp Ala
530 535 540
His Thr Tyr Arg Asn Met Ala Asn Val
545 550
<210>22
<211>1662
<212>DNA
<213>非洲爪蟾
<400>22
atggctcttt gggcttctct tgccctggca tttgccagtc tggtggcagt gtcccaggct 60
gcttcactgg gagtggttta caccgagggt ggatttgtgg aaggtaccag caagaaaatt 120
gggatcctgt tcccgaatta cattgatgtt tttaagggca tcccgtttgc tgctccacca 180
aaagcctttg agaaggcaca actgcaccca ggctggtcag gtacattaaa agccacaaac 240
tttaaggaac ggtgcttaca atccacctta acccaaacaa atgtccgtgg tgatttagac 300
tgcctctacc tgaacatctg ggttcctcag acccgctctt cagtgtccac caacctacca 360
gtcatggttt ggatctacgg tggggccttc ttgctcggtt catctcaggg ggccaacgtg 420
ttggataact atctgtatga tggagaggag ctcgctctcc gtggcaatgt cattgtggtg 480
accttgaact acagactggg accattgggc tttctgagta ctggagactc taaccttcct 540
ggcaactatg gactgtggga tcaacacatg gccatcgcct gggtgaaaag gaacattgct 600
gcatttggtg ggaaccctga taacatcacc atatttggag agtctgctgg aggcgccagt 660
gtctcccttc agaccctgtc tccatacaac aaaggactga tcaagcgagc catcagccag 720
agtggggtgg gcatgtcccc ttgggcactt cagagcaacc cacttttctg gaccacaaag 780
gtggctgaaa aagttggatg tcctgttcat gacacagccg ctatggcaaa ctgcttgagg 840
atttcagacc ctaaggctgt cactttagcc tataaactgg acccgtctgt cctggagtat 900
cccgctgttt actacttggg catctcccca gtcattgatg gtgatttcat tcctgatgaa 960
ccaatgaatc tctttgctaa tgcggcggat gtggattaca tggcaggtgt aaacaacatg 1020
gatgcacatt tgtttgcagg catagatctg ccagttatca atcagcctct tcagaagatt 1080
tctgaggccg aagtctataa tctggtgcag ggtttgaccc tgactaaaat ctccagtgcc 1140
ttggaaactg cctacaacct ttacacggcc aactggggac ccaaccctga gcaggagaat 1200
atgaaaagaa ctgtcataga cttagagacg gactatcttt tcctggtccc tacccaagag 1260
gcactggctc ttcactccat gaatgctcgg agtggacgga cttacaacta cgtgttctct 1320
ttgccgactc gcatgcccat ttaccccagc tgggtcggag ccgatcatgc agatgatttg 1380
cagtacgtgt tcgggaaacc cttccagact ccattggctt acagacccaa ggatagagat 1440
gtctctgccg ccatgattgc ctattggacc aactttgctg caactggtga ccccaaccaa 1500
ggaccctcca aagtgcccac cgattggctg ccttacacca ctcaccttgg ccagtacctg 1560
gaaatcaacg acaacatgtc ttaccaatct gtaaagcaga gtctacgttc cccttatgtg 1620
aaattctggg cccacactta ccgcaatatg gccaacgtgt aa 1662
<210>23
<211>556
<212>PRT
<213>鸡(Gallus gallus)
<400>23
Met Ala His Trp Ala Ile Leu Ser Phe Ala Leu Cys Cys Cys Leu Gly
1 5 10 15
Val Ala Gln Ala Ala Thr Leu Gly Val Val Leu Thr Glu Gly Gly Phe
20 25 30
Val Glu Gly Glu Ser Lys Arg Arg Gly Leu Phe Gly Ser Tyr Val Asp
35 40 45
Ile Phe Arg Gly Ile Pro Phe Ala Ala Pro Pro Lys Ala Leu Gln Asp
50 55 60
Pro Gln Pro His Pro Gly Trp Asp Gly Thr Leu Lys Ala Lys Lys Phe
65 70 75 80
Lys Asn Arg Cys Met Gln Met Thr Leu Thr Gln Thr Asp Val Arg Gly
85 90 95
Lys Glu Asp Cys Leu Tyr Leu Asn Ile Trp Ile Pro Gln Gly Lys Arg
100 105 110
Glu Val Ser Thr Asn Leu Pro Val Met Val Trp Ile Tyr Gly Gly Ala
115 120 125
Phe Leu Leu Gly Gly Gly Gln Gly Ala Asn Phe Leu Asp Asn Tyr Leu
130 135 140
Tyr Asp Gly Glu Glu Ile Ala Val Arg Gly Asn Val Ile Val Val Thr
145 150 155 160
Leu Asn Tyr Arg Val Gly Pro Leu Gly Phe Leu Ser Thr Gly Asp Pro
165 170 175
Asn Met Pro Gly Asn Tyr Gly Leu Lys Asp Gln His Met Ala Ile Ala
180 185 190
Trp Val Lys Arg Asn Ile Lys Ala Phe Gly Gly Asp Pro Asp Asn Ile
195 200 205
Thr Ile Phe Gly Glu Ser Ala Gly Ala Ala Ser Val Ser Leu Gln Ile
210 215 220
Leu Ser Pro Lys Asn Ala Gly Leu Phe Lys Arg Ala Ile Ser Gln Ser
225 230 235 240
Gly Val Ser Leu Cys Ser Trp Val Ile Gln Lys Asp Pro Leu Thr Trp
245 250 255
Ala Lys Lys Val Gly Glu Gln Val Gly Cys Pro Thr Asp Asn Thr Thr
260 265 270
Val Leu Ala Asn Cys Leu Arg Ala Thr Asp Pro Lys Ala Leu Thr Leu
275 280 285
Ala His His Val Glu Leu Ile Ser Leu Pro Gly Pro Leu Val His Thr
290 295 300
Leu Ser Ile Thr Pro Val Val Asp Gly Asp Phe Leu Pro Asp Met Pro
305 310 315 320
Glu Asn Leu Phe Ala Asn Ala Ala Asp Ile Asp Tyr Ile Ala Gly Val
325 330 335
Asn Asn Met Asp Gly His Phe Phe Ala Gly Phe Asp Leu Pro Ala Ile
340 345 350
Asn Arg Pro Leu Gln Lys Ile Thr Ala Ser Asp Val Tyr Asn Leu Val
355 360 365
Lys Gly Leu Thr Ala Asp Arg Gly Glu Arg Gly Ala Asn Leu Thr Tyr
370 375 380
Asp Leu Tyr Thr Glu Leu Trp Gly Asp Asn Pro Glu Gln Gln Val Met
385 390 395 400
Lys Arg Thr Val Val Asp Leu Ala Thr Asp Tyr Ile Phe Leu Ile Pro
405 410 415
Thr Gln Trp Thr Leu Asn Leu His His Lys Asn Ala Arg Ser Gly Lys
420 425 430
Thr Tyr Ser Tyr Leu Phe Ser Gln Pro Ser Arg Met Pro Ile Tyr Pro
435 440 445
Ser Trp Val Gly Ala Asp His Ala Asp Asp Leu Gln Tyr Val Phe Gly
450 455 460
Lys Pro Phe Ala Thr Pro Leu Gly Tyr Leu Pro Lys His Arg Thr Val
465 470 475 480
Ser Ser Ala Met Ile Ala Tyr Trp Thr Asn Phe Ala Arg Thr Gly Asp
485 490 495
Pro Asn Ser Gly Asn Ser Glu Val Pro Ile Thr Trp Pro Pro Tyr Thr
500 505 510
Thr Glu Gly Gly Tyr Tyr Leu Glu Ile Asn Asn Lys Ile Asn Tyr Asn
515 520 525
Ser Val Lys Gln Asn Leu Arg Thr Pro Tyr Val Asn Tyr Trp Asn Ser
530 535 540
Val Tyr Leu Asn Leu Pro Leu Ile Ala Ser Thr Ser
545 550 555
<210>24
<211>1671
<212>DNA
<213>鸡
<400>24
atggctcact gggcgattct gagctttgcc ttgtgctgct gcctcggggt agcacaggcc 60
gcaactctgg gtgtggtgct caccgaggga ggttttgtgg aaggcgagag taaacgacgg 120
ggactctttg ggagctatgt ggatatcttc agagggatcc cttttgctgc cccgccaaag 180
gcactgcaag acccccaacc tcatcctggc tgggacggaa cactgaaagc aaaaaaattt 240
aagaatcgct gcatgcagat gacacttacc caaactgatg tccgtgggaa ggaggactgc 300
ctctatctga acatctggat ccctcaaggg aagagagaag tctccaccaa cttgccagtg 360
atggtctgga tctacggtgg tgccttcctt cttggagggg gtcaaggagc caacttcctt 420
gacaactacc tctatgatgg tgaggagatc gccgtgcggg gcaatgtgat tgtggtgacc 480
ctcaactatc gtgtggggcc cctgggcttc ctcagcactg gagacccaaa catgccaggg 540
aactacgggc tgaaggatca gcacatggct attgcctggg tgaagaggaa tatcaaggcc 600
tttggaggcg acccagacaa catcaccatc tttggggagt cagctggtgc tgccagtgtc 660
tccctgcaga tattgtcccc aaagaacgca ggtctgttca agagagccat cagccaaagc 720
ggtgtcagtc tgtgcagctg ggtcatccaa aaggacccac tcacttgggc taaaaaggtt 780
ggagagcagg tgggctgccc cacagacaac accacggtct tggccaactg cctccgtgcc 840
actgacccca aagccctgac actggcccac cacgtggaac tgatctccct gcctggtccc 900
ctggttcata cactctccat cactcctgtt gttgatggag acttcctccc tgacatgcca 960
gagaacctct ttgccaatgc tgctgacatc gactacattg ctggggtcaa caacatggat 1020
ggacatttct ttgctggctt tgatttacct gctatcaacc gtccacttca gaaaatcact 1080
gcgagcgatg tctataactt ggtcaaagga ctaactgcag acaggggtga gagaggagcc 1140
aacttgacgt acgatctcta cacagagttg tggggtgaca acccagagca acaagtcatg 1200
aagagaacag tggtggacct ggctaccgac tacattttcc tgattcccac acagtggaca 1260
ctaaacctgc accacaagaa tgcccggagt ggcaagacat acagctactt gttctcccag 1320
ccatctcgaa tgcccatcta tccaagctgg gtaggggcag accacgctga tgacttgcag 1380
tacgtgtttg ggaaaccctt tgccacccct ctaggctacc tgcccaagca caggacagtc 1440
tcatctgcca tgattgctta ttggaccaat tttgccagga ctggtgaccc caacagtggg 1500
aattcagagg tgcccattac ctggccaccc tacaccactg agggtggtta ctacctggaa 1560
atcaacaaca aaataaacta taattcagtg aaacagaatc tgagaacccc atacgtgaac 1620
tactggaatt cagtctatct aaatctgcca ctgattgcca gcacatccta g 1671
<210>25
<211>544
<212>PRT
<213>黑腹果蝇(Drosophila melanogaster)
<400>25
Met Ser Ile Phe Lys Arg Leu Leu Cys Leu Thr Leu Leu Trp Ile Ala
1 5 10 15
Ala Leu Glu Ser Glu Ala Asp Pro Leu Ile Val Glu Ile Thr Asn Gly
20 25 30
Lys Ile Arg Gly Lys Asp Asn Gly Leu Tyr Tyr Ser Tyr Glu Ser Ile
35 40 45
Pro Tyr Ala Glu His Pro Thr Gly Ala Leu Arg Phe Glu Ala Pro Gln
50 55 60
Pro Tyr Ser His His Trp Thr Asp Val Phe Asn Ala Thr Gln Ser Pro
65 70 75 80
Val Glu Cys Met Gln Trp Asn Gln Phe Ile Asn Glu Asn Asn Lys Leu
85 90 95
Met Gly Asp Glu Asp Cys Leu Thr Val Ser Ile Tyr Lys Pro Lys Lys
100 105 110
Pro Asn Arg Ser Ser Phe Pro Val Val Val Leu Leu His Gly Gly Ala
115 120 125
Phe Met Phe Gly Ser Gly Ser Ile Tyr Gly His Asp Ser Ile Met Arg
130 135 140
Glu Gly Thr Leu Leu Val Val Lys Ile Ser Tyr Arg Leu Gly Pro Leu
145 150 155 160
Gly Phe Ala Ser Thr Gly Asp Arg His Leu Pro Gly Asn Tyr Gly Leu
165 170 175
Lys Asp Gln Arg Leu Ala Leu Gln Trp Ile Lys Lys Asn Ile Ala His
180 185 190
Phe Gly Gly Met Pro Asp Asn Ile Val Leu Ile Gly His Ser Ala Gly
195 200 205
Gly Ala Ser Ala His Leu Gln Leu Leu His Glu Asp Phe Lys His Leu
210 215 220
Ala Lys Gly Ala Ile Ser Val Ser Gly Asn Ala Leu Asp Pro Trp Val
225 230 235 240
Ile Gln Gln Gly Gly Arg Arg Arg Ala Phe Glu Leu Gly Arg Ile Val
245 250 255
Gly Cys Gly His Thr Asn Val Ser Ala Glu Leu Lys Asp Cys Leu Lys
260 265 270
Ser Lys Pro Ala Ser Asp Ile Val Ser Ala Val Arg Ser Phe Leu Val
275 280 285
Phe Ser Tyr Val Pro Phe Ser Ala Phe Gly Pro Val Val Glu Pro Ser
290 295 300
Asp Ala Pro Asp Ala Phe Leu Thr Glu Asp Pro Arg Ala Val Ile Lys
305 310 315 320
Ser Gly Lys Phe Ala Gln Val Pro Trp Ala Val Thr Tyr Thr Thr Glu
325 330 335
Asp Gly Gly Tyr Asn Ala Ala Gln Leu Leu Glu Arg Asn Lys Leu Thr
340 345 350
Gly Glu Ser Trp Ile Asp Leu Leu Asn Asp Arg Trp Phe Asp Trp Ala
355 360 365
Pro Tyr Leu Leu Phe Tyr Arg Asp Ala Lys Lys Thr Ile Lys Asp Met
370 375 380
Asp Asp Leu Ser Phe Asp Leu Arg Gln Gln Tyr Leu Ala Asp Arg Arg
385 390 395 400
Phe Ser Val Glu Ser Tyr Trp Asn Val Gln Arg Met Phe Thr Asp Val
405 410 415
Leu Phe Lys Asn Ser Val Pro Ser Ala Ile Asp Leu His Arg Lys Tyr
420 425 430
Gly Lys Ser Pro Val Tyr Ser Phe Val Tyr Asp Asn Pro Thr Asp Ser
435 440 445
Gly Val Gly Gln Leu Leu Ser Asn Arg Thr Asp Val His Phe Gly Thr
450 455 460
Val His Gly Asp Asp Phe Phe Leu Ile Phe Asn Thr Ala Ala Tyr Arg
465 470 475 480
Ile Gly Ile Arg Pro Asp Glu Glu Val Ile Ser Lys Lys Phe Ile Gly
485 490 495
Met Leu Glu Asp Phe Ala Leu Asn Asp Lys Gly Thr Leu Thr Phe Gly
500 505 510
Glu Cys Asn Phe Gln Asn Asn Val Asn Ser Lys Glu Tyr Gln Val Leu
515 520 525
Arg Ile Ser Arg Asn Ala Cys Lys Asn Glu Glu Tyr Ala Arg Phe Pro
530 535 540
<210>26
<211>1635
<212>DNA
<213>黑腹果蝇
<400>26
atgagtatat tcaaacggct gttgtgcctg actttgctgt ggatagcagc tttagaatct 60
gaagctgatc ccttgattgt tgagataaca aatggaaaaa tccgtggcaa agataatggg 120
ttgtactaca gctacgaatc gattccctat gccgagcatc caactggtgc cctccgtttt 180
gaagcacctc agccgtatag tcatcattgg actgatgttt tcaatgccac gcagtctcca 240
gttgagtgca tgcagtggaa tcagtttata aacgaaaaca ataagctgat gggtgatgag 300
gattgcttaa cggtaagcat ctataagcca aagaaaccca atcggagcag ctttcctgtc 360
gtagtactcc tgcatggagg tgctttcatg ttcggtagtg gatccatata tggacacgac 420
tccattatgc gtgagggaac tttgcttgtg gtaaaaataa gctatcgtct tggaccattg 480
ggttttgcaa gtaccggcga tagacacttg ccgggaaact atggtctaaa ggatcaacgt 540
ctggccctac aatggatcaa gaagaacatt gctcactttg gtggaatgcc agataatatt 600
gtgctcattg gtcactctgc aggcggtgct tcggctcatt tgcagctgtt gcacgaggat 660
ttcaaacatt tggccaaagg agcgatttcg gtgagcggca atgcattgga tccttgggtc 720
atacagcagg gtggacgacg acgtgcattt gaactgggtc gtattgtcgg ttgtggacac 780
acaaatgtct ccgcagaact caaggactgc ttgaagtcta agccggctag cgatatagtc 840
tctgctgtcc gaagcttcct tgtgttttcc tatgtaccct tcagtgcttt tggacctgtt 900
gtggagccgt cagatgcacc agacgccttt ctaaccgagg acccaagagc agtgattaag 960
agcgggaagt ttgcccaagt cccttgggct gtgacgtaca ccactgagga cgggggatac 1020
aacgctgctc agctgttgga aagaaacaaa ttaactggcg agagttggat tgacctactc 1080
aatgatcgat ggtttgattg ggcaccatac ttgctcttct atcgggacgc caagaaaacc 1140
atcaaagata tggatgatct ttcatttgat ctcaggcagc agtatctagc agatcggcga 1200
ttcagtgtgg aaagttattg gaacgtgcag cgaatgttta ctgatgttct tttcaagaat 1260
agcgtgccaa gtgcaataga tcttcaccga aagtatggca aaagtccggt ttattctttt 1320
gtctacgata atcctaccga ttccggagtg ggtcaattgc tttccaatcg aacagatgta 1380
cattttggta ctgtccacgg agatgacttt ttcttgattt tcaatacagc tgcataccgt 1440
atcggcattc gtccggatga agaagttatt tcaaaaaagt ttataggtat gctggaggat 1500
ttcgcactca acgataaggg aacattaaca tttggagaat gtaatttcca aaataatgtg 1560
aacagcaagg aatatcaagt gctgcgtatt tcacgaaacg cttgtaaaaa cgaggaatat 1620
gctcggtttc cctaa 1635
<210>27
<211>570
<212>PRT
<213>家蚕(Bombyx mori)
<400>27
Met Cys Thr Lys Phe Ala Val Leu Leu Tyr Tyr Val Ile Val Gly Ser
1 5 10 15
Val Arg Ala Tyr Ser Ser Pro Ala Ala Ser Pro Pro Ser Ser Cys Asn
20 25 30
Val Val Ala Gln Thr Glu Ser Gly Trp Val Cys Gly Arg Thr Arg Arg
35 40 45
Ala Glu Ala Ser Thr Leu Tyr Ala Ser Phe Arg Gly Val Pro Tyr Ala
50 55 60
Lys Gln Pro Val Gly Glu Leu Arg Phe Lys Glu Leu Gln Pro Ala Glu
65 70 75 80
Pro Trp Thr Asp Tyr Leu Asp Ala Thr Glu Glu Gly Pro Val Cys Tyr
85 90 95
Gln Thr Asp Val Leu Tyr Gly Ser Leu Met Lys Pro His Gly Met Asp
100 105 110
Glu Ala Cys Ile Tyr Ala Asn Ile His Val Pro Leu Asn Ala Leu Pro
115 120 125
Ala Ala Gly Glu Thr Pro Thr Lys Pro Gly Leu Pro Ile Leu Val Phe
130 135 140
Ile His Gly Gly Gly Phe Ala Phe Gly Ser Gly Asp Ala Asp Leu Tyr
145 150 155 160
Gly Pro Glu Tyr Leu Val Thr Arg Asn Val Val Val Ile Thr Phe Asn
165 170 175
Tyr Arg Leu Asn Phe Phe Gly Phe Phe Ser Leu Asp Thr Pro Lys Val
180 185 190
Pro Gly Asn Asn Gly Leu Arg Asp Met Val Thr Leu Leu Arg Trp Val
195 200 205
Lys Arg Asn Ala Arg Ala Phe Gly Gly Asn Pro Asp Asn Val Thr Leu
210 215 220
Ala Gly Gln Ser Ala Gly Ala Ala Ala Ala His Leu Leu Thr Leu Ser
225 230 235 240
Lys Ala Thr Glu Gly Leu Val Ser Arg Ala Ile Leu Met Ser Gly Ala
245 250 255
Gly Thr Ser Thr Phe Phe Thr Thr Ser Pro Ile Phe Ser Gln Ser Ile
260 265 270
Asn Lys Ile Leu Phe Ser Ile Leu Gly Val Asn Ser Thr Asn Pro Asp
275 280 285
Glu Ile His Glu Lys Leu Val Ala Met Pro Val Glu Lys Leu Asn Glu
290 295 300
Ala Asn Arg Ile Leu Ile Asp Gln Ile Gly Leu Thr Thr Phe Phe Pro
305 310 315 320
Val Val Glu Thr Pro His Pro Gly Ile Thr Thr Ile Leu Asp Glu Asp
325 330 335
Pro Asn Ile Leu Val Gln Gln Gly Arg Gly Lys Asp Ile Pro Leu Ile
340 345 350
Ile Gly Phe Thr Asn Ser Glu Cys His Met Phe Gln His Arg Phe Glu
355 360 365
Gln Ile Asp Ile Val Ser Lys Ile Asn Glu Asn Pro Ala Ile Leu Val
370 375 380
Pro Ser Asn Leu Leu Tyr Ser Ser Thr Pro Glu Thr Ile Ala Leu Val
385 390 395 400
Ser Asn Gln Ile Ser Gln Arg Tyr Phe Asn Gly Ser Val Asp Leu Glu
405 410 415
Gly Phe Ile Asn Met Cys Thr Asp Ser Tyr Tyr Lys Tyr Pro Ala Met
420 425 430
Lys Leu Ala Glu Lys Arg Ser Ala Ala Gly Asp Ala Pro Val Phe Leu
435 440 445
Tyr Gln Phe Ser Tyr Asp Gly Tyr Ser Val Phe Lys Gln Ala Phe His
450 455 460
Leu His Phe Asn Gly Ala Gly His Ala Asp Asp Leu Thr Tyr Val Leu
465 470 475 480
Lys Val Asn Ser Ala Ser Gly Thr Ser Ser Ser Gln Lys Ala Asp Asp
485 490 495
Glu Met Lys Tyr Trp Met Thr Thr Phe Val Thr Asn Phe Met Arg Cys
500 505 510
Ser Ala Pro Met Cys Asp Glu Thr Thr Ala Trp Pro Pro Val Thr Pro
515 520 525
Arg Glu Leu Gln Tyr Gln Asp Ile Ile Thr Pro Asn Leu Cys His Gln
530 535 540
Thr Ser Leu Thr Lys Glu Gln Leu Glu Met Lys Asn Phe Phe Asp Lys
545 550 555 560
Ile His Asn Gly Gly Glu Ser Arg Leu Lys
565 570
<210>28
<211>1713
<212>DNA
<213>家蚕
<400>28
atgtgtacca aattcgctgt attactatat tacgttatcg tgggctcagt aagggcatac 60
tcaagcccag cagcgtcgcc gccgtcgtcg tgcaatgtgg tcgcgcagac ggagtcaggc 120
tgggtgtgtg gccgtactcg ccgggcggaa gcaagcactt tatacgccag tttccgggga 180
gtgccttatg ccaagcaacc agtcggagaa cttcgattta aggaattaca accagcagag 240
ccatggaccg actacctaga tgccaccgag gaaggtccag tttgctacca gacagacgtt 300
ctttatggaa gtctaatgaa acctcacggc atggatgagg catgcatcta cgccaatata 360
catgtgcctt tgaacgccct gccggcagct ggtgagacgc ctacgaagcc tggtcttcca 420
atattagtct ttattcacgg aggtggcttc gcgtttggat ctggtgatgc tgacctatat 480
ggaccggagt atcttgtcac aagaaacgtt gttgtcatca cttttaacta caggttgaat 540
ttctttggat ttttctcatt ggatactcct aaagtccccg gaaacaatgg tcttagggac 600
atggtgactc tgctccgttg ggtgaagagg aacgccagag cctttggagg taatcctgac 660
aacgtgacct tggcgggcca gagcgctggg gccgctgctg ctcaccttct caccttatcc 720
aaggccactg aaggcttagt ttcaagggct atattgatga gcggtgctgg aacatccact 780
ttctttacaa catctcctat cttctcccag tccatcaaca aaatcttgtt ttccatcctt 840
ggcgtcaact ccactaatcc tgatgagata cacgagaaac tcgtcgccat gccggttgaa 900
aaactgaatg aagccaacag aatattgatt gatcaaatcg gccttaccac ttttttccca 960
gtggtagaaa caccgcatcc cggaattacc actatattag atgaagatcc aaatattctg 1020
gtccagcaag gccgcggtaa agacataccc ttgattatag gtttcacgaa ttctgaatgc 1080
catatgttcc agcatagatt tgaacagatc gatatagtat ctaagatcaa tgaaaatcca 1140
gcaatcttag ttccttccaa tctactgtac tcctcgactc ctgagacgat tgctttggtc 1200
tcgaatcaaa tcagccaaag atacttcaat ggtagcgtag atctggaggg ctttatcaat 1260
atgtgtaccg atagttacta caagtaccca gccatgaagt tggccgagaa gagatctgcg 1320
gcaggtgatg ctccggtatt tctgtaccag ttctcttacg acggttacag cgtgttcaag 1380
caagcctttc atttgcattt caacggtgct ggacacgcgg acgacttgac atacgtgctg 1440
aaagtgaatt ctgcgtcagg gactagttca tcacaaaaag cagacgatga aatgaaatat 1500
tggatgacga cgttcgtcac aaactttatg cgatgcagtg ctcctatgtg cgatgaaact 1560
acagcgtggc caccagttac accgcgggaa ctacaatacc aagacattat tacaccaaac 1620
ttatgccacc aaactagtct taccaaagaa caactcgaaa tgaagaattt cttcgataag 1680
atccataatg gaggtgaaag cagacttaag taa 1713
<210>29
<211>360
<212>PRT
<213>拟南芥(Arabidopsis thaliana)
<400>29
Met Trp Thr Ser Lys Thr Ile Ser Phe Thr Leu Phe Ile Thr Thr Thr
1 5 10 15
Leu Leu Gly Ser Cys Asn Ala Ser Ala Lys Ala Lys Thr Gln Pro Leu
20 25 30
Phe Pro Ala Ile Leu Ile Phe Gly Asp Ser Thr Val Asp Thr Gly Asn
35 40 45
Asn Asn Tyr Pro Ser Gln Thr Ile Phe Arg Ala Lys His Val Pro Tyr
50 55 60
Gly Ile Asp Leu Pro Asn His Ser Pro Asn Gly Arg Phe Ser Asn Gly
65 70 75 80
Lys Ile Phe Ser Asp Ile Ile Ala Thr Lys Leu Asn Ile Lys Gln Phe
85 90 95
Val Pro Pro Phe Leu Gln Pro Asn Leu Thr Asp Gln Glu Ile Val Thr
100 105 110
Gly Val Cys Phe Ala Ser Ala Gly Ala Gly Tyr Asp Asp Gln Thr Ser
115 120 125
Leu Thr Thr Gln Ala Ile Arg Val Ser Glu Gln Pro Asn Met Phe Lys
130 135 140
Ser Tyr Ile Ala Arg Leu Lys Ser Ile Val Gly Asp Lys Lys Ala Met
145 150 155 160
Lys Ile Ile Asn Asn Ala Leu Val Val Val Ser Ala Gly Pro Asn Asp
165 170 175
Phe Ile Leu Asn Tyr Tyr Glu Val Pro Ser Trp Arg Arg Met Tyr Pro
180 185 190
Ser Ile Ser Asp Tyr Gln Asp Phe Val Leu Ser Arg Leu Asn Asn Phe
195 200 205
Val Lys Glu Leu Tyr Ser Leu Gly Cys Arg Lys Ile Leu Val Gly Gly
210 215 220
Leu Pro Pro Met Gly Cys Leu Pro Ile Gln Met Thr Ala Gln Phe Arg
225 230 235 240
Asn Val Leu Arg Phe Cys Leu Glu Gln Glu Asn Arg Asp Ser Val Leu
245 250 255
Tyr Asn Gln Lys Leu Gln Lys Leu Leu Pro Gln Thr Gln Ala Ser Leu
260 265 270
Thr Gly Ser Lys Ile Leu Tyr Ser Asp Val Tyr Asp Pro Met Met Glu
275 280 285
Met Leu Gln Asn Pro Ser Lys Tyr Gly Phe Lys Glu Thr Thr Arg Gly
290 295 300
Cys Cys Gly Thr Gly Phe Leu Glu Thr Ser Phe Met Cys Asn Ala Tyr
305 310 315 320
Ser Ser Met Cys Gln Asn Arg Ser Glu Phe Leu Phe Phe Asp Ser Ile
325 330 335
His Pro Ser Glu Ala Thr Tyr Asn Tyr Ile Gly Asn Val Leu Asp Thr
340 345 350
Lys Ile Arg Gly Trp Leu Lys Ala
355 360
<210>30
<211>1083
<212>DNA
<213>拟南芥
<400>30
atgtggactt ctaaaaccat aagcttcact ctcttcatca caacaacact tctcgggtcc 60
tgcaacgcat ctgcaaaggc caaaacgcaa ccgctattcc cagcgattct aatctttggt 120
gattcaacag tcgacacagg caacaataat tacccttcac aaacaatctt cagagctaaa 180
catgttcctt acggaattga tctcccaaac cactcaccta acggaagatt ctcaaacggg 240
aaaattttct ccgacataat cgcaaccaaa ctcaacatca aacagtttgt tcctcctttc 300
ttacaaccaa atctcaccga ccaagaaatt gtaaccggag tctgtttcgc atcagcaggt 360
gccggttacg atgaccaaac cagtctcacg acacaagcga ttcgtgtctc ggaacaacca 420
aatatgttca agagttacat tgctcgtctt aagagtatcg taggagacaa gaaagccatg 480
aagatcataa acaatgcttt ggtggttgtg agtgcagggc ctaatgattt catcttgaat 540
tattacgagg ttccctcatg gcgtcgcatg tatcctagca tttctgatta ccaagatttt 600
gttcttagta ggcttaacaa tttcgtgaag gagctttaca gcctaggttg ccggaaaatt 660
ttggtcggag gtttaccgcc aatgggatgt ttaccgattc aaatgactgc tcaattccgc 720
aacgtcctaa ggttttgctt ggaacaagag aacagagact ctgttttata caatcagaaa 780
cttcagaagc tcttacctca gacacaagca tctcttacag gaagcaagat cctttactct 840
gatgtctatg accctatgat ggagatgctc caaaacccta gcaaatacgg atttaaagag 900
acgacgagag gatgttgtgg aacagggttc ttggagacga gcttcatgtg taatgcttat 960
tcttccatgt gtcagaatcg ctcggagttt ctgttctttg actcgattca tccatctgaa 1020
gctacctaca attacattgg taatgttctg gatactaaga ttcgtgggtg gcttaaggct 1080
taa 1083
<210>31
<211>300
<212>PRT
<213>苹果(Malus pumila)
<400>31
Met Glu Pro Ile Asn Asp Glu Ile Ala Arg Glu Phe Arg Phe Phe Arg
1 5 10 15
Val Tyr Lys Asp Gly Arg Ile Glu Ile Phe Tyr Lys Thr Gln Lys Val
20 25 30
Pro Pro Ser Thr Asp Glu Ile Thr Gly Val Gln Ser Lys Asp Ile Thr
35 40 45
Ile Gln Pro Glu Pro Ala Val Ser Ala Arg Ile Phe Leu Pro Lys Ile
50 55 60
His Glu Pro Ala Gln Lys Leu Pro Val Leu Leu Tyr Leu His Gly Gly
65 70 75 80
Gly Phe Ile Phe Glu Ser Ala Phe Ser Pro Ile Tyr His Asn Phe Val
85 90 95
Gly Arg Leu Ala Ala Glu Ala His Ala Val Val Val Ser Val Glu Tyr
100 105 110
Gly Leu Phe Pro Asp Arg Pro Val Pro Ala Cys Tyr Glu Asp Ser Trp
115 120 125
Ala Ala Leu Lys Trp Leu Ala Ser His Ala Ser Gly Asp Gly Thr Glu
130 135 140
Ser Trp Leu Asn Lys Tyr Ala Asp Phe Asp Arg Leu Phe Ile Gly Gly
145 150 155 160
Asp Ser Gly Gly Ala Asn Leu Ser His Tyr Leu Ala Val Arg Val Gly
165 170 175
Ser Leu Gly Gln Pro Asp Leu Lys Ile Gly Gly Val Val Leu Val His
180 185 190
Pro Phe Phe Gly Gly Leu Glu Glu Asp Asp Gln Met Phe Leu Tyr Met
195 200 205
Cys Thr Glu Asn Gly Gly Leu Glu Asp Arg Arg Leu Arg Pro Pro Pro
210 215 220
Glu Asp Phe Lys Arg Leu Ala Cys Gly Lys Met Leu Ile Phe Phe Ala
225 230 235 240
Ala Gly Asp His Leu Arg Gly Ala Gly Gln Leu Tyr Tyr Glu Asp Leu
245 250 255
Lys Lys Ser Glu Trp Gly Gly Ser Val Asp Val Val Glu His Gly Glu
260 265 270
Gly His Val Phe His Leu Phe Asn Ser Asp Cys Glu Asn Ala Ala Asp
275 280 285
Leu Val Lys Lys Phe Gly Ser Phe Ile Asn Gln Lys
290 295 300
<210>32
<211>903
<212DNA
<213>苹果
<400>32
atggagccaa tcaacgacga gattgctcgt gaatttcgct tcttccgggt gtacaaagac 60
ggtcgcatag aaatattcta caagacacaa aaggtccccc cttcgactga cgaaatcact 120
ggtgtccaat ccaaggacat cacaattcaa cccgaacccg ccgtttctgc ccgtatcttc 180
cttcccaaga tccacgagcc ggcccaaaag ctccccgttc tcctctacct ccacggcggt 240
gggtttatct tcgagtctgc cttctctcct atttatcaca acttcgtcgg acgattggca 300
gctgaagccc acgcagtcgt agtgtccgtc gaatacgggt tgttcccgga tcgccccgta 360
cccgcttgct atgaagactc atgggcggcg ctcaaatggc tcgcgtccca cgctagtggg 420
gatggaaccg agtcgtggtt aaacaagtat gctgactttg accggttgtt tataggcggg 480
gacagcggtg gagcaaattt gtcgcactat ttggctgtcc gggtcgggtc cctcgggcaa 540
ccggatttga agattggtgg agttgtgctg gtgcatccgt tctttggggg cttggaggag 600
gacgaccaaa tgtttctgta catgtgtacg gagaacggtg ggttggagga tcgtaggctg 660
aggccgcccc cagaggattt caaaaggcta gcttgcggga agatgttgat atttttcgcg 720
gcgggagacc atctgagagg ggcgggccag ctgtactatg aggacctgaa aaagagtgag 780
tggggcggga gtgtcgacgt ggtggagcat ggtgaaggac atgtgtttca cttgttcaat 840
tcggactgtg agaatgctgc ggacttggtg aaaaaatttg gatccttcat caaccaaaag 900
tag 903
<210>33
<211>35
<212>DNA
<213>人工序列
<220>
<223>引物
<400>33
cgtctagaaa gagaatgatg aaaattgttc cgccg 35
<210>34
<211>46
<212>DNA
<213>人工序列
<220>
<223>引物
<400>34
cggttaactt aatggtgatg gtgatggtgc caatctaacg attcaa 46
<210>35
<211>762
<212>DNA
<213>人工序列
<220>
<223>CarE-his
<400>35
atgatgaaaa ttgttccgcc gaagccgttt ttctttgaag ccggggagcg ggcggtgctg 60
cttttgcatg ggtttaccgg caattccgcc gacgttcgga tgcttgggcg attcttggaa 120
tcgaaagggt atacgtgcca cgctccgatt tacaaagggc atggcgtgcc gccggaagag 180
ctcgtccaca ccggaccgga tgattggtgg caagacgtca tgaacggcta tcagtttttg 240
aaaaacaaag gctacgaaaa aattgccgtg gctggattgt cgcttggagg cgtattttct 300
ctcaaattag gctacactgt acctacacaa ggcattgtga cgatgtgcgc gccgatgtac 360
atcaaaagcg aagaaacgat gtacgaaggt gtgctcgagt atgcgcgcga gtataaaaag 420
cgggaaggga aatcagagga acaaatcgaa caggaaatgg aacggttcaa acaaacgccg 480
atgaagacgt tgaaagcctt gcaagaactc attgccgatg tgcgcgccca ccttgatttg 540
gtttatgcac cgacgttcgt cgtccaagcg cgccatgatg agatgatcaa tccagacagc 600
gcgaacatca tttataacga aattgaatcg ccggtcaaac aaatcaaatg gtatgagcaa 660
tcaggccatg tgattacgct tgatcaagaa aaagatcagc tgcatgaaga tatttatgca 720
tttcttgaat cgttagattg gcaccatcac catcaccatt ga 762
<210>36
<211>44
<212>DNA
<213>人工序列
<220>
<223>引物
<400>36
cgctcgagaa aagagaggct gaagctatga tgaaaattgt tccg 44
<210>37
<211>29
<212>DNA
<213>人工序列
<220>
<223>引物
<400>37
cgcgcgcata ttcgaggacg ccttcgtac 29
<210>38
<211>29
<212>DNA
<213>人工序列
<220>
<223>引物
<400>38
cgtcctcgaa tatgcgcgcg agtataaaa 29
<210>39
<211>47
<212>DNA
<213>人工序列
<220>
<223>引物
<400>39
cggaattctt aatggtgatg gtgatggtgc caatctaacg attcaag 47
<210>40
<211>762
<212>DNA
<213>人工序列
<220>
<223>突变的CarE-his
<400>40
atgatgaaaa ttgttccgcc gaagccgttt ttctttgaag ccggggagcg ggcggtgctg 60
cttttgcatg ggtttaccgg caattccgcc gacgttcgga tgcttgggcg attcttggaa 120
tcgaaagggt atacgtgcca cgctccgatt tacaaagggc atggcgtgcc gccggaagag 180
ctcgtccaca ccggaccgga tgattggtgg caagacgtca tgaacggcta tcagtttttg 240
aaaaacaaag gctacgaaaa aattgccgtg gctggattgt cgcttggagg cgtattttct 300
ctcaaattag gctacactgt acctacacaa ggcattgtga cgatgtgcgc gccgatgtac 360
atcaaaagcg aagaaacgat gtacgaaggc gtcctcgaat atgcgcgcga gtataaaaag 420
cgggaaggga aatcagagga acaaatcgaa caggaaatgg aacggttcaa acaaacgccg 480
atgaagacgt tgaaagcctt gcaagaactc attgccgatg tgcgcgccca ccttgatttg 540
gtttatgcac cgacgttcgt cgtccaagcg cgccatgatg agatgatcaa tccagacagc 600
gcgaacatca tttataacga aattgaatcg ccggtcaaac aaatcaaatg gtatgagcaa 660
tcaggccatg tgattacgct tgatcaagaa aaagatcagc tgcatgaaga tatttatgca 720
tttcttgaat cgttagattg gcaccatcac catcaccatt aa 762
Claims (10)
1.一种用于产生蛋白质的方法,所述方法包括:
将经改造而表达编码来自微生物的羧酸酯酶或其变体之基因的真核细胞在适于表达该羧酸酯酶或其变体的条件下进行培养。
2.权利要求1的方法,其中所述真核细胞是酵母细胞。
3.一种用于产生蛋白质的方法,所述方法包括:
将经改造而表达编码羧酸酯酶或其变体之基因的真核细胞在适于表达该羧酸酯酶或其变体的条件下进行培养,其中所述真核细胞是丝状真菌细胞。
4.权利要求3的方法,其中所述丝状真菌细胞来自曲霉(Aspergillus)属。
5.一种表达载体,其包含:
编码微生物羧酸酯酶或其变体的基因;
和
能促进该微生物羧酸酯酶或其变体在真核细胞中表达的调节序列,其中所述调节序列与该基因可操纵地连接。
6.一种表达载体,其包含:
编码羧酸酯酶或其变体的基因;
和
能促进该羧酸酯酶或其变体在丝状真菌细胞中表达的调节序列,其中所述调节序列与该基因可操纵地连接。
7.一种包含权利要求5和6中任一项的表达载体的真核细胞。
8.一种组合物,其包含:
真核细胞;和
该真核细胞所表达的微生物羧酸酯酶或其变体。
9.一种组合物,其包含:
丝状真菌细胞;和
该丝状真菌细胞所表达的微生物羧酸酯酶或其变体。
10.一种将含有羧酸酯基团的化合物转化成无羧酸酯基团的化合物的方法,包括:
将该化合物与通过权利要求1和3中任一项的方法所产生的羧酸酯酶或其变体一起孵育。
Priority Applications (3)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| CN2009101668391A CN101993861A (zh) | 2009-08-31 | 2009-08-31 | 羧酸酯酶的重组表达 |
| US13/127,652 US20110212504A1 (en) | 2009-08-31 | 2010-06-02 | Recombinant expression of carboxylesterases |
| PCT/CN2010/073449 WO2011023009A1 (en) | 2009-08-31 | 2010-06-02 | Recombinant expression of carboxylesterases |
Applications Claiming Priority (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| CN2009101668391A CN101993861A (zh) | 2009-08-31 | 2009-08-31 | 羧酸酯酶的重组表达 |
Publications (1)
| Publication Number | Publication Date |
|---|---|
| CN101993861A true CN101993861A (zh) | 2011-03-30 |
Family
ID=43627215
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| CN2009101668391A Pending CN101993861A (zh) | 2009-08-31 | 2009-08-31 | 羧酸酯酶的重组表达 |
Country Status (3)
| Country | Link |
|---|---|
| US (1) | US20110212504A1 (zh) |
| CN (1) | CN101993861A (zh) |
| WO (1) | WO2011023009A1 (zh) |
Cited By (7)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN102559718A (zh) * | 2011-12-08 | 2012-07-11 | 上海交通大学 | 嗜热羧酸酯酶基因工程菌的构建及其酶的应用 |
| CN104031898A (zh) * | 2013-03-04 | 2014-09-10 | 中国农业大学 | 一种真菌羧酸酯酶及其编码基因与应用 |
| CN113444705A (zh) * | 2021-06-29 | 2021-09-28 | 武汉华美生物工程有限公司 | 一种高纯度高活性的Ces1C蛋白及其表达载体与应用 |
| CN114410665A (zh) * | 2021-12-27 | 2022-04-29 | 安徽农业大学 | 高效催化没食子酸甲酯生物合成的基因及其应用 |
| CN114717216A (zh) * | 2022-03-31 | 2022-07-08 | 石河子大学 | 一种棉酚降解酶CCE001a基因、氨基酸序列及其应用 |
| CN114774387A (zh) * | 2022-03-22 | 2022-07-22 | 上海威高医疗技术发展有限公司 | 嗜热羧酸酯酶突变体及其应用 |
| CN119372177A (zh) * | 2024-12-23 | 2025-01-28 | 安徽农业大学 | 一种茚虫威降解酶及其编码基因、生产菌和应用 |
Families Citing this family (6)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US10059969B1 (en) | 2014-10-03 | 2018-08-28 | Abbvie Inc. | Process for the preparation of (S)-2-amino-non-8-enoic acid |
| CN110461812A (zh) | 2017-02-01 | 2019-11-15 | 艾伯维公司 | 用于制备(±)-2-(二氟甲基)-1-(烷氧基羰基)-环丙烷甲酸和(±)-2-(乙烯基)-1-(烷氧基羰基)-环丙烷甲酸的酶促方法 |
| MX2020006222A (es) * | 2017-12-13 | 2020-08-31 | Codexis Inc | Polipeptidos de carboxiesterasa para acoplamiento de amidas. |
| CN109943515B (zh) * | 2019-04-30 | 2021-07-27 | 江南大学 | 一种产羧酸酯酶的重组菌及其应用 |
| US20210379164A1 (en) * | 2020-06-04 | 2021-12-09 | Ecto Development Corporation | Enzymes and methods for preventing and treating pyrethroid exposure in animals |
| EP4149277A4 (en) * | 2021-05-21 | 2024-11-20 | Cambridge Crops, Inc. d/b/a Mori | SYSTEMS AND METHODS FOR MANUFACTURING SILK FIBROIN SOLUTION AND POWDERS CONTAINING SILK FIBROIN |
Citations (1)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN1246152A (zh) * | 1996-09-30 | 2000-03-01 | 金克克国际有限公司 | 酯酶及其编码dna和掺入此dna的载体和宿主细胞 |
Family Cites Families (1)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| BRPI0820615B1 (pt) * | 2007-12-06 | 2020-05-12 | Novozymes A/S | Célula microbiana hospedeira recombinante, métodos para produzir um polipeptídeo tendo atividade de acetil-xilano esterase, método para produzir uma proteína e método de degradar um material contendo xilano |
-
2009
- 2009-08-31 CN CN2009101668391A patent/CN101993861A/zh active Pending
-
2010
- 2010-06-02 WO PCT/CN2010/073449 patent/WO2011023009A1/en not_active Ceased
- 2010-06-02 US US13/127,652 patent/US20110212504A1/en not_active Abandoned
Patent Citations (1)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN1246152A (zh) * | 1996-09-30 | 2000-03-01 | 金克克国际有限公司 | 酯酶及其编码dna和掺入此dna的载体和宿主细胞 |
Non-Patent Citations (2)
| Title |
|---|
| HOSAM E. EWIS等: "Molecular cloing and characterization of two thermostable carboxyl esterases from Geobacillus stearothermophilus", 《GENE》 * |
| JAENTGES,U.K.等: "ACA01540", 《GENBANK》 * |
Cited By (11)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN102559718A (zh) * | 2011-12-08 | 2012-07-11 | 上海交通大学 | 嗜热羧酸酯酶基因工程菌的构建及其酶的应用 |
| CN102559718B (zh) * | 2011-12-08 | 2013-09-11 | 上海交通大学 | 嗜热羧酸酯酶基因工程菌的构建及其酶的应用 |
| CN104031898A (zh) * | 2013-03-04 | 2014-09-10 | 中国农业大学 | 一种真菌羧酸酯酶及其编码基因与应用 |
| CN104031898B (zh) * | 2013-03-04 | 2016-04-27 | 中国农业大学 | 一种真菌羧酸酯酶及其编码基因与应用 |
| CN113444705A (zh) * | 2021-06-29 | 2021-09-28 | 武汉华美生物工程有限公司 | 一种高纯度高活性的Ces1C蛋白及其表达载体与应用 |
| CN114410665A (zh) * | 2021-12-27 | 2022-04-29 | 安徽农业大学 | 高效催化没食子酸甲酯生物合成的基因及其应用 |
| CN114410665B (zh) * | 2021-12-27 | 2024-01-16 | 安徽农业大学 | 高效催化没食子酸甲酯生物合成的基因及其应用 |
| CN114774387A (zh) * | 2022-03-22 | 2022-07-22 | 上海威高医疗技术发展有限公司 | 嗜热羧酸酯酶突变体及其应用 |
| CN114717216A (zh) * | 2022-03-31 | 2022-07-08 | 石河子大学 | 一种棉酚降解酶CCE001a基因、氨基酸序列及其应用 |
| CN119372177A (zh) * | 2024-12-23 | 2025-01-28 | 安徽农业大学 | 一种茚虫威降解酶及其编码基因、生产菌和应用 |
| CN119372177B (zh) * | 2024-12-23 | 2025-04-29 | 安徽农业大学 | 一种茚虫威降解酶及其编码基因、生产菌和应用 |
Also Published As
| Publication number | Publication date |
|---|---|
| US20110212504A1 (en) | 2011-09-01 |
| WO2011023009A1 (en) | 2011-03-03 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| CN101993861A (zh) | 羧酸酯酶的重组表达 | |
| CN111378585B (zh) | 用于表达外源基因的毕赤酵母突变株 | |
| KR102789173B1 (ko) | 메틸영양성 효모를 유전적으로 조작하는 발현 구축물 및 방법 | |
| JP2009543560A (ja) | 真菌細胞中の遺伝子発現を上昇させる方法 | |
| US6316220B1 (en) | DNA encoding a transcription factor for an α-amylase promoter in filamentous fungi | |
| Jiang et al. | Cell surface display of functionally active lipases from Yarrowia lipolytica in Pichia pastoris | |
| KR20210013585A (ko) | 돌연변이체 리파아제 및 이의 용도 | |
| Lee | Production of heterologous microbial lipases by yeasts | |
| EP2220110A1 (en) | Fungal pepc inhibitor | |
| KR102181315B1 (ko) | 활성 개선된 리조무코르 미에헤이 유래 변이 리파제 및 효모를 이용한 재조합 생산 방법 | |
| WO2004055177A2 (en) | Artificial esterases | |
| KR102170228B1 (ko) | 에스터전이반응 효율이 향상된 변이 리파제 및 이를 이용한 바이오디젤 전환 기술 | |
| CN102260331A (zh) | 毕赤酵母壁蛋白Gcw34及其表面展示系统和构建方法 | |
| CN102276707A (zh) | 毕赤酵母壁蛋白Gcw19及其表面展示系统和构建方法 | |
| CN115433725A (zh) | 甘油单-二酰酯脂肪酶突变体及其应用 | |
| Martínez-Corona et al. | Lipase activity of recombinant KmYJR107Wp and KmLIP3p enzymes expressed in Saccharomyces cerevisiae BY4742 from Kluyveromyces marxianus L2029 | |
| US20230250406A1 (en) | Mutant lipase and use thereof | |
| US8940505B2 (en) | Production of heterologous extracellular polypeptides in Yarrowia lipolytica | |
| Büyük et al. | A Preliminary Study on Purification and Characterization of Lipase (s) Produced by Cryptococcus Diffluens D44 | |
| CN119410606B (zh) | 一种脂肪酶突变体及其制备方法和应用 | |
| WO2009144245A1 (en) | Method for increasing growth rate | |
| CN102260336A (zh) | 毕赤酵母壁蛋白Gcw5及其表面展示系统和构建方法 | |
| CN110114455A (zh) | 真菌中蛋白质的选择性分泌技术 | |
| EP4569101A1 (en) | Synthetic formolase pathway | |
| WO2025140482A1 (zh) | 可改善目标多肽表达效果的多核苷酸构建体及其应用 |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| C06 | Publication | ||
| PB01 | Publication | ||
| C10 | Entry into substantive examination | ||
| SE01 | Entry into force of request for substantive examination | ||
| C02 | Deemed withdrawal of patent application after publication (patent law 2001) | ||
| WD01 | Invention patent application deemed withdrawn after publication |
Application publication date: 20110330 |