JP2003000240A - DNA encoding a transcription factor that controls a gene expressed in a solid culture of Aspergillus oryzae - Google Patents
DNA encoding a transcription factor that controls a gene expressed in a solid culture of Aspergillus oryzaeInfo
- Publication number
- JP2003000240A JP2003000240A JP2001190356A JP2001190356A JP2003000240A JP 2003000240 A JP2003000240 A JP 2003000240A JP 2001190356 A JP2001190356 A JP 2001190356A JP 2001190356 A JP2001190356 A JP 2001190356A JP 2003000240 A JP2003000240 A JP 2003000240A
- Authority
- JP
- Japan
- Prior art keywords
- ser
- ala
- pro
- glu
- gln
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Withdrawn
Links
- 108090000623 proteins and genes Proteins 0.000 title claims abstract description 105
- 239000007787 solid Substances 0.000 title claims abstract description 100
- 240000006439 Aspergillus oryzae Species 0.000 title claims description 56
- 235000002247 Aspergillus oryzae Nutrition 0.000 title claims description 51
- 108020004414 DNA Proteins 0.000 title claims description 12
- 108091023040 Transcription factor Proteins 0.000 title abstract description 22
- 102000040945 Transcription factor Human genes 0.000 title abstract description 22
- 102000004169 proteins and genes Human genes 0.000 claims abstract description 38
- 238000004519 manufacturing process Methods 0.000 claims abstract description 18
- 241000588724 Escherichia coli Species 0.000 claims description 17
- 125000003275 alpha amino acid group Chemical group 0.000 claims description 10
- 238000013518 transcription Methods 0.000 claims description 10
- 230000035897 transcription Effects 0.000 claims description 10
- 239000013598 vector Substances 0.000 claims description 8
- 238000000034 method Methods 0.000 claims description 7
- 108091028043 Nucleic acid sequence Proteins 0.000 claims description 2
- 229940023064 escherichia coli Drugs 0.000 claims 3
- 108091026890 Coding region Proteins 0.000 claims 1
- 108090000790 Enzymes Proteins 0.000 abstract description 23
- 102000004190 Enzymes Human genes 0.000 abstract description 21
- 230000000694 effects Effects 0.000 abstract description 17
- 238000009630 liquid culture Methods 0.000 abstract description 16
- 241000228212 Aspergillus Species 0.000 abstract description 9
- 239000002773 nucleotide Substances 0.000 abstract description 7
- 125000003729 nucleotide group Chemical group 0.000 abstract description 7
- 238000010367 cloning Methods 0.000 abstract description 3
- 230000014509 gene expression Effects 0.000 description 50
- 101150051757 glaB gene Proteins 0.000 description 29
- 229940088598 enzyme Drugs 0.000 description 18
- 230000022532 regulation of transcription, DNA-dependent Effects 0.000 description 17
- 230000001105 regulatory effect Effects 0.000 description 17
- 230000002103 transcriptional effect Effects 0.000 description 15
- 238000003556 assay Methods 0.000 description 14
- 238000004458 analytical method Methods 0.000 description 13
- 108010073178 Glucan 1,4-alpha-Glucosidase Proteins 0.000 description 12
- 101710185494 Zinc finger protein Proteins 0.000 description 12
- 102100023597 Zinc finger protein 816 Human genes 0.000 description 12
- 102000053187 Glucuronidase Human genes 0.000 description 11
- 108010060309 Glucuronidase Proteins 0.000 description 11
- 101150108358 GLAA gene Proteins 0.000 description 10
- 102100022624 Glucoamylase Human genes 0.000 description 10
- OBZKMHQCWJWLEJ-GWPKAZDLSA-L [H+].[H+].[Zn++].N[C@@H](C[S-])C(O)=O.N[C@@H](C[S-])C(O)=O.N[C@@H](Cc1c[n-]cn1)C(O)=O.N[C@@H](Cc1c[n-]cn1)C(O)=O Chemical compound [H+].[H+].[Zn++].N[C@@H](C[S-])C(O)=O.N[C@@H](C[S-])C(O)=O.N[C@@H](Cc1c[n-]cn1)C(O)=O.N[C@@H](Cc1c[n-]cn1)C(O)=O OBZKMHQCWJWLEJ-GWPKAZDLSA-L 0.000 description 10
- 239000000499 gel Substances 0.000 description 10
- 238000011144 upstream manufacturing Methods 0.000 description 10
- 238000005516 engineering process Methods 0.000 description 9
- 230000007613 environmental effect Effects 0.000 description 9
- 230000007246 mechanism Effects 0.000 description 9
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Substances O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 9
- 230000001771 impaired effect Effects 0.000 description 8
- 108010026333 seryl-proline Proteins 0.000 description 8
- 230000035882 stress Effects 0.000 description 8
- 101100520819 Arabidopsis thaliana At5g66631 gene Proteins 0.000 description 7
- 101100184274 Candida albicans (strain SC5314 / ATCC MYA-2876) MNL1 gene Proteins 0.000 description 7
- 101100078102 Saccharomyces cerevisiae (strain ATCC 204508 / S288c) MSN2 gene Proteins 0.000 description 7
- 108091081024 Start codon Proteins 0.000 description 7
- HCHKCACWOHOZIP-UHFFFAOYSA-N Zinc Chemical compound [Zn] HCHKCACWOHOZIP-UHFFFAOYSA-N 0.000 description 7
- 210000004027 cell Anatomy 0.000 description 7
- 108010057821 leucylproline Proteins 0.000 description 7
- 239000013612 plasmid Substances 0.000 description 7
- 239000011701 zinc Substances 0.000 description 7
- 229910052725 zinc Inorganic materials 0.000 description 7
- 108010011619 6-Phytase Proteins 0.000 description 6
- 108091005508 Acid proteases Proteins 0.000 description 6
- 241000209094 Oryza Species 0.000 description 6
- KOSRFJWDECSPRO-UHFFFAOYSA-N alpha-L-glutamyl-L-glutamic acid Natural products OC(=O)CCC(N)C(=O)NC(CCC(O)=O)C(O)=O KOSRFJWDECSPRO-UHFFFAOYSA-N 0.000 description 6
- 125000000539 amino acid group Chemical group 0.000 description 6
- 108010055341 glutamyl-glutamic acid Proteins 0.000 description 6
- 229940085127 phytase Drugs 0.000 description 6
- 239000000523 sample Substances 0.000 description 6
- 102000005572 Cathepsin A Human genes 0.000 description 5
- 108010059081 Cathepsin A Proteins 0.000 description 5
- 102000004163 DNA-directed RNA polymerases Human genes 0.000 description 5
- 108090000626 DNA-directed RNA polymerases Proteins 0.000 description 5
- YBAFDPFAUTYYRW-UHFFFAOYSA-N N-L-alpha-glutamyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CCC(O)=O YBAFDPFAUTYYRW-UHFFFAOYSA-N 0.000 description 5
- 235000007164 Oryza sativa Nutrition 0.000 description 5
- 102000004139 alpha-Amylases Human genes 0.000 description 5
- 108090000637 alpha-Amylases Proteins 0.000 description 5
- 229940024171 alpha-amylase Drugs 0.000 description 5
- 108010005400 cutinase Proteins 0.000 description 5
- 238000012217 deletion Methods 0.000 description 5
- 230000037430 deletion Effects 0.000 description 5
- 239000003550 marker Substances 0.000 description 5
- 108010012581 phenylalanylglutamate Proteins 0.000 description 5
- 235000009566 rice Nutrition 0.000 description 5
- 230000014616 translation Effects 0.000 description 5
- 235000015099 wheat brans Nutrition 0.000 description 5
- UHPMCKVQTMMPCG-UHFFFAOYSA-N 5,8-dihydroxy-2-methoxy-6-methyl-7-(2-oxopropyl)naphthalene-1,4-dione Chemical compound CC1=C(CC(C)=O)C(O)=C2C(=O)C(OC)=CC(=O)C2=C1O UHPMCKVQTMMPCG-UHFFFAOYSA-N 0.000 description 4
- 108091060211 Expressed sequence tag Proteins 0.000 description 4
- 241000223218 Fusarium Species 0.000 description 4
- 241000235347 Schizosaccharomyces pombe Species 0.000 description 4
- AVKUERGKIZMTKX-NJBDSQKTSA-N ampicillin Chemical compound C1([C@@H](N)C(=O)N[C@H]2[C@H]3SC([C@@H](N3C2=O)C(O)=O)(C)C)=CC=CC=C1 AVKUERGKIZMTKX-NJBDSQKTSA-N 0.000 description 4
- 229960000723 ampicillin Drugs 0.000 description 4
- 108010038633 aspartylglutamate Proteins 0.000 description 4
- 238000012258 culturing Methods 0.000 description 4
- GNBHRKFJIUUOQI-UHFFFAOYSA-N fluorescein Chemical compound O1C(=O)C2=CC=CC=C2C21C1=CC=C(O)C=C1OC1=CC(O)=CC=C21 GNBHRKFJIUUOQI-UHFFFAOYSA-N 0.000 description 4
- 239000012634 fragment Substances 0.000 description 4
- 230000006698 induction Effects 0.000 description 4
- 230000032361 posttranscriptional gene silencing Effects 0.000 description 4
- 108091008146 restriction endonucleases Proteins 0.000 description 4
- 238000004448 titration Methods 0.000 description 4
- FXKNPWNXPQZLES-ZLUOBGJFSA-N Ala-Asn-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O FXKNPWNXPQZLES-ZLUOBGJFSA-N 0.000 description 3
- 108020004635 Complementary DNA Proteins 0.000 description 3
- 241000196324 Embryophyta Species 0.000 description 3
- XXCDTYBVGMPIOA-FXQIFTODSA-N Glu-Asp-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O XXCDTYBVGMPIOA-FXQIFTODSA-N 0.000 description 3
- WQZGKKKJIJFFOK-GASJEMHNSA-N Glucose Natural products OC[C@H]1OC(O)[C@H](O)[C@@H](O)[C@@H]1O WQZGKKKJIJFFOK-GASJEMHNSA-N 0.000 description 3
- PEDCQBHIVMGVHV-UHFFFAOYSA-N Glycerine Chemical compound OCC(O)CO PEDCQBHIVMGVHV-UHFFFAOYSA-N 0.000 description 3
- 229920002472 Starch Polymers 0.000 description 3
- 108010047495 alanylglycine Proteins 0.000 description 3
- 108010069205 aspartyl-phenylalanine Proteins 0.000 description 3
- 108010092854 aspartyllysine Proteins 0.000 description 3
- 230000001580 bacterial effect Effects 0.000 description 3
- 230000015572 biosynthetic process Effects 0.000 description 3
- 108010078144 glutaminyl-glycine Proteins 0.000 description 3
- 108010049041 glutamylalanine Proteins 0.000 description 3
- 239000012528 membrane Substances 0.000 description 3
- 101150095344 niaD gene Proteins 0.000 description 3
- 108010031719 prolyl-serine Proteins 0.000 description 3
- 102000037983 regulatory factors Human genes 0.000 description 3
- 108091008025 regulatory factors Proteins 0.000 description 3
- 238000011160 research Methods 0.000 description 3
- 230000003938 response to stress Effects 0.000 description 3
- 239000008107 starch Substances 0.000 description 3
- 235000019698 starch Nutrition 0.000 description 3
- 238000003786 synthesis reaction Methods 0.000 description 3
- 108091032973 (ribonucleotides)n+m Proteins 0.000 description 2
- QKNYBSVHEMOAJP-UHFFFAOYSA-N 2-amino-2-(hydroxymethyl)propane-1,3-diol;hydron;chloride Chemical compound Cl.OCC(N)(CO)CO QKNYBSVHEMOAJP-UHFFFAOYSA-N 0.000 description 2
- FWMNVWWHGCHHJJ-SKKKGAJSSA-N 4-amino-1-[(2r)-6-amino-2-[[(2r)-2-[[(2r)-2-[[(2r)-2-amino-3-phenylpropanoyl]amino]-3-phenylpropanoyl]amino]-4-methylpentanoyl]amino]hexanoyl]piperidine-4-carboxylic acid Chemical compound C([C@H](C(=O)N[C@H](CC(C)C)C(=O)N[C@H](CCCCN)C(=O)N1CCC(N)(CC1)C(O)=O)NC(=O)[C@H](N)CC=1C=CC=CC=1)C1=CC=CC=C1 FWMNVWWHGCHHJJ-SKKKGAJSSA-N 0.000 description 2
- 108010011667 Ala-Phe-Ala Proteins 0.000 description 2
- XWFWAXPOLRTDFZ-FXQIFTODSA-N Ala-Pro-Ser Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O XWFWAXPOLRTDFZ-FXQIFTODSA-N 0.000 description 2
- MSWSRLGNLKHDEI-ACZMJKKPSA-N Ala-Ser-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(O)=O MSWSRLGNLKHDEI-ACZMJKKPSA-N 0.000 description 2
- 239000004382 Amylase Substances 0.000 description 2
- 102000013142 Amylases Human genes 0.000 description 2
- 108010065511 Amylases Proteins 0.000 description 2
- 241000219194 Arabidopsis Species 0.000 description 2
- VDBKFYYIBLXEIF-GUBZILKMSA-N Arg-Gln-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O VDBKFYYIBLXEIF-GUBZILKMSA-N 0.000 description 2
- HQIZDMIGUJOSNI-IUCAKERBSA-N Arg-Gly-Arg Chemical compound N[C@@H](CCCNC(N)=N)C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(O)=O HQIZDMIGUJOSNI-IUCAKERBSA-N 0.000 description 2
- VOKWBBBXJONREA-DCAQKATOSA-N Asn-Met-His Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC(=O)N)N VOKWBBBXJONREA-DCAQKATOSA-N 0.000 description 2
- PBVLJOIPOGUQQP-CIUDSAMLSA-N Asp-Ala-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O PBVLJOIPOGUQQP-CIUDSAMLSA-N 0.000 description 2
- BCFXQBXXDSEHRS-FXQIFTODSA-N Cys-Ser-Arg Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O BCFXQBXXDSEHRS-FXQIFTODSA-N 0.000 description 2
- 102000053602 DNA Human genes 0.000 description 2
- KCXVZYZYPLLWCC-UHFFFAOYSA-N EDTA Chemical compound OC(=O)CN(CC(O)=O)CCN(CC(O)=O)CC(O)=O KCXVZYZYPLLWCC-UHFFFAOYSA-N 0.000 description 2
- 241000233866 Fungi Species 0.000 description 2
- PKVWNYGXMNWJSI-CIUDSAMLSA-N Gln-Gln-Gln Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O PKVWNYGXMNWJSI-CIUDSAMLSA-N 0.000 description 2
- IVCOYUURLWQDJQ-LPEHRKFASA-N Gln-Gln-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCC(=O)N)N)C(=O)O IVCOYUURLWQDJQ-LPEHRKFASA-N 0.000 description 2
- QNJNPKSWAHPYGI-JYJNAYRXSA-N Glu-Phe-Leu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C(O)=O)CC1=CC=CC=C1 QNJNPKSWAHPYGI-JYJNAYRXSA-N 0.000 description 2
- BFEZQZKEPRKKHV-SRVKXCTJSA-N Glu-Pro-Lys Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CCC(=O)O)N)C(=O)N[C@@H](CCCCN)C(=O)O BFEZQZKEPRKKHV-SRVKXCTJSA-N 0.000 description 2
- XTQFHTHIAKKCTM-YFKPBYRVSA-N Gly-Glu-Gly Chemical compound NCC(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O XTQFHTHIAKKCTM-YFKPBYRVSA-N 0.000 description 2
- BHPQOIPBLYJNAW-NGZCFLSTSA-N Gly-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN BHPQOIPBLYJNAW-NGZCFLSTSA-N 0.000 description 2
- WCORRBXVISTKQL-WHFBIAKZSA-N Gly-Ser-Ser Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O WCORRBXVISTKQL-WHFBIAKZSA-N 0.000 description 2
- 101100295959 Halobacterium salinarum (strain ATCC 700922 / JCM 11081 / NRC-1) arcB gene Proteins 0.000 description 2
- RNMNYMDTESKEAJ-KKUMJFAQSA-N His-Leu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CN=CN1 RNMNYMDTESKEAJ-KKUMJFAQSA-N 0.000 description 2
- 108091092195 Intron Proteins 0.000 description 2
- 241000880493 Leptailurus serval Species 0.000 description 2
- GGAPIOORBXHMNY-ULQDDVLXSA-N Lys-Arg-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CCCCN)N)O GGAPIOORBXHMNY-ULQDDVLXSA-N 0.000 description 2
- FGMHXLULNHTPID-KKUMJFAQSA-N Lys-His-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CCCCN)C(O)=O)CC1=CN=CN1 FGMHXLULNHTPID-KKUMJFAQSA-N 0.000 description 2
- YCJCEMKOZOYBEF-OEAJRASXSA-N Lys-Thr-Phe Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O YCJCEMKOZOYBEF-OEAJRASXSA-N 0.000 description 2
- 244000294411 Mirabilis expansa Species 0.000 description 2
- 235000015429 Mirabilis expansa Nutrition 0.000 description 2
- AJHCSUXXECOXOY-UHFFFAOYSA-N N-glycyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)CN)C(O)=O)=CNC2=C1 AJHCSUXXECOXOY-UHFFFAOYSA-N 0.000 description 2
- 108010079364 N-glycylalanine Proteins 0.000 description 2
- 108010002311 N-glycylglutamic acid Proteins 0.000 description 2
- MKGIILKDUGDRRO-FXQIFTODSA-N Pro-Ser-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H]1CCCN1 MKGIILKDUGDRRO-FXQIFTODSA-N 0.000 description 2
- AIOWVDNPESPXRB-YTWAJWBKSA-N Pro-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@@H]2CCCN2)O AIOWVDNPESPXRB-YTWAJWBKSA-N 0.000 description 2
- FIODMZKLZFLYQP-GUBZILKMSA-N Pro-Val-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O FIODMZKLZFLYQP-GUBZILKMSA-N 0.000 description 2
- 108010003201 RGH 0205 Proteins 0.000 description 2
- 238000002123 RNA extraction Methods 0.000 description 2
- 240000004808 Saccharomyces cerevisiae Species 0.000 description 2
- 235000014680 Saccharomyces cerevisiae Nutrition 0.000 description 2
- SMIDBHKWSYUBRZ-ACZMJKKPSA-N Ser-Glu-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O SMIDBHKWSYUBRZ-ACZMJKKPSA-N 0.000 description 2
- FUMGHWDRRFCKEP-CIUDSAMLSA-N Ser-Leu-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O FUMGHWDRRFCKEP-CIUDSAMLSA-N 0.000 description 2
- KCGIREHVWRXNDH-GARJFASQSA-N Ser-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N KCGIREHVWRXNDH-GARJFASQSA-N 0.000 description 2
- DINQYZRMXGWWTG-GUBZILKMSA-N Ser-Pro-Pro Chemical compound OC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 DINQYZRMXGWWTG-GUBZILKMSA-N 0.000 description 2
- HHJFMHQYEAAOBM-ZLUOBGJFSA-N Ser-Ser-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O HHJFMHQYEAAOBM-ZLUOBGJFSA-N 0.000 description 2
- PPCZVWHJWJFTFN-ZLUOBGJFSA-N Ser-Ser-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O PPCZVWHJWJFTFN-ZLUOBGJFSA-N 0.000 description 2
- VGQVAVQWKJLIRM-FXQIFTODSA-N Ser-Ser-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O VGQVAVQWKJLIRM-FXQIFTODSA-N 0.000 description 2
- MQCPGOZXFSYJPS-KZVJFYERSA-N Thr-Ala-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O MQCPGOZXFSYJPS-KZVJFYERSA-N 0.000 description 2
- PKXHGEXFMIZSER-QTKMDUPCSA-N Thr-Arg-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N)O PKXHGEXFMIZSER-QTKMDUPCSA-N 0.000 description 2
- CEXFELBFVHLYDZ-XGEHTFHBSA-N Thr-Arg-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O CEXFELBFVHLYDZ-XGEHTFHBSA-N 0.000 description 2
- FDALPRWYVKJCLL-PMVVWTBXSA-N Thr-His-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)NCC(O)=O FDALPRWYVKJCLL-PMVVWTBXSA-N 0.000 description 2
- WRUWXBBEFUTJOU-XGEHTFHBSA-N Thr-Met-Ser Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CO)C(=O)O)N)O WRUWXBBEFUTJOU-XGEHTFHBSA-N 0.000 description 2
- WPSKTVVMQCXPRO-BWBBJGPYSA-N Thr-Ser-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O WPSKTVVMQCXPRO-BWBBJGPYSA-N 0.000 description 2
- ZESGVALRVJIVLZ-VFCFLDTKSA-N Thr-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@@H]1C(=O)O)N)O ZESGVALRVJIVLZ-VFCFLDTKSA-N 0.000 description 2
- HRHYJNLMIJWGLF-BZSNNMDCSA-N Tyr-Ser-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=C(O)C=C1 HRHYJNLMIJWGLF-BZSNNMDCSA-N 0.000 description 2
- 108010024078 alanyl-glycyl-serine Proteins 0.000 description 2
- 108010005233 alanylglutamic acid Proteins 0.000 description 2
- 108010044940 alanylglutamine Proteins 0.000 description 2
- 108010087924 alanylproline Proteins 0.000 description 2
- 235000019418 amylase Nutrition 0.000 description 2
- 101150008194 argB gene Proteins 0.000 description 2
- 108010008355 arginyl-glutamine Proteins 0.000 description 2
- 108010040443 aspartyl-aspartic acid Proteins 0.000 description 2
- 238000006243 chemical reaction Methods 0.000 description 2
- 230000007812 deficiency Effects 0.000 description 2
- 230000003111 delayed effect Effects 0.000 description 2
- 239000000539 dimer Substances 0.000 description 2
- 238000001962 electrophoresis Methods 0.000 description 2
- 238000002474 experimental method Methods 0.000 description 2
- 239000013604 expression vector Substances 0.000 description 2
- 238000000855 fermentation Methods 0.000 description 2
- 230000004151 fermentation Effects 0.000 description 2
- 239000008103 glucose Substances 0.000 description 2
- 108010013768 glutamyl-aspartyl-proline Proteins 0.000 description 2
- VPZXBVLAVMBEQI-UHFFFAOYSA-N glycyl-DL-alpha-alanine Natural products OC(=O)C(C)NC(=O)CN VPZXBVLAVMBEQI-UHFFFAOYSA-N 0.000 description 2
- 108010050848 glycylleucine Proteins 0.000 description 2
- 108010025306 histidylleucine Proteins 0.000 description 2
- BPHPUYQFMNQIOC-NXRLNHOXSA-N isopropyl beta-D-thiogalactopyranoside Chemical compound CC(C)S[C@@H]1O[C@H](CO)[C@H](O)[C@H](O)[C@H]1O BPHPUYQFMNQIOC-NXRLNHOXSA-N 0.000 description 2
- 101150066555 lacZ gene Proteins 0.000 description 2
- 108010073472 leucyl-prolyl-proline Proteins 0.000 description 2
- 101150039489 lysZ gene Proteins 0.000 description 2
- 108020004999 messenger RNA Proteins 0.000 description 2
- 235000013536 miso Nutrition 0.000 description 2
- 238000002264 polyacrylamide gel electrophoresis Methods 0.000 description 2
- 238000002360 preparation method Methods 0.000 description 2
- 108010070643 prolylglutamic acid Proteins 0.000 description 2
- 108010053725 prolylvaline Proteins 0.000 description 2
- 230000008844 regulatory mechanism Effects 0.000 description 2
- 230000004044 response Effects 0.000 description 2
- 238000003757 reverse transcription PCR Methods 0.000 description 2
- 235000019992 sake Nutrition 0.000 description 2
- 235000013555 soy sauce Nutrition 0.000 description 2
- 101150101900 uidA gene Proteins 0.000 description 2
- GJLXVWOMRRWCIB-MERZOTPQSA-N (2S)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-acetamido-5-(diaminomethylideneamino)pentanoyl]amino]-3-(4-hydroxyphenyl)propanoyl]amino]-3-(4-hydroxyphenyl)propanoyl]amino]-5-(diaminomethylideneamino)pentanoyl]amino]-3-(1H-indol-3-yl)propanoyl]amino]-6-aminohexanoyl]amino]-6-aminohexanoyl]amino]-6-aminohexanoyl]amino]-6-aminohexanoyl]amino]-6-aminohexanoyl]amino]-6-aminohexanoyl]amino]-6-aminohexanamide Chemical compound C([C@H](NC(=O)[C@H](CCCN=C(N)N)NC(=O)C)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(N)=O)C1=CC=C(O)C=C1 GJLXVWOMRRWCIB-MERZOTPQSA-N 0.000 description 1
- PQFMROVJTOPVDF-JBDRJPRFSA-N (2s)-2-[[(2s)-2-[[(2s)-2-[[(2s)-2-amino-3-carboxypropanoyl]amino]-3-carboxypropanoyl]amino]-4-carboxybutanoyl]amino]butanedioic acid Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O PQFMROVJTOPVDF-JBDRJPRFSA-N 0.000 description 1
- 102000040650 (ribonucleotides)n+m Human genes 0.000 description 1
- HHGYNJRJIINWAK-FXQIFTODSA-N Ala-Ala-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N HHGYNJRJIINWAK-FXQIFTODSA-N 0.000 description 1
- AAQGRPOPTAUUBM-ZLUOBGJFSA-N Ala-Ala-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O AAQGRPOPTAUUBM-ZLUOBGJFSA-N 0.000 description 1
- YLTKNGYYPIWKHZ-ACZMJKKPSA-N Ala-Ala-Glu Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(O)=O YLTKNGYYPIWKHZ-ACZMJKKPSA-N 0.000 description 1
- CXRCVCURMBFFOL-FXQIFTODSA-N Ala-Ala-Pro Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@H]1C(O)=O CXRCVCURMBFFOL-FXQIFTODSA-N 0.000 description 1
- JBVSSSZFNTXJDX-YTLHQDLWSA-N Ala-Ala-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@H](C)N JBVSSSZFNTXJDX-YTLHQDLWSA-N 0.000 description 1
- DVWVZSJAYIJZFI-FXQIFTODSA-N Ala-Arg-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(O)=O DVWVZSJAYIJZFI-FXQIFTODSA-N 0.000 description 1
- SVBXIUDNTRTKHE-CIUDSAMLSA-N Ala-Arg-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O SVBXIUDNTRTKHE-CIUDSAMLSA-N 0.000 description 1
- JAMAWBXXKFGFGX-KZVJFYERSA-N Ala-Arg-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O JAMAWBXXKFGFGX-KZVJFYERSA-N 0.000 description 1
- LBJYAILUMSUTAM-ZLUOBGJFSA-N Ala-Asn-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O LBJYAILUMSUTAM-ZLUOBGJFSA-N 0.000 description 1
- GORKKVHIBWAQHM-GCJQMDKQSA-N Ala-Asn-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GORKKVHIBWAQHM-GCJQMDKQSA-N 0.000 description 1
- WXERCAHAIKMTKX-ZLUOBGJFSA-N Ala-Asp-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O WXERCAHAIKMTKX-ZLUOBGJFSA-N 0.000 description 1
- BUDNAJYVCUHLSV-ZLUOBGJFSA-N Ala-Asp-Ser Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O BUDNAJYVCUHLSV-ZLUOBGJFSA-N 0.000 description 1
- NFDVJAKFMXHJEQ-HERUPUMHSA-N Ala-Asp-Trp Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N NFDVJAKFMXHJEQ-HERUPUMHSA-N 0.000 description 1
- KUDREHRZRIVKHS-UWJYBYFXSA-N Ala-Asp-Tyr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O KUDREHRZRIVKHS-UWJYBYFXSA-N 0.000 description 1
- FRFDXQWNDZMREB-ACZMJKKPSA-N Ala-Cys-Gln Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(N)=O)C(O)=O FRFDXQWNDZMREB-ACZMJKKPSA-N 0.000 description 1
- CXQODNIBUNQWAS-CIUDSAMLSA-N Ala-Gln-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N CXQODNIBUNQWAS-CIUDSAMLSA-N 0.000 description 1
- RXTBLQVXNIECFP-FXQIFTODSA-N Ala-Gln-Gln Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O RXTBLQVXNIECFP-FXQIFTODSA-N 0.000 description 1
- CRWFEKLFPVRPBV-CIUDSAMLSA-N Ala-Gln-Met Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCSC)C(O)=O CRWFEKLFPVRPBV-CIUDSAMLSA-N 0.000 description 1
- CZPAHAKGPDUIPJ-CIUDSAMLSA-N Ala-Gln-Pro Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N1CCC[C@H]1C(O)=O CZPAHAKGPDUIPJ-CIUDSAMLSA-N 0.000 description 1
- MVBWLRJESQOQTM-ACZMJKKPSA-N Ala-Gln-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O MVBWLRJESQOQTM-ACZMJKKPSA-N 0.000 description 1
- UHMQKOBNPRAZGB-CIUDSAMLSA-N Ala-Glu-Met Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCSC)C(=O)O)N UHMQKOBNPRAZGB-CIUDSAMLSA-N 0.000 description 1
- VBRDBGCROKWTPV-XHNCKOQMSA-N Ala-Glu-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N VBRDBGCROKWTPV-XHNCKOQMSA-N 0.000 description 1
- ROLXPVQSRCPVGK-XDTLVQLUSA-N Ala-Glu-Tyr Chemical compound N[C@@H](C)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O ROLXPVQSRCPVGK-XDTLVQLUSA-N 0.000 description 1
- SUMYEVXWCAYLLJ-GUBZILKMSA-N Ala-Leu-Gln Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O SUMYEVXWCAYLLJ-GUBZILKMSA-N 0.000 description 1
- QUIGLPSHIFPEOV-CIUDSAMLSA-N Ala-Lys-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O QUIGLPSHIFPEOV-CIUDSAMLSA-N 0.000 description 1
- RAAWHFXHAACDFT-FXQIFTODSA-N Ala-Met-Asn Chemical compound CSCC[C@H](NC(=O)[C@H](C)N)C(=O)N[C@@H](CC(N)=O)C(O)=O RAAWHFXHAACDFT-FXQIFTODSA-N 0.000 description 1
- DRARURMRLANNLS-GUBZILKMSA-N Ala-Met-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C(C)C)C(O)=O DRARURMRLANNLS-GUBZILKMSA-N 0.000 description 1
- PEIBBAXIKUAYGN-UBHSHLNASA-N Ala-Phe-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=CC=C1 PEIBBAXIKUAYGN-UBHSHLNASA-N 0.000 description 1
- CJQAEJMHBAOQHA-DLOVCJGASA-N Ala-Phe-Asn Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(=O)N)C(=O)O)N CJQAEJMHBAOQHA-DLOVCJGASA-N 0.000 description 1
- CNQAFFMNJIQYGX-DRZSPHRISA-N Ala-Phe-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=CC=C1 CNQAFFMNJIQYGX-DRZSPHRISA-N 0.000 description 1
- IPZQNYYAYVRKKK-FXQIFTODSA-N Ala-Pro-Ala Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O IPZQNYYAYVRKKK-FXQIFTODSA-N 0.000 description 1
- WQLDNOCHHRISMS-NAKRPEOUSA-N Ala-Pro-Ile Chemical compound [H]N[C@@H](C)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WQLDNOCHHRISMS-NAKRPEOUSA-N 0.000 description 1
- VJVQKGYHIZPSNS-FXQIFTODSA-N Ala-Ser-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCN=C(N)N VJVQKGYHIZPSNS-FXQIFTODSA-N 0.000 description 1
- HOVPGJUNRLMIOZ-CIUDSAMLSA-N Ala-Ser-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@H](C)N HOVPGJUNRLMIOZ-CIUDSAMLSA-N 0.000 description 1
- MMLHRUJLOUSRJX-CIUDSAMLSA-N Ala-Ser-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCCN MMLHRUJLOUSRJX-CIUDSAMLSA-N 0.000 description 1
- PEEYDECOOVQKRZ-DLOVCJGASA-N Ala-Ser-Phe Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O PEEYDECOOVQKRZ-DLOVCJGASA-N 0.000 description 1
- NZGRHTKZFSVPAN-BIIVOSGPSA-N Ala-Ser-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CO)C(=O)N1CCC[C@@H]1C(=O)O)N NZGRHTKZFSVPAN-BIIVOSGPSA-N 0.000 description 1
- NCQMBSJGJMYKCK-ZLUOBGJFSA-N Ala-Ser-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O NCQMBSJGJMYKCK-ZLUOBGJFSA-N 0.000 description 1
- WQKAQKZRDIZYNV-VZFHVOOUSA-N Ala-Ser-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O WQKAQKZRDIZYNV-VZFHVOOUSA-N 0.000 description 1
- KTXKIYXZQFWJKB-VZFHVOOUSA-N Ala-Thr-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O KTXKIYXZQFWJKB-VZFHVOOUSA-N 0.000 description 1
- KUFVXLQLDHJVOG-SHGPDSBTSA-N Ala-Thr-Thr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](C)N)O KUFVXLQLDHJVOG-SHGPDSBTSA-N 0.000 description 1
- NLYYHIKRBRMAJV-AEJSXWLSSA-N Ala-Val-Pro Chemical compound C[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N NLYYHIKRBRMAJV-AEJSXWLSSA-N 0.000 description 1
- QGZKDVFQNNGYKY-UHFFFAOYSA-O Ammonium Chemical compound [NH4+] QGZKDVFQNNGYKY-UHFFFAOYSA-O 0.000 description 1
- 241000219195 Arabidopsis thaliana Species 0.000 description 1
- HULHGJZIZXCPLD-FXQIFTODSA-N Arg-Ala-Cys Chemical compound C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N HULHGJZIZXCPLD-FXQIFTODSA-N 0.000 description 1
- GXCSUJQOECMKPV-CIUDSAMLSA-N Arg-Ala-Gln Chemical compound C[C@H](NC(=O)[C@@H](N)CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(O)=O GXCSUJQOECMKPV-CIUDSAMLSA-N 0.000 description 1
- XPSGESXVBSQZPL-SRVKXCTJSA-N Arg-Arg-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O XPSGESXVBSQZPL-SRVKXCTJSA-N 0.000 description 1
- UXJCMQFPDWCHKX-DCAQKATOSA-N Arg-Arg-Glu Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(O)=O)C(O)=O UXJCMQFPDWCHKX-DCAQKATOSA-N 0.000 description 1
- BHSYMWWMVRPCPA-CYDGBPFRSA-N Arg-Arg-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H](N)CCCN=C(N)N BHSYMWWMVRPCPA-CYDGBPFRSA-N 0.000 description 1
- OVVUNXXROOFSIM-SDDRHHMPSA-N Arg-Arg-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CCCN=C(N)N)N)C(=O)O OVVUNXXROOFSIM-SDDRHHMPSA-N 0.000 description 1
- USNSOPDIZILSJP-FXQIFTODSA-N Arg-Asn-Asn Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O USNSOPDIZILSJP-FXQIFTODSA-N 0.000 description 1
- ITVINTQUZMQWJR-QXEWZRGKSA-N Arg-Asn-Val Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O ITVINTQUZMQWJR-QXEWZRGKSA-N 0.000 description 1
- HKRXJBBCQBAGIM-FXQIFTODSA-N Arg-Asp-Ser Chemical compound C(C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CO)C(=O)O)N)CN=C(N)N HKRXJBBCQBAGIM-FXQIFTODSA-N 0.000 description 1
- GIVWETPOBCRTND-DCAQKATOSA-N Arg-Gln-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O GIVWETPOBCRTND-DCAQKATOSA-N 0.000 description 1
- SNBHMYQRNCJSOJ-CIUDSAMLSA-N Arg-Gln-Asn Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O SNBHMYQRNCJSOJ-CIUDSAMLSA-N 0.000 description 1
- VNFWDYWTSHFRRG-SRVKXCTJSA-N Arg-Gln-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O VNFWDYWTSHFRRG-SRVKXCTJSA-N 0.000 description 1
- HPKSHFSEXICTLI-CIUDSAMLSA-N Arg-Glu-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O HPKSHFSEXICTLI-CIUDSAMLSA-N 0.000 description 1
- GMFAGHNRXPSSJS-SRVKXCTJSA-N Arg-Leu-Gln Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O GMFAGHNRXPSSJS-SRVKXCTJSA-N 0.000 description 1
- RIIVUOJDDQXHRV-SRVKXCTJSA-N Arg-Lys-Gln Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O RIIVUOJDDQXHRV-SRVKXCTJSA-N 0.000 description 1
- NPAVRDPEFVKELR-DCAQKATOSA-N Arg-Lys-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O NPAVRDPEFVKELR-DCAQKATOSA-N 0.000 description 1
- HIMXTOIXVXWHTB-DCAQKATOSA-N Arg-Met-Gln Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N HIMXTOIXVXWHTB-DCAQKATOSA-N 0.000 description 1
- INXWADWANGLMPJ-JYJNAYRXSA-N Arg-Phe-Arg Chemical compound NC(=N)NCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CCCNC(N)=N)C(O)=O)CC1=CC=CC=C1 INXWADWANGLMPJ-JYJNAYRXSA-N 0.000 description 1
- DPLFNLDACGGBAK-KKUMJFAQSA-N Arg-Phe-Glu Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N DPLFNLDACGGBAK-KKUMJFAQSA-N 0.000 description 1
- SLQQPJBDBVPVQV-JYJNAYRXSA-N Arg-Phe-Val Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(O)=O SLQQPJBDBVPVQV-JYJNAYRXSA-N 0.000 description 1
- AMIQZQAAYGYKOP-FXQIFTODSA-N Arg-Ser-Asn Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O AMIQZQAAYGYKOP-FXQIFTODSA-N 0.000 description 1
- ADPACBMPYWJJCE-FXQIFTODSA-N Arg-Ser-Asp Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O ADPACBMPYWJJCE-FXQIFTODSA-N 0.000 description 1
- BXLDDWZOTGGNOJ-SZMVWBNQSA-N Arg-Trp-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H](CCCN=C(N)N)N BXLDDWZOTGGNOJ-SZMVWBNQSA-N 0.000 description 1
- AOJYORNRFWWEIV-IHRRRGAJSA-N Arg-Tyr-Asp Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(O)=O)C(O)=O)CC1=CC=C(O)C=C1 AOJYORNRFWWEIV-IHRRRGAJSA-N 0.000 description 1
- PDQBXRSOSCTGKY-ACZMJKKPSA-N Asn-Ala-Gln Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N PDQBXRSOSCTGKY-ACZMJKKPSA-N 0.000 description 1
- XWGJDUSDTRPQRK-ZLUOBGJFSA-N Asn-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC(N)=O XWGJDUSDTRPQRK-ZLUOBGJFSA-N 0.000 description 1
- IARGXWMWRFOQPG-GCJQMDKQSA-N Asn-Ala-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O IARGXWMWRFOQPG-GCJQMDKQSA-N 0.000 description 1
- HOIFSHOLNKQCSA-FXQIFTODSA-N Asn-Arg-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O HOIFSHOLNKQCSA-FXQIFTODSA-N 0.000 description 1
- GNKVBRYFXYWXAB-WDSKDSINSA-N Asn-Glu-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O GNKVBRYFXYWXAB-WDSKDSINSA-N 0.000 description 1
- OLGCWMNDJTWQAG-GUBZILKMSA-N Asn-Glu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC(N)=O OLGCWMNDJTWQAG-GUBZILKMSA-N 0.000 description 1
- OPEPUCYIGFEGSW-WDSKDSINSA-N Asn-Gly-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O OPEPUCYIGFEGSW-WDSKDSINSA-N 0.000 description 1
- HYQYLOSCICEYTR-YUMQZZPRSA-N Asn-Gly-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O HYQYLOSCICEYTR-YUMQZZPRSA-N 0.000 description 1
- JQSWHKKUZMTOIH-QWRGUYRKSA-N Asn-Gly-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CC(=O)N)N JQSWHKKUZMTOIH-QWRGUYRKSA-N 0.000 description 1
- OOWSBIOUKIUWLO-RCOVLWMOSA-N Asn-Gly-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O OOWSBIOUKIUWLO-RCOVLWMOSA-N 0.000 description 1
- ODBSSLHUFPJRED-CIUDSAMLSA-N Asn-His-Asn Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N ODBSSLHUFPJRED-CIUDSAMLSA-N 0.000 description 1
- UYXXMIZGHYKYAT-NHCYSSNCSA-N Asn-His-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CC(=O)N)N UYXXMIZGHYKYAT-NHCYSSNCSA-N 0.000 description 1
- PNHQRQTVBRDIEF-CIUDSAMLSA-N Asn-Leu-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC(=O)N)N PNHQRQTVBRDIEF-CIUDSAMLSA-N 0.000 description 1
- WIDVAWAQBRAKTI-YUMQZZPRSA-N Asn-Leu-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O WIDVAWAQBRAKTI-YUMQZZPRSA-N 0.000 description 1
- QGABLMITFKUQDF-DCAQKATOSA-N Asn-Met-Lys Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)N)N QGABLMITFKUQDF-DCAQKATOSA-N 0.000 description 1
- ZVUMKOMKQCANOM-AVGNSLFASA-N Asn-Phe-Gln Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O ZVUMKOMKQCANOM-AVGNSLFASA-N 0.000 description 1
- SZNGQSBRHFMZLT-IHRRRGAJSA-N Asn-Pro-Phe Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O SZNGQSBRHFMZLT-IHRRRGAJSA-N 0.000 description 1
- OOXUBGLNDRGOKT-FXQIFTODSA-N Asn-Ser-Arg Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O OOXUBGLNDRGOKT-FXQIFTODSA-N 0.000 description 1
- GZXOUBTUAUAVHD-ACZMJKKPSA-N Asn-Ser-Glu Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O GZXOUBTUAUAVHD-ACZMJKKPSA-N 0.000 description 1
- VLDRQOHCMKCXLY-SRVKXCTJSA-N Asn-Ser-Phe Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O VLDRQOHCMKCXLY-SRVKXCTJSA-N 0.000 description 1
- MYTHOBCLNIOFBL-SRVKXCTJSA-N Asn-Ser-Tyr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O MYTHOBCLNIOFBL-SRVKXCTJSA-N 0.000 description 1
- NCXTYSVDWLAQGZ-ZKWXMUAHSA-N Asn-Ser-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC(N)=O NCXTYSVDWLAQGZ-ZKWXMUAHSA-N 0.000 description 1
- ZUFPUBYQYWCMDB-NUMRIWBASA-N Asn-Thr-Glu Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O ZUFPUBYQYWCMDB-NUMRIWBASA-N 0.000 description 1
- AMGQTNHANMRPOE-LKXGYXEUSA-N Asn-Thr-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O AMGQTNHANMRPOE-LKXGYXEUSA-N 0.000 description 1
- SKQTXVZTCGSRJS-SRVKXCTJSA-N Asn-Tyr-Asp Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CC(=O)N)N)O SKQTXVZTCGSRJS-SRVKXCTJSA-N 0.000 description 1
- ZAESWDKAMDVHLL-RCOVLWMOSA-N Asn-Val-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O ZAESWDKAMDVHLL-RCOVLWMOSA-N 0.000 description 1
- NECWUSYTYSIFNC-DLOVCJGASA-N Asp-Ala-Phe Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 NECWUSYTYSIFNC-DLOVCJGASA-N 0.000 description 1
- FAEIQWHBRBWUBN-FXQIFTODSA-N Asp-Arg-Ser Chemical compound C(C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC(=O)O)N)CN=C(N)N FAEIQWHBRBWUBN-FXQIFTODSA-N 0.000 description 1
- NAPNAGZWHQHZLG-ZLUOBGJFSA-N Asp-Asp-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(=O)O)N NAPNAGZWHQHZLG-ZLUOBGJFSA-N 0.000 description 1
- VPSHHQXIWLGVDD-ZLUOBGJFSA-N Asp-Asp-Asp Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O VPSHHQXIWLGVDD-ZLUOBGJFSA-N 0.000 description 1
- WCFCYFDBMNFSPA-ACZMJKKPSA-N Asp-Asp-Glu Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCC(O)=O WCFCYFDBMNFSPA-ACZMJKKPSA-N 0.000 description 1
- VZNOVQKGJQJOCS-SRVKXCTJSA-N Asp-Asp-Tyr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O VZNOVQKGJQJOCS-SRVKXCTJSA-N 0.000 description 1
- GHODABZPVZMWCE-FXQIFTODSA-N Asp-Glu-Glu Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O GHODABZPVZMWCE-FXQIFTODSA-N 0.000 description 1
- VFUXXFVCYZPOQG-WDSKDSINSA-N Asp-Glu-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O VFUXXFVCYZPOQG-WDSKDSINSA-N 0.000 description 1
- OMMIEVATLAGRCK-BYPYZUCNSA-N Asp-Gly-Gly Chemical compound OC(=O)C[C@H](N)C(=O)NCC(=O)NCC(O)=O OMMIEVATLAGRCK-BYPYZUCNSA-N 0.000 description 1
- SNDBKTFJWVEVPO-WHFBIAKZSA-N Asp-Gly-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](CO)C(O)=O SNDBKTFJWVEVPO-WHFBIAKZSA-N 0.000 description 1
- RTXQQDVBACBSCW-CFMVVWHZSA-N Asp-Ile-Tyr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O RTXQQDVBACBSCW-CFMVVWHZSA-N 0.000 description 1
- CLUMZOKVGUWUFD-CIUDSAMLSA-N Asp-Leu-Asn Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O CLUMZOKVGUWUFD-CIUDSAMLSA-N 0.000 description 1
- DWOGMPWRQQWPPF-GUBZILKMSA-N Asp-Leu-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O DWOGMPWRQQWPPF-GUBZILKMSA-N 0.000 description 1
- LTCKTLYKRMCFOC-KKUMJFAQSA-N Asp-Phe-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O LTCKTLYKRMCFOC-KKUMJFAQSA-N 0.000 description 1
- BWJZSLQJNBSUPM-FXQIFTODSA-N Asp-Pro-Asn Chemical compound OC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O BWJZSLQJNBSUPM-FXQIFTODSA-N 0.000 description 1
- FAUPLTGRUBTXNU-FXQIFTODSA-N Asp-Pro-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O FAUPLTGRUBTXNU-FXQIFTODSA-N 0.000 description 1
- DINOVZWPTMGSRF-QXEWZRGKSA-N Asp-Pro-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O DINOVZWPTMGSRF-QXEWZRGKSA-N 0.000 description 1
- KGHLGJAXYSVNJP-WHFBIAKZSA-N Asp-Ser-Gly Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O KGHLGJAXYSVNJP-WHFBIAKZSA-N 0.000 description 1
- UTLCRGFJFSZWAW-OLHMAJIHSA-N Asp-Thr-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)O)N)O UTLCRGFJFSZWAW-OLHMAJIHSA-N 0.000 description 1
- GCACQYDBDHRVGE-LKXGYXEUSA-N Asp-Thr-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H]([C@H](O)C)NC(=O)[C@@H](N)CC(O)=O GCACQYDBDHRVGE-LKXGYXEUSA-N 0.000 description 1
- NWAHPBGBDIFUFD-KKUMJFAQSA-N Asp-Tyr-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(O)=O NWAHPBGBDIFUFD-KKUMJFAQSA-N 0.000 description 1
- 241001513093 Aspergillus awamori Species 0.000 description 1
- 241000351920 Aspergillus nidulans Species 0.000 description 1
- 241000131386 Aspergillus sojae Species 0.000 description 1
- OKTJSMMVPCPJKN-UHFFFAOYSA-N Carbon Chemical compound [C] OKTJSMMVPCPJKN-UHFFFAOYSA-N 0.000 description 1
- QFMCHXSGIZPBKG-ZLUOBGJFSA-N Cys-Ala-Asp Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CS)N QFMCHXSGIZPBKG-ZLUOBGJFSA-N 0.000 description 1
- NQSUTVRXXBGVDQ-LKXGYXEUSA-N Cys-Asn-Thr Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O NQSUTVRXXBGVDQ-LKXGYXEUSA-N 0.000 description 1
- FIADUEYFRSCCIK-CIUDSAMLSA-N Cys-Glu-Arg Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O FIADUEYFRSCCIK-CIUDSAMLSA-N 0.000 description 1
- DZSICRGTVPDCRN-YUMQZZPRSA-N Cys-Gly-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CS)N DZSICRGTVPDCRN-YUMQZZPRSA-N 0.000 description 1
- RRJOQIBQVZDVCW-SRVKXCTJSA-N Cys-His-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CS)N RRJOQIBQVZDVCW-SRVKXCTJSA-N 0.000 description 1
- CIBLYQCAZRYWHY-UHFFFAOYSA-N Cys-Leu-Phe-Cys Chemical compound SCC(N)C(=O)NC(CC(C)C)C(=O)NC(C(=O)NC(CS)C(O)=O)CC1=CC=CC=C1 CIBLYQCAZRYWHY-UHFFFAOYSA-N 0.000 description 1
- OHLLDUNVMPPUMD-DCAQKATOSA-N Cys-Leu-Val Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H](CS)N OHLLDUNVMPPUMD-DCAQKATOSA-N 0.000 description 1
- PEZINYWZBQNTIX-NAKRPEOUSA-N Cys-Met-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](CS)N PEZINYWZBQNTIX-NAKRPEOUSA-N 0.000 description 1
- MFMDKTLJCUBQIC-MXAVVETBSA-N Cys-Phe-Ile Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O MFMDKTLJCUBQIC-MXAVVETBSA-N 0.000 description 1
- NMWZMKLDGZXRKP-BZSNNMDCSA-N Cys-Phe-Phe Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O NMWZMKLDGZXRKP-BZSNNMDCSA-N 0.000 description 1
- CMYVIUWVYHOLRD-ZLUOBGJFSA-N Cys-Ser-Ala Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O CMYVIUWVYHOLRD-ZLUOBGJFSA-N 0.000 description 1
- GGRDJANMZPGMNS-CIUDSAMLSA-N Cys-Ser-Leu Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O GGRDJANMZPGMNS-CIUDSAMLSA-N 0.000 description 1
- JLZCAZJGWNRXCI-XKBZYTNZSA-N Cys-Thr-Glu Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O JLZCAZJGWNRXCI-XKBZYTNZSA-N 0.000 description 1
- 229920001353 Dextrin Polymers 0.000 description 1
- 239000004375 Dextrin Substances 0.000 description 1
- 241000588722 Escherichia Species 0.000 description 1
- 101100321669 Fagopyrum esculentum FA02 gene Proteins 0.000 description 1
- 101150014526 Gla gene Proteins 0.000 description 1
- YJIUYQKQBBQYHZ-ACZMJKKPSA-N Gln-Ala-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O YJIUYQKQBBQYHZ-ACZMJKKPSA-N 0.000 description 1
- RZSLYUUFFVHFRQ-FXQIFTODSA-N Gln-Ala-Glu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O RZSLYUUFFVHFRQ-FXQIFTODSA-N 0.000 description 1
- NUMFTVCBONFQIQ-DRZSPHRISA-N Gln-Ala-Phe Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O NUMFTVCBONFQIQ-DRZSPHRISA-N 0.000 description 1
- WOACHWLUOFZLGJ-GUBZILKMSA-N Gln-Arg-Gln Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(O)=O WOACHWLUOFZLGJ-GUBZILKMSA-N 0.000 description 1
- SOBBAYVQSNXYPQ-ACZMJKKPSA-N Gln-Asn-Asn Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O SOBBAYVQSNXYPQ-ACZMJKKPSA-N 0.000 description 1
- AAOBFSKXAVIORT-GUBZILKMSA-N Gln-Asn-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O AAOBFSKXAVIORT-GUBZILKMSA-N 0.000 description 1
- RMOCFPBLHAOTDU-ACZMJKKPSA-N Gln-Asn-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O RMOCFPBLHAOTDU-ACZMJKKPSA-N 0.000 description 1
- IKDOHQHEFPPGJG-FXQIFTODSA-N Gln-Asp-Glu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O IKDOHQHEFPPGJG-FXQIFTODSA-N 0.000 description 1
- LOJYQMFIIJVETK-WDSKDSINSA-N Gln-Gln Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(O)=O LOJYQMFIIJVETK-WDSKDSINSA-N 0.000 description 1
- QYKBTDOAMKORGL-FXQIFTODSA-N Gln-Gln-Asp Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N QYKBTDOAMKORGL-FXQIFTODSA-N 0.000 description 1
- KVXVVDFOZNYYKZ-DCAQKATOSA-N Gln-Gln-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O KVXVVDFOZNYYKZ-DCAQKATOSA-N 0.000 description 1
- MCAVASRGVBVPMX-FXQIFTODSA-N Gln-Glu-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O MCAVASRGVBVPMX-FXQIFTODSA-N 0.000 description 1
- XJKAKYXMFHUIHT-AUTRQRHGSA-N Gln-Glu-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCC(=O)N)N XJKAKYXMFHUIHT-AUTRQRHGSA-N 0.000 description 1
- BVELAHPZLYLZDJ-HGNGGELXSA-N Gln-His-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(O)=O BVELAHPZLYLZDJ-HGNGGELXSA-N 0.000 description 1
- HYPVLWGNBIYTNA-GUBZILKMSA-N Gln-Leu-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O HYPVLWGNBIYTNA-GUBZILKMSA-N 0.000 description 1
- KSKFIECUYMYWNS-AVGNSLFASA-N Gln-Lys-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CCC(=O)N)N KSKFIECUYMYWNS-AVGNSLFASA-N 0.000 description 1
- KLKYKPXITJBSNI-CIUDSAMLSA-N Gln-Met-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C)C(O)=O KLKYKPXITJBSNI-CIUDSAMLSA-N 0.000 description 1
- VNTGPISAOMAXRK-CIUDSAMLSA-N Gln-Pro-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O VNTGPISAOMAXRK-CIUDSAMLSA-N 0.000 description 1
- DCWNCMRZIZSZBL-KKUMJFAQSA-N Gln-Pro-Tyr Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CCC(=O)N)N)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O DCWNCMRZIZSZBL-KKUMJFAQSA-N 0.000 description 1
- KUBFPYIMAGXGBT-ACZMJKKPSA-N Gln-Ser-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O KUBFPYIMAGXGBT-ACZMJKKPSA-N 0.000 description 1
- LGWNISYVKDNJRP-FXQIFTODSA-N Gln-Ser-Gln Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O LGWNISYVKDNJRP-FXQIFTODSA-N 0.000 description 1
- SYZZMPFLOLSMHL-XHNCKOQMSA-N Gln-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CCC(=O)N)N)C(=O)O SYZZMPFLOLSMHL-XHNCKOQMSA-N 0.000 description 1
- JILRMFFFCHUUTJ-ACZMJKKPSA-N Gln-Ser-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O JILRMFFFCHUUTJ-ACZMJKKPSA-N 0.000 description 1
- VLOLPWWCNKWRNB-LOKLDPHHSA-N Gln-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)N)N)O VLOLPWWCNKWRNB-LOKLDPHHSA-N 0.000 description 1
- UQKVUFGUSVYJMQ-IRIUXVKKSA-N Gln-Tyr-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CCC(=O)N)N)O UQKVUFGUSVYJMQ-IRIUXVKKSA-N 0.000 description 1
- BBFCMGBMYIAGRS-AUTRQRHGSA-N Gln-Val-Glu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O BBFCMGBMYIAGRS-AUTRQRHGSA-N 0.000 description 1
- FITIQFSXXBKFFM-NRPADANISA-N Gln-Val-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O FITIQFSXXBKFFM-NRPADANISA-N 0.000 description 1
- ITYRYNUZHPNCIK-GUBZILKMSA-N Glu-Ala-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O ITYRYNUZHPNCIK-GUBZILKMSA-N 0.000 description 1
- ATRHMOJQJWPVBQ-DRZSPHRISA-N Glu-Ala-Phe Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O ATRHMOJQJWPVBQ-DRZSPHRISA-N 0.000 description 1
- IRDASPPCLZIERZ-XHNCKOQMSA-N Glu-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)O)N IRDASPPCLZIERZ-XHNCKOQMSA-N 0.000 description 1
- WOSRKEJQESVHGA-CIUDSAMLSA-N Glu-Arg-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O WOSRKEJQESVHGA-CIUDSAMLSA-N 0.000 description 1
- RDDSZZJOKDVPAE-ACZMJKKPSA-N Glu-Asn-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O RDDSZZJOKDVPAE-ACZMJKKPSA-N 0.000 description 1
- VAIWPXWHWAPYDF-FXQIFTODSA-N Glu-Asp-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O VAIWPXWHWAPYDF-FXQIFTODSA-N 0.000 description 1
- CKOFNWCLWRYUHK-XHNCKOQMSA-N Glu-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCC(=O)O)N)C(=O)O CKOFNWCLWRYUHK-XHNCKOQMSA-N 0.000 description 1
- JRCUFCXYZLPSDZ-ACZMJKKPSA-N Glu-Asp-Ser Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O JRCUFCXYZLPSDZ-ACZMJKKPSA-N 0.000 description 1
- SBCYJMOOHUDWDA-NUMRIWBASA-N Glu-Asp-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SBCYJMOOHUDWDA-NUMRIWBASA-N 0.000 description 1
- KLJMRPIBBLTDGE-ACZMJKKPSA-N Glu-Cys-Asn Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(N)=O)C(O)=O KLJMRPIBBLTDGE-ACZMJKKPSA-N 0.000 description 1
- OXEMJGCAJFFREE-FXQIFTODSA-N Glu-Gln-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O OXEMJGCAJFFREE-FXQIFTODSA-N 0.000 description 1
- XHWLNISLUFEWNS-CIUDSAMLSA-N Glu-Gln-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O XHWLNISLUFEWNS-CIUDSAMLSA-N 0.000 description 1
- ILGFBUGLBSAQQB-GUBZILKMSA-N Glu-Glu-Arg Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O ILGFBUGLBSAQQB-GUBZILKMSA-N 0.000 description 1
- HNVFSTLPVJWIDV-CIUDSAMLSA-N Glu-Glu-Gln Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O HNVFSTLPVJWIDV-CIUDSAMLSA-N 0.000 description 1
- BUZMZDDKFCSKOT-CIUDSAMLSA-N Glu-Glu-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O BUZMZDDKFCSKOT-CIUDSAMLSA-N 0.000 description 1
- KASDBWKLWJKTLJ-GUBZILKMSA-N Glu-Glu-Met Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCSC)C(O)=O KASDBWKLWJKTLJ-GUBZILKMSA-N 0.000 description 1
- LYCDZGLXQBPNQU-WDSKDSINSA-N Glu-Gly-Cys Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@@H](CS)C(O)=O LYCDZGLXQBPNQU-WDSKDSINSA-N 0.000 description 1
- WVTIBGWZUMJBFY-GUBZILKMSA-N Glu-His-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(O)=O WVTIBGWZUMJBFY-GUBZILKMSA-N 0.000 description 1
- VGUYMZGLJUJRBV-YVNDNENWSA-N Glu-Ile-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(O)=O VGUYMZGLJUJRBV-YVNDNENWSA-N 0.000 description 1
- PJBVXVBTTFZPHJ-GUBZILKMSA-N Glu-Leu-Asp Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CCC(=O)O)N PJBVXVBTTFZPHJ-GUBZILKMSA-N 0.000 description 1
- NWOUBJNMZDDGDT-AVGNSLFASA-N Glu-Leu-His Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 NWOUBJNMZDDGDT-AVGNSLFASA-N 0.000 description 1
- QDMVXRNLOPTPIE-WDCWCFNPSA-N Glu-Lys-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O QDMVXRNLOPTPIE-WDCWCFNPSA-N 0.000 description 1
- SOEPMWQCTJITPZ-SRVKXCTJSA-N Glu-Met-Lys Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)O)N SOEPMWQCTJITPZ-SRVKXCTJSA-N 0.000 description 1
- CBWKURKPYSLMJV-SOUVJXGZSA-N Glu-Phe-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)NC(=O)[C@H](CCC(=O)O)N)C(=O)O CBWKURKPYSLMJV-SOUVJXGZSA-N 0.000 description 1
- MIIGESVJEBDJMP-FHWLQOOXSA-N Glu-Phe-Tyr Chemical compound C([C@H](NC(=O)[C@H](CCC(O)=O)N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=CC=C1 MIIGESVJEBDJMP-FHWLQOOXSA-N 0.000 description 1
- MXJYXYDREQWUMS-XKBZYTNZSA-N Glu-Thr-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O MXJYXYDREQWUMS-XKBZYTNZSA-N 0.000 description 1
- KXRORHJIRAOQPG-SOUVJXGZSA-N Glu-Tyr-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)[C@H](CCC(=O)O)N)C(=O)O KXRORHJIRAOQPG-SOUVJXGZSA-N 0.000 description 1
- QXUPRMQJDWJDFR-NRPADANISA-N Glu-Val-Ser Chemical compound CC(C)[C@H](NC(=O)[C@@H](N)CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O QXUPRMQJDWJDFR-NRPADANISA-N 0.000 description 1
- OGCIHJPYKVSMTE-YUMQZZPRSA-N Gly-Arg-Glu Chemical compound [H]NCC(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O OGCIHJPYKVSMTE-YUMQZZPRSA-N 0.000 description 1
- DTPOVRRYXPJJAZ-FJXKBIBVSA-N Gly-Arg-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCN=C(N)N DTPOVRRYXPJJAZ-FJXKBIBVSA-N 0.000 description 1
- CIMULJZTTOBOPN-WHFBIAKZSA-N Gly-Asn-Asn Chemical compound NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O CIMULJZTTOBOPN-WHFBIAKZSA-N 0.000 description 1
- QGZSAHIZRQHCEQ-QWRGUYRKSA-N Gly-Asp-Tyr Chemical compound NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 QGZSAHIZRQHCEQ-QWRGUYRKSA-N 0.000 description 1
- VNBNZUAPOYGRDB-ZDLURKLDSA-N Gly-Cys-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)CN)O VNBNZUAPOYGRDB-ZDLURKLDSA-N 0.000 description 1
- BPQYBFAXRGMGGY-LAEOZQHASA-N Gly-Gln-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)CN BPQYBFAXRGMGGY-LAEOZQHASA-N 0.000 description 1
- PABFFPWEJMEVEC-JGVFFNPUSA-N Gly-Gln-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)N)NC(=O)CN)C(=O)O PABFFPWEJMEVEC-JGVFFNPUSA-N 0.000 description 1
- FIQQRCFQXGLOSZ-WDSKDSINSA-N Gly-Glu-Asp Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O FIQQRCFQXGLOSZ-WDSKDSINSA-N 0.000 description 1
- PDAWDNVHMUKWJR-ZETCQYMHSA-N Gly-Gly-His Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CC1=CNC=N1 PDAWDNVHMUKWJR-ZETCQYMHSA-N 0.000 description 1
- YWAQATDNEKZFFK-BYPYZUCNSA-N Gly-Gly-Ser Chemical compound NCC(=O)NCC(=O)N[C@@H](CO)C(O)=O YWAQATDNEKZFFK-BYPYZUCNSA-N 0.000 description 1
- TVDHVLGFJSHPAX-UWVGGRQHSA-N Gly-His-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CN=CN1 TVDHVLGFJSHPAX-UWVGGRQHSA-N 0.000 description 1
- UUWOBINZFGTFMS-UWVGGRQHSA-N Gly-His-Met Chemical compound [H]NCC(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCSC)C(O)=O UUWOBINZFGTFMS-UWVGGRQHSA-N 0.000 description 1
- NNCSJUBVFBDDLC-YUMQZZPRSA-N Gly-Leu-Ser Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O NNCSJUBVFBDDLC-YUMQZZPRSA-N 0.000 description 1
- MIIVFRCYJABHTQ-ONGXEEELSA-N Gly-Leu-Val Chemical compound [H]NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O MIIVFRCYJABHTQ-ONGXEEELSA-N 0.000 description 1
- GGAPHLIUUTVYMX-QWRGUYRKSA-N Gly-Phe-Ser Chemical compound OC[C@@H](C([O-])=O)NC(=O)[C@@H](NC(=O)C[NH3+])CC1=CC=CC=C1 GGAPHLIUUTVYMX-QWRGUYRKSA-N 0.000 description 1
- ZZJVYSAQQMDIRD-UWVGGRQHSA-N Gly-Pro-His Chemical compound NCC(=O)N1CCC[C@H]1C(=O)N[C@@H](Cc1cnc[nH]1)C(O)=O ZZJVYSAQQMDIRD-UWVGGRQHSA-N 0.000 description 1
- OHUKZZYSJBKFRR-WHFBIAKZSA-N Gly-Ser-Asp Chemical compound [H]NCC(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O OHUKZZYSJBKFRR-WHFBIAKZSA-N 0.000 description 1
- LBDXVCBAJJNJNN-WHFBIAKZSA-N Gly-Ser-Cys Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@@H](CS)C(O)=O LBDXVCBAJJNJNN-WHFBIAKZSA-N 0.000 description 1
- ABPRMMYHROQBLY-NKWVEPMBSA-N Gly-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)CN)C(=O)O ABPRMMYHROQBLY-NKWVEPMBSA-N 0.000 description 1
- YDIDLLVFCYSXNY-RCOVLWMOSA-N Gly-Val-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)CN YDIDLLVFCYSXNY-RCOVLWMOSA-N 0.000 description 1
- BAYQNCWLXIDLHX-ONGXEEELSA-N Gly-Val-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)CN BAYQNCWLXIDLHX-ONGXEEELSA-N 0.000 description 1
- FNXSYBOHALPRHV-ONGXEEELSA-N Gly-Val-Lys Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCCN FNXSYBOHALPRHV-ONGXEEELSA-N 0.000 description 1
- NYHBQMYGNKIUIF-UUOKFMHZSA-N Guanosine Chemical compound C1=NC=2C(=O)NC(N)=NC=2N1[C@@H]1O[C@H](CO)[C@@H](O)[C@H]1O NYHBQMYGNKIUIF-UUOKFMHZSA-N 0.000 description 1
- AFPFGFUGETYOSY-HGNGGELXSA-N His-Ala-Glu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O AFPFGFUGETYOSY-HGNGGELXSA-N 0.000 description 1
- HXKZJLWGSWQKEA-LSJOCFKGSA-N His-Ala-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CN=CN1 HXKZJLWGSWQKEA-LSJOCFKGSA-N 0.000 description 1
- HDXNWVLQSQFJOX-SRVKXCTJSA-N His-Arg-Gln Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N HDXNWVLQSQFJOX-SRVKXCTJSA-N 0.000 description 1
- HRGGKHFHRSFSDE-CIUDSAMLSA-N His-Asn-Ser Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CO)C(=O)O)N HRGGKHFHRSFSDE-CIUDSAMLSA-N 0.000 description 1
- VLPMGIJPAWENQB-SRVKXCTJSA-N His-Cys-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(O)=O VLPMGIJPAWENQB-SRVKXCTJSA-N 0.000 description 1
- ZNNNYCXPCKACHX-DCAQKATOSA-N His-Gln-Gln Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O ZNNNYCXPCKACHX-DCAQKATOSA-N 0.000 description 1
- NELVFWFDOKRTOR-SDDRHHMPSA-N His-Gln-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC2=CN=CN2)N)C(=O)O NELVFWFDOKRTOR-SDDRHHMPSA-N 0.000 description 1
- SKOKHBGDXGTDDP-MELADBBJSA-N His-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CN=CN2)N SKOKHBGDXGTDDP-MELADBBJSA-N 0.000 description 1
- KHUFDBQXGLEIHC-BZSNNMDCSA-N His-Leu-Tyr Chemical compound C([C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CN=CN1 KHUFDBQXGLEIHC-BZSNNMDCSA-N 0.000 description 1
- PGRPSOUCWRBWKZ-DLOVCJGASA-N His-Lys-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC1=CN=CN1 PGRPSOUCWRBWKZ-DLOVCJGASA-N 0.000 description 1
- MIHTTYXBXIRRGV-AVGNSLFASA-N His-Met-Arg Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N MIHTTYXBXIRRGV-AVGNSLFASA-N 0.000 description 1
- LQGCNWWLGGMTJO-ULQDDVLXSA-N His-Met-Phe Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CC2=CN=CN2)N LQGCNWWLGGMTJO-ULQDDVLXSA-N 0.000 description 1
- YAEKRYQASVCDLK-JYJNAYRXSA-N His-Phe-Glu Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CC2=CN=CN2)N YAEKRYQASVCDLK-JYJNAYRXSA-N 0.000 description 1
- PGXZHYYGOPKYKM-IHRRRGAJSA-N His-Pro-Lys Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CC2=CN=CN2)N)C(=O)N[C@@H](CCCCN)C(=O)O PGXZHYYGOPKYKM-IHRRRGAJSA-N 0.000 description 1
- ZHHLTWUOWXHVQJ-YUMQZZPRSA-N His-Ser-Gly Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CO)C(=O)NCC(=O)O)N ZHHLTWUOWXHVQJ-YUMQZZPRSA-N 0.000 description 1
- FRDFAWHTPDKRHG-ULQDDVLXSA-N His-Tyr-Arg Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O)C1=CN=CN1 FRDFAWHTPDKRHG-ULQDDVLXSA-N 0.000 description 1
- DAKSMIWQZPHRIB-BZSNNMDCSA-N His-Tyr-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(O)=O DAKSMIWQZPHRIB-BZSNNMDCSA-N 0.000 description 1
- 102000017286 Histone H2A Human genes 0.000 description 1
- 108050005231 Histone H2A Proteins 0.000 description 1
- 101000578940 Homo sapiens PDZ domain-containing protein MAGIX Proteins 0.000 description 1
- BOTVMTSMOUSDRW-GMOBBJLQSA-N Ile-Arg-Asn Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC(N)=O)C(O)=O BOTVMTSMOUSDRW-GMOBBJLQSA-N 0.000 description 1
- ASCFJMSGKUIRDU-ZPFDUUQYSA-N Ile-Arg-Gln Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(O)=O ASCFJMSGKUIRDU-ZPFDUUQYSA-N 0.000 description 1
- SCHZQZPYHBWYEQ-PEFMBERDSA-N Ile-Asn-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N SCHZQZPYHBWYEQ-PEFMBERDSA-N 0.000 description 1
- UKTUOMWSJPXODT-GUDRVLHUSA-N Ile-Asn-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N1CCC[C@@H]1C(=O)O)N UKTUOMWSJPXODT-GUDRVLHUSA-N 0.000 description 1
- PPSQSIDMOVPKPI-BJDJZHNGSA-N Ile-Cys-Leu Chemical compound N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(=O)O PPSQSIDMOVPKPI-BJDJZHNGSA-N 0.000 description 1
- KMBPQYKVZBMRMH-PEFMBERDSA-N Ile-Gln-Asn Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O KMBPQYKVZBMRMH-PEFMBERDSA-N 0.000 description 1
- ZGGWRNBSBOHIGH-HVTMNAMFSA-N Ile-Gln-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N ZGGWRNBSBOHIGH-HVTMNAMFSA-N 0.000 description 1
- BEWFWZRGBDVXRP-PEFMBERDSA-N Ile-Glu-Asn Chemical compound [H]N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O BEWFWZRGBDVXRP-PEFMBERDSA-N 0.000 description 1
- PHIXPNQDGGILMP-YVNDNENWSA-N Ile-Glu-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N PHIXPNQDGGILMP-YVNDNENWSA-N 0.000 description 1
- IGJWJGIHUFQANP-LAEOZQHASA-N Ile-Gly-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N[C@@H](CCC(=O)N)C(=O)O)N IGJWJGIHUFQANP-LAEOZQHASA-N 0.000 description 1
- ZXIGYKICRDFISM-DJFWLOJKSA-N Ile-His-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CC(=O)N)C(=O)O)N ZXIGYKICRDFISM-DJFWLOJKSA-N 0.000 description 1
- GLYJPWIRLBAIJH-UHFFFAOYSA-N Ile-Lys-Pro Natural products CCC(C)C(N)C(=O)NC(CCCCN)C(=O)N1CCCC1C(O)=O GLYJPWIRLBAIJH-UHFFFAOYSA-N 0.000 description 1
- RCMNUBZKIIJCOI-ZPFDUUQYSA-N Ile-Met-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N RCMNUBZKIIJCOI-ZPFDUUQYSA-N 0.000 description 1
- CAHCWMVNBZJVAW-NAKRPEOUSA-N Ile-Pro-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)O)N CAHCWMVNBZJVAW-NAKRPEOUSA-N 0.000 description 1
- ZNOBVZFCHNHKHA-KBIXCLLPSA-N Ile-Ser-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N ZNOBVZFCHNHKHA-KBIXCLLPSA-N 0.000 description 1
- PZWBBXHHUSIGKH-OSUNSFLBSA-N Ile-Thr-Arg Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N PZWBBXHHUSIGKH-OSUNSFLBSA-N 0.000 description 1
- ZSESFIFAYQEKRD-CYDGBPFRSA-N Ile-Val-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCSC)C(=O)O)N ZSESFIFAYQEKRD-CYDGBPFRSA-N 0.000 description 1
- 108020005350 Initiator Codon Proteins 0.000 description 1
- 108010065920 Insulin Lispro Proteins 0.000 description 1
- PMGDADKJMCOXHX-UHFFFAOYSA-N L-Arginyl-L-glutamin-acetat Natural products NC(=N)NCCCC(N)C(=O)NC(CCC(N)=O)C(O)=O PMGDADKJMCOXHX-UHFFFAOYSA-N 0.000 description 1
- RCFDOSNHHZGBOY-UHFFFAOYSA-N L-isoleucyl-L-alanine Natural products CCC(C)C(N)C(=O)NC(C)C(O)=O RCFDOSNHHZGBOY-UHFFFAOYSA-N 0.000 description 1
- CZCSUZMIRKFFFA-CIUDSAMLSA-N Leu-Ala-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O CZCSUZMIRKFFFA-CIUDSAMLSA-N 0.000 description 1
- PBCHMHROGNUXMK-DLOVCJGASA-N Leu-Ala-His Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 PBCHMHROGNUXMK-DLOVCJGASA-N 0.000 description 1
- KWTVLKBOQATPHJ-SRVKXCTJSA-N Leu-Ala-Lys Chemical compound C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(C)C)N KWTVLKBOQATPHJ-SRVKXCTJSA-N 0.000 description 1
- BQSLGJHIAGOZCD-CIUDSAMLSA-N Leu-Ala-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O BQSLGJHIAGOZCD-CIUDSAMLSA-N 0.000 description 1
- HBJZFCIVFIBNSV-DCAQKATOSA-N Leu-Arg-Asn Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC(N)=O)C(O)=O HBJZFCIVFIBNSV-DCAQKATOSA-N 0.000 description 1
- DLFAACQHIRSQGG-CIUDSAMLSA-N Leu-Asp-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O DLFAACQHIRSQGG-CIUDSAMLSA-N 0.000 description 1
- JQSXWJXBASFONF-KKUMJFAQSA-N Leu-Asp-Phe Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O JQSXWJXBASFONF-KKUMJFAQSA-N 0.000 description 1
- PVMPDMIKUVNOBD-CIUDSAMLSA-N Leu-Asp-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O PVMPDMIKUVNOBD-CIUDSAMLSA-N 0.000 description 1
- CLVUXCBGKUECIT-HJGDQZAQSA-N Leu-Asp-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CLVUXCBGKUECIT-HJGDQZAQSA-N 0.000 description 1
- KAFOIVJDVSZUMD-DCAQKATOSA-N Leu-Gln-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O KAFOIVJDVSZUMD-DCAQKATOSA-N 0.000 description 1
- KAFOIVJDVSZUMD-UHFFFAOYSA-N Leu-Gln-Gln Natural products CC(C)CC(N)C(=O)NC(CCC(N)=O)C(=O)NC(CCC(N)=O)C(O)=O KAFOIVJDVSZUMD-UHFFFAOYSA-N 0.000 description 1
- LOLUPZNNADDTAA-AVGNSLFASA-N Leu-Gln-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O LOLUPZNNADDTAA-AVGNSLFASA-N 0.000 description 1
- RVVBWTWPNFDYBE-SRVKXCTJSA-N Leu-Glu-Arg Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O RVVBWTWPNFDYBE-SRVKXCTJSA-N 0.000 description 1
- HFBCHNRFRYLZNV-GUBZILKMSA-N Leu-Glu-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O HFBCHNRFRYLZNV-GUBZILKMSA-N 0.000 description 1
- KGCLIYGPQXUNLO-IUCAKERBSA-N Leu-Gly-Glu Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(O)=O KGCLIYGPQXUNLO-IUCAKERBSA-N 0.000 description 1
- CSFVADKICPDRRF-KKUMJFAQSA-N Leu-His-Leu Chemical compound CC(C)C[C@H]([NH3+])C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C([O-])=O)CC1=CN=CN1 CSFVADKICPDRRF-KKUMJFAQSA-N 0.000 description 1
- OHZIZVWQXJPBJS-IXOXFDKPSA-N Leu-His-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OHZIZVWQXJPBJS-IXOXFDKPSA-N 0.000 description 1
- KYIIALJHAOIAHF-KKUMJFAQSA-N Leu-Leu-His Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 KYIIALJHAOIAHF-KKUMJFAQSA-N 0.000 description 1
- JLWZLIQRYCTYBD-IHRRRGAJSA-N Leu-Lys-Arg Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O JLWZLIQRYCTYBD-IHRRRGAJSA-N 0.000 description 1
- GNRPTBRHRRZCMA-RWMBFGLXSA-N Leu-Met-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCSC)C(=O)N1CCC[C@@H]1C(=O)O)N GNRPTBRHRRZCMA-RWMBFGLXSA-N 0.000 description 1
- UHNQRAFSEBGZFZ-YESZJQIVSA-N Leu-Phe-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N2CCC[C@@H]2C(=O)O)N UHNQRAFSEBGZFZ-YESZJQIVSA-N 0.000 description 1
- YUTNOGOMBNYPFH-XUXIUFHCSA-N Leu-Pro-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(O)=O YUTNOGOMBNYPFH-XUXIUFHCSA-N 0.000 description 1
- DPURXCQCHSQPAN-AVGNSLFASA-N Leu-Pro-Pro Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 DPURXCQCHSQPAN-AVGNSLFASA-N 0.000 description 1
- CHJKEDSZNSONPS-DCAQKATOSA-N Leu-Pro-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O CHJKEDSZNSONPS-DCAQKATOSA-N 0.000 description 1
- PWPBLZXWFXJFHE-RHYQMDGZSA-N Leu-Pro-Thr Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O PWPBLZXWFXJFHE-RHYQMDGZSA-N 0.000 description 1
- IRMLZWSRWSGTOP-CIUDSAMLSA-N Leu-Ser-Ala Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O IRMLZWSRWSGTOP-CIUDSAMLSA-N 0.000 description 1
- KIZIOFNVSOSKJI-CIUDSAMLSA-N Leu-Ser-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)O)N KIZIOFNVSOSKJI-CIUDSAMLSA-N 0.000 description 1
- AKVBOOKXVAMKSS-GUBZILKMSA-N Leu-Ser-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O AKVBOOKXVAMKSS-GUBZILKMSA-N 0.000 description 1
- SBANPBVRHYIMRR-UHFFFAOYSA-N Leu-Ser-Pro Natural products CC(C)CC(N)C(=O)NC(CO)C(=O)N1CCCC1C(O)=O SBANPBVRHYIMRR-UHFFFAOYSA-N 0.000 description 1
- BRTVHXHCUSXYRI-CIUDSAMLSA-N Leu-Ser-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O BRTVHXHCUSXYRI-CIUDSAMLSA-N 0.000 description 1
- ICYRCNICGBJLGM-HJGDQZAQSA-N Leu-Thr-Asp Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(O)=O ICYRCNICGBJLGM-HJGDQZAQSA-N 0.000 description 1
- FGZVGOAAROXFAB-IXOXFDKPSA-N Leu-Thr-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC(C)C)N)O FGZVGOAAROXFAB-IXOXFDKPSA-N 0.000 description 1
- WUHBLPVELFTPQK-KKUMJFAQSA-N Leu-Tyr-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(O)=O WUHBLPVELFTPQK-KKUMJFAQSA-N 0.000 description 1
- SXOFUVGLPHCPRQ-KKUMJFAQSA-N Leu-Tyr-Cys Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CS)C(O)=O SXOFUVGLPHCPRQ-KKUMJFAQSA-N 0.000 description 1
- VJGQRELPQWNURN-JYJNAYRXSA-N Leu-Tyr-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O VJGQRELPQWNURN-JYJNAYRXSA-N 0.000 description 1
- JGKHAFUAPZCCDU-BZSNNMDCSA-N Leu-Tyr-Leu Chemical compound CC(C)C[C@H]([NH3+])C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C([O-])=O)CC1=CC=C(O)C=C1 JGKHAFUAPZCCDU-BZSNNMDCSA-N 0.000 description 1
- CGHXMODRYJISSK-NHCYSSNCSA-N Leu-Val-Asp Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC(O)=O CGHXMODRYJISSK-NHCYSSNCSA-N 0.000 description 1
- VKVDRTGWLVZJOM-DCAQKATOSA-N Leu-Val-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O VKVDRTGWLVZJOM-DCAQKATOSA-N 0.000 description 1
- JCFYLFOCALSNLQ-GUBZILKMSA-N Lys-Ala-Gln Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(O)=O JCFYLFOCALSNLQ-GUBZILKMSA-N 0.000 description 1
- HKCCVDWHHTVVPN-CIUDSAMLSA-N Lys-Asp-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O HKCCVDWHHTVVPN-CIUDSAMLSA-N 0.000 description 1
- FLCMXEFCTLXBTL-DCAQKATOSA-N Lys-Asp-Arg Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N FLCMXEFCTLXBTL-DCAQKATOSA-N 0.000 description 1
- PHHYNOUOUWYQRO-XIRDDKMYSA-N Lys-Asp-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCCCN)N PHHYNOUOUWYQRO-XIRDDKMYSA-N 0.000 description 1
- GGNOBVSOZPHLCE-GUBZILKMSA-N Lys-Gln-Asp Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O GGNOBVSOZPHLCE-GUBZILKMSA-N 0.000 description 1
- MRWXLRGAFDOILG-DCAQKATOSA-N Lys-Gln-Gln Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O MRWXLRGAFDOILG-DCAQKATOSA-N 0.000 description 1
- NNCDAORZCMPZPX-GUBZILKMSA-N Lys-Gln-Ser Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CO)C(=O)O)N NNCDAORZCMPZPX-GUBZILKMSA-N 0.000 description 1
- HEWWNLVEWBJBKA-WDCWCFNPSA-N Lys-Gln-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CCCCN HEWWNLVEWBJBKA-WDCWCFNPSA-N 0.000 description 1
- UETQMSASAVBGJY-QWRGUYRKSA-N Lys-Gly-His Chemical compound NCCCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC1=CNC=N1 UETQMSASAVBGJY-QWRGUYRKSA-N 0.000 description 1
- PBLLTSKBTAHDNA-KBPBESRZSA-N Lys-Gly-Phe Chemical compound [H]N[C@@H](CCCCN)C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O PBLLTSKBTAHDNA-KBPBESRZSA-N 0.000 description 1
- KYNNSEJZFVCDIV-ZPFDUUQYSA-N Lys-Ile-Asn Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O KYNNSEJZFVCDIV-ZPFDUUQYSA-N 0.000 description 1
- ONPDTSFZAIWMDI-AVGNSLFASA-N Lys-Leu-Gln Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O ONPDTSFZAIWMDI-AVGNSLFASA-N 0.000 description 1
- LUAJJLPHUXPQLH-KKUMJFAQSA-N Lys-Phe-Ser Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CCCCN)N LUAJJLPHUXPQLH-KKUMJFAQSA-N 0.000 description 1
- AFLBTVGQCQLOFJ-AVGNSLFASA-N Lys-Pro-Arg Chemical compound NCCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCN=C(N)N)C(O)=O AFLBTVGQCQLOFJ-AVGNSLFASA-N 0.000 description 1
- JCVOHUKUYSYBAD-DCAQKATOSA-N Lys-Pro-Cys Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CCCCN)N)C(=O)N[C@@H](CS)C(=O)O JCVOHUKUYSYBAD-DCAQKATOSA-N 0.000 description 1
- CNGOEHJCLVCJHN-SRVKXCTJSA-N Lys-Pro-Glu Chemical compound NCCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O CNGOEHJCLVCJHN-SRVKXCTJSA-N 0.000 description 1
- HYSVGEAWTGPMOA-IHRRRGAJSA-N Lys-Pro-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O HYSVGEAWTGPMOA-IHRRRGAJSA-N 0.000 description 1
- LECIJRIRMVOFMH-ULQDDVLXSA-N Lys-Pro-Phe Chemical compound NCCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 LECIJRIRMVOFMH-ULQDDVLXSA-N 0.000 description 1
- DIBZLYZXTSVGLN-CIUDSAMLSA-N Lys-Ser-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O DIBZLYZXTSVGLN-CIUDSAMLSA-N 0.000 description 1
- FPQMQEOVSKMVMA-ACRUOGEOSA-N Lys-Tyr-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O)NC(=O)[C@H](CCCCN)N)O FPQMQEOVSKMVMA-ACRUOGEOSA-N 0.000 description 1
- OHXUUQDOBQKSNB-AVGNSLFASA-N Lys-Val-Arg Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O OHXUUQDOBQKSNB-AVGNSLFASA-N 0.000 description 1
- QLFAPXUXEBAWEK-NHCYSSNCSA-N Lys-Val-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O QLFAPXUXEBAWEK-NHCYSSNCSA-N 0.000 description 1
- UGCIQUYEJIEHKX-GVXVVHGQSA-N Lys-Val-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O UGCIQUYEJIEHKX-GVXVVHGQSA-N 0.000 description 1
- DRRXXZBXDMLGFC-IHRRRGAJSA-N Lys-Val-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCCN DRRXXZBXDMLGFC-IHRRRGAJSA-N 0.000 description 1
- ULNXMMYXQKGNPG-LPEHRKFASA-N Met-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCSC)N ULNXMMYXQKGNPG-LPEHRKFASA-N 0.000 description 1
- WXHHTBVYQOSYSL-FXQIFTODSA-N Met-Ala-Ser Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O WXHHTBVYQOSYSL-FXQIFTODSA-N 0.000 description 1
- HUKLXYYPZWPXCC-KZVJFYERSA-N Met-Ala-Thr Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O HUKLXYYPZWPXCC-KZVJFYERSA-N 0.000 description 1
- CWFYZYQMUDWGTI-GUBZILKMSA-N Met-Arg-Asp Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O CWFYZYQMUDWGTI-GUBZILKMSA-N 0.000 description 1
- ZAJNRWKGHWGPDQ-SDDRHHMPSA-N Met-Arg-Pro Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N1CCC[C@@H]1C(=O)O)N ZAJNRWKGHWGPDQ-SDDRHHMPSA-N 0.000 description 1
- MDXAULHWGWETHF-SRVKXCTJSA-N Met-Arg-Val Chemical compound CSCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C(C)C)C(O)=O)CCCNC(N)=N MDXAULHWGWETHF-SRVKXCTJSA-N 0.000 description 1
- IUYCGMNKIZDRQI-BQBZGAKWSA-N Met-Gly-Ala Chemical compound CSCC[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(O)=O IUYCGMNKIZDRQI-BQBZGAKWSA-N 0.000 description 1
- LQMHZERGCQJKAH-STQMWFEESA-N Met-Gly-Phe Chemical compound CSCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 LQMHZERGCQJKAH-STQMWFEESA-N 0.000 description 1
- AFFKUNVPPLQUGA-DCAQKATOSA-N Met-Leu-Ala Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O AFFKUNVPPLQUGA-DCAQKATOSA-N 0.000 description 1
- RSOMVHWMIAZNLE-HJWJTTGWSA-N Met-Phe-Ile Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O RSOMVHWMIAZNLE-HJWJTTGWSA-N 0.000 description 1
- FDGAMQVRGORBDV-GUBZILKMSA-N Met-Ser-Met Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCSC FDGAMQVRGORBDV-GUBZILKMSA-N 0.000 description 1
- NDJSSFWDYDUQID-YTWAJWBKSA-N Met-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCSC)N)O NDJSSFWDYDUQID-YTWAJWBKSA-N 0.000 description 1
- YBHQCJILTOVLHD-YVMONPNESA-N Mirin Chemical compound S1C(N)=NC(=O)\C1=C\C1=CC=C(O)C=C1 YBHQCJILTOVLHD-YVMONPNESA-N 0.000 description 1
- 241000699670 Mus sp. Species 0.000 description 1
- PESQCPHRXOFIPX-UHFFFAOYSA-N N-L-methionyl-L-tyrosine Natural products CSCCC(N)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 PESQCPHRXOFIPX-UHFFFAOYSA-N 0.000 description 1
- XMBSYZWANAQXEV-UHFFFAOYSA-N N-alpha-L-glutamyl-L-phenylalanine Natural products OC(=O)CCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 XMBSYZWANAQXEV-UHFFFAOYSA-N 0.000 description 1
- 229910002651 NO3 Inorganic materials 0.000 description 1
- NHNBFGGVMKEFGY-UHFFFAOYSA-N Nitrate Chemical compound [O-][N+]([O-])=O NHNBFGGVMKEFGY-UHFFFAOYSA-N 0.000 description 1
- 102100028326 PDZ domain-containing protein MAGIX Human genes 0.000 description 1
- 229910019142 PO4 Inorganic materials 0.000 description 1
- 102000035195 Peptidases Human genes 0.000 description 1
- 108091005804 Peptidases Proteins 0.000 description 1
- 239000001888 Peptone Substances 0.000 description 1
- 108010080698 Peptones Proteins 0.000 description 1
- LSXGADJXBDFXQU-DLOVCJGASA-N Phe-Ala-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=CC=C1 LSXGADJXBDFXQU-DLOVCJGASA-N 0.000 description 1
- QMMRHASQEVCJGR-UBHSHLNASA-N Phe-Ala-Pro Chemical compound C([C@H](N)C(=O)N[C@@H](C)C(=O)N1[C@@H](CCC1)C(O)=O)C1=CC=CC=C1 QMMRHASQEVCJGR-UBHSHLNASA-N 0.000 description 1
- CGOMLCQJEMWMCE-STQMWFEESA-N Phe-Arg-Gly Chemical compound NC(N)=NCCC[C@@H](C(=O)NCC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 CGOMLCQJEMWMCE-STQMWFEESA-N 0.000 description 1
- JEGFCFLCRSJCMA-IHRRRGAJSA-N Phe-Arg-Ser Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CO)C(=O)O)N JEGFCFLCRSJCMA-IHRRRGAJSA-N 0.000 description 1
- LJUUGSWZPQOJKD-JYJNAYRXSA-N Phe-Arg-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CCCNC(N)=N)NC(=O)[C@@H](N)Cc1ccccc1)C(O)=O LJUUGSWZPQOJKD-JYJNAYRXSA-N 0.000 description 1
- HCTXJGRYAACKOB-SRVKXCTJSA-N Phe-Asn-Asp Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N HCTXJGRYAACKOB-SRVKXCTJSA-N 0.000 description 1
- OJUMUUXGSXUZJZ-SRVKXCTJSA-N Phe-Asp-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O OJUMUUXGSXUZJZ-SRVKXCTJSA-N 0.000 description 1
- ZBYHVSHBZYHQBW-SRVKXCTJSA-N Phe-Cys-Asp Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(=O)O)C(=O)O)N ZBYHVSHBZYHQBW-SRVKXCTJSA-N 0.000 description 1
- ABQFNJAFONNUTH-FHWLQOOXSA-N Phe-Gln-Tyr Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O)N ABQFNJAFONNUTH-FHWLQOOXSA-N 0.000 description 1
- BEEVXUYVEHXWRQ-YESZJQIVSA-N Phe-His-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CN=CN2)NC(=O)[C@H](CC3=CC=CC=C3)N)C(=O)O BEEVXUYVEHXWRQ-YESZJQIVSA-N 0.000 description 1
- WKTSCAXSYITIJJ-PCBIJLKTSA-N Phe-Ile-Asn Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O WKTSCAXSYITIJJ-PCBIJLKTSA-N 0.000 description 1
- MIICYIIBVYQNKE-QEWYBTABSA-N Phe-Ile-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N MIICYIIBVYQNKE-QEWYBTABSA-N 0.000 description 1
- SMFGCTXUBWEPKM-KBPBESRZSA-N Phe-Leu-Gly Chemical compound OC(=O)CNC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 SMFGCTXUBWEPKM-KBPBESRZSA-N 0.000 description 1
- CZQZSMJXFGGBHM-KKUMJFAQSA-N Phe-Pro-Gln Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O CZQZSMJXFGGBHM-KKUMJFAQSA-N 0.000 description 1
- ZLAKUZDMKVKFAI-JYJNAYRXSA-N Phe-Pro-Val Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O ZLAKUZDMKVKFAI-JYJNAYRXSA-N 0.000 description 1
- XDMMOISUAHXXFD-SRVKXCTJSA-N Phe-Ser-Asp Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O XDMMOISUAHXXFD-SRVKXCTJSA-N 0.000 description 1
- BONHGTUEEPIMPM-AVGNSLFASA-N Phe-Ser-Glu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(O)=O BONHGTUEEPIMPM-AVGNSLFASA-N 0.000 description 1
- GTMSCDVFQLNEOY-BZSNNMDCSA-N Phe-Tyr-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N GTMSCDVFQLNEOY-BZSNNMDCSA-N 0.000 description 1
- BAONJAHBAUDJKA-BZSNNMDCSA-N Phe-Tyr-Asp Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)N[C@@H](CC(O)=O)C(O)=O)C1=CC=CC=C1 BAONJAHBAUDJKA-BZSNNMDCSA-N 0.000 description 1
- GOUWCZRDTWTODO-YDHLFZDLSA-N Phe-Val-Asn Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O GOUWCZRDTWTODO-YDHLFZDLSA-N 0.000 description 1
- IFMDQWDAJUMMJC-DCAQKATOSA-N Pro-Ala-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O IFMDQWDAJUMMJC-DCAQKATOSA-N 0.000 description 1
- XQLBWXHVZVBNJM-FXQIFTODSA-N Pro-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1 XQLBWXHVZVBNJM-FXQIFTODSA-N 0.000 description 1
- LCRSGSIRKLXZMZ-BPNCWPANSA-N Pro-Ala-Tyr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O LCRSGSIRKLXZMZ-BPNCWPANSA-N 0.000 description 1
- VCYJKOLZYPYGJV-AVGNSLFASA-N Pro-Arg-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O VCYJKOLZYPYGJV-AVGNSLFASA-N 0.000 description 1
- GDXZRWYXJSGWIV-GMOBBJLQSA-N Pro-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@@H]1CCCN1 GDXZRWYXJSGWIV-GMOBBJLQSA-N 0.000 description 1
- KPDRZQUWJKTMBP-DCAQKATOSA-N Pro-Asp-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@@H]1CCCN1 KPDRZQUWJKTMBP-DCAQKATOSA-N 0.000 description 1
- HXOLCSYHGRNXJJ-IHRRRGAJSA-N Pro-Asp-Phe Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O HXOLCSYHGRNXJJ-IHRRRGAJSA-N 0.000 description 1
- FISHYTLIMUYTQY-GUBZILKMSA-N Pro-Gln-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H]1CCCN1 FISHYTLIMUYTQY-GUBZILKMSA-N 0.000 description 1
- WGAQWMRJUFQXMF-ZPFDUUQYSA-N Pro-Gln-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WGAQWMRJUFQXMF-ZPFDUUQYSA-N 0.000 description 1
- PULPZRAHVFBVTO-DCAQKATOSA-N Pro-Glu-Arg Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O PULPZRAHVFBVTO-DCAQKATOSA-N 0.000 description 1
- NMELOOXSGDRBRU-YUMQZZPRSA-N Pro-Glu-Gly Chemical compound OC(=O)CNC(=O)[C@H](CCC(=O)O)NC(=O)[C@@H]1CCCN1 NMELOOXSGDRBRU-YUMQZZPRSA-N 0.000 description 1
- VOZIBWWZSBIXQN-SRVKXCTJSA-N Pro-Glu-Lys Chemical compound NCCCC[C@H](NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H]1CCCN1)C(O)=O VOZIBWWZSBIXQN-SRVKXCTJSA-N 0.000 description 1
- UIMCLYYSUCIUJM-UWVGGRQHSA-N Pro-Gly-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H]1CCCN1 UIMCLYYSUCIUJM-UWVGGRQHSA-N 0.000 description 1
- LCUOTSLIVGSGAU-AVGNSLFASA-N Pro-His-Arg Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O LCUOTSLIVGSGAU-AVGNSLFASA-N 0.000 description 1
- BBFRBZYKHIKFBX-GMOBBJLQSA-N Pro-Ile-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@@H]1CCCN1 BBFRBZYKHIKFBX-GMOBBJLQSA-N 0.000 description 1
- FMLRRBDLBJLJIK-DCAQKATOSA-N Pro-Leu-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]1CCCN1 FMLRRBDLBJLJIK-DCAQKATOSA-N 0.000 description 1
- XYSXOCIWCPFOCG-IHRRRGAJSA-N Pro-Leu-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O XYSXOCIWCPFOCG-IHRRRGAJSA-N 0.000 description 1
- FKYKZHOKDOPHSA-DCAQKATOSA-N Pro-Leu-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O FKYKZHOKDOPHSA-DCAQKATOSA-N 0.000 description 1
- ULWBBFKQBDNGOY-RWMBFGLXSA-N Pro-Lys-Pro Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CCCCN)C(=O)N2CCC[C@@H]2C(=O)O ULWBBFKQBDNGOY-RWMBFGLXSA-N 0.000 description 1
- ZZCJYPLMOPTZFC-SRVKXCTJSA-N Pro-Met-Met Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCSC)C(O)=O ZZCJYPLMOPTZFC-SRVKXCTJSA-N 0.000 description 1
- QGLFRQCECIWXFA-RCWTZXSCSA-N Pro-Met-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@@H]1CCCN1)O QGLFRQCECIWXFA-RCWTZXSCSA-N 0.000 description 1
- LGMBKOAPPTYKLC-JYJNAYRXSA-N Pro-Phe-Arg Chemical compound C([C@@H](C(=O)N[C@@H](CCCNC(=N)N)C(O)=O)NC(=O)[C@H]1NCCC1)C1=CC=CC=C1 LGMBKOAPPTYKLC-JYJNAYRXSA-N 0.000 description 1
- MHBSUKYVBZVQRW-HJWJTTGWSA-N Pro-Phe-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O MHBSUKYVBZVQRW-HJWJTTGWSA-N 0.000 description 1
- CGSOWZUPLOKYOR-AVGNSLFASA-N Pro-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 CGSOWZUPLOKYOR-AVGNSLFASA-N 0.000 description 1
- QAAYIXYLEMRULP-SRVKXCTJSA-N Pro-Pro-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 QAAYIXYLEMRULP-SRVKXCTJSA-N 0.000 description 1
- FDMKYQQYJKYCLV-GUBZILKMSA-N Pro-Pro-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 FDMKYQQYJKYCLV-GUBZILKMSA-N 0.000 description 1
- GOMUXSCOIWIJFP-GUBZILKMSA-N Pro-Ser-Arg Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O GOMUXSCOIWIJFP-GUBZILKMSA-N 0.000 description 1
- SNGZLPOXVRTNMB-LPEHRKFASA-N Pro-Ser-Pro Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CO)C(=O)N2CCC[C@@H]2C(=O)O SNGZLPOXVRTNMB-LPEHRKFASA-N 0.000 description 1
- KWMZPPWYBVZIER-XGEHTFHBSA-N Pro-Ser-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KWMZPPWYBVZIER-XGEHTFHBSA-N 0.000 description 1
- PRKWBYCXBBSLSK-GUBZILKMSA-N Pro-Ser-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O PRKWBYCXBBSLSK-GUBZILKMSA-N 0.000 description 1
- KIDXAAQVMNLJFQ-KZVJFYERSA-N Pro-Thr-Ala Chemical compound C[C@@H](O)[C@H](NC(=O)[C@@H]1CCCN1)C(=O)N[C@@H](C)C(O)=O KIDXAAQVMNLJFQ-KZVJFYERSA-N 0.000 description 1
- CHYAYDLYYIJCKY-OSUNSFLBSA-N Pro-Thr-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O CHYAYDLYYIJCKY-OSUNSFLBSA-N 0.000 description 1
- RMJZWERKFFNNNS-XGEHTFHBSA-N Pro-Thr-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O RMJZWERKFFNNNS-XGEHTFHBSA-N 0.000 description 1
- QKWYXRPICJEQAJ-KJEVXHAQSA-N Pro-Tyr-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@@H]2CCCN2)O QKWYXRPICJEQAJ-KJEVXHAQSA-N 0.000 description 1
- QDDJNKWPTJHROJ-UFYCRDLUSA-N Pro-Tyr-Tyr Chemical compound C([C@@H](C(=O)O)NC(=O)[C@H](CC=1C=CC(O)=CC=1)NC(=O)[C@H]1NCCC1)C1=CC=C(O)C=C1 QDDJNKWPTJHROJ-UFYCRDLUSA-N 0.000 description 1
- ZAUHSLVPDLNTRZ-QXEWZRGKSA-N Pro-Val-Asn Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O ZAUHSLVPDLNTRZ-QXEWZRGKSA-N 0.000 description 1
- DGDCSVGVWWAJRS-AVGNSLFASA-N Pro-Val-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@@H]2CCCN2 DGDCSVGVWWAJRS-AVGNSLFASA-N 0.000 description 1
- YDTUEBLEAVANFH-RCWTZXSCSA-N Pro-Val-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H]1CCCN1 YDTUEBLEAVANFH-RCWTZXSCSA-N 0.000 description 1
- PGSWNLRYYONGPE-JYJNAYRXSA-N Pro-Val-Tyr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O PGSWNLRYYONGPE-JYJNAYRXSA-N 0.000 description 1
- FHJQROWZEJFZPO-SRVKXCTJSA-N Pro-Val-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H]1CCCN1 FHJQROWZEJFZPO-SRVKXCTJSA-N 0.000 description 1
- 239000004365 Protease Substances 0.000 description 1
- 101001062854 Rattus norvegicus Fatty acid-binding protein 5 Proteins 0.000 description 1
- 108700008625 Reporter Genes Proteins 0.000 description 1
- 241000235343 Saccharomycetales Species 0.000 description 1
- 108091058545 Secretory proteins Proteins 0.000 description 1
- 102000040739 Secretory proteins Human genes 0.000 description 1
- DWUIECHTAMYEFL-XVYDVKMFSA-N Ser-Ala-His Chemical compound OC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 DWUIECHTAMYEFL-XVYDVKMFSA-N 0.000 description 1
- NLQUOHDCLSFABG-GUBZILKMSA-N Ser-Arg-Arg Chemical compound NC(N)=NCCC[C@H](NC(=O)[C@H](CO)N)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O NLQUOHDCLSFABG-GUBZILKMSA-N 0.000 description 1
- FCRMLGJMPXCAHD-FXQIFTODSA-N Ser-Arg-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(O)=O FCRMLGJMPXCAHD-FXQIFTODSA-N 0.000 description 1
- NRCJWSGXMAPYQX-LPEHRKFASA-N Ser-Arg-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CO)N)C(=O)O NRCJWSGXMAPYQX-LPEHRKFASA-N 0.000 description 1
- KNZQGAUEYZJUSQ-ZLUOBGJFSA-N Ser-Asp-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CO)N KNZQGAUEYZJUSQ-ZLUOBGJFSA-N 0.000 description 1
- HEQPKICPPDOSIN-SRVKXCTJSA-N Ser-Asp-Tyr Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 HEQPKICPPDOSIN-SRVKXCTJSA-N 0.000 description 1
- SWSRFJZZMNLMLY-ZKWXMUAHSA-N Ser-Asp-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O SWSRFJZZMNLMLY-ZKWXMUAHSA-N 0.000 description 1
- WTPKKLMBNBCCNL-ACZMJKKPSA-N Ser-Cys-Glu Chemical compound C(CC(=O)O)[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CO)N WTPKKLMBNBCCNL-ACZMJKKPSA-N 0.000 description 1
- CRZRTKAVUUGKEQ-ACZMJKKPSA-N Ser-Gln-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O CRZRTKAVUUGKEQ-ACZMJKKPSA-N 0.000 description 1
- CDVFZMOFNJPUDD-ACZMJKKPSA-N Ser-Gln-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O CDVFZMOFNJPUDD-ACZMJKKPSA-N 0.000 description 1
- ZOHGLPQGEHSLPD-FXQIFTODSA-N Ser-Gln-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O ZOHGLPQGEHSLPD-FXQIFTODSA-N 0.000 description 1
- BPMRXBZYPGYPJN-WHFBIAKZSA-N Ser-Gly-Asn Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O BPMRXBZYPGYPJN-WHFBIAKZSA-N 0.000 description 1
- SFTZWNJFZYOLBD-ZDLURKLDSA-N Ser-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CO SFTZWNJFZYOLBD-ZDLURKLDSA-N 0.000 description 1
- IOVBCLGAJJXOHK-SRVKXCTJSA-N Ser-His-His Chemical compound C([C@H](NC(=O)[C@H](CO)N)C(=O)N[C@@H](CC=1NC=NC=1)C(O)=O)C1=CN=CN1 IOVBCLGAJJXOHK-SRVKXCTJSA-N 0.000 description 1
- MLSQXWSRHURDMF-GARJFASQSA-N Ser-His-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CN=CN2)NC(=O)[C@H](CO)N)C(=O)O MLSQXWSRHURDMF-GARJFASQSA-N 0.000 description 1
- CAOYHZOWXFFAIR-CIUDSAMLSA-N Ser-His-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(O)=O CAOYHZOWXFFAIR-CIUDSAMLSA-N 0.000 description 1
- IUXGJEIKJBYKOO-SRVKXCTJSA-N Ser-Leu-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CO)N IUXGJEIKJBYKOO-SRVKXCTJSA-N 0.000 description 1
- UBRMZSHOOIVJPW-SRVKXCTJSA-N Ser-Leu-Lys Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(O)=O UBRMZSHOOIVJPW-SRVKXCTJSA-N 0.000 description 1
- YUJLIIRMIAGMCQ-CIUDSAMLSA-N Ser-Leu-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O YUJLIIRMIAGMCQ-CIUDSAMLSA-N 0.000 description 1
- NIOYDASGXWLHEZ-CIUDSAMLSA-N Ser-Met-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(O)=O)C(O)=O NIOYDASGXWLHEZ-CIUDSAMLSA-N 0.000 description 1
- RXSWQCATLWVDLI-XGEHTFHBSA-N Ser-Met-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O RXSWQCATLWVDLI-XGEHTFHBSA-N 0.000 description 1
- NQZFFLBPNDLTPO-DLOVCJGASA-N Ser-Phe-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](CO)N NQZFFLBPNDLTPO-DLOVCJGASA-N 0.000 description 1
- BUYHXYIUQUBEQP-AVGNSLFASA-N Ser-Phe-Glu Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CO)N BUYHXYIUQUBEQP-AVGNSLFASA-N 0.000 description 1
- UPLYXVPQLJVWMM-KKUMJFAQSA-N Ser-Phe-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O UPLYXVPQLJVWMM-KKUMJFAQSA-N 0.000 description 1
- RWDVVSKYZBNDCO-MELADBBJSA-N Ser-Phe-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)NC(=O)[C@H](CO)N)C(=O)O RWDVVSKYZBNDCO-MELADBBJSA-N 0.000 description 1
- ADJDNJCSPNFFPI-FXQIFTODSA-N Ser-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CO ADJDNJCSPNFFPI-FXQIFTODSA-N 0.000 description 1
- AZWNCEBQZXELEZ-FXQIFTODSA-N Ser-Pro-Ser Chemical compound OC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O AZWNCEBQZXELEZ-FXQIFTODSA-N 0.000 description 1
- AABIBDJHSKIMJK-FXQIFTODSA-N Ser-Ser-Met Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCSC)C(O)=O AABIBDJHSKIMJK-FXQIFTODSA-N 0.000 description 1
- CUXJENOFJXOSOZ-BIIVOSGPSA-N Ser-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CO)N)C(=O)O CUXJENOFJXOSOZ-BIIVOSGPSA-N 0.000 description 1
- JURQXQBJKUHGJS-UHFFFAOYSA-N Ser-Ser-Ser-Ser Chemical compound OCC(N)C(=O)NC(CO)C(=O)NC(CO)C(=O)NC(CO)C(O)=O JURQXQBJKUHGJS-UHFFFAOYSA-N 0.000 description 1
- XJDMUQCLVSCRSJ-VZFHVOOUSA-N Ser-Thr-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O XJDMUQCLVSCRSJ-VZFHVOOUSA-N 0.000 description 1
- SQHKXWODKJDZRC-LKXGYXEUSA-N Ser-Thr-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(O)=O SQHKXWODKJDZRC-LKXGYXEUSA-N 0.000 description 1
- PURRNJBBXDDWLX-ZDLURKLDSA-N Ser-Thr-Gly Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CO)N)O PURRNJBBXDDWLX-ZDLURKLDSA-N 0.000 description 1
- DYEGLQRVMBWQLD-IXOXFDKPSA-N Ser-Thr-Phe Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CO)N)O DYEGLQRVMBWQLD-IXOXFDKPSA-N 0.000 description 1
- VLMIUSLQONKLDV-HEIBUPTGSA-N Ser-Thr-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O VLMIUSLQONKLDV-HEIBUPTGSA-N 0.000 description 1
- PCMZJFMUYWIERL-ZKWXMUAHSA-N Ser-Val-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O PCMZJFMUYWIERL-ZKWXMUAHSA-N 0.000 description 1
- SYCFMSYTIFXWAJ-DCAQKATOSA-N Ser-Val-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CO)N SYCFMSYTIFXWAJ-DCAQKATOSA-N 0.000 description 1
- HNDMFDBQXYZSRM-IHRRRGAJSA-N Ser-Val-Phe Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O HNDMFDBQXYZSRM-IHRRRGAJSA-N 0.000 description 1
- NJEMRSFGDNECGF-GCJQMDKQSA-N Thr-Ala-Asp Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC(O)=O NJEMRSFGDNECGF-GCJQMDKQSA-N 0.000 description 1
- DWYAUVCQDTZIJI-VZFHVOOUSA-N Thr-Ala-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O DWYAUVCQDTZIJI-VZFHVOOUSA-N 0.000 description 1
- CAJFZCICSVBOJK-SHGPDSBTSA-N Thr-Ala-Thr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CAJFZCICSVBOJK-SHGPDSBTSA-N 0.000 description 1
- SWIKDOUVROTZCW-GCJQMDKQSA-N Thr-Asn-Ala Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](C)C(=O)O)N)O SWIKDOUVROTZCW-GCJQMDKQSA-N 0.000 description 1
- JHBHMCMKSPXRHV-NUMRIWBASA-N Thr-Asn-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N)O JHBHMCMKSPXRHV-NUMRIWBASA-N 0.000 description 1
- DSLHSTIUAPKERR-XGEHTFHBSA-N Thr-Cys-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CS)C(=O)N[C@@H](C(C)C)C(O)=O DSLHSTIUAPKERR-XGEHTFHBSA-N 0.000 description 1
- WLDUCKSCDRIVLJ-NUMRIWBASA-N Thr-Gln-Asp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N)O WLDUCKSCDRIVLJ-NUMRIWBASA-N 0.000 description 1
- RCEHMXVEMNXRIW-IRIUXVKKSA-N Thr-Gln-Tyr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N)O RCEHMXVEMNXRIW-IRIUXVKKSA-N 0.000 description 1
- VGYBYGQXZJDZJU-XQXXSGGOSA-N Thr-Glu-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O VGYBYGQXZJDZJU-XQXXSGGOSA-N 0.000 description 1
- LGNBRHZANHMZHK-NUMRIWBASA-N Thr-Glu-Asp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N)O LGNBRHZANHMZHK-NUMRIWBASA-N 0.000 description 1
- OQCXTUQTKQFDCX-HTUGSXCWSA-N Thr-Glu-Phe Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N)O OQCXTUQTKQFDCX-HTUGSXCWSA-N 0.000 description 1
- SIMKLINEDYOTKL-MBLNEYKQSA-N Thr-His-Ala Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](C)C(=O)O)N)O SIMKLINEDYOTKL-MBLNEYKQSA-N 0.000 description 1
- YOOAQCZYZHGUAZ-KATARQTJSA-N Thr-Leu-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O YOOAQCZYZHGUAZ-KATARQTJSA-N 0.000 description 1
- JLNMFGCJODTXDH-WEDXCCLWSA-N Thr-Lys-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O JLNMFGCJODTXDH-WEDXCCLWSA-N 0.000 description 1
- MGJLBZFUXUGMML-VOAKCMCISA-N Thr-Lys-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)O)N)O MGJLBZFUXUGMML-VOAKCMCISA-N 0.000 description 1
- UJQVSMNQMQHVRY-KZVJFYERSA-N Thr-Met-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C)C(O)=O UJQVSMNQMQHVRY-KZVJFYERSA-N 0.000 description 1
- KERCOYANYUPLHJ-XGEHTFHBSA-N Thr-Pro-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O KERCOYANYUPLHJ-XGEHTFHBSA-N 0.000 description 1
- GVMXJJAJLIEASL-ZJDVBMNYSA-N Thr-Pro-Thr Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O GVMXJJAJLIEASL-ZJDVBMNYSA-N 0.000 description 1
- YGCDFAJJCRVQKU-RCWTZXSCSA-N Thr-Pro-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)[C@@H](C)O YGCDFAJJCRVQKU-RCWTZXSCSA-N 0.000 description 1
- FWTFAZKJORVTIR-VZFHVOOUSA-N Thr-Ser-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O FWTFAZKJORVTIR-VZFHVOOUSA-N 0.000 description 1
- IVDFVBVIVLJJHR-LKXGYXEUSA-N Thr-Ser-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O IVDFVBVIVLJJHR-LKXGYXEUSA-N 0.000 description 1
- DOBIBIXIHJKVJF-XKBZYTNZSA-N Thr-Ser-Gln Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(N)=O DOBIBIXIHJKVJF-XKBZYTNZSA-N 0.000 description 1
- RVMNUBQWPVOUKH-HEIBUPTGSA-N Thr-Ser-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O RVMNUBQWPVOUKH-HEIBUPTGSA-N 0.000 description 1
- IEZVHOULSUULHD-XGEHTFHBSA-N Thr-Ser-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O IEZVHOULSUULHD-XGEHTFHBSA-N 0.000 description 1
- BBPCSGKKPJUYRB-UVOCVTCTSA-N Thr-Thr-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O BBPCSGKKPJUYRB-UVOCVTCTSA-N 0.000 description 1
- ZMYCLHFLHRVOEA-HEIBUPTGSA-N Thr-Thr-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O ZMYCLHFLHRVOEA-HEIBUPTGSA-N 0.000 description 1
- ABCLYRRGTZNIFU-BWAGICSOSA-N Thr-Tyr-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N)O ABCLYRRGTZNIFU-BWAGICSOSA-N 0.000 description 1
- CYCGARJWIQWPQM-YJRXYDGGSA-N Thr-Tyr-Ser Chemical compound C[C@@H](O)[C@H]([NH3+])C(=O)N[C@H](C(=O)N[C@@H](CO)C([O-])=O)CC1=CC=C(O)C=C1 CYCGARJWIQWPQM-YJRXYDGGSA-N 0.000 description 1
- LVRFMARKDGGZMX-IZPVPAKOSA-N Thr-Tyr-Thr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H]([C@@H](C)O)C(O)=O)CC1=CC=C(O)C=C1 LVRFMARKDGGZMX-IZPVPAKOSA-N 0.000 description 1
- YOPQYBJJNSIQGZ-JNPHEJMOSA-N Thr-Tyr-Tyr Chemical compound C([C@H](NC(=O)[C@@H](N)[C@H](O)C)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=C(O)C=C1 YOPQYBJJNSIQGZ-JNPHEJMOSA-N 0.000 description 1
- QGVBFDIREUUSHX-IFFSRLJSSA-N Thr-Val-Gln Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O QGVBFDIREUUSHX-IFFSRLJSSA-N 0.000 description 1
- KPMIQCXJDVKWKO-IFFSRLJSSA-N Thr-Val-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O KPMIQCXJDVKWKO-IFFSRLJSSA-N 0.000 description 1
- 108010074506 Transfer Factor Proteins 0.000 description 1
- OENGVSDBQHHGBU-QEJZJMRPSA-N Trp-Glu-Asn Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O OENGVSDBQHHGBU-QEJZJMRPSA-N 0.000 description 1
- SNJAPSVIPKUMCK-NWLDYVSISA-N Trp-Glu-Thr Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SNJAPSVIPKUMCK-NWLDYVSISA-N 0.000 description 1
- YRSOERSDNRSCBC-XIRDDKMYSA-N Trp-His-Cys Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC3=CN=CN3)C(=O)N[C@@H](CS)C(=O)O)N YRSOERSDNRSCBC-XIRDDKMYSA-N 0.000 description 1
- ABRICLFKFRFDKS-IHPCNDPISA-N Trp-Ser-Tyr Chemical compound C([C@H](NC(=O)[C@H](CO)NC(=O)[C@H](CC=1C2=CC=CC=C2NC=1)N)C(O)=O)C1=CC=C(O)C=C1 ABRICLFKFRFDKS-IHPCNDPISA-N 0.000 description 1
- GFZQWWDXJVGEMW-ULQDDVLXSA-N Tyr-Arg-Lys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCCN)C(=O)O)N)O GFZQWWDXJVGEMW-ULQDDVLXSA-N 0.000 description 1
- SCCKSNREWHMKOJ-SRVKXCTJSA-N Tyr-Asn-Ser Chemical compound N[C@@H](Cc1ccc(O)cc1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O SCCKSNREWHMKOJ-SRVKXCTJSA-N 0.000 description 1
- RCLOWEZASFJFEX-KKUMJFAQSA-N Tyr-Asp-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 RCLOWEZASFJFEX-KKUMJFAQSA-N 0.000 description 1
- ARPONUQDNWLXOZ-KKUMJFAQSA-N Tyr-Gln-Arg Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O ARPONUQDNWLXOZ-KKUMJFAQSA-N 0.000 description 1
- QUILOGWWLXMSAT-IHRRRGAJSA-N Tyr-Gln-Gln Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O QUILOGWWLXMSAT-IHRRRGAJSA-N 0.000 description 1
- WDGDKHLSDIOXQC-ACRUOGEOSA-N Tyr-Leu-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=C(O)C=C1 WDGDKHLSDIOXQC-ACRUOGEOSA-N 0.000 description 1
- ARMNWLJYHCOSHE-KKUMJFAQSA-N Tyr-Pro-Gln Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O ARMNWLJYHCOSHE-KKUMJFAQSA-N 0.000 description 1
- RCMWNNJFKNDKQR-UFYCRDLUSA-N Tyr-Pro-Phe Chemical compound C([C@H](N)C(=O)N1[C@@H](CCC1)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=C(O)C=C1 RCMWNNJFKNDKQR-UFYCRDLUSA-N 0.000 description 1
- YYLHVUCSTXXKBS-IHRRRGAJSA-N Tyr-Pro-Ser Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O YYLHVUCSTXXKBS-IHRRRGAJSA-N 0.000 description 1
- KHPLUFDSWGDRHD-SLFFLAALSA-N Tyr-Tyr-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)[C@H](CC3=CC=C(C=C3)O)N)C(=O)O KHPLUFDSWGDRHD-SLFFLAALSA-N 0.000 description 1
- UEOOXDLMQZBPFR-ZKWXMUAHSA-N Val-Ala-Asn Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N UEOOXDLMQZBPFR-ZKWXMUAHSA-N 0.000 description 1
- IZFVRRYRMQFVGX-NRPADANISA-N Val-Ala-Gln Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N IZFVRRYRMQFVGX-NRPADANISA-N 0.000 description 1
- UDLYXGYWTVOIKU-QXEWZRGKSA-N Val-Asn-Arg Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N UDLYXGYWTVOIKU-QXEWZRGKSA-N 0.000 description 1
- TZVUSFMQWPWHON-NHCYSSNCSA-N Val-Asp-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](C(C)C)N TZVUSFMQWPWHON-NHCYSSNCSA-N 0.000 description 1
- XKVXSCHXGJOQND-ZOBUZTSGSA-N Val-Asp-Trp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N XKVXSCHXGJOQND-ZOBUZTSGSA-N 0.000 description 1
- NYTKXWLZSNRILS-IFFSRLJSSA-N Val-Gln-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](C(C)C)N)O NYTKXWLZSNRILS-IFFSRLJSSA-N 0.000 description 1
- JTWIMNMUYLQNPI-WPRPVWTQSA-N Val-Gly-Arg Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCNC(N)=N JTWIMNMUYLQNPI-WPRPVWTQSA-N 0.000 description 1
- WFENBJPLZMPVAX-XVKPBYJWSA-N Val-Gly-Glu Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(O)=O WFENBJPLZMPVAX-XVKPBYJWSA-N 0.000 description 1
- YTUABZMPYKCWCQ-XQQFMLRXSA-N Val-His-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N2CCC[C@@H]2C(=O)O)N YTUABZMPYKCWCQ-XQQFMLRXSA-N 0.000 description 1
- UMPVMAYCLYMYGA-ONGXEEELSA-N Val-Leu-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O UMPVMAYCLYMYGA-ONGXEEELSA-N 0.000 description 1
- IJGPOONOTBNTFS-GVXVVHGQSA-N Val-Lys-Glu Chemical compound [H]N[C@@H](C(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O IJGPOONOTBNTFS-GVXVVHGQSA-N 0.000 description 1
- VENKIVFKIPGEJN-NHCYSSNCSA-N Val-Met-Glu Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N VENKIVFKIPGEJN-NHCYSSNCSA-N 0.000 description 1
- YDVDTCJGBBJGRT-GUBZILKMSA-N Val-Met-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CO)C(=O)O)N YDVDTCJGBBJGRT-GUBZILKMSA-N 0.000 description 1
- MIKHIIQMRFYVOR-RCWTZXSCSA-N Val-Pro-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](C(C)C)N)O MIKHIIQMRFYVOR-RCWTZXSCSA-N 0.000 description 1
- QSPOLEBZTMESFY-SRVKXCTJSA-N Val-Pro-Val Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O QSPOLEBZTMESFY-SRVKXCTJSA-N 0.000 description 1
- PZTZYZUTCPZWJH-FXQIFTODSA-N Val-Ser-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)O)N PZTZYZUTCPZWJH-FXQIFTODSA-N 0.000 description 1
- UVHFONIHVHLDDQ-IFFSRLJSSA-N Val-Thr-Glu Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N)O UVHFONIHVHLDDQ-IFFSRLJSSA-N 0.000 description 1
- LCHZBEUVGAVMKS-RHYQMDGZSA-N Val-Thr-Leu Chemical compound CC(C)C[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)C(C)C)[C@@H](C)O)C(O)=O LCHZBEUVGAVMKS-RHYQMDGZSA-N 0.000 description 1
- QPJSIBAOZBVELU-BPNCWPANSA-N Val-Tyr-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](C(C)C)N QPJSIBAOZBVELU-BPNCWPANSA-N 0.000 description 1
- VVIZITNVZUAEMI-DLOVCJGASA-N Val-Val-Gln Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCC(N)=O VVIZITNVZUAEMI-DLOVCJGASA-N 0.000 description 1
- 239000002253 acid Substances 0.000 description 1
- 238000000246 agarose gel electrophoresis Methods 0.000 description 1
- 108010028939 alanyl-alanyl-lysyl-alanine Proteins 0.000 description 1
- 108010045023 alanyl-prolyl-tyrosine Proteins 0.000 description 1
- 108010070944 alanylhistidine Proteins 0.000 description 1
- 108010070783 alanyltyrosine Proteins 0.000 description 1
- 235000013334 alcoholic beverage Nutrition 0.000 description 1
- 239000003392 amylase inhibitor Substances 0.000 description 1
- 108010001271 arginyl-glutamyl-arginine Proteins 0.000 description 1
- 108010084758 arginyl-tyrosyl-aspartic acid Proteins 0.000 description 1
- 108010062796 arginyllysine Proteins 0.000 description 1
- 108010060035 arginylproline Proteins 0.000 description 1
- 108010010430 asparagine-proline-alanine Proteins 0.000 description 1
- 108010077245 asparaginyl-proline Proteins 0.000 description 1
- 108010021908 aspartyl-aspartyl-glutamyl-aspartic acid Proteins 0.000 description 1
- 108010093581 aspartyl-proline Proteins 0.000 description 1
- 108010047857 aspartylglycine Proteins 0.000 description 1
- 239000012148 binding buffer Substances 0.000 description 1
- 238000009395 breeding Methods 0.000 description 1
- 230000001488 breeding effect Effects 0.000 description 1
- 229940041514 candida albicans extract Drugs 0.000 description 1
- 229910052799 carbon Inorganic materials 0.000 description 1
- 230000010261 cell growth Effects 0.000 description 1
- 230000008859 change Effects 0.000 description 1
- 210000000349 chromosome Anatomy 0.000 description 1
- 239000002299 complementary DNA Substances 0.000 description 1
- 238000010276 construction Methods 0.000 description 1
- 230000001276 controlling effect Effects 0.000 description 1
- 239000002537 cosmetic Substances 0.000 description 1
- 238000012136 culture method Methods 0.000 description 1
- 108010060199 cysteinylproline Proteins 0.000 description 1
- 230000003247 decreasing effect Effects 0.000 description 1
- 238000011161 development Methods 0.000 description 1
- 235000019425 dextrin Nutrition 0.000 description 1
- 230000009429 distress Effects 0.000 description 1
- 239000003623 enhancer Substances 0.000 description 1
- 238000000605 extraction Methods 0.000 description 1
- 235000013305 food Nutrition 0.000 description 1
- 230000030279 gene silencing Effects 0.000 description 1
- 238000012226 gene silencing method Methods 0.000 description 1
- 108010073628 glutamyl-valyl-phenylalanine Proteins 0.000 description 1
- 108010023364 glycyl-histidyl-arginine Proteins 0.000 description 1
- 108010089804 glycyl-threonine Proteins 0.000 description 1
- 108010010147 glycylglutamine Proteins 0.000 description 1
- 108010084389 glycyltryptophan Proteins 0.000 description 1
- 108010037850 glycylvaline Proteins 0.000 description 1
- 108010036413 histidylglycine Proteins 0.000 description 1
- 108010092114 histidylphenylalanine Proteins 0.000 description 1
- 108010085325 histidylproline Proteins 0.000 description 1
- 238000009396 hybridization Methods 0.000 description 1
- 230000006872 improvement Effects 0.000 description 1
- 238000011534 incubation Methods 0.000 description 1
- 230000005764 inhibitory process Effects 0.000 description 1
- 230000000977 initiatory effect Effects 0.000 description 1
- 238000003780 insertion Methods 0.000 description 1
- 230000037431 insertion Effects 0.000 description 1
- 108010078274 isoleucylvaline Proteins 0.000 description 1
- 108010034529 leucyl-lysine Proteins 0.000 description 1
- 108010003700 lysyl aspartic acid Proteins 0.000 description 1
- 108010009298 lysylglutamic acid Proteins 0.000 description 1
- 108010054155 lysyllysine Proteins 0.000 description 1
- 108010017391 lysylvaline Proteins 0.000 description 1
- 108010016686 methionyl-alanyl-serine Proteins 0.000 description 1
- 108010056582 methionylglutamic acid Proteins 0.000 description 1
- 108010005942 methionylglycine Proteins 0.000 description 1
- 244000005700 microbiome Species 0.000 description 1
- 230000035772 mutation Effects 0.000 description 1
- 235000019319 peptone Nutrition 0.000 description 1
- 108010082795 phenylalanyl-arginyl-arginine Proteins 0.000 description 1
- 108010024654 phenylalanyl-prolyl-alanine Proteins 0.000 description 1
- 108010073025 phenylalanylphenylalanine Proteins 0.000 description 1
- 108010051242 phenylalanylserine Proteins 0.000 description 1
- NBIIXXVUZAFLBC-UHFFFAOYSA-K phosphate Chemical compound [O-]P([O-])([O-])=O NBIIXXVUZAFLBC-UHFFFAOYSA-K 0.000 description 1
- 239000010452 phosphate Substances 0.000 description 1
- 108010020755 prolyl-glycyl-glycine Proteins 0.000 description 1
- 108010015796 prolylisoleucine Proteins 0.000 description 1
- 108010090894 prolylleucine Proteins 0.000 description 1
- 230000004043 responsiveness Effects 0.000 description 1
- 239000002342 ribonucleoside Substances 0.000 description 1
- 230000028327 secretion Effects 0.000 description 1
- 241000894007 species Species 0.000 description 1
- 239000000126 substance Substances 0.000 description 1
- 230000009466 transformation Effects 0.000 description 1
- 108010029384 tryptophyl-histidine Proteins 0.000 description 1
- 108010084932 tryptophyl-proline Proteins 0.000 description 1
- 108010020532 tyrosyl-proline Proteins 0.000 description 1
- IBIDRSSEHFLGSD-UHFFFAOYSA-N valinyl-arginine Natural products CC(C)C(N)C(=O)NC(C(O)=O)CCCN=C(N)N IBIDRSSEHFLGSD-UHFFFAOYSA-N 0.000 description 1
- 108010073969 valyllysine Proteins 0.000 description 1
- 108700026220 vif Genes Proteins 0.000 description 1
- 239000012138 yeast extract Substances 0.000 description 1
Landscapes
- Preparation Of Compounds By Using Micro-Organisms (AREA)
- Micro-Organisms Or Cultivation Processes Thereof (AREA)
- Peptides Or Proteins (AREA)
Abstract
(57)【要約】
【解決手段】 麹菌の固体培養発現遺伝子を制御する転
写因子(sgbR1、sgbR2、sgbR3)をコー
ドする遺伝子のクローニングが行われ、その塩基配列も
決定された。
【効果】 固体培養での異種タンパク質の効率的生産が
可能となるだけでなく、麹菌の液体培養及び固体培養で
の酵素生産の包括的改善を行うことが可能となる。(57) Abstract: Cloning of a gene encoding a transcription factor (sgbR1, sgbR2, sgbR3) controlling a gene expressed in a solid culture of Aspergillus was performed, and its nucleotide sequence was also determined. [Effect] Not only the efficient production of a heterologous protein in solid culture can be achieved, but also the enzyme production in liquid culture and solid culture of Aspergillus can be comprehensively improved.
Description
【0001】[0001]
【発明の属する技術分野】本発明は麹菌の固体培養応答
性cis因子に結合し該因子下流の遺伝子の転写を制御
するタンパク質、該タンパク質をコードする遺伝子、該
遺伝子を含有する組み換えベクター、該組み換えベクタ
ーを含有する形質転換体、該遺伝子を含有する組み換え
麹菌、該形質転換体を用いる転写因子タンパク質の製造
方法、タンパク質を麹菌体内で発現させ、固体培養発現
遺伝子群の生産を制御する方法に関するものである。TECHNICAL FIELD The present invention relates to a protein that binds to a solid culture responsive cis factor of Aspergillus oryzae and controls transcription of a gene downstream of the factor, a gene encoding the protein, a recombinant vector containing the gene, and the recombinant vector. A transformant containing a vector, a recombinant koji mold containing the gene, a method for producing a transcription factor protein using the transformant, a method for expressing a protein in the koji mold and controlling the production of a solid culture-expressed gene group Is.
【0002】[0002]
【従来の技術】麹菌を含む全生物の遺伝子の転写は、R
NAポリメラーゼにより行われる。RNAポリメラーゼ
は二本鎖DNAを鋳型に、プライマー非依存的にリボヌ
クレオシドリン酸を3′方向にむかって重合する。真核
生物のRNAポリメラーゼには3種類の酵素が存在し、
I型はrRNA、II型はmRNA、III型はtRNAの
合成を司る。真核生物のRNAポリメラーゼはそれ自身
では正しい位置からの特異的転写を行うことが出来ず、
基本転写因子群を必要としており、転写の基本水準維持
に関与するプロモーターと呼ばれる領域に結合する。し
かし、RNAポリメラーゼによるRNAの合成量は、様
々な培養条件すなわち細胞の増殖や外部環境の変化に応
じて変動するのが一般的である。この外部環境の変化へ
の応答は、遺伝子制御領域のエンハンサーとよばれる領
域に結合することによってRNAポリメラーゼの転写開
始を正または負に制御する転写制御因子によって担われ
ている。2. Description of the Related Art The transcription of genes of all organisms including Aspergillus oryzae is
It is performed by NA polymerase. RNA polymerase uses double-stranded DNA as a template to polymerize ribonucleoside phosphate toward the 3'direction in a primer-independent manner. There are three types of enzymes in eukaryotic RNA polymerase,
Type I controls the synthesis of rRNA, type II controls the synthesis of mRNA, and type III controls the synthesis of tRNA. Eukaryotic RNA polymerases, by themselves, cannot direct specific transcription from the correct position,
It requires a group of basic transcription factors and binds to a region called a promoter involved in maintaining the basic level of transcription. However, the amount of RNA synthesized by RNA polymerase generally fluctuates according to various culture conditions, that is, cell growth and changes in the external environment. The response to this change in the external environment is carried by a transcriptional regulatory factor that positively or negatively controls the initiation of transcription of RNA polymerase by binding to a region called an enhancer of the gene regulatory region.
【0003】麹菌を含む糸状菌においてもこのような遺
伝子発現制御の報告は多数存在する。このような例とし
ては、麹菌のデンプンによるアミラーゼ誘導機構、グル
コースによるアミラーゼ抑制機構、アンモニウムによる
プロテアーゼ抑制機構などがあげられる。これは主に液
体培養を中心とした実験系により解明されている。There are many reports on such gene expression control in filamentous fungi including Aspergillus oryzae. Examples of such a mechanism include an amylase induction mechanism by starch of Aspergillus oryzae, an amylase inhibition mechanism by glucose, and a protease inhibition mechanism by ammonium. This has been clarified mainly by experimental systems centered on liquid culture.
【0004】しかし、アスペルギルス・オリゼーやアス
ペルギルス・ソーヤ、アスペルギルス・アワモリなどの
アスペルギルス属の麹菌は、従来清酒、醤油、味噌、み
りん醸造など我が国古来の醸造に用いられており、その
利用に当たっての培養形態は麹とよばれる固体培養であ
る。この固体培養の特徴は、液体培養に比べてタンパク
質生産力が非常に高いことである。これら醸造産業では
多くの有用タンパク質が、液体培養では生産されず、固
体培養で特異的に大量に生産されることが知られてい
る。However, Aspergillus oryzae of the genus Aspergillus such as Aspergillus oryzae, Aspergillus soja, and Aspergillus awamori have been conventionally used in sake brewing such as sake, soy sauce, miso, and mirin brewing, and the culture form for its use. Is a solid culture called koji. The feature of this solid culture is that protein productivity is much higher than that of liquid culture. It is known in the brewing industry that many useful proteins are not produced in liquid culture but are produced in large quantities specifically in solid culture.
【0005】近年、この原因のひとつには、固体培養で
特異的に発現する遺伝子が存在することが報告されてい
る。例えば、麹菌のグルコアミラーゼ生産においては、
液体培養で発現するグルコアミラーゼ遺伝子(gla
A)とは異なるglaB遺伝子が固体培養で強力に発現
することにより、大量の酵素タンパク質が固体培養で生
産されることが実証された。しかしながら、固体培養で
の遺伝子発現の制御機構、すなわち転写制御因子の解析
はほとんど行われておらず、なぜglaB遺伝子のよう
に固体培養特異的遺伝子群が、固体培養でのみ発現する
かは明らかになっていない。In recent years, it has been reported that one of the causes is a gene specifically expressed in solid culture. For example, in the production of glucoamylase from Aspergillus,
Glucoamylase gene (gla) expressed in liquid culture
Strong expression of the glaB gene different from A) in solid culture demonstrated that a large amount of enzyme protein was produced in solid culture. However, the control mechanism of gene expression in solid culture, that is, the transcriptional regulatory factor has hardly been analyzed, and it is clear why solid culture-specific genes such as glaB gene are expressed only in solid culture. is not.
【0006】もし、このような固体培養での遺伝子発現
制御を司る転写制御因子が取得できれば、固体培養での
高発現機構が明らかとなり、さらに発現量を高める検討
も可能になる。また本来遺伝子が全く発現しない液体培
養においても、当該遺伝子を大量に発現することも可能
となる。すなわち、これらの転写制御因子は固体培養で
の遺伝子発現、酵素生産を決定する重要な因子であり、
麹菌での効率的な物質生産のために転写制御因子の取得
が望まれている。[0006] If a transcriptional regulatory factor that controls gene expression in such solid culture can be obtained, the mechanism of high expression in solid culture will be clarified, and it will be possible to study to further increase the expression level. Further, even in liquid culture in which the gene is not originally expressed at all, it becomes possible to express the gene in a large amount. That is, these transcription control factors are important factors that determine gene expression and enzyme production in solid culture,
The acquisition of transcriptional regulatory factors is desired for efficient substance production in Aspergillus.
【0007】[0007]
【発明が解決しようとする課題】本発明は、前記課題の
解決を意図するものである。本発明の目的は、麹菌の固
体培養応答性cis因子に結合し該因子下流の遺伝子の
転写を制御する3種類の転写制御因子を提供することに
ある。また本発明の目的は、このような3種類の転写制
御因子をコードするDNAを提供することにある。本発
明の他の目的は本遺伝子を含有する組み換えベクターと
この組み換えベクターを含有する形質転換体を提供する
ことにある。また本発明は上記形質転換体を用いる転写
因子タンパク質の製造方法、ならびにタンパク質を麹菌
体内で発現させ、固体培養発現遺伝子群の生産を包括的
に制御することを目的とする。SUMMARY OF THE INVENTION The present invention is intended to solve the above problems. It is an object of the present invention to provide three types of transcription control factors that bind to solid culture responsive cis factor of Aspergillus and control the transcription of genes downstream of the factor. Another object of the present invention is to provide DNAs encoding such three types of transcription control factors. Another object of the present invention is to provide a recombinant vector containing this gene and a transformant containing this recombinant vector. Another object of the present invention is to produce a transcription factor protein using the above transformant and to comprehensively control the production of a solid culture-expressed gene group by expressing the protein in a koji mold.
【0008】[0008]
【課題を解決するための手段】本発明は、上記目的を達
成するためになされたものであって、先ずはじめに、麹
菌(アスペルギルス・オリゼー:Aspergillu
s oryzae)に着目した。麹菌A.oryzae
は清酒、醤油、味噌などの我が国の伝統的発酵産業で使
用されてきた糸状菌である。本菌株の特徴は、上記発酵
産業で固体培養により有用なタンパク質を非常に大量に
生産することである。本菌株が持つ高い蛋白生産能と醸
造微生物としての安全性から、異種蛋白生産の宿主とし
て注目されている(Biotechnology,6,
1419(1988)、特開昭62−272988)。The present invention has been made in order to achieve the above-mentioned object. First, aspergillus (Aspergillus oryzae: Aspergillus).
s oryzae). Aspergillus A. oryzae
Is a filamentous fungus that has been used in Japan's traditional fermentation industry, such as sake, soy sauce, and miso. A feature of this strain is that it produces a very large amount of useful proteins by solid culture in the fermentation industry. Due to the high protein-producing ability of this strain and its safety as a brewing microorganism, it has attracted attention as a host for heterologous protein production (Biotechnology, 6, 6.
1419 (1988), JP-A-62-272988).
【0009】しかし、このような麹菌の異種蛋白生産に
おける報告は液体培養によるものが非常に多い。醸造産
業での培養形態が固体培養であるにもかかわらず、異種
蛋白生産が液体培養で行われることが多い原因として、
固体培養での遺伝子発現メカニズムは、液体培養に比べ
て未解明な点が多いことがあげられる。しかし、固体培
養は、液体培養に比べてタンパク質分泌生産量も多く、
また固体培養特異的に生産される酵素群も非常に種類が
多い特徴を有している。そこで、固体培養での遺伝子発
現メカニズムを分子レベルで解明することが出来れば、
古来より非常に有用な固体培養を介しての異種蛋白生産
が可能となる。However, there are many reports on such heterologous protein production of Aspergillus oryzae by liquid culture. Despite the fact that the culture form in the brewing industry is solid culture, the reason why heterologous protein production is often carried out in liquid culture is
The gene expression mechanism in solid culture has many unclear points compared to liquid culture. However, solid culture produces a larger amount of protein secretion than liquid culture,
In addition, the enzyme group produced in solid culture has a characteristic that there are many kinds. Therefore, if we can elucidate the gene expression mechanism in solid culture at the molecular level,
Since ancient times, it has become possible to produce heterologous proteins through very useful solid culture.
【0010】発明者らは、既に、A.oryzaeの固
体培養特異的に発現するglaBプロモーターを用いた
固体培養での異種蛋白生産に成功しており、Asper
gillus属などの近種の遺伝子であれば、その生産
能はさらに増大することが認められた。特に、A.or
yzaeの遺伝子を、A.oryzaeのglaBプロ
モーター制御下で発現させた場合、非常に大量のタンパ
ク質が固体培養により生産されることを見いだした。
(特願平11−154271、特願2000−3675
4)よって、このglaBプロモーターを代表とする固
体培養特異的に発現するプロモーター配列を制御する転
写制御因子を明らかに出来れば、これらの固体培養を用
いた麹菌の異種蛋白の生産性の改善につながり、大変産
業的に意義がある点にはじめて着目した。The inventors have already reported that A. oryzae has succeeded in producing a heterologous protein in solid culture using the glaB promoter that is specifically expressed in solid culture.
It was confirmed that the productivity of genes of the near species such as the genus gillus is further increased. In particular, A. or
The yzae gene was cloned into A. It was found that when expressed under the control of the oryzae glaB promoter, a very large amount of protein was produced by solid culture.
(Japanese Patent Application No. 11-154271, Japanese Patent Application 2000-3675)
4) Therefore, if a transcriptional regulatory factor that controls a promoter sequence specifically expressed in solid culture, which is represented by the glaB promoter, can be clarified, it will lead to improvement in productivity of heterologous protein of Aspergillus oryzae using these solid cultures. For the first time, I focused on the point that it has great industrial significance.
【0011】本発明者らは、麹菌の固体培養特異的に発
現する遺伝子glaBの発現条件を検討するために、g
laBプロモーター配列の直後に大腸菌uidA遺伝子
を連結したレポーター解析を試みた。その結果、gla
Bプロモーターは固体培養において低水分活性、高温度
および菌糸伸長障害によりその発現が高められることを
明らかにした。また、このような固体培養中の低水分活
性、高温度および菌糸伸長障害という環境因子がgla
Bプロモーター中のどの領域に応答するのかを調査し
た。glaBプロモーターの種々の欠失変異体を作製
し、それぞれについてレポーターアッセイを行った結
果、上記環境因子に応答する領域を同定した。The present inventors have investigated the expression conditions of the gene glaB, which is specifically expressed in solid culture of Aspergillus oryzae.
An attempt was made to analyze a reporter in which the E. coli uidA gene was ligated immediately after the laB promoter sequence. As a result, gla
It was revealed that the B promoter was enhanced in solid culture by low water activity, high temperature and impaired hyphal elongation. In addition, environmental factors such as low water activity, high temperature and impaired hyphal elongation during solid culture are gla.
We investigated which region in the B promoter is responsible. Various deletion mutants of the glaB promoter were prepared, and a reporter assay was performed on each of them to identify the region responsive to the above environmental factors.
【0012】glaBプロモーター中の開始コドン上流
350から254bpに存在する97bpの部位特異的
欠失により、固体培養中の低水分活性、高温度および菌
糸伸長障害という環境因子応答性が失われる。この97
bpはglaBプロモーターが固体培養特異的に発現す
るために必須のcis因子であることが示唆された。こ
れを確認するために、この97bpの領域を異種遺伝子
プロモーターに挿入する実験を行った。麹菌で液体培養
特異的に発現し、固体培養でほとんど発現しないgla
Aプロモーターの−205の位置に上記97bpの領域
を挿入した結果、このキメラプロモーターは、固体培養
で強く発現するようになった。これは上記97bpの領
域が固体培養特異的に発現するために必須のcis因子
であることを表している。次に、この97bpの領域を
麹菌内へ複数導入することにより、転写制御因子の不足
を招き、promoter titrationという
現象が起こるのかどうかを確認した。A site-specific deletion of 97 bp located 350 to 254 bp upstream of the start codon in the glaB promoter abolishes the environmental factor responsiveness of low water activity, high temperature and impaired hyphal elongation in solid culture. This 97
It was suggested that bp is an essential cis factor for the glaB promoter to be expressed specifically in solid culture. In order to confirm this, an experiment was performed to insert this 97 bp region into a heterologous gene promoter. Gla, which is specifically expressed in Aspergillus oryzae in liquid culture and is rarely expressed in solid culture
As a result of inserting the 97 bp region at the -205 position of the A promoter, this chimeric promoter became strongly expressed in solid culture. This indicates that the above 97 bp region is an essential cis factor in order to express specifically in solid culture. Next, it was confirmed whether introduction of a plurality of 97 bp regions into Aspergillus oryzae causes a deficiency of transcriptional regulatory factors and causes a phenomenon called promoter titration.
【0013】その結果、本領域を麹菌内へ複数コピー導
入した麹菌を育種後、米麹の固体培養を行い、種々の酵
素生産性について検討した。その結果、本麹菌は、グル
コアミラーゼのみならず、α−アミラーゼ、酸性プロテ
アーゼ、酸性カルボキシペプチダーゼ、フィターゼなど
についても親株に対する菌体当たりの酵素生産量が、2
0−40%に低下した。これはこの97bpの領域には
固体培養での遺伝子発現を正に制御する転写制御因子が
結合し、またこの領域に結合する転写制御因子はグルコ
アミラーゼのみならず、他の固体培養で生産される酵素
群の発現も制御することが推察された。そこで、この9
7bpの領域に結合する転写制御因子を取得すれば、麹
菌の固体培養で発現する遺伝子群の制御機構の解明につ
ながるとの新規着想を得た。As a result, after breeding koji mold in which multiple copies of this region were introduced into koji mold, solid culture of rice koji was carried out and various enzyme productivity was examined. As a result, the Aspergillus oryzae produces not only glucoamylase but also α-amylase, acid protease, acid carboxypeptidase, phytase, etc. with an enzyme production amount of 2 per cell relative to the parent strain.
It fell to 0-40%. This 97 bp region is bound by a transcriptional regulatory factor that positively regulates gene expression in solid culture, and the transcriptional regulatory factor that binds to this region is produced not only by glucoamylase but also by other solid culture. It was speculated that the expression of the enzyme group is also controlled. So, this 9
We obtained a new idea that the acquisition of a transcriptional regulatory factor that binds to the 7-bp region will lead to the elucidation of the regulatory mechanism of the genes expressed in solid culture of Aspergillus oryzae.
【0014】このような転写制御因子は、固体培養中の
低水分活性、高温度および菌糸伸長障害という環境因子
を認識することが示唆されており、本因子は温度や乾燥
などのストレスや菌糸伸長障害ストレスに応答する可能
性が示唆された。既に植物のシロイヌナズナでは構造的
にC2H2タイプのzinc fingerタンパク質の
中に、ストレス応答性のものが多いという知見が得られ
ている。またマウスでは全遺伝子の約1%がC2H2タイ
プのzinc fingerタンパク質であるといわれ
ている。そこで、我々は麹菌ESTライブラリーの中か
らC2H2タイプのzinc fingerタンパク質の
抽出を行った。It has been suggested that such a transcription control factor recognizes environmental factors such as low water activity, high temperature and impaired hyphal elongation in solid culture. This factor is stressed by temperature and drought and hyphal elongation. The possibility of responding to distress stress was suggested. It has already been found that in plant Arabidopsis thaliana, many structurally C 2 H 2 type zinc finger proteins are responsive to stress. Further, it is said that about 1% of all genes in mice are C 2 H 2 type zinc finger proteins. Therefore, we extracted the C 2 H 2 type zinc finger protein from the Aspergillus oryzae EST library.
【0015】その結果、小麦フスマを用いた固体培養に
のみ存在する3種類のC2H2タイプのzinc fin
gerタンパク質をESTクローンから抽出するのに成
功した。1つは出芽酵母のストレス応答性を制御するC
2H2タイプのzinc fingerタンパク質である
MSN2と相同性を示し、これをsgbR1と命名し
た。もう1つはFusarium属のクチナーゼ遺伝子
の上流域に結合するC2H2タイプのzinc fing
erタンパク質と相同性を示し、これをsgbR2と命
名した。最後の1つは分裂酵母Schizosacch
aromyces pombeの機能未知なC2H2タイ
プのzinc fingerタンパク質と相同性を示
し、これをsgbR3と命名した。これらの3つのC2
H2タイプのzinc fingerタンパク質が上記
97bpの領域に結合するかどうかを検討するために、
大腸菌においてそのタンパク質を発現させた。As a result, three types of C 2 H 2 type zinc fin which exist only in solid culture using wheat bran
The ger protein was successfully extracted from the EST clone. One is C, which controls the stress response of Saccharomyces cerevisiae.
It showed homology to MSN2, which is a 2 H 2 type zinc finger protein, and was named sgbR1. The other is a C 2 H 2 -type zinc finger that binds to the upstream region of the Fusarium cutinase gene.
It showed homology with the er protein and was named sgbR2. The last one is the fission yeast Schizosacc
indicates unknown function C 2 H 2 type of zinc finger protein homologous aromyces pombe, and was named SgbR3. These three C 2
In order to examine whether the H 2 -type zinc finger protein binds to the 97 bp region,
The protein was expressed in E. coli.
【0016】発現タンパク質を用いてゲルシフトアッセ
イを行った結果、この97bpの領域にsgbR1とs
gbR2の2つの因子が結合することが確認された。ま
た、これら3つの転写制御因子の機能を決定するために
コサプレッション法を用いて、それぞれの遺伝子の発現
を抑制させた麹菌を構築した。固体培養を行った結果、
いずれもグルコアミラーゼのみならず、α−アミラー
ゼ、酸性プロテアーゼ、酸性カルボキシペプチダーゼ、
フィターゼなどについても親株に対する菌体当たりの酵
素生産量が30−70%に低下した。よって両転写制御
因子は、固体培養特異的な遺伝子群の発現を正に制御す
る因子であることが明らかとなった。これらの転写因子
の活性を調節することにより固体培養での遺伝子発現を
自由にコントロールできるものとの新しい観点に本発明
者らはたった。As a result of gel shift assay using the expressed protein, sgbR1 and sgbR1
It was confirmed that two factors of gbR2 bind. In addition, the co-suppression method was used to determine the functions of these three transcription control factors, and koji mold in which the expression of each gene was suppressed was constructed. As a result of performing solid culture,
Not only glucoamylase, but also α-amylase, acid protease, acid carboxypeptidase,
With respect to phytase and the like, the enzyme production per bacterial cell was reduced to 30-70% with respect to the parent strain. Therefore, it was revealed that both transcription control factors positively control the expression of the gene group specific to solid culture. The present inventors have found a new viewpoint that gene expression in solid culture can be freely controlled by regulating the activity of these transcription factors.
【0017】以下、本発明の詳細について述べる。The details of the present invention will be described below.
【0018】本発明者らは、既に麹菌の固体培養特異的
に発現する遺伝子としてグルコアミラーゼ遺伝子gla
Bを発見したことを報告した(特開平10−8496
8)。さらに、本遺伝子を固体培養で発現させる環境因
子を決定するために、大腸菌由来β−グルクロニダーゼ
(GUS)をレポーターとしてglaBプロモーターの
発現条件を検討した(H.Ishidaら、Curr. Genet., 37,
p.373-379、2000)。The present inventors have already identified the glucoamylase gene gla as a gene that is specifically expressed in solid culture of Aspergillus oryzae.
It was reported that B was discovered (Japanese Patent Laid-Open No. 10-8496).
8). Furthermore, in order to determine the environmental factor for expressing this gene in solid culture, expression conditions of the glaB promoter were examined using Escherichia coli-derived β-glucuronidase (GUS) as a reporter (H. Ishida et al., Curr. Genet., 37,
p.373-379, 2000).
【0019】すなわち、高発現システムを開発するため
のプロモーター解析システムとして、具体的には図22
に示すような、プロモーター解析プラスミドpNGUS
を用いた。このプラスミドは、形質転換用マーカーであ
るA.oryzaeのniaD遺伝子(E.S.Unk
leら、Mol.Gen.Genet.,218,p.
99−104,1989)と、レポーター遺伝子である
大腸菌のβ−グルクロニダーゼ(GUS)をコードする
uidA遺伝子(R.A.Jeffersonら、Pr
oc.Natl.Acad.Sci.,p.8447−
8451,1986)を含む。このプラスミドのuid
A遺伝子の上流域(例えばSa1I、PstIサイト)
に、検討しようとする種々の遺伝子プロモーターあるい
はその一部を挿入し、構築されたプロモーター解析用プ
ラスミドをA.oryzae(Aspergillus
oryzae O−1013:本菌株は、工業技術院
生命工学工業技術研究所(現、独立行政法人 産業技術
総合研究所 特許生物寄託センター)にFERM P−
16528として既に寄託されている)のniaD変異
株(硝酸資化能欠損株、Nitrate Reduct
ase欠損:例えばAspergillus oryz
ae 1013−niaD:本菌株は、工業技術院生命
工学工業技術研究所(現、独立行政法人 産業技術総合
研究所 特許生物寄託センター)にFERM P−17
707として寄託されている)に導入し、導入プラスミ
ドが宿主染色体のniaD locusに1コピーだけ
導入された形質転換体を選択した。そしてこれらの形質
転換体のGUS活性を測定することにより、プロモータ
ー活性の指標とした。なお、プロモーター解析用プラス
ミドpNGUSを大腸菌に導入してなる形質転換体は、
Escherichiacoli IN 113と命名
され、経済産業省産業技術総合研究所生命工学工業技術
研究所(現、独立行政法人 産業技術総合研究所 特許
生物寄託センター)にFERM P−18254として
寄託されている。That is, as a promoter analysis system for developing a high expression system, specifically, FIG.
Promoter analysis plasmid pNGUS as shown in
Was used. This plasmid is A. oryzae niaD gene (ES Unk
le et al., Mol. Gen. Genet. 218, p.
99-104, 1989) and a reporter gene E. coli β-glucuronidase (GUS) encoding the uidA gene (RA Jefferson et al., Pr.
oc. Natl. Acad. Sci. , P. 8447-
8451, 1986). Uid of this plasmid
Upstream region of A gene (for example, Sa1I and PstI sites)
Various gene promoters to be studied or a part thereof were inserted into A. oryzae (Aspergillus
oryzae O-1013: This strain was transferred to the Institute of Biotechnology, National Institute of Advanced Industrial Science and Technology (now, National Institute of Advanced Industrial Science and Technology, Patent Biological Deposit Center) under FERM P-
16528 already deposited), niaD mutant strain (nitrate-utilizing strain, Nitrate Reduct)
ase deficiency: For example, Aspergillus oryz
ae 1013-niaD: This strain is FERM P-17 at the Institute of Biotechnology, Institute of Industrial Science and Technology (now the National Institute of Advanced Industrial Science and Technology, Patent Biological Deposit Center).
(Deposited as 707), and a transformant in which only one copy of the introduced plasmid was introduced into niaD locus of the host chromosome was selected. Then, the GUS activity of these transformants was measured and used as an index of the promoter activity. The transformant obtained by introducing the promoter analysis plasmid pNGUS into E. coli is
It has been named Escherichia coli IN 113, and has been deposited as FERM P-18254 at the Institute of Biotechnology, National Institute of Advanced Industrial Science and Technology (now the National Institute of Advanced Industrial Science and Technology, Patent Biological Deposit Center) of the Ministry of Economy, Trade and Industry.
【0020】上記解析システムを用いてglaBプロモ
ーターの発現条件を検討した結果、glaB遺伝子は、
固体培養における低水分活性、高温度、菌糸伸長障害の
因子により、その発現が高められることを明らかとした
(特開平11−225746)。このような固体培養に
おける低水分活性、高温度、菌糸伸長障害の環境因子に
応答する領域を同定するために、種々のglaBプロモ
ーターの欠失変異体を作製した。その結果、開始コドン
上流350から254bpに存在する97bpの欠失に
より、固体培養におけるglaBプロモーター活性は完
全に失われ、この97bpが固体培養における低水分活
性、高温度、菌糸伸長障害の環境因子に応答する領域で
あることを同定した(特開平11−243965)。As a result of examining the expression conditions of the glaB promoter using the above analysis system, the glaB gene was found to be
It was clarified that its expression is enhanced by factors of low water activity, high temperature, and hyphal elongation disorder in solid culture (JP-A-11-225746). In order to identify the region responding to environmental factors of low water activity, high temperature, and hyphal elongation disorder in solid culture, various deletion mutants of glaB promoter were prepared. As a result, due to the deletion of 97 bp existing upstream from the start codon 350 to 254 bp, the glaB promoter activity in solid culture was completely lost, and this 97 bp was an environmental factor for low water activity, high temperature and impaired hyphal elongation in solid culture. It was identified to be a response area (Japanese Patent Laid-Open No. 11-243965).
【0021】次に、この97bp(その塩基配列を配列
表の配列番号7(図21)に示す)の領域が、実際に固
体培養における遺伝子発現を正に制御するのかを検討し
た。本領域を固体培養では発現が微弱なA.oryza
e由来glaAプロモーターの−205bpの位置に挿
入し、得られたキメラプロモーターのGUSレポーター
アッセイを行った(図1)。その結果、キメラプロモー
ターは固体培養におけるGUS発現量がネイティブのも
のの約50倍に増大した。よってこの97bpの領域
が、実際に固体培養における遺伝子発現を正に制御する
ことが明らかとなった。Next, it was examined whether the 97 bp region (the nucleotide sequence of which is shown in SEQ ID NO: 7 (FIG. 21) of the sequence listing) actually positively regulates gene expression in solid culture. This region has a weak expression in solid culture. oryza
It was inserted into the -205 bp position of the e-derived glaA promoter, and the resulting chimeric promoter was subjected to GUS reporter assay (Fig. 1). As a result, the GUS expression level of the chimeric promoter in solid culture was increased about 50 times that of the native one. Therefore, it was revealed that this 97 bp region actually positively controls gene expression in solid culture.
【0022】次に、この97bpの領域を麹菌へ多コピ
ー導入することによって、プロモータータイトレーショ
ン実験を行った。97bpの領域を8コピータンデムに
リピートさせたフラグメントをA.nidulans由
来argBマーカーとともにA.oryzae M−2
−3(arg−)へ導入した。サザン解析の結果、本9
7bpの領域が80−300コピー導入された形質転換
体が得られた。これらの形質転換体のうち5株について
固体培養を行った結果、図2に示すようにグルコアミラ
ーゼのみならず、α−アミラーゼ、酸性プロテアーゼ、
酸性カルボキシペプチダーゼ、フィターゼなどについて
も親株に対する菌体当たりの酵素生産量が、20−40
%に低下した。これは人為的に導入した97bpの領域
に多数の転写制御因子が結合したため、転写制御因子が
不足し、その結果、酵素生産量が低下したものと推察さ
れた。Next, a promoter titration experiment was carried out by introducing multiple copies of this 97 bp region into Aspergillus oryzae. The fragment in which the 97 bp region was repeated in 8 copy tandem was A. nigerans derived argB marker together with A. oryzae M-2
-3 (arg-). As a result of Southern analysis, this book 9
A transformant having 80-300 copies of the 7-bp region was obtained. As a result of solid culture of 5 strains among these transformants, as shown in FIG. 2, not only glucoamylase but also α-amylase, acid protease,
Regarding acid carboxypeptidase, phytase, etc., the enzyme production amount per bacterial cell with respect to the parent strain was 20-40.
Fell to%. It was speculated that this is because the transcription control factor was insufficient because a large number of transcription control factors bound to the artificially introduced 97 bp region, and as a result, the enzyme production amount decreased.
【0023】これらの結果から、この97bpの領域に
は固体培養での遺伝子発現を正に制御する転写制御因子
が結合し、また、この領域に結合する転写制御因子はグ
ルコアミラーゼのみならず、他の固体培養で生産される
酵素群の発現も制御することが推察された。そこで、こ
の97bpの領域に結合する転写制御因子を取得すれ
ば、麹菌の固体培養で発現する遺伝子群の制御機構の解
明につながると発明者らは考えた。From these results, a transcription control factor that positively controls gene expression in solid culture binds to this 97 bp region, and the transcription control factor that binds to this region is not only glucoamylase but also other It was presumed that the expression of the enzyme group produced in the solid culture of the above was also controlled. Therefore, the present inventors considered that the acquisition of a transcriptional regulatory factor that binds to this 97 bp region would lead to the elucidation of the regulatory mechanism of the genes expressed in solid culture of Aspergillus oryzae.
【0024】本発明者らは、このような転写制御因子
は、固体培養中の低水分活性、高温度および菌糸伸長障
害という環境因子を認識することが示唆されており、本
因子は温度や乾燥などのストレスや菌糸伸長障害ストレ
スに応答する可能性が示唆された。既に植物のシロイヌ
ナズナでは構造的にC2H2タイプのzinc fing
erタンパク質の中に、ストレス応答性のものが多いと
いう知見が得られている。そこで、我々は麹菌ESTラ
イブラリーの中からC2H2タイプのzinc fing
erタンパク質の抽出を行った。その結果、小麦フスマ
を用いた固体培養にのみ存在する3種類のC2H2タイプ
のzinc fingerタンパク質のESTクローン
を抽出するのにはじめて成功した。It has been suggested by the present inventors that such a transcription control factor recognizes environmental factors such as low water activity, high temperature and impaired hyphal elongation in solid culture. It was suggested that it may respond to such stress and hyphal elongation disorder stress. Already in plants, Arabidopsis is structurally a C 2 H 2 type of zinc finger.
It has been found that many er proteins are responsive to stress. Therefore, we selected a C 2 H 2 type zinc finger from the Aspergillus oryzae EST library.
The er protein was extracted. As a result, they succeeded for the first time in extracting EST clones of three types of C 2 H 2 -type zinc finger proteins that exist only in solid culture using wheat bran.
【0025】1つは出芽酵母のC2H2タイプのzinc
fingerタンパク質であるMSN2と相同性を示
す。MSN2は出芽酵母においてSTREとよばれる配
列に結合し、複数のストレス応答性遺伝子群の発現を正
に制御することが知られている。もう1つはFusar
ium属のクチナーゼ遺伝子の上流域に結合するC2H2
タイプのzinc fingerタンパク質と相同性を
示した。これは、Fusarium属が植物の葉に感染
する際に分泌するクチナーゼの発現を正に制御する転写
制御因子であり、Fusarium属が植物の葉に対し
て付着のシグナルに応答する因子と考えられる。最後の
1つは分裂酵母Schizosaccharomyce
s pombeの機能未知なC2H2タイプのzinc
fingerタンパク質と相同性を示した。このような
3つのC2H2タイプのzincfingerタンパク質
の部分cDNAをプローブとして、A.oryzaeの
λEMBL3ライブラリーからゲノム遺伝子のクローニ
ングを行った。The first is the C 2 H 2 type zinc of budding yeast.
It shows homology with MSN2 which is a finger protein. It is known that MSN2 binds to a sequence called STRE in Saccharomyces cerevisiae and positively regulates the expression of multiple stress-responsive genes. The other is Fusar
C 2 H 2 binding to upstream region of cutinase gene of ium
It showed homology with the type of zinc finger protein. This is a transcriptional regulatory factor that positively regulates the expression of cutinase secreted when Fusarium genus infects plant leaves, and is considered to be a factor in which Fusarium genus responds to an adhesion signal to plant leaves. The last one is the fission yeast Schizosaccharomyces
C 2 H 2 type zinc with unknown function of s pombe
It showed homology with the finger protein. The partial cDNAs of the three C 2 H 2 type zincfinger proteins were used as probes. A genomic gene was cloned from the oryzae λEMBL3 library.
【0026】これらのDNAシーケンスを行った結果、
MSN2と相同性のある転写因子は、そのアミノ酸配列
を配列番号1(図6〜8)に示し、その塩基配列を配列
番号2(図9、10)に示したが、配列番号1、2に示
しすように、717アミノ酸残基をコードして、イント
ロンを持たない遺伝子であった。また本遺伝子はS.c
erevisiaeの様々なストレス応答遺伝子のST
RE配列に結合するMSN2と相同性を示す。これをs
gbR1と命名した。As a result of performing these DNA sequences,
A transcription factor homologous to MSN2 has its amino acid sequence shown in SEQ ID NO: 1 (FIGS. 6 to 8) and its nucleotide sequence shown in SEQ ID NO: 2 (FIGS. 9 and 10). As shown, it was a gene encoding 717 amino acid residues and having no intron. In addition, this gene is S. c
ST of various stress response genes of Erevisiae
It shows homology with MSN2 binding to the RE sequence. S this
It was named gbR1.
【0027】一方、フザリウム(Fusarium)属
クチナーゼ制御因子と相同性のある転写因子クローン
は、そのアミノ酸配列を配列番号3(図11〜13)に
示し、その塩基配列を配列番号4(図14、15)に示
したが、配列番号3、4に示すように、597アミノ酸
残基をコードして、1つのイントロンを有する遺伝子で
あった。これをsgbR2と命名した。On the other hand, the transcription factor clone having homology with the Fusarium cutinase regulatory factor has its amino acid sequence shown in SEQ ID NO: 3 (FIGS. 11 to 13) and its nucleotide sequence shown in SEQ ID NO: 4 (FIG. 14, FIG. As shown in 15), it was a gene encoding 597 amino acid residues and having one intron as shown in SEQ ID NOs: 3 and 4. This was named sgbR2.
【0028】最後の分裂酵母シゾサッカロミセス・ポン
ベ(Schizosaccharomyces pom
be)の機能未知なC2H2タイプのzinc fing
erタンパク質と相同性を示す転写因子クローンは、そ
のアミノ酸配列を配列番号5(図16〜18)に示し、
その塩基配列を配列番号6(図19、20)に示した
が、配列番号5、6に示すように、468アミノ酸残基
をコードして、2つのイントロンを有する遺伝子であっ
た。これをsgbR3と命名した。The last fission yeast Schizosaccharomyces pomb
be) Function of unknown C 2 H 2 type zinc finger
The transcription factor clone showing homology with the er protein has its amino acid sequence shown in SEQ ID NO: 5 (FIGS. 16 to 18),
Its base sequence is shown in SEQ ID NO: 6 (FIGS. 19 and 20), and as shown in SEQ ID NOs: 5 and 6, it was a gene encoding 468 amino acid residues and having two introns. This was named sgbR3.
【0029】これらの因子の機能を決定するために、コ
サブレッション法を用いて、それぞれの遺伝子の発現を
抑制させた麹菌を構築した。麹菌内で構成的に高発現す
るhistoneH2Aプロモーター下流にsgbR
1、sgbR2及びsgbR3の開始コドンから500
bpの部分断片を連結させ、これをA.nidulan
sのsCマーカーを用いて麹菌内へマルチコピーで導入
した。得られた麹菌を用いて固体培養を行った結果、図
4に示すようにグルコアミラーゼのみならず、α−アミ
ラーゼ、酸性プロテアーゼ、酸性カルボキシルペプチダ
ーゼ、フィターゼなどについても親株に対する菌体当た
りの酵素生産量がいずれも30−70%に低下した。よ
って、これらの転写制御因子は、固体培養特異的な遺伝
子群の発現を正に制御する因子であることが明らかとな
った。In order to determine the function of these factors, the koji mold in which the expression of each gene was suppressed was constructed by using the cossablation method. SgbR is downstream of the histoneH2A promoter that is constitutively highly expressed in Aspergillus oryzae.
1, 500 from the start codon of sgbR2 and sgbR3
The partial fragment of bp was ligated, and this was nidulan
Using the sC marker of s, it was introduced into Koji mold in multiple copies. As a result of carrying out solid culture using the obtained koji mold, not only glucoamylase but also α-amylase, acid protease, acid carboxylpeptidase, phytase, etc. as shown in FIG. In all cases fell to 30-70%. Therefore, it was revealed that these transcription control factors are factors that positively control the expression of gene groups specific to solid culture.
【0030】次に、これらの遺伝子が、glaBプロモ
ーターのcis因子である97bpとの結合能を有する
かどうかをゲルシフトアッセイにより確認した。sgb
R1、sgbR2及びsgbR3のcDNAを取得する
ために、蒸し米を用いた固体培養菌体から調製したmR
NAをもとにRT−PCRを行った。この際に使用した
プライマーの両端には制限酵素サイトBamHI(sg
bR1)、EcoRI(sgbR2)、SmaI(sg
bR3)を付加した。得られたクローンをpUC118
のlacZプロモーターの正の方向にサブクローニング
し、得られた発現ベクター(それぞれ、pSGBR1、
pSGBR2、pSGBR3と命名)を大腸菌JM10
9に形質転換した。アンピシリン含有培地で培養し、ア
ンピシリン耐性株を選択し、得られた形質転換体は、そ
れぞれ、エシェリヒア・コリ(Escherichia
coli)SGBR1、同SGBR2、同SGBR3
と命名し、これを独立行政法人 産業技術総合研究所
特許生物寄託センターに、それぞれ、FERM P−1
8357、FERM P−18358、FERMP−1
8359として寄託した。この形質転換体は、sgbR
1、sgbR2、sgbR3タンパク質をコードする遺
伝子を含有しており、IPTG誘導によって、lacZ
プロモーター支配下でsgbR1、sgbR2、sgb
R3タンパク質が発現することを確認した。Next, it was confirmed by gel shift assay whether these genes have the binding ability to 97 bp which is a cis factor of the glaB promoter. sgb
MR prepared from solid-cultured cells using steamed rice to obtain cDNAs for R1, sgbR2 and sgbR3
RT-PCR was performed based on NA. The restriction enzyme site BamHI (sg
bR1), EcoRI (sgbR2), SmaI (sg
bR3) was added. The obtained clone was designated as pUC118.
Of the expression vector (pSGBR1, pSGBR1,
(named pSGBR2 and pSGBR3) was used for Escherichia coli JM10
Transformed into 9. The transformants obtained by culturing in an ampicillin-containing medium and selecting an ampicillin-resistant strain were respectively transformed into Escherichia coli (Escherichia coli).
coli) SGBR1, SGBR2, SGBR3
And named it, National Institute of Advanced Industrial Science and Technology
FERM P-1 at the Patent Biological Deposit Center
8357, FERM P-18358, FERMP-1
Deposited as 8359. This transformant is sgbR
1, which contains genes encoding sgbR2 and sgbR3 proteins, and by IPTG induction, lacZ
SgbR1, sgbR2, sgb under the control of promoter
It was confirmed that the R3 protein was expressed.
【0031】ゲルシフトアッセイは、タンパク質として
大腸菌の無細胞調製液を用い、プローブとしてはgla
Bプロモーターのcis因子である97bpの2本鎖D
NAの二量体をフルオレッセンラベルしたものを用い
た。室温で30分間インキュベートした後に、ポリアク
リルアミドゲル電気泳動を行った。泳動ゲルは、正電荷
メンブレンへアルカリ転写し、抗原抗体反応によりフル
オレッセンの検出を行った。図5に示すようにsgbR
1とsgbR2はglaBプロモーターのcis因子で
ある97bpとの結合により、遅延したバンドが確認さ
れた。またこのバンドは過剰の非ラベルプローブの添加
により消失した。よってsgbR1とsgbR2はとも
にglaBプロモーターのcis因子である97bpへ
の結合能を有することが確認された。In the gel shift assay, a cell-free preparation of Escherichia coli was used as the protein, and gla was used as the probe.
97 bp double-stranded D which is a cis factor of B promoter
The dimer of NA was labeled with fluorescein. After incubating at room temperature for 30 minutes, polyacrylamide gel electrophoresis was performed. The electrophoresis gel was alkali-transferred to a positively charged membrane, and fluorescein was detected by an antigen-antibody reaction. As shown in Figure 5, sgbR
A delayed band was confirmed by binding of 1 and sgbR2 with 97 bp which is a cis factor of the glaB promoter. Also, this band disappeared due to the addition of excess unlabeled probe. Therefore, it was confirmed that both sgbR1 and sgbR2 have the ability to bind to 97 bp, which is the cis factor of the glaB promoter.
【0032】以上の結果、sgbR1、sgbR2及び
sgbR3は麹菌の固体培養におけるタンパク質生産を
正に制御する転写制御因子であることが明らかとなっ
た。From the above results, it was revealed that sgbR1, sgbR2 and sgbR3 are transcriptional control factors that positively control protein production in solid culture of Aspergillus oryzae.
【0033】本転写制御因子(転写因子ということもあ
る)である、sgbR1、sgbR2、sgbR3が得
られたことによって、麹菌の液体培養及び固体培養での
酵素生産の包括的改善を行うことが可能となり、工業的
にも非常に有望な遺伝子であることが示唆された。By obtaining sgbR1, sgbR2 and sgbR3, which are the present transcriptional regulatory factors (sometimes referred to as transcription factors), it is possible to comprehensively improve enzyme production in liquid culture and solid culture of Aspergillus oryzae. Therefore, it was suggested that the gene is very promising industrially.
【0034】[0034]
【実施例1】glaBプロモーター中の固体培養高発現
に必須な領域の同定
glaBプロモーターの固体培養高発現に必須な領域を
欠失変異により開始コドン上流350から254bpの
97bpであることを明らかにした。次に、固体培養高
発現の機能を付与するのに必須な領域の同定を試みた。
glaBプロモーターの開始コドン上流350から下流
へ97、61、27bpを固体培養では発現が微弱であ
るglaAプロモーター中に挿入し、キメラプロモータ
ーの固体培養高発現能を比較した。挿入位置はglaA
プロモーターのデンプン誘導に必須なcis因子である
RegionIIIaの上流末端が存在する開始コドン
上流205bpの位置に決定した。Example 1 Identification of a region essential for high expression in solid culture in the glaB promoter It was clarified that a region essential for high expression in solid culture of the glaB promoter was 97 bp from the start codon 350 to 254 bp upstream by deletion mutation. . Next, an attempt was made to identify a region essential for imparting the function of high expression in solid culture.
97, 61, and 27 bp were inserted downstream from the start codon 350 of the glaB promoter into the glaA promoter, which expression is weak in solid culture, and the high expression ability of the chimeric promoter in solid culture was compared. The insertion position is glaA
It was determined at a position 205 bp upstream of the initiation codon in which the upstream end of Region IIIa, which is a cis factor essential for starch induction of the promoter, is present.
【0035】上記3種のフラグメントをglaAプロモ
ーターの−205の位置に挿入し、得られたキメラプロ
モーターをプロモーター解析用プラスミドpNGUS
(図22)のSalIサイトへサブクローニングし、n
iaDをマーカーとして麹菌へ形質転換後、固体培養に
おけるGUSアッセイを行った。その結果、97bpを
挿入したキメラプロモーターは、ネイティブのものの約
50倍のGUS発現量を示した。よって、glaBプロ
モーターの開始コドン上流350から254までの97
bp(配列番号7、図21)が、固体培養高発現に必須
なcis因子であることが証明された(図1)。図中、
GUS activityはGUS活性(U/mg−タ
ンパク質)を示し、単一炭素源としてGlcはグルコー
ス、Stはデンプンを使用したことを示す。The above three kinds of fragments were inserted at the −205 position of the glaA promoter, and the resulting chimeric promoter was used as a plasmid pNGUS for promoter analysis.
Subcloning into the SalI site of (Fig. 22)
After transformation into Aspergillus oryzae using iaD as a marker, GUS assay in solid culture was performed. As a result, the chimeric promoter having 97 bp inserted showed a GUS expression level about 50 times that of the native one. Therefore, 97 from 350 to 254 upstream of the start codon of the glaB promoter.
It was proved that bp (SEQ ID NO: 7, FIG. 21) is a cis factor essential for high expression in solid culture (FIG. 1). In the figure,
GUS activity shows GUS activity (U / mg-protein), Glc is glucose and St is starch as a single carbon source.
【0036】[0036]
【実施例2】glaBプロモーターcis因子97bp
を用いたプロモータータイトレーションアッセイ
glaBプロモーターのcis因子97bpに結合する
固体培養高発現に関わる転写制御因子が、glaB遺伝
子以外の遺伝子を制御するかを確認するために、プロモ
ータータイトレーションアッセイを行った。97bpの
上流側にSalI、下流側にXhoIの制限酵素サイト
を付加し、pBluescript IISK+にサブク
ローニングした。このXhoIサイトに新たな97bp
のSalIサイトを結合させる操作を3回繰り返すこと
によって、97bpの8量体が得られた。これをA.n
idulans由来argBをマーカーとしてA.or
yzae M−2−3(argB−)へ導入した。サザ
ン解析の結果、本97bpの領域が80−300コピー
導入された形質転換体が得られた。Example 2 GlaB promoter cis factor 97 bp
A promoter titration assay was carried out in order to confirm whether or not the transcription control factor involved in high expression in solid culture that binds to the cis factor 97 bp of the glaB promoter controls genes other than the glaB gene. A restriction enzyme site of SalI was added to the upstream side of 97 bp, and a restriction enzyme site of XhoI was added to the downstream side, and subcloning was performed into pBluescript II SK + . New 97bp on this XhoI site
By repeating the operation for binding the SalI site of 3 times, a 97 bp octamer was obtained. A. n
A. duluans-derived argB as a marker. or
It was introduced into yzae M-2-3 (argB-). As a result of Southern analysis, a transformant into which 80 to 300 copies of the 97 bp region were introduced was obtained.
【0037】これらの形質転換体のうち5株について固
体培養を行った結果、図2に示すようにグルコアミラー
ゼのみならず、α−アミラーゼ、酸性プロテアーゼ、酸
性カルボキシペプチダーゼ、フィターゼなどについても
親株に対する菌体当たりの酵素生産量が、20−40%
に低下した。これは人為的に導入した97bpの領域に
固体培養での遺伝子発現を正に制御する転写制御因子が
結合し、本転写制御因子の不足により、上記酵素遺伝子
群の効率的な転写が行われなくなったためと考えられ
る。よってこの領域に結合する転写制御因子はグルコア
ミラーゼのみならず、他の固体培養で生産される酵素群
の遺伝子発現も制御することが推察された。As a result of solid culture of 5 of these transformants, as shown in FIG. 2, not only glucoamylase but also α-amylase, acid protease, acid carboxypeptidase, phytase, etc. 20-40% of enzyme production per body
Fell to. This is because a transcriptional regulatory factor that positively regulates gene expression in solid culture binds to the artificially introduced 97 bp region, and due to the lack of this transcriptional regulatory factor, efficient transcription of the enzyme gene group is not performed. It is thought to be a tame. Therefore, it was speculated that the transcription control factor binding to this region controls not only glucoamylase but also gene expression of other enzyme groups produced in solid culture.
【0038】[0038]
【実施例3】C2H2タイプzinc fingerタン
パク質のESTライブラリーからのクローニング
97bpに結合する転写制御因子は、固体培養中の低水
分活性、高温度および菌糸伸長障害という環境因子を認
識することが示唆されており、本因子は温度や乾燥及び
菌糸伸長障害などのストレスに応答する因子である可能
性が示唆された。既に植物のシロイヌナズナでは構造的
にC2H2タイプのzinc fingerタンパク質の
中に、ストレス応答性のものが多いという知見を得てい
る。そこで、我々は麹菌ESTライブラリーの中からC
2H2タイプのzinc fingerタンパク質の抽出
を行った。その結果、小麦フスマを用いた固体培養にの
み存在する3種類のC2H2タイプのzinc fing
erタンパク質のESTクローンを抽出した。Example 3 Cloning of C 2 H 2 Type Zinc Finger Protein from EST Library A transcription factor that binds to 97 bp recognizes environmental factors such as low water activity, high temperature and impaired hyphal elongation in solid culture. It was suggested that this factor may be a factor that responds to stress such as temperature, drought and impaired hyphal elongation. It has already been found that in Arabidopsis plants, many structurally C 2 H 2 type zinc finger proteins are responsive to stress. Therefore, we selected C from the koji mold EST library.
Extraction of 2 H 2 type zinc finger protein was performed. As a result, three types of C 2 H 2 type zinc fingering that exist only in solid culture using wheat bran
An EST clone of the er protein was extracted.
【0039】この3つのC2H2タイプのzinc fi
ngerタンパク質の部分cDNAをプローブとして、
Gene Imageキット(アマシャムファルマシア
社製)を用いて、アスペルギルス・オリゼーO−101
3株(FERM P−16528)の約3000個のラ
ムダEMBL3ゲノムDNAライブラリーをプラークハ
イブリダイゼーションによりスクリーニングし、陽性を
示すクローンを得た。These three C 2 H 2 type zinc fi
Using a partial cDNA of the nger protein as a probe,
Using the Gene Image kit (manufactured by Amersham Pharmacia), Aspergillus oryzae O-101
About 3000 lambda EMBL3 genomic DNA libraries of 3 strains (FERM P-16528) were screened by plaque hybridization to obtain positive clones.
【0040】上記ゲノム遺伝子のクローニングの結果、
MSN2と相同性のある転写因子は、配列番号1(アミ
ノ酸配列:図6〜8)、配列番号2(塩基配列:図9、
10)に示すように、717アミノ酸残基をコードし
て、イントロンを持たない遺伝子であった。また本遺伝
子はS.cerevisiaeの様々なストレス応答遺
伝子のSTRE配列に結合するMSN2と相同性を示
す。これをsgbR1と命名した。一方、Fusari
um属クチナーゼ制御因子と相同性のある転写因子クロ
ーンは、配列番号3(アミノ酸配列:図11〜13)、
配列番号4(塩基配列:図14、15)に示すように、
597アミノ酸残基をコードして、1つのイントロンを
有する遺伝子であった。これをsgbR2と命名した。
最後の分裂酵母Schizosaccharomyce
s pombeの機能未知なC2H2タイプのzinc
fingerタンパク質と相同性を示す転写因子クロー
ンは、配列番号5(アミノ酸配列:図16〜18)、配
列番号6(塩基配列:図19、20)に示すように、4
68アミノ酸残基をコードして、2つのイントロンを有
する遺伝子であった。これをsgbR3と命名した。As a result of cloning the above-mentioned genomic gene,
Transcription factors homologous to MSN2 include SEQ ID NO: 1 (amino acid sequence: FIGS. 6 to 8), SEQ ID NO: 2 (base sequence: FIG. 9,
As shown in 10), it was a gene encoding 717 amino acid residues and having no intron. In addition, this gene is S. It shows homology with MSN2 binding to the STRE sequences of various stress response genes of S. cerevisiae. This was named sgbR1. On the other hand, Fusari
A transcription factor clone homologous to the um genus cutinase regulatory factor is SEQ ID NO: 3 (amino acid sequence: FIGS. 11 to 13),
As shown in SEQ ID NO: 4 (base sequence: FIGS. 14 and 15),
It was a gene encoding 597 amino acid residues and having one intron. This was named sgbR2.
The last fission yeast Schizosaccharomyces
C 2 H 2 type zinc with unknown function of s pombe
Transcription factor clones showing homology with the finger protein are 4 as shown in SEQ ID NO: 5 (amino acid sequence: FIGS. 16 to 18) and SEQ ID NO: 6 (base sequence: FIGS. 19 and 20).
It was a gene encoding 68 amino acid residues and having two introns. This was named sgbR3.
【0041】[0041]
【実施例4】sgbR1、sgbR2及びsgbR3の
ノザン解析
sgbR1、sgbR2及びsgbR3の発現条件を解
析するためにノザン解析を行った。液体培養(2%デキ
ストリン、1%ペプトン、0.5%イーストエキス、
0.05% MgSO4、0.1% KCl、0.1%
K2HPO4、0.001% FeSO4)、小麦フス
マを用いた固体培養、蒸し米を用いた固体培養菌体より
RNAの抽出を行った。RNA抽出にはISOGEN
(ニッポンジーン社製)を用いた。Example 4 Northern Analysis of sgbR1, sgbR2 and sgbR3 Northern analysis was performed to analyze the expression conditions of sgbR1, sgbR2 and sgbR3. Liquid culture (2% dextrin, 1% peptone, 0.5% yeast extract,
0.05% MgSO 4 , 0.1% KCl, 0.1%
K 2 HPO 4 , 0.001% FeSO 4 ), solid culture using wheat bran, and RNA extraction from solid culture cells using steamed rice. ISOGEN for RNA extraction
(Manufactured by Nippon Gene Co., Ltd.) was used.
【0042】得られたRNA各20μgを1%ホルムア
ルデヒド変性アガロースゲル電気泳動を行った。泳動
後、HybondN+メンブレン(アマシャムファルマ
シア社製)ヘアルカリトランスファー後、Gene I
mageキット(アマシャムファルマシア社製)を用い
てノザン解析を行った。図3(電気泳動パターンの写
真:図面代用写真)に示したように両遺伝子共に液体培
養、小麦フスマを用いた固体培養、蒸米を用いた固体培
養で発現が認められた。20 μg of each of the obtained RNAs was subjected to 1% formaldehyde-denaturing agarose gel electrophoresis. After electrophoresis, Hybond N + membrane (manufactured by Amersham Pharmacia) was alkali-transferred, and then Gene I
Northern analysis was performed using a image kit (manufactured by Amersham Pharmacia). As shown in FIG. 3 (photograph of electrophoretic pattern: photograph in lieu of drawing), expression of both genes was observed in liquid culture, solid culture using wheat bran, and solid culture using steamed rice.
【0043】[0043]
【実施例5】sgbR1、sgbR2及びsgbR3の
コサプレッション法によるジーンサイレンシング株の解
析
sgbR1、sgbR2及びsgbR3の機能を決定す
るためにコサプレッション法を用いて、両遺伝子の発現
を抑制させた麹菌の構築を検討した。麹菌内で構成的に
高発現するA.oryzae由来histoneH2A
プロモーター(685bp)下流にsgbR1、sgb
R2及びsgbR3の開始コドン(それぞれ、配列番号
2、4、6において、186番目、1288番目、50
6番目のATG)から500bpの部分断片を連結さ
せ、これをアスペルギルス・ニドランス(A.nidu
lans)のsCマーカーを用いて麹菌内へマルチコピ
ーで導入した。Example 5 Analysis of Gene Silencing Strains by Cosuppression Method of sgbR1, sgbR2 and sgbR3 Using the cosuppression method to determine the functions of sgbR1, sgbR2 and sgbR3, the expression of both genes was suppressed in Aspergillus oryzae. Considered construction. A. which is constitutively highly expressed in Aspergillus oryzae. oryzae-derived histone H2A
SgbR1 and sgb downstream of the promoter (685 bp)
Initiation codons of R2 and sgbR3 (in SEQ ID NOS: 2, 4, and 6, 186th, 1288th, 50th, respectively)
A partial fragment of 500 bp from the 6th ATG) was ligated, and this was ligated to Aspergillus nidulans (A. nidu).
was introduced into koji mold in multiple copies using the sC marker of lans).
【0044】得られた麹菌を用いて固体培養を行った結
果、図4に示すようにグルコアミラーゼのみならず、α
−アミラーゼ、酸性プロテアーゼ、酸性カルボキシペプ
チダーゼ、フィターゼなどについても親株に対する菌体
当たりの酵素生産量が30−70%に低下した。よっ
て、これらの転写制御因子は、固体培養特異的な遺伝子
群の発現を正に制御する因子であることが明らかとなっ
た。As a result of carrying out solid culture using the obtained koji mold, as shown in FIG. 4, not only glucoamylase but α
-For amylase, acid protease, acid carboxypeptidase, phytase, etc., the enzyme production per cell was reduced to 30-70% with respect to the parent strain. Therefore, it was revealed that these transcription control factors are factors that positively control the expression of gene groups specific to solid culture.
【0045】[0045]
【実施例6】sgbR1、sgbR2及びsgbR3の
97bp cis因子とのゲルシフトアッセイ
これらの転写制御因子が、glaBプロモーターのci
s因子である97bpとの結合能を有するかどうかをゲ
ルシフトアッセイにより確認した。sgbR1、sgb
R2及びsgbR3のcDNAを取得するために、蒸し
米を用いた固体培養菌体から調製したmRNAをもとに
RT−PCRを行った。この際に使用したプライマーの
両端には制限酵素サイトBamHI(sgbR1)、E
coRI(sgbR2)及びSmaI(sgbR3)を
付加した。得られたクローンをpUC118のlacZ
プロモーターの正の方向にサブクローニングし、得られ
た発現ベクター(pSGBR1、pSGBR2、pSG
BR3)を大腸菌JM109に形質転換した。アンピシ
リン含有培地で培養し、アンピシリン耐性株を選択し、
得られた形質転換体は、Escherichia co
li SGBR1、Escherichia coli
SGBR2、Escherichia coli S
GBR3と命名し、これを独立行政法人 産業技術総合
研究所 特許生物寄託センターにFERM P−183
57、FERM P−18358、FERM P−18
359としてそれぞれ寄託した。Example 6 Gel shift assay of sgbR1, sgbR2 and sgbR3 with 97 bp cis factor These transcription factors are ci of the glaB promoter.
It was confirmed by gel shift assay whether or not it has the binding ability to 97 bp which is an s factor. sgbR1, sgb
In order to obtain the R2 and sgbR3 cDNAs, RT-PCR was performed based on the mRNA prepared from solid-cultured bacterial cells using steamed rice. The restriction enzyme sites BamHI (sgbR1), E
coRI (sgbR2) and SmaI (sgbR3) were added. The obtained clone was lacZ of pUC118.
Expression vectors (pSGBR1, pSGBR2, pSG) obtained by subcloning in the positive direction of the promoter
BR3) was transformed into E. coli JM109. Culture in ampicillin-containing medium, select ampicillin resistant strain,
The resulting transformant was Escherichia co
li SGBR1, Escherichia coli
SGBR2, Escherichia coli S
Named GBR3 and named it FERM P-183 at National Institute of Advanced Industrial Science and Technology, Patent Biological Depository Center.
57, FERM P-18358, FERM P-18
359, respectively.
【0046】形質転換体は、転写因子sgbR1、sg
bR2、sgbR3タンパク質をコードする遺伝子すべ
て含有している。この点は、形質転換体をOD660が、
0.6になるまで37℃にて培養後、最終1mM IP
TGと最終1mM ZnSO 4を添加し、さらに30℃
で6時間培養を行い、このIPTG誘導により、lac
Zプロモーター支配下でsbgR1、sgbR2及びs
gbR3タンパク質を発現させてこれを確認した。The transformants are the transcription factors sgbR1 and sg.
All genes encoding bR2 and sgbR3 proteins
Contained. In this respect, the transformants were OD660But,
After culturing at 37 ℃ until 0.6, the final 1 mM IP
TG and final 1 mM ZnSO FourIs added, and further 30 ° C
Culturing was carried out for 6 hours, and by this IPTG induction, lac
SbgR1, sgbR2 and s under the control of the Z promoter
This was confirmed by expressing the gbR3 protein.
【0047】ゲルシフトアッセイは、タンパク質として
結合バッファー(最終1mM MgCl2、0.5mM
EDTA、0.5mM DTT、50mM KCl、
5mM Tris−HCl pH7.5、0.05μg
/μl polydIdC、4% glycerol)
に対して透析した大腸菌の無細胞調製液10μgを用
い、プローブとしてはglaBプロモーターのcis因
子である97bpの2本鎖DNAの二量体をGene
Imageキット(アマシャムファルマシア社製)によ
るフルオレッセンラベルしたものを0.8pmol用い
た。両者をバッファー(最終1mM MgCl2、0.
5mM EDTA、0.5mM DTT、50mM K
Cl、5mM Tris−HCl pH7.5、0.0
5μg/μl polydIdC、4% glycer
ol)中で室温30分間インキュベートした後に、2〜
15%グラジエントポリアクリルアミドゲル電気泳動を
4℃、120Vで行った。泳動ゲルは、正電荷メンブレ
ンへアルカリ転写し、抗原抗体反応によりフルオレッセ
ンの検出をGene Imageキット(アマシャムフ
ァルマシア社製)により行った。The gel shift assay was performed using binding buffer (final 1 mM MgCl 2 , 0.5 mM) as protein.
EDTA, 0.5 mM DTT, 50 mM KCl,
5 mM Tris-HCl pH 7.5, 0.05 μg
/ Μl polydIdC, 4% glycerol)
10 μg of a cell-free preparation of Escherichia coli dialyzed against was used as a probe, and a dimer of 97 bp double-stranded DNA which is a cis factor of the glaB promoter was used as a gene.
0.8 pmol of fluorescein-labeled with Image kit (manufactured by Amersham Pharmacia) was used. Both were buffered (final 1 mM MgCl 2 , 0.
5 mM EDTA, 0.5 mM DTT, 50 mM K
Cl, 5 mM Tris-HCl pH 7.5, 0.0
5 μg / μl polydIdC, 4% glycer
2) after incubation at room temperature for 30 minutes in
15% gradient polyacrylamide gel electrophoresis was performed at 120C at 4 ° C. The electrophoretic gel was alkali-transferred to a positively charged membrane, and fluorescein was detected by an antigen-antibody reaction using a Gene Image kit (manufactured by Amersham Pharmacia).
【0048】図5(ゲルシフトアッセイの結果を示すパ
ターンの写真:図面代用写真)に示すようにsgbR1
とsgbR2はglaBプロモーターのcis因子であ
る97bpとの結合により、遅延したバンドが確認され
た。またこのバンドは過剰の非ラベルプローブの添加に
より消失した。よってsgbR1とsgbR2は、とも
にglaBプロモーターのcis因子である97bpへ
の結合能を有することが確認された。sgbR3は97
bpへの結合を確認できなかったが、コサプレッション
により固体培養での酵素遺伝子群の発現を低下させたこ
とから、sgbR1やsgbR2と相互作用することに
より、固体培養での酵素遺伝子発現を制御することが示
唆された。As shown in FIG. 5 (photograph of pattern showing results of gel shift assay: photograph substituting for drawing), sgbR1
A delayed band was confirmed by binding of sgbR2 and 97 bp, which is a cis factor of the glaB promoter. Also, this band disappeared due to the addition of excess unlabeled probe. Therefore, it was confirmed that both sgbR1 and sgbR2 have the ability to bind to 97 bp, which is the cis factor of the glaB promoter. sgbR3 is 97
Although the binding to bp could not be confirmed, the expression of the enzyme gene group in solid culture was reduced by cosuppression. Therefore, the enzyme gene expression in solid culture is controlled by interacting with sgbR1 and sgbR2. It has been suggested.
【0049】[0049]
【発明の効果】固体培養は、醸造産業の培養形態であ
り、液体培養に比べてタンパク質分泌生産量も多く、ま
た培養特異的に生産される酵素群も非常に種類が多い特
徴を有する。本発明によりこの固体培養での遺伝子発現
に関わる転写制御因子が、はじめて明らかとなった。固
体培養は様々な産業で実用化されている培養法であり、
本培養法における発現メカニズムが分子レベルで解明で
きれば当該産業の発展に著しく貢献できるだけでなく固
体培養での異種タンパク質生産を可能にするものであ
る。また本発明は麹菌の酵素を麹菌で生産させるため、
生産される酵素蛋白は非常に安全性が高く、食品、医薬
品、化粧品産業などへも応用が可能な画期的な技術であ
る。よって本転写制御因子が得られたことによって、麹
菌の液体培養及び固体培養での酵素生産の包括的改善を
行うことが可能となり、工業的にも非常に有望な遺伝子
であることが示唆された。EFFECTS OF THE INVENTION Solid culture is a culture form of the brewing industry, and has a characteristic that the amount of secretory protein produced is larger than that of liquid culture, and that the enzyme groups produced in a culture-specific manner have a great variety. According to the present invention, a transcription control factor involved in gene expression in this solid culture was clarified for the first time. Solid culture is a culture method that has been put to practical use in various industries.
If the expression mechanism in the present culturing method can be elucidated at the molecular level, it will not only significantly contribute to the development of the industry but also enable heterologous protein production in solid culture. Further, in the present invention, since the enzyme of koji mold is produced by koji mold,
The enzyme protein produced is extremely safe and is an epoch-making technology that can be applied to the food, pharmaceutical, cosmetics industries, etc. Therefore, it was possible to comprehensively improve enzyme production in liquid culture and solid culture of Aspergillus oryzae by obtaining this transcriptional regulatory factor, and it was suggested that this gene is a very promising gene in industry. .
【0050】[0050]
【配列表】 SEQUENCE LISTING <110> National Research Institute of Brewing ; Gekkeikan Inc., Ltd. <120> DNA-encoding Transfer factor Regulating Expression of Koji-mo ld Gene in Solid Culture <130> 6431 <141> 2001-6-27 <160> 7 <210> 1 <211> 717 <212> PRT <213> Aspergillus oryzae <400> 1 Met Ser Met Tyr Pro Ser Ile Arg Asn Lys Gly Phe Ile Ser Glu Pro 1 5 10 15 Thr Ser Thr Pro Phe Ile Asn Phe Gln Asp Pro Val Phe Ile Gln Asp 20 25 30 Glu Phe Leu Pro Asp Leu Ser Gln Glu Ala Ala Leu Gln Phe Tyr Asn 35 40 45 Gln Gln Leu Pro Arg Leu Gln Ser Pro Phe Gln Tyr His Ser Gly Leu 50 55 60 Glu Phe Tyr Ser Pro Pro Ser Ser Ala Pro Ser Ser Pro Thr Ser Ser 65 70 75 80 Pro Ala Ser Ser Thr Thr His Leu Pro Ala Thr Thr Ala Val Ala Ala 85 90 95 Glu Tyr Pro Asn Ser Phe Glu Ala Pro Ser Met Thr Pro Val Ser Ser 100 105 110 Tyr Asn Ser Ala Phe Asn Ile Gln His Thr Ala Thr Pro Pro Ser Ser 115 120 125 Gln Pro Tyr Leu Ser Gln Tyr Pro Gln Tyr Leu Phe Glu Asn Ser Pro 130 135 140 Met Leu Ala Gln Gln Pro Val Ala Asn Thr His Gly Trp Glu Asn Asn 145 150 155 160 Leu Gln Leu Leu Ser Ser Ala Arg Met Gln His Lys Ala Ser Pro Ser 165 170 175 Thr Ser Ser Thr Arg Ser Ala Pro Ala Gly Ser Ser Tyr Gln Arg Ser 180 185 190 Asn Ala Ser Thr Leu Ser Lys Pro Leu Pro Thr Pro Val Gln Thr Pro 195 200 205 Ile Gln Asn Ser Phe Leu Ala Ala Pro Tyr Gln Gln Asn Tyr Asp Thr 210 215 220 Ser Val His Asp Gly Ser Gln Ala Glu Ala Glu Met Val Arg Arg Ala 225 230 235 240 Val Met Glu Gln Gln Gln Lys Gln Gln Gln Gln Gln Ser His His Gln 245 250 255 His Gln Pro Ser Asp Tyr Ser Leu Ala Pro Ser Val Ser Ser Val Ser 260 265 270 His Asn Ser Pro Val Thr Pro Gln Ile Lys Pro Glu Glu Leu Asp Glu 275 280 285 Ala Ser Lys Ala Met Val Asn Gly Glu Lys Arg Tyr Pro Asp Ile Asp 290 295 300 Arg Trp Met Asp Asp Tyr Leu His Leu Asp Ala Phe Ala Asp Tyr Asn 305 310 315 320 Asn His Asn Gly Asn Asn Leu Pro Ile Gly Ile Pro Lys Ile Asn Arg 325 330 335 Thr Met Ser Asp Ile Tyr Gln Asp Glu Leu Tyr Asn Pro Ala Leu Met 340 345 350 Pro Thr Pro Gln Val Ser Lys Gln Thr Thr Asn Gln Gln Asn Leu Leu 355 360 365 Asn Pro Phe Arg Asn Val Phe Ala Asp Arg Leu Gln Ala Ala Asn Gln 370 375 380 Gly His Met Thr Ala Arg Ser His Ser Pro Val Val Asn Met His Arg 385 390 395 400 Asp Arg Ser Pro Phe Arg Gln Asn Ser Pro Leu Ala Ala Glu Tyr Asn 405 410 415 Asn Gly Phe Gln Gln Pro Gln Met Ala Thr Ser Val Pro Met Thr Gln 420 425 430 Asn Val Gly Gln Ser Gln Gly Glu Gly Glu Pro Lys Thr Met Ser Pro 435 440 445 Lys Asp Ala Leu Leu Asp Phe Asn Glu Gly Asp Asp Ala Gly Ile Pro 450 455 460 Leu Phe Pro Thr Ser Gln Pro Asp Phe Asn Leu Gly Glu Ala Leu Gly 465 470 475 480 Leu Arg Arg Glu Ser Ser Ser Ser Phe Pro Gln Ser Gln Asn Phe Thr 485 490 495 Ser Met Glu Ser Phe Pro Thr Gln Tyr Thr Thr Pro Asn Gly Leu Pro 500 505 510 Gln Gln Tyr Pro Phe Ala Gln Gln Gln Gln Asp His Gln Gln Gln Gln 515 520 525 Gln Asn Asn Leu Leu His Gln Thr Pro Glu Phe Pro Ala Ser Leu Pro 530 535 540 His Phe Glu Ser Thr Asn Ser Asp Ala Gly Val Asn Asn Gly Val Ala 545 550 555 560 Ser Pro Pro Ala Gln Pro Thr Met Ala Met Arg Pro Val Lys Glu Glu 565 570 575 Ile Thr Arg Pro Glu Arg Thr Ser Ala Asp Ser Gly Thr Tyr Thr Cys 580 585 590 Thr Tyr His Gly Cys Thr Leu Arg Phe Glu Thr Pro Thr Lys Leu Gln 595 600 605 Lys His Lys Arg Glu Ala His Arg Gln Thr Thr Pro Gly Gly His Leu 610 615 620 Val Gly Arg Asp Thr Ser Ala Arg Asn Ser Gln Ala Gly Pro His Lys 625 630 635 640 Cys Glu Arg Ile Asn Pro Ser Thr Gly Lys Pro Cys Asn Ser Val Phe 645 650 655 Ser Arg Pro Tyr Asp Leu Thr Arg His Glu Asp Thr Ile His Asn Ala 660 665 670 Arg Lys Gln Lys Val Arg Cys His Leu Cys Thr Glu Glu Lys Thr Phe 675 680 685 Ser Arg Asn Asp Ala Leu Thr Arg His Met Arg Val Val His Pro Glu 690 695 700 Val Asp Trp Pro Gly Lys Gln Arg Arg Arg Gly Arg Glu 705 710 715 717 <210> 2 <211> 2362 <212> DNA <213> Aspergillus oryzae <400> 2 ccacttcata ttcgcagtgt tcggggataa agataatcgg atcttgggcc gttggaactt 60 ttatttggag gcttgggtgc catacctagt acttaacttc tggaacctcc cccagtggcg 120 aatcctctga acttccccat ccactccggc gtcataccct tgacgtccat ttgttgaaag 180 agaacatgtc catgtaccct tcgatacgca ataagggttt tatcagtgaa ccaacatcta 240 cccccttcat caatttccaa gaccccgttt tcatccagga cgagttcctt cccgacttgt 300 cgcaggaggc cgcattacag ttctacaacc agcaactacc caggctccaa tcacctttcc 360 agtaccactc gggtcttgaa ttttactctc cgccatcttc cgcgccgtcg tcaccgactt 420 cctctccagc ctcttccacc acccacctgc cagcgaccac agcggtggct gcggaatacc 480 ccaactcctt tgaggctcct tcgatgacac ccgtgtcctc ctataactct gcattcaaca 540 ttcagcatac cgcgacacca ccgtcctcac agccatactt gtcacaatac ccccaatacc 600 tttttgaaaa ctccccaatg ttggcgcaac agcctgtcgc aaacacccac ggctgggaaa 660 acaacctgca gctgctgagc tcggctcgca tgcagcacaa ggcatcaccg agtacctcgt 720 cgactcgatc tgcacctgca gggagctcgt accaaagatc taacgcttct acgctgtcga 780 agcctctacc cactcccgtc cagacaccta tccagaactc gtttctcgcc gcgccctatc 840 agcagaacta tgacacttcg gtgcatgatg gtagccaagc tgaagcggag atggtgagac 900 gggctgtgat ggaacagcag cagaagcagc aacagcaaca aagtcaccat cagcatcagc 960 caagtgatta ctcactggca ccttcggtat catcagtgag ccacaactca cctgttactc 1020 ctcagataaa accggaggaa cttgacgagg cctcgaaggc gatggtcaat ggtgagaaac 1080 gatatcctga tatagaccga tggatggatg attacttaca tctcgatgcc tttgcagact 1140 acaataacca caacggaaac aaccttccga tcggcattcc caaaatcaac cgaactatgt 1200 cggacatcta ccaagatgag ctctataacc cagctctcat gccgactcct caagtctcca 1260 aacagactac gaaccagcag aatcttctga acccgttcag aaacgtcttt gctgatcgac 1320 tgcaagccgc caatcagggc cacatgactg cacggtctca ttctcctgtt gttaacatgc 1380 accgggatcg gtctcctttc cgacagaact cgcccctggc tgctgagtac aacaatgggt 1440 tccagcagcc ccagatggcg accagtgtgc ctatgactca aaatgtaggc caaagtcaag 1500 gagaaggaga accgaagact atgtctccta aggatgccct gttggacttc aacgagggag 1560 atgacgccgg gattcctttg ttcccaacaa gtcaacctga cttcaacctt ggtgaagcac 1620 tcggcctccg acgtgaaagc tcatcatcct tcccgcagtc gcagaacttt acttccatgg 1680 agtcattccc cacgcaatac actacaccga atggacttcc ccagcagtat cccttcgcgc 1740 agcagcaaca agaccatcag cagcaacaac agaacaacct cctgcaccag acccccgaat 1800 tccctgcatc ccttcctcac tttgagtcta cgaacagtga tgccggggtc aacaacggtg 1860 tggcttctcc accggcacag cccacaatgg cgatgcgccc agtgaaggaa gagatcaccc 1920 gccctgagcg cacttctgcg gacagtggca cttacacttg cacttaccac ggctgcacgc 1980 ttcggtttga aacgcctacg aagctgcaga agcacaagcg cgaggcccac cgtcagacta 2040 cgcctggcgg ccacttggtt gggcgcgata cttccgcacg caactctcag gccggtcccc 2100 acaaatgtga gcggatcaat ccgtctacgg gaaagccatg caattctgtt ttctcccggc 2160 cttacgatct tacccggcac gaggacacga tccacaacgc acgcaagcag aaggtccggt 2220 gtcatctgtg cacggaagaa aagacctttt cgcgtaatga tgcgctgacc cggcacatgc 2280 gagtggtgca ccctgaggtc gattggcccg gcaagcaaag aaggagaggc agggagtaag 2340 cctgaatggg gttgtggtgt ca 2362 <210> 3 <211> 597 <212> PRT <213> Aspergillus oryzae <400> 3 Met Ala Pro Thr Val Gln Gly Gln Pro Ser Phe Ala Tyr Tyr Pro Thr 1 5 10 15 Ala Asp Ser Gln Arg Gln Gln Tyr Thr Ser His Pro Ala Glu Pro Gln 20 25 30 Pro Tyr Tyr Gly Gln Ile Gln Ala Phe Pro Gln Gln His Cys Leu Pro 35 40 45 Glu Gln Gln Pro Val Tyr Asn Ala Gln Pro Met Met Asn Met His Gln 50 55 60 Met Ala Thr Thr Asn Ala Phe Arg Gly Ala Met Asn Met Thr Pro Ile 65 70 75 80 Ala Ser Pro Gln Pro Ser His Leu Lys Pro Thr Ile Val Val Gln Gln 85 90 95 Gly Ser Pro Ala Leu Met Pro Leu Asp Thr Arg Phe Val Gly Asp Tyr 100 105 110 Tyr Ser Phe Pro Ser Thr Pro Pro Leu Ser Thr Ala Gly Ser Ser Ile 115 120 125 Ser Ser Pro Pro Ser Ser Ser Gly Thr Leu His Thr Pro Ile Asn Asp 130 135 140 Cys Phe Phe Ser Phe Glu Lys Val Glu Gly Val Lys Glu Gly Cys Glu 145 150 155 160 Ser Asp Val His Ala Glu Leu Leu Ala Ser Ala Asp Trp Thr Arg Ser 165 170 175 Ala Thr Ser Pro Pro Met Thr Pro Val Phe Ile Asn Ala Asn Ser Leu 180 185 190 Thr Ala Ser Gln Ser Ser Asp Phe Leu Ser Ala His Gly Ser Cys Pro 195 200 205 Ser Leu Ser Pro Ser Pro Leu Leu Val Ser Ser Val Phe Thr Pro Ser 210 215 220 Gln Ser Ala Phe Pro Val Glu Gln Ala Thr Ser Asp Phe Cys Asp Pro 225 230 235 240 Arg Gln Leu Thr Val Glu Ser Ser Val Asn Thr Ser Ser Pro Ala Glu 245 250 255 Leu Pro Pro Leu Pro Thr Leu Ser Cys Asp Glu Glu Glu Pro Lys Val 260 265 270 Val Leu Gly Ser Glu Ala Val Thr Leu Pro Val His Glu Asp Pro Ser 275 280 285 Pro Ala Tyr Thr Ser Ser Thr Glu Asp Pro Leu Ser Ser Leu Pro Thr 290 295 300 Phe Asp Ser Phe Ser Asp Leu Asp Ser Glu Asp Glu Phe Val Asn Arg 305 310 315 320 Leu Val Asp Phe His Pro Ser Gly Asn Thr Tyr Tyr Val Gly Glu Lys 325 330 335 Arg Gln Arg Leu Ser Ala Tyr Ser Phe Asp Asp Glu Glu Phe Leu Ser 340 345 350 Glu His Ser Leu Glu Asp Ser Ser Asp Asp Leu Glu Leu Ala His Ser 355 360 365 Gly Leu Ser Phe Leu Gly Cys Ala Asp Phe Ala Pro Ala Gln Ser Asp 370 375 380 Ala Ser Glu Thr Ala Asp Glu Met Lys Thr Lys Lys Arg Ser Asn Ser 385 390 395 400 Arg Lys Ser Leu Lys Arg Ala Asn Ser Glu Asp Gln Asp Ala Leu Lys 405 410 415 Lys Ala Gln Ala Pro Ile Asn Ser Arg Ala Asn Ser Thr Glu Ala Asn 420 425 430 Val Ala Gln Gln Ala Ala Ala Pro Ser Cys Ser Ala Ser Glu Ala Asn 435 445 440 Val Ser Ser Ser Cys Glu Ala Pro Ser Val Pro Val Ser Val Asn Arg 450 455 460 Arg Gly Arg Lys Gln Ser Leu Thr Asp Asp Pro Ser Lys Thr Phe Val 465 470 475 480 Cys Ser Leu Cys Ser Arg Arg Phe Arg Arg Gln Glu His Leu Lys Arg 485 490 495 His Tyr Arg Ser Leu His Thr Gln Asp Lys Pro Phe Glu Cys Asn Glu 500 505 510 Cys Gly Lys Lys Phe Ser Arg Ser Asp Asn Leu Ala Gln His Ala Arg 515 520 525 Thr His Ala Gly Gly Ser Ile Val Met Gly Val Leu Asp Thr Asn Asn 530 535 540 Ala Ser Glu Arg Tyr Asp Asn Arg Asp Ala Ser Thr Met Gly Ala Val 545 550 555 560 Leu Tyr Glu Ala Ala Thr Leu Arg Arg Pro Ser Arg Arg Leu Ala Asn 565 570 575 His Pro Lys Met Gly Phe Leu Thr His Pro His Arg Ser Ser Ala Arg 580 585 590 Gln Glu Ala Gln Ala 595 597 <210> 4 <211> 3167 <212> DNA <213> Aspergillus oryzae <400> 4 ggggccggtt gggggccctc ccggggttgg gttacgggaa aagtttcaag ggccaaagga 60 aaaaaaaggt aaaaaagatt tagggaaaaa aaggccaaaa aaaagggcct tggaaaaagg 120 aattcccaag aacctttccc ccaaaaggtt aattggggta ggggaaccat tggggtttct 180 taccaaggtt tttcccttta tcctttgccc ccttcaggga aaaatttact tattatccta 240 gggaatttgg ccccccccag gggggggatc cctgggattt aagggggggt gccccccggc 300 ccccccctaa gggggttccc ccctttttag ggattttttt ttttttggaa attaaatttc 360 ctttcccttt ttttttaaaa aaaataaaat cttttcggtt ttattttatt ttttataata 420 aaaaaattcc taattataat tataaatccg ggtattttta gggataaatt ctaggtttta 480 actaccctcc ctcccttttc cccgggcccc cccccctcca aatagacaag acggactaaa 540 actaaaaagt acggctccag tcctacgact aactctcact acgcacggcg ctactaagtc 600 gtagcacaag cagatcagtc tctcttttat ccactgtcca tccttctggc cgtctgtccc 660 tcttcccttt cccatatcgg ccctgtctct cgtttccccg cgactgtcca ttagtcctct 720 ttatttctta gtcttttcct tttttccctt tttagacttg tctcgattct ctgtccttcc 780 cttcgttatt tcttattttt tggctttgat ccccatctct ctcttctctc tcttctttta 840 ctcatcatct gtctccggta cctttgatcg tgatcgtccg aatccaaaag gtggctcgta 900 tcgaccagag gttcctgtcc ttccttctat cagatataac tataccgtac attcacactc 960 agaagcccag aagacgttgc atattttata aaccctaatt cggactgcac tcttctgcct 1020 tcagtcattc attcatccag aacaagagac atctacattc attcaaccta cagtgctcga 1080 catctatcat tagcaccaag ctgcctcgac atcaatctgg aacatcctga cttagactaa 1140 acatacttaa taatacccaa gaggttaagc agtgttcgct tgatcctcat cacattcatc 1200 attggttcgt ggaaggatga ctatatatca tacaggatgc ccatactgat ccctcgtctt 1260 ctagtcacaa tggacgctac atacaccatg gcaccaacgg tgcaaggaca accatcattt 1320 gcatactacc ccactgctga ttcacaaaga caacaataca caagccaccc ggctgagccc 1380 cagccatact atggacagat tcaggccttc cctcagcaac actgcctacc agagcaacaa 1440 ccggtctaca atgctcaacc catgatgaac atgcatcaga tggctaccac caatgccttc 1500 cgcggagcaa tgaacatgac tcccatcgcc tctccccaac catctcacct gaagcccaca 1560 atcgtcgtcc agcagggttc cccagccctc atgcctttgg acacaaggtt cgtcggtgac 1620 tactatagct tcccttctac ccctccactc tccaccgcgg gaagttctat cagcagcccg 1680 ccatccagca gtggcacctt gcacacccca atcaacgact gcttcttctc attcgaaaag 1740 gtggagggtg taaaggaggg ttgtgaaagt gacgtccatg ccgagctttt ggccagtgcc 1800 gactggactc ggtcagcaac ttcgcctccc atgactcctg gtatgttttc ttctcatcag 1860 ttgagtaggg ggttggaacc agtctaactt cttctagtct tcatcaacgc caattctctc 1920 accgccagtc agagctcaga cttcctctct gcccacgggt cttgcccctc tctttctcct 1980 tcgcctctcc tggtatcttc cgtgtttact ccgtctcagt cggctttccc tgttgagcag 2040 tcgcctctcc tggtatcttc cgtgtttact ccgtctcagt cggctttccc tgttgagcag 2040 gccacctccg acttctgtga ccctcggcag ctgactgtcg agtcttccgt caacacatcc 2100 tctcccgctg agcttcctcc tttgcccact ctctcctgcg atgaggagga accgaaggtg 2160 gtcctgggaa gcgaggctgt gactttgcca gttcatgagg acccatcacc cgcttacact 2220 agctctaccg aggaccccct cagttcgttg cccacttttg acagcttctc cgaccttgat 2280 tcggaggacg agtttgtcaa ccgcttggtt gacttccatc ccagcggtaa tacttactac 2340 gtgggcgaaa agagacagcg cctcagcgca tactccttcg atgacgagga gttcttgagc 2400 gagcacagct tggaagattc ttcagacgac ctggagttgg cccactctgg cctttctttc 2460 ctgggatgcg ccgatttcgc tccggctcaa agtgatgcca gcgagaccgc tgacgagatg 2520 aagacaaaga agcgaagcaa ctcccgcaag tctctcaaga gagcaaactc tgaggaccag 2580 gatgctctca agaaggcaca ggcaccgatc aacagccggg ccaacagcac agaggccaac 2640 gtggctcagc aggcagctgc tccgtcctgc tcagcctcgg aagccaatgt atctagctcc 2700 tgcgaggccc catctgttcc tgtgtcggtc aaccgccggg gtcgtaagca atccctgacg 2760 gatgatcctt ccaagacctt tgtctgctct ctctgctctc gtcgtttccg tcgtcaggag 2820 caccttaagc gtcactaccg ctctctgcac acccaggaca agcccttcga gtgcaacgag 2880 tgtggtaaga agttttctcg gagtgacaac cttgcgcagc atgccaggac ccatgccggt 2940 ggctctatcg tgatgggtgt cctggacacg aacaacgcat ctgaacggta tgataatcgc 3000 gacgcgagta ccatgggtgc tgtgctctat gaagctgcca cgctgcggcg accaagtcga 3060 cgactagcga atcatccgaa gatggggttt ctgacgcacc ctcatcgatc gtcggcccgc 3120 caagaagcgc aagcgtgacg agcactttag ttgcccccat tgccggt 3167 <210> 5 <211> 468 <212> PRT <213> Aspergillus oryzae <400> 5 Met Ala Ser Ile Asp Pro Asn Val Pro Thr Thr Thr Leu Pro Tyr Thr 1 5 10 15 Cys Asn Thr Cys Leu Val Ala Phe Arg Gly Ser Asp Ala Gln Arg Asp 20 25 30 His Met Arg Lys Asp Trp His Leu Tyr Asn Met Lys Arg Arg Ile Ala 35 40 45 Ser Leu Pro Pro Val Ser Gln Glu Val Phe Asn Asp Lys Val Leu Ala 50 55 60 Ala Lys Ala Thr Thr Ser Ala Ala Ala Ala Lys Ala Ser Phe Glu Lys 65 70 75 80 Thr Cys Val Ala Cys Gln Lys Thr Phe Phe Ser Glu Asn Ser Tyr Gln 85 90 95 Asn His Val Lys Ser Ser Lys His Lys Ala Arg Glu Ala Gln Met Leu 100 105 110 Arg Asp Ser Ala Asp Asp Ala Ser Ser Val Met Ser Ser Thr Phe Ser 115 120 125 Leu Gly Glu Pro Val Asn Lys Pro Arg Glu Arg Ser Glu Val Ser Lys 130 135 140 Val Thr Glu Ser Leu Lys Asn Ala Thr Ile Glu Glu Asp Asp Glu Asp 145 150 155 160 Glu Glu Met Glu Glu Gln Gly Phe Ser Ala Ser Arg Cys Leu Phe Cys 165 170 175 Asn Glu Lys Ser Ser Asp Leu Gln Gln Asn Thr Glu His Met Phe Lys 180 185 190 Thr His Gly Met Phe Ile Pro Glu Lys Asp Tyr Leu Val Asp Leu Glu 195 200 205 Gly Leu Val His Tyr Leu Tyr Arg Lys Ile Asn Glu Asn Ser Glu Cys 210 215 220 Leu Tyr Cys His Ala Val Arg Asn Asn Pro Glu Gly Ala Arg Thr His 225 230 235 240 Met Arg Asp Lys Gly His Cys Met Ile Ala Phe Glu Lys Gln Asp Glu 245 250 255 Gln Val Glu Ile Gly Gln Phe Tyr Asp Phe Arg Ser Thr Tyr Ser Asp 260 265 270 Gly Glu Gly Glu Asp Glu Glu Asp Ser Ile Met Glu Asp Gly Gly Val 275 280 285 Lys Val Asp Gly Glu Asp Asp Glu Gly Trp Glu Thr Glu Thr Ser Ala 290 295 300 Ser Ser Met Asp Asp Asp Glu Asp Glu Leu Asp Asp Thr Lys Gly Gln 305 310 315 320 Val Tyr Ala Thr Glu Phe Glu Leu His Leu Pro Ser Gly Arg Thr Ala 325 330 335 Gly His Arg Ser Leu Ala Lys Tyr Tyr Arg Gln Asn Leu Arg Asn Tyr 340 345 350 Pro Thr Ala Glu Glu Arg Ala Ala Arg Gln Leu Ala Ile Glu Asn Gly 355 360 365 Glu Ile Glu Glu Glu Glu Pro Lys Pro Arg Gly Arg Asp Leu Asn Arg 370 375 380 Ala Val Pro Glu Ala Phe Ala Ala Glu Leu Glu Arg Lys Asp Arg Thr 385 390 395 400 Arg Ala Gln Arg Gln Glu Lys Arg Tyr Thr Ala Arg Val Asn Arg Ala 405 410 415 Ala Asn Asn Gln Lys His Phe Arg Val Cys Phe Ile Pro Leu Leu Ile 420 425 430 Ile Arg Gln Ala Asn Thr Pro Ser Arg Ile Pro Ser Cys Ser Arg Phe 435 440 445 Leu Tyr Leu Ile Cys Leu Arg Ala Cys Trp His Cys Leu Ala Lys Ser 450 455 460 Trp Ser Tyr Gly 465 468 <210> 6 <211> 2337 <212> DNA <213> Aspergillus oryzae <400> 6 tggtcgacgg cccgggctgg tatcatacgg actgacacac cagcctcagg cgccaagcat 60 gaggccggct gactaacatt aaatggaaga agtatgatcc gatcgttgat gcactaggct 120 tgatatagat agggccttcc cagacaggat tagtggaggg gcacgtgcaa tcaacgtcat 180 gttctctaaa agcggcatgg gcaatggttt tcaatgtgtg gtccgggtta tcggcgacat 240 cgattactag cccgatcttt tctgtcgagg ggcaatattg cgcccgcttc actttctcgc 300 tagatacttc ctaccggctt ttatttctaa tcaggcccaa aactttctgt gacgactacg 360 cctggttttg ctttttcatt gctcttgcaa aaacgaatac ccccgcgcaa tatatattcc 420 cacaaccgat cgccttttca tcaaatttat ttctgtgata cccacctttt ttttttttgt 480 tccatatcag gaactgtcag aaatcatggc ctccatcgat ccaaatgtac caacaactac 540 gttgccatat acctgcaaca cctgtctcgt tgcttttcgc ggtagcgatg ctcagcggga 600 tcatatgcgc aaggactggc agtaagtttg ttggaaaata tacgactcag aatgcattac 660 taatgttcgg tcgactcaag tctctataac atgaagcgcc gcatcgcgtc tctgccccca 720 gtgtcccagg aggtttttaa cgacaaggtc cttgctgcca aagccacaac tagtgctgcc 780 gccgccaaag cttcctttga gaagacatgt gtcgcctgcc agaagacatt cttcagcgag 840 aactcgtatc agaaccacgt gaagagttcc aagcacaagg cccgtgaggc acagatgctt 900 agagacagcg ccgatgatgc atcgtctgtc atgagttcta ctttctctct gggcgagcca 960 gtcaacaagc ctcgtgagcg gtcggaggtg tcgaaagtta cggagagtct caagaacgct 1020 accatcgaag aggatgacga agatgaggaa atggaagagc agggcttctc ggcctcccgt 1080 tgtcttttct gtaatgagaa atcttcagat cttcaacaaa acaccgagca catgttcaag 1140 acccacggca tgttcatacc agagaaggac tatcttgttg atttggaggg acttgtccac 1200 tatctttatc ggaagatcaa tgagaacagt gaatgtttat actgccatgc cgtccgaaac 1260 ggtgagggtg aggatgagga ggactcgatc atggaggatg gtggtgtcaa ggtggatggc 1440 gaggatgatg agggatggga aaccgaaact tctgcatctt ctatggatga cgatgaggat 1500 gagctcgacg acacgaaggg acaggtctat gcgacagagt ttgagctgca tttgccctca 1560 ggccgcactg caggtcaccg atcgctcgcc aaatactacc gccagaactt gcgtaactac 1620 cctacagcgg aagaacgggc agctcgtcag ctggctattg aaaatggcga gattgaggag 1680 gaggaaccga aacccagggg tcgcgacctc aatcgggccg ttgttagccg tggcaatggc 1740 ggtattgggt atgatcggtg ctactgacag ccagaagcat ttgctgctga gcttgaacgc 1800 aaagaccgga ctcgtgccca gcgacaagag aagcggtaca cagctcgggt caaccgggct 1860 gccaataacc agaagcactt cagggtatgt ttcatccctt tactgataat aagacaagct 1920 aacaccccct ctaggatccc ctcctgcagt agattcctgt atctgatttg tttgcgtgca 1980 tgttggcatt gtttggcaaa atcatggagt tacggctgaa tgacctgtag atactcatgg 2040 caagataaaa agtgattatt aagaagatga aggatattac ttcgcattcc tgctatcttt 2100 actattttag acagtgctgc atcttatgac ggatataccg aaggtctacc agactgaagg 2160 tgaatcaagt cctacatcta accccccgac caaaccctaa gtcaccacat ctcacagaac 2220 cgatagcaaa cttacaatca agcaccataa caacaaacca ggaccacaaa taataatata 2280 tgaacatcat atcactttac cagcccgggc cgtcgaccac gcgtgcccta tagtgag 2337 <210> 7 <211> 97 <212> DNA <213> Artificial Sequence <400> 7 gagaactaag agaatggcgg cacgggcaga tgtcgggatg atcactttta gtcectccga 60 cgcaatatcg atttcaattg gatccgccac gctgcat 97[Sequence list] SEQUENCE LISTING <110> National Research Institute of Brewing; Gekkeikan Inc., Ltd. <120> DNA-encoding Transfer factor Regulating Expression of Koji-mo ld Gene in Solid Culture <130> 6431 <141> 2001-6-27 <160> 7 <210> 1 <211> 717 <212> PRT <213> Aspergillus oryzae <400> 1 Met Ser Met Tyr Pro Ser Ile Arg Asn Lys Gly Phe Ile Ser Glu Pro 1 5 10 15 Thr Ser Thr Pro Phe Ile Asn Phe Gln Asp Pro Val Phe Ile Gln Asp 20 25 30 Glu Phe Leu Pro Asp Leu Ser Gln Glu Ala Ala Leu Gln Phe Tyr Asn 35 40 45 Gln Gln Leu Pro Arg Leu Gln Ser Pro Phe Gln Tyr His Ser Gly Leu 50 55 60 Glu Phe Tyr Ser Pro Pro Ser Ser Ala Pro Ser Ser Pro Thr Ser Ser 65 70 75 80 Pro Ala Ser Ser Thr Thr His Leu Pro Ala Thr Thr Ala Val Ala Ala 85 90 95 Glu Tyr Pro Asn Ser Phe Glu Ala Pro Ser Met Thr Pro Val Ser Ser 100 105 110 Tyr Asn Ser Ala Phe Asn Ile Gln His Thr Ala Thr Pro Pro Ser Ser 115 120 125 Gln Pro Tyr Leu Ser Gln Tyr Pro Gln Tyr Leu Phe Glu Asn Ser Pro 130 135 140 Met Leu Ala Gln Gln Pro Val Ala Asn Thr His Gly Trp Glu Asn Asn 145 150 155 160 Leu Gln Leu Leu Ser Ser Ala Arg Met Gln His Lys Ala Ser Pro Ser 165 170 175 Thr Ser Ser Thr Arg Ser Ala Pro Ala Gly Ser Ser Tyr Gln Arg Ser 180 185 190 Asn Ala Ser Thr Leu Ser Lys Pro Leu Pro Thr Pro Val Gln Thr Pro 195 200 205 Ile Gln Asn Ser Phe Leu Ala Ala Pro Tyr Gln Gln Asn Tyr Asp Thr 210 215 220 Ser Val His Asp Gly Ser Gln Ala Glu Ala Glu Met Val Arg Arg Ala 225 230 235 240 Val Met Glu Gln Gln Gln Lys Gln Gln Gln Gln Gln Ser His His Gln 245 250 255 His Gln Pro Ser Asp Tyr Ser Leu Ala Pro Ser Val Ser Ser Val Ser 260 265 270 His Asn Ser Pro Val Thr Pro Gln Ile Lys Pro Glu Glu Leu Asp Glu 275 280 285 Ala Ser Lys Ala Met Val Asn Gly Glu Lys Arg Tyr Pro Asp Ile Asp 290 295 300 Arg Trp Met Asp Asp Tyr Leu His Leu Asp Ala Phe Ala Asp Tyr Asn 305 310 315 320 Asn His Asn Gly Asn Asn Leu Pro Ile Gly Ile Pro Lys Ile Asn Arg 325 330 335 Thr Met Ser Asp Ile Tyr Gln Asp Glu Leu Tyr Asn Pro Ala Leu Met 340 345 350 Pro Thr Pro Gln Val Ser Lys Gln Thr Thr Asn Gln Gln Asn Leu Leu 355 360 365 Asn Pro Phe Arg Asn Val Phe Ala Asp Arg Leu Gln Ala Ala Asn Gln 370 375 380 Gly His Met Thr Ala Arg Ser His Ser Pro Val Val Asn Met His Arg 385 390 395 400 Asp Arg Ser Pro Phe Arg Gln Asn Ser Pro Leu Ala Ala Glu Tyr Asn 405 410 415 Asn Gly Phe Gln Gln Pro Gln Met Ala Thr Ser Val Pro Met Thr Gln 420 425 430 Asn Val Gly Gln Ser Gln Gly Glu Gly Glu Pro Lys Thr Met Ser Pro 435 440 445 Lys Asp Ala Leu Leu Asp Phe Asn Glu Gly Asp Asp Ala Gly Ile Pro 450 455 460 Leu Phe Pro Thr Ser Gln Pro Asp Phe Asn Leu Gly Glu Ala Leu Gly 465 470 475 480 Leu Arg Arg Glu Ser Ser Ser Ser Phe Pro Gln Ser Gln Asn Phe Thr 485 490 495 Ser Met Glu Ser Phe Pro Thr Gln Tyr Thr Thr Pro Asn Gly Leu Pro 500 505 510 Gln Gln Tyr Pro Phe Ala Gln Gln Gln Gln Asp His Gln Gln Gln Gln 515 520 525 Gln Asn Asn Leu Leu His Gln Thr Pro Glu Phe Pro Ala Ser Leu Pro 530 535 540 His Phe Glu Ser Thr Asn Ser Asp Ala Gly Val Asn Asn Gly Val Ala 545 550 555 560 Ser Pro Pro Ala Gln Pro Thr Met Ala Met Arg Pro Val Lys Glu Glu 565 570 575 Ile Thr Arg Pro Glu Arg Thr Ser Ala Asp Ser Gly Thr Tyr Thr Cys 580 585 590 Thr Tyr His Gly Cys Thr Leu Arg Phe Glu Thr Pro Thr Lys Leu Gln 595 600 605 Lys His Lys Arg Glu Ala His Arg Gln Thr Thr Pro Gly Gly His Leu 610 615 620 Val Gly Arg Asp Thr Ser Ala Arg Asn Ser Gln Ala Gly Pro His Lys 625 630 635 640 Cys Glu Arg Ile Asn Pro Ser Thr Gly Lys Pro Cys Asn Ser Val Phe 645 650 655 Ser Arg Pro Tyr Asp Leu Thr Arg His Glu Asp Thr Ile His Asn Ala 660 665 670 Arg Lys Gln Lys Val Arg Cys His Leu Cys Thr Glu Glu Lys Thr Phe 675 680 685 Ser Arg Asn Asp Ala Leu Thr Arg His Met Arg Val Val His Pro Glu 690 695 700 Val Asp Trp Pro Gly Lys Gln Arg Arg Arg Gly Arg Glu 705 710 715 717 <210> 2 <211> 2362 <212> DNA <213> Aspergillus oryzae <400> 2 ccacttcata ttcgcagtgt tcggggataa agataatcgg atcttgggcc gttggaactt 60 ttatttggag gcttgggtgc catacctagt acttaacttc tggaacctcc cccagtggcg 120 aatcctctga acttccccat ccactccggc gtcataccct tgacgtccat ttgttgaaag 180 agaacatgtc catgtaccct tcgatacgca ataagggttt tatcagtgaa ccaacatcta 240 cccccttcat caatttccaa gaccccgttt tcatccagga cgagttcctt cccgacttgt 300 cgcaggaggc cgcattacag ttctacaacc agcaactacc caggctccaa tcacctttcc 360 agtaccactc gggtcttgaa ttttactctc cgccatcttc cgcgccgtcg tcaccgactt 420 cctctccagc ctcttccacc acccacctgc cagcgaccac agcggtggct gcggaatacc 480 ccaactcctt tgaggctcct tcgatgacac ccgtgtcctc ctataactct gcattcaaca 540 ttcagcatac cgcgacacca ccgtcctcac agccatactt gtcacaatac ccccaatacc 600 tttttgaaaa ctccccaatg ttggcgcaac agcctgtcgc aaacacccac ggctgggaaa 660 acaacctgca gctgctgagc tcggctcgca tgcagcacaa ggcatcaccg agtacctcgt 720 cgactcgatc tgcacctgca gggagctcgt accaaagatc taacgcttct acgctgtcga 780 agcctctacc cactcccgtc cagacaccta tccagaactc gtttctcgcc gcgccctatc 840 agcagaacta tgacacttcg gtgcatgatg gtagccaagc tgaagcggag atggtgagac 900 gggctgtgat ggaacagcag cagaagcagc aacagcaaca aagtcaccat cagcatcagc 960 caagtgatta ctcactggca ccttcggtat catcagtgag ccacaactca cctgttactc 1020 ctcagataaa accggaggaa cttgacgagg cctcgaaggc gatggtcaat ggtgagaaac 1080 gatatcctga tatagaccga tggatggatg attacttaca tctcgatgcc tttgcagact 1140 acaataacca caacggaaac aaccttccga tcggcattcc caaaatcaac cgaactatgt 1200 cggacatcta ccaagatgag ctctataacc cagctctcat gccgactcct caagtctcca 1260 aacagactac gaaccagcag aatcttctga acccgttcag aaacgtcttt gctgatcgac 1320 tgcaagccgc caatcagggc cacatgactg cacggtctca ttctcctgtt gttaacatgc 1380 accgggatcg gtctcctttc cgacagaact cgcccctggc tgctgagtac aacaatgggt 1440 tccagcagcc ccagatggcg accagtgtgc ctatgactca aaatgtaggc caaagtcaag 1500 gagaaggaga accgaagact atgtctccta aggatgccct gttggacttc aacgagggag 1560 atgacgccgg gattcctttg ttcccaacaa gtcaacctga cttcaacctt ggtgaagcac 1620 tcggcctccg acgtgaaagc tcatcatcct tcccgcagtc gcagaacttt acttccatgg 1680 agtcattccc cacgcaatac actacaccga atggacttcc ccagcagtat cccttcgcgc 1740 agcagcaaca agaccatcag cagcaacaac agaacaacct cctgcaccag acccccgaat 1800 tccctgcatc ccttcctcac tttgagtcta cgaacagtga tgccggggtc aacaacggtg 1860 tggcttctcc accggcacag cccacaatgg cgatgcgccc agtgaaggaa gagatcaccc 1920 gccctgagcg cacttctgcg gacagtggca cttacacttg cacttaccac ggctgcacgc 1980 ttcggtttga aacgcctacg aagctgcaga agcacaagcg cgaggcccac cgtcagacta 2040 cgcctggcgg ccacttggtt gggcgcgata cttccgcacg caactctcag gccggtcccc 2100 acaaatgtga gcggatcaat ccgtctacgg gaaagccatg caattctgtt ttctcccggc 2160 cttacgatct tacccggcac gaggacacga tccacaacgc acgcaagcag aaggtccggt 2220 gtcatctgtg cacggaagaa aagacctttt cgcgtaatga tgcgctgacc cggcacatgc 2280 gagtggtgca ccctgaggtc gattggcccg gcaagcaaag aaggagaggc agggagtaag 2340 cctgaatggg gttgtggtgt ca 2362 <210> 3 <211> 597 <212> PRT <213> Aspergillus oryzae <400> 3 Met Ala Pro Thr Val Gln Gly Gln Pro Ser Phe Ala Tyr Tyr Pro Thr 1 5 10 15 Ala Asp Ser Gln Arg Gln Gln Tyr Thr Ser His Pro Ala Glu Pro Gln 20 25 30 Pro Tyr Tyr Gly Gln Ile Gln Ala Phe Pro Gln Gln His Cys Leu Pro 35 40 45 Glu Gln Gln Pro Val Tyr Asn Ala Gln Pro Met Met Asn Met His Gln 50 55 60 Met Ala Thr Thr Asn Ala Phe Arg Gly Ala Met Asn Met Thr Pro Ile 65 70 75 80 Ala Ser Pro Gln Pro Ser His Leu Lys Pro Thr Ile Val Val Gln Gln 85 90 95 Gly Ser Pro Ala Leu Met Pro Leu Asp Thr Arg Phe Val Gly Asp Tyr 100 105 110 Tyr Ser Phe Pro Ser Thr Pro Pro Leu Ser Thr Ala Gly Ser Ser Ile 115 120 125 Ser Ser Pro Pro Ser Ser Ser Gly Thr Leu His Thr Pro Ile Asn Asp 130 135 140 Cys Phe Phe Ser Phe Glu Lys Val Glu Gly Val Lys Glu Gly Cys Glu 145 150 155 160 Ser Asp Val His Ala Glu Leu Leu Ala Ser Ala Asp Trp Thr Arg Ser 165 170 175 Ala Thr Ser Pro Pro Met Thr Pro Val Phe Ile Asn Ala Asn Ser Leu 180 185 190 Thr Ala Ser Gln Ser Ser Asp Phe Leu Ser Ala His Gly Ser Cys Pro 195 200 205 Ser Leu Ser Pro Ser Pro Leu Leu Val Ser Ser Val Phe Thr Pro Ser 210 215 220 Gln Ser Ala Phe Pro Val Glu Gln Ala Thr Ser Asp Phe Cys Asp Pro 225 230 235 240 Arg Gln Leu Thr Val Glu Ser Ser Val Asn Thr Ser Ser Pro Ala Glu 245 250 255 Leu Pro Pro Leu Pro Thr Leu Ser Cys Asp Glu Glu Glu Pro Lys Val 260 265 270 Val Leu Gly Ser Glu Ala Val Thr Leu Pro Val His Glu Asp Pro Ser 275 280 285 Pro Ala Tyr Thr Ser Ser Thr Glu Asp Pro Leu Ser Ser Leu Pro Thr 290 295 300 Phe Asp Ser Phe Ser Asp Leu Asp Ser Glu Asp Glu Phe Val Asn Arg 305 310 315 320 Leu Val Asp Phe His Pro Ser Gly Asn Thr Tyr Tyr Val Gly Glu Lys 325 330 335 Arg Gln Arg Leu Ser Ala Tyr Ser Phe Asp Asp Glu Glu Phe Leu Ser 340 345 350 Glu His Ser Leu Glu Asp Ser Ser Asp Asp Leu Glu Leu Ala His Ser 355 360 365 Gly Leu Ser Phe Leu Gly Cys Ala Asp Phe Ala Pro Ala Gln Ser Asp 370 375 380 Ala Ser Glu Thr Ala Asp Glu Met Lys Thr Lys Lys Arg Ser Asn Ser 385 390 395 400 Arg Lys Ser Leu Lys Arg Ala Asn Ser Glu Asp Gln Asp Ala Leu Lys 405 410 415 Lys Ala Gln Ala Pro Ile Asn Ser Arg Ala Asn Ser Thr Glu Ala Asn 420 425 430 Val Ala Gln Gln Ala Ala Ala Pro Ser Cys Ser Ala Ser Glu Ala Asn 435 445 440 Val Ser Ser Ser Cys Glu Ala Pro Ser Val Pro Val Ser Val Asn Arg 450 455 460 Arg Gly Arg Lys Gln Ser Leu Thr Asp Asp Pro Ser Lys Thr Phe Val 465 470 475 480 Cys Ser Leu Cys Ser Arg Arg Phe Arg Arg Gln Glu His Leu Lys Arg 485 490 495 His Tyr Arg Ser Leu His Thr Gln Asp Lys Pro Phe Glu Cys Asn Glu 500 505 510 Cys Gly Lys Lys Phe Ser Arg Ser Asp Asn Leu Ala Gln His Ala Arg 515 520 525 Thr His Ala Gly Gly Ser Ile Val Met Gly Val Leu Asp Thr Asn Asn 530 535 540 Ala Ser Glu Arg Tyr Asp Asn Arg Asp Ala Ser Thr Met Gly Ala Val 545 550 555 560 Leu Tyr Glu Ala Ala Thr Leu Arg Arg Pro Ser Arg Arg Leu Ala Asn 565 570 575 His Pro Lys Met Gly Phe Leu Thr His Pro His Arg Ser Ser Ala Arg 580 585 590 Gln Glu Ala Gln Ala 595 597 <210> 4 <211> 3167 <212> DNA <213> Aspergillus oryzae <400> 4 ggggccggtt gggggccctc ccggggttgg gttacgggaa aagtttcaag ggccaaagga 60 aaaaaaaggt aaaaaagatt tagggaaaaa aaggccaaaa aaaagggcct tggaaaaagg 120 aattcccaag aacctttccc ccaaaaggtt aattggggta ggggaaccat tggggtttct 180 taccaaggtt tttcccttta tcctttgccc ccttcaggga aaaatttact tattatccta 240 gggaatttgg ccccccccag gggggggatc cctgggattt aagggggggt gccccccggc 300 ccccccctaa gggggttccc ccctttttag ggattttttt ttttttggaa attaaatttc 360 ctttcccttt ttttttaaaa aaaataaaat cttttcggtt ttattttatt ttttataata 420 aaaaaattcc taattataat tataaatccg ggtattttta gggataaatt ctaggtttta 480 actaccctcc ctcccttttc cccgggcccc cccccctcca aatagacaag acggactaaa 540 actaaaaagt acggctccag tcctacgact aactctcact acgcacggcg ctactaagtc 600 gtagcacaag cagatcagtc tctcttttat ccactgtcca tccttctggc cgtctgtccc 660 tcttcccttt cccatatcgg ccctgtctct cgtttccccg cgactgtcca ttagtcctct 720 ttatttctta gtcttttcct tttttccctt tttagacttg tctcgattct ctgtccttcc 780 cttcgttatt tcttattttt tggctttgat ccccatctct ctcttctctc tcttctttta 840 ctcatcatct gtctccggta cctttgatcg tgatcgtccg aatccaaaag gtggctcgta 900 tcgaccagag gttcctgtcc ttccttctat cagatataac tataccgtac attcacactc 960 agaagcccag aagacgttgc atattttata aaccctaatt cggactgcac tcttctgcct 1020 tcagtcattc attcatccag aacaagagac atctacattc attcaaccta cagtgctcga 1080 catctatcat tagcaccaag ctgcctcgac atcaatctgg aacatcctga cttagactaa 1140 acatacttaa taatacccaa gaggttaagc agtgttcgct tgatcctcat cacattcatc 1200 attggttcgt ggaaggatga ctatatatca tacaggatgc ccatactgat ccctcgtctt 1260 ctagtcacaa tggacgctac atacaccatg gcaccaacgg tgcaaggaca accatcattt 1320 gcatactacc ccactgctga ttcacaaaga caacaataca caagccaccc ggctgagccc 1380 cagccatact atggacagat tcaggccttc cctcagcaac actgcctacc agagcaacaa 1440 ccggtctaca atgctcaacc catgatgaac atgcatcaga tggctaccac caatgccttc 1500 cgcggagcaa tgaacatgac tcccatcgcc tctccccaac catctcacct gaagcccaca 1560 atcgtcgtcc agcagggttc cccagccctc atgcctttgg acacaaggtt cgtcggtgac 1620 tactatagct tcccttctac ccctccactc tccaccgcgg gaagttctat cagcagcccg 1680 ccatccagca gtggcacctt gcacacccca atcaacgact gcttcttctc attcgaaaag 1740 gtggagggtg taaaggaggg ttgtgaaagt gacgtccatg ccgagctttt ggccagtgcc 1800 gactggactc ggtcagcaac ttcgcctccc atgactcctg gtatgttttc ttctcatcag 1860 ttgagtaggg ggttggaacc agtctaactt cttctagtct tcatcaacgc caattctctc 1920 accgccagtc agagctcaga cttcctctct gcccacgggt cttgcccctc tctttctcct 1980 tcgcctctcc tggtatcttc cgtgtttact ccgtctcagt cggctttccc tgttgagcag 2040 tcgcctctcc tggtatcttc cgtgtttact ccgtctcagt cggctttccc tgttgagcag 2040 gccacctccg acttctgtga ccctcggcag ctgactgtcg agtcttccgt caacacatcc 2100 tctcccgctg agcttcctcc tttgcccact ctctcctgcg atgaggagga accgaaggtg 2160 gtcctgggaa gcgaggctgt gactttgcca gttcatgagg acccatcacc cgcttacact 2220 agctctaccg aggaccccct cagttcgttg cccacttttg acagcttctc cgaccttgat 2280 tcggaggacg agtttgtcaa ccgcttggtt gacttccatc ccagcggtaa tacttactac 2340 gtgggcgaaa agagacagcg cctcagcgca tactccttcg atgacgagga gttcttgagc 2400 gagcacagct tggaagattc ttcagacgac ctggagttgg cccactctgg cctttctttc 2460 ctgggatgcg ccgatttcgc tccggctcaa agtgatgcca gcgagaccgc tgacgagatg 2520 aagacaaaga agcgaagcaa ctcccgcaag tctctcaaga gagcaaactc tgaggaccag 2580 gatgctctca agaaggcaca ggcaccgatc aacagccggg ccaacagcac agaggccaac 2640 gtggctcagc aggcagctgc tccgtcctgc tcagcctcgg aagccaatgt atctagctcc 2700 tgcgaggccc catctgttcc tgtgtcggtc aaccgccggg gtcgtaagca atccctgacg 2760 gatgatcctt ccaagacctt tgtctgctct ctctgctctc gtcgtttccg tcgtcaggag 2820 caccttaagc gtcactaccg ctctctgcac acccaggaca agcccttcga gtgcaacgag 2880 tgtggtaaga agttttctcg gagtgacaac cttgcgcagc atgccaggac ccatgccggt 2940 ggctctatcg tgatgggtgt cctggacacg aacaacgcat ctgaacggta tgataatcgc 3000 gacgcgagta ccatgggtgc tgtgctctat gaagctgcca cgctgcggcg accaagtcga 3060 cgactagcga atcatccgaa gatggggttt ctgacgcacc ctcatcgatc gtcggcccgc 3120 caagaagcgc aagcgtgacg agcactttag ttgcccccat tgccggt 3167 <210> 5 <211> 468 <212> PRT <213> Aspergillus oryzae <400> 5 Met Ala Ser Ile Asp Pro Asn Val Pro Thr Thr Thr Leu Pro Tyr Thr 1 5 10 15 Cys Asn Thr Cys Leu Val Ala Phe Arg Gly Ser Asp Ala Gln Arg Asp 20 25 30 His Met Arg Lys Asp Trp His Leu Tyr Asn Met Lys Arg Arg Ile Ala 35 40 45 Ser Leu Pro Pro Val Ser Gln Glu Val Phe Asn Asp Lys Val Leu Ala 50 55 60 Ala Lys Ala Thr Thr Ser Ala Ala Ala Ala Lys Ala Ser Phe Glu Lys 65 70 75 80 Thr Cys Val Ala Cys Gln Lys Thr Phe Phe Ser Glu Asn Ser Tyr Gln 85 90 95 Asn His Val Lys Ser Ser Lys His Lys Ala Arg Glu Ala Gln Met Leu 100 105 110 Arg Asp Ser Ala Asp Asp Ala Ser Ser Val Met Ser Ser Thr Phe Ser 115 120 125 Leu Gly Glu Pro Val Asn Lys Pro Arg Glu Arg Ser Glu Val Ser Lys 130 135 140 Val Thr Glu Ser Leu Lys Asn Ala Thr Ile Glu Glu Asp Asp Glu Asp 145 150 155 160 Glu Glu Met Glu Glu Gln Gly Phe Ser Ala Ser Arg Cys Leu Phe Cys 165 170 175 Asn Glu Lys Ser Ser Asp Leu Gln Gln Asn Thr Glu His Met Phe Lys 180 185 190 Thr His Gly Met Phe Ile Pro Glu Lys Asp Tyr Leu Val Asp Leu Glu 195 200 205 Gly Leu Val His Tyr Leu Tyr Arg Lys Ile Asn Glu Asn Ser Glu Cys 210 215 220 Leu Tyr Cys His Ala Val Arg Asn Asn Pro Glu Gly Ala Arg Thr His 225 230 235 240 Met Arg Asp Lys Gly His Cys Met Ile Ala Phe Glu Lys Gln Asp Glu 245 250 255 Gln Val Glu Ile Gly Gln Phe Tyr Asp Phe Arg Ser Thr Tyr Ser Asp 260 265 270 Gly Glu Gly Glu Asp Glu Glu Asp Ser Ile Met Glu Asp Gly Gly Val 275 280 285 Lys Val Asp Gly Glu Asp Asp Glu Gly Trp Glu Thr Glu Thr Ser Ala 290 295 300 Ser Ser Met Asp Asp Asp Glu Asp Glu Leu Asp Asp Thr Lys Gly Gln 305 310 315 320 Val Tyr Ala Thr Glu Phe Glu Leu His Leu Pro Ser Gly Arg Thr Ala 325 330 335 Gly His Arg Ser Leu Ala Lys Tyr Tyr Arg Gln Asn Leu Arg Asn Tyr 340 345 350 Pro Thr Ala Glu Glu Arg Ala Ala Arg Gln Leu Ala Ile Glu Asn Gly 355 360 365 Glu Ile Glu Glu Glu Glu Pro Lys Pro Arg Gly Arg Asp Leu Asn Arg 370 375 380 Ala Val Pro Glu Ala Phe Ala Ala Glu Leu Glu Arg Lys Asp Arg Thr 385 390 395 400 Arg Ala Gln Arg Gln Glu Lys Arg Tyr Thr Ala Arg Val Asn Arg Ala 405 410 415 Ala Asn Asn Gln Lys His Phe Arg Val Cys Phe Ile Pro Leu Leu Ile 420 425 430 Ile Arg Gln Ala Asn Thr Pro Ser Arg Ile Pro Ser Cys Ser Arg Phe 435 440 445 Leu Tyr Leu Ile Cys Leu Arg Ala Cys Trp His Cys Leu Ala Lys Ser 450 455 460 Trp Ser Tyr Gly 465 468 <210> 6 <211> 2337 <212> DNA <213> Aspergillus oryzae <400> 6 tggtcgacgg cccgggctgg tatcatacgg actgacacac cagcctcagg cgccaagcat 60 gaggccggct gactaacatt aaatggaaga agtatgatcc gatcgttgat gcactaggct 120 tgatatagat agggccttcc cagacaggat tagtggaggg gcacgtgcaa tcaacgtcat 180 gttctctaaa agcggcatgg gcaatggttt tcaatgtgtg gtccgggtta tcggcgacat 240 cgattactag cccgatcttt tctgtcgagg ggcaatattg cgcccgcttc actttctcgc 300 tagatacttc ctaccggctt ttatttctaa tcaggcccaa aactttctgt gacgactacg 360 cctggttttg ctttttcatt gctcttgcaa aaacgaatac ccccgcgcaa tatatattcc 420 cacaaccgat cgccttttca tcaaatttat ttctgtgata cccacctttt ttttttttgt 480 tccatatcag gaactgtcag aaatcatggc ctccatcgat ccaaatgtac caacaactac 540 gttgccatat acctgcaaca cctgtctcgt tgcttttcgc ggtagcgatg ctcagcggga 600 tcatatgcgc aaggactggc agtaagtttg ttggaaaata tacgactcag aatgcattac 660 taatgttcgg tcgactcaag tctctataac atgaagcgcc gcatcgcgtc tctgccccca 720 gtgtcccagg aggtttttaa cgacaaggtc cttgctgcca aagccacaac tagtgctgcc 780 gccgccaaag cttcctttga gaagacatgt gtcgcctgcc agaagacatt cttcagcgag 840 aactcgtatc agaaccacgt gaagagttcc aagcacaagg cccgtgaggc acagatgctt 900 agagacagcg ccgatgatgc atcgtctgtc atgagttcta ctttctctct gggcgagcca 960 gtcaacaagc ctcgtgagcg gtcggaggtg tcgaaagtta cggagagtct caagaacgct 1020 accatcgaag aggatgacga agatgaggaa atggaagagc agggcttctc ggcctcccgt 1080 tgtcttttct gtaatgagaa atcttcagat cttcaacaaa acaccgagca catgttcaag 1140 acccacggca tgttcatacc agagaaggac tatcttgttg atttggaggg acttgtccac 1200 tatctttatc ggaagatcaa tgagaacagt gaatgtttat actgccatgc cgtccgaaac 1260 ggtgagggtg aggatgagga ggactcgatc atggaggatg gtggtgtcaa ggtggatggc 1440 gaggatgatg agggatggga aaccgaaact tctgcatctt ctatggatga cgatgaggat 1500 gagctcgacg acacgaaggg acaggtctat gcgacagagt ttgagctgca tttgccctca 1560 ggccgcactg caggtcaccg atcgctcgcc aaatactacc gccagaactt gcgtaactac 1620 cctacagcgg aagaacgggc agctcgtcag ctggctattg aaaatggcga gattgaggag 1680 gaggaaccga aacccagggg tcgcgacctc aatcgggccg ttgttagccg tggcaatggc 1740 ggtattgggt atgatcggtg ctactgacag ccagaagcat ttgctgctga gcttgaacgc 1800 aaagaccgga ctcgtgccca gcgacaagag aagcggtaca cagctcgggt caaccgggct 1860 gccaataacc agaagcactt cagggtatgt ttcatccctt tactgataat aagacaagct 1920 aacaccccct ctaggatccc ctcctgcagt agattcctgt atctgatttg tttgcgtgca 1980 tgttggcatt gtttggcaaa atcatggagt tacggctgaa tgacctgtag atactcatgg 2040 caagataaaa agtgattatt aagaagatga aggatattac ttcgcattcc tgctatcttt 2100 actattttag acagtgctgc atcttatgac ggatataccg aaggtctacc agactgaagg 2160 tgaatcaagt cctacatcta accccccgac caaaccctaa gtcaccacat ctcacagaac 2220 cgatagcaaa cttacaatca agcaccataa caacaaacca ggaccacaaa taataatata 2280 tgaacatcat atcactttac cagcccgggc cgtcgaccac gcgtgcccta tagtgag 2337 <210> 7 <211> 97 <212> DNA <213> Artificial Sequence <400> 7 gagaactaag agaatggcgg cacgggcaga tgtcgggatg atcactttta gtcectccga 60 cgcaatatcg atttcaattg gatccgccac gctgcat 97
【図1】glaBプロモーター中の固体培養高発現に必
須な領域の同定を示す。FIG. 1 shows the identification of a region essential for high expression in solid culture in the glaB promoter.
【図2】glaBプロモーターcis因子を用いたプロ
モータータイトレーションアッセイを示す。FIG. 2 shows a promoter titration assay using the glaB promoter cis factor.
【図3】sgbR1、sgbR2及びsgbR3のノー
ザン解析用のパターンを示す図面代用写真である。FIG. 3 is a drawing-substituting photograph showing a pattern for Northern analysis of sgbR1, sgbR2, and sgbR3.
【図4】sgbR1、sgbR2及びsgbR3のコサ
プレッションアッセイを示す。FIG. 4 shows cosuppression assays for sgbR1, sgbR2 and sgbR3.
【図5】sgbR1、sgbR2及びsgbR3のゲル
シフトアッセイのパターンを示す図面代用写真である。FIG. 5 is a drawing-substituting photograph showing a pattern of a gel shift assay of sgbR1, sgbR2, and sgbR3.
【図6】本転写因子sgbR1タンパク質のアミノ酸配
列を示す。FIG. 6 shows the amino acid sequence of the present transcription factor sgbR1 protein.
【図7】同上続きを示す。FIG. 7 shows the continuation of the above.
【図8】同上続きを示す。FIG. 8 shows the continuation of the above.
【図9】本転写因子sgbR1遺伝子の塩基配列を示
す。FIG. 9 shows the nucleotide sequence of the present transcription factor sgbR1 gene.
【図10】同上続きを示す。FIG. 10 shows the continuation of the above.
【図11】本転写因子sgbR2タンパク質のアミノ酸
配列を示す。FIG. 11 shows the amino acid sequence of the present transcription factor sgbR2 protein.
【図12】同上続きを示す。FIG. 12 shows the continuation of the above.
【図13】同上続きを示す。FIG. 13 shows the continuation of the above.
【図14】本転写因子sgbR2遺伝子の塩基配列を示
す。FIG. 14 shows the nucleotide sequence of the transcription factor sgbR2 gene.
【図15】同上続きを示す。FIG. 15 shows the continuation of the above.
【図16】本転写因子sgbR3タンパク質のアミノ酸
配列を示す。FIG. 16 shows the amino acid sequence of the present transcription factor sgbR3 protein.
【図17】同上続きを示す。FIG. 17 shows the continuation of the above.
【図18】同上続きを示す。FIG. 18 shows the continuation of the above.
【図19】本転写因子sgbR3遺伝子の塩基配列を示
す。FIG. 19 shows the nucleotide sequence of the transcription factor sgbR3 gene.
【図20】同上続きを示す。FIG. 20 shows the continuation of the above.
【図21】固体培養発現遺伝子cis因子(97bp)
の塩基配列を示す。FIG. 21: Solid culture expression gene cis factor (97 bp)
The base sequence of is shown.
【図22】プロモーター解析用プラスミドpNGUSを
示す。FIG. 22 shows a plasmid pNGUS for promoter analysis.
───────────────────────────────────────────────────── フロントページの続き (51)Int.Cl.7 識別記号 FI テーマコート゛(参考) C12N 5/10 C12N 15/00 A C12P 21/02 5/00 A (72)発明者 秦 洋二 京都市伏見区片原町300 月桂冠株式会社 総合研究所内 (72)発明者 川戸 章嗣 京都市伏見区下鳥羽小柳町24 月桂冠株式 会社総合研究所内 (72)発明者 赤尾 健 広島県東広島市鏡山三丁目7番1号 独立 行政法人 酒類総合研究所内 Fターム(参考) 4B024 AA20 BA80 DA11 FA02 4B064 AG01 CA19 CC24 4B065 AA26X AA63X AA63Y AB01 CA24 4H045 AA10 AA20 BA10 CA15 FA74─────────────────────────────────────────────────── ─── Continuation of front page (51) Int.Cl. 7 Identification code FI theme code (reference) C12N 5/10 C12N 15/00 A C12P 21/02 5/00 A (72) Inventor Yoji Hata Fushimi, Kyoto 300 Katahara-cho, Gekkei-cho Co., Ltd. in the General Research Institute (72) Inventor Shoji Kawado 24 Shimotoba Koyanagi-cho, Fushimi-ku, Kyoto City In the General Research Institute, Keikei-sha Co., Ltd. (72) Ken Akao 3-7-1 Kagamiyama, Higashihiroshima City, Hiroshima Prefecture No. F-term in the National Institute of Alcoholic Beverages (Reference) 4B024 AA20 BA80 DA11 FA02 4B064 AG01 CA19 CC24 4B065 AA26X AA63X AA63Y AB01 CA24 4H045 AA10 AA20 BA10 CA15 FA74
Claims (8)
示すアミノ酸配列の少なくともひとつを有する、固体培
養応答性のcis因子下流の遺伝子の転写を制御するタ
ンパク質。1. A protein which controls the transcription of a gene downstream of a cis factor which is responsive to solid-state culture, and which has at least one of the amino acid sequences shown in SEQ ID NOs: 1, 3 and 5, respectively.
配列の少なくともひとつを有する、固体培養応答性のc
is因子下流の遺伝子の転写を制御するタンパク質をコ
ードする遺伝子のDNA。2. A solid-culture responsive c having at least one of the nucleotide sequences shown in SEQ ID NOs: 2, 4, and 6, respectively.
DNA of a gene encoding a protein that controls the transcription of a gene downstream of the is factor.
もコーディング領域を含んでなる組換えベクター。3. A recombinant vector comprising at least a coding region of the DNA according to claim 2.
GBR2、又はpSGBR3。4. A recombinant vector pSGBR1 or pS
GBR2, or pSGBR3.
を導入してなる形質転換体。5. A transformant obtained by introducing the recombinant vector according to claim 3 or 4.
coli SGBR1(FERM P−18357)、
又はEscherichia coli SGBR2
(FERM P−18358)、又はEscheric
hia coli SGBR3(FERM P−183
59)。6. A transformant, Escherichia
coli SGBR1 (FERM P-18357),
Or Escherichia coli SGBR2
(FERM P-18358), or Escheric
hia coli SGBR3 (FERM P-183
59).
麹菌において、固体培養応答性のcis因子下流の遺伝
子の転写を制御するタンパク質を生産させる方法。7. A method for producing a protein which controls transcription of a gene downstream of a cis factor responsive to solid culture in the koji mold into which the DNA according to claim 2 is introduced.
で発現させ、固体培養発現遺伝子群の生産を制御する方
法。8. A method of controlling the production of a solid culture-expressed gene group by expressing the protein according to claim 1 in Aspergillus oryzae.
Priority Applications (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| JP2001190356A JP2003000240A (en) | 2001-06-22 | 2001-06-22 | DNA encoding a transcription factor that controls a gene expressed in a solid culture of Aspergillus oryzae |
Applications Claiming Priority (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| JP2001190356A JP2003000240A (en) | 2001-06-22 | 2001-06-22 | DNA encoding a transcription factor that controls a gene expressed in a solid culture of Aspergillus oryzae |
Publications (1)
| Publication Number | Publication Date |
|---|---|
| JP2003000240A true JP2003000240A (en) | 2003-01-07 |
Family
ID=19029135
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| JP2001190356A Withdrawn JP2003000240A (en) | 2001-06-22 | 2001-06-22 | DNA encoding a transcription factor that controls a gene expressed in a solid culture of Aspergillus oryzae |
Country Status (1)
| Country | Link |
|---|---|
| JP (1) | JP2003000240A (en) |
Cited By (4)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| WO2007034782A1 (en) | 2005-09-26 | 2007-03-29 | Noda Institute For Scientific Research | Recombinant vector capable of increasing secretion of koji mold protease |
| WO2015100856A1 (en) * | 2013-12-30 | 2015-07-09 | 广东启智生物科技有限公司 | Genetic recombinant saccharomyces cerevisiae capable of degrading and utilizing kitchen wastes |
| JP2021528985A (en) * | 2018-06-27 | 2021-10-28 | ベーリンガー インゲルハイム エルツェーファウ ゲゼルシャフト ミット ベシュレンクテル ハフツング ウント コンパニー コマンディトゲゼルシャフト | Means and methods for increasing protein expression by the use of transcription factors |
| CN119799530A (en) * | 2024-12-31 | 2025-04-11 | 优甜生物科技(北京)有限公司 | Recombinant yeast cells and their use in producing thaumatin |
-
2001
- 2001-06-22 JP JP2001190356A patent/JP2003000240A/en not_active Withdrawn
Cited By (7)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| WO2007034782A1 (en) | 2005-09-26 | 2007-03-29 | Noda Institute For Scientific Research | Recombinant vector capable of increasing secretion of koji mold protease |
| US7842799B2 (en) | 2005-09-26 | 2010-11-30 | Noda Institute For Scientific Research | Recombinant vector capable of increasing secretion of Koji mold protease |
| WO2015100856A1 (en) * | 2013-12-30 | 2015-07-09 | 广东启智生物科技有限公司 | Genetic recombinant saccharomyces cerevisiae capable of degrading and utilizing kitchen wastes |
| US10584359B2 (en) | 2013-12-30 | 2020-03-10 | Guangdong Recyclean Low-Carbon Technology | Genetically recombinant Saccharomyces cerevisiae for degrading kitchen waste |
| JP2021528985A (en) * | 2018-06-27 | 2021-10-28 | ベーリンガー インゲルハイム エルツェーファウ ゲゼルシャフト ミット ベシュレンクテル ハフツング ウント コンパニー コマンディトゲゼルシャフト | Means and methods for increasing protein expression by the use of transcription factors |
| JP7645076B2 (en) | 2018-06-27 | 2025-03-13 | ベーリンガー インゲルハイム エルツェーファウ ゲゼルシャフト ミット ベシュレンクテル ハフツング ウント コンパニー コマンディトゲゼルシャフト | Means and methods for increasing protein expression through the use of transcription factors - Patents.com |
| CN119799530A (en) * | 2024-12-31 | 2025-04-11 | 优甜生物科技(北京)有限公司 | Recombinant yeast cells and their use in producing thaumatin |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| KR102253479B1 (en) | Promoter variant | |
| KR102504198B1 (en) | Expression constructs and methods of genetically engineering methylotrophic yeast | |
| EP2106447B1 (en) | Method for methanol independent induction from methanol inducible promoters in pichia | |
| CN109161550B (en) | SlbHLH59 gene for regulating and controlling ascorbic acid content of tomato fruits and application method | |
| Sakamoto et al. | Aspergillus oryzae atfB encodes a transcription factor required for stress tolerance in conidia | |
| WO1997044470A1 (en) | Novel yeast promoters suitable for expression cloning in yeast and heterologous expression of proteins in yeast | |
| Sophianopoulou et al. | The proline transport protein of Aspergillus nidulans is very similar to amino acid transporters of Saccharomyces cerevisiae | |
| Ehrlich et al. | Alteration of Different Domains in AFLR Affects Aflatoxin Pathway Metabolism inAspergillus parasiticusTransformants | |
| WO1996032475A2 (en) | Methods for preparing dna-binding proteins | |
| EP3265475B1 (en) | Constitutive yeast llp promoter-based expression system | |
| CN100432099C (en) | Transcription factor | |
| JP2003000240A (en) | DNA encoding a transcription factor that controls a gene expressed in a solid culture of Aspergillus oryzae | |
| CN110358753B (en) | Fusion protein based on CjCas9 and VPR core domain, corresponding DNA targeting activation system and its application | |
| CA2485119A1 (en) | Promoters with modified transcription efficiency | |
| Janus et al. | Identification of a minimal cre1 promoter sequence promoting glucose-dependent gene expression in the β-lactam producer Acremonium chrysogenum | |
| JPWO2001083787A1 (en) | Novel promoter and gene expression method using said promoter | |
| US7285399B2 (en) | Compositions and methods using the yeast YMR107W promoter | |
| JP4587750B2 (en) | Candida utilis-derived gene encoding transcriptional activation protein | |
| JPWO2006046694A1 (en) | Gene screening method using yeast with ergosterol synthase expressed inducibly | |
| JP4671394B2 (en) | Promoter DNA from Candida utilis | |
| Hynes et al. | Regulation of the amdS gene in Aspergillus nidulans | |
| Hynes et al. | Regulation of acetamide catabolism | |
| Yu et al. | Protein phosphatase 2A: identification in Oryza sativa of the gene encoding the regulatory A subunit | |
| CN121294509A (en) | TaYAB5 protein and application of related biological material thereof in regulating and controlling plant spike length | |
| CN101805745B (en) | Method for quickly screening yeast with high-expression target proteins |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| A621 | Written request for application examination |
Free format text: JAPANESE INTERMEDIATE CODE: A621 Effective date: 20080430 |
|
| RD02 | Notification of acceptance of power of attorney |
Free format text: JAPANESE INTERMEDIATE CODE: A7422 Effective date: 20080430 |
|
| A761 | Written withdrawal of application |
Free format text: JAPANESE INTERMEDIATE CODE: A761 Effective date: 20090622 |