US6004790A - Cyclomaltodextrin glucanotransferase variants - Google Patents
Cyclomaltodextrin glucanotransferase variants Download PDFInfo
- Publication number
- US6004790A US6004790A US08/947,965 US94796597A US6004790A US 6004790 A US6004790 A US 6004790A US 94796597 A US94796597 A US 94796597A US 6004790 A US6004790 A US 6004790A
- Authority
- US
- United States
- Prior art keywords
- variant
- amino acid
- cgtase
- alanine
- seq
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Expired - Fee Related
Links
- 108010025880 Cyclomaltodextrin glucanotransferase Proteins 0.000 title claims abstract description 512
- 238000003780 insertion Methods 0.000 claims abstract description 52
- 230000037431 insertion Effects 0.000 claims abstract description 52
- 238000006467 substitution reaction Methods 0.000 claims abstract description 37
- 238000012217 deletion Methods 0.000 claims abstract description 32
- 230000037430 deletion Effects 0.000 claims abstract description 32
- 235000001014 amino acid Nutrition 0.000 claims description 121
- 150000001413 amino acids Chemical group 0.000 claims description 108
- 229940024606 amino acid Drugs 0.000 claims description 106
- 230000000694 effects Effects 0.000 claims description 75
- 241000193752 Bacillus circulans Species 0.000 claims description 59
- 230000004075 alteration Effects 0.000 claims description 55
- 235000004279 alanine Nutrition 0.000 claims description 53
- QNAYBMKLOCPYGJ-REOHCLBHSA-N L-alanine Chemical compound C[C@H](N)C(O)=O QNAYBMKLOCPYGJ-REOHCLBHSA-N 0.000 claims description 52
- 241000194110 Bacillus sp. (in: Bacteria) Species 0.000 claims description 48
- DHMQDGOQFOQNFH-UHFFFAOYSA-N Glycine Chemical compound NCC(O)=O DHMQDGOQFOQNFH-UHFFFAOYSA-N 0.000 claims description 42
- CKLJMWTZIZZHCS-REOHCLBHSA-N L-aspartic acid Chemical compound OC(=O)[C@@H](N)CC(O)=O CKLJMWTZIZZHCS-REOHCLBHSA-N 0.000 claims description 42
- KDXKERNSBIXSRK-UHFFFAOYSA-N Lysine Natural products NCCCCC(N)C(O)=O KDXKERNSBIXSRK-UHFFFAOYSA-N 0.000 claims description 41
- COLNVLDHVKWLRT-QMMMGPOBSA-N L-phenylalanine Chemical compound OC(=O)[C@@H](N)CC1=CC=CC=C1 COLNVLDHVKWLRT-QMMMGPOBSA-N 0.000 claims description 40
- WHUUTDBJXJRKMK-UHFFFAOYSA-N Glutamic acid Natural products OC(=O)C(N)CCC(O)=O WHUUTDBJXJRKMK-UHFFFAOYSA-N 0.000 claims description 39
- ROHFNLRQFUQHCH-YFKPBYRVSA-N L-leucine Chemical compound CC(C)C[C@H](N)C(O)=O ROHFNLRQFUQHCH-YFKPBYRVSA-N 0.000 claims description 33
- 241000193830 Bacillus <bacterium> Species 0.000 claims description 31
- ROHFNLRQFUQHCH-UHFFFAOYSA-N Leucine Natural products CC(C)CC(N)C(O)=O ROHFNLRQFUQHCH-UHFFFAOYSA-N 0.000 claims description 30
- 235000005772 leucine Nutrition 0.000 claims description 30
- 239000004475 Arginine Substances 0.000 claims description 23
- WHUUTDBJXJRKMK-VKHMYHEASA-N L-glutamic acid Chemical compound OC(=O)[C@@H](N)CCC(O)=O WHUUTDBJXJRKMK-VKHMYHEASA-N 0.000 claims description 23
- ODKSFYDXXFIFQN-UHFFFAOYSA-N arginine Natural products OC(=O)C(N)CCCNC(N)=N ODKSFYDXXFIFQN-UHFFFAOYSA-N 0.000 claims description 23
- 235000009697 arginine Nutrition 0.000 claims description 23
- AGPKZVBTJJNPAG-WHFBIAKZSA-N L-isoleucine Chemical compound CC[C@H](C)[C@H](N)C(O)=O AGPKZVBTJJNPAG-WHFBIAKZSA-N 0.000 claims description 22
- COLNVLDHVKWLRT-UHFFFAOYSA-N phenylalanine Natural products OC(=O)C(N)CC1=CC=CC=C1 COLNVLDHVKWLRT-UHFFFAOYSA-N 0.000 claims description 22
- QIVBCDIJIAJPQS-UHFFFAOYSA-N Tryptophan Natural products C1=CC=C2C(CC(N)C(O)=O)=CNC2=C1 QIVBCDIJIAJPQS-UHFFFAOYSA-N 0.000 claims description 21
- 235000013922 glutamic acid Nutrition 0.000 claims description 21
- 239000004220 glutamic acid Substances 0.000 claims description 21
- 239000004471 Glycine Substances 0.000 claims description 20
- ZDXPYRJPNDTMRX-VKHMYHEASA-N L-glutamine Chemical compound OC(=O)[C@@H](N)CCC(N)=O ZDXPYRJPNDTMRX-VKHMYHEASA-N 0.000 claims description 20
- ONIBWKKTOPOVIA-UHFFFAOYSA-N Proline Natural products OC(=O)C1CCCN1 ONIBWKKTOPOVIA-UHFFFAOYSA-N 0.000 claims description 20
- QIVBCDIJIAJPQS-VIFPVBQESA-N L-tryptophane Chemical compound C1=CC=C2C(C[C@H](N)C(O)=O)=CNC2=C1 QIVBCDIJIAJPQS-VIFPVBQESA-N 0.000 claims description 19
- DCXYFEDJOCDNAF-REOHCLBHSA-N L-asparagine Chemical compound OC(=O)[C@@H](N)CC(N)=O DCXYFEDJOCDNAF-REOHCLBHSA-N 0.000 claims description 18
- 229960000310 isoleucine Drugs 0.000 claims description 18
- AGPKZVBTJJNPAG-UHFFFAOYSA-N isoleucine Natural products CCC(C)C(N)C(O)=O AGPKZVBTJJNPAG-UHFFFAOYSA-N 0.000 claims description 18
- ODKSFYDXXFIFQN-BYPYZUCNSA-P L-argininium(2+) Chemical compound NC(=[NH2+])NCCC[C@H]([NH3+])C(O)=O ODKSFYDXXFIFQN-BYPYZUCNSA-P 0.000 claims description 15
- 235000003704 aspartic acid Nutrition 0.000 claims description 15
- OQFSQFPPLPISGP-UHFFFAOYSA-N beta-carboxyaspartic acid Natural products OC(=O)C(N)C(C(O)=O)C(O)=O OQFSQFPPLPISGP-UHFFFAOYSA-N 0.000 claims description 15
- DCXYFEDJOCDNAF-UHFFFAOYSA-N Asparagine Natural products OC(=O)C(N)CC(N)=O DCXYFEDJOCDNAF-UHFFFAOYSA-N 0.000 claims description 13
- 241000193385 Geobacillus stearothermophilus Species 0.000 claims description 13
- ZDXPYRJPNDTMRX-UHFFFAOYSA-N glutamine Natural products OC(=O)C(N)CCC(N)=O ZDXPYRJPNDTMRX-UHFFFAOYSA-N 0.000 claims description 13
- 235000004554 glutamine Nutrition 0.000 claims description 13
- OUYCCCASQSFEME-QMMMGPOBSA-N L-tyrosine Chemical compound OC(=O)[C@@H](N)CC1=CC=C(O)C=C1 OUYCCCASQSFEME-QMMMGPOBSA-N 0.000 claims description 12
- 235000009582 asparagine Nutrition 0.000 claims description 12
- 229960001230 asparagine Drugs 0.000 claims description 12
- OUYCCCASQSFEME-UHFFFAOYSA-N tyrosine Natural products OC(=O)C(N)CC1=CC=C(O)C=C1 OUYCCCASQSFEME-UHFFFAOYSA-N 0.000 claims description 12
- ONIBWKKTOPOVIA-BYPYZUCNSA-N L-Proline Chemical compound OC(=O)[C@@H]1CCCN1 ONIBWKKTOPOVIA-BYPYZUCNSA-N 0.000 claims description 11
- 241000178960 Paenibacillus macerans Species 0.000 claims description 10
- MTCFGRXMJLQNBG-UHFFFAOYSA-N Serine Natural products OCC(N)C(O)=O MTCFGRXMJLQNBG-UHFFFAOYSA-N 0.000 claims description 10
- 241000193407 Bacillus ohbensis Species 0.000 claims description 9
- KDXKERNSBIXSRK-YFKPBYRVSA-N L-lysine Chemical compound NCCCC[C@H](N)C(O)=O KDXKERNSBIXSRK-YFKPBYRVSA-N 0.000 claims description 9
- MTCFGRXMJLQNBG-REOHCLBHSA-N (2S)-2-Amino-3-hydroxypropansäure Chemical compound OC[C@H](N)C(O)=O MTCFGRXMJLQNBG-REOHCLBHSA-N 0.000 claims description 8
- HNDVDQJCIGZPNO-YFKPBYRVSA-N L-histidine Chemical compound OC(=O)[C@@H](N)CC1=CN=CN1 HNDVDQJCIGZPNO-YFKPBYRVSA-N 0.000 claims description 8
- HNDVDQJCIGZPNO-UHFFFAOYSA-N histidine Natural products OC(=O)C(N)CC1=CN=CN1 HNDVDQJCIGZPNO-UHFFFAOYSA-N 0.000 claims description 8
- 241000588747 Klebsiella pneumoniae Species 0.000 claims description 6
- 241001235134 Thermoanaerobacter sp. ATCC 53627 Species 0.000 claims description 6
- 239000004472 Lysine Substances 0.000 claims description 5
- 241000194108 Bacillus licheniformis Species 0.000 claims description 4
- 108700023372 Glycosyltransferases Proteins 0.000 claims 1
- 102000051366 Glycosyltransferases Human genes 0.000 claims 1
- 239000000758 substrate Substances 0.000 abstract description 131
- 125000000539 amino acid group Chemical group 0.000 abstract description 94
- 230000027455 binding Effects 0.000 abstract description 68
- 238000000034 method Methods 0.000 abstract description 66
- 239000013604 expression vector Substances 0.000 abstract description 18
- 239000002243 precursor Substances 0.000 abstract description 18
- 102000004190 Enzymes Human genes 0.000 description 236
- 108090000790 Enzymes Proteins 0.000 description 236
- 229940088598 enzyme Drugs 0.000 description 236
- 229920000858 Cyclodextrin Polymers 0.000 description 193
- 125000002791 glucosyl group Chemical group C1([C@H](O)[C@@H](O)[C@H](O)[C@H](O1)CO)* 0.000 description 121
- 230000003993 interaction Effects 0.000 description 104
- WHGYBXFWUBPSRW-FOUAGVGXSA-N beta-cyclodextrin Chemical class OC[C@H]([C@H]([C@@H]([C@H]1O)O)O[C@H]2O[C@@H]([C@@H](O[C@H]3O[C@H](CO)[C@H]([C@@H]([C@H]3O)O)O[C@H]3O[C@H](CO)[C@H]([C@@H]([C@H]3O)O)O[C@H]3O[C@H](CO)[C@H]([C@@H]([C@H]3O)O)O[C@H]3O[C@H](CO)[C@H]([C@@H]([C@H]3O)O)O3)[C@H](O)[C@H]2O)CO)O[C@@H]1O[C@H]1[C@H](O)[C@@H](O)[C@@H]3O[C@@H]1CO WHGYBXFWUBPSRW-FOUAGVGXSA-N 0.000 description 98
- 125000003275 alpha amino acid group Chemical group 0.000 description 96
- 230000035772 mutation Effects 0.000 description 91
- 230000015572 biosynthetic process Effects 0.000 description 83
- 229910052739 hydrogen Inorganic materials 0.000 description 83
- 108020004414 DNA Proteins 0.000 description 75
- 239000000047 product Substances 0.000 description 73
- 230000036961 partial effect Effects 0.000 description 66
- 238000002474 experimental method Methods 0.000 description 63
- 229960004853 betadex Drugs 0.000 description 62
- 235000011175 beta-cyclodextrine Nutrition 0.000 description 59
- 229920002472 Starch Polymers 0.000 description 58
- 125000003295 alanine group Chemical group N[C@@H](C)C(=O)* 0.000 description 58
- 239000001116 FEMA 4028 Substances 0.000 description 56
- 239000002773 nucleotide Substances 0.000 description 56
- 125000003729 nucleotide group Chemical group 0.000 description 56
- 235000019698 starch Nutrition 0.000 description 55
- 239000008107 starch Substances 0.000 description 55
- 230000007423 decrease Effects 0.000 description 54
- 229940097362 cyclodextrins Drugs 0.000 description 51
- 229910052757 nitrogen Inorganic materials 0.000 description 45
- HFHDHCJBZVLPGP-RWMJIURBSA-N alpha-cyclodextrin Chemical compound OC[C@H]([C@H]([C@@H]([C@H]1O)O)O[C@H]2O[C@@H]([C@@H](O[C@H]3O[C@H](CO)[C@H]([C@@H]([C@H]3O)O)O[C@H]3O[C@H](CO)[C@H]([C@@H]([C@H]3O)O)O[C@H]3O[C@H](CO)[C@H]([C@@H]([C@H]3O)O)O3)[C@H](O)[C@H]2O)CO)O[C@@H]1O[C@H]1[C@H](O)[C@@H](O)[C@@H]3O[C@@H]1CO HFHDHCJBZVLPGP-RWMJIURBSA-N 0.000 description 43
- 210000004027 cell Anatomy 0.000 description 43
- 229920001450 Alpha-Cyclodextrin Polymers 0.000 description 42
- 125000000637 arginyl group Chemical group N[C@@H](CCCNC(N)=N)C(=O)* 0.000 description 38
- GDSRMADSINPKSL-HSEONFRVSA-N gamma-cyclodextrin Chemical compound OC[C@H]([C@H]([C@@H]([C@H]1O)O)O[C@H]2O[C@@H]([C@@H](O[C@H]3O[C@H](CO)[C@H]([C@@H]([C@H]3O)O)O[C@H]3O[C@H](CO)[C@H]([C@@H]([C@H]3O)O)O[C@H]3O[C@H](CO)[C@H]([C@@H]([C@H]3O)O)O[C@H]3O[C@H](CO)[C@H]([C@@H]([C@H]3O)O)O[C@H]3O[C@H](CO)[C@H]([C@@H]([C@H]3O)O)O3)[C@H](O)[C@H]2O)CO)O[C@@H]1O[C@H]1[C@H](O)[C@@H](O)[C@@H]3O[C@@H]1CO GDSRMADSINPKSL-HSEONFRVSA-N 0.000 description 38
- 239000001257 hydrogen Substances 0.000 description 38
- 238000004519 manufacturing process Methods 0.000 description 37
- HFHDHCJBZVLPGP-UHFFFAOYSA-N schardinger α-dextrin Chemical compound O1C(C(C2O)O)C(CO)OC2OC(C(C2O)O)C(CO)OC2OC(C(C2O)O)C(CO)OC2OC(C(O)C2O)C(CO)OC2OC(C(C2O)O)C(CO)OC2OC2C(O)C(O)C1OC2CO HFHDHCJBZVLPGP-UHFFFAOYSA-N 0.000 description 34
- 229940043377 alpha-cyclodextrin Drugs 0.000 description 33
- 238000001042 affinity chromatography Methods 0.000 description 31
- 125000001909 leucine group Chemical group [H]N(*)C(C(*)=O)C([H])([H])C(C([H])([H])[H])C([H])([H])[H] 0.000 description 31
- 108090000623 proteins and genes Proteins 0.000 description 31
- 239000013615 primer Substances 0.000 description 30
- 125000003630 glycyl group Chemical group [H]N([H])C([H])([H])C(*)=O 0.000 description 26
- 230000003247 decreasing effect Effects 0.000 description 25
- 229940080345 gamma-cyclodextrin Drugs 0.000 description 25
- 125000000613 asparagine group Chemical group N[C@@H](CC(N)=O)C(=O)* 0.000 description 22
- 230000005764 inhibitory process Effects 0.000 description 22
- 125000000430 tryptophan group Chemical group [H]N([H])C(C(=O)O*)C([H])([H])C1=C([H])N([H])C2=C([H])C([H])=C([H])C([H])=C12 0.000 description 22
- 125000001493 tyrosinyl group Chemical group [H]OC1=C([H])C([H])=C(C([H])=C1[H])C([H])([H])C([H])(N([H])[H])C(*)=O 0.000 description 22
- 235000014469 Bacillus subtilis Nutrition 0.000 description 21
- 241000588724 Escherichia coli Species 0.000 description 21
- 108010061238 threonyl-glycine Proteins 0.000 description 21
- 238000006243 chemical reaction Methods 0.000 description 20
- 125000000291 glutamic acid group Chemical group N[C@@H](CCC(O)=O)C(=O)* 0.000 description 20
- 108010044374 isoleucyl-tyrosine Proteins 0.000 description 20
- 239000013612 plasmid Substances 0.000 description 20
- 241000186338 Thermoanaerobacter sp. Species 0.000 description 19
- 230000009878 intermolecular interaction Effects 0.000 description 19
- 238000010276 construction Methods 0.000 description 18
- 125000000404 glutamine group Chemical group N[C@@H](CCC(N)=O)C(=O)* 0.000 description 18
- 125000001500 prolyl group Chemical group [H]N1C([H])(C(=O)[*])C([H])([H])C([H])([H])C1([H])[H] 0.000 description 18
- 102220617269 Protein-glutamine gamma-glutamyltransferase K_Y89G_mutation Human genes 0.000 description 17
- 229910052740 iodine Inorganic materials 0.000 description 17
- 125000000741 isoleucyl group Chemical group [H]N([H])C(C(C([H])([H])[H])C([H])([H])C([H])([H])[H])C(=O)O* 0.000 description 17
- 101100290347 Arabidopsis thaliana MBS2 gene Proteins 0.000 description 16
- 244000063299 Bacillus subtilis Species 0.000 description 16
- 108091028043 Nucleic acid sequence Proteins 0.000 description 16
- 241000186339 Thermoanaerobacter Species 0.000 description 16
- 125000003607 serino group Chemical group [H]N([H])[C@]([H])(C(=O)[*])C(O[H])([H])[H] 0.000 description 16
- 108091034117 Oligonucleotide Proteins 0.000 description 15
- 238000011534 incubation Methods 0.000 description 15
- 229910052698 phosphorus Inorganic materials 0.000 description 15
- 229920001592 potato starch Polymers 0.000 description 15
- XBGGUPMXALFZOT-UHFFFAOYSA-N glycyl-L-tyrosine hemihydrate Natural products NCC(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 XBGGUPMXALFZOT-UHFFFAOYSA-N 0.000 description 14
- 229920001542 oligosaccharide Polymers 0.000 description 14
- 150000002482 oligosaccharides Chemical class 0.000 description 14
- 235000018102 proteins Nutrition 0.000 description 14
- 230000002829 reductive effect Effects 0.000 description 14
- 239000000243 solution Substances 0.000 description 14
- 239000004382 Amylase Substances 0.000 description 13
- 108010065511 Amylases Proteins 0.000 description 13
- 238000007363 ring formation reaction Methods 0.000 description 13
- 239000000126 substance Substances 0.000 description 13
- 239000013598 vector Substances 0.000 description 13
- 102000013142 Amylases Human genes 0.000 description 12
- KZNQNBZMBZJQJO-UHFFFAOYSA-N N-glycyl-L-proline Natural products NCC(=O)N1CCCC1C(O)=O KZNQNBZMBZJQJO-UHFFFAOYSA-N 0.000 description 12
- 102220509110 Sphingosine 1-phosphate receptor 1_E146Y_mutation Human genes 0.000 description 12
- 235000019418 amylase Nutrition 0.000 description 12
- 108010010430 asparagine-proline-alanine Proteins 0.000 description 12
- 108010077515 glycylproline Proteins 0.000 description 12
- KJFMBFZCATUALV-UHFFFAOYSA-N phenolphthalein Chemical compound C1=CC(O)=CC=C1C1(C=2C=CC(O)=CC=2)C2=CC=CC=C2C(=O)O1 KJFMBFZCATUALV-UHFFFAOYSA-N 0.000 description 12
- 102000004169 proteins and genes Human genes 0.000 description 12
- 102220007344 rs111033666 Human genes 0.000 description 12
- 238000002741 site-directed mutagenesis Methods 0.000 description 12
- 108090000637 alpha-Amylases Proteins 0.000 description 11
- 102000004139 alpha-Amylases Human genes 0.000 description 11
- 108010047857 aspartylglycine Proteins 0.000 description 11
- 108010089804 glycyl-threonine Proteins 0.000 description 11
- 239000002609 medium Substances 0.000 description 11
- 230000004048 modification Effects 0.000 description 11
- 238000012986 modification Methods 0.000 description 11
- 229910052700 potassium Inorganic materials 0.000 description 11
- JLCPHMBAVCMARE-UHFFFAOYSA-N [3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-hydroxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methyl [5-(6-aminopurin-9-yl)-2-(hydroxymethyl)oxolan-3-yl] hydrogen phosphate Polymers Cc1cn(C2CC(OP(O)(=O)OCC3OC(CC3OP(O)(=O)OCC3OC(CC3O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c3nc(N)[nH]c4=O)C(COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3CO)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cc(C)c(=O)[nH]c3=O)n3cc(C)c(=O)[nH]c3=O)n3ccc(N)nc3=O)n3cc(C)c(=O)[nH]c3=O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)O2)c(=O)[nH]c1=O JLCPHMBAVCMARE-UHFFFAOYSA-N 0.000 description 10
- 108010069205 aspartyl-phenylalanine Proteins 0.000 description 10
- 108010093581 aspartyl-proline Proteins 0.000 description 10
- 238000003556 assay Methods 0.000 description 10
- 108010050848 glycylleucine Proteins 0.000 description 10
- 230000002209 hydrophobic effect Effects 0.000 description 10
- 230000008569 process Effects 0.000 description 10
- 238000004458 analytical method Methods 0.000 description 9
- 230000001580 bacterial effect Effects 0.000 description 9
- 230000003197 catalytic effect Effects 0.000 description 9
- 238000000746 purification Methods 0.000 description 9
- OWEGMIWEEQEYGQ-UHFFFAOYSA-N 100676-05-9 Natural products OC1C(O)C(O)C(CO)OC1OCC1C(O)C(O)C(O)C(OC2C(OC(O)C(O)C2O)CO)O1 OWEGMIWEEQEYGQ-UHFFFAOYSA-N 0.000 description 8
- GUBGYTABKSRVRQ-PICCSMPSSA-N Maltose Natural products O[C@@H]1[C@@H](O)[C@H](O)[C@@H](CO)O[C@@H]1O[C@@H]1[C@@H](CO)OC(O)[C@H](O)[C@H]1O GUBGYTABKSRVRQ-PICCSMPSSA-N 0.000 description 8
- 108010047495 alanylglycine Proteins 0.000 description 8
- 108010077245 asparaginyl-proline Proteins 0.000 description 8
- 239000012634 fragment Substances 0.000 description 8
- 230000003248 secreting effect Effects 0.000 description 8
- 238000012163 sequencing technique Methods 0.000 description 8
- 241000193749 Bacillus coagulans Species 0.000 description 7
- 241001288713 Escherichia coli MC1061 Species 0.000 description 7
- 241000282326 Felis catus Species 0.000 description 7
- WQZGKKKJIJFFOK-GASJEMHNSA-N Glucose Natural products OC[C@H]1OC(O)[C@H](O)[C@@H](O)[C@@H]1O WQZGKKKJIJFFOK-GASJEMHNSA-N 0.000 description 7
- VWHGTYCRDRBSFI-ZETCQYMHSA-N Leu-Gly-Gly Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)NCC(O)=O VWHGTYCRDRBSFI-ZETCQYMHSA-N 0.000 description 7
- 108010076504 Protein Sorting Signals Proteins 0.000 description 7
- KZSNJWFQEVHDMF-UHFFFAOYSA-N Valine Natural products CC(C)C(N)C(O)=O KZSNJWFQEVHDMF-UHFFFAOYSA-N 0.000 description 7
- 125000003118 aryl group Chemical group 0.000 description 7
- 230000015556 catabolic process Effects 0.000 description 7
- 239000008103 glucose Substances 0.000 description 7
- 239000008187 granular material Substances 0.000 description 7
- 108010044311 leucyl-glycyl-glycine Proteins 0.000 description 7
- 108010026333 seryl-proline Proteins 0.000 description 7
- 241000193755 Bacillus cereus Species 0.000 description 6
- 241000193747 Bacillus firmus Species 0.000 description 6
- 241000194107 Bacillus megaterium Species 0.000 description 6
- ULGZDMOVFRHVEP-RWJQBGPGSA-N Erythromycin Chemical compound O([C@@H]1[C@@H](C)C(=O)O[C@@H]([C@@]([C@H](O)[C@@H](C)C(=O)[C@H](C)C[C@@](C)(O)[C@H](O[C@H]2[C@@H]([C@H](C[C@@H](C)O2)N(C)C)O)[C@H]1C)(C)O)CC)[C@H]1C[C@@](C)(OC)[C@@H](O)[C@H](C)O1 ULGZDMOVFRHVEP-RWJQBGPGSA-N 0.000 description 6
- LCRDMSSAKLTKBU-ZDLURKLDSA-N Gly-Ser-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)CN LCRDMSSAKLTKBU-ZDLURKLDSA-N 0.000 description 6
- 102000035195 Peptidases Human genes 0.000 description 6
- 108091005804 Peptidases Proteins 0.000 description 6
- IAOZOFPONWDXNT-IXOXFDKPSA-N Phe-Ser-Thr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O IAOZOFPONWDXNT-IXOXFDKPSA-N 0.000 description 6
- 239000004365 Protease Substances 0.000 description 6
- 240000004808 Saccharomyces cerevisiae Species 0.000 description 6
- 235000014680 Saccharomyces cerevisiae Nutrition 0.000 description 6
- 241000006383 Salimicrobium halophilum Species 0.000 description 6
- NDZYTIMDOZMECO-SHGPDSBTSA-N Thr-Thr-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O NDZYTIMDOZMECO-SHGPDSBTSA-N 0.000 description 6
- 108010070944 alanylhistidine Proteins 0.000 description 6
- 108010087924 alanylproline Proteins 0.000 description 6
- 229940024171 alpha-amylase Drugs 0.000 description 6
- 238000005571 anion exchange chromatography Methods 0.000 description 6
- 229940054340 bacillus coagulans Drugs 0.000 description 6
- 229940005348 bacillus firmus Drugs 0.000 description 6
- 230000002255 enzymatic effect Effects 0.000 description 6
- 238000004128 high performance liquid chromatography Methods 0.000 description 6
- 125000000487 histidyl group Chemical group [H]N([H])C(C(=O)O*)C([H])([H])C1=C([H])N([H])C([H])=N1 0.000 description 6
- 108010034529 leucyl-lysine Proteins 0.000 description 6
- 125000003588 lysine group Chemical group [H]N([H])C([H])([H])C([H])([H])C([H])([H])C([H])([H])C([H])(N([H])[H])C(*)=O 0.000 description 6
- 108010017391 lysylvaline Proteins 0.000 description 6
- 125000001360 methionine group Chemical group N[C@@H](CCSC)C(=O)* 0.000 description 6
- 238000002703 mutagenesis Methods 0.000 description 6
- 231100000350 mutagenesis Toxicity 0.000 description 6
- 108010051242 phenylalanylserine Proteins 0.000 description 6
- 238000006116 polymerization reaction Methods 0.000 description 6
- 235000019419 proteases Nutrition 0.000 description 6
- 239000001509 sodium citrate Substances 0.000 description 6
- NLJMYIDDQXHKNR-UHFFFAOYSA-K sodium citrate Chemical compound O.O.[Na+].[Na+].[Na+].[O-]C(=O)CC(O)(CC([O-])=O)C([O-])=O NLJMYIDDQXHKNR-UHFFFAOYSA-K 0.000 description 6
- 238000012360 testing method Methods 0.000 description 6
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Substances O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 6
- CBWCQCANJSGUOH-ZKWXMUAHSA-N Asn-Val-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O CBWCQCANJSGUOH-ZKWXMUAHSA-N 0.000 description 5
- MRQQMVZUHXUPEV-IHRRRGAJSA-N Asp-Arg-Phe Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O MRQQMVZUHXUPEV-IHRRRGAJSA-N 0.000 description 5
- 241000178569 Bacillus circulans subsp. alkalophilus Species 0.000 description 5
- AAOBFSKXAVIORT-GUBZILKMSA-N Gln-Asn-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O AAOBFSKXAVIORT-GUBZILKMSA-N 0.000 description 5
- JBCLFWXMTIKCCB-UHFFFAOYSA-N H-Gly-Phe-OH Natural products NCC(=O)NC(C(O)=O)CC1=CC=CC=C1 JBCLFWXMTIKCCB-UHFFFAOYSA-N 0.000 description 5
- FADYJNXDPBKVCA-UHFFFAOYSA-N L-Phenylalanyl-L-lysin Natural products NCCCCC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 FADYJNXDPBKVCA-UHFFFAOYSA-N 0.000 description 5
- TYYLDKGBCJGJGW-UHFFFAOYSA-N L-tryptophan-L-tyrosine Natural products C=1NC2=CC=CC=C2C=1CC(N)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 TYYLDKGBCJGJGW-UHFFFAOYSA-N 0.000 description 5
- 241000880493 Leptailurus serval Species 0.000 description 5
- SITLTJHOQZFJGG-UHFFFAOYSA-N N-L-alpha-glutamyl-L-valine Natural products CC(C)C(C(O)=O)NC(=O)C(N)CCC(O)=O SITLTJHOQZFJGG-UHFFFAOYSA-N 0.000 description 5
- XXXAXOWMBOKTRN-XPUUQOCRSA-N Ser-Gly-Val Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O XXXAXOWMBOKTRN-XPUUQOCRSA-N 0.000 description 5
- CRZNCABIJLRFKZ-IUKAMOBKSA-N Thr-Ile-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N CRZNCABIJLRFKZ-IUKAMOBKSA-N 0.000 description 5
- IEZVHOULSUULHD-XGEHTFHBSA-N Thr-Ser-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O IEZVHOULSUULHD-XGEHTFHBSA-N 0.000 description 5
- 239000000872 buffer Substances 0.000 description 5
- 125000004122 cyclic group Chemical group 0.000 description 5
- 108010081551 glycylphenylalanine Proteins 0.000 description 5
- 229930027917 kanamycin Natural products 0.000 description 5
- 229960000318 kanamycin Drugs 0.000 description 5
- SBUJHOSQTJFQJX-NOAMYHISSA-N kanamycin Chemical compound O[C@@H]1[C@@H](O)[C@H](O)[C@@H](CN)O[C@@H]1O[C@H]1[C@H](O)[C@@H](O[C@@H]2[C@@H]([C@@H](N)[C@H](O)[C@@H](CO)O2)O)[C@H](N)C[C@@H]1N SBUJHOSQTJFQJX-NOAMYHISSA-N 0.000 description 5
- 229930182823 kanamycin A Natural products 0.000 description 5
- 108010012058 leucyltyrosine Proteins 0.000 description 5
- 239000000203 mixture Substances 0.000 description 5
- 230000009466 transformation Effects 0.000 description 5
- 108010044292 tryptophyltyrosine Proteins 0.000 description 5
- WEVYAHXRMPXWCK-UHFFFAOYSA-N Acetonitrile Chemical compound CC#N WEVYAHXRMPXWCK-UHFFFAOYSA-N 0.000 description 4
- BVBKBQRPOJFCQM-DCAQKATOSA-N Arg-Asn-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O BVBKBQRPOJFCQM-DCAQKATOSA-N 0.000 description 4
- POZKLUIXMHIULG-FDARSICLSA-N Arg-Trp-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H](CCCN=C(N)N)N POZKLUIXMHIULG-FDARSICLSA-N 0.000 description 4
- 241000228245 Aspergillus niger Species 0.000 description 4
- 240000006439 Aspergillus oryzae Species 0.000 description 4
- 241000894006 Bacteria Species 0.000 description 4
- 241000186146 Brevibacterium Species 0.000 description 4
- UXVMQQNJUSDDNG-UHFFFAOYSA-L Calcium chloride Chemical compound [Cl-].[Cl-].[Ca+2] UXVMQQNJUSDDNG-UHFFFAOYSA-L 0.000 description 4
- 241000193403 Clostridium Species 0.000 description 4
- 108091033380 Coding strand Proteins 0.000 description 4
- 241000186216 Corynebacterium Species 0.000 description 4
- 239000003155 DNA primer Substances 0.000 description 4
- 241000233866 Fungi Species 0.000 description 4
- GNMQDOGFWYWPNM-LAEOZQHASA-N Gln-Gly-Ile Chemical compound CC[C@H](C)[C@H](NC(=O)CNC(=O)[C@@H](N)CCC(N)=O)C(O)=O GNMQDOGFWYWPNM-LAEOZQHASA-N 0.000 description 4
- UGEZSPWLJABDAR-KKUMJFAQSA-N Gln-Tyr-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CCC(=O)N)N UGEZSPWLJABDAR-KKUMJFAQSA-N 0.000 description 4
- SOEXCCGNHQBFPV-DLOVCJGASA-N Gln-Val-Val Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C(C)C)C(O)=O SOEXCCGNHQBFPV-DLOVCJGASA-N 0.000 description 4
- OXEMJGCAJFFREE-FXQIFTODSA-N Glu-Gln-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O OXEMJGCAJFFREE-FXQIFTODSA-N 0.000 description 4
- RJIVPOXLQFJRTG-LURJTMIESA-N Gly-Arg-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)CN)CCCN=C(N)N RJIVPOXLQFJRTG-LURJTMIESA-N 0.000 description 4
- XRTDOIOIBMAXCT-NKWVEPMBSA-N Gly-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)CN)C(=O)O XRTDOIOIBMAXCT-NKWVEPMBSA-N 0.000 description 4
- JPWIMMUNWUKOAD-STQMWFEESA-N Gly-Asp-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)CN JPWIMMUNWUKOAD-STQMWFEESA-N 0.000 description 4
- HHRODZSXDXMUHS-LURJTMIESA-N Gly-Met-Gly Chemical compound CSCC[C@H](NC(=O)C[NH3+])C(=O)NCC([O-])=O HHRODZSXDXMUHS-LURJTMIESA-N 0.000 description 4
- NVTPVQLIZCOJFK-FOHZUACHSA-N Gly-Thr-Asp Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(O)=O NVTPVQLIZCOJFK-FOHZUACHSA-N 0.000 description 4
- TVTZEOHWHUVYCG-KYNKHSRBSA-N Gly-Thr-Thr Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O TVTZEOHWHUVYCG-KYNKHSRBSA-N 0.000 description 4
- UWSMZKRTOZEGDD-CUJWVEQBSA-N His-Thr-Ser Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O UWSMZKRTOZEGDD-CUJWVEQBSA-N 0.000 description 4
- 101000843289 Homo sapiens Putative histone H2B type 2-C Proteins 0.000 description 4
- HERITAGIPLEJMT-GVARAGBVSA-N Ile-Ala-Tyr Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 HERITAGIPLEJMT-GVARAGBVSA-N 0.000 description 4
- XMYURPUVJSKTMC-KBIXCLLPSA-N Ile-Ser-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N XMYURPUVJSKTMC-KBIXCLLPSA-N 0.000 description 4
- PMGDADKJMCOXHX-UHFFFAOYSA-N L-Arginyl-L-glutamin-acetat Natural products NC(=N)NCCCC(N)C(=O)NC(CCC(N)=O)C(O)=O PMGDADKJMCOXHX-UHFFFAOYSA-N 0.000 description 4
- LHSGPCFBGJHPCY-UHFFFAOYSA-N L-leucine-L-tyrosine Natural products CC(C)CC(N)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 LHSGPCFBGJHPCY-UHFFFAOYSA-N 0.000 description 4
- LZDNBBYBDGBADK-UHFFFAOYSA-N L-valyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)C(C)C)C(O)=O)=CNC2=C1 LZDNBBYBDGBADK-UHFFFAOYSA-N 0.000 description 4
- ZRLUISBDKUWAIZ-CIUDSAMLSA-N Leu-Ala-Asp Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC(O)=O ZRLUISBDKUWAIZ-CIUDSAMLSA-N 0.000 description 4
- KYNNSEJZFVCDIV-ZPFDUUQYSA-N Lys-Ile-Asn Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O KYNNSEJZFVCDIV-ZPFDUUQYSA-N 0.000 description 4
- 102000008300 Mutant Proteins Human genes 0.000 description 4
- 108010021466 Mutant Proteins Proteins 0.000 description 4
- XMBSYZWANAQXEV-UHFFFAOYSA-N N-alpha-L-glutamyl-L-phenylalanine Natural products OC(=O)CCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 XMBSYZWANAQXEV-UHFFFAOYSA-N 0.000 description 4
- 108010066427 N-valyltryptophan Proteins 0.000 description 4
- KLYYKKGCPOGDPE-OEAJRASXSA-N Phe-Thr-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O KLYYKKGCPOGDPE-OEAJRASXSA-N 0.000 description 4
- FZHBZMDRDASUHN-NAKRPEOUSA-N Pro-Ala-Ile Chemical compound CC[C@H](C)[C@H](NC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1)C(O)=O FZHBZMDRDASUHN-NAKRPEOUSA-N 0.000 description 4
- YXHYJEPDKSYPSQ-AVGNSLFASA-N Pro-Leu-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]1CCCN1 YXHYJEPDKSYPSQ-AVGNSLFASA-N 0.000 description 4
- 102100030989 Putative histone H2B type 2-C Human genes 0.000 description 4
- 241001468159 Thermoanaerobacterium thermosulfurigenes Species 0.000 description 4
- SKHPKKYKDYULDH-HJGDQZAQSA-N Thr-Asn-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O SKHPKKYKDYULDH-HJGDQZAQSA-N 0.000 description 4
- LXWZOMSOUAMOIA-JIOCBJNQSA-N Thr-Asn-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N1CCC[C@@H]1C(=O)O)N)O LXWZOMSOUAMOIA-JIOCBJNQSA-N 0.000 description 4
- WYOBRXPIZVKNMF-IRXDYDNUSA-N Tyr-Tyr-Gly Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)NCC(O)=O)C1=CC=C(O)C=C1 WYOBRXPIZVKNMF-IRXDYDNUSA-N 0.000 description 4
- BYOHPUZJVXWHAE-BYULHYEWSA-N Val-Asn-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)N)C(=O)O)N BYOHPUZJVXWHAE-BYULHYEWSA-N 0.000 description 4
- 108010076324 alanyl-glycyl-glycine Proteins 0.000 description 4
- 108010005233 alanylglutamic acid Proteins 0.000 description 4
- 108010044940 alanylglutamine Proteins 0.000 description 4
- 108010008355 arginyl-glutamine Proteins 0.000 description 4
- 108010040443 aspartyl-aspartic acid Proteins 0.000 description 4
- 239000001110 calcium chloride Substances 0.000 description 4
- 229910001628 calcium chloride Inorganic materials 0.000 description 4
- 238000005859 coupling reaction Methods 0.000 description 4
- 230000009881 electrostatic interaction Effects 0.000 description 4
- 238000001952 enzyme assay Methods 0.000 description 4
- 108010006664 gamma-glutamyl-glycyl-glycine Proteins 0.000 description 4
- 108010092114 histidylphenylalanine Proteins 0.000 description 4
- 238000009776 industrial production Methods 0.000 description 4
- 108010057821 leucylproline Proteins 0.000 description 4
- 239000003550 marker Substances 0.000 description 4
- 238000000302 molecular modelling Methods 0.000 description 4
- 230000036963 noncompetitive effect Effects 0.000 description 4
- 102000039446 nucleic acids Human genes 0.000 description 4
- 108020004707 nucleic acids Proteins 0.000 description 4
- 150000007523 nucleic acids Chemical class 0.000 description 4
- 108010070409 phenylalanyl-glycyl-glycine Proteins 0.000 description 4
- 108010025826 prolyl-leucyl-arginine Proteins 0.000 description 4
- 108010053725 prolylvaline Proteins 0.000 description 4
- 238000003259 recombinant expression Methods 0.000 description 4
- 230000010076 replication Effects 0.000 description 4
- 102220206418 rs777639154 Human genes 0.000 description 4
- 108010020532 tyrosyl-proline Proteins 0.000 description 4
- 108010073969 valyllysine Proteins 0.000 description 4
- XUFXOAAUWZOOIT-SXARVLRPSA-N (2R,3R,4R,5S,6R)-5-[[(2R,3R,4R,5S,6R)-5-[[(2R,3R,4S,5S,6R)-3,4-dihydroxy-6-methyl-5-[[(1S,4R,5S,6S)-4,5,6-trihydroxy-3-(hydroxymethyl)-1-cyclohex-2-enyl]amino]-2-oxanyl]oxy]-3,4-dihydroxy-6-(hydroxymethyl)-2-oxanyl]oxy]-6-(hydroxymethyl)oxane-2,3,4-triol Chemical compound O([C@H]1O[C@H](CO)[C@H]([C@@H]([C@H]1O)O)O[C@H]1O[C@@H]([C@H]([C@H](O)[C@H]1O)N[C@@H]1[C@@H]([C@@H](O)[C@H](O)C(CO)=C1)O)C)[C@@H]1[C@@H](CO)O[C@@H](O)[C@H](O)[C@H]1O XUFXOAAUWZOOIT-SXARVLRPSA-N 0.000 description 3
- ZCYVEMRRCGMTRW-UHFFFAOYSA-N 7553-56-2 Chemical compound [I] ZCYVEMRRCGMTRW-UHFFFAOYSA-N 0.000 description 3
- RLMISHABBKUNFO-WHFBIAKZSA-N Ala-Ala-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O RLMISHABBKUNFO-WHFBIAKZSA-N 0.000 description 3
- YYSWCHMLFJLLBJ-ZLUOBGJFSA-N Ala-Ala-Ser Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O YYSWCHMLFJLLBJ-ZLUOBGJFSA-N 0.000 description 3
- RZZMZYZXNJRPOJ-BJDJZHNGSA-N Ala-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](C)N RZZMZYZXNJRPOJ-BJDJZHNGSA-N 0.000 description 3
- KUFVXLQLDHJVOG-SHGPDSBTSA-N Ala-Thr-Thr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](C)N)O KUFVXLQLDHJVOG-SHGPDSBTSA-N 0.000 description 3
- XSLGWYYNOSUMRM-ZKWXMUAHSA-N Ala-Val-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O XSLGWYYNOSUMRM-ZKWXMUAHSA-N 0.000 description 3
- SSQHYGLFYWZWDV-UVBJJODRSA-N Ala-Val-Trp Chemical compound CC(C)[C@H](NC(=O)[C@H](C)N)C(=O)N[C@@H](Cc1c[nH]c2ccccc12)C(O)=O SSQHYGLFYWZWDV-UVBJJODRSA-N 0.000 description 3
- 229920000945 Amylopectin Polymers 0.000 description 3
- 229920000856 Amylose Polymers 0.000 description 3
- 235000019890 Amylum Nutrition 0.000 description 3
- SGYSTDWPNPKJPP-GUBZILKMSA-N Arg-Ala-Arg Chemical compound NC(=N)NCCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O SGYSTDWPNPKJPP-GUBZILKMSA-N 0.000 description 3
- TTXYKSADPSNOIF-IHRRRGAJSA-N Arg-Asp-Phe Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O TTXYKSADPSNOIF-IHRRRGAJSA-N 0.000 description 3
- PZBSKYJGKNNYNK-ULQDDVLXSA-N Arg-Leu-Tyr Chemical compound CC(C)C[C@H](NC(=O)[C@@H](N)CCCN=C(N)N)C(=O)N[C@@H](Cc1ccc(O)cc1)C(O)=O PZBSKYJGKNNYNK-ULQDDVLXSA-N 0.000 description 3
- CLICCYPMVFGUOF-IHRRRGAJSA-N Arg-Lys-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O CLICCYPMVFGUOF-IHRRRGAJSA-N 0.000 description 3
- PBSQFBAJKPLRJY-BYULHYEWSA-N Asn-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CC(=O)N)N PBSQFBAJKPLRJY-BYULHYEWSA-N 0.000 description 3
- BIGRHVNFFJTHEB-UBHSHLNASA-N Asn-Trp-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC(O)=O)C(O)=O BIGRHVNFFJTHEB-UBHSHLNASA-N 0.000 description 3
- XACXDSRQIXRMNS-OLHMAJIHSA-N Asp-Asn-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC(=O)O)N)O XACXDSRQIXRMNS-OLHMAJIHSA-N 0.000 description 3
- AYFVRYXNDHBECD-YUMQZZPRSA-N Asp-Leu-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O AYFVRYXNDHBECD-YUMQZZPRSA-N 0.000 description 3
- GYWQGGUCMDCUJE-DLOVCJGASA-N Asp-Phe-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C)C(O)=O GYWQGGUCMDCUJE-DLOVCJGASA-N 0.000 description 3
- 108091003079 Bovine Serum Albumin Proteins 0.000 description 3
- FRPHFZCDPYBUAU-UHFFFAOYSA-N Bromocresolgreen Chemical compound CC1=C(Br)C(O)=C(Br)C=C1C1(C=2C(=C(Br)C(O)=C(Br)C=2)C)C2=CC=CC=C2S(=O)(=O)O1 FRPHFZCDPYBUAU-UHFFFAOYSA-N 0.000 description 3
- OYPRJOBELJOOCE-UHFFFAOYSA-N Calcium Chemical compound [Ca] OYPRJOBELJOOCE-UHFFFAOYSA-N 0.000 description 3
- OAGVHWYIBZMWLA-YFKPBYRVSA-N Glu-Gly-Gly Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)NCC(O)=O OAGVHWYIBZMWLA-YFKPBYRVSA-N 0.000 description 3
- RAUDKMVXNOWDLS-WDSKDSINSA-N Glu-Gly-Ser Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O RAUDKMVXNOWDLS-WDSKDSINSA-N 0.000 description 3
- ATVYZJGOZLVXDK-IUCAKERBSA-N Glu-Leu-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O ATVYZJGOZLVXDK-IUCAKERBSA-N 0.000 description 3
- PYTZFYUXZZHOAD-WHFBIAKZSA-N Gly-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)CN PYTZFYUXZZHOAD-WHFBIAKZSA-N 0.000 description 3
- FFJQHWKSGAWSTJ-BFHQHQDPSA-N Gly-Thr-Ala Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O FFJQHWKSGAWSTJ-BFHQHQDPSA-N 0.000 description 3
- AFMOTCMSEBITOE-YEPSODPASA-N Gly-Val-Thr Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O AFMOTCMSEBITOE-YEPSODPASA-N 0.000 description 3
- WCNXUTNLSRWWQN-DCAQKATOSA-N His-Asp-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC1=CN=CN1)N WCNXUTNLSRWWQN-DCAQKATOSA-N 0.000 description 3
- CGAMSLMBYJHMDY-ONGXEEELSA-N His-Val-Gly Chemical compound CC(C)[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CC1=CN=CN1)N CGAMSLMBYJHMDY-ONGXEEELSA-N 0.000 description 3
- LEDRIAHEWDJRMF-CFMVVWHZSA-N Ile-Asn-Tyr Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 LEDRIAHEWDJRMF-CFMVVWHZSA-N 0.000 description 3
- NBJAAWYRLGCJOF-UGYAYLCHSA-N Ile-Asp-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N NBJAAWYRLGCJOF-UGYAYLCHSA-N 0.000 description 3
- LKACSKJPTFSBHR-MNXVOIDGSA-N Ile-Gln-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N LKACSKJPTFSBHR-MNXVOIDGSA-N 0.000 description 3
- WVUDHMBJNBWZBU-XUXIUFHCSA-N Ile-Lys-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCSC)C(=O)O)N WVUDHMBJNBWZBU-XUXIUFHCSA-N 0.000 description 3
- VEPIBPGLTLPBDW-URLPEUOOSA-N Ile-Phe-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N VEPIBPGLTLPBDW-URLPEUOOSA-N 0.000 description 3
- CNMOKANDJMLAIF-CIQUZCHMSA-N Ile-Thr-Ala Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O CNMOKANDJMLAIF-CIQUZCHMSA-N 0.000 description 3
- YBKKLDBBPFIXBQ-MBLNEYKQSA-N Ile-Thr-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(=O)O)N YBKKLDBBPFIXBQ-MBLNEYKQSA-N 0.000 description 3
- RMJWFINHACYKJI-SIUGBPQLSA-N Ile-Tyr-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N RMJWFINHACYKJI-SIUGBPQLSA-N 0.000 description 3
- 108010065920 Insulin Lispro Proteins 0.000 description 3
- 241000588748 Klebsiella Species 0.000 description 3
- AYFVYJQAPQTCCC-GBXIJSLDSA-N L-threonine Chemical compound C[C@@H](O)[C@H](N)C(O)=O AYFVYJQAPQTCCC-GBXIJSLDSA-N 0.000 description 3
- KZSNJWFQEVHDMF-BYPYZUCNSA-N L-valine Chemical compound CC(C)[C@H](N)C(O)=O KZSNJWFQEVHDMF-BYPYZUCNSA-N 0.000 description 3
- VIWUBXKCYJGNCL-SRVKXCTJSA-N Leu-Asn-His Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 VIWUBXKCYJGNCL-SRVKXCTJSA-N 0.000 description 3
- FMEICTQWUKNAGC-YUMQZZPRSA-N Leu-Gly-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O FMEICTQWUKNAGC-YUMQZZPRSA-N 0.000 description 3
- POZULHZYLPGXMR-ONGXEEELSA-N Leu-Gly-Val Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O POZULHZYLPGXMR-ONGXEEELSA-N 0.000 description 3
- HKCCVDWHHTVVPN-CIUDSAMLSA-N Lys-Asp-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O HKCCVDWHHTVVPN-CIUDSAMLSA-N 0.000 description 3
- QAHFGYLFLVGBNW-DCAQKATOSA-N Met-Ala-Lys Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCCN QAHFGYLFLVGBNW-DCAQKATOSA-N 0.000 description 3
- CULGJGUDIJATIP-STQMWFEESA-N Met-Tyr-Gly Chemical compound CSCC[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CC=C(O)C=C1 CULGJGUDIJATIP-STQMWFEESA-N 0.000 description 3
- 241000192041 Micrococcus Species 0.000 description 3
- 108010002311 N-glycylglutamic acid Proteins 0.000 description 3
- CYZBFPYMSJGBRL-DRZSPHRISA-N Phe-Ala-Glu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O CYZBFPYMSJGBRL-DRZSPHRISA-N 0.000 description 3
- CSYVXYQDIVCQNU-QWRGUYRKSA-N Phe-Asp-Gly Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O CSYVXYQDIVCQNU-QWRGUYRKSA-N 0.000 description 3
- OVJMCXAPGFDGMG-HKUYNNGSSA-N Phe-Gly-Trp Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)NCC(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O OVJMCXAPGFDGMG-HKUYNNGSSA-N 0.000 description 3
- CJZTUKSFZUSNCC-FXQIFTODSA-N Pro-Asp-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H]1CCCN1 CJZTUKSFZUSNCC-FXQIFTODSA-N 0.000 description 3
- ZPPVJIJMIKTERM-YUMQZZPRSA-N Pro-Gln-Gly Chemical compound OC(=O)CNC(=O)[C@H](CCC(=O)N)NC(=O)[C@@H]1CCCN1 ZPPVJIJMIKTERM-YUMQZZPRSA-N 0.000 description 3
- HAAQQNHQZBOWFO-LURJTMIESA-N Pro-Gly-Gly Chemical compound OC(=O)CNC(=O)CNC(=O)[C@@H]1CCCN1 HAAQQNHQZBOWFO-LURJTMIESA-N 0.000 description 3
- 108020004511 Recombinant DNA Proteins 0.000 description 3
- VGNYHOBZJKWRGI-CIUDSAMLSA-N Ser-Asn-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CO VGNYHOBZJKWRGI-CIUDSAMLSA-N 0.000 description 3
- UAJAYRMZGNQILN-BQBZGAKWSA-N Ser-Gly-Met Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CCSC)C(O)=O UAJAYRMZGNQILN-BQBZGAKWSA-N 0.000 description 3
- VMLONWHIORGALA-SRVKXCTJSA-N Ser-Leu-Leu Chemical compound CC(C)C[C@@H](C([O-])=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]([NH3+])CO VMLONWHIORGALA-SRVKXCTJSA-N 0.000 description 3
- MQUZANJDFOQOBX-SRVKXCTJSA-N Ser-Phe-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O MQUZANJDFOQOBX-SRVKXCTJSA-N 0.000 description 3
- PQEQXWRVHQAAKS-SRVKXCTJSA-N Ser-Tyr-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](CO)N)CC1=CC=C(O)C=C1 PQEQXWRVHQAAKS-SRVKXCTJSA-N 0.000 description 3
- 241001137870 Thermoanaerobacterium Species 0.000 description 3
- 101000865057 Thermococcus litoralis DNA polymerase Proteins 0.000 description 3
- STGXWWBXWXZOER-MBLNEYKQSA-N Thr-Ala-His Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 STGXWWBXWXZOER-MBLNEYKQSA-N 0.000 description 3
- XSLXHSYIVPGEER-KZVJFYERSA-N Thr-Ala-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O XSLXHSYIVPGEER-KZVJFYERSA-N 0.000 description 3
- VXMHQKHDKCATDV-VEVYYDQMSA-N Thr-Asp-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O VXMHQKHDKCATDV-VEVYYDQMSA-N 0.000 description 3
- XTCNBOBTROGWMW-RWRJDSDZSA-N Thr-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N XTCNBOBTROGWMW-RWRJDSDZSA-N 0.000 description 3
- ZESGVALRVJIVLZ-VFCFLDTKSA-N Thr-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@@H]1C(=O)O)N)O ZESGVALRVJIVLZ-VFCFLDTKSA-N 0.000 description 3
- WPSYJHFHZYJXMW-JSGCOSHPSA-N Trp-Gln-Gly Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O WPSYJHFHZYJXMW-JSGCOSHPSA-N 0.000 description 3
- GDPDVIBHJDFRFD-RNXOBYDBSA-N Trp-Tyr-Tyr Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O GDPDVIBHJDFRFD-RNXOBYDBSA-N 0.000 description 3
- FJKXUIJOMUWCDD-FHWLQOOXSA-N Tyr-Gln-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O)N)O FJKXUIJOMUWCDD-FHWLQOOXSA-N 0.000 description 3
- KEHKBBUYZWAMHL-DZKIICNBSA-N Tyr-Gln-Val Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O KEHKBBUYZWAMHL-DZKIICNBSA-N 0.000 description 3
- BIVIUZRBCAUNPW-JRQIVUDYSA-N Tyr-Thr-Asn Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(O)=O BIVIUZRBCAUNPW-JRQIVUDYSA-N 0.000 description 3
- KLQPIEVIKOQRAW-IZPVPAKOSA-N Tyr-Thr-Thr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N)O KLQPIEVIKOQRAW-IZPVPAKOSA-N 0.000 description 3
- ZYVAAYAOTVJBSS-GMVOTWDCSA-N Tyr-Trp-Ala Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](C)C(O)=O ZYVAAYAOTVJBSS-GMVOTWDCSA-N 0.000 description 3
- OJCISMMNNUNNJA-BZSNNMDCSA-N Tyr-Tyr-Asp Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)N[C@@H](CC(O)=O)C(O)=O)C1=CC=C(O)C=C1 OJCISMMNNUNNJA-BZSNNMDCSA-N 0.000 description 3
- VHRLUTIMTDOVCG-PEDHHIEDSA-N Val-Ile-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)NC(=O)[C@H](C(C)C)N VHRLUTIMTDOVCG-PEDHHIEDSA-N 0.000 description 3
- MLADEWAIYAPAAU-IHRRRGAJSA-N Val-Lys-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N MLADEWAIYAPAAU-IHRRRGAJSA-N 0.000 description 3
- VNGKMNPAENRGDC-JYJNAYRXSA-N Val-Phe-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C(C)C)CC1=CC=CC=C1 VNGKMNPAENRGDC-JYJNAYRXSA-N 0.000 description 3
- YKNOJPJWNVHORX-UNQGMJICSA-N Val-Phe-Thr Chemical compound CC(C)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H]([C@@H](C)O)C(O)=O)CC1=CC=CC=C1 YKNOJPJWNVHORX-UNQGMJICSA-N 0.000 description 3
- UEXPMFIAZZHEAD-HSHDSVGOSA-N Val-Thr-Trp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](C(C)C)N)O UEXPMFIAZZHEAD-HSHDSVGOSA-N 0.000 description 3
- 238000002835 absorbance Methods 0.000 description 3
- 229960002632 acarbose Drugs 0.000 description 3
- XUFXOAAUWZOOIT-UHFFFAOYSA-N acarviostatin I01 Natural products OC1C(O)C(NC2C(C(O)C(O)C(CO)=C2)O)C(C)OC1OC(C(C1O)O)C(CO)OC1OC1C(CO)OC(O)C(O)C1O XUFXOAAUWZOOIT-UHFFFAOYSA-N 0.000 description 3
- 108010069020 alanyl-prolyl-glycine Proteins 0.000 description 3
- 108010011559 alanylphenylalanine Proteins 0.000 description 3
- 230000003625 amylolytic effect Effects 0.000 description 3
- 108010068265 aspartyltyrosine Proteins 0.000 description 3
- 229940098773 bovine serum albumin Drugs 0.000 description 3
- 239000012928 buffer substance Substances 0.000 description 3
- 239000011575 calcium Substances 0.000 description 3
- 229910052791 calcium Inorganic materials 0.000 description 3
- 230000008859 change Effects 0.000 description 3
- 230000002860 competitive effect Effects 0.000 description 3
- 229960003276 erythromycin Drugs 0.000 description 3
- 239000011521 glass Substances 0.000 description 3
- 108010020688 glycylhistidine Proteins 0.000 description 3
- 108010087823 glycyltyrosine Proteins 0.000 description 3
- 108010037850 glycylvaline Proteins 0.000 description 3
- 108010036413 histidylglycine Proteins 0.000 description 3
- 239000011630 iodine Substances 0.000 description 3
- 108010038320 lysylphenylalanine Proteins 0.000 description 3
- 108010056582 methionylglutamic acid Proteins 0.000 description 3
- 108010034507 methionyltryptophan Proteins 0.000 description 3
- STZCRXQWRGQSJD-GEEYTBSJSA-M methyl orange Chemical compound [Na+].C1=CC(N(C)C)=CC=C1\N=N\C1=CC=C(S([O-])(=O)=O)C=C1 STZCRXQWRGQSJD-GEEYTBSJSA-M 0.000 description 3
- 229940012189 methyl orange Drugs 0.000 description 3
- 238000002156 mixing Methods 0.000 description 3
- 210000001322 periplasm Anatomy 0.000 description 3
- 229920000642 polymer Polymers 0.000 description 3
- 238000012216 screening Methods 0.000 description 3
- 238000002864 sequence alignment Methods 0.000 description 3
- 239000000725 suspension Substances 0.000 description 3
- 238000013518 transcription Methods 0.000 description 3
- 230000035897 transcription Effects 0.000 description 3
- 238000005918 transglycosylation reaction Methods 0.000 description 3
- 108010017949 tyrosyl-glycyl-glycine Proteins 0.000 description 3
- 239000004474 valine Substances 0.000 description 3
- GSCHIGXDTVYEEM-UHFFFAOYSA-N 2-[2-[[3-[6-[[4,5-dihydroxy-3-[3,4,5-trihydroxy-6-(hydroxymethyl)oxan-2-yl]oxyoxan-2-yl]oxymethyl]-3,4-dihydroxy-5-[3,4,5-trihydroxy-6-[(3,4,5-trihydroxyoxan-2-yl)oxymethyl]oxan-2-yl]oxyoxan-2-yl]oxy-4,5-dihydroxy-6-[4,5,6-trihydroxy-2-(hydroxymethyl)oxan Chemical compound OC1C(O)C(O)C(CO)OC1OC1C(OCC2C(C(O)C(O)C(OC3C(OC(O)C(O)C3O)CO)O2)OC2C(C(O)C(OC3C(C(O)C(O)C(COC4C(C(O)C(O)CO4)O)O3)O)C(COC3C(C(O)C(O)CO3)OC3C(C(O)C(O)C(CO)O3)O)O2)O)OCC(O)C1O GSCHIGXDTVYEEM-UHFFFAOYSA-N 0.000 description 2
- SNBCLPGEMZEWLU-QXFUBDJGSA-N 2-chloro-n-[[(2r,3s,5r)-3-hydroxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methyl]acetamide Chemical compound O=C1NC(=O)C(C)=CN1[C@@H]1O[C@H](CNC(=O)CCl)[C@@H](O)C1 SNBCLPGEMZEWLU-QXFUBDJGSA-N 0.000 description 2
- CNNSWSHYGANWBM-UHFFFAOYSA-N 6-chloro-2,3-dimethylquinoxaline Chemical compound C1=C(Cl)C=C2N=C(C)C(C)=NC2=C1 CNNSWSHYGANWBM-UHFFFAOYSA-N 0.000 description 2
- SSSROGPPPVTHLX-FXQIFTODSA-N Ala-Arg-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O SSSROGPPPVTHLX-FXQIFTODSA-N 0.000 description 2
- ZIWWTZWAKYBUOB-CIUDSAMLSA-N Ala-Asp-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O ZIWWTZWAKYBUOB-CIUDSAMLSA-N 0.000 description 2
- WMYJZJRILUVVRG-WDSKDSINSA-N Ala-Gly-Gln Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(N)=O WMYJZJRILUVVRG-WDSKDSINSA-N 0.000 description 2
- VGPWRRFOPXVGOH-BYPYZUCNSA-N Ala-Gly-Gly Chemical compound C[C@H](N)C(=O)NCC(=O)NCC(O)=O VGPWRRFOPXVGOH-BYPYZUCNSA-N 0.000 description 2
- DVJSJDDYCYSMFR-ZKWXMUAHSA-N Ala-Ile-Gly Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O DVJSJDDYCYSMFR-ZKWXMUAHSA-N 0.000 description 2
- MNZHHDPWDWQJCQ-YUMQZZPRSA-N Ala-Leu-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O MNZHHDPWDWQJCQ-YUMQZZPRSA-N 0.000 description 2
- CYBJZLQSUJEMAS-LFSVMHDDSA-N Ala-Phe-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](C)N)O CYBJZLQSUJEMAS-LFSVMHDDSA-N 0.000 description 2
- ADSGHMXEAZJJNF-DCAQKATOSA-N Ala-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](C)N ADSGHMXEAZJJNF-DCAQKATOSA-N 0.000 description 2
- KLALXKYLOMZDQT-ZLUOBGJFSA-N Ala-Ser-Asn Chemical compound C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC(N)=O KLALXKYLOMZDQT-ZLUOBGJFSA-N 0.000 description 2
- VNFSAYFQLXPHPY-CIQUZCHMSA-N Ala-Thr-Ile Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O VNFSAYFQLXPHPY-CIQUZCHMSA-N 0.000 description 2
- XCIGOVDXZULBBV-DCAQKATOSA-N Ala-Val-Lys Chemical compound CC(C)[C@H](NC(=O)[C@H](C)N)C(=O)N[C@@H](CCCCN)C(O)=O XCIGOVDXZULBBV-DCAQKATOSA-N 0.000 description 2
- OMSKGWFGWCQFBD-KZVJFYERSA-N Ala-Val-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OMSKGWFGWCQFBD-KZVJFYERSA-N 0.000 description 2
- DPNHSNLIULPOBH-GUBZILKMSA-N Arg-Asn-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCCN=C(N)N)N DPNHSNLIULPOBH-GUBZILKMSA-N 0.000 description 2
- PQWTZSNVWSOFFK-FXQIFTODSA-N Arg-Asp-Asn Chemical compound C(C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N)CN=C(N)N PQWTZSNVWSOFFK-FXQIFTODSA-N 0.000 description 2
- NKNILFJYKKHBKE-WPRPVWTQSA-N Arg-Gly-Val Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O NKNILFJYKKHBKE-WPRPVWTQSA-N 0.000 description 2
- XUGATJVGQUGQKY-ULQDDVLXSA-N Arg-Lys-Phe Chemical compound NC(=N)NCCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 XUGATJVGQUGQKY-ULQDDVLXSA-N 0.000 description 2
- CZUHPNLXLWMYMG-UBHSHLNASA-N Arg-Phe-Ala Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C)C(O)=O)CC1=CC=CC=C1 CZUHPNLXLWMYMG-UBHSHLNASA-N 0.000 description 2
- VUGWHBXPMAHEGZ-SRVKXCTJSA-N Arg-Pro-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CCCN=C(N)N VUGWHBXPMAHEGZ-SRVKXCTJSA-N 0.000 description 2
- IOTKDTZEEBZNCM-UGYAYLCHSA-N Asn-Asn-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O IOTKDTZEEBZNCM-UGYAYLCHSA-N 0.000 description 2
- BVLIJXXSXBUGEC-SRVKXCTJSA-N Asn-Asn-Tyr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O BVLIJXXSXBUGEC-SRVKXCTJSA-N 0.000 description 2
- XSGBIBGAMKTHMY-WHFBIAKZSA-N Asn-Asp-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O XSGBIBGAMKTHMY-WHFBIAKZSA-N 0.000 description 2
- SRUUBQBAVNQZGJ-LAEOZQHASA-N Asn-Gln-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC(=O)N)N SRUUBQBAVNQZGJ-LAEOZQHASA-N 0.000 description 2
- MSBDSTRUMZFSEU-PEFMBERDSA-N Asn-Glu-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O MSBDSTRUMZFSEU-PEFMBERDSA-N 0.000 description 2
- XLHLPYFMXGOASD-CIUDSAMLSA-N Asn-His-Asp Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CC(=O)N)N XLHLPYFMXGOASD-CIUDSAMLSA-N 0.000 description 2
- SUEIIIFUBHDCCS-PBCZWWQYSA-N Asn-His-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SUEIIIFUBHDCCS-PBCZWWQYSA-N 0.000 description 2
- JWKDQOORUCYUIW-ZPFDUUQYSA-N Asn-Lys-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JWKDQOORUCYUIW-ZPFDUUQYSA-N 0.000 description 2
- YXVAESUIQFDBHN-SRVKXCTJSA-N Asn-Phe-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O YXVAESUIQFDBHN-SRVKXCTJSA-N 0.000 description 2
- SNYCNNPOFYBCEK-ZLUOBGJFSA-N Asn-Ser-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O SNYCNNPOFYBCEK-ZLUOBGJFSA-N 0.000 description 2
- WUQXMTITJLFXAU-JIOCBJNQSA-N Asn-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)N)N)O WUQXMTITJLFXAU-JIOCBJNQSA-N 0.000 description 2
- ZAESWDKAMDVHLL-RCOVLWMOSA-N Asn-Val-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O ZAESWDKAMDVHLL-RCOVLWMOSA-N 0.000 description 2
- JUWZKMBALYLZCK-WHFBIAKZSA-N Asp-Gly-Asn Chemical compound OC(=O)C[C@H](N)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O JUWZKMBALYLZCK-WHFBIAKZSA-N 0.000 description 2
- FAUPLTGRUBTXNU-FXQIFTODSA-N Asp-Pro-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O FAUPLTGRUBTXNU-FXQIFTODSA-N 0.000 description 2
- 241000228212 Aspergillus Species 0.000 description 2
- 241000351920 Aspergillus nidulans Species 0.000 description 2
- 241000193744 Bacillus amyloliquefaciens Species 0.000 description 2
- 108010029675 Bacillus licheniformis alpha-amylase Proteins 0.000 description 2
- 241000194103 Bacillus pumilus Species 0.000 description 2
- 108091005658 Basic proteases Proteins 0.000 description 2
- 241001509321 Clostridium thermoamylolyticum Species 0.000 description 2
- 238000001712 DNA sequencing Methods 0.000 description 2
- 241000620209 Escherichia coli DH5[alpha] Species 0.000 description 2
- SSWAFVQFQWOJIJ-XIRDDKMYSA-N Gln-Arg-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CCC(=O)N)N SSWAFVQFQWOJIJ-XIRDDKMYSA-N 0.000 description 2
- JKPGHIQCHIIRMS-AVGNSLFASA-N Gln-Asp-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCC(=O)N)N JKPGHIQCHIIRMS-AVGNSLFASA-N 0.000 description 2
- MTCXQQINVAFZKW-MNXVOIDGSA-N Gln-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)N)N MTCXQQINVAFZKW-MNXVOIDGSA-N 0.000 description 2
- YPFFHGRJCUBXPX-NHCYSSNCSA-N Gln-Pro-Val Chemical compound CC(C)[C@H](NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CCC(N)=O)C(O)=O YPFFHGRJCUBXPX-NHCYSSNCSA-N 0.000 description 2
- UEILCTONAMOGBR-RWRJDSDZSA-N Gln-Thr-Ile Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O UEILCTONAMOGBR-RWRJDSDZSA-N 0.000 description 2
- SJMJMEWQMBJYPR-DZKIICNBSA-N Gln-Tyr-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CCC(=O)N)N SJMJMEWQMBJYPR-DZKIICNBSA-N 0.000 description 2
- ICRKQMRFXYDYMK-LAEOZQHASA-N Gln-Val-Asn Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O ICRKQMRFXYDYMK-LAEOZQHASA-N 0.000 description 2
- ZMXZGYLINVNTKH-DZKIICNBSA-N Gln-Val-Phe Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 ZMXZGYLINVNTKH-DZKIICNBSA-N 0.000 description 2
- CKRUHITYRFNUKW-WDSKDSINSA-N Glu-Asn-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O CKRUHITYRFNUKW-WDSKDSINSA-N 0.000 description 2
- SBCYJMOOHUDWDA-NUMRIWBASA-N Glu-Asp-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SBCYJMOOHUDWDA-NUMRIWBASA-N 0.000 description 2
- HUFCEIHAFNVSNR-IHRRRGAJSA-N Glu-Gln-Tyr Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 HUFCEIHAFNVSNR-IHRRRGAJSA-N 0.000 description 2
- RFTVTKBHDXCEEX-WDSKDSINSA-N Glu-Ser-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)NCC(O)=O RFTVTKBHDXCEEX-WDSKDSINSA-N 0.000 description 2
- XOEKMEAOMXMURD-JYJNAYRXSA-N Glu-Tyr-His Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)NC(=O)[C@H](CCC(=O)O)N)O XOEKMEAOMXMURD-JYJNAYRXSA-N 0.000 description 2
- YPHPEHMXOYTEQG-LAEOZQHASA-N Glu-Val-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCC(O)=O YPHPEHMXOYTEQG-LAEOZQHASA-N 0.000 description 2
- VIPDPMHGICREIS-GVXVVHGQSA-N Glu-Val-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O VIPDPMHGICREIS-GVXVVHGQSA-N 0.000 description 2
- XEJTYSCIXKYSHR-WDSKDSINSA-N Gly-Asp-Gln Chemical compound C(CC(=O)N)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)CN XEJTYSCIXKYSHR-WDSKDSINSA-N 0.000 description 2
- SCWYHUQOOFRVHP-MBLNEYKQSA-N Gly-Ile-Thr Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SCWYHUQOOFRVHP-MBLNEYKQSA-N 0.000 description 2
- FKESCSGWBPUTPN-FOHZUACHSA-N Gly-Thr-Asn Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(O)=O FKESCSGWBPUTPN-FOHZUACHSA-N 0.000 description 2
- FOKISINOENBSDM-WLTAIBSBSA-N Gly-Thr-Tyr Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O FOKISINOENBSDM-WLTAIBSBSA-N 0.000 description 2
- GBYYQVBXFVDJPJ-WLTAIBSBSA-N Gly-Tyr-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)CN)O GBYYQVBXFVDJPJ-WLTAIBSBSA-N 0.000 description 2
- JYGYNWYVKXENNE-OALUTQOASA-N Gly-Tyr-Trp Chemical compound [H]NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O JYGYNWYVKXENNE-OALUTQOASA-N 0.000 description 2
- 101100175482 Glycine max CG-3 gene Proteins 0.000 description 2
- HGNUKGZQASSBKQ-PCBIJLKTSA-N Ile-Asp-Phe Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N HGNUKGZQASSBKQ-PCBIJLKTSA-N 0.000 description 2
- PWDSHAAAFXISLE-SXTJYALSSA-N Ile-Ile-Asp Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O PWDSHAAAFXISLE-SXTJYALSSA-N 0.000 description 2
- GVKKVHNRTUFCCE-BJDJZHNGSA-N Ile-Leu-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)O)N GVKKVHNRTUFCCE-BJDJZHNGSA-N 0.000 description 2
- GVNNAHIRSDRIII-AJNGGQMLSA-N Ile-Lys-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)O)N GVNNAHIRSDRIII-AJNGGQMLSA-N 0.000 description 2
- HXIDVIFHRYRXLZ-NAKRPEOUSA-N Ile-Ser-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)O)N HXIDVIFHRYRXLZ-NAKRPEOUSA-N 0.000 description 2
- NXRNRBOKDBIVKQ-CXTHYWKRSA-N Ile-Tyr-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O)N NXRNRBOKDBIVKQ-CXTHYWKRSA-N 0.000 description 2
- 201000008225 Klebsiella pneumonia Diseases 0.000 description 2
- XVSJMWYYLHPDKY-DCAQKATOSA-N Leu-Asp-Met Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCSC)C(O)=O XVSJMWYYLHPDKY-DCAQKATOSA-N 0.000 description 2
- KPYAOIVPJKPIOU-KKUMJFAQSA-N Leu-Lys-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(O)=O KPYAOIVPJKPIOU-KKUMJFAQSA-N 0.000 description 2
- INCJJHQRZGQLFC-KBPBESRZSA-N Leu-Phe-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)NCC(O)=O INCJJHQRZGQLFC-KBPBESRZSA-N 0.000 description 2
- ISSAURVGLGAPDK-KKUMJFAQSA-N Leu-Tyr-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O ISSAURVGLGAPDK-KKUMJFAQSA-N 0.000 description 2
- FACUGMGEFUEBTI-SRVKXCTJSA-N Lys-Asn-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CCCCN FACUGMGEFUEBTI-SRVKXCTJSA-N 0.000 description 2
- CUHGAUZONORRIC-HJGDQZAQSA-N Lys-Thr-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCCCN)N)O CUHGAUZONORRIC-HJGDQZAQSA-N 0.000 description 2
- VWPJQIHBBOJWDN-DCAQKATOSA-N Lys-Val-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O VWPJQIHBBOJWDN-DCAQKATOSA-N 0.000 description 2
- HLZORBMOISUNIV-DCAQKATOSA-N Met-Ser-Leu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC(C)C HLZORBMOISUNIV-DCAQKATOSA-N 0.000 description 2
- WUGMRIBZSVSJNP-UHFFFAOYSA-N N-L-alanyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)C)C(O)=O)=CNC2=C1 WUGMRIBZSVSJNP-UHFFFAOYSA-N 0.000 description 2
- YBAFDPFAUTYYRW-UHFFFAOYSA-N N-L-alpha-glutamyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CCC(O)=O YBAFDPFAUTYYRW-UHFFFAOYSA-N 0.000 description 2
- AJHCSUXXECOXOY-UHFFFAOYSA-N N-glycyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)CN)C(O)=O)=CNC2=C1 AJHCSUXXECOXOY-UHFFFAOYSA-N 0.000 description 2
- 102220623105 NT-3 growth factor receptor_V21F_mutation Human genes 0.000 description 2
- GNUCSNWOCQFMMC-UFYCRDLUSA-N Phe-Arg-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 GNUCSNWOCQFMMC-UFYCRDLUSA-N 0.000 description 2
- UMKYAYXCMYYNHI-AVGNSLFASA-N Phe-Gln-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC(=O)N)C(=O)O)N UMKYAYXCMYYNHI-AVGNSLFASA-N 0.000 description 2
- BVHFFNYBKRTSIU-MEYUZBJRSA-N Phe-His-Thr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O BVHFFNYBKRTSIU-MEYUZBJRSA-N 0.000 description 2
- GYEPCBNTTRORKW-PCBIJLKTSA-N Phe-Ile-Asp Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O GYEPCBNTTRORKW-PCBIJLKTSA-N 0.000 description 2
- LRBSWBVUCLLRLU-BZSNNMDCSA-N Phe-Leu-Lys Chemical compound CC(C)C[C@H](NC(=O)[C@@H](N)Cc1ccccc1)C(=O)N[C@@H](CCCCN)C(O)=O LRBSWBVUCLLRLU-BZSNNMDCSA-N 0.000 description 2
- 241000235648 Pichia Species 0.000 description 2
- 206010035717 Pneumonia klebsiella Diseases 0.000 description 2
- DZZCICYRSZASNF-FXQIFTODSA-N Pro-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1 DZZCICYRSZASNF-FXQIFTODSA-N 0.000 description 2
- VOHFZDSRPZLXLH-IHRRRGAJSA-N Pro-Asn-Phe Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O VOHFZDSRPZLXLH-IHRRRGAJSA-N 0.000 description 2
- CNUIHOAISPKQPY-HSHDSVGOSA-N Pro-Thr-Trp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O CNUIHOAISPKQPY-HSHDSVGOSA-N 0.000 description 2
- VDHGTOHMHHQSKG-JYJNAYRXSA-N Pro-Val-Phe Chemical compound CC(C)[C@H](NC(=O)[C@@H]1CCCN1)C(=O)N[C@@H](Cc1ccccc1)C(O)=O VDHGTOHMHHQSKG-JYJNAYRXSA-N 0.000 description 2
- 102220619541 Protein-L-histidine N-pros-methyltransferase_K47R_mutation Human genes 0.000 description 2
- 108010019653 Pwo polymerase Proteins 0.000 description 2
- ZOPISOXXPQNOCO-SVSWQMSJSA-N Ser-Ile-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](CO)N ZOPISOXXPQNOCO-SVSWQMSJSA-N 0.000 description 2
- FLMYSKVSDVHLEW-SVSWQMSJSA-N Ser-Thr-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FLMYSKVSDVHLEW-SVSWQMSJSA-N 0.000 description 2
- FAPWRFPIFSIZLT-UHFFFAOYSA-M Sodium chloride Chemical compound [Na+].[Cl-] FAPWRFPIFSIZLT-UHFFFAOYSA-M 0.000 description 2
- 241000203775 Thermoactinomyces Species 0.000 description 2
- 241001147777 Thermoanaerobacter brockii subsp. finnii Species 0.000 description 2
- 241000186337 Thermoanaerobacter ethanolicus Species 0.000 description 2
- 241000193446 Thermoanaerobacterium thermosaccharolyticum Species 0.000 description 2
- IGROJMCBGRFRGI-YTLHQDLWSA-N Thr-Ala-Ala Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O IGROJMCBGRFRGI-YTLHQDLWSA-N 0.000 description 2
- TZKPNGDGUVREEB-FOHZUACHSA-N Thr-Asn-Gly Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O TZKPNGDGUVREEB-FOHZUACHSA-N 0.000 description 2
- XFTYVCHLARBHBQ-FOHZUACHSA-N Thr-Gly-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O XFTYVCHLARBHBQ-FOHZUACHSA-N 0.000 description 2
- PAXANSWUSVPFNK-IUKAMOBKSA-N Thr-Ile-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N PAXANSWUSVPFNK-IUKAMOBKSA-N 0.000 description 2
- URPSJRMWHQTARR-MBLNEYKQSA-N Thr-Ile-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O URPSJRMWHQTARR-MBLNEYKQSA-N 0.000 description 2
- VRUFCJZQDACGLH-UVOCVTCTSA-N Thr-Leu-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O VRUFCJZQDACGLH-UVOCVTCTSA-N 0.000 description 2
- VUXIQSUQQYNLJP-XAVMHZPKSA-N Thr-Ser-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CO)C(=O)N1CCC[C@@H]1C(=O)O)N)O VUXIQSUQQYNLJP-XAVMHZPKSA-N 0.000 description 2
- AYFVYJQAPQTCCC-UHFFFAOYSA-N Threonine Natural products CC(O)C(N)C(O)=O AYFVYJQAPQTCCC-UHFFFAOYSA-N 0.000 description 2
- 239000004473 Threonine Substances 0.000 description 2
- 108010022394 Threonine synthase Proteins 0.000 description 2
- 102100033598 Triosephosphate isomerase Human genes 0.000 description 2
- DXHHCIYKHRKBOC-BHYGNILZSA-N Trp-Gln-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)N)C(=O)O DXHHCIYKHRKBOC-BHYGNILZSA-N 0.000 description 2
- SAKLWFSRZTZQAJ-GQGQLFGLSA-N Trp-Ile-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N SAKLWFSRZTZQAJ-GQGQLFGLSA-N 0.000 description 2
- DMWNPLOERDAHSY-MEYUZBJRSA-N Tyr-Leu-Thr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O DMWNPLOERDAHSY-MEYUZBJRSA-N 0.000 description 2
- XGZBEGGGAUQBMB-KJEVXHAQSA-N Tyr-Pro-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CC2=CC=C(C=C2)O)N)O XGZBEGGGAUQBMB-KJEVXHAQSA-N 0.000 description 2
- TYFLVOUZHQUBGM-IHRRRGAJSA-N Tyr-Ser-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 TYFLVOUZHQUBGM-IHRRRGAJSA-N 0.000 description 2
- XUIOBCQESNDTDE-FQPOAREZSA-N Tyr-Thr-Ala Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N)O XUIOBCQESNDTDE-FQPOAREZSA-N 0.000 description 2
- UEOOXDLMQZBPFR-ZKWXMUAHSA-N Val-Ala-Asn Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N UEOOXDLMQZBPFR-ZKWXMUAHSA-N 0.000 description 2
- ASQFIHTXXMFENG-XPUUQOCRSA-N Val-Ala-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O ASQFIHTXXMFENG-XPUUQOCRSA-N 0.000 description 2
- IVXJODPZRWHCCR-JYJNAYRXSA-N Val-Arg-Phe Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N IVXJODPZRWHCCR-JYJNAYRXSA-N 0.000 description 2
- ISERLACIZUGCDX-ZKWXMUAHSA-N Val-Asp-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](C(C)C)N ISERLACIZUGCDX-ZKWXMUAHSA-N 0.000 description 2
- CELJCNRXKZPTCX-XPUUQOCRSA-N Val-Gly-Ala Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(O)=O CELJCNRXKZPTCX-XPUUQOCRSA-N 0.000 description 2
- QRVPEKJBBRYISE-XUXIUFHCSA-N Val-Lys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](C(C)C)N QRVPEKJBBRYISE-XUXIUFHCSA-N 0.000 description 2
- XBJKAZATRJBDCU-GUBZILKMSA-N Val-Pro-Ala Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O XBJKAZATRJBDCU-GUBZILKMSA-N 0.000 description 2
- PQSNETRGCRUOGP-KKHAAJSZSA-N Val-Thr-Asn Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(N)=O PQSNETRGCRUOGP-KKHAAJSZSA-N 0.000 description 2
- 239000002253 acid Substances 0.000 description 2
- 239000000654 additive Substances 0.000 description 2
- 108010041407 alanylaspartic acid Proteins 0.000 description 2
- BFNBIHQBYMNNAN-UHFFFAOYSA-N ammonium sulfate Chemical compound N.N.OS(O)(=O)=O BFNBIHQBYMNNAN-UHFFFAOYSA-N 0.000 description 2
- 229910052921 ammonium sulfate Inorganic materials 0.000 description 2
- -1 ammonium sulphate Chemical class 0.000 description 2
- 239000001166 ammonium sulphate Substances 0.000 description 2
- 235000011130 ammonium sulphate Nutrition 0.000 description 2
- 230000003321 amplification Effects 0.000 description 2
- 108010059459 arginyl-threonyl-phenylalanine Proteins 0.000 description 2
- 108010062796 arginyllysine Proteins 0.000 description 2
- 239000006227 byproduct Substances 0.000 description 2
- 238000005119 centrifugation Methods 0.000 description 2
- HVYWMOMLDIMFJA-DPAQBDIFSA-N cholesterol Chemical compound C1C=C2C[C@@H](O)CC[C@]2(C)[C@@H]2[C@@H]1[C@@H]1CC[C@H]([C@H](C)CCCC(C)C)[C@@]1(C)CC2 HVYWMOMLDIMFJA-DPAQBDIFSA-N 0.000 description 2
- 238000004590 computer program Methods 0.000 description 2
- 239000002537 cosmetic Substances 0.000 description 2
- 230000008878 coupling Effects 0.000 description 2
- 230000001808 coupling effect Effects 0.000 description 2
- 238000010168 coupling process Methods 0.000 description 2
- 239000013078 crystal Substances 0.000 description 2
- 230000001419 dependent effect Effects 0.000 description 2
- 102000004419 dihydrofolate reductase Human genes 0.000 description 2
- 239000003814 drug Substances 0.000 description 2
- 238000001914 filtration Methods 0.000 description 2
- 235000013305 food Nutrition 0.000 description 2
- 108010063718 gamma-glutamylaspartic acid Proteins 0.000 description 2
- 101150021650 gluA gene Proteins 0.000 description 2
- 108010078144 glutaminyl-glycine Proteins 0.000 description 2
- 108010042598 glutamyl-aspartyl-glycine Proteins 0.000 description 2
- 150000004676 glycans Chemical class 0.000 description 2
- VPZXBVLAVMBEQI-UHFFFAOYSA-N glycyl-DL-alpha-alanine Natural products OC(=O)C(C)NC(=O)CN VPZXBVLAVMBEQI-UHFFFAOYSA-N 0.000 description 2
- 108010090037 glycyl-alanyl-isoleucine Proteins 0.000 description 2
- 108010010096 glycyl-glycyl-tyrosine Proteins 0.000 description 2
- 239000001963 growth medium Substances 0.000 description 2
- 230000007062 hydrolysis Effects 0.000 description 2
- 238000006460 hydrolysis reaction Methods 0.000 description 2
- 238000011065 in-situ storage Methods 0.000 description 2
- 230000002452 interceptive effect Effects 0.000 description 2
- 238000004255 ion exchange chromatography Methods 0.000 description 2
- 108010064235 lysylglycine Proteins 0.000 description 2
- 108010005942 methionylglycine Proteins 0.000 description 2
- 108010085203 methionylmethionine Proteins 0.000 description 2
- 230000000813 microbial effect Effects 0.000 description 2
- 244000005700 microbiome Species 0.000 description 2
- 238000003199 nucleic acid amplification method Methods 0.000 description 2
- 235000015097 nutrients Nutrition 0.000 description 2
- 230000003287 optical effect Effects 0.000 description 2
- 108010072637 phenylalanyl-arginyl-phenylalanine Proteins 0.000 description 2
- 108010084572 phenylalanyl-valine Proteins 0.000 description 2
- 108010018625 phenylalanylarginine Proteins 0.000 description 2
- 108010012581 phenylalanylglutamate Proteins 0.000 description 2
- 229920001282 polysaccharide Polymers 0.000 description 2
- 239000005017 polysaccharide Substances 0.000 description 2
- 230000001376 precipitating effect Effects 0.000 description 2
- 108010090894 prolylleucine Proteins 0.000 description 2
- 102200017393 rs104894299 Human genes 0.000 description 2
- 102200121669 rs104894915 Human genes 0.000 description 2
- 102220219953 rs756843283 Human genes 0.000 description 2
- 150000003839 salts Chemical class 0.000 description 2
- 239000013605 shuttle vector Substances 0.000 description 2
- 239000007974 sodium acetate buffer Substances 0.000 description 2
- 239000004307 sodium orthophenyl phenol Substances 0.000 description 2
- UCSJYZPVAKXKNQ-HZYVHMACSA-N streptomycin Chemical compound CN[C@H]1[C@H](O)[C@@H](O)[C@H](CO)O[C@H]1O[C@@H]1[C@](C=O)(O)[C@H](C)O[C@H]1O[C@@H]1[C@@H](NC(N)=N)[C@H](O)[C@@H](NC(N)=N)[C@H](O)[C@H]1O UCSJYZPVAKXKNQ-HZYVHMACSA-N 0.000 description 2
- 239000006228 supernatant Substances 0.000 description 2
- 125000000341 threoninyl group Chemical group [H]OC([H])(C([H])([H])[H])C([H])(N([H])[H])C(*)=O 0.000 description 2
- 108010072986 threonyl-seryl-lysine Proteins 0.000 description 2
- 230000002103 transcriptional effect Effects 0.000 description 2
- 230000006098 transglycosylation Effects 0.000 description 2
- 150000004043 trisaccharides Chemical class 0.000 description 2
- 108010051110 tyrosyl-lysine Proteins 0.000 description 2
- 238000011144 upstream manufacturing Methods 0.000 description 2
- 108010009962 valyltyrosine Proteins 0.000 description 2
- 238000005406 washing Methods 0.000 description 2
- 210000005253 yeast cell Anatomy 0.000 description 2
- 239000004101 4-Hexylresorcinol Substances 0.000 description 1
- FWMNVWWHGCHHJJ-SKKKGAJSSA-N 4-amino-1-[(2r)-6-amino-2-[[(2r)-2-[[(2r)-2-[[(2r)-2-amino-3-phenylpropanoyl]amino]-3-phenylpropanoyl]amino]-4-methylpentanoyl]amino]hexanoyl]piperidine-4-carboxylic acid Chemical compound C([C@H](C(=O)N[C@H](CC(C)C)C(=O)N[C@H](CCCCN)C(=O)N1CCC(N)(CC1)C(O)=O)NC(=O)[C@H](N)CC=1C=CC=CC=1)C1=CC=CC=C1 FWMNVWWHGCHHJJ-SKKKGAJSSA-N 0.000 description 1
- UHPMCKVQTMMPCG-UHFFFAOYSA-N 5,8-dihydroxy-2-methoxy-6-methyl-7-(2-oxopropyl)naphthalene-1,4-dione Chemical compound CC1=C(CC(C)=O)C(O)=C2C(=O)C(OC)=CC(=O)C2=C1O UHPMCKVQTMMPCG-UHFFFAOYSA-N 0.000 description 1
- 241000186361 Actinobacteria <class> Species 0.000 description 1
- 229920001817 Agar Polymers 0.000 description 1
- SBGXWWCLHIOABR-UHFFFAOYSA-N Ala Ala Gly Ala Chemical compound CC(N)C(=O)NC(C)C(=O)NCC(=O)NC(C)C(O)=O SBGXWWCLHIOABR-UHFFFAOYSA-N 0.000 description 1
- DKJPOZOEBONHFS-ZLUOBGJFSA-N Ala-Ala-Asp Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC(O)=O DKJPOZOEBONHFS-ZLUOBGJFSA-N 0.000 description 1
- WQVFQXXBNHHPLX-ZKWXMUAHSA-N Ala-Ala-His Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](Cc1cnc[nH]1)C(O)=O WQVFQXXBNHHPLX-ZKWXMUAHSA-N 0.000 description 1
- PXKLCFFSVLKOJM-ACZMJKKPSA-N Ala-Asn-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O PXKLCFFSVLKOJM-ACZMJKKPSA-N 0.000 description 1
- CVGNCMIULZNYES-WHFBIAKZSA-N Ala-Asn-Gly Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O CVGNCMIULZNYES-WHFBIAKZSA-N 0.000 description 1
- GSCLWXDNIMNIJE-ZLUOBGJFSA-N Ala-Asp-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O GSCLWXDNIMNIJE-ZLUOBGJFSA-N 0.000 description 1
- MKZCBYZBCINNJN-DLOVCJGASA-N Ala-Asp-Phe Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 MKZCBYZBCINNJN-DLOVCJGASA-N 0.000 description 1
- YEVZMOUUZINZCK-LKTVYLICSA-N Ala-Glu-Trp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O YEVZMOUUZINZCK-LKTVYLICSA-N 0.000 description 1
- BLIMFWGRQKRCGT-YUMQZZPRSA-N Ala-Gly-Lys Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCCN BLIMFWGRQKRCGT-YUMQZZPRSA-N 0.000 description 1
- OBVSBEYOMDWLRJ-BFHQHQDPSA-N Ala-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@H](C)N OBVSBEYOMDWLRJ-BFHQHQDPSA-N 0.000 description 1
- JDIQCVUDDFENPU-ZKWXMUAHSA-N Ala-His-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CNC=N1 JDIQCVUDDFENPU-ZKWXMUAHSA-N 0.000 description 1
- PNALXAODQKTNLV-JBDRJPRFSA-N Ala-Ile-Ala Chemical compound C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O PNALXAODQKTNLV-JBDRJPRFSA-N 0.000 description 1
- IFKQPMZRDQZSHI-GHCJXIJMSA-N Ala-Ile-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O IFKQPMZRDQZSHI-GHCJXIJMSA-N 0.000 description 1
- LXAARTARZJJCMB-CIQUZCHMSA-N Ala-Ile-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LXAARTARZJJCMB-CIQUZCHMSA-N 0.000 description 1
- QQACQIHVWCVBBR-GVARAGBVSA-N Ala-Ile-Tyr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O QQACQIHVWCVBBR-GVARAGBVSA-N 0.000 description 1
- UWIQWPWWZUHBAO-ZLIFDBKOSA-N Ala-Leu-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](NC(=O)[C@H](C)N)CC(C)C)C(O)=O)=CNC2=C1 UWIQWPWWZUHBAO-ZLIFDBKOSA-N 0.000 description 1
- CJQAEJMHBAOQHA-DLOVCJGASA-N Ala-Phe-Asn Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(=O)N)C(=O)O)N CJQAEJMHBAOQHA-DLOVCJGASA-N 0.000 description 1
- BFMIRJBURUXDRG-DLOVCJGASA-N Ala-Phe-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=CC=C1 BFMIRJBURUXDRG-DLOVCJGASA-N 0.000 description 1
- DHBKYZYFEXXUAK-ONGXEEELSA-N Ala-Phe-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=CC=C1 DHBKYZYFEXXUAK-ONGXEEELSA-N 0.000 description 1
- FQNILRVJOJBFFC-FXQIFTODSA-N Ala-Pro-Asp Chemical compound C[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(=O)O)C(=O)O)N FQNILRVJOJBFFC-FXQIFTODSA-N 0.000 description 1
- MSWSRLGNLKHDEI-ACZMJKKPSA-N Ala-Ser-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(O)=O MSWSRLGNLKHDEI-ACZMJKKPSA-N 0.000 description 1
- DYXOFPBJBAHWFY-JBDRJPRFSA-N Ala-Ser-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@H](C)N DYXOFPBJBAHWFY-JBDRJPRFSA-N 0.000 description 1
- OEVCHROQUIVQFZ-YTLHQDLWSA-N Ala-Thr-Ala Chemical compound C[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](C)C(O)=O OEVCHROQUIVQFZ-YTLHQDLWSA-N 0.000 description 1
- REWSWYIDQIELBE-FXQIFTODSA-N Ala-Val-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O REWSWYIDQIELBE-FXQIFTODSA-N 0.000 description 1
- 108010021809 Alcohol dehydrogenase Proteins 0.000 description 1
- 102100034042 Alcohol dehydrogenase 1C Human genes 0.000 description 1
- 102220621821 Apelin receptor_S94F_mutation Human genes 0.000 description 1
- DFCIPNHFKOQAME-FXQIFTODSA-N Arg-Ala-Asn Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O DFCIPNHFKOQAME-FXQIFTODSA-N 0.000 description 1
- VYSRNGOMGHOJCK-GUBZILKMSA-N Arg-Ala-Met Chemical compound C[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N VYSRNGOMGHOJCK-GUBZILKMSA-N 0.000 description 1
- GIVATXIGCXFQQA-FXQIFTODSA-N Arg-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCN=C(N)N GIVATXIGCXFQQA-FXQIFTODSA-N 0.000 description 1
- JUWQNWXEGDYCIE-YUMQZZPRSA-N Arg-Gln-Gly Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O JUWQNWXEGDYCIE-YUMQZZPRSA-N 0.000 description 1
- LLZXKVAAEWBUPB-KKUMJFAQSA-N Arg-Gln-Phe Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O LLZXKVAAEWBUPB-KKUMJFAQSA-N 0.000 description 1
- NPAVRDPEFVKELR-DCAQKATOSA-N Arg-Lys-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O NPAVRDPEFVKELR-DCAQKATOSA-N 0.000 description 1
- AFNHFVVOJZBIJD-GUBZILKMSA-N Arg-Met-Asp Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(O)=O)C(O)=O AFNHFVVOJZBIJD-GUBZILKMSA-N 0.000 description 1
- GSUFZRURORXYTM-STQMWFEESA-N Arg-Phe-Gly Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CC=CC=C1 GSUFZRURORXYTM-STQMWFEESA-N 0.000 description 1
- KZXPVYVSHUJCEO-ULQDDVLXSA-N Arg-Phe-Lys Chemical compound NC(=N)NCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CCCCN)C(O)=O)CC1=CC=CC=C1 KZXPVYVSHUJCEO-ULQDDVLXSA-N 0.000 description 1
- UIUXXFIKWQVMEX-UFYCRDLUSA-N Arg-Phe-Tyr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O UIUXXFIKWQVMEX-UFYCRDLUSA-N 0.000 description 1
- SLQQPJBDBVPVQV-JYJNAYRXSA-N Arg-Phe-Val Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(O)=O SLQQPJBDBVPVQV-JYJNAYRXSA-N 0.000 description 1
- MOGMYRUNTKYZFB-UNQGMJICSA-N Arg-Thr-Phe Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 MOGMYRUNTKYZFB-UNQGMJICSA-N 0.000 description 1
- OGZBJJLRKQZRHL-KJEVXHAQSA-N Arg-Thr-Tyr Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 OGZBJJLRKQZRHL-KJEVXHAQSA-N 0.000 description 1
- YHZQOSXDTFRZKU-WDSOQIARSA-N Arg-Trp-Leu Chemical compound C1=CC=C2C(C[C@@H](C(=O)N[C@@H](CC(C)C)C(O)=O)NC(=O)[C@@H](N)CCCN=C(N)N)=CNC2=C1 YHZQOSXDTFRZKU-WDSOQIARSA-N 0.000 description 1
- QTAIIXQCOPUNBQ-QXEWZRGKSA-N Arg-Val-Asp Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O QTAIIXQCOPUNBQ-QXEWZRGKSA-N 0.000 description 1
- YNDLOUMBVDVALC-ZLUOBGJFSA-N Asn-Ala-Ala Chemical compound C[C@@H](C(=O)N[C@@H](C)C(=O)O)NC(=O)[C@H](CC(=O)N)N YNDLOUMBVDVALC-ZLUOBGJFSA-N 0.000 description 1
- CMLGVVWQQHUXOZ-GHCJXIJMSA-N Asn-Ala-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O CMLGVVWQQHUXOZ-GHCJXIJMSA-N 0.000 description 1
- NUHQMYUWLUSRJX-BIIVOSGPSA-N Asn-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)N)N NUHQMYUWLUSRJX-BIIVOSGPSA-N 0.000 description 1
- VDCIPFYVCICPEC-FXQIFTODSA-N Asn-Arg-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(O)=O VDCIPFYVCICPEC-FXQIFTODSA-N 0.000 description 1
- GMRGSBAMMMVDGG-GUBZILKMSA-N Asn-Arg-Arg Chemical compound C(C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N)CN=C(N)N GMRGSBAMMMVDGG-GUBZILKMSA-N 0.000 description 1
- GXMSVVBIAMWMKO-BQBZGAKWSA-N Asn-Arg-Gly Chemical compound NC(=O)C[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CCCN=C(N)N GXMSVVBIAMWMKO-BQBZGAKWSA-N 0.000 description 1
- KXFCBAHYSLJCCY-ZLUOBGJFSA-N Asn-Asn-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O KXFCBAHYSLJCCY-ZLUOBGJFSA-N 0.000 description 1
- QHBMKQWOIYJYMI-BYULHYEWSA-N Asn-Asn-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O QHBMKQWOIYJYMI-BYULHYEWSA-N 0.000 description 1
- CUQUEHYSSFETRD-ACZMJKKPSA-N Asn-Asp-Gln Chemical compound C(CC(=O)N)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(=O)N)N CUQUEHYSSFETRD-ACZMJKKPSA-N 0.000 description 1
- DMLSCRJBWUEALP-LAEOZQHASA-N Asn-Glu-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O DMLSCRJBWUEALP-LAEOZQHASA-N 0.000 description 1
- CTQIOCMSIJATNX-WHFBIAKZSA-N Asn-Gly-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](C)C(O)=O CTQIOCMSIJATNX-WHFBIAKZSA-N 0.000 description 1
- DDPXDCKYWDGZAL-BQBZGAKWSA-N Asn-Gly-Arg Chemical compound NC(=O)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N DDPXDCKYWDGZAL-BQBZGAKWSA-N 0.000 description 1
- IICZCLFBILYRCU-WHFBIAKZSA-N Asn-Gly-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O IICZCLFBILYRCU-WHFBIAKZSA-N 0.000 description 1
- FTCGGKNCJZOPNB-WHFBIAKZSA-N Asn-Gly-Ser Chemical compound NC(=O)C[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O FTCGGKNCJZOPNB-WHFBIAKZSA-N 0.000 description 1
- OOWSBIOUKIUWLO-RCOVLWMOSA-N Asn-Gly-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O OOWSBIOUKIUWLO-RCOVLWMOSA-N 0.000 description 1
- SEKBHZJLARBNPB-GHCJXIJMSA-N Asn-Ile-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O SEKBHZJLARBNPB-GHCJXIJMSA-N 0.000 description 1
- TZFQICWZWFNIKU-KKUMJFAQSA-N Asn-Leu-Tyr Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 TZFQICWZWFNIKU-KKUMJFAQSA-N 0.000 description 1
- KAZKWIKPEPABOO-IHRRRGAJSA-N Asn-Met-Tyr Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@H](CC(=O)N)N KAZKWIKPEPABOO-IHRRRGAJSA-N 0.000 description 1
- MVXJBVVLACEGCG-PCBIJLKTSA-N Asn-Phe-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O MVXJBVVLACEGCG-PCBIJLKTSA-N 0.000 description 1
- PLTGTJAZQRGMPP-FXQIFTODSA-N Asn-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CC(N)=O PLTGTJAZQRGMPP-FXQIFTODSA-N 0.000 description 1
- SZNGQSBRHFMZLT-IHRRRGAJSA-N Asn-Pro-Phe Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O SZNGQSBRHFMZLT-IHRRRGAJSA-N 0.000 description 1
- VCJCPARXDBEGNE-GUBZILKMSA-N Asn-Pro-Pro Chemical compound NC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 VCJCPARXDBEGNE-GUBZILKMSA-N 0.000 description 1
- GMUOCGCDOYYWPD-FXQIFTODSA-N Asn-Pro-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O GMUOCGCDOYYWPD-FXQIFTODSA-N 0.000 description 1
- IDUUACUJKUXKKD-VEVYYDQMSA-N Asn-Pro-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O IDUUACUJKUXKKD-VEVYYDQMSA-N 0.000 description 1
- XTMZYFMTYJNABC-ZLUOBGJFSA-N Asn-Ser-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CC(=O)N)N XTMZYFMTYJNABC-ZLUOBGJFSA-N 0.000 description 1
- HNXWVVHIGTZTBO-LKXGYXEUSA-N Asn-Ser-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC(N)=O HNXWVVHIGTZTBO-LKXGYXEUSA-N 0.000 description 1
- MYTHOBCLNIOFBL-SRVKXCTJSA-N Asn-Ser-Tyr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O MYTHOBCLNIOFBL-SRVKXCTJSA-N 0.000 description 1
- KZYSHAMXEBPJBD-JRQIVUDYSA-N Asn-Thr-Tyr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O KZYSHAMXEBPJBD-JRQIVUDYSA-N 0.000 description 1
- BCADFFUQHIMQAA-KKHAAJSZSA-N Asn-Thr-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O BCADFFUQHIMQAA-KKHAAJSZSA-N 0.000 description 1
- QNNBHTFDFFFHGC-KKUMJFAQSA-N Asn-Tyr-Lys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)N)N)O QNNBHTFDFFFHGC-KKUMJFAQSA-N 0.000 description 1
- DPWDPEVGACCWTC-SRVKXCTJSA-N Asn-Tyr-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(O)=O DPWDPEVGACCWTC-SRVKXCTJSA-N 0.000 description 1
- CBHVAFXKOYAHOY-NHCYSSNCSA-N Asn-Val-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O CBHVAFXKOYAHOY-NHCYSSNCSA-N 0.000 description 1
- KBQOUDLMWYWXNP-YDHLFZDLSA-N Asn-Val-Phe Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CC(=O)N)N KBQOUDLMWYWXNP-YDHLFZDLSA-N 0.000 description 1
- WQAOZCVOOYUWKG-LSJOCFKGSA-N Asn-Val-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H](CC(=O)N)N WQAOZCVOOYUWKG-LSJOCFKGSA-N 0.000 description 1
- RGKKALNPOYURGE-ZKWXMUAHSA-N Asp-Ala-Val Chemical compound N[C@@H](CC(=O)O)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)O RGKKALNPOYURGE-ZKWXMUAHSA-N 0.000 description 1
- VBVKSAFJPVXMFJ-CIUDSAMLSA-N Asp-Asn-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC(=O)O)N VBVKSAFJPVXMFJ-CIUDSAMLSA-N 0.000 description 1
- WLKVEEODTPQPLI-ACZMJKKPSA-N Asp-Gln-Asn Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O WLKVEEODTPQPLI-ACZMJKKPSA-N 0.000 description 1
- PDECQIHABNQRHN-GUBZILKMSA-N Asp-Glu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC(O)=O PDECQIHABNQRHN-GUBZILKMSA-N 0.000 description 1
- YNCHFVRXEQFPBY-BQBZGAKWSA-N Asp-Gly-Arg Chemical compound OC(=O)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N YNCHFVRXEQFPBY-BQBZGAKWSA-N 0.000 description 1
- PSLSTUMPZILTAH-BYULHYEWSA-N Asp-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC(O)=O PSLSTUMPZILTAH-BYULHYEWSA-N 0.000 description 1
- SPWXXPFDTMYTRI-IUKAMOBKSA-N Asp-Ile-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SPWXXPFDTMYTRI-IUKAMOBKSA-N 0.000 description 1
- RTXQQDVBACBSCW-CFMVVWHZSA-N Asp-Ile-Tyr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O RTXQQDVBACBSCW-CFMVVWHZSA-N 0.000 description 1
- KYQNAIMCTRZLNP-QSFUFRPTSA-N Asp-Ile-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O KYQNAIMCTRZLNP-QSFUFRPTSA-N 0.000 description 1
- CLUMZOKVGUWUFD-CIUDSAMLSA-N Asp-Leu-Asn Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O CLUMZOKVGUWUFD-CIUDSAMLSA-N 0.000 description 1
- LBOVBQONZJRWPV-YUMQZZPRSA-N Asp-Lys-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O LBOVBQONZJRWPV-YUMQZZPRSA-N 0.000 description 1
- LIJXJYGRSRWLCJ-IHRRRGAJSA-N Asp-Phe-Arg Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O LIJXJYGRSRWLCJ-IHRRRGAJSA-N 0.000 description 1
- YRZIYQGXTSBRLT-AVGNSLFASA-N Asp-Phe-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O YRZIYQGXTSBRLT-AVGNSLFASA-N 0.000 description 1
- RPUYTJJZXQBWDT-SRVKXCTJSA-N Asp-Phe-Ser Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC(=O)O)N RPUYTJJZXQBWDT-SRVKXCTJSA-N 0.000 description 1
- BWJZSLQJNBSUPM-FXQIFTODSA-N Asp-Pro-Asn Chemical compound OC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O BWJZSLQJNBSUPM-FXQIFTODSA-N 0.000 description 1
- KESWRFKUZRUTAH-FXQIFTODSA-N Asp-Pro-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O KESWRFKUZRUTAH-FXQIFTODSA-N 0.000 description 1
- DRCOAZZDQRCGGP-GHCJXIJMSA-N Asp-Ser-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O DRCOAZZDQRCGGP-GHCJXIJMSA-N 0.000 description 1
- OFYVKOXTTDCUIL-FXQIFTODSA-N Asp-Ser-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CC(=O)O)N OFYVKOXTTDCUIL-FXQIFTODSA-N 0.000 description 1
- XYPJXLLXNSAWHZ-SRVKXCTJSA-N Asp-Ser-Tyr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O XYPJXLLXNSAWHZ-SRVKXCTJSA-N 0.000 description 1
- MNQMTYSEKZHIDF-GCJQMDKQSA-N Asp-Thr-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O MNQMTYSEKZHIDF-GCJQMDKQSA-N 0.000 description 1
- UTLCRGFJFSZWAW-OLHMAJIHSA-N Asp-Thr-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)O)N)O UTLCRGFJFSZWAW-OLHMAJIHSA-N 0.000 description 1
- UEFODXNXUAVPTC-VEVYYDQMSA-N Asp-Thr-Met Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CC(=O)O)N)O UEFODXNXUAVPTC-VEVYYDQMSA-N 0.000 description 1
- GCACQYDBDHRVGE-LKXGYXEUSA-N Asp-Thr-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H]([C@H](O)C)NC(=O)[C@@H](N)CC(O)=O GCACQYDBDHRVGE-LKXGYXEUSA-N 0.000 description 1
- RSMZEHCMIOKNMW-GSSVUCPTSA-N Asp-Thr-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O RSMZEHCMIOKNMW-GSSVUCPTSA-N 0.000 description 1
- QOJJMJKTMKNFEF-ZKWXMUAHSA-N Asp-Val-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC(O)=O QOJJMJKTMKNFEF-ZKWXMUAHSA-N 0.000 description 1
- JGLWFWXGOINXEA-YDHLFZDLSA-N Asp-Val-Tyr Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 JGLWFWXGOINXEA-YDHLFZDLSA-N 0.000 description 1
- 102000004580 Aspartic Acid Proteases Human genes 0.000 description 1
- 108010017640 Aspartic Acid Proteases Proteins 0.000 description 1
- 241001513093 Aspergillus awamori Species 0.000 description 1
- 241000193422 Bacillus lentus Species 0.000 description 1
- 101000911986 Bacillus licheniformis Cyclomaltodextrin glucanotransferase Proteins 0.000 description 1
- 101000911982 Bacillus ohbensis Cyclomaltodextrin glucanotransferase Proteins 0.000 description 1
- 241000193388 Bacillus thuringiensis Species 0.000 description 1
- 102100021277 Beta-secretase 2 Human genes 0.000 description 1
- 101710150190 Beta-secretase 2 Proteins 0.000 description 1
- 241000149420 Bothrometopus brevis Species 0.000 description 1
- 101100280051 Brucella abortus biovar 1 (strain 9-941) eryH gene Proteins 0.000 description 1
- 241000819038 Chichester Species 0.000 description 1
- 101000796894 Coturnix japonica Alcohol dehydrogenase 1 Proteins 0.000 description 1
- KVCJEMHFLGVINV-ZLUOBGJFSA-N Cys-Ser-Asn Chemical compound SC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC(N)=O KVCJEMHFLGVINV-ZLUOBGJFSA-N 0.000 description 1
- 206010059866 Drug resistance Diseases 0.000 description 1
- 102000002322 Egg Proteins Human genes 0.000 description 1
- 108010000912 Egg Proteins Proteins 0.000 description 1
- 101710121765 Endo-1,4-beta-xylanase Proteins 0.000 description 1
- YQYJSBFKSSDGFO-UHFFFAOYSA-N Epihygromycin Natural products OC1C(O)C(C(=O)C)OC1OC(C(=C1)O)=CC=C1C=C(C)C(=O)NC1C(O)C(O)C2OCOC2C1O YQYJSBFKSSDGFO-UHFFFAOYSA-N 0.000 description 1
- 241000701959 Escherichia virus Lambda Species 0.000 description 1
- 241000192125 Firmicutes Species 0.000 description 1
- 241000223218 Fusarium Species 0.000 description 1
- 241000223221 Fusarium oxysporum Species 0.000 description 1
- 108700007698 Genetic Terminator Regions Proteins 0.000 description 1
- MINZLORERLNSPP-ACZMJKKPSA-N Gln-Asn-Cys Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CS)C(=O)O)N MINZLORERLNSPP-ACZMJKKPSA-N 0.000 description 1
- LLVXTGUTDYMJLY-GUBZILKMSA-N Gln-Asn-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)N)N LLVXTGUTDYMJLY-GUBZILKMSA-N 0.000 description 1
- RMOCFPBLHAOTDU-ACZMJKKPSA-N Gln-Asn-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O RMOCFPBLHAOTDU-ACZMJKKPSA-N 0.000 description 1
- NKCZYEDZTKOFBG-GUBZILKMSA-N Gln-Gln-Arg Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O NKCZYEDZTKOFBG-GUBZILKMSA-N 0.000 description 1
- CITDWMLWXNUQKD-FXQIFTODSA-N Gln-Gln-Asn Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC(=O)N)C(=O)O)N CITDWMLWXNUQKD-FXQIFTODSA-N 0.000 description 1
- GQZDDFRXSDGUNG-YVNDNENWSA-N Gln-Ile-Gln Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(N)=O)C(O)=O GQZDDFRXSDGUNG-YVNDNENWSA-N 0.000 description 1
- QBEWLBKBGXVVPD-RYUDHWBXSA-N Gln-Phe-Gly Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CCC(=O)N)N QBEWLBKBGXVVPD-RYUDHWBXSA-N 0.000 description 1
- NJMYZEJORPYOTO-BQBZGAKWSA-N Gln-Pro Chemical compound NC(=O)CC[C@H](N)C(=O)N1CCC[C@H]1C(O)=O NJMYZEJORPYOTO-BQBZGAKWSA-N 0.000 description 1
- VNTGPISAOMAXRK-CIUDSAMLSA-N Gln-Pro-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O VNTGPISAOMAXRK-CIUDSAMLSA-N 0.000 description 1
- JKDBRTNMYXYLHO-JYJNAYRXSA-N Gln-Tyr-Leu Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C(O)=O)CC1=CC=C(O)C=C1 JKDBRTNMYXYLHO-JYJNAYRXSA-N 0.000 description 1
- VDMABHYXBULDGN-LAEOZQHASA-N Gln-Val-Asp Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O VDMABHYXBULDGN-LAEOZQHASA-N 0.000 description 1
- KHHDJQRWIFHXHS-NRPADANISA-N Gln-Val-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCC(=O)N)N KHHDJQRWIFHXHS-NRPADANISA-N 0.000 description 1
- SDSMVVSHLAAOJL-UKJIMTQDSA-N Gln-Val-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CCC(=O)N)N SDSMVVSHLAAOJL-UKJIMTQDSA-N 0.000 description 1
- FITIQFSXXBKFFM-NRPADANISA-N Gln-Val-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O FITIQFSXXBKFFM-NRPADANISA-N 0.000 description 1
- WZZSKAJIHTUUSG-ACZMJKKPSA-N Glu-Ala-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(O)=O WZZSKAJIHTUUSG-ACZMJKKPSA-N 0.000 description 1
- RCCDHXSRMWCOOY-GUBZILKMSA-N Glu-Arg-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(O)=O RCCDHXSRMWCOOY-GUBZILKMSA-N 0.000 description 1
- NLKVNZUFDPWPNL-YUMQZZPRSA-N Glu-Arg-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O NLKVNZUFDPWPNL-YUMQZZPRSA-N 0.000 description 1
- OJGLIOXAKGFFDW-SRVKXCTJSA-N Glu-Arg-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CCC(=O)O)N OJGLIOXAKGFFDW-SRVKXCTJSA-N 0.000 description 1
- SVZIKUHLRKVZIF-GUBZILKMSA-N Glu-Asn-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)O)N SVZIKUHLRKVZIF-GUBZILKMSA-N 0.000 description 1
- AFODTOLGSZQDSL-PEFMBERDSA-N Glu-Asn-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)O)N AFODTOLGSZQDSL-PEFMBERDSA-N 0.000 description 1
- PCBBLFVHTYNQGG-LAEOZQHASA-N Glu-Asn-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)O)N PCBBLFVHTYNQGG-LAEOZQHASA-N 0.000 description 1
- DSPQRJXOIXHOHK-WDSKDSINSA-N Glu-Asp-Gly Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O DSPQRJXOIXHOHK-WDSKDSINSA-N 0.000 description 1
- GGJOGFJIPPGNRK-JSGCOSHPSA-N Glu-Gly-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)CNC(=O)[C@H](CCC(O)=O)N)C(O)=O)=CNC2=C1 GGJOGFJIPPGNRK-JSGCOSHPSA-N 0.000 description 1
- LKOAAMXDJGEYMS-ZPFDUUQYSA-N Glu-Met-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O LKOAAMXDJGEYMS-ZPFDUUQYSA-N 0.000 description 1
- ZIYGTCDTJJCDDP-JYJNAYRXSA-N Glu-Phe-Lys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)O)N ZIYGTCDTJJCDDP-JYJNAYRXSA-N 0.000 description 1
- JVYNYWXHZWVJEF-NUMRIWBASA-N Glu-Thr-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)O)N)O JVYNYWXHZWVJEF-NUMRIWBASA-N 0.000 description 1
- QVXWAFZDWRLXTI-NWLDYVSISA-N Glu-Thr-Trp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CCC(=O)O)N)O QVXWAFZDWRLXTI-NWLDYVSISA-N 0.000 description 1
- JLCYOCDGIUZMKQ-JBACZVJFSA-N Glu-Trp-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)NC(=O)[C@H](CCC(=O)O)N JLCYOCDGIUZMKQ-JBACZVJFSA-N 0.000 description 1
- UCZXXMREFIETQW-AVGNSLFASA-N Glu-Tyr-Asn Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(O)=O UCZXXMREFIETQW-AVGNSLFASA-N 0.000 description 1
- KIEICAOUSNYOLM-NRPADANISA-N Glu-Val-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O KIEICAOUSNYOLM-NRPADANISA-N 0.000 description 1
- FGGKGJHCVMYGCD-UKJIMTQDSA-N Glu-Val-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FGGKGJHCVMYGCD-UKJIMTQDSA-N 0.000 description 1
- ZYRXTRTUCAVNBQ-GVXVVHGQSA-N Glu-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)O)N ZYRXTRTUCAVNBQ-GVXVVHGQSA-N 0.000 description 1
- 108010073178 Glucan 1,4-alpha-Glucosidase Proteins 0.000 description 1
- 102100022624 Glucoamylase Human genes 0.000 description 1
- VSVZIEVNUYDAFR-YUMQZZPRSA-N Gly-Ala-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)CN VSVZIEVNUYDAFR-YUMQZZPRSA-N 0.000 description 1
- QSDKBRMVXSWAQE-BFHQHQDPSA-N Gly-Ala-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)CN QSDKBRMVXSWAQE-BFHQHQDPSA-N 0.000 description 1
- JXYMPBCYRKWJEE-BQBZGAKWSA-N Gly-Arg-Ala Chemical compound [H]NCC(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(O)=O JXYMPBCYRKWJEE-BQBZGAKWSA-N 0.000 description 1
- OVSKVOOUFAKODB-UWVGGRQHSA-N Gly-Arg-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCN=C(N)N OVSKVOOUFAKODB-UWVGGRQHSA-N 0.000 description 1
- KKBWDNZXYLGJEY-UHFFFAOYSA-N Gly-Arg-Pro Natural products NCC(=O)NC(CCNC(=N)N)C(=O)N1CCCC1C(=O)O KKBWDNZXYLGJEY-UHFFFAOYSA-N 0.000 description 1
- AIJAPFVDBFYNKN-WHFBIAKZSA-N Gly-Asn-Asp Chemical compound C([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)CN)C(=O)N AIJAPFVDBFYNKN-WHFBIAKZSA-N 0.000 description 1
- JVACNFOPSUPDTK-QWRGUYRKSA-N Gly-Asn-Phe Chemical compound NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 JVACNFOPSUPDTK-QWRGUYRKSA-N 0.000 description 1
- FMNHBTKMRFVGRO-FOHZUACHSA-N Gly-Asn-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)CN FMNHBTKMRFVGRO-FOHZUACHSA-N 0.000 description 1
- GRIRDMVMJJDZKV-RCOVLWMOSA-N Gly-Asn-Val Chemical compound [H]NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O GRIRDMVMJJDZKV-RCOVLWMOSA-N 0.000 description 1
- FZQLXNIMCPJVJE-YUMQZZPRSA-N Gly-Asp-Leu Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O FZQLXNIMCPJVJE-YUMQZZPRSA-N 0.000 description 1
- NPSWCZIRBAYNSB-JHEQGTHGSA-N Gly-Gln-Thr Chemical compound [H]NCC(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O NPSWCZIRBAYNSB-JHEQGTHGSA-N 0.000 description 1
- HDNXXTBKOJKWNN-WDSKDSINSA-N Gly-Glu-Asn Chemical compound NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O HDNXXTBKOJKWNN-WDSKDSINSA-N 0.000 description 1
- JNGJGFMFXREJNF-KBPBESRZSA-N Gly-Glu-Trp Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O JNGJGFMFXREJNF-KBPBESRZSA-N 0.000 description 1
- CCQOOWAONKGYKQ-BYPYZUCNSA-N Gly-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)CN CCQOOWAONKGYKQ-BYPYZUCNSA-N 0.000 description 1
- UFPXDFOYHVEIPI-BYPYZUCNSA-N Gly-Gly-Asp Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CC(O)=O UFPXDFOYHVEIPI-BYPYZUCNSA-N 0.000 description 1
- YWAQATDNEKZFFK-BYPYZUCNSA-N Gly-Gly-Ser Chemical compound NCC(=O)NCC(=O)N[C@@H](CO)C(O)=O YWAQATDNEKZFFK-BYPYZUCNSA-N 0.000 description 1
- UQJNXZSSGQIPIQ-FBCQKBJTSA-N Gly-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)CN UQJNXZSSGQIPIQ-FBCQKBJTSA-N 0.000 description 1
- INLIXXRWNUKVCF-JTQLQIEISA-N Gly-Gly-Tyr Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 INLIXXRWNUKVCF-JTQLQIEISA-N 0.000 description 1
- ALOBJFDJTMQQPW-ONGXEEELSA-N Gly-His-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)CN ALOBJFDJTMQQPW-ONGXEEELSA-N 0.000 description 1
- SXJHOPPTOJACOA-QXEWZRGKSA-N Gly-Ile-Arg Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CCCN=C(N)N SXJHOPPTOJACOA-QXEWZRGKSA-N 0.000 description 1
- ITZOBNKQDZEOCE-NHCYSSNCSA-N Gly-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)CN ITZOBNKQDZEOCE-NHCYSSNCSA-N 0.000 description 1
- XVYKMNXXJXQKME-XEGUGMAKSA-N Gly-Ile-Tyr Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 XVYKMNXXJXQKME-XEGUGMAKSA-N 0.000 description 1
- UHPAZODVFFYEEL-QWRGUYRKSA-N Gly-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)CN UHPAZODVFFYEEL-QWRGUYRKSA-N 0.000 description 1
- AFWYPMDMDYCKMD-KBPBESRZSA-N Gly-Leu-Tyr Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 AFWYPMDMDYCKMD-KBPBESRZSA-N 0.000 description 1
- VBOBNHSVQKKTOT-YUMQZZPRSA-N Gly-Lys-Ala Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O VBOBNHSVQKKTOT-YUMQZZPRSA-N 0.000 description 1
- BXICSAQLIHFDDL-YUMQZZPRSA-N Gly-Lys-Asn Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O BXICSAQLIHFDDL-YUMQZZPRSA-N 0.000 description 1
- MHXKHKWHPNETGG-QWRGUYRKSA-N Gly-Lys-Leu Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O MHXKHKWHPNETGG-QWRGUYRKSA-N 0.000 description 1
- CVFOYJJOZYYEPE-KBPBESRZSA-N Gly-Lys-Tyr Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O CVFOYJJOZYYEPE-KBPBESRZSA-N 0.000 description 1
- OMOZPGCHVWOXHN-BQBZGAKWSA-N Gly-Met-Ser Chemical compound CSCC[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)CN OMOZPGCHVWOXHN-BQBZGAKWSA-N 0.000 description 1
- OCPPBNKYGYSLOE-IUCAKERBSA-N Gly-Pro-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)CN OCPPBNKYGYSLOE-IUCAKERBSA-N 0.000 description 1
- ZLCLYFGMKFCDCN-XPUUQOCRSA-N Gly-Ser-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CO)NC(=O)CN)C(O)=O ZLCLYFGMKFCDCN-XPUUQOCRSA-N 0.000 description 1
- CQMFNTVQVLQRLT-JHEQGTHGSA-N Gly-Thr-Gln Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(O)=O CQMFNTVQVLQRLT-JHEQGTHGSA-N 0.000 description 1
- DBUNZBWUWCIELX-JHEQGTHGSA-N Gly-Thr-Glu Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O DBUNZBWUWCIELX-JHEQGTHGSA-N 0.000 description 1
- PYFHPYDQHCEVIT-KBPBESRZSA-N Gly-Trp-Gln Chemical compound [H]NCC(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCC(N)=O)C(O)=O PYFHPYDQHCEVIT-KBPBESRZSA-N 0.000 description 1
- YJDALMUYJIENAG-QWRGUYRKSA-N Gly-Tyr-Asn Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)CN)O YJDALMUYJIENAG-QWRGUYRKSA-N 0.000 description 1
- OCRQUYDOYKCOQG-IRXDYDNUSA-N Gly-Tyr-Phe Chemical compound C([C@H](NC(=O)CN)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=C(O)C=C1 OCRQUYDOYKCOQG-IRXDYDNUSA-N 0.000 description 1
- 101100295959 Halobacterium salinarum (strain ATCC 700922 / JCM 11081 / NRC-1) arcB gene Proteins 0.000 description 1
- 241000125500 Hedypnois rhagadioloides Species 0.000 description 1
- ZNPRMNDAFQKATM-LKTVYLICSA-N His-Ala-Tyr Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O ZNPRMNDAFQKATM-LKTVYLICSA-N 0.000 description 1
- NOQPTNXSGNPJNS-YUMQZZPRSA-N His-Asn-Gly Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O NOQPTNXSGNPJNS-YUMQZZPRSA-N 0.000 description 1
- UZZXGLOJRZKYEL-DJFWLOJKSA-N His-Asn-Ile Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O UZZXGLOJRZKYEL-DJFWLOJKSA-N 0.000 description 1
- QPSCMXDWVKWVOW-BZSNNMDCSA-N His-His-Tyr Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O QPSCMXDWVKWVOW-BZSNNMDCSA-N 0.000 description 1
- TTYKEFZRLKQTHH-MELADBBJSA-N His-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)[C@H](CC2=CN=CN2)N)C(=O)O TTYKEFZRLKQTHH-MELADBBJSA-N 0.000 description 1
- FJCGVRRVBKYYOU-DCAQKATOSA-N His-Met-Ser Chemical compound CSCC[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N FJCGVRRVBKYYOU-DCAQKATOSA-N 0.000 description 1
- VIJMRAIWYWRXSR-CIUDSAMLSA-N His-Ser-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CN=CN1 VIJMRAIWYWRXSR-CIUDSAMLSA-N 0.000 description 1
- ALPXXNRQBMRCPZ-MEYUZBJRSA-N His-Thr-Phe Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O ALPXXNRQBMRCPZ-MEYUZBJRSA-N 0.000 description 1
- 101000780463 Homo sapiens Alcohol dehydrogenase 1C Proteins 0.000 description 1
- 101000741885 Homo sapiens Protection of telomeres protein 1 Proteins 0.000 description 1
- 101000801742 Homo sapiens Triosephosphate isomerase Proteins 0.000 description 1
- CYHYBSGMHMHKOA-CIQUZCHMSA-N Ile-Ala-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N CYHYBSGMHMHKOA-CIQUZCHMSA-N 0.000 description 1
- BOTVMTSMOUSDRW-GMOBBJLQSA-N Ile-Arg-Asn Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC(N)=O)C(O)=O BOTVMTSMOUSDRW-GMOBBJLQSA-N 0.000 description 1
- QYZYJFXHXYUZMZ-UGYAYLCHSA-N Ile-Asn-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)N)C(=O)O)N QYZYJFXHXYUZMZ-UGYAYLCHSA-N 0.000 description 1
- HZMLFETXHFHGBB-UGYAYLCHSA-N Ile-Asn-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N HZMLFETXHFHGBB-UGYAYLCHSA-N 0.000 description 1
- UAVQIQOOBXFKRC-BYULHYEWSA-N Ile-Asn-Gly Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O UAVQIQOOBXFKRC-BYULHYEWSA-N 0.000 description 1
- YBJWJQQBWRARLT-KBIXCLLPSA-N Ile-Gln-Ser Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O YBJWJQQBWRARLT-KBIXCLLPSA-N 0.000 description 1
- DVRDRICMWUSCBN-UKJIMTQDSA-N Ile-Gln-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](C(C)C)C(=O)O)N DVRDRICMWUSCBN-UKJIMTQDSA-N 0.000 description 1
- FUOYNOXRWPJPAN-QEWYBTABSA-N Ile-Glu-Phe Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N FUOYNOXRWPJPAN-QEWYBTABSA-N 0.000 description 1
- GQKSJYINYYWPMR-NGZCFLSTSA-N Ile-Gly-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N1CCC[C@@H]1C(=O)O)N GQKSJYINYYWPMR-NGZCFLSTSA-N 0.000 description 1
- YNMQUIVKEFRCPH-QSFUFRPTSA-N Ile-Ile-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(=O)O)N YNMQUIVKEFRCPH-QSFUFRPTSA-N 0.000 description 1
- PFPUFNLHBXKPHY-HTFCKZLJSA-N Ile-Ile-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(=O)O)N PFPUFNLHBXKPHY-HTFCKZLJSA-N 0.000 description 1
- HQEPKOFULQTSFV-JURCDPSOSA-N Ile-Phe-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C)C(=O)O)N HQEPKOFULQTSFV-JURCDPSOSA-N 0.000 description 1
- FGBRXCZYVRFNKQ-MXAVVETBSA-N Ile-Phe-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)O)N FGBRXCZYVRFNKQ-MXAVVETBSA-N 0.000 description 1
- ZLFNNVATRMCAKN-ZKWXMUAHSA-N Ile-Ser-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)NCC(=O)O)N ZLFNNVATRMCAKN-ZKWXMUAHSA-N 0.000 description 1
- DTPGSUQHUMELQB-GVARAGBVSA-N Ile-Tyr-Ala Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C)C(O)=O)CC1=CC=C(O)C=C1 DTPGSUQHUMELQB-GVARAGBVSA-N 0.000 description 1
- GNXGAVNTVNOCLL-SIUGBPQLSA-N Ile-Tyr-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N GNXGAVNTVNOCLL-SIUGBPQLSA-N 0.000 description 1
- YJRSIJZUIUANHO-NAKRPEOUSA-N Ile-Val-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(=O)O)N YJRSIJZUIUANHO-NAKRPEOUSA-N 0.000 description 1
- BCISUQVFDGYZBO-QSFUFRPTSA-N Ile-Val-Asp Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC(O)=O BCISUQVFDGYZBO-QSFUFRPTSA-N 0.000 description 1
- APQYGMBHIVXFML-OSUNSFLBSA-N Ile-Val-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N APQYGMBHIVXFML-OSUNSFLBSA-N 0.000 description 1
- 241000235649 Kluyveromyces Species 0.000 description 1
- 241001138401 Kluyveromyces lactis Species 0.000 description 1
- 241000235058 Komagataella pastoris Species 0.000 description 1
- HGCNKOLVKRAVHD-UHFFFAOYSA-N L-Met-L-Phe Natural products CSCCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 HGCNKOLVKRAVHD-UHFFFAOYSA-N 0.000 description 1
- 239000004395 L-leucine Substances 0.000 description 1
- KFKWRHQBZQICHA-STQMWFEESA-N L-leucyl-L-phenylalanine Natural products CC(C)C[C@H](N)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 KFKWRHQBZQICHA-STQMWFEESA-N 0.000 description 1
- FBOZXECLQNJBKD-ZDUSSCGKSA-N L-methotrexate Chemical compound C=1N=C2N=C(N)N=C(N)C2=NC=1CN(C)C1=CC=C(C(=O)N[C@@H](CCC(O)=O)C(O)=O)C=C1 FBOZXECLQNJBKD-ZDUSSCGKSA-N 0.000 description 1
- 241000235087 Lachancea kluyveri Species 0.000 description 1
- JQSXWJXBASFONF-KKUMJFAQSA-N Leu-Asp-Phe Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O JQSXWJXBASFONF-KKUMJFAQSA-N 0.000 description 1
- KVMULWOHPPMHHE-DCAQKATOSA-N Leu-Glu-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O KVMULWOHPPMHHE-DCAQKATOSA-N 0.000 description 1
- ZFNLIDNJUWNIJL-WDCWCFNPSA-N Leu-Glu-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ZFNLIDNJUWNIJL-WDCWCFNPSA-N 0.000 description 1
- FIYMBBHGYNQFOP-IUCAKERBSA-N Leu-Gly-Gln Chemical compound CC(C)C[C@@H](C(=O)NCC(=O)N[C@@H](CCC(=O)N)C(=O)O)N FIYMBBHGYNQFOP-IUCAKERBSA-N 0.000 description 1
- KGCLIYGPQXUNLO-IUCAKERBSA-N Leu-Gly-Glu Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(O)=O KGCLIYGPQXUNLO-IUCAKERBSA-N 0.000 description 1
- KVOFSTUWVSQMDK-KKUMJFAQSA-N Leu-His-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CC(C)C)CC1=CN=CN1 KVOFSTUWVSQMDK-KKUMJFAQSA-N 0.000 description 1
- WRLPVDVHNWSSCL-MELADBBJSA-N Leu-His-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N2CCC[C@@H]2C(=O)O)N WRLPVDVHNWSSCL-MELADBBJSA-N 0.000 description 1
- QJXHMYMRGDOHRU-NHCYSSNCSA-N Leu-Ile-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O QJXHMYMRGDOHRU-NHCYSSNCSA-N 0.000 description 1
- YOKVEHGYYQEQOP-QWRGUYRKSA-N Leu-Leu-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O YOKVEHGYYQEQOP-QWRGUYRKSA-N 0.000 description 1
- IEWBEPKLKUXQBU-VOAKCMCISA-N Leu-Leu-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O IEWBEPKLKUXQBU-VOAKCMCISA-N 0.000 description 1
- ZRHDPZAAWLXXIR-SRVKXCTJSA-N Leu-Lys-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O ZRHDPZAAWLXXIR-SRVKXCTJSA-N 0.000 description 1
- HDHQQEDVWQGBEE-DCAQKATOSA-N Leu-Met-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CO)C(O)=O HDHQQEDVWQGBEE-DCAQKATOSA-N 0.000 description 1
- ZAVCJRJOQKIOJW-KKUMJFAQSA-N Leu-Phe-Asp Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(O)=O)C(O)=O)CC1=CC=CC=C1 ZAVCJRJOQKIOJW-KKUMJFAQSA-N 0.000 description 1
- BMVFXOQHDQZAQU-DCAQKATOSA-N Leu-Pro-Asp Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(=O)O)C(=O)O)N BMVFXOQHDQZAQU-DCAQKATOSA-N 0.000 description 1
- CHJKEDSZNSONPS-DCAQKATOSA-N Leu-Pro-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O CHJKEDSZNSONPS-DCAQKATOSA-N 0.000 description 1
- PPGBXYKMUMHFBF-KATARQTJSA-N Leu-Ser-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PPGBXYKMUMHFBF-KATARQTJSA-N 0.000 description 1
- ZJZNLRVCZWUONM-JXUBOQSCSA-N Leu-Thr-Ala Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O ZJZNLRVCZWUONM-JXUBOQSCSA-N 0.000 description 1
- VDIARPPNADFEAV-WEDXCCLWSA-N Leu-Thr-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O VDIARPPNADFEAV-WEDXCCLWSA-N 0.000 description 1
- ILDSIMPXNFWKLH-KATARQTJSA-N Leu-Thr-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O ILDSIMPXNFWKLH-KATARQTJSA-N 0.000 description 1
- AAKRWBIIGKPOKQ-ONGXEEELSA-N Leu-Val-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O AAKRWBIIGKPOKQ-ONGXEEELSA-N 0.000 description 1
- XFIHDSBIPWEYJJ-YUMQZZPRSA-N Lys-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN XFIHDSBIPWEYJJ-YUMQZZPRSA-N 0.000 description 1
- NFLFJGGKOHYZJF-BJDJZHNGSA-N Lys-Ala-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN NFLFJGGKOHYZJF-BJDJZHNGSA-N 0.000 description 1
- HQVDJTYKCMIWJP-YUMQZZPRSA-N Lys-Asn-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O HQVDJTYKCMIWJP-YUMQZZPRSA-N 0.000 description 1
- YVSHZSUKQHNDHD-KKUMJFAQSA-N Lys-Asn-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCCCN)N YVSHZSUKQHNDHD-KKUMJFAQSA-N 0.000 description 1
- GKFNXYMAMKJSKD-NHCYSSNCSA-N Lys-Asp-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O GKFNXYMAMKJSKD-NHCYSSNCSA-N 0.000 description 1
- JYXBNQOKPRQNQS-YTFOTSKYSA-N Lys-Ile-Ile Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JYXBNQOKPRQNQS-YTFOTSKYSA-N 0.000 description 1
- ZXFRGTAIIZHNHG-AJNGGQMLSA-N Lys-Ile-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)O)NC(=O)[C@H](CCCCN)N ZXFRGTAIIZHNHG-AJNGGQMLSA-N 0.000 description 1
- MYZMQWHPDAYKIE-SRVKXCTJSA-N Lys-Leu-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O MYZMQWHPDAYKIE-SRVKXCTJSA-N 0.000 description 1
- RIJCHEVHFWMDKD-SRVKXCTJSA-N Lys-Lys-Asn Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O RIJCHEVHFWMDKD-SRVKXCTJSA-N 0.000 description 1
- PLDJDCJLRCYPJB-VOAKCMCISA-N Lys-Lys-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PLDJDCJLRCYPJB-VOAKCMCISA-N 0.000 description 1
- MSSJJDVQTFTLIF-KBPBESRZSA-N Lys-Phe-Gly Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](Cc1ccccc1)C(=O)NCC(O)=O MSSJJDVQTFTLIF-KBPBESRZSA-N 0.000 description 1
- MEQLGHAMAUPOSJ-DCAQKATOSA-N Lys-Ser-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O MEQLGHAMAUPOSJ-DCAQKATOSA-N 0.000 description 1
- OHXUUQDOBQKSNB-AVGNSLFASA-N Lys-Val-Arg Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O OHXUUQDOBQKSNB-AVGNSLFASA-N 0.000 description 1
- MDDUIRLQCYVRDO-NHCYSSNCSA-N Lys-Val-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCCN MDDUIRLQCYVRDO-NHCYSSNCSA-N 0.000 description 1
- RPWQJSBMXJSCPD-XUXIUFHCSA-N Lys-Val-Ile Chemical compound CC[C@H](C)[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)CCCCN)C(C)C)C(O)=O RPWQJSBMXJSCPD-XUXIUFHCSA-N 0.000 description 1
- WYEXWKAWMNJKPN-UBHSHLNASA-N Met-Ala-Phe Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CCSC)N WYEXWKAWMNJKPN-UBHSHLNASA-N 0.000 description 1
- UZVWDRPUTHXQAM-FXQIFTODSA-N Met-Asp-Ala Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O UZVWDRPUTHXQAM-FXQIFTODSA-N 0.000 description 1
- TZLYIHDABYBOCJ-FXQIFTODSA-N Met-Asp-Ser Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O TZLYIHDABYBOCJ-FXQIFTODSA-N 0.000 description 1
- SLQDSYZHHOKQSR-QXEWZRGKSA-N Met-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CCSC SLQDSYZHHOKQSR-QXEWZRGKSA-N 0.000 description 1
- JACAKCWAOHKQBV-UWVGGRQHSA-N Met-Gly-Lys Chemical compound CSCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCCN JACAKCWAOHKQBV-UWVGGRQHSA-N 0.000 description 1
- HZVXPUHLTZRQEL-UWVGGRQHSA-N Met-Leu-Gly Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O HZVXPUHLTZRQEL-UWVGGRQHSA-N 0.000 description 1
- CNAGWYQWQDMUGC-IHRRRGAJSA-N Met-Phe-Asn Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(=O)N)C(=O)O)N CNAGWYQWQDMUGC-IHRRRGAJSA-N 0.000 description 1
- XPVCDCMPKCERFT-GUBZILKMSA-N Met-Ser-Arg Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O XPVCDCMPKCERFT-GUBZILKMSA-N 0.000 description 1
- MIXPUVSPPOWTCR-FXQIFTODSA-N Met-Ser-Ser Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O MIXPUVSPPOWTCR-FXQIFTODSA-N 0.000 description 1
- KYXDADPHSNFWQX-VEVYYDQMSA-N Met-Thr-Asp Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(O)=O KYXDADPHSNFWQX-VEVYYDQMSA-N 0.000 description 1
- CIIJWIAORKTXAH-FJXKBIBVSA-N Met-Thr-Gly Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O CIIJWIAORKTXAH-FJXKBIBVSA-N 0.000 description 1
- IHRFZLQEQVHXFA-RHYQMDGZSA-N Met-Thr-Lys Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCCCN IHRFZLQEQVHXFA-RHYQMDGZSA-N 0.000 description 1
- RMLWDZINJUDMEB-IHRRRGAJSA-N Met-Tyr-Asn Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N RMLWDZINJUDMEB-IHRRRGAJSA-N 0.000 description 1
- IIHMNTBFPMRJCN-RCWTZXSCSA-N Met-Val-Thr Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O IIHMNTBFPMRJCN-RCWTZXSCSA-N 0.000 description 1
- 101100235161 Mycolicibacterium smegmatis (strain ATCC 700084 / mc(2)155) lerI gene Proteins 0.000 description 1
- WYBVBIHNJWOLCJ-UHFFFAOYSA-N N-L-arginyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CCCN=C(N)N WYBVBIHNJWOLCJ-UHFFFAOYSA-N 0.000 description 1
- PESQCPHRXOFIPX-UHFFFAOYSA-N N-L-methionyl-L-tyrosine Natural products CSCCC(N)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 PESQCPHRXOFIPX-UHFFFAOYSA-N 0.000 description 1
- AUEJLPRZGVVDNU-UHFFFAOYSA-N N-L-tyrosyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CC1=CC=C(O)C=C1 AUEJLPRZGVVDNU-UHFFFAOYSA-N 0.000 description 1
- 108010047562 NGR peptide Proteins 0.000 description 1
- 229930193140 Neomycin Natural products 0.000 description 1
- 241000221960 Neurospora Species 0.000 description 1
- 241000194109 Paenibacillus lautus Species 0.000 description 1
- WSXKXSBOJXEZDV-DLOVCJGASA-N Phe-Ala-Asn Chemical compound NC(=O)C[C@@H](C([O-])=O)NC(=O)[C@H](C)NC(=O)[C@@H]([NH3+])CC1=CC=CC=C1 WSXKXSBOJXEZDV-DLOVCJGASA-N 0.000 description 1
- AJOKKVTWEMXZHC-DRZSPHRISA-N Phe-Ala-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=CC=C1 AJOKKVTWEMXZHC-DRZSPHRISA-N 0.000 description 1
- VHWOBXIWBDWZHK-IHRRRGAJSA-N Phe-Arg-Asp Chemical compound NC(N)=NCCC[C@@H](C(=O)N[C@@H](CC(O)=O)C(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 VHWOBXIWBDWZHK-IHRRRGAJSA-N 0.000 description 1
- RIYZXJVARWJLKS-KKUMJFAQSA-N Phe-Asp-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 RIYZXJVARWJLKS-KKUMJFAQSA-N 0.000 description 1
- WPTYDQPGBMDUBI-QWRGUYRKSA-N Phe-Gly-Asn Chemical compound N[C@@H](Cc1ccccc1)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O WPTYDQPGBMDUBI-QWRGUYRKSA-N 0.000 description 1
- JEBWZLWTRPZQRX-QWRGUYRKSA-N Phe-Gly-Asp Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O JEBWZLWTRPZQRX-QWRGUYRKSA-N 0.000 description 1
- NAXPHWZXEXNDIW-JTQLQIEISA-N Phe-Gly-Gly Chemical compound OC(=O)CNC(=O)CNC(=O)[C@@H](N)CC1=CC=CC=C1 NAXPHWZXEXNDIW-JTQLQIEISA-N 0.000 description 1
- BIYWZVCPZIFGPY-QWRGUYRKSA-N Phe-Gly-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)NCC(=O)N[C@@H](CO)C(O)=O BIYWZVCPZIFGPY-QWRGUYRKSA-N 0.000 description 1
- PMKIMKUGCSVFSV-CQDKDKBSSA-N Phe-His-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CC2=CC=CC=C2)N PMKIMKUGCSVFSV-CQDKDKBSSA-N 0.000 description 1
- SMFGCTXUBWEPKM-KBPBESRZSA-N Phe-Leu-Gly Chemical compound OC(=O)CNC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 SMFGCTXUBWEPKM-KBPBESRZSA-N 0.000 description 1
- CMHTUJQZQXFNTQ-OEAJRASXSA-N Phe-Leu-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC1=CC=CC=C1)N)O CMHTUJQZQXFNTQ-OEAJRASXSA-N 0.000 description 1
- OQTDZEJJWWAGJT-KKUMJFAQSA-N Phe-Lys-Asp Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O OQTDZEJJWWAGJT-KKUMJFAQSA-N 0.000 description 1
- KLXQWABNAWDRAY-ACRUOGEOSA-N Phe-Lys-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 KLXQWABNAWDRAY-ACRUOGEOSA-N 0.000 description 1
- XDMMOISUAHXXFD-SRVKXCTJSA-N Phe-Ser-Asp Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O XDMMOISUAHXXFD-SRVKXCTJSA-N 0.000 description 1
- MVIJMIZJPHQGEN-IHRRRGAJSA-N Phe-Ser-Val Chemical compound CC(C)[C@@H](C([O-])=O)NC(=O)[C@H](CO)NC(=O)[C@@H]([NH3+])CC1=CC=CC=C1 MVIJMIZJPHQGEN-IHRRRGAJSA-N 0.000 description 1
- RAGOJJCBGXARPO-XVSYOHENSA-N Phe-Thr-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H]([C@H](O)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 RAGOJJCBGXARPO-XVSYOHENSA-N 0.000 description 1
- BPIMVBKDLSBKIJ-FCLVOEFKSA-N Phe-Thr-Phe Chemical compound C([C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 BPIMVBKDLSBKIJ-FCLVOEFKSA-N 0.000 description 1
- AOKZOUGUMLBPSS-PMVMPFDFSA-N Phe-Trp-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC(C)C)C(O)=O AOKZOUGUMLBPSS-PMVMPFDFSA-N 0.000 description 1
- APZNYJFGVAGFCF-JYJNAYRXSA-N Phe-Val-Val Chemical compound CC(C)[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)Cc1ccccc1)C(C)C)C(O)=O APZNYJFGVAGFCF-JYJNAYRXSA-N 0.000 description 1
- KIZQGKLMXKGDIV-BQBZGAKWSA-N Pro-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1 KIZQGKLMXKGDIV-BQBZGAKWSA-N 0.000 description 1
- IFMDQWDAJUMMJC-DCAQKATOSA-N Pro-Ala-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O IFMDQWDAJUMMJC-DCAQKATOSA-N 0.000 description 1
- CQZNGNCAIXMAIQ-UBHSHLNASA-N Pro-Ala-Phe Chemical compound C[C@H](NC(=O)[C@@H]1CCCN1)C(=O)N[C@@H](Cc1ccccc1)C(O)=O CQZNGNCAIXMAIQ-UBHSHLNASA-N 0.000 description 1
- WWAQEUOYCYMGHB-FXQIFTODSA-N Pro-Asn-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H]1CCCN1 WWAQEUOYCYMGHB-FXQIFTODSA-N 0.000 description 1
- SBYVDRLQAGENMY-DCAQKATOSA-N Pro-Asn-His Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O SBYVDRLQAGENMY-DCAQKATOSA-N 0.000 description 1
- UUHXBJHVTVGSKM-BQBZGAKWSA-N Pro-Gly-Asn Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O UUHXBJHVTVGSKM-BQBZGAKWSA-N 0.000 description 1
- VYWNORHENYEQDW-YUMQZZPRSA-N Pro-Gly-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H]1CCCN1 VYWNORHENYEQDW-YUMQZZPRSA-N 0.000 description 1
- UIMCLYYSUCIUJM-UWVGGRQHSA-N Pro-Gly-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H]1CCCN1 UIMCLYYSUCIUJM-UWVGGRQHSA-N 0.000 description 1
- XQSREVQDGCPFRJ-STQMWFEESA-N Pro-Gly-Phe Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O XQSREVQDGCPFRJ-STQMWFEESA-N 0.000 description 1
- MRYUJHGPZQNOAD-IHRRRGAJSA-N Pro-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@@H]1CCCN1 MRYUJHGPZQNOAD-IHRRRGAJSA-N 0.000 description 1
- ZZCJYPLMOPTZFC-SRVKXCTJSA-N Pro-Met-Met Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCSC)C(O)=O ZZCJYPLMOPTZFC-SRVKXCTJSA-N 0.000 description 1
- APIAILHCTSBGLU-JYJNAYRXSA-N Pro-Met-Phe Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@@H]2CCCN2 APIAILHCTSBGLU-JYJNAYRXSA-N 0.000 description 1
- ZUZINZIJHJFJRN-UBHSHLNASA-N Pro-Phe-Ala Chemical compound C([C@@H](C(=O)N[C@@H](C)C(O)=O)NC(=O)[C@H]1NCCC1)C1=CC=CC=C1 ZUZINZIJHJFJRN-UBHSHLNASA-N 0.000 description 1
- OWQXAJQZLWHPBH-FXQIFTODSA-N Pro-Ser-Asn Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O OWQXAJQZLWHPBH-FXQIFTODSA-N 0.000 description 1
- QKDIHFHGHBYTKB-IHRRRGAJSA-N Pro-Ser-Phe Chemical compound N([C@@H](CO)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C(=O)[C@@H]1CCCN1 QKDIHFHGHBYTKB-IHRRRGAJSA-N 0.000 description 1
- RMJZWERKFFNNNS-XGEHTFHBSA-N Pro-Thr-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O RMJZWERKFFNNNS-XGEHTFHBSA-N 0.000 description 1
- VVAWNPIOYXAMAL-KJEVXHAQSA-N Pro-Thr-Tyr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O VVAWNPIOYXAMAL-KJEVXHAQSA-N 0.000 description 1
- QHSSUIHLAIWXEE-IHRRRGAJSA-N Pro-Tyr-Asn Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(O)=O QHSSUIHLAIWXEE-IHRRRGAJSA-N 0.000 description 1
- 102100038745 Protection of telomeres protein 1 Human genes 0.000 description 1
- 102220619455 Protein-L-histidine N-pros-methyltransferase_K47A_mutation Human genes 0.000 description 1
- 102220619456 Protein-L-histidine N-pros-methyltransferase_K47L_mutation Human genes 0.000 description 1
- 102220465296 Purine nucleoside phosphorylase LACC1_Y89A_mutation Human genes 0.000 description 1
- 241000235403 Rhizomucor miehei Species 0.000 description 1
- 101000968489 Rhizomucor miehei Lipase Proteins 0.000 description 1
- 241000235070 Saccharomyces Species 0.000 description 1
- 241001292348 Salipaludibacillus agaradhaerens Species 0.000 description 1
- 241000235346 Schizosaccharomyces Species 0.000 description 1
- 241000235347 Schizosaccharomyces pombe Species 0.000 description 1
- ZUGXSSFMTXKHJS-ZLUOBGJFSA-N Ser-Ala-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O ZUGXSSFMTXKHJS-ZLUOBGJFSA-N 0.000 description 1
- WTUJZHKANPDPIN-CIUDSAMLSA-N Ser-Ala-Lys Chemical compound C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CO)N WTUJZHKANPDPIN-CIUDSAMLSA-N 0.000 description 1
- XVAUJOAYHWWNQF-ZLUOBGJFSA-N Ser-Asn-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(O)=O XVAUJOAYHWWNQF-ZLUOBGJFSA-N 0.000 description 1
- YMEXHZTVKDAKIY-GHCJXIJMSA-N Ser-Asn-Ile Chemical compound CC[C@H](C)[C@H](NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CO)C(O)=O YMEXHZTVKDAKIY-GHCJXIJMSA-N 0.000 description 1
- TYYBJUYSTWJHGO-ZKWXMUAHSA-N Ser-Asn-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O TYYBJUYSTWJHGO-ZKWXMUAHSA-N 0.000 description 1
- BGOWRLSWJCVYAQ-CIUDSAMLSA-N Ser-Asp-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O BGOWRLSWJCVYAQ-CIUDSAMLSA-N 0.000 description 1
- MUARUIBTKQJKFY-WHFBIAKZSA-N Ser-Gly-Asp Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O MUARUIBTKQJKFY-WHFBIAKZSA-N 0.000 description 1
- WSTIOCFMWXNOCX-YUMQZZPRSA-N Ser-Gly-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CO)N WSTIOCFMWXNOCX-YUMQZZPRSA-N 0.000 description 1
- OQPNSDWGAMFJNU-QWRGUYRKSA-N Ser-Gly-Tyr Chemical compound OC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 OQPNSDWGAMFJNU-QWRGUYRKSA-N 0.000 description 1
- RIAKPZVSNBBNRE-BJDJZHNGSA-N Ser-Ile-Leu Chemical compound OC[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O RIAKPZVSNBBNRE-BJDJZHNGSA-N 0.000 description 1
- DOSZISJPMCYEHT-NAKRPEOUSA-N Ser-Ile-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O DOSZISJPMCYEHT-NAKRPEOUSA-N 0.000 description 1
- QYSFWUIXDFJUDW-DCAQKATOSA-N Ser-Leu-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O QYSFWUIXDFJUDW-DCAQKATOSA-N 0.000 description 1
- XXNYYSXNXCJYKX-DCAQKATOSA-N Ser-Leu-Met Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(O)=O XXNYYSXNXCJYKX-DCAQKATOSA-N 0.000 description 1
- KCGIREHVWRXNDH-GARJFASQSA-N Ser-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N KCGIREHVWRXNDH-GARJFASQSA-N 0.000 description 1
- NQZFFLBPNDLTPO-DLOVCJGASA-N Ser-Phe-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](CO)N NQZFFLBPNDLTPO-DLOVCJGASA-N 0.000 description 1
- UGTZYIPOBYXWRW-SRVKXCTJSA-N Ser-Phe-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O UGTZYIPOBYXWRW-SRVKXCTJSA-N 0.000 description 1
- FKYWFUYPVKLJLP-DCAQKATOSA-N Ser-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CO FKYWFUYPVKLJLP-DCAQKATOSA-N 0.000 description 1
- FLONGDPORFIVQW-XGEHTFHBSA-N Ser-Pro-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CO FLONGDPORFIVQW-XGEHTFHBSA-N 0.000 description 1
- WLJPJRGQRNCIQS-ZLUOBGJFSA-N Ser-Ser-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O WLJPJRGQRNCIQS-ZLUOBGJFSA-N 0.000 description 1
- OZPDGESCTGGNAD-CIUDSAMLSA-N Ser-Ser-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CO OZPDGESCTGGNAD-CIUDSAMLSA-N 0.000 description 1
- XJDMUQCLVSCRSJ-VZFHVOOUSA-N Ser-Thr-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O XJDMUQCLVSCRSJ-VZFHVOOUSA-N 0.000 description 1
- SNXUIBACCONSOH-BWBBJGPYSA-N Ser-Thr-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CO)C(O)=O SNXUIBACCONSOH-BWBBJGPYSA-N 0.000 description 1
- VLMIUSLQONKLDV-HEIBUPTGSA-N Ser-Thr-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O VLMIUSLQONKLDV-HEIBUPTGSA-N 0.000 description 1
- SDFUZKIAHWRUCS-QEJZJMRPSA-N Ser-Trp-Glu Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CO)N SDFUZKIAHWRUCS-QEJZJMRPSA-N 0.000 description 1
- QYBRQMLZDDJBSW-AVGNSLFASA-N Ser-Tyr-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O QYBRQMLZDDJBSW-AVGNSLFASA-N 0.000 description 1
- HXPNJVLVHKABMJ-KKUMJFAQSA-N Ser-Tyr-His Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)NC(=O)[C@H](CO)N)O HXPNJVLVHKABMJ-KKUMJFAQSA-N 0.000 description 1
- IAOHCSQDQDWRQU-GUBZILKMSA-N Ser-Val-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O IAOHCSQDQDWRQU-GUBZILKMSA-N 0.000 description 1
- ANOQEBQWIAYIMV-AEJSXWLSSA-N Ser-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N ANOQEBQWIAYIMV-AEJSXWLSSA-N 0.000 description 1
- SIEBDTCABMZCLF-XGEHTFHBSA-N Ser-Val-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SIEBDTCABMZCLF-XGEHTFHBSA-N 0.000 description 1
- 241000187747 Streptomyces Species 0.000 description 1
- 241000187398 Streptomyces lividans Species 0.000 description 1
- 241001468239 Streptomyces murinus Species 0.000 description 1
- 239000004102 Synthetic calcium aluminate Substances 0.000 description 1
- 101150033985 TPI gene Proteins 0.000 description 1
- 101000911976 Thermoanaerobacterium thermosulfurigenes Cyclomaltodextrin glucanotransferase Proteins 0.000 description 1
- BSNZTJXVDOINSR-JXUBOQSCSA-N Thr-Ala-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O BSNZTJXVDOINSR-JXUBOQSCSA-N 0.000 description 1
- DWYAUVCQDTZIJI-VZFHVOOUSA-N Thr-Ala-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O DWYAUVCQDTZIJI-VZFHVOOUSA-N 0.000 description 1
- CAJFZCICSVBOJK-SHGPDSBTSA-N Thr-Ala-Thr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CAJFZCICSVBOJK-SHGPDSBTSA-N 0.000 description 1
- QGXCWPNQVCYJEL-NUMRIWBASA-N Thr-Asn-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O QGXCWPNQVCYJEL-NUMRIWBASA-N 0.000 description 1
- JBHMLZSKIXMVFS-XVSYOHENSA-N Thr-Asn-Phe Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O JBHMLZSKIXMVFS-XVSYOHENSA-N 0.000 description 1
- OJRNZRROAIAHDL-LKXGYXEUSA-N Thr-Asn-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O OJRNZRROAIAHDL-LKXGYXEUSA-N 0.000 description 1
- JXKMXEBNZCKSDY-JIOCBJNQSA-N Thr-Asp-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N)O JXKMXEBNZCKSDY-JIOCBJNQSA-N 0.000 description 1
- XDARBNMYXKUFOJ-GSSVUCPTSA-N Thr-Asp-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XDARBNMYXKUFOJ-GSSVUCPTSA-N 0.000 description 1
- ZUUDNCOCILSYAM-KKHAAJSZSA-N Thr-Asp-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O ZUUDNCOCILSYAM-KKHAAJSZSA-N 0.000 description 1
- SLUWOCTZVGMURC-BFHQHQDPSA-N Thr-Gly-Ala Chemical compound C[C@@H](O)[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(O)=O SLUWOCTZVGMURC-BFHQHQDPSA-N 0.000 description 1
- NIEWSKWFURSECR-FOHZUACHSA-N Thr-Gly-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O NIEWSKWFURSECR-FOHZUACHSA-N 0.000 description 1
- GXUWHVZYDAHFSV-FLBSBUHZSA-N Thr-Ile-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GXUWHVZYDAHFSV-FLBSBUHZSA-N 0.000 description 1
- BVOVIGCHYNFJBZ-JXUBOQSCSA-N Thr-Leu-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O BVOVIGCHYNFJBZ-JXUBOQSCSA-N 0.000 description 1
- MECLEFZMPPOEAC-VOAKCMCISA-N Thr-Leu-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N)O MECLEFZMPPOEAC-VOAKCMCISA-N 0.000 description 1
- YOOAQCZYZHGUAZ-KATARQTJSA-N Thr-Leu-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O YOOAQCZYZHGUAZ-KATARQTJSA-N 0.000 description 1
- KPNSNVTUVKSBFL-ZJDVBMNYSA-N Thr-Met-Thr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N)O KPNSNVTUVKSBFL-ZJDVBMNYSA-N 0.000 description 1
- FDQXPJCLVPFKJW-KJEVXHAQSA-N Thr-Met-Tyr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N)O FDQXPJCLVPFKJW-KJEVXHAQSA-N 0.000 description 1
- WYLAVUAWOUVUCA-XVSYOHENSA-N Thr-Phe-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O WYLAVUAWOUVUCA-XVSYOHENSA-N 0.000 description 1
- BIBYEFRASCNLAA-CDMKHQONSA-N Thr-Phe-Gly Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CC=CC=C1 BIBYEFRASCNLAA-CDMKHQONSA-N 0.000 description 1
- MXNAOGFNFNKUPD-JHYOHUSXSA-N Thr-Phe-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MXNAOGFNFNKUPD-JHYOHUSXSA-N 0.000 description 1
- WTMPKZWHRCMMMT-KZVJFYERSA-N Thr-Pro-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O WTMPKZWHRCMMMT-KZVJFYERSA-N 0.000 description 1
- NYQIZWROIMIQSL-VEVYYDQMSA-N Thr-Pro-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O NYQIZWROIMIQSL-VEVYYDQMSA-N 0.000 description 1
- OLFOOYQTTQSSRK-UNQGMJICSA-N Thr-Pro-Phe Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 OLFOOYQTTQSSRK-UNQGMJICSA-N 0.000 description 1
- STUAPCLEDMKXKL-LKXGYXEUSA-N Thr-Ser-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O STUAPCLEDMKXKL-LKXGYXEUSA-N 0.000 description 1
- AHERARIZBPOMNU-KATARQTJSA-N Thr-Ser-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O AHERARIZBPOMNU-KATARQTJSA-N 0.000 description 1
- IQPWNQRRAJHOKV-KATARQTJSA-N Thr-Ser-Lys Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCCN IQPWNQRRAJHOKV-KATARQTJSA-N 0.000 description 1
- WKGAAMOJPMBBMC-IXOXFDKPSA-N Thr-Ser-Phe Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O WKGAAMOJPMBBMC-IXOXFDKPSA-N 0.000 description 1
- WPSKTVVMQCXPRO-BWBBJGPYSA-N Thr-Ser-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O WPSKTVVMQCXPRO-BWBBJGPYSA-N 0.000 description 1
- VBMOVTMNHWPZJR-SUSMZKCASA-N Thr-Thr-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O VBMOVTMNHWPZJR-SUSMZKCASA-N 0.000 description 1
- ZMYCLHFLHRVOEA-HEIBUPTGSA-N Thr-Thr-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O ZMYCLHFLHRVOEA-HEIBUPTGSA-N 0.000 description 1
- XGUAUKUYQHBUNY-SWRJLBSHSA-N Thr-Trp-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCC(O)=O)C(O)=O XGUAUKUYQHBUNY-SWRJLBSHSA-N 0.000 description 1
- PELIQFPESHBTMA-WLTAIBSBSA-N Thr-Tyr-Gly Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CC=C(O)C=C1 PELIQFPESHBTMA-WLTAIBSBSA-N 0.000 description 1
- CYCGARJWIQWPQM-YJRXYDGGSA-N Thr-Tyr-Ser Chemical compound C[C@@H](O)[C@H]([NH3+])C(=O)N[C@H](C(=O)N[C@@H](CO)C([O-])=O)CC1=CC=C(O)C=C1 CYCGARJWIQWPQM-YJRXYDGGSA-N 0.000 description 1
- XGFYGMKZKFRGAI-RCWTZXSCSA-N Thr-Val-Arg Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N XGFYGMKZKFRGAI-RCWTZXSCSA-N 0.000 description 1
- BKIOKSLLAAZYTC-KKHAAJSZSA-N Thr-Val-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O BKIOKSLLAAZYTC-KKHAAJSZSA-N 0.000 description 1
- ILUOMMDDGREELW-OSUNSFLBSA-N Thr-Val-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)[C@@H](C)O ILUOMMDDGREELW-OSUNSFLBSA-N 0.000 description 1
- MNYNCKZAEIAONY-XGEHTFHBSA-N Thr-Val-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O MNYNCKZAEIAONY-XGEHTFHBSA-N 0.000 description 1
- 108700015934 Triose-phosphate isomerases Proteins 0.000 description 1
- QAXCHNZDPLSFPC-PJODQICGSA-N Trp-Ala-Arg Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O)=CNC2=C1 QAXCHNZDPLSFPC-PJODQICGSA-N 0.000 description 1
- WACMTVIJWRNVSO-CWRNSKLLSA-N Trp-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)N)C(=O)O WACMTVIJWRNVSO-CWRNSKLLSA-N 0.000 description 1
- OGZRZMJASKKMJZ-XIRDDKMYSA-N Trp-Leu-Asp Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N OGZRZMJASKKMJZ-XIRDDKMYSA-N 0.000 description 1
- KBKTUNYBNJWFRL-UBHSHLNASA-N Trp-Ser-Asn Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O)=CNC2=C1 KBKTUNYBNJWFRL-UBHSHLNASA-N 0.000 description 1
- UPUNWAXSLPBMRK-XTWBLICNSA-N Trp-Thr-Thr Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O UPUNWAXSLPBMRK-XTWBLICNSA-N 0.000 description 1
- FHHYVSCGOMPLLO-IHPCNDPISA-N Trp-Tyr-Asp Chemical compound C([C@H](NC(=O)[C@H](CC=1C2=CC=CC=C2NC=1)N)C(=O)N[C@@H](CC(O)=O)C(O)=O)C1=CC=C(O)C=C1 FHHYVSCGOMPLLO-IHPCNDPISA-N 0.000 description 1
- DXYWRYQRKPIGGU-BPNCWPANSA-N Tyr-Ala-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 DXYWRYQRKPIGGU-BPNCWPANSA-N 0.000 description 1
- OEVJGIHPQOXYFE-SRVKXCTJSA-N Tyr-Asn-Asp Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O OEVJGIHPQOXYFE-SRVKXCTJSA-N 0.000 description 1
- MBFJIHUHHCJBSN-AVGNSLFASA-N Tyr-Asn-Gln Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O MBFJIHUHHCJBSN-AVGNSLFASA-N 0.000 description 1
- CYDVHRFXDMDMGX-KKUMJFAQSA-N Tyr-Asn-His Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N)O CYDVHRFXDMDMGX-KKUMJFAQSA-N 0.000 description 1
- MTEQZJFSEMXXRK-CFMVVWHZSA-N Tyr-Asn-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC1=CC=C(C=C1)O)N MTEQZJFSEMXXRK-CFMVVWHZSA-N 0.000 description 1
- XMNDQSYABVWZRK-BZSNNMDCSA-N Tyr-Asn-Phe Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O XMNDQSYABVWZRK-BZSNNMDCSA-N 0.000 description 1
- JRXKIVGWMMIIOF-YDHLFZDLSA-N Tyr-Asn-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC1=CC=C(C=C1)O)N JRXKIVGWMMIIOF-YDHLFZDLSA-N 0.000 description 1
- RCLOWEZASFJFEX-KKUMJFAQSA-N Tyr-Asp-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 RCLOWEZASFJFEX-KKUMJFAQSA-N 0.000 description 1
- NRFTYDWKWGJLAR-MELADBBJSA-N Tyr-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N)C(=O)O NRFTYDWKWGJLAR-MELADBBJSA-N 0.000 description 1
- UABYBEBXFFNCIR-YDHLFZDLSA-N Tyr-Asp-Val Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O UABYBEBXFFNCIR-YDHLFZDLSA-N 0.000 description 1
- XQYHLZNPOTXRMQ-KKUMJFAQSA-N Tyr-Glu-Arg Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O XQYHLZNPOTXRMQ-KKUMJFAQSA-N 0.000 description 1
- HKYTWJOWZTWBQB-AVGNSLFASA-N Tyr-Glu-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 HKYTWJOWZTWBQB-AVGNSLFASA-N 0.000 description 1
- KCPFDGNYAMKZQP-KBPBESRZSA-N Tyr-Gly-Leu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O KCPFDGNYAMKZQP-KBPBESRZSA-N 0.000 description 1
- AZGZDDNKFFUDEH-QWRGUYRKSA-N Tyr-Gly-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC1=CC=C(O)C=C1 AZGZDDNKFFUDEH-QWRGUYRKSA-N 0.000 description 1
- QAYSODICXVZUIA-WLTAIBSBSA-N Tyr-Gly-Thr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O QAYSODICXVZUIA-WLTAIBSBSA-N 0.000 description 1
- MVYRJYISVJWKSX-KBPBESRZSA-N Tyr-His-Gly Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)NCC(=O)O)N)O MVYRJYISVJWKSX-KBPBESRZSA-N 0.000 description 1
- BXPOOVDVGWEXDU-WZLNRYEVSA-N Tyr-Ile-Thr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O BXPOOVDVGWEXDU-WZLNRYEVSA-N 0.000 description 1
- YMUQBRQQCPQEQN-CXTHYWKRSA-N Tyr-Ile-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N YMUQBRQQCPQEQN-CXTHYWKRSA-N 0.000 description 1
- FJBCEFPCVPHPPM-STECZYCISA-N Tyr-Ile-Val Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O FJBCEFPCVPHPPM-STECZYCISA-N 0.000 description 1
- DWAMXBFJNZIHMC-KBPBESRZSA-N Tyr-Leu-Gly Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O DWAMXBFJNZIHMC-KBPBESRZSA-N 0.000 description 1
- JLKVWTICWVWGSK-JYJNAYRXSA-N Tyr-Lys-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 JLKVWTICWVWGSK-JYJNAYRXSA-N 0.000 description 1
- ZOBLBMGJKVJVEV-BZSNNMDCSA-N Tyr-Lys-Lys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)O)N)O ZOBLBMGJKVJVEV-BZSNNMDCSA-N 0.000 description 1
- OKDNSNWJEXAMSU-IRXDYDNUSA-N Tyr-Phe-Gly Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)NCC(O)=O)C1=CC=C(O)C=C1 OKDNSNWJEXAMSU-IRXDYDNUSA-N 0.000 description 1
- QPOUERMDWKKZEG-HJPIBITLSA-N Tyr-Ser-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 QPOUERMDWKKZEG-HJPIBITLSA-N 0.000 description 1
- LUMQYLVYUIRHHU-YJRXYDGGSA-N Tyr-Ser-Thr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LUMQYLVYUIRHHU-YJRXYDGGSA-N 0.000 description 1
- ZZDYJFVIKVSUFA-WLTAIBSBSA-N Tyr-Thr-Gly Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O ZZDYJFVIKVSUFA-WLTAIBSBSA-N 0.000 description 1
- RVGVIWNHABGIFH-IHRRRGAJSA-N Tyr-Val-Ser Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O RVGVIWNHABGIFH-IHRRRGAJSA-N 0.000 description 1
- JLFKWDAZBRYCGX-ZKWXMUAHSA-N Val-Asn-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CO)C(=O)O)N JLFKWDAZBRYCGX-ZKWXMUAHSA-N 0.000 description 1
- DBOXBUDEAJVKRE-LSJOCFKGSA-N Val-Asn-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](C(C)C)C(=O)O)N DBOXBUDEAJVKRE-LSJOCFKGSA-N 0.000 description 1
- DDNIHOWRDOXXPF-NGZCFLSTSA-N Val-Asp-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N DDNIHOWRDOXXPF-NGZCFLSTSA-N 0.000 description 1
- XKVXSCHXGJOQND-ZOBUZTSGSA-N Val-Asp-Trp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N XKVXSCHXGJOQND-ZOBUZTSGSA-N 0.000 description 1
- VLDMQVZZWDOKQF-AUTRQRHGSA-N Val-Glu-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N VLDMQVZZWDOKQF-AUTRQRHGSA-N 0.000 description 1
- LAYSXAOGWHKNED-XPUUQOCRSA-N Val-Gly-Ser Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O LAYSXAOGWHKNED-XPUUQOCRSA-N 0.000 description 1
- OACSGBOREVRSME-NHCYSSNCSA-N Val-His-Asn Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](Cc1cnc[nH]1)C(=O)N[C@@H](CC(N)=O)C(O)=O OACSGBOREVRSME-NHCYSSNCSA-N 0.000 description 1
- SDUBQHUJJWQTEU-XUXIUFHCSA-N Val-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](C(C)C)N SDUBQHUJJWQTEU-XUXIUFHCSA-N 0.000 description 1
- OVBMCNDKCWAXMZ-NAKRPEOUSA-N Val-Ile-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](C(C)C)N OVBMCNDKCWAXMZ-NAKRPEOUSA-N 0.000 description 1
- UMPVMAYCLYMYGA-ONGXEEELSA-N Val-Leu-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O UMPVMAYCLYMYGA-ONGXEEELSA-N 0.000 description 1
- DAVNYIUELQBTAP-XUXIUFHCSA-N Val-Leu-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](C(C)C)N DAVNYIUELQBTAP-XUXIUFHCSA-N 0.000 description 1
- ZHQWPWQNVRCXAX-XQQFMLRXSA-N Val-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](C(C)C)N ZHQWPWQNVRCXAX-XQQFMLRXSA-N 0.000 description 1
- BTWMICVCQLKKNR-DCAQKATOSA-N Val-Leu-Ser Chemical compound CC(C)[C@H]([NH3+])C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C([O-])=O BTWMICVCQLKKNR-DCAQKATOSA-N 0.000 description 1
- SYSWVVCYSXBVJG-RHYQMDGZSA-N Val-Leu-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](C(C)C)N)O SYSWVVCYSXBVJG-RHYQMDGZSA-N 0.000 description 1
- VPGCVZRRBYOGCD-AVGNSLFASA-N Val-Lys-Val Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O VPGCVZRRBYOGCD-AVGNSLFASA-N 0.000 description 1
- ILMVQSHENUZYIZ-JYJNAYRXSA-N Val-Met-Tyr Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N ILMVQSHENUZYIZ-JYJNAYRXSA-N 0.000 description 1
- NZGOVKLVQNOEKP-YDHLFZDLSA-N Val-Phe-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(=O)N)C(=O)O)N NZGOVKLVQNOEKP-YDHLFZDLSA-N 0.000 description 1
- HJSLDXZAZGFPDK-ULQDDVLXSA-N Val-Phe-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](C(C)C)N HJSLDXZAZGFPDK-ULQDDVLXSA-N 0.000 description 1
- PZTZYZUTCPZWJH-FXQIFTODSA-N Val-Ser-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)O)N PZTZYZUTCPZWJH-FXQIFTODSA-N 0.000 description 1
- SDHZOOIGIUEPDY-JYJNAYRXSA-N Val-Ser-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CO)NC(=O)[C@@H](N)C(C)C)C(O)=O)=CNC2=C1 SDHZOOIGIUEPDY-JYJNAYRXSA-N 0.000 description 1
- CEKSLIVSNNGOKH-KZVJFYERSA-N Val-Thr-Ala Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C)C(=O)O)NC(=O)[C@H](C(C)C)N)O CEKSLIVSNNGOKH-KZVJFYERSA-N 0.000 description 1
- YQYFYUSYEDNLSD-YEPSODPASA-N Val-Thr-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O YQYFYUSYEDNLSD-YEPSODPASA-N 0.000 description 1
- WUFHZIRMAZZWRS-OSUNSFLBSA-N Val-Thr-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@H](C(C)C)N WUFHZIRMAZZWRS-OSUNSFLBSA-N 0.000 description 1
- PDDJTOSAVNRJRH-UNQGMJICSA-N Val-Thr-Phe Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](C(C)C)N)O PDDJTOSAVNRJRH-UNQGMJICSA-N 0.000 description 1
- QHSSPPHOHJSTML-HOCLYGCPSA-N Val-Trp-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)NCC(=O)O)N QHSSPPHOHJSTML-HOCLYGCPSA-N 0.000 description 1
- DOBHJKVVACOQTN-DZKIICNBSA-N Val-Tyr-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C(C)C)CC1=CC=C(O)C=C1 DOBHJKVVACOQTN-DZKIICNBSA-N 0.000 description 1
- JXWGBRRVTRAZQA-ULQDDVLXSA-N Val-Tyr-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](C(C)C)N JXWGBRRVTRAZQA-ULQDDVLXSA-N 0.000 description 1
- JVGDAEKKZKKZFO-RCWTZXSCSA-N Val-Val-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](C(C)C)N)O JVGDAEKKZKKZFO-RCWTZXSCSA-N 0.000 description 1
- WHNSHJJNWNSTSU-BZSNNMDCSA-N Val-Val-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)C(C)C)C(O)=O)=CNC2=C1 WHNSHJJNWNSTSU-BZSNNMDCSA-N 0.000 description 1
- 108020005202 Viral DNA Proteins 0.000 description 1
- 108010081404 acein-2 Proteins 0.000 description 1
- 108010048241 acetamidase Proteins 0.000 description 1
- 150000007513 acids Chemical class 0.000 description 1
- 239000008272 agar Substances 0.000 description 1
- 239000003905 agrochemical Substances 0.000 description 1
- 108010086434 alanyl-seryl-glycine Proteins 0.000 description 1
- FYGDTMLNYKFZSV-DZOUCCHMSA-N alpha-D-Glcp-(1->4)-alpha-D-Glcp-(1->4)-D-Glcp Chemical compound O[C@@H]1[C@@H](O)[C@H](O)[C@@H](CO)O[C@@H]1O[C@@H]1[C@@H](CO)O[C@H](O[C@@H]2[C@H](OC(O)[C@H](O)[C@H]2O)CO)[C@H](O)[C@H]1O FYGDTMLNYKFZSV-DZOUCCHMSA-N 0.000 description 1
- BNABBHGYYMZMOA-AHIHXIOASA-N alpha-maltoheptaose Chemical compound O[C@@H]1[C@@H](O)[C@H](O)[C@@H](CO)O[C@@H]1O[C@@H]1[C@@H](CO)O[C@H](O[C@@H]2[C@H](O[C@H](O[C@@H]3[C@H](O[C@H](O[C@@H]4[C@H](O[C@H](O[C@@H]5[C@H](O[C@H](O[C@@H]6[C@H](O[C@H](O)[C@H](O)[C@H]6O)CO)[C@H](O)[C@H]5O)CO)[C@H](O)[C@H]4O)CO)[C@H](O)[C@H]3O)CO)[C@H](O)[C@H]2O)CO)[C@H](O)[C@H]1O BNABBHGYYMZMOA-AHIHXIOASA-N 0.000 description 1
- 101150069003 amdS gene Proteins 0.000 description 1
- 229960000723 ampicillin Drugs 0.000 description 1
- AVKUERGKIZMTKX-NJBDSQKTSA-N ampicillin Chemical compound C1([C@@H](N)C(=O)N[C@H]2[C@H]3SC([C@@H](N3C2=O)C(O)=O)(C)C)=CC=CC=C1 AVKUERGKIZMTKX-NJBDSQKTSA-N 0.000 description 1
- 239000003242 anti bacterial agent Substances 0.000 description 1
- 229940088710 antibiotic agent Drugs 0.000 description 1
- 238000013459 approach Methods 0.000 description 1
- 239000007864 aqueous solution Substances 0.000 description 1
- 101150008194 argB gene Proteins 0.000 description 1
- 108010091092 arginyl-glycyl-proline Proteins 0.000 description 1
- 108010069926 arginyl-glycyl-serine Proteins 0.000 description 1
- 108010068380 arginylarginine Proteins 0.000 description 1
- 229940009098 aspartate Drugs 0.000 description 1
- 108010038633 aspartylglutamate Proteins 0.000 description 1
- 108010092854 aspartyllysine Proteins 0.000 description 1
- 230000003115 biocidal effect Effects 0.000 description 1
- 229920000704 biodegradable plastic Polymers 0.000 description 1
- 235000008429 bread Nutrition 0.000 description 1
- 239000007853 buffer solution Substances 0.000 description 1
- 235000014121 butter Nutrition 0.000 description 1
- 210000004899 c-terminal region Anatomy 0.000 description 1
- 239000004196 calcium 5'-ribonucleotide Substances 0.000 description 1
- 235000013894 calcium 5'-ribonucleotide Nutrition 0.000 description 1
- 239000004221 calcium diglutamate Substances 0.000 description 1
- 235000013921 calcium diglutamate Nutrition 0.000 description 1
- 239000004244 calcium guanylate Substances 0.000 description 1
- 235000013900 calcium guanylate Nutrition 0.000 description 1
- 239000004197 calcium inosinate Substances 0.000 description 1
- 235000013893 calcium inosinate Nutrition 0.000 description 1
- 229940041514 candida albicans extract Drugs 0.000 description 1
- 239000007795 chemical reaction product Substances 0.000 description 1
- 239000003795 chemical substances by application Substances 0.000 description 1
- 229960005091 chloramphenicol Drugs 0.000 description 1
- WIIZWVCIJKGZOK-RKDXNWHRSA-N chloramphenicol Chemical compound ClC(Cl)C(=O)N[C@H](CO)[C@H](O)C1=CC=C([N+]([O-])=O)C=C1 WIIZWVCIJKGZOK-RKDXNWHRSA-N 0.000 description 1
- 235000012000 cholesterol Nutrition 0.000 description 1
- 230000002759 chromosomal effect Effects 0.000 description 1
- 210000000349 chromosome Anatomy 0.000 description 1
- 238000003776 cleavage reaction Methods 0.000 description 1
- 230000006957 competitive inhibition Effects 0.000 description 1
- 239000002299 complementary DNA Substances 0.000 description 1
- 230000009146 cooperative binding Effects 0.000 description 1
- 239000012228 culture supernatant Substances 0.000 description 1
- 238000012258 culturing Methods 0.000 description 1
- 235000018417 cysteine Nutrition 0.000 description 1
- XUJNEKJLAYXESH-UHFFFAOYSA-N cysteine Natural products SCC(N)C(O)=O XUJNEKJLAYXESH-UHFFFAOYSA-N 0.000 description 1
- 108010016616 cysteinylglycine Proteins 0.000 description 1
- 210000000805 cytoplasm Anatomy 0.000 description 1
- 230000007547 defect Effects 0.000 description 1
- 238000006731 degradation reaction Methods 0.000 description 1
- 239000003599 detergent Substances 0.000 description 1
- 238000007865 diluting Methods 0.000 description 1
- FSXRLASFHBWESK-UHFFFAOYSA-N dipeptide phenylalanyl-tyrosine Natural products C=1C=C(O)C=CC=1CC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 FSXRLASFHBWESK-UHFFFAOYSA-N 0.000 description 1
- 239000004192 dipotassium guanylate Substances 0.000 description 1
- 235000013898 dipotassium guanylate Nutrition 0.000 description 1
- 239000004195 dipotassium inosinate Substances 0.000 description 1
- 235000013892 dipotassium inosinate Nutrition 0.000 description 1
- 239000004193 disodium 5'-ribonucleotide Substances 0.000 description 1
- 235000013888 disodium 5'-ribonucleotide Nutrition 0.000 description 1
- 239000004198 disodium guanylate Substances 0.000 description 1
- 235000013896 disodium guanylate Nutrition 0.000 description 1
- 239000004194 disodium inosinate Substances 0.000 description 1
- 235000013890 disodium inosinate Nutrition 0.000 description 1
- 229940079593 drug Drugs 0.000 description 1
- 239000000975 dye Substances 0.000 description 1
- 235000013345 egg yolk Nutrition 0.000 description 1
- 210000002969 egg yolk Anatomy 0.000 description 1
- 230000008030 elimination Effects 0.000 description 1
- 238000003379 elimination reaction Methods 0.000 description 1
- 238000005516 engineering process Methods 0.000 description 1
- 210000003527 eukaryotic cell Anatomy 0.000 description 1
- 238000000855 fermentation Methods 0.000 description 1
- 230000004151 fermentation Effects 0.000 description 1
- 239000004225 ferrous lactate Substances 0.000 description 1
- 239000010408 film Substances 0.000 description 1
- 239000000706 filtrate Substances 0.000 description 1
- 235000013312 flour Nutrition 0.000 description 1
- 235000010037 flour treatment agent Nutrition 0.000 description 1
- 230000002538 fungal effect Effects 0.000 description 1
- 239000000417 fungicide Substances 0.000 description 1
- 238000001641 gel filtration chromatography Methods 0.000 description 1
- 238000010353 genetic engineering Methods 0.000 description 1
- 108010061330 glucan 1,4-alpha-maltohydrolase Proteins 0.000 description 1
- 239000004247 glycine and its sodium salt Substances 0.000 description 1
- 235000013905 glycine and its sodium salt Nutrition 0.000 description 1
- 230000002414 glycolytic effect Effects 0.000 description 1
- 108010000434 glycyl-alanyl-leucine Proteins 0.000 description 1
- 108010027668 glycyl-alanyl-valine Proteins 0.000 description 1
- 108010019832 glycyl-asparaginyl-glycine Proteins 0.000 description 1
- 108010077435 glycyl-phenylalanyl-glycine Proteins 0.000 description 1
- 108010082286 glycyl-seryl-alanine Proteins 0.000 description 1
- 108010074027 glycyl-seryl-phenylalanine Proteins 0.000 description 1
- 108010010147 glycylglutamine Proteins 0.000 description 1
- 108010015792 glycyllysine Proteins 0.000 description 1
- 108010084389 glycyltryptophan Proteins 0.000 description 1
- 239000004226 guanylic acid Substances 0.000 description 1
- 235000013928 guanylic acid Nutrition 0.000 description 1
- 125000001475 halogen functional group Chemical group 0.000 description 1
- 108010028295 histidylhistidine Proteins 0.000 description 1
- 125000004435 hydrogen atom Chemical class [H]* 0.000 description 1
- 210000003000 inclusion body Anatomy 0.000 description 1
- 125000001041 indolyl group Chemical group 0.000 description 1
- 239000003112 inhibitor Substances 0.000 description 1
- 239000004245 inosinic acid Substances 0.000 description 1
- 235000013902 inosinic acid Nutrition 0.000 description 1
- 238000002955 isolation Methods 0.000 description 1
- 108010078274 isoleucylvaline Proteins 0.000 description 1
- 239000002650 laminated plastic Substances 0.000 description 1
- 108010044056 leucyl-phenylalanine Proteins 0.000 description 1
- 230000000670 limiting effect Effects 0.000 description 1
- 101150039489 lysZ gene Proteins 0.000 description 1
- 108010009298 lysylglutamic acid Proteins 0.000 description 1
- 108010054155 lysyllysine Proteins 0.000 description 1
- 239000004240 magnesium diglutamate Substances 0.000 description 1
- 235000013918 magnesium diglutamate Nutrition 0.000 description 1
- 125000003071 maltose group Chemical group 0.000 description 1
- 150000002692 maltoses Chemical class 0.000 description 1
- 239000012528 membrane Substances 0.000 description 1
- 229930182817 methionine Natural products 0.000 description 1
- 108010068488 methionylphenylalanine Proteins 0.000 description 1
- 229960000485 methotrexate Drugs 0.000 description 1
- 238000013048 microbiological method Methods 0.000 description 1
- 238000010369 molecular cloning Methods 0.000 description 1
- 239000004238 monoammonium glutamate Substances 0.000 description 1
- 235000013917 monoammonium glutamate Nutrition 0.000 description 1
- 239000004239 monopotassium glutamate Substances 0.000 description 1
- 235000013919 monopotassium glutamate Nutrition 0.000 description 1
- 239000004223 monosodium glutamate Substances 0.000 description 1
- 230000036438 mutation frequency Effects 0.000 description 1
- 229960004927 neomycin Drugs 0.000 description 1
- 230000007935 neutral effect Effects 0.000 description 1
- 101150095344 niaD gene Proteins 0.000 description 1
- 230000009965 odorless effect Effects 0.000 description 1
- 239000003960 organic solvent Substances 0.000 description 1
- 230000003204 osmotic effect Effects 0.000 description 1
- 230000001590 oxidative effect Effects 0.000 description 1
- 230000001717 pathogenic effect Effects 0.000 description 1
- 230000037361 pathway Effects 0.000 description 1
- 239000002304 perfume Substances 0.000 description 1
- 239000010451 perlite Substances 0.000 description 1
- 239000000575 pesticide Substances 0.000 description 1
- 108010084525 phenylalanyl-phenylalanyl-glycine Proteins 0.000 description 1
- 230000026731 phosphorylation Effects 0.000 description 1
- 238000006366 phosphorylation reaction Methods 0.000 description 1
- 230000000704 physical effect Effects 0.000 description 1
- 239000004033 plastic Substances 0.000 description 1
- 229920003023 plastic Polymers 0.000 description 1
- 229920001184 polypeptide Polymers 0.000 description 1
- 238000001556 precipitation Methods 0.000 description 1
- 238000002360 preparation method Methods 0.000 description 1
- 102000004196 processed proteins & peptides Human genes 0.000 description 1
- 108090000765 processed proteins & peptides Proteins 0.000 description 1
- 108010020755 prolyl-glycyl-glycine Proteins 0.000 description 1
- 108010031719 prolyl-serine Proteins 0.000 description 1
- 108010015796 prolylisoleucine Proteins 0.000 description 1
- 210000001938 protoplast Anatomy 0.000 description 1
- 101150054232 pyrG gene Proteins 0.000 description 1
- 238000002708 random mutagenesis Methods 0.000 description 1
- 230000009257 reactivity Effects 0.000 description 1
- 230000009467 reduction Effects 0.000 description 1
- 230000003362 replicative effect Effects 0.000 description 1
- 108091008146 restriction endonucleases Proteins 0.000 description 1
- 230000000717 retained effect Effects 0.000 description 1
- 102220209728 rs1057523598 Human genes 0.000 description 1
- 102220061201 rs786201790 Human genes 0.000 description 1
- 230000007017 scission Effects 0.000 description 1
- 230000028327 secretion Effects 0.000 description 1
- 108010048818 seryl-histidine Proteins 0.000 description 1
- 108010048397 seryl-lysyl-leucine Proteins 0.000 description 1
- 230000035939 shock Effects 0.000 description 1
- 238000002791 soaking Methods 0.000 description 1
- 239000011780 sodium chloride Substances 0.000 description 1
- 238000005063 solubilization Methods 0.000 description 1
- 230000007928 solubilization Effects 0.000 description 1
- 238000000527 sonication Methods 0.000 description 1
- 230000009870 specific binding Effects 0.000 description 1
- 229960000268 spectinomycin Drugs 0.000 description 1
- UNFWWIHTNXNPBV-WXKVUWSESA-N spectinomycin Chemical compound O([C@@H]1[C@@H](NC)[C@@H](O)[C@H]([C@@H]([C@H]1O1)O)NC)[C@]2(O)[C@H]1O[C@H](C)CC2=O UNFWWIHTNXNPBV-WXKVUWSESA-N 0.000 description 1
- 230000006641 stabilisation Effects 0.000 description 1
- 238000011105 stabilization Methods 0.000 description 1
- 230000000087 stabilizing effect Effects 0.000 description 1
- 229960005322 streptomycin Drugs 0.000 description 1
- 235000000346 sugar Nutrition 0.000 description 1
- 150000008163 sugars Chemical class 0.000 description 1
- 239000013589 supplement Substances 0.000 description 1
- 229930101283 tetracycline Natural products 0.000 description 1
- OFVLGDICTFRJMM-WESIUVDSSA-N tetracycline Chemical compound C1=CC=C2[C@](O)(C)[C@H]3C[C@H]4[C@H](N(C)C)C(O)=C(C(N)=O)C(=O)[C@@]4(O)C(O)=C3C(=O)C2=C1O OFVLGDICTFRJMM-WESIUVDSSA-N 0.000 description 1
- 150000004044 tetrasaccharides Chemical class 0.000 description 1
- 101150080369 tpiA gene Proteins 0.000 description 1
- 230000001131 transforming effect Effects 0.000 description 1
- 239000012137 tryptone Substances 0.000 description 1
- 108010084932 tryptophyl-proline Proteins 0.000 description 1
- 108010078580 tyrosylleucine Proteins 0.000 description 1
- 108010003885 valyl-prolyl-glycyl-glycine Proteins 0.000 description 1
- 235000013343 vitamin Nutrition 0.000 description 1
- 239000011782 vitamin Substances 0.000 description 1
- 229940088594 vitamin Drugs 0.000 description 1
- 229930003231 vitamin Natural products 0.000 description 1
- 239000012138 yeast extract Substances 0.000 description 1
- 239000004246 zinc acetate Substances 0.000 description 1
- 235000013904 zinc acetate Nutrition 0.000 description 1
Images
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
- C12N9/10—Transferases (2.)
- C12N9/1048—Glycosyltransferases (2.4)
- C12N9/1051—Hexosyltransferases (2.4.1)
- C12N9/1074—Cyclomaltodextrin glucanotransferase (2.4.1.19)
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
- C12N9/14—Hydrolases (3)
- C12N9/24—Hydrolases (3) acting on glycosyl compounds (3.2)
- C12N9/2402—Hydrolases (3) acting on glycosyl compounds (3.2) hydrolysing O- and S- glycosyl compounds (3.2.1)
- C12N9/2405—Glucanases
- C12N9/2408—Glucanases acting on alpha -1,4-glucosidic bonds
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12P—FERMENTATION OR ENZYME-USING PROCESSES TO SYNTHESISE A DESIRED CHEMICAL COMPOUND OR COMPOSITION OR TO SEPARATE OPTICAL ISOMERS FROM A RACEMIC MIXTURE
- C12P19/00—Preparation of compounds containing saccharide radicals
- C12P19/18—Preparation of compounds containing saccharide radicals produced by the action of a glycosyl transferase, e.g. alpha-, beta- or gamma-cyclodextrins
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Y—ENZYMES
- C12Y204/00—Glycosyltransferases (2.4)
- C12Y204/01—Hexosyltransferases (2.4.1)
- C12Y204/01019—Cyclomaltodextrin glucanotransferase (2.4.1.19)
Definitions
- the present invention relates to variants of cyclomaltodextrin glucanotransferase. More specifically the invention relates to a method of modifying the substrate binding and/or product selectivity of a precursor CGTase enzyme, and CGTase variants derived from a precursor CGTase enzyme by substitution, insertion and/or deletion of one or more amino acid residue(s), which amino acid residue(s) holds a position close to the substrate. Moreover, the invention relates to DNA constructs encoding the CGTase variants, expression vectors, host cells and methods of producing the CGTase variants of the invention.
- Cyclomaltodextrin glucanotransferase (E.C. 2.4.1.19), also designated cyclodextrin glucanotransferase or cyclodextrin glycosyltransferase, in the following termed CGTase, catalyses the conversion of starch and similar substrates into cyclomaltodextrins via an intramolecular transglycosylation reaction, thereby forming cyclomaltodextrins, in the following termed cyclodextrins (or CD), of various sizes.
- CGTase Cyclomaltodextrin glucanotransferase (E.C. 2.4.1.19)
- CGTase catalyses the conversion of starch and similar substrates into cyclomaltodextrins via an intramolecular transglycosylation reaction, thereby forming cyclomaltodextrins, in the following termed cyclodextrins (
- cyclodextrins of 6, 7 and 8 glucose units which are termed ⁇ -, ⁇ - and ⁇ -cyclodextrins, respectively.
- cyclodextrins of 9, 10, and 11 glucose units which are termed ⁇ -, ⁇ -, and ⁇ -cyclodextrins, respectively.
- Cyclodextrins are thus cyclic glucose oligomers with a hydrophobic internal cavity. They are able to form inclusion complexes with many small hydrophobic molecules in aqueous solutions, resulting in changes in physical properties, e.g. increased solubility and stability and decreased chemical reactivity and volatility. Cyclodextrins find applications particularly in the food, cosmetic, chemical and pharmaceutical industries.
- CGTases have both starch-degrading activity and transglycosylation activity. Although some CGTases produce mainly ⁇ -cyclodextrins and some CGTases produce mainly ⁇ -cyclodextrins, CGTases usually form a mixture of ⁇ -, ⁇ - and ⁇ -cyclodextrins. Selective precipitation steps with organic solvents may be used for the isolation of separate ⁇ -, ⁇ - and ⁇ -cyclodextrins. To avoid expensive and environmentally harmful procedures, the availability of CGTases capable of producing an increased ratio of one particular type of cyclodextrin is desirable.
- CGTases from different bacterial sources including CGTases obtained from Bacillus, Brevibacterium, Clostridium, Corynebacterium, Klebsiella, Micrococcus, Thermoanaerobacter and Thermoanaerobacterium have been described in the literature.
- EP 614971 describes a Brevibacterium CGTase, Haeckel & Bahl [Haeckel K, Bahl H; FEMS Microbiol. Lett. 1989 60 333-338] describe Clostridium thermosulfurogenes CGTase, Podkovyrov & Zeikus [Podkovyrov S M, Zeikus J G; J. Bacteriol. 1992 174 5400-5405] describe a Clostridium thermohydrosulfricum CGTase, JP 7000183 describes a Corynebacterium CGTase, Binder et al.
- thermophilic Actinomycetes have been reported [Abelian V A, Afyan K B, Avakian Z G, Melkumyan A G and Afrikian E G; Biochemistry (Moscow) 1995 60 (10) 1223-1229].
- CGTases are functionally related to ⁇ -amylases. CGTases and ⁇ -amylases both degrade starch by hydrolysis of the ⁇ -(1,4)-glycosidic bonds, but produce virtually exclusively cyclic and linear products, respectively.
- CGTases and ⁇ -amylases possess a high overall amino acid sequence identity, more than 60%.
- CGTases and ⁇ -amylases share about 30% amino acid sequence identity.
- the active site clefts of CGTases and ⁇ -amylases, located between the A and B domain are rather similar.
- the catalytic site of CGTases is located in the A domain, and has three catalytic residues (in Bacillus circulans strain 251 these are Asp229, Glu257 and Asp328, respectively, cf. Strokopytov et al. 1995, op cit.).
- a central amino acid residue is located in the B domain, around which residue the cyclodextrins are formed, i.e. the cyclization axis. Substitution of this central residue, e.g.
- Nakamura et al. [Nakamura A, Haga K and Yamane K; Biochemistry 1994 33 9929-9936] describe the effects on substrate binding and cyclization characteristics by replacements carried out at four residues in the active center of a Bacillus sp. Strain 1011 CGTase.
- a phenylalanine at position 183 has been replaced by leucine
- a tyrosine at position 195 has been replaced by alanine
- phenylalanine, leucine, threonine, valine, and tryptophan respectively
- a phenylalanine at position 259 has been replaced by leucine
- a phenylalanine at position 283 has been replaced by leucine.
- Penninga et al. [Penninga D, Strokopytov B, Rozeboom H J, Lawson C L, Dijkstra B W, Bergsma J and Dijkhuizen L; Biochemistry 1995 34 3368-3376] describe the effect on activity and product selectivity of site-directed mutations in tyrosine at position 195 of a Bacillus circulans Strain 251 CGTase.
- four CGTase variants have been produced, in which variants the tyrosine at position 195 have been replaced by phenylalanine, tryptophan, leucine and glycine, respectively.
- Fujiware et al. [Fujiwara S, Kakihara H, Sakaguchi K and Imanaka T; J. Bacteriol. 1992 174 (22) 7478-7481] describe CGTase variants derived from Bacillus stearothermophilus, in which a tyrosine residue at position 191 (corresponding to position 195 CGTase numbering) has been replaced by phenylalanine, a tryptophan residue at position 254 (corresponding to position 258, CGTase numbering) has been replaced by valine, a phenylalanine at position 255 (corresponding to position 259, CGTase numbering) has been replaced by phenylalanine and isoleucine, respectively, a threonine residue at position 591 (corresponding to position 598, CGTase numbering) has been replaced by phenylalanine, and a tryptophan residue at position 629 (corresponding to position 636, CGTase numbering)
- JP-A-7023781 describes CGTase variants derived from Bacillus sp. 1011, in which a tyrosine residue at position 195 has been replaced by leucine, valine, phenylalanine and isoleucine, respectively.
- JP-A-5244945 describes CGTase variants derived from Bacillus stearothermophilus TC-91, in which tyrosine residues at positions 222 and 286 (corresponding to positions 195 and 259, CGTase numbering) have been replaced by phenylalanine in order to increase the relative production of ⁇ -cyclodextrin to ⁇ -cyclodextrin.
- JP-A-5041985 describes CGTase variants derived from Bacillus sp. #1011, in which histidine at residue 140 in region A, histidine at residue 233 in region B, and histidine at residue 327 in region C, respectively, have been replaced by arginine and asparagine residues, respectively.
- EP 630,967 describes CGTase variants in which a tyrosine residue at position 211 of a Bacillus sp. 290-3 CGTase (corresponding to position 195, CGTase numbering), at position 217 of a Bacillus sp. 1-1 CGTase (corresponding to position 195, CGTase numbering), and at position 229 of a Bacillus circulans CGTase (corresponding to position 195, CGTase numbering), have been substituted for tryptophan and serine.
- the invention provides a method of modifying the substrate binding and/or product selectivity of a precursor CGTase enzyme, which method comprises substitution, insertion and/or deletion of one or more amino acid residue(s) of the precursor enzyme, which amino acid residue(s) holds a position close to the substrate.
- the invention provides a CGTase variant derived from a precursor CGTase enzyme by substitution, insertion and/or deletion of one or more amino acid residue(s), which amino acid residue(s) holds a position close to the substrate.
- the invention provides a DNA construct encoding a CGTase variant of the invention.
- the invention provides a recombinant expression vector comprising the DNA construct of the invention.
- the invention provides a host cell comprising the DNA construct of the invention, or the recombinant expression vector of the invention.
- the invention provides a method of producing a CGTase variant of the invention, which method comprises culturing the host cell of the invention under conditions permitting the production of the CGTase variant, and recovering the enzyme from the culture.
- the invention provides CGTase variants for use in processes for the manufacture of cyclodextrins, in processes for the manufacture of linear oligosaccharides, and in processes for in situ generation of cyclodextrins.
- a CGTase variant of this invention is a CGTase variant or mutated CGTase, having an amino acid sequence not found in nature.
- a CGTase variant or mutated CGTase of this invention is a functional derivative of a precursor CGTase enzyme (i.e. the native, parental, or wild-type enzyme), and may be obtained by alteration of a DNA nucleotide sequence of a precursor gene or its derivatives, encoding the precursor enzyme.
- the CGTase variant or mutated CGTase may be expressed and produced when the DNA nucleotide sequence encoding the CGTase variant is inserted into a suitable vector in a suitable host organism.
- the host organism is not necessarily identical to the organism from which the precursor gene originated.
- enzyme variants have also been referred to as mutants or muteins.
- substitution of serine with alanine in position 145 is designated as S145A.
- Amino acid residues which represent insertions in relation to the amino acid sequence of the CGTase from Bacillus circulans Strain 251 are numbered by the addition of letters in alphabetical order to the preceding CGTase number, such as e.g. position 91aF for the "insert" Phe between Thr at position 91 and Gly at position 92 of the amino acid sequence of the CGTase from Thermoanaerobacter sp. ATCC 53627, cf. Table 1 (j).
- P149* Deletion of a proline at position 149 is indicated as P149*, and an insertion between position 147 and 148 where no amino acid residue is present, is indicated as *147aD for insertion of an aspartic acid in position 147a.
- slash marks e.g. S145A/D147L, representing mutations in positions 145 and 147 substituting serine with alanine and aspartic acid with leucine, respectively.
- FIG. 1 shows a model of the structure of the active site cleft (domains A and B) of a CGTase from Bacillus circulans Strain 251, which has been complexed with a linear starch molecule, and residues involved in the enzyme-substrate interactions;
- FIG. 2 shows the formation (% cyclodextrin) of ⁇ - ( ⁇ ), ⁇ - ( ⁇ ), and ⁇ -cyclodextrin ( ⁇ ) from 10% PaselliTM WA4 (pre-gelatinized drum-dried starch) during a 50 hour incubation at 50° C. catalyzed by (A) wild-type enzyme (Bacillus circulans Strain 251 CGTase), (B) the Y89D CGTase variant, (C) the S146P CGTase variant, and (D) the Y89D/S146P CGTase variant;
- A wild-type enzyme
- B Bacillus circulans Strain 251 CGTase
- B the Y89D CGTase variant
- C the S146P CGTase variant
- D the Y89D/S146P CGTase variant
- FIG. 3 shows the construction of plasmid pDP66K, subcloning steps are indicated adjacent to the arrows;
- FIG. 4 shows the results of starch binding experiments (% of protein bound to raw starch) at starch concentrations of from 0 to 8% raw starch, ( ⁇ ) without ⁇ -cyclodextrin, and ( ⁇ ) with 0.1 mM ⁇ -cyclodextrin; (a) wild-type enzyme (Bacillus circulans Strain 251 CGTase), (b) the W616A/W662A variant, and (c) the Y633A variant;
- FIG. 5 shows the results of reaction kinetic experiments (activity, U/mg) on PaselliTM SA2 (i.e. partially hydrolysed potato starch) at concentrations of from 0 to 5% PaselliTM, ( ⁇ ) without ⁇ -cyclodextrin, ( ⁇ ) with 0.1 mM ⁇ -cyclodextrin, and ( ⁇ ) with 0.2 mM ⁇ -cyclodextrin; (a) wild-type enzyme (Bacillus circulans Strain 251 CGTase), (b) the W616A/W662A variant, and (c) the Y633A variant;
- PaselliTM SA2 i.e. partially hydrolysed potato starch
- FIG. 6 shows the results of reaction kinetic experiments (activity, U/mg) on raw starch at starch concentration of from 0 to 60% raw starch, ( ⁇ ) wild-type enzyme (Bacillus circulans Strain 251 CGTase), ( ⁇ ) the W616A/W662A variant, and ( ⁇ ) the Y633A variant; the dotted line indicates the modelled curve resulting from the supposed interaction between MBS2 on the E domain and MBS3 on the C domain;
- FIG. 7 shows the product formation ( ⁇ ⁇ -cyclodextrin formation; ⁇ ⁇ -cyclodextrin formation, and ⁇ ⁇ -cyclodextrin formation) of two CGTase variants of the invention (N193G, FIG. 7B, and Y89G, FIG. 7C) compared to the wild-type enzyme (from Bacillus circulans Strain 251, FIG. 7A) during incubation for 0 to 45 hours;
- FIG. 8 shows the product formation ( ⁇ ⁇ -cyclodextrin formation; ⁇ ⁇ -cyclodextrin formation, and ⁇ ⁇ -cyclodextrin formation) of two CGTase variants of the invention (*145aI, FIG. 8B, and D371G, FIG. 8C) compared to the wild-type enzyme (from Bacillus circulans Strain 251, FIG. 8A) during incubation for 0 to 45 hours;
- FIG. 9 shows the product formation ( ⁇ ⁇ -cyclodextrin formation; ⁇ ⁇ -cyclodextrin formation, and ⁇ ⁇ -cyclodextrin formation) of two CGTase variants of the invention (N193G, FIG. 9B, and Y89G, FIG. 9C) compared to the wild-type enzyme (from Bacillus circulans Strain 251, FIG. 9A) during incubation for 0 to 10 hours; and
- FIG. 10 shows the product formation ( ⁇ ⁇ -cyclodextrin formation; ⁇ ⁇ -cyclodextrin formation, and ⁇ ⁇ -cyclodextrin formation) of two CGTase variants of the invention (145aI, FIG. 10B, and D371G, FIG. 10C) compared to the wild-type enzyme (from Bacillus circulans Strain 251, FIG. 10A) during incubation for 0 to 10 hours.
- the present invention provides a method of modifying the substrate binding and/or increasing the product selectivity of a CGTase enzyme, thereby obtaining a CGTase variant having a modified substrate binding capability and/or an increased product selectivity, as compared to the precursor enzyme.
- a CGTase variant of modified substrate binding capability is meant to describe a CGTase variant that is able to more efficiently act on its substrate, and/or a CGTase variant that is less affected by product inhibition.
- product inhibition is meant to describe the phenomenon that increasing amounts of product reduce or even inhibit the substrate conversion. It is desirable to obtain CGTase variants that are less affected by product inhibition (i.e. variants of reduced product inhibition).
- a CGTase variant of increased product selectivity is meant to describe a CGTase variant that is able to more selectively produce any of the various cyclodextrins thereby increasing the ratio of the desired product, as compared to the precursor enzyme.
- the present invention is based on the concept of removing and/or introducing "obstacles" in the subsites of the active site cleft, the substrate binding cleft, or the groove leading to these clefts, thereby facilitating introduction of the substrate and its disposition in such a way that products of a predetermined size are obtained, and in such a way that substrate binding is not inhibited by the product.
- CGTase enzyme By modifying the substrate binding of a CGTase enzyme, its product selectivity can be modified in order that the CGTase variant is able to more selectively produce any of the various cyclodextrins, ⁇ -, ⁇ - and ⁇ -cyclodextrins. Even CGTases capable of producing ⁇ -, ⁇ -, and ⁇ -cyclodextrins with 9, 10 and 11 glucose units, respectively, may be obtained. Modification of the substrate binding of a CGTase may also reduce the tendency of product inhibition, thereby increasing the cyclodextrin yield of the CGTase variant.
- the concept of the invention may be expressed differently as the modification of enzyme-substrate side chain intermolecular interactions.
- the intermolecular interactions between substrate and CGTase can be changed in order to direct the substrate to a specific location in the active site cleft, thereby obtaining a cyclic or linear product of predefined size, preferably ⁇ -, a ⁇ - or a ⁇ -cyclodextrin, or ⁇ -, ⁇ , and ⁇ -cyclodextrins, or a linear oligosaccharide of similar size, preferably of 2-12 glucose units, more preferred 2-9 glucose units.
- the introduction of more intermolecular interactions in the region around glucose units C to I, preferably C to H, of FIG. 1, will lock the substrate in a position 6 glucose units from the catalytic site (between glucose units B and C of FIG. 1), and lead to increased product selectivity for ⁇ -cyclodextrins (6 glucose units).
- the formation of larger cyclodextrins and/or larger linear oligosaccharides may simultaneously be reduced by reducing potential intermolecular interactions of glucose unit I to J of FIG. 1.
- the introduction of more intermolecular interactions in the region around glucose units F to J, preferably H and I, of FIG. 1, will lock the substrate in a position 7 glucose units from the catalytic site (between glucose units B and C of FIG. 1), and lead to increased product selectivity for ⁇ -cyclodextrins (7 glucose units).
- the formation of e.g. ⁇ -cyclodextrins and/or small linear oligosaccharides may simultaneously be reduced by reducing potential intermolecular interactions of glucose unit C to G of FIG. 1.
- the introduction of more intermolecular interactions in the region around glucose units H to K, preferably I and J, of FIG. 1, will lock the substrate in a position 8 glucose units from the catalytic site (between glucose units B and C of FIG. 1), and lead to increased product selectivity for ⁇ -cyclodextrins (8 glucose units).
- the formation of smaller cyclodextrins and/or linear oligosaccharides may simultaneously be reduced by reducing potential intermolecular interactions of glucose unit C to H of FIG. 1.
- the introduction of more intermolecular interactions in the region around glucose units J to M, preferably K and L, of FIG. 1, will lock the substrate in a position 9 glucose units from the catalytic site (between glucose units B and C of FIG. 1), and lead to increased product selectivity for ⁇ -cyclodextrins (9 glucose units).
- cyclodextrins and/or linear oligosaccharides may simultaneously be reduced by reducing potential intermolecular interactions of glucose unit C to H of FIG. 1.
- the introduction of more intermolecular interactions in the region around glucose units K to N, preferably L and M, of FIG. 1, will lock the substrate in a position 10 glucose units from the catalytic site (between glucose units B and C of FIG. 1), and lead to increased product selectivity for ⁇ -cyclodextrins (10 glucose units).
- the formation of smaller cyclodextrins and/or linear oligosaccharides may simultaneously be reduced by reducing potential intermolecular interactions of glucose unit C to H of FIG. 1.
- the introduction of more intermolecular interactions in the region around glucose units L to O, preferably M and N, of FIG. 1, will lock the substrate in a position 11 glucose units from the catalytic site (between glucose units B and C of FIG. 1), and lead to increased product selectivity for ⁇ -cyclodextrins (11 glucose units).
- the formation of smaller cyclodextrins and/or linear oligosaccharides may simultaneously be reduced by reducing potential intermolecular interactions of glucose unit C to H of FIG. 1.
- the formation of linear oligosaccharides of desired length may be increased by combining the above conditions with substitution at the cyclization axis, corresponding to position 195, CGTase numbering.
- the CGTase enzyme subjected to the method of the invention may be any CGTase found in nature.
- the CGTase preferably is a microbial enzyme, preferably a bacterial enzyme, and preferably the CGTase is derived from a strain of Bacillus, a strain of Brevibacterium, a strain of Clostridium, a strain of Corynebacterium, a strain of Klebsiella, a strain of Micrococcus, a strain of Thermoanaerobium, a strain of Thermoanaerobacter, a strain of Thermoanaerobacterium, or a strain of Thermoactinomyces.
- the CGTase is derived from a strain of Bacillus autolyticus, a strain of Bacillus cereus, a strain of Bacillus circulans, a strain of Bacillus circulans var. alkalophilus, a strain of Bacillus coagulans, a strain of Bacillus firmus, a strain of Bacillus halophilus, a strain of Bacillus macerans, a strain of Bacillus megaterium, a strain of Bacillus ohbensis, a strain of Bacillus stearothermophilus, a strain of Bacillus subtilis, a strain of Klebsiella pneumonia, a strain of Thermoanaerobacter ethanolicus, a strain of Thermoanaerobacter finnii, a strain of Clostridium thermoamylolyticum, a strain of Clostridium thermosaccharolyticum, or a strain of Thermoanaerobacterium thermos
- the CGTase is derived from the strain Bacillus sp. Strain 1011, the strain Bacillus sp. Strain 38-2, the strain Bacillus sp. Strain 17-1, the strain Bacillus sp. 1-1, the strain Bacillus sp. Strain B1018, the strain Bacillus circulans Strain 8, the strain Thermoanaerobacter sp. ATCC 53627, or the strain Bacillus circulans Strain 251, or a mutant or a variant thereof.
- the strain Thermoanaerobacter sp. ATCC 53627 was deposited according to the Budapest Treaty on the International Recognition of the Deposit of Microorganisms for the Purposes of Patent Procedure at the American Type Culture Collection (ATCC), 12301 Parklawn Drive, Rockville, Md. 20852, USA, on Jun. 3, 1987.
- the strain Bacillus circulans Strain 251 has been deposited in the open collection at Rijksinstituut voor Kingsgezondheid (RIV), Bilthoven, The Netherlands, and allotted the accession number RIV 11115, and thus is publicly available.
- the method of the invention comprises substitution, insertion and/or deletion of one or more amino acid residue(s) of the enzyme, which residue(s) hold a position close to the substrate, when the substrate has bound to the CGTase enzyme at its substrate binding sites.
- the method of the invention comprises substitution, insertion and/or deletion of two or more amino acid residue(s), preferably of three or more amino acid residue(s).
- a CGTase amino acid residue holding a position close to the substrate indicates an amino acid residue located within the enzyme in a way that it is within a potential intermolecular (i.e. enzyme-substrate) interactive distance from a glucose unit of the substrate (i.e. a polysaccharide).
- potential intermolecular interactions include, but are not limited to hydrogen bonding, salt bridge formation, polar interactions, hydrophobic interactions, and aromatic interactions.
- an amino acid position close to the substrate indicates a distance less than 8 ⁇ (angstrom), preferably less than 5 ⁇ , more preferred less than 3 ⁇ , from the substrate.
- these distances are calculated using the CGTase from Bacillus circulans Strain 251 [cf. Lawson C L, van Montfort R, Strokopytov B, Rozeboom H J, Kalk K H, de Vries G E, Penninga D, Dijkhuizen L, and Dijkstra B W; J. Mol. Biol. 1994 236 590-600], complexed with a derivative of maltonanose, the coordinates of which have been deposited with the Protein Data Bank, Biology Department, Bldg. 463, Brookhaven National Laboratory, P.O. Box 5000, Upton, N.Y. 11973-5000, USA, under the entry code 1DIJ.
- Knowledge of this structure makes it possible to identify similar positions in other CGTases, having a known primary structure, which positions corresponds to the positions stated in e.g. Table 2, cf. also Table 1.
- the method of the invention comprises substituting one or more amino acid residue(s) of the CGTase enzyme, which residue(s) are located in one or more of the A, B, C and/or E domains, cf. Table 1.
- sequence alignment and molecular modelling of a CGTase enzyme found in nature amino acid residues located close to the substrate can be identified.
- sequence alignment the tertiary structure of any homologous CGTase can be modelled based on known three-dimensional CGTase structures.
- Table 2 presents a list of CGTase amino acid positions located within 8 ⁇ from the substrate, and therefore to be considered in the context of this invention.
- the amino acid residues are identified by CGTase numbering, which allows identification of the corresponding amino acid positions in any CGTase enzyme.
- the method of the invention comprises substitution, insertion and/or deletion at one or more amino acid residue(s) identified in Table 2, below.
- the method of the invention comprises substitution, insertion and/or deletion at one or more amino acid residue(s) identified in Tables 3-5, below.
- the method of the invention comprises substitution, insertion and/or deletion at one or more amino acid residue(s) identified in Tables 6-8, below.
- the substrate binding and product selectivity of a CGTase variant of the invention can be designed by removing existing and/or introducing potential intermolecular interactions between the CGTase variant and its substrate.
- intermolecular interactions include, but are not limited to hydrogen bonding, salt bridge formation, polar interactions, hydrophobic interactions, and aromatic interactions.
- Amino acid residues having side chains with hydrogen bonding potentials are generally the following:
- amino acids do not in general possess the potential ability to form side chain hydrogen bonds (i.e. no H-bonding capability):
- Amino acid residues having side chains with salt bridge formation potentials are generally the following:
- Amino acid residues having side chains with polar interaction potentials are generally the following:
- Amino acid residues having side chains with hydrophobic interaction potentials are generally the following:
- Amino acid residues having side chains with aromatic interaction potentials are generally the following:
- the present invention provides novel CGTase variants, having an amino acid sequence not found in nature.
- the CGTase variant of the invention is regarded a derivative of a precursor CGTase enzyme (i.e. the native, parental, or wild-type enzyme).
- the substrate binding and/or product selectivity has been modified, as compared to the precursor CGTase enzyme, by replacement, insertion and/or deletion of one or more amino acid residue(s) holding a position close to the substrate.
- the CGTase variant of the invention may be derived from any CGTase enzyme found in nature.
- the CGTase variant of the invention preferably is derived from a microbial enzyme, preferably a bacterial enzyme, and preferably the CGTase variant is derived from a strain of Bacillus, a strain of Brevibacterium, a strain of Clostridium, a strain of Corynebacterium, a strain of Klebsiella, a strain of Micrococcus, a strain of Thermoanaerobium, a strain of Thermoanaerobacter, a strain of Thermoanaerobacterium, or a strain of Thermoactinomyces.
- the CGTase variant of the invention is derived from a strain of Bacillus autolyticus, a strain of Bacillus cereus, a strain of Bacillus circulans, a strain of Bacillus circulans var. alkalophilus, a strain of Bacillus coagulans, a strain of Bacillus firmus, a strain of Bacillus halophilus, a strain of Bacillus macerans, a strain of Bacillus megaterium, a strain of Bacillus ohbensis, a strain of Bacillus stearothermophilus, a strain of Bacillus subtilis, a strain of Klebsiella pneumonia, a strain of Thermoanaerobacter ethanolicus, a strain of Thermoanaerobacter finnii, a strain of Clostridium thermoamylolyticum, a strain of Clostridium thermosaccharolyticum, or a strain of Thermoanaero
- the CGTase variant of the invention is derived from the strain Bacillus sp. Strain 1011, the strain Bacillus sp. Strain 38-2, the strain Bacillus sp. Strain 17-1, the strain Bacillus sp. 1-1, the strain Bacillus sp. Strain B1018, the strain Bacillus circulans Strain 8, the strain Bacillus circulans Strain 251, or the strain Thermoanaerobacter sp. ATCC 53627, or a mutant or a variant thereof.
- an amino acid residue holding a position close to the substrate indicates an amino acid residue located within the enzyme in such a way that it is within a potential intermolecular (i.e. enzyme-substrate) interactive distance from a glucose unit of the substrate (i.e. a polysaccharide).
- a potential intermolecular i.e. enzyme-substrate
- a glucose unit of the substrate i.e. a polysaccharide
- Examples of potential intermolecular interactions include, but are not limited to hydrogen bonding, salt bridge formation, polar interactions, hydrophobic interactions, and aromatic interactions.
- an amino acid position close to the substrate indicates a distance less than 8 ⁇ (angstrom), preferably less than 5 ⁇ , more preferred less than 3 ⁇ , from the substrate.
- CGTases have substrate binding regions located at the A domain, at the B domain, at the C domain and at the E domain. Consequently, in a preferred embodiment, the invention provides a CGTase variant, in which variant a substitution, an insertion and/or a deletion have been introduced at one or more of the amino acid residue(s) located in one or more of the A, B, C and E domains.
- the invention provides a CGTase variant, in which variant a substitution, an insertion and/or a deletion have been introduced at one or more of the amino acid positions corresponding to the positions stated in Table 2.
- CGTase numbering a substitutions at positions 195 and 198 (CGTase numbering) have been accomplished, the CGTase is not contemplated a CGTase variant of the invention unless additional substitution, insertion and/or deletion at one or more amino acid residue(s) has been introduced.
- a CGTase comprising any of the following specific mutations: H140R, H140N, F183L, H233R, H233N, W258V, F259L, F259I, F259Y, F283L, H327R, H327N, T598F and/or W636F, is not contemplated a CGTase variant of the invention, unless additional substitution, insertion and/or deletion of amino acid residue(s) at one or more positions not stated here has been introduced.
- CGTase comprising any of the following specific mutations: F195Y/F259Y, W258V/F259I, T598F/W636F, and F183L/F259L, is not contemplated a CGTase variant of the invention, unless additional substitution, insertion and/or deletion of amino acid residue(s) at one or more positions has been introduced. Therefore such CGTase variants are disclaimed according to the present invention.
- the CGTase variant of the invention is a CGTase variant derived from an enzyme obtainable from a strain of Bacillus, which enzyme has been modified by substitution, insertion and/or deletion at one or more amino acid positions corresponding to the positions stated in Tables 3-5.
- the CGTase variant is derived from a strain of Bacillus autolyticus, a strain of Bacillus cereus, a strain of Bacillus circulans, a strain of Bacillus circulans var.
- the CGTase variant is derived from the strain Bacillus sp. Strain 1011, the strain Bacillus sp. Strain 38-2, the strain Bacillus sp. Strain 17-1, the strain Bacillus sp. 1-1, the strain Bacillus sp. Strain B1018, the strain Bacillus circulans Strain 8, or the strain Bacillus circulans Strain 251, or a mutant or a variant thereof.
- the CGTase variant of the invention is a CGTase variant derived from an enzyme obtainable from a strain of Thermoanaerobacter, which enzyme has been modified by substitution, insertion and/or deletion at one or more of the amino acid positions corresponding to the positions stated in Tables 6-8.
- the CGTase variant is derived from the strain Thermoanaerobacter sp. ATCC 53627, or a mutant or a variant thereof.
- intermolecular enzyme/substrate interactions have been modified, as compared to the precursor enzyme.
- potential intermolecular interactions include, but are not limited to hydrogen bonding, salt bridge formation, polar interactions, hydrophobic interactions, and aromatic interactions. Such modifications may be accomplished by substitution, insertion and/or deletion at one or more of the above described positions, according to the following guidance.
- Amino acid residues having side chains with hydrogen bonding potentials are generally the following:
- amino acids do not in general possess the potential ability to form side chain hydrogen bonds (i.e. no H-bonding capability):
- Amino acid residues having side chains with salt bridge formation potentials are generally the following:
- Amino acid residues having side chains with polar interaction potentials are generally the following:
- Amino acid residues having side chains with hydrophobic interaction potentials are generally the following:
- Amino acid residues having side chains with aromatic interaction potentials are generally the following:
- variants are obtained, which possess an altered number of hydrogen bonds or other interactions in the subsites of the active cleft or in the groove leading to this cleft or on the maltose binding sites.
- interactions on or before subsites C-I of the substrate should be increased, and interactions on subsites I and higher should be decreased.
- sterical hindrance could be applied to prevent binding on subsites I and higher.
- Mutations which increase hydrogen bonding relative to the substrate are e.g.:
- H233Q relative to subsite B of the substrate
- L197D or L197E subsite D
- N94Q or N94K or N94R or N94W or N94F subsite E
- D371N or D371G subsite E+F
- Y89D subsite E
- A144K or A144R or A144D subsite H
- N193D or N193E subsite H
- Y167F in order to release the residue at position 193 for H-bonding to subsite H
- T185R or T185E or T185D on maltose binding site 2, cf. below.
- Mutations which alter the conformation of the substrate binding cleft, and thus make new enzyme-substrate interactions are e.g.:
- Mutations which decrease hydrogen bonding relative to the substrate are e.g.:
- a mutation which increases the hydrogen bonding relative to subsite H is e.g. A144R.
- a mutation which increases hydrogen bonding relative to the substrate is e.g. N88K.
- S145W or S145Y or S145F, and S146W or S146I or S146R or S146P prevent binding on subsite I of the substrate.
- L600W or L600F or L600Y (of maltose binding site 2, cf. below).
- an ⁇ -cyclodextrin forming CGTase variant of the invention may be a variant, which at positions 87-94 comprises the partial amino acid sequence IKYSGVNN (SEQ ID NO:11), and/or at positions 143-151 comprises the partial amino acid sequence GRAGTNPGF (SEQ ID NO:12), or at positions 143-145 comprises the partial amino acid sequence GRW.
- Mutations which alter the conformation of the substrate binding cleft close to the active site and thus create space for larger cyclodextrins are e.g.:
- N88P Y89* (a deletion), 91aY (an insertion), V92* or N92*, and N94*.
- a mutation which increases hydrogen bonding relative to the substrate is e.g. S146E.
- Mutations which decrease hydrogen bonding relative to the substrate are e.g.
- Mutations which remove hydrogen bonds from subsites D, E, F, H, I and J of the substrate are e.g.:
- a mutation which introduce sterical hindrance between subsites I and J of the substrate, designed to shift the product ratio towards the production of smaller cyclodextrins is e.g. D147W.
- a ⁇ -cyclodextrin forming CGTase variant of the invention may be a variant, which at positions 87-94 comprises the partial amino acid sequence HP*SGY** (SEQ ID NO:3), and/or at positions 143-151 comprises the partial amino acid sequence PALETNPNF (SEQ ID NO:4), or at positions 143-151 comprises the partial amino acid sequence PAAETWPAF (SEQ ID NO:5).
- a ⁇ -cyclodextrin forming CGTase variant of the invention may be a variant, which at positions 87-94 comprises the partial amino acid sequence HP*SGY** (SEQ ID NO:3), and/or at positions 143-151 comprises the partial amino acid sequence PALETNPNF (SEQ ID NO:4), or at positions 143-151 comprises the partial amino acid sequence PAAETWPAF (SEQ ID NO:5), and which variant at position 195 holds a leucine residue (X195L).
- a CGTase variant of the invention capable of forming linear oligosaccharides may be a variant, which at positions 87-94 comprises the partial amino acid sequence HP*SGY** (SEQ ID NO:3), and/or at positions 143-151 comprises the partial amino acid sequence PALETNPNF (SEQ ID NO:4), or at positions 143-151 comprises the partial amino acid sequence PAAETWPAF (SEQ ID NO:5), and which variant at position 195 holds a glycine residue (X195G).
- Mutations which alter the conformation of the substrate binding cleft close to the active site and thus create space for larger cyclodextrins are e.g.:
- N88P Y89* (a deletion), 91aY (an insertion), V92* or N92*, and N94*.
- a mutation which increases hydrogen bonding relative to the substrate is e.g. S146E.
- Mutations which decrease hydrogen bonding relative to the substrate are e.g.
- Mutations which remove hydrogen bonds from subsites D, E, F and H of the substrate are e.g.:
- Mutations which change the binding properties at subsites I and J of the substrate are e.g.:
- X145aI or *145aI via insertion
- S145A, and Q148E in particular S145A/X145aI or A145A/*145aI, and X145aI/Q148E or *145aI/Q148E.
- Mutations which reduce the coupling activity at subsites A, D and E are e.g.:
- Mutations reducing the coupling activity by changing specific binding of cyclodextrins is e.g. R47Q.
- V21F or V21Y are V21F or V21Y.
- a ⁇ -cyclodextrin forming CGTase variant of the invention may be a variant, which at positions 87-94 comprises the partial amino acid sequence HP*SGY** (SEQ ID NO:3), and/or at positions 143-151 comprises the partial amino acid sequence PALETNPNF (SEQ ID NO:4), or at positions 143-151 comprises the partial amino acid sequence PAAEADPNF (SEQ ID NO:6).
- a ⁇ -cyclodextrin forming CGTase variant of the invention may be a variant, which at positions 87-94 comprises the partial amino acid sequence HP*SGY** (SEQ ID NO:3), and/or at positions 143-151 comprises the partial amino acid sequence PALETNPNF (SEQ ID NO:4), or at positions 143-151 comprises the partial amino acid sequence PAAEADPNF (SEQ ID NO:6), and which variant at position 195 holds a leucine residue (X195W).
- the variants of the invention may be combined with a substitution at the central amino acid residue forming the cyclization axis, corresponding to position 195, CGTase numbering.
- a substitution at the central amino acid residue forming the cyclization axis corresponding to position 195, CGTase numbering.
- tyrosine and phenylalanine are predominant in wild-type CGTases (cf. Table 1).
- glycine is introduced at this position (X195G).
- a CGTase variant of the invention is an enzyme which has been modified by substitution, insertion and/or deletion at one or more of the amino acid positions corresponding to the positions stated in Table 9, below. As indicated in this table, the introduction of one or more of these substitutions/insertions/deletions lead to CGTase variants of increased product selectivity in respect of ⁇ -, ⁇ - or ⁇ -cyclodextrins, respectively.
- the E domain of the Bacillus circulans Strain 251 CGTase has now been identified as a raw starch binding domain.
- MBS maltose binding sites
- a CGTase variant of the invention is an enzyme which has been modified by substitution, insertion and/or deletion at one or more of the amino acid positions corresponding to the positions stated in Table 10, below. Such modifications lead to CGTase variants of reduced product inhibition.
- Mutations which reduces non-competitive product inhibition are e.g.:
- Y633A (takes place on MBS2, this mutation completely removes non-competitive product inhibition), 599aP or 599aR or 599aH, and L600R.
- Residues 595-605 form a loop next to MBS2. Insertion enlarges the loop, thereby preventing binding of a cyclodextrin to MBS2 by sterical hindrance, while the role of MBS2 in guidance of the substrate chain is preserved. Mutations at position 600 and adjacent residues could reduce the binding of cyclic products to MBS2, while the binding of linear substrates remains unaffected. Substitution of leucine at position 600 with aspartate, alanine or glycine has minor effects on product inhibition. Substitution with arginine, due to its large size and charged nature, affect binding of cyclodextrins, thereby reducing product inhibition.
- Mutations that decrease electrostatic interactions around MBS1, leading to decreased product affinity are e.g. W616A and/or W662A.
- Mutations that decrease electrostatic interactions around MBS2, leading to decreased product affinity are e.g. L600A or L600S, and/or Y663A.
- a mutations that decreases electrostatic interactions around MBS3, leading to decreased product affinity is e.g. W413A.
- Second (malto-oligosaccharide) substrate is contemplated caused by coupling reactions. Reduction of this coupling reaction may be achieved by reducing the binding of the first (cyclodextrin) and second (malto-oligosaccharide) substrate.
- Mutations reducing competitive product inhibition by reducing cyclodextrin binding are e.g.:
- R47A or R47Q or R47L Y89G, D196A or D196L, D371G or D371N or D371A or D371L, and R375G or R375Q or R375N or R375A or R375L.
- Mutations reducing competitive product inhibition by reducing binding of the second substrate are e.g.:
- the CGTase variant of the invention is a CGTase variant derived from an enzyme obtainable from a strain of Bacillus, which enzyme has been modified by substitution, insertion and/or deletion at one or more of the amino acid positions corresponding to the positions stated in Table 11, below. Such modifications lead to CGTase variants of increased product selectivity, as indicated in the table.
- the CGTase variant is derived from a strain of a strain of Bacillus autolyticus, a strain of Bacillus cereus, a strain of Bacillus circulans, a strain of Bacillus circulans var. alkalophilus, a strain of Bacillus coagulans, a strain of Bacillus firmus, a strain of Bacillus halophilus, a strain of Bacillus macerans, a strain of Bacillus megaterium, a strain of Bacillus ohbensis, a strain of Bacillus stearothermophilus, or a strain of Bacillus subtilis.
- the CGTase variant is derived from the strain Bacillus sp. Strain 1011, the strain Bacillus sp. Strain 38-2, the strain Bacillus sp. Strain 17-1, the strain Bacillus sp. 1-1, the strain Bacillus sp. Strain B1018, the strain Bacillus circulans Strain 8, or the strain Bacillus circulans Strain 251, or a mutant or a variant thereof.
- the CGTase variant of the invention is a CGTase variant derived from an enzyme obtainable from a strain of Bacillus, which enzyme has been modified by substitution, insertion and/or deletion at one or more of the amino acid positions corresponding to the positions stated in Table 12, below. Such modifications lead to CGTase variants of reduced product inhibition.
- the invention provides the following CGTase variants:
- a CGTase variant which variant at position 21 holds a tyrosine residue (F21Y).
- a CGTase variant which variant at position 47 holds a glutamine residue (R47Q), or an alanine residue (R47A), or a leucine residue (R47L), or a histidine residue (R47H).
- a CGTase variant which variant at position 88 holds a proline residue (N88P) or a lysine residue (N88K).
- a CGTase variant which variant at position 89 holds an aspartic acid residue (Y89D), or an alanine residue (Y89A), or a glycine residue (Y89G).
- a CGTase variant which variant at position 91a (via insertion) holds an alanine residue (*91aA), or a tyrosine residue (*91aY).
- a CGTase variant which variant at position 94 holds a glutamine residue (N94Q), or a lysine residue (N94K), or an arginine residue (N94R), or a tryptophan residue (N94W), or a phenylalanine residue (N94F), or in which variant position 94 has been deleted (N94*).
- a CGTase variant which variant at position 135 holds a leucine residue (D135L).
- a CGTase variant which variant at position 143 holds a natural amino acid residue different from that of the wild-type enzyme (P143X).
- a CGTase variant which variant at position 143 holds an alanine residue (P143A), or a glycine residue (P143G).
- a CGTase variant which variant at position 144 holds a natural amino acid residue different from that of the wild-type enzyme (A144X).
- a CGTase variant which variant at position 144 holds an arginine residue (A144R), or a lysine residue (A144K), or an aspartic acid residue (A144D).
- a CGTase variant which variant at position 145 holds a natural amino acid residue different from that of the wild-type enzyme (S145X).
- a CGTase variant which variant at position 145 holds an alanine residue (S145A), or a glutamic acid (S145E), or a tryptophan residue (S145W), or a glycine residue (S145G), or a phenylalanine residue (S145F), or a tyrosine residue (S145Y), or a leucine residue (S145L).
- S145A alanine residue
- S145E glutamic acid
- S145W tryptophan residue
- S145G glycine residue
- S145F phenylalanine residue
- S145Y tyrosine residue
- leucine residue S145L
- a CGTase variant which variant at position 145a (via insertion) holds a natural amino acid residue (*145aX).
- a CGTase variant which variant at position 145a (via insertion) holds an isoleucine residue (*145aI).
- a CGTase variant which variant at position 146 holds a natural amino acid residue different from that of the wild-type enzyme (S146X).
- a CGTase variant which variant at position 146 holds a proline residue (S146P), or an isoleucine residue (S146I), or a glutamine residue (S146Q), or a tryptophan residue (S146W), or an arginine residue (S146R), or a glutamic acid residue (S146E).
- S146P proline residue
- S146I isoleucine residue
- S146Q glutamine residue
- S146W tryptophan residue
- S146R arginine residue
- S146E glutamic acid residue
- a CGTase variant which variant at position 147 holds an isoleucine residue (D147I), or a leucine residue (D147L), or an alanine residue (D147A), or a serine residue (D147S), or a tryptophan residue (D147W).
- a CGTase variant which variant at position 147a (via insertion) holds an alanine residue (*147aA).
- a CGTase variant which variant at position 147a (via insertion) holds a natural amino acid residue (*147aX).
- a CGTase variant which variant at position 148 holds a natural amino acid residue different from that of the wild-type enzyme (Q148X).
- a CGTase variant which variant at position 148 holds an alanine residue (Q148A), or a glycine residue (Q148G), or a glutamic acid residue (Q148E), or an asparagine residue (Q148N).
- a CGTase variant, which variant at position 149 holds a natural amino acid residue different from that of the wild-type enzyme (P149X).
- a CGTase variant which variant at position 179 holds a serine residue (G179S), an asparagine residue (G179N), or an aspartic acid residue (G179D).
- a CGTase variant which variant at position 180 holds a serine residue (G180S), an asparagine residue (G180N), or an aspartic acid residue (G180D).
- a CGTase variant which variant at position 185 holds an arginine residue (T185R), or a glutamic acid residue (T185E), or an aspartic acid residue (T185D).
- a CGTase variant which variant at position 193 holds a glycine residue (N193G), or an alanine residue (N193A), or an aspartic acid residue (N193D), or a glutamic acid residue (N193E).
- a CGTase variant, which variant at position 196 holds a natural amino acid residue different from that of the wild-type enzyme (D196X).
- a CGTase variant which variant at position 196 holds an alanine residue (D196A), a serine residue (D196S), or a leucine residue (D196L).
- a CGTase variant which variant at position 197 holds an aspartic acid residue (L197D), or a glutamic acid residue (L197E).
- a CGTase variant which variant at position 232 holds a glutamine residue (K232Q), or an asparagine residue (K232N), or an alanine residue (K232A), or a leucine residue (K232L).
- a CGTase variant which variant at position 264 holds a glutamine residue (E264Q), or an alanine residue (E264A), or an asparagine residue (E264N), or a leucine residue (E264L).
- a CGTase variant which variant at position 371 holds a natural amino acid residue different from that of the wild-type enzyme (D371X).
- a CGTase variant which variant at position 371 holds a glycine residue (D371G), or an asparagine residue (D371N), or an alanine residue (D371A), or a leucine residue (D371L).
- a CGTase variant which variant at position 375 holds a natural amino acid residue different from that of the wild-type enzyme (R375X).
- a CGTase variant which variant at position 375 holds a proline residue (R375P), or a glycine residue (R375G), or a glutamine residue (R375Q), or an asparagine residue (R375N), or an alanine residue (R375A), or a leucine residue (R375L).
- a CGTase variant which variant at position 599a (via insertion) holds a proline residue (*599aP), or an arginine residue (*599aR), or a histidine residue (*599aH).
- a CGTase variant which variant position 600 has been substituted for a different naturally occurring amino acid residue, in particular a tryptophan residue (L600W), a phenylalanine residue (L600F), a tyrosine residue (L600Y), an arginine residue (L600R), a proline residue (L600P), or an asparagine residue (L600N).
- a tryptophan residue L600W
- a phenylalanine residue L600F
- a tyrosine residue L600Y
- an arginine residue L600R
- L600P proline residue
- asparagine residue L600N
- a CGTase variant which variant at position 616 holds an alanine residue (W616A).
- a CGTase variant which variant at position 633 holds an alanine residue (Y633A).
- a CGTase variant which variant at position 662 holds an alanine residue (W662A).
- a CGTase variant which variant at position 47 holds a histidine residue, and at position 135 holds a leucine residue (R47H/D135L).
- a CGTase variant which variant at position 88 holds a proline residue, and at position 143 holds a glycine residue (N88P/P143G).
- a CGTase variant which variant at position 89 holds an aspartic acid residue, and at position 146 holds a proline residue (Y89D/S146P).
- a CGTase variant which variant at position 89 holds a glycine residue, and at position 193 holds a glycine residue (Y89G/N193G).
- a CGTase variant in which variant positions 92 and 94 have been deleted (V92*/N94*).
- a CGTase variant which variant at position 143 holds an alanine residue, and at position 144 holds an arginine residue (P143A/A144R).
- a CGTase variant which variant at position 143 holds a glycine residue, and at position 144 holds an arginine residue, and at position 145 holds a tryptophan residue (P143G/A144R/S145W).
- a CGTase variant which variant at position 143 holds a glycine residue, and at position 144 holds an arginine residue, and at position 145 holds a tryptophan residue (P143G/A144R/S145W), and which variant at position 179 holds a serine residue (G179S), an asparagine residue (G179N), or an aspartic acid residue (G179D).
- a CGTase variant which variant at positions 143-148 comprises the partial amino acid sequence GRA**A (SEQ ID NO:7), the partial amino acid sequence GRAAAA (SEQ ID NO:8), the partial amino acid sequence GRAPAA (SEQ ID NO:9), or the partial amino acid sequence GRGPAA (SEQ ID NO:10).
- a CGTase variant which variant at position 144 holds an arginine residue, at position 145 holds an alanine residue, and at position 146 holds a proline residue (A144R/S145A/S146P).
- a CGTase variant which variant at position 145 holds an alanine residue, and at position 145a (via insertion) holds an isoleucine residue (S145A/*145aI).
- a CGTase variant which variant at position 145 holds an alanine residue, and at position 146 holds a glycine residue (S145A/S146G).
- a CGTase variant which variant at position 145 holds a leucine residue, and at position 148 holds an asparagine residue (S145L/Q148N).
- a CGTase variant which variant at position 145 holds a glutamic acid residue, and in position 146 holds a proline residue or a glutamine residue (S145E/S146P or S145E/S146Q).
- a CGTase variant which variant at position 145 holds a tryptophan residue, and in position 146 holds a tryptophan residue, or an isoleucine residue, or an arginine residue (S145W/S146W or S145W/S146I or S145W/S146R).
- a CGTase variant which variant at position 145 holds an alanine residue, at position 145a (via insertion) holds an isoleucine residue, and at position 148 holds a glutamic acid residue (S145A/*145aI/Q148E).
- a CGTase variant which variant at position 145a (via insertion) holds an isoleucine residue, and at position 148 holds a glutamic acid residue (*145aI/Q148E).
- a CGTase variant which variant at position 148 holds a glutamic acid residue, and at position 193 holds a glutamine residue.
- a CGTase variant which variant at position 616 holds an alanine residue, and at position 662 holds an alanine residue (W616A/W662A).
- a CGTase variant which variant at positions 87-94 comprises the partial amino acid sequence IKYSGVNN (SEQ ID NO:11), and/or at positions 143-151 comprises the partial amino acid sequence GRAGTNPGF (SEQ ID NO:12), or at positions 143-145 comprises the partial amino acid sequence GRW.
- a CGTase variant which variant at positions 87-94 comprises the partial amino acid sequence HP*SGY** (SEQ ID NO:3), and/or at positions 143-151 comprises the partial amino acid sequence PALETNPNF (SEQ ID NO:4), or at positions 143-151 comprises the partial amino acid sequence PAAETWPAF (SEQ ID NO:5).
- a CGTase variant which variant at positions 87-94 comprises the partial amino acid sequence HP*SGY** (SEQ ID NO:3), and/or at positions 143-151 comprises the partial amino acid sequence PALETNPNF (SEQ ID NO:4), or at positions 143-151 comprises the partial amino acid sequence PAAETWPAF (SEQ ID NO:5), and which variant at position 195 holds a leucine residue (Y195L).
- a CGTase variant which variant at positions 87-94 comprises the partial amino acid sequence HP*SGY** (SEQ ID NO:3), and/or at positions 143-151 comprises the partial amino acid sequence PALETNPNF (SEQ ID NO:4), or at positions 143-151 comprises the partial amino acid sequence PAAEADPNF (SEQ ID NO:6).
- a CGTase variant which variant at positions 87-94 comprises the partial amino acid sequence HP*SGY** (SEQ ID NO:3), and/or at positions 143-151 comprises the partial amino acid sequence PALETNPNF (SEQ ID NO:4), or at positions 143-151 comprises the partial amino acid sequence PAAEADPNF (SEQ ID NO:6), and which variant at position 195 holds a leucine residue (Y195W).
- the above CGTase variants are derived from a strain of Bacillus autolyticus, a strain of Bacillus cereus, a strain of Bacillus circulans, a strain of Bacillus circulans var. alkalophilus, a strain of Bacillus coagulans, a strain of Bacillus firmus, a strain of Bacillus halophilus, a strain of Bacillus macerans, a strain of Bacillus megaterium, a strain of Bacillus ohbensis, a strain of Bacillus stearothermophilus, or a strain of Bacillus subtilis.
- the above CGTase variants are derived from the strain Bacillus sp. Strain 1011, the strain Bacillus sp. Strain 38-2, the strain Bacillus sp. Strain 17-1, the strain Bacillus sp. 1-1, the strain Bacillus sp. Strain B1018, the strain Bacillus circulans Strain 8, or the strain Bacillus circulans Strain 251, or a mutant or a variant thereof.
- the CGTase variant of the invention is a CGTase variant derived from an enzyme obtainable from a strain of Thermoanaerobacter, which enzyme has been modified by substitution, insertion and/or deletion at one or more of the amino acid positions corresponding to the positions stated in Table 13, below. Such modification lead to CGTase variants of increased product selectivity, as indicated in the table.
- the CGTase variant is derived from a strain of Thermoanaerobacter sp. ATCC 53627, or a mutant or a variant thereof.
- the CGTase variant of the invention is a CGTase variant derived from an enzyme obtainable from a strain of Thermoanaerobacter, which enzyme has been modified by substitution, insertion and/or deletion at one or more of the amino acid residues corresponding to the positions stated in Table 14, below. Such modifications lead to CGTase variants of reduced product inhibition.
- the CGTase variant is derived from a strain of Thermoanaerobacter sp. ATCC 53627, or a mutant or a variant thereof.
- the invention provides the following CGTase variants, derived from a strain of Thermoanaerobacter sp., preferably the strain of Thermoanaerobacter ATCC 53627, or a mutant or a variant thereof:
- a CGTase variant which variant at position 21 holds a phenylalanine residue (V21F) or a tyrosine residue (V21Y).
- a CGTase variant which variant at position 47 holds a glutamine residue (K47Q), or an alanine residue (K47A), or a leucine residue (K47L), or a histidine residue (K47H), or an arginine residue (K47R).
- a CGTase variant which variant at position 88 holds a lysine residue (P88K).
- a CGTase variant which variant at position 89 holds an alanine residue (D89A), or a glycine residue (D89G).
- a CGTase variant which variant at position 91a holds an alanine residue (F91aA) or a tyrosine residue (F91aY), or in which variant position 91a has been deleted (F91a*).
- a CGTase variant which variant at position 94 holds a glutamine residue (S94Q), or a lysine residue (S94K), or an arginine residue (S94R), or a tryptophan residue (S94W), or a phenylalanine residue (S94F), or in which variant position 94 has been deleted (S94*).
- a CGTase variant which variant at position 135 holds a leucine residue (D135L).
- a CGTase variant which variant at position 143 holds a natural amino acid residue different from that of the wild-type enzyme (p143X).
- a CGTase variant which variant at position 143 holds an alanine residue (P143A), or holds a glycine residue (P143G).
- a CGTase variant which variant at position 144 holds a natural amino acid residue different from that of the wild-type enzyme (A145X).
- a CGTase variant which variant at position 144 holds an arginine residue (A144R), or a lysine residue (A144K), or an aspartic acid residue (A144D).
- a CGTase variant which variant at position 145 holds a natural amino acid residue different from that of the wild-type enzyme (S145X).
- a CGTase variant which variant at position 145 holds an alanine residue (S145A), or a glutamic acid (S145E), or a tryptophan residue (S145W), or a glycine residue (S145G), or a phenylalanine residue (S145F), or a tyrosine residue (S145Y), or a leucine residue (S145L).
- S145A alanine residue
- S145E glutamic acid
- S145W tryptophan residue
- S145G glycine residue
- S145F phenylalanine residue
- S145Y tyrosine residue
- leucine residue S145L
- a CGTase variant which variant at position 145a (via insertion) holds a natural amino acid residue (*145aX).
- a CGTase variant which variant at position 145a (via insertion) holds an isoleucine residue (*145aI).
- a CGTase variant which variant at position 146 holds a natural amino acid residue different from that of the wild-type enzyme (E145X).
- a CGTase variant which variant at position 146 holds a proline residue (E146P), or a serine residue (E146S), or an isoleucine residue (E146I), or a glutamine residue (E146Q), or a tryptophan residue (E146W), or an arginine residue (E146R).
- E146P proline residue
- E146S serine residue
- E146I isoleucine residue
- E146Q glutamine residue
- E146W tryptophan residue
- E146R arginine residue
- a CGTase variant which variant at position 147 holds an isoleucine residue (T147I), or a leucine residue (T147L), or an alanine residue (T147A), or a serine residue (T147S), or a tryptophan residue (T147W).
- a CGTase variant which variant at position 147a (via insertion) holds a natural amino acid residue (*147aX).
- a CGTase variant which variant at position 147a (via insertion) holds an alanine residue (*147aA).
- a CGTase variant which variant at position 148 holds a natural amino acid residue different from that of the wild-type enzyme (D148X).
- a CGTase variant which variant at position 148 holds an alanine residue (D148A), or a glycine residue (D148G), or a glutamic acid residue (D148E), or an asparagine residue (D148N).
- a CGTase variant, which variant at position 149 holds a natural amino acid residue different from that of the wild-type enzyme (P149X).
- a CGTase variant which variant at position 179 holds a serine residue (G179S), an asparagine residue (G179N), or an aspartic acid residue (G179D).
- a CGTase variant which variant at position 180 holds a serine residue (G180S), an asparagine residue (G180N), or an aspartic acid residue (G180D).
- a CGTase variant which variant at position 185 holds an arginine residue (S185R), or a glutamic acid residue (S185E), or an aspartic acid residue (S185D).
- a CGTase variant which variant at position 186 holds an alanine residue (Y186A).
- a CGTase variant which variant at position 193 holds a glycine residue (N193G), or an alanine residue (N193A), or an aspartic acid residue (N193D), or a glutamic acid residue (N193E).
- a CGTase variant, which variant at position 196 holds a natural amino acid residue different from that of the wild-type enzyme (D196X).
- a CGTase variant which variant at position 196 holds an alanine residue (D196A), a serine residue (D196S), or a leucine residue (D196L).
- a CGTase variant which variant at position 197 holds an aspartic acid residue (L197D), or a glutamic acid residue (L197E).
- a CGTase variant which variant at position 232 holds a glutamine residue (K232Q), or an asparagine residue (K232N), or an alanine residue (K232A), or a leucine residue (K232L).
- a CGTase variant which variant at position 259 holds a phenylalanine residue (Y259F).
- a CGTase variant which variant at position 264 holds a glutamine residue (E264Q), or an alanine residue (E264A), or an asparagine residue (E264N), or a leucine residue (E264L).
- a CGTase variant which variant at position 268 holds an alanine residue (N268A).
- a CGTase variant which variant at position 371 holds a natural amino acid residue different from that of the wild-type enzyme (D371X).
- a CGTase variant which variant at position 371 holds a glycine residue (D371G), or an asparagine residue (D371N), or an alanine residue (D371A), or a leucine residue (D371L), or a glutamic acid residue (D371E).
- a CGTase variant which variant at position 375 holds a natural amino acid residue different from that of the wild-type enzyme (R375X).
- a CGTase variant which variant at position 375 holds a proline residue (R375P), or a glycine residue (R375G), or a glutamine residue (R375Q), or an asparagine residue (R375N), or an alanine residue (R375A), or a leucine residue (R375L).
- a CGTase variant which variant at position 599a (via insertion) holds a proline residue (*599aP), or an arginine residue (*599aR), or a histidine residue (*599aH).
- a CGTase variant which variant position 600 has been substituted for a different amino acid residue, in particular a phenylalanine residue (W600F), a tyrosine residue (W600Y), an arginine residue (W600R), a proline residue (W600P), a leucine residue (W600L), or an asparagine residue (W600N).
- W600F phenylalanine residue
- W600Y tyrosine residue
- W600R arginine residue
- W600P proline residue
- W600L leucine residue
- W600N asparagine residue
- a CGTase variant which variant at position 616 holds an alanine residue (W616A).
- a CGTase variant which variant at position 633 holds an alanine residue (Y633A).
- a CGTase variant which variant at position 662 holds an alanine residue (W662A).
- a CGTase variant which variant at position 47 holds a histidine residue or an arginine residue, and/or at position 135 holds a leucine residue (K47H/D135L or K47R/D135L).
- a CGTase variant which variant at positions 87-94 comprises the partial amino acid sequence IKYSGVNN (SEQ ID NO:11), or the partial amino acid sequence INDSGVNN (SEQ ID NO:58), and/or at positions 143-151 comprises the partial amino acid sequence GRAGTNPGF (SEQ ID NO:12), or at positions 143-145 comprises the partial amino acid sequence GRW, and/or at position 195 holds a tyrosine residue (F195Y).
- a CGTase variant which variant at positions 87-94 comprises the partial amino acid sequence INDSGVNN (SEQ ID NO:58), and/or at positions 146-150 comprises the partial amino acid sequence SDQPS (SEQ ID NO:59).
- a CGTase variant which variant at positions 87-94 comprises the partial amino acid sequence HP*SGY** (SEQ ID NO:3), and/or at positions 143-151 comprises the partial amino acid sequence PALETNPNF (SEQ ID NO:4), or at positions 143-151 comprises the partial amino acid sequence PAAETWPAF (SEQ ID NO:5).
- a CGTase variant which variant at positions 87-94 comprises the partial amino acid sequence HP*SGY** (SEQ ID NO:3), and/or at positions 143-151 comprises the partial amino acid sequence PALETNPNF (SEQ ID NO:4), or at positions 143-151 comprises the partial amino acid sequence PAAETWPAF (SEQ ID NO:5), and which variant at position 195 holds a leucine residue (F195L).
- a CGTase variant which variant at positions 87-94 comprises the partial amino acid sequence HP*SGY** (SEQ ID NO:3), and/or at positions 143-151 comprises the partial amino acid sequence PALETNPNF (SEQ ID NO:4), or at positions 143-151 comprises the partial amino acid sequence PAAEADPNF (SEQ ID NO:6).
- a CGTase variant which variant at positions 87-94 comprises the partial amino acid sequence HP*SGY** (SEQ ID NO:3), and/or at positions 143-151 comprises the partial amino acid sequence PALETNPNF (SEQ ID NO:4), or at positions 143-151 comprises the partial amino acid sequence PAAEADPNF (SEQ ID NO:6), and which variant at position 195 holds a leucine residue (F195W).
- a CGTase variant in which variant positions 92 and 94 have been deleted (G92*/S94*).
- a CGTase variant which variant at position 143 holds an alanine residue, and at position 144 holds an arginine residue (P143A/A144R).
- a CGTase variant which variant at position 143 holds a glycine residue, and at position 144 holds an arginine residue, and at position 145 holds a tryptophan residue (P143G/A144R/S145W).
- a CGTase variant which variant at position 143 holds a glycine residue, and at position 144 holds an arginine residue, and at position 145 holds a tryptophan residue (P143G/A144R/S145W), and which variant at position 179 holds a serine residue (G179S), an asparagine residue (G179N), or an aspartic acid residue (G179D), and/or at position 180 holds an asparagine residue (G180N), or an aspartic acid residue (G180D).
- a CGTase variant which variant at positions 143-148 comprises the partial amino acid sequence GRA**A (SEQ ID NO:7), the partial amino acid sequence GRAAAA (SEQ ID NO:8), the partial amino acid sequence GRPAAA (SEQ ID NO:64), the partial amino acid sequence GRAPAA (SEQ ID NO:9), or the partial amino acid sequence GRGPAA (SEQ ID NO:10).
- a CGTase variant which variant at positions 143-151 comprises the partial amino acid sequence GRAGTNPG (SEQ ID NO:12).
- a CGTase variant which variant at positions 143-151 comprises the partial amino acid sequence GRAGTNPG (SEQ ID NO:12), and at position 195 holds a tyrosine residue (F195Y).
- a CGTase variant which variant at position 144 holds an arginine residue, at position 145 holds an alanine residue, and at position 146 holds a proline residue (A144R/S145A/E146P).
- a CGTase variant which variant at position 145 holds an alanine residue, and at position 145a (via insertion) holds an isoleucine residue (S145A/*145aI).
- a CGTase variant which variant at position 145 holds an alanine residue, and at position 146 holds a glycine residue (S145A/E146G).
- a CGTase variant which variant at position 145 holds a leucine residue, and at position 148 holds an asparagine residue (S145L/D148N).
- a CGTase variant which variant at position 145 holds a glutamic acid residue, and in position 146 holds a proline residue or a glutamine residue (S145E/E146P or S145E/E146Q).
- a CGTase variant which variant at position 145 holds a tryptophan residue, and in position 146 holds a tryptophan residue, or an isoleucine residue, or an arginine residue (S145W/E146W or S145W/E146I or S145W/E146R).
- a CGTase variant which variant at position 145 holds an alanine residue, at position 145a (via insertion) holds an isoleucine residue, and at position 148 holds a glutamic acid residue (S145A/*145aI/D148E).
- a CGTase variant which variant at position 145a (via insertion) holds an isoleucine residue, and at position 148 holds a glutamic acid residue (*145aI/D148E).
- a CGTase variant which variant at position 616 holds an alanine residue, and at position 662 holds an alanine residue (W616A/W662A).
- CGTase variants of the invention follows the general principles of recombinant DNA technology, e.g. as described by Sambrook et al. [Sambrook J, Fritsch E F, Maniatis T; Molecular Cloning: A Laboratory Manual, Cold Spring Harbor Laboratory Press, 1989, New York], and known to the person skilled in the art.
- the production takes rise in the provision of a DNA construct encoding CGTase variant of the invention.
- the invention provides a DNA construct encoding a CGTase variant of the invention.
- DNA construct is intended to indicate any nucleic acid molecule of cDNA, genomic DNA, synthetic DNA or RNA origin.
- construct is intended to indicate a nucleic acid segment which may be single- or double-stranded, and which may be based on a complete or partial naturally occurring nucleotide sequence encoding the CGTase variant of interest.
- the construct may optionally contain other nucleic acid segments.
- the DNA construct of the invention may be prepared by suitably modifying a DNA sequence encoding the precursor CGTase, which modification may bring about:
- the modification of the DNA sequence may be performed by site-directed mutagenesis or by random mutagenesis, or by a combination of these techniques in accordance with well-known procedures, e.g. as described by Sambrook et al., op cit.
- the DNA construct of the invention comprises one or more of the partial oligonucleotide sequences (primers) described in the examples below. These partial oligonucleotide sequences are in particular
- the CGTase variant may be obtained by combining the DNA construct encoding the CGTase variant of the invention with an appropriate expression signal in an appropriate expression vector.
- the expression vector of the invention may be any expression vector that is conveniently subjected to recombinant DNA procedures, and the choice of vector will often depend on the host cell into which it is to be introduced.
- the vector may be an autonomously replicating vector, i.e. a vector which exists as an extrachromosomal entity, the replication of which is independent of chromosomal replication, e.g. a plasmid.
- the vector may be one which, when introduced into a host cell, is integrated into the host cell genome and replicated together with the chromosome(s) into which it has been integrated.
- the DNA sequence encoding the CGTase variant preferably is operably linked to additional segments required for transcription of the DNA.
- the expression vector is derived from plasmid or viral DNA, or may contain elements of both.
- operably linked indicates that the segments are arranged so that they function in concert for their intended purposes, e.g. transcription initiates in a promoter and proceeds through the DNA sequence coding for the CGTase variant.
- the DNA sequence encoding the CGTase variant preferably should be operably connected to a suitable promoter and terminator sequence.
- the promoter may be any DNA sequence which shows transcriptional activity in the host cell of choice and may be derived from genes encoding proteins either homologous or heterologous to the host cell.
- the procedures used to ligate the DNA sequences coding for the CGTase variant, the promoter and the terminator, respectively, and to insert them into suitable vectors are well known to persons skilled in the art (cf., for instance, Sambrook et al., op cit)
- the promoter may be any DNA sequence which shows transcriptional activity in the host cell of choice and may be derived from genes encoding proteins either homologous or heterologous to the host cell.
- suitable promoters for directing the transcription of the DNA encoding the CGTase variant of the invention in bacterial host cells include the promoter of the Bacillus stearothermophilus maltogenic amylase gene, the Bacillus licheniformis alpha-amylase gene, the Bacillus amyloliquefaciens BAN amylase gene, the Bacillus subtilis alkaline protease gen, or the Bacillus pumilus xylanase or xylosidase gene, or by the phage Lambda P R or P L promoters or the E. coli lac, trp or tac promoters.
- promoters for use in yeast host cells include promoters from yeast glycolytic genes [Hitzeman et al., J. Biol. Chem. 1980 255 12073-12080; Alber and Kawasaki, J. Mol Appl. Gen. 1982 1 419-434] or alcohol dehydrogenase genes [Young et al., in Genetic Engineering of Microorganisms for Chemicals (Hollaender et al, Eds.), Plenum Press, New York, 1982], or the TPI1 [U.S. Pat. No. 4,599,311] or ADH2-4c [Russell et al., Nature 1983 304 652-654] promoters.
- suitable promoters for use in filamentous fungus host cells are, for instance, the ADH3 promoter [McKnight et al., EMBO J. 1985 4 2093-2099] or the tpiA promoter.
- suitable promoters are those derived from the gene encoding A. oryzae TAKA amylase, Rhizomucor miehei aspartic proteinase, A. niger neutral ⁇ -amylase, A. niger acid stable ⁇ -amylase, A. niger or A. awamori glucoamylase (gluA), Rhizomucor miehei lipase, A. oryzae alkaline protease, A. oryzae triose phosphate isomerase or A. nidulans acetamidase.
- Preferred are the TAKA-amylase and gluA promoters.
- the expression vector of the invention may further comprise a DNA sequence enabling the vector to replicate in the host cell in question.
- the expression vector may also comprise a selectable marker, e.g. a gene the product of which complements a defect in the host cell, such as the gene coding for dihydrofolate reductase (DHFR) or the Schizosaccharomyces pombe TPI gene [Russell P R; Gene 1985 40 125-130], or one which confers resistance to a drug, e.g. ampicillin, kanamycin, tetracyclin, chloramphenicol, neomycin, hygromycin or methotrexate.
- selectable markers include amdS, pyrG, argB, niaD and sC.
- a secretory signal sequence (also known as a leader sequence, prepro sequence or pre sequence) may be provided in the expression vector.
- the secretory signal sequence is joined to the DNA sequence encoding the CGTase in the correct reading frame.
- Secretory signal sequences are commonly positioned 5' to the DNA sequence encoding the CGTase variant.
- the secretory signal sequence may be that normally associated with the CGTase or may be from a gene encoding another secreted protein.
- the expression vector of the invention may comprise a secretory signal sequence substantially identical to the secretory signal encoding sequence of the Bacillus licheniformis ⁇ -amylase gene, e.g. as described in WO 86/05812.
- measures for amplification of the expression may be taken, e.g. by tandem amplification techniques, involving single or double crossing-over, or by multicopy techniques, e.g. as described in U.S. Pat. No. 4,959,316 or WO 91/09129.
- the expression vector may include a temperature sensitive origin of replication, e.g. as described in EP 283,075.
- Procedures for ligating DNA sequences encoding the CGTase variant, the promoter and optionally the terminator and/or secretory signal sequence, respectively, and to insert them into suitable vectors containing the information necessary for replication, are well known to persons skilled in the art (cf., for instance, Sambrook et al., op cit).
- the invention provides a host cell comprising the DNA construct of the invention and/or the recombinant expression vector of the invention.
- the host cell of the invention into which the DNA construct or the recombinant expression vector of the invention is to be introduced, may be any cell, preferably a non-pathogenic cell, which is capable of producing the CGTase variant and includes bacteria, yeast, fungi and higher eukaryotic cells.
- Examples of bacterial host cells which, on cultivation, are capable of producing the CGTase variant of the invention are grampositive bacteria such as strains of Bacillus, in particular a strain of B. subtilis, B. licheniformis, B. lentus, B. brevis, B. stearothermophilus, B. alkalophilus, B. amyloliquefaciens, B. coagulans, B. circulans, B. lautus, B. megatherium, B. pumilus, B. thuringiensis or B. agaradherens, or strains of Streptomyces, in particular a strain of S. lividans or S. murinus, or gramnegative bacteria such as Echerichia coli.
- the transformation of the bacteria may be effected by protoplast transformation or by using competent cells in a manner known per se (cf. Sambrook et al., op cit).
- the CGTase When expressing the CGTase variant in bacteria such as E. coli, the CGTase may be retained in the cytoplasm, typically as insoluble granules (known as inclusion bodies), or may be directed to the periplasmic space by a bacterial secretion sequence. In the former case, the cells are lysed and the granules are recovered and denatured after which the CGTase is refolded by diluting the denaturing agent. In the latter case, the CGTase may be recovered from the periplasmic space by disrupting the cells, e.g. by sonication or osmotic shock, to release the contents of the periplasmic space and recovering the CGTase variant.
- yeasts cells include cells of Saccharomyces spp. or Schizosaccharomyces spp., in particular strains of Saccharomyces cerevisiae or Saccharomyces reteyveri.
- Methods for transforming yeast cells with heterologous DNA and producing heterologous polypeptides therefrom are described, e.g. in U.S. Pat. No. 4,599,311, U.S. Pat. No. 4,931,373, U.S. Pat. No. 4,870,008, 5,037,743, and U.S. Pat. No. 4,845,075, all of which are hereby incorporated by reference.
- Transformed cells are selected by a phenotype determined by a selectable marker, commonly drug resistance or the ability to grow in the absence of a particular nutrient, e.g. leucine.
- a preferred vector for use in yeast is the POT1 vector disclosed in U.S. Pat. No. 4,931,373.
- the DNA sequence encoding the CGTase variant of the invention may be preceded by a signal sequence and optionally a leader sequence , e.g. as described above.
- suitable yeast cells are strains of Kluyveromyces, such as K. lactis, Hansenula, e.g. H. polymorpha, or Pichia, e.g. P. pastoris [Gleeson et al., J. Gen. Microbiol. 1986 132 3459-3465; U.S. Pat. No. 4,882,279].
- Examples of other fungal cells are cells of filamentous fungi, e.g. Aspergillus spp., Neurospora spp., Fusarium spp. or Trichoderna spp., in particular strains of A. oryzae, A. nidulans or A. niger.
- Aspergillus spp. for the expression of proteins have been described in e.g., EP 272,277 and EP 230,023.
- the transformation of F. oxysporum may, for instance, be carried out as described by Malardier et al., Gene 1989 78 147-156.
- the transformed or transfected host cell described above is then cultured in a suitable nutrient medium under conditions permitting the expression of the CGTase, after which the resulting CGTase variant is recovered from the culture.
- the medium used to culture the cells may be any conventional medium suitable for growing the host cells, such as minimal or complex media containing appropriate supplements. Suitable media are available from commercial suppliers or may be prepared according to published recipes (e.g. in catalogues of the American Type Culture Collection).
- the CGTase variant produced by the cells may then be recovered from the culture medium by conventional procedures including separating the host cells from the medium by centrifugation or filtration, precipitating the proteinaceous components of the supernatant or filtrate by means of a salt, e.g. ammonium sulphate, purification by a variety of chromatographic procedures, e. g. ion exchange chromatography, gelfiltration chromatography, affinity chromatography, or the like, dependent on the type of CGTase in question.
- a salt e.g. ammonium sulphate
- the present invention provides a method of producing the CGTase variant of the invention, wherein a suitable host cell, which has been transformed with a DNA sequence encoding the CGTase, is cultured under conditions permitting the production of the enzyme, and the resulting enzyme is recovered from the culture.
- the medium used to culture the transformed host cells may be any conventional medium suitable for growing the host cells in question.
- the expressed CGTase may conveniently be secreted into the culture medium and may be recovered therefrom by well-known procedures including separating the cells from the medium by centrifugation or filtration, precipitating proteinaceous components of the medium by means of a salt such as ammonium sulphate, followed by chromatographic procedures such as ion exchange chromatography, affinity chromatography, or the like.
- the CGTase variant of the invention find application in processes for the manufacture of cyclodextrins for various industrial applications, particularly in the food, cosmetic, chemical, agrochemical and pharmaceutical industries.
- the invention provides CGTase variants for use in a process for the manufacture of cyclodextrins, in particular ⁇ -, ⁇ -, ⁇ -, ⁇ -, and/or ⁇ -cyclodextrins.
- the invention provides CGTase variants for use in a process for the manufacture of ⁇ -, ⁇ - and ⁇ -cyclodextrins, or mixtures hereof.
- the invention provides CGTase variants for use in a process for the manufacture of ⁇ -, ⁇ -, and ⁇ -cyclodextrins, or mixtures hereof.
- the CGTase variants of the invention may also be used in a process for the manufacture of linear oligosaccharides, in particular linear oligosaccharides of 2 to 12 glucose units, preferably linear oligosaccharides of 2 to 9 glucose units.
- the CGTase variants of the invention may be used for in situ generation of cyclodextrins.
- the CGTase variants of the invention may be added to a substrate containing medium in which the enzyme variants are capable of forming the desired cyclodextrins.
- This application is particularly well suited for being implemented in methods of producing baked products, in methods for stabilizing chemical products during their manufacture, and in detergent compositions.
- CGTase variants of the invention therefore also may be used for implementation into bread-improving additives, e.g. dough compositions, dough additives, dough conditioners, pre-mixes, and similar preparations conventionally used for adding to the flour and/or the dough during processes for making bread or other baked products.
- bread-improving additives e.g. dough compositions, dough additives, dough conditioners, pre-mixes, and similar preparations conventionally used for adding to the flour and/or the dough during processes for making bread or other baked products.
- Cyclodextrins have an inclusion ability useful for stabilization, solubilization, etc.
- cyclodextrins can make oxidizing and photolytic substances stable, volatile substances non-volatile, poorly-soluble substances soluble, and odoriferous substances odorless, etc. and thus are useful to encapsulate perfumes, vitamins, dyes, pharmaceuticals, pesticides and fungicides.
- Cyclodextrins are also capable of binding lipophilic substances such as cholesterol, to remove them from egg yolk, butter, etc.
- Cyclodextrins also find utilization in products and processes relating to plastics and rubber, where they have been used for different purposes in plastic laminates, films, membranes, etc. Also cyclodextrins have been used for the manufacture of biodegradable plastics.
- the CGTase from Bacillus circulans Strain 251 [cf. Lawson C L, van Montfort R, Strokopytov B, Rozeboom H J, Kalk K H, de Vries G E, Penninga D, Dijkhuizen L, and Dijkstra B W; J. Mol. Biol. 1994 236 590-600] was soaked in a buffer solution containing the non-hydrolyzable tetrasaccharide acarbose, and an X-ray structure of the CGTase including the pseudo-tetrasaccharide located in the catalytic site was obtained, cf. Strokopytov et al.
- a nonasaccharide (A-I) was formed in an enzyme-substrate-complex structure. Coordinates of this structure have been deposited with the Protein Data Bank, Biology Department, Bldg. 463, Brookhaven National Laboratory, P.O. Box 5000, Upton, N.Y. 11973-5000, USA, under the entry code 1DIJ.
- J-L trisaccharide
- the residues listed in FIG. 1 are referring to Bacillus circulans Strain 251 CGTase and comprise only the closest contacts between the substrate and the enzyme. By changing the number of hydrogen-bonds and other interactions between the enzyme and the substrate, the product selectivity can be altered. Normally, cleavage of the starch takes place between glucose unit B and C in the model.
- the structure of a Thermoanaerobacter CGTase was modelled based on the known structure of Bacillus circulans CGTase. Again the computer program InsightTM from Biosym was employed, using the homology module, according to the manufacturers instructions. The substrate found in Bacillus circulans was docked into the Thermoanaerobacter model, and the positions stated in Tables 5-7 identified.
- This example describes the construction of three ⁇ -cyclodextrin producing CGTase variants, in which site-directed mutagenesis have lead to an altered number of hydrogen bonds in the subsites of the active site cleft.
- the variants are derived from a Bacillus circulans Strain 251 CGTase (i.e. the wild-type enzyme), obtained as described by Lawson et al. [Lawson C L, van Montfort R, Strokopytov B, Rozeboom H J, Kalk K H, de Vries G E, Penninga D, Dijkhuizen L, and Dijkstra B W; J. Mol. Biol. 1994 236 590-600].
- Mutant proteins were produced by the use of an amylase and protease negative Bacillus subtilis strain, and purified using affinity chromatography.
- CGTase activity was determined by incubating appropriately diluted enzyme solutions with substrate in 10 mM sodium citrate, pH 6.0, for 5-10 minutes at 50° C.
- Cyclodextrin forming activity was determined using 5% PaselliTM SA2 (i.e. partially hydrolysed potato starch with an average degree of polymerization of 50, available from AVEBE, Foxhol, The Netherlands) as substrate.
- the ⁇ -cyclodextrin formed was determined with phenolphthalein.
- One unit of activity is defined as the amount of enzyme able to produce 1 ⁇ mol of ⁇ -cyclodextrin per minute.
- ⁇ - and ⁇ -cyclodextrin formation was subsequently determined by use of HPLC (cf. below).
- Cyclodextrin formation was also determined under industrial production process conditions. For this purpose 0.1 U/ml CGTase was incubated with 10% PaselliTM WA4 (i.e. jet-cooked, pre-gelatinized drum-dried starch) in a 10 mM sodium citrate buffer (pH 6.0) at 50° C. for 45 hours. Samples were collected at regular intervals of time, boiled for 5 minutes, and the products formed analyzed by HPLC using a 25 cm Econosil-NH 2 10 micron column (Alltech Associates Inc., USA) eluted with acetonitril/water (60/40% v/v) at a flow rate of 1 ml per minute.
- PaselliTM WA4 i.e. jet-cooked, pre-gelatinized drum-dried starch
- Variants were designed in order to increase ⁇ -cyclodextrin formation.
- a tyrosine residue at position 89 was changed into an aspartic acid residue (Y89D), which introduces an additional hydrogen bond with subsite F of the substrate, cf. FIG. 1.
- This gives rise to stronger binding of the amylose chain in the active site cleft, with the formation of smaller cyclodextrins.
- an increase in ⁇ -cyclodextrin forming activities was detected, with a simultaneous decrease in the ⁇ -cyclodextrin forming activity, as seen from the ratio of cyclodextrins produced from PaselliTM WA4, cf. Table 16, below, and in the cyclodextrin formation profiles, cf. FIG. 2B.
- This example describes the construction of two CGTase variants, holding mutations in the E domain cleft.
- the variants are derived from a Bacillus circulans Strain 251 CGTase (i.e. the wild-type enzyme), obtained as described by Lawson et al. [Lawson C L, van Montfort R, Strokopytov B, Rozeboom H J, Kalk K H, de Vries G E, Penninga D, Dijkhuizen L, and Dijkstra B W; J. Mol. Biol. 1994 236 590-600].
- MBS maltose binding sites
- Mutations were introduced by a method based on two PCR reactions using VENT-DNA polymerase. For each mutation specific oligonucleotides were developed. The mutations were confirmed by restriction analysis and sequencing. Variants were obtained from an amylase and protease negative Bacillus subtilis strain and were purified using affinity chromatography.
- Escherichia coli MC1061 [Meissner P S, Sisk W P, Berman M L; Proc. Natl. Acad. Sci. USA 1987 84 4171-4175] was used for recombinant DNA manipulations and site-directed mutagenesis.
- E. coli DH5 ⁇ [Hanahan D; J. Mol Biol. 1983 166 557] was used for the production of monomeric supercoiled plasmid DNA for sequencing.
- CGTases variants were produced with the ⁇ -amylase and protease negative Bacillus subtilis Strain DB104A [Smith H, de Jong A, Bron S, Venema G; Gene 1988 70 351-361].
- the fragment containing the kanamycin-resistance marker was ligated with the largest fragment from plasmid pDP66S [Penninga D, Strokopytov B, Rozeboom H J, Lawson C L, Dijkstra B W, Bergsma J, Dijkhuizen L; Biochemistry 1995 34 3368-3376] containing the Bacillus circulans CGTase gene, digested with HIndIII and XbaI (made blunt with Klenow polymerase).
- the resulting CGTase protein expression shuttle vector pDP66K with the CGTase gene under control of the erthromycin-inducible p32 promotor [van der Vossen J M B M, Kodde J, Haandrikman A J, Venema G, Kok J; Appl. Environ. Microbiol. 1992 58 3142-3149], was transformed to E. coli MC1061 under selection for erythromycin and kanamycin resistance, cf. FIG. 3.
- CGTase Variants As only relatively low stability with plasmid pDP66S (8.5 kb) [Saenger W; Angew. Chem. 1980 19 344-362] was found, pDP66K (7.7 kb) was constructed, cf. FIG. 3, with the CGTase gene under the control of the strong p32 promotor [van der Vossen J M B M, Kodde J, Haandrikman A J, Venema G, Kok J; Appl. Environ. Microbiol. 1992 58 3142-3149]. Plasmid pDP66K containing the additional antibiotic resistance marker for kanamycin appeared to be considerably more stable in E. coli as well as in B.
- subtilis cells than plasmid pDP66S containing the streptomycin/spectinomycin resistance cassette.
- a shuttle vector Using this shuttle vector, a high extracellular production of wild-type enzyme and CGTase variants was obtained reproducibly in batch fermentations with the ⁇ -amylase and protease negative B. subtilis Strain DB104A.
- a single 51 erlenmeyer flask with 11 B. subtilis Strain DB104A culture allowed purification to homogeneity of up to 25 mg of the CGTase variants. Mutations were constructed via site-directed (PCR) mutagenesis. Using specific oligonucleotide primers a mutation frequency close to 70% was observed. All mutations were confirmed by restriction analysis and DNA sequencing.
- Plasmid carrying bacterial strains were grown on LB medium in presence of the antibiotics erythromycin and kanamycin, at concentrations of 100 and 5 ⁇ g/ml for E. coli and Bacillus subtilis, respectively [Sambrook et al., op cit]. When appropriate, agar plates contained 1% starch to screen for halo formation. Bacillus subtilis Strain DB 104A was grown in a 51 flask, containing 11 medium with 2% tryptone, 0.5% yeast extract, 1% sodium chloride and 1% casamino acids (pH 7.0) with 10 ⁇ g/ml erythromycin and 5 ⁇ g/ml kanamycin.
- Plasmid pDP66K carrying the right restriction site was transformed to E. coli DH5 ⁇ cells. DNA sequence determination was performed on supercoiled plasmid DNA using the dideoxy-chain termination method [Sanger F, Coulson A R; J. Mol Biol. 1975 94 441-448] and the T7-sequencing kit from Pharmacia LKB Biotechnology, Sweden.
- Plasmid pDP66K carrying positively characterized mutant CGtase genes, was transformed to Bacillus subtilis Strain DB104A. The organism was grown to an optical density of 4.5 determined at 600 nm in a 51 flask (for approx. 36 hours). Under these conditions high extracellular CGTase levels were produced. The culture was centrifuged ( ⁇ 10,000 g) at 4° C. for 30 minutes. The (mutant) CGTases were further purified to homogeneity by affinity chromatography using a 30 ML ⁇ -cyclodextrin-Sepharose-6FF column (Pharmacia, Sweden) [Sundberg L, Porath J; J. Chromatogr.
- ⁇ -cyclodextrin Forming Activity ⁇ -cyclodextrin forming activity was determined using 5% PaselliTM SA2 (i.e. partially hydrolysed potato starch with an average degree of polymerization of 50, available from AVEBE, Foxhol, The Netherlands) as substrate and after incubation for 3 minutes at 50° C. 0.1-0.1 units of activity were used. The ⁇ -cyclodextrin formed was determined based on its ability to form a stable colorless inclusion complex with phenolphthalein. One unit of activity is defined as the amount of enzyme able to form 1 ⁇ mol of ⁇ -cyclodextrin per minute.
- Raw Starch Binding Properties were studied by incubating 6 ⁇ g/ml of enzyme with increasing amounts (0-10%) of granular potato starch (PaselliTM SA2, available from AVEBE, Foxhol, The Netherlands) for 1 hour at 4° C., with and without 0.1 mM of ⁇ -cyclodextrin (equilibrium was reached within 10 minutes). After incubation, protein bound to the starch granules was spun down for 1 minute at 4° C. and at 10,000 ⁇ g, and the remaining ⁇ -cyclodextrin forming activity of the supernatant was determined as described above.
- PaselliTM SA2 available from AVEBE, Foxhol, The Netherlands
- maltose binding site 1 (MBS1) includes two tryptophan residues
- the double mutation W616A/W662A was constructed. In this way we created comparable changes in the two binding sites, which were designed to completely remove the hydrophobic interactions of the aromatic residues with the glucose units of the substrate.
- the two separate CGTase variants, W616A and W662A gave intermediate results compared to the double mutant, W616A/W662A.
- ⁇ -cyclodextrin on raw starch binding indicates that it can inhibit the binding by competition with a starch chain for the binding sites of the enzyme. This effect is more pronounced for the variants produced as compared to the wild-type, indicating that when one MBS is deleted, competition of ⁇ -cyclodextrin with raw starch for the remaining site is stronger. This also indicates a form of cooperativity between MBS's.
- the Y633A variant has the same "n” value as the wild-type enzyme. This suggests that sites other than MBS2 cooperate with MBS1 in binding.
- This example describes the construction of several ⁇ -and ⁇ -cyclodextrin producing CGTase variants, in which site directed mutagenesis has lead to an altered number of hydrogen bonds in the active site cleft.
- the variants are derived from a Bacillus circulans Strain 251 CGTase (i.e. the wild-type enzyme), obtained as described by Lawson et al. [Lawson C L, van MOntfort R, Strokopytov B, Rozeboom H J, Kalk K H, de Vries G E, Penninga D, Dijkhuizen L and Dijkstra B W; J. Mol Biol. 1994 236 590-600].
- Mutations were introduced with a PCR method using VENT-DNA polymerase (New-England Biolabs, Beverly, Mass., USA).
- a first PCR reaction was carried out with a mutagenesis primer for the coding strand, plus a primer downstream on the template strand.
- the reaction product was subsequently used as primer in a second PCR reaction together with a primer upstream on the coding strand.
- the product of the last reaction was cut with PvuII and SalI and exchanged with the corresponding fragment (1200 bp) from the vector pDP66K (cf. FIG. 3).
- the resulting (mutant) plasmid was transformed to E. coli MC1061 cells [Meissner P S, Sisk W P, Berman M L; Proc. Natl. Acad. Sci. USA 1987 84 4171-4175].
- Plasmid pDP66K carrying the right restriction site was transformed to E. coli DH5 ⁇ cells [Hanahan D; J. Mol Biol. 1983 166 557]. DNA sequence determination was performed on supercoiled plasmid DNA using the dideoxy-chain termination method [Sanger F, Coulson A R; J. Mol Biol. 1975 94 441-448] and the T7-sequencing kit from Pharmacia-LKB Biotechnology, Sweden.
- Plasmid pDP66K carrying positively characterized mutant cgt genes, was transformed to B. subtilis strain DB104A [Smith H, de Jong A, Bron S, Venema G; Gene 1988 70 351-361]. The organism was grown to an optical density at 600 nm of 4.5 in a 51 flask (for approx. 36 hours). Under these conditions high extracellular CGTase levels were produced.
- the culture was centrifuged at 4° C. for 30 minutes at 10,000 ⁇ g.
- the CGTases variant in the culture supernatants were further purified to homogeneity by affinity chromatography, using a 30 ml ⁇ -cyclodextrin-Sepharose-6FF column (Pharmacia, Sweden) [Sundberg L, Porath J; J. Chromatogr. 1974 90 87-98] with a maximal capacity of 3.5 mg protein per ml. After washing with 10 mM sodium acetate buffer (pH 5.5), bound CGTase was eluted with the same buffer containing 10 mg/ml ⁇ -cyclodextrin.
- ⁇ -cyclodextrin forming activity was determined by incubating an appropriately diluted enzyme sample (0.1-0.2 units of activity) for 3 minutes at 50° C. PaselliTM SA2 (5% solution partially hydrolysed potato starch with an average degree of polymerization of 50 (AVEBE, Foxhol, The Netherlands), was used as a substrate. The ⁇ -cyclodextrin formed was determined based on its ability to form a stable colorless inclusion complex with phenolphthalein. One unit of activity is defined as the amount of enzyme able to produce 1 ⁇ mol of ⁇ -cyclodextrin per minute.
- Cyclodextrin forming activity was also measured under production process conditions.
- 0.1 U/ml CGTase was incubated with 10% PaselliTM WA4 (i.e. jet-cooked, pregelatinized drum-dried starch) in a 10 mM sodium citrate buffer (pH 6.0) at 50° C. for 45 hours.
- Samples were collected at regular time intervals, diluted 10 times, boiled for 8 min. and the products formed analyzed by HPLC using a 25 cm Econosphere-NH 2 5 micron column (Alltech Associates Inc., USA) eluted with acetonitrile/water (60/40 v/v) at 1 ml per min.
- the variants of this example were designed in order to increase ⁇ -and ⁇ -cyclodextrin formation.
- the N193G, Y89G, D371G, D371N and the Y89G/N193G CGTase variants were all designed with the intention to decrease the interactions between the amylose chain and the first part of the active site cleft (Subsites C-G). As a result, the amylose chain would be able to move further into the active site cleft, thereby changing the ratio of cyclodextrins towards the ⁇ -and ⁇ -cyclodextrins.
- the N193G CGTase variant demonstrates a rapid increase in ⁇ -cyclodextrin (FIGS. 7 and 9).
- the ratio is changed already dramatically after 5 hours of incubation (Table 20) towards ⁇ -and ⁇ -cyclodextrin.
- Table 21 the ratio has changed towards ⁇ -cyclodextrin formation only.
- This mutation seems particularly well suited for combination with other mutations, e.g. D371G or D371N.
- the Y89G CGTase variant results in a small change towards ⁇ -cyclodextrin after 45 hours of incubation at the expense of ⁇ -cyclodextrin (cf. FIG. 7 and Table 21).
- the D371N and D371G CGTase variants both show a shift towards formation of the larger cyclodextrins (cf. FIG. 8 and Table 21). Both ⁇ -and ⁇ -cyclodextrin increased at the expense of ⁇ -cyclodextrin. This shift is more pronounced at early incubation times (cf. Table 20 and FIG. 10).
- the Y89G/N193G CGTase double mutant resulted in a shift from ⁇ -cyclodextrin to both ⁇ -and ⁇ -cyclodextrin (cf. Table 21).
- this mutation could give rise to a single shift to ⁇ -cyclodextrin.
- the *145aI CGTase variant was constructed on the basis of alignment studies. This insertion mutation seems especially advantageous for obtaining ⁇ -cyclodextrin producing CGTase variants. Both short incubation times (cf. FIG. 10 and Table 20) and long incubation times (cf. FIG. 8 and Table 21) gave a shift from ⁇ -cyclodextrin to both ⁇ -and ⁇ -cyclodextrin. Also, in order to obtain a single shift to ⁇ -cyclodextrin, this mutation seems particularly well suited for combination with other mutations, e.g. D371G or D371N.
- the N326Q CGTase variant was constructed and shown to cause a shift from ⁇ -cyclodextrin to ⁇ - and ⁇ -cyclodextrin formation (cf. Table 21).
- This example describes the construction of 24 ⁇ -cyclodextrin producing CGTase variants (A1-A24), in which site-directed mutagenesis either has lead to an altered number of hydrogen bonds in the subsites of the active cleft or, alternatively, to sterical hindrance in parts of the substrate binding left.
- the variants are derived from a Thermoanaerobacter sp. CGTase obtained according to WO 89/03421, and having the nucleotide and amino acid sequences presented as SEQ ID NOS:1-2 (i.e. the wild-type enzyme).
- Mutations were introduced by a method based on PCR by the use of PWO polymerase. For each mutation, specific oligonucleotides (primers) were developed. The mutations were confirmed by restriction analysis whenever possible, and by sequencing. Mutant proteins were expressed in either Escherichia coli MC1061 [Meissner P S, Sisk W P, Berman M L; Proc. Natl. Acad. Sci. USA 1987 84 4171-4175], or in the ⁇ -amylase and protease negative Bacillus subtilis Strain DB104A [Smith H, de Jong A, Bron S, Venema G; Gene 1988 70 351-361]. Proteins were purified from the media using affinity chromatography (AfC) and/or anion-exchange chromatography (AEC).
- AfC affinity chromatography
- AEC anion-exchange chromatography
- Phadebas amylase test Phadebas tablets (PhadebasTM Amylase Test, Pharmacia) are used as substrate.
- This substrate is a cross-linked insoluble blue-colored starch polymer, which is mixed with bovine serum albumin and a buffer substance. After suspension in water, starch is hydrolyzed by the enzyme, thereby yielding blue fragments. The determination is carried out after incubation at 60° C., pH 6.2, in 0.15 nM calcium for 15 minutes. The absorbance of the resulting blue solution, determined at 620 nm, corresponds the enzymatic activity.
- the enzyme activity is compared to that of an enzyme standard, and the activity is expressed in the same unit as that of the enzyme standard.
- the enzyme standard was TermamylTM (Novo Nordisk A/D, Denmark), the amylolytic activity of which has been be determined using potato starch as substrate. This method is based on the break-down of modified potato starch by the enzyme, and the reaction is followed by mixing samples of the starch/enzyme solution with an iodine solution. Initially, a blackish-blue color is formed, but during the break-down of the starch the blue color gets weaker and gradually turns into a reddish-brown, which is compared to a colored glass standard.
- KNU One Kilo Novo alfa Amylase Unit
- CGTase activity was determined by incubating diluted enzyme with substrate in 10 mM sodium citrate, pH 6.0 for 4-10 minutes at 60° C.
- Cyclodextrin forming activity was determined using 5% PaselliTM SA2 (i.e. partially hydrolysed potato starch with an average degree of polymerization of 50, available from AVEBE, Foxhol, The Netherlands) as substrate.
- the ⁇ -cyclodextrin formed was determined with Methyl-orange, the ⁇ -cyclodextrin formed was determined with phenolphthalein, and the ⁇ -cyclodextrin formed was determined with bromo cresol green.
- the activity is expressed in units per mg (U/mg).
- One unit of enzyme activity is defined as the amount of enzyme able to produce one ⁇ mol of the specific cyclodextrin per minute.
- Cyclodextrin formation was also determined under conventional industrial production process conditions.
- a precooked 10% amylopectin solution in 0.5 mM CaCl 2 at pH 5.5 was incubated with 50 NU of CGTase per gram of substrate, at 60° C. and for 24 hours. Samples are regularly withdrawn and boiled for 10 minutes at a pH of 2.5-3 prior to analysis by HPLC.
- oligonucleotides were synthesized in order to initiate the site-directed mutagenesis (the numbers indicate positions according to the CGTase numbering):
- A3 F195Y+143-151(GRAGTNPG) (SEQ ID NO:12);
- A4 F195Y+87-94(IKYSG-VNN) (SEQ ID NO:11)+143-151(GRAGTNPG) (SEQ ID NO:12);
- the Spe I--Bst X I fragment of A2 was ligated into the CGTase gene holding the F195Y mutation.
- the F195Y was introduced by the use of the A3 primer;
- the 146-150(SDQPS) (SEQ ID NO:59) mutations were introduced using the primer 5'-CTCCTGCATCATCTGATCAACCGTCCTTTGGGGAAAATGG-3' (SEQ ID NO:52);
- the G180N mutation was introduced using the primer 5'-CCATCATTACGGAAACACTAATTTTTCATC-3' (SEQ ID NO:33);
- the G180D mutation was introduced using the primer 5'-CCATCATTACGGAGACACTAATTTTTCATC-3' (SEQ ID NO:34);
- the G179N mutation was introduced using the primer 5'-CCATCATTATAATGGAACTAATTTTTCATC-3' (SEQ ID NO:35);
- the G179S mutation was introduced using the primer 5'-CCATCATTATAGTGGAACTAATTTTTCATC-3'(SEQ ID NO:36);
- the G179D mutation was introduced using the primer 5'-CCATCATTATGATGGAACTAATTTTTCATC-3'(SEQ ID NO:37).
- experiment A1 the loop at positions 143 to 151 was replaced by (GRAGTNPG (SEQ ID NO:12)) in order to increase the interactions between the enzyme and glucose unit H, and in order to decrease the interactions between the enzyme and glucose units I and J (cf. FIG. 1).
- the initial rate of both ⁇ -CD formation and of ⁇ -CD formation has decreased.
- the ratio of ⁇ -CD has increased, whereas the ⁇ -CD ratio has decreased.
- the loop at positions 87-94 was replaced by (IKYSG*VNN (SEQ ID NO:11)), and simultaneously the loop at positions 143 to 151 was replaced by (GRAGTNPG (SEQ ID NO:12)) in order to increase the interactions between the enzyme and glucose units E, F and H, and in order to decrease the interactions between the enzyme and glucose units I and J (cf. FIG. 1).
- the F195 was replaced by 195Y in order to decrease the contact between enzyme and substrate.
- the initial rate of ⁇ -CD formation has decreased.
- the ⁇ -CD ratio has decreased.
- experiment A5 the region at positions 143 to 145 was replaced by (GRW) in order to increase the interactions between the enzyme and glucose unit H, and in order to decrease the interactions between the enzyme and glucose units I and J by making a sterical hindrance (cf. FIG. 1).
- the initial rate of ⁇ -CD formation has increased, whereas the initial rate of both ⁇ -CD formation and of ⁇ -CD formation has decreased.
- the ratio of ⁇ -CD has increased whereas the ⁇ -CD ratio has decreased.
- G180 was replaced by 180N in order to increase the interactions between the enzyme and glucose unit H (cf. FIG. 1).
- the region at positions 143 to 145 was replaced by (GRW) in order to increase the interactions between the enzyme and glucose unit H, and in order to decrease the interactions between the enzyme and glucose units I and J by making a sterical hindrance (cf. FIG. 1).
- G180 was replaced by 180D in order to increase the interactions between the enzyme and glucose unit H (cf. FIG. 1).
- the region at positions 143 to 145 was replaced by (GRW) in order to increase the interactions between the enzyme and glucose unit H, and in order to decrease the interactions between the enzyme and glucose units I and J by making a sterical hindrance (cf. FIG. 1).
- G179 was replaced by 179N in order to increase the interactions between the enzyme and glucose unit H (cf. FIG. 1).
- the region at positions 143 to 145 was replaced by (GRW) in order to increase the interactions between the enzyme and glucose unit H, and in order to decrease the interactions between the enzyme and glucose units I and J by making a sterical hindrance (cf. FIG. 1).
- G179 was replaced by 179S in order to increase the interactions between the enzyme and glucose unit H (cf. FIG. 1).
- the region at positions 143 to 145 was replaced by (GRW) in order to increase the interactions between the enzyme and glucose unit H, and in order to decrease the interactions between the enzyme and glucose units I and J by making a sterical hindrance (cf. FIG. 1).
- G179 was replaced by 179D in order to increase the interactions between the enzyme and glucose unit H (cf. FIG. 1).
- the region at positions 143 to 145 was replaced by (GRW) in order to increase the interactions between the enzyme and glucose unit H, and in order to decrease the interactions between the enzyme and glucose units I and J by making a sterical hindrance (cf. FIG. 1).
- This example describes the construction of 15 ⁇ -cyclodextrin producing CGTase variants (B1-B9), in which site-directed mutagenesis either has lead to an altered number of hydrogen bonds in the subsites of the active cleft or, alternatively, to sterical hindrance in parts of the substrate binding left.
- the variants are derived from a Thermoanaerobacter sp. CGTase obtained according to WO 89/03421, and having the nucleotide and amino acid sequences presented as SEQ ID NOS:1-2 (i.e. the wild-type enzyme).
- Mutations were introduced by a method based on PCR by the use of PWO polymerase. For each mutation, specific oligonucleotides (primers) were developed. The mutations were confirmed by restriction analysis whenever possible, and by sequencing. Mutant proteins were expressed in either Escherichia coli MC1061 [Meissner P S, Sisk W P, Berman M L; Proc. Natl. Acad. Sci. USA 1987 84 4171-4175], or in the ⁇ -amylase and protease negative Bacillus subtilis Strain DB104A [Smith H, de Jong A, Bron S, Venema G; Gene 1988 70 351-361]. Proteins were purified from the media using affinity chromatography (AfC) and/or anion-exchange chromatography (AEC).
- AfC affinity chromatography
- AEC anion-exchange chromatography
- Phadebas amylase test Phadebas tablets (PhadebasTM Amylase Test, Pharmacia) are used as substrate.
- This substrate is a cross-linked insoluble blue-colored starch polymer, which is mixed with bovine serum albumin and a buffer substance. After suspension in water, starch is hydrolyzed by the enzyme, thereby yielding blue fragments. The determination is carried out after incubation at 60° C., pH 6.2, in 0.15 nM calcium for 15 minutes. The absorbance of the resulting blue solution, determined at 620 nm, corresponds the enzymatic activity.
- the enzyme activity is compared to that of an enzyme standard, and the activity is expressed in the same unit as that of the enzyme standard.
- the enzyme standard was TermamylTM (Novo Nordisk A/D, Denmark), the amylolytic activity of which has been be determined using potato starch as substrate. This method is based on the break-down of modified potato starch by the enzyme, and the reaction is followed by mixing samples of the starch/enzyme solution with an iodine solution. Initially, a blackish-blue color is formed, but during the break-down of the starch the blue color gets weaker and gradually turns into a reddish-brown, which is compared to a colored glass standard.
- KNU One Kilo Novo alfa Amylase Unit
- CGTase activity was determined by incubating diluted enzyme with substrate in 10 mM sodium citrate, pH 6.0 for 4-10 minutes at 85° C.
- Cyclodextrin forming activity was determined using 5% PaselliTM SA2 (i.e. partially hydrolysed potato starch with an average degree of polymerization of 50, available from AVEBE, Foxhol, The Netherlands) as substrate.
- the ⁇ -cyclodextrin formed was determined with Methyl-orange, the ⁇ -cyclodextrin formed was determined with phenolphthalein, and the ⁇ -cyclodextrin formed was determined with bromo cresol green.
- the activity is expressed in units per mg (U/mg).
- One unit of enzyme activity is defined as the amount of enzyme able to produce one ⁇ mol of the specific cyclodextrin per minute.
- Cyclodextrin formation was also determined under conventional industrial production process conditions.
- a precooked 10% amylopectin solution in 0.5 mM CaCl 2 at pH 5.5 was incubated with 50 NU of CGTase per gram of substrate, at 85° C. and for 24 hours. Samples are regularly withdrawn and boiled for 10 minutes at a pH of 2.5-3 prior to analysis by HPLC.
- oligonucleotides were synthesized in order to initiate the site-directed mutagenesis (the numbers indicate positions according to the CGTase numbering):
- experiment B1 S145 was replaced by 145A in order to decrease the interactions between the enzyme and glucose unit J (cf. FIG. 1).
- the initial rate of both ⁇ -CD formation and of ⁇ -CD formation has increased.
- the ratio of ⁇ -CD has decreased whereas the ⁇ -CD ratio has increased.
- experiment B2 E146 was replaced by 146S in order to increase the interactions between the enzyme and glucose unit I (cf. FIG. 1).
- the initial rate of both ⁇ -CD formation and of ⁇ -CD formation has increased.
- the ratio of ⁇ -CD has decreased.
- T147 was replaced by 147A in order to decrease the interactions between the enzyme and glucose unit J (cf. FIG. 1).
- the ratio of ⁇ -CD has decreased, whereas the ⁇ -CD ratio has increased.
- experiment B4 T147 was replaced by 147L in order to decrease the interactions between the enzyme and glucose unit J (cf. FIG. 1).
- the ratio of ⁇ -CD has decreased, whereas the ⁇ -CD ratio has increased.
- experiment B9 the loop at positions 87 to 94 was replaced by (IKYSG*VNN) (SEQ ID NO:11) in order to increase the contacts between the enzyme and glucose units E and F (cf. FIG. 1).
- experiment B 10 5'-TTACCGTAATTTATATGACTTAGCAG-3' (SEQ ID NO:67) was used to introduce the F195Y mutation.
- the 87-94(IKYSG-VNN) (SEQ ID NO:11) mutations was introduced using primer B9. Simultaneously, the F195 was replaced by 195Y in order to decrease the contact between enzyme and substrate.
- This example describes the construction of 9 ⁇ -cyclodextrin producing CGTase variants (C1-C9), in which site-directed mutagenesis either has lead to an altered number of hydrogen bonds in the subsites of the active cleft or, alternatively, to sterical hindrance in parts of the substrate binding left.
- the variants are derived from a Thermoanaerobacter sp. CGTase obtained according to WO 89/03421, and having the nucleotide and amino acid sequences presented as SEQ ID NOS:1-2 (i.e. the wild-type enzyme).
- Variants were introduced by a method based on Unique Site Elimination (USE), following the protocol from the supplier (Stratagene®).
- the unique restriction site BsaMI at the plasmid opposite to the CGTase gene was removed by the use of the 5'P-CACTGTTCCTTCGAACGCGTAACCTTAAATACC-3' (SEQ ID NO:69) oligonucleotide.
- "P” indicates a 5' phosphorylation necessary for the procedure.
- For each mutation specific oligonucleotides were developed. The mutations were confirmed by restriction analysis whenever possible, and by sequencing.
- Phadebas amylase test Phadebas tablets (PhadebasTM Amylase Test, Pharmacia) are used as substrate.
- This substrate is a cross-linked insoluble blue-colored starch polymer, which is mixed with bovine serum albumin and a buffer substance, After suspension in water, starch is hydrolyzed by the enzyme, thereby yielding blue fragments. The determination is carried out after incubation at 60° C., pH 6.2, in 0.15 nM calcium for 15 minutes. The absorbance of the resulting blue solution, determined at 620 nm, corresponds the enzymatic activity.
- the enzyme activity is compared to that of an enzyme standard, and the activity is expressed in the same unit as that of the enzyme standard.
- the enzyme standard was TermamylTM (Novo Nordisk A/D, Denmark), the amylolytic activity of which has been be determined using potato starch as substrate. This method is based on the break-down of modified potato starch by the enzyme, and the reaction is followed by mixing samples of the starch/enzyme solution with an iodine solution. Initially, a blackish-blue color is formed, but during the break-down of the starch the blue color gets weaker and gradually turns into a reddish-brown, which is compared to a colored glass standard.
- KNU One Kilo Novo alfa Amylase Unit
- CGTase activity was determined by incubating diluted enzyme with substrate in 10 mM sodium citrate, pH 6.0 for 4-10 minutes at 85° C.
- Cyclodextrin forming activity was determined using 5% PaselliTM SA2 (i.e. partially hydrolysed potato starch with an average degree of polymerization of 50, available from AVEBE, Foxhol, The Netherlands) as substrate.
- the ⁇ -cyclodextrin formed was determined with Methyl-orange, the ⁇ -cyclodextrin formed was determined with phenolphthalein and the ⁇ -cyclodextrin formed was determined with bromo cresol green.
- the activity is expressed in units per mg (U/mg). One unit of enzyme activity is defined as the amount of enzyme able to produce one ⁇ mol of the specific cyclodextrin per minute.
- Cyclodextrin formation was also determined under conventional industrial production process conditions.
- a precooked 10% amylopectin solution in 0.5 mM CaCl 2 at pH 5.5 was incubated with 50 NU of CGTase per gram of substrate, at 60° C. and for 24 hours. Samples are regularly withdrawn and boiled for 10 minutes at a pH of 2.5-3 prior to analysis by HPLC.
- oligonucleotides were synthesized in order to initiate the site-directed mutagenesis (the numbers indicate positions according to the CGTase numbering):
- experiment C1 N193 were replaced by 193A in order to decrease the interactions between the enzyme and glucose unit H.
- the ratio of ⁇ -CD has decreased, and the ratio of ⁇ -CD has increased.
Landscapes
- Chemical & Material Sciences (AREA)
- Organic Chemistry (AREA)
- Life Sciences & Earth Sciences (AREA)
- Health & Medical Sciences (AREA)
- Engineering & Computer Science (AREA)
- Zoology (AREA)
- Wood Science & Technology (AREA)
- Genetics & Genomics (AREA)
- Bioinformatics & Cheminformatics (AREA)
- General Health & Medical Sciences (AREA)
- Biochemistry (AREA)
- General Engineering & Computer Science (AREA)
- Microbiology (AREA)
- Biotechnology (AREA)
- Biomedical Technology (AREA)
- Molecular Biology (AREA)
- Medicinal Chemistry (AREA)
- Chemical Kinetics & Catalysis (AREA)
- General Chemical & Material Sciences (AREA)
- Micro-Organisms Or Cultivation Processes Thereof (AREA)
- Enzymes And Modification Thereof (AREA)
- Preparation Of Compounds By Using Micro-Organisms (AREA)
Abstract
Description
______________________________________ A = Ala = Alanine C = Cys = Cysteine D = Asp = Aspartic acid E = Glu = Glutamic acid F = Phe = Phenylalanine G = Gly = Glycine H = His = Histidine I = Ile = Isoleucine K = Lys = Lysine L = Leu = Leucine M = Met = Methionine N = Asn = Asparagine P = Pro = Proline Q = Gln = Glutamine R = Arg = Arginine S = Ser = Serine T = Thr = Threonine V = Val = Valine W = Trp = Tryptophan Y = Tyr = Tyrosine B = Asx = Asp or Asn Z = Glx = Glu or Gln X = Xaa = Any amino acid * = Deletion or absent amino acid ______________________________________
TABLE 1
______________________________________
Amino Acid Sequence Alignment, CGTase Numbering and Domains
of Selected CGTases of Different Bacterial Origin
a Bacillus circulans 251 (SEQ ID NO: 70); b Bacillus sp. 1-1 (SEQ ID
NO: 71); c Bacillus sp. 38-2 (SEQ ID NO: 72); d Bacillus sp. 1011 (SEQ
ID NO: 73); e Bacillus licheniformis (SEQ ID NO: 74); f Bacillus
macerans (SEQ ID NO: 75); g Bacillus ohbensis (SEQ ID NO: 76);
h Bacillus stearothermophilus (SEQ ID NO: 77); i Klebsiella pneumoniae
(SEQ ID NO: 78); j Thermoanaerobacter ATCC 53627 (SEQ ID NO: 2).
No a b c d e f g h i j
Domain
______________________________________
1 A E A A D S D A E A A
2 P A P P A P * G P P A
3 D D D D D D * N B D A
4 T * T T T T * * E T A
5 S * S S * S * * T S A
6 V V V V V V V * Y V A
7 S T S S T D T L * S A
8 N N N N N N N N * N A
9 K K K K K K K K * V A
10 Q V Q Q Q V V V L V A
11 N N N N N N N N D N A
12 F Y F F F F Y F F Y A
13 S S S S S S T T R S A
14 T D T T T T R S K T A
14a * K * * * * R * K * A
15 D D D D D D D D E D A
16 V V V V V V V V T V A
17 I I I I I I I V I I A
18 Y Y Y Y Y Y Y Y Y Y A
19 Q Q Q Q Q Q Q Q F Q A
20 I I I I V I I I L I A
21 F V F F F V V V F V A
22 T T T T T T T V L T A
23 D D D D D D D D D D A
24 R R R R R R R R R R A
25 F F F F F F F F F F A
26 S S S S L A S V S L A
27 D D D D D D D D D D A
28 G G G G G G G G G G A
29 N N N N N D D N D N A
30 P P P P P R P T P P A
31 A G A A S T S S S S A
32 N N N N N N N N N N A
33 N N N N N N N N N N A
34 P P P P P P P P A P A
35 T S T T T A T S G T A
36 G G G G G G G G F G A
37 A A A A A D A A N D A
38 A I A A A A I L S L A
39 F F F F F F Y F A Y A
40 D S D D D S S S T D A
41 G Q G G G G Q S Y P A
42 T N S S T D D G D T A
43 C C C C C R C C P H A
44 T I T T S S S T N T A
45 N D N N N N D N N S A
46 L L L L L L L L L L A
47 R H R R K K H R K K A
48 L K L L L L K K K K A
49 Y Y Y Y Y Y Y Y Y Y A
50 C C C C C F C C T F A
51 G G G G G G G G G G A
52 G G G G G G G G G G A
53 D D D D D D D D D D A
54 W W W W W W W W L W A
55 Q Q Q Q Q Q Q Q R Q A
56 G G G G G G G G G G A
57 I I I I L I I I L I A
58 I I I I V I I I L I A
59 N D N N N D D N N N A
60 K K K K K K K K K K A
61 I I I I I I I I L I A
62 N N N N N N N N * N A
63 D D D D D D D D * D A
64 G G G G N G G G P G A
65 Y Y Y Y Y Y Y Y Y Y A
66 L L L L F L L L L L A
67 T T T T S T T T K T A
68 G D G G D G D D S G A
69 M L M M L M L M L M A
70 G G G G G G G G G G A
71 V I I I V V I V V I A
72 T T T T T T T T T T A
73 A A A A A A A A S A A
74 I L I I L L I I I I A
75 W W W W W W W W W W A
76 I I I I I I I I I I A
77 S S S S S S S S T S A
78 Q Q Q Q Q Q Q Q P Q A
79 P P P P P P P P P P A
80 V V V V V V V V I V A
81 B B E E E E E E D E A
82 N N N N N N N N N N A
83 I V I I I I V V V I A
84 Y Y Y Y F T Y F N Y A
85 S A S S A S A S N A A
86 I * V V T V * V T V A
87 I L I I I I L M D L A
88 N H N N N K H N * P A
89 Y P Y Y Y Y P D * D A
90 S S S S S S S A A S A
91 G G G G G G G S A T A
91a * Y * * * * Y * * F A
92 V * V V V V * * * G A
93 N * H N T N * G G G A
94 N * N N N N * S N S A
95 T T T T T T T A T T A
96 A S A A A S S S G S A
97 Y Y Y Y Y Y Y Y Y Y A
98 H H H H H H H H H H A
99 G G G G G G G G G G A
100 Y Y Y Y Y Y Y Y Y Y A
101 W W W W W W W W W W A
102 A A A A A A A A G A A
103 R R R R R R R R R R A
104 D D D D D D D D D D A
105 F Y F F F F Y F Y F A
106 K K K K K K K K F K A
107 K K K K K Q R K R K A
108 T T T T T T T P I T A
109 N N N N N N N N D N A
110 P P P P P D P P E P A
111 A Y A A Y A F F H F A
112 Y Y Y Y F F Y F F F A
113 G G G G G G G G G G A
114 T N T T T D D T N S A
115 I F M M M F F L L F A
116 A D Q Q T A S S D T A
117 D D D D D D D D D D A
118 F F F F F F F F F F A
119 Q D K K Q Q D Q K Q A
120 N R N N N N R R E N A
121 L L L L L L L L L L A
122 I M I I V I M V T I A
123 A S D D T D D D S A A
124 A T T T T T T A L T A
125 A A A A A L A A M A A
126 H H H H H T H H H H A
127 A S A A A L S A S A A
128 K N H H K I N K P H A
129 N G N N G T G G D N A
130 I I I I I S I I Y I A
131 K K K K K R K K N K A
132 V V V V I S V V M V A
133 I I I I I D I I K I A
134 I M I I I R M I L I A
135 D D D D D L D D V D A
136 F F F F F R F F L F A
137 A T A A A P T A D A A
138 P P P P P Q P P Y P A
139 N N N N N P N N A N A
140 H H H H H H H H P H A
141 T S T T T V S T N T A
142 S S S S S S S S H S A
143 P P P P P G P P S P A
144 A A A A A R A A N A B
145 S L S S M A L S A S B
146 S E S S E G E E N E B
147 D T D D T T T T D T B
148 Q N D D D N D N E D B
149 P P P P T P P P N P B
150 S N S S S G S S E T B
151 F Y F F F F Y Y P Y B
152 A V A A A A A M G G B
153 E E E E E E B E A E B
154 N N N N N N N N L N B
155 G G G G G G G G Y G B
156 R A R R K A A R R R B
157 L I L L L L V L D L B
158 Y Y Y Y Y Y Y Y G Y B
159 D D D D D D N D V D B
160 N N N N N N D N F N B
161 G G G G G G G G I G B
162 T A N N N S V T T V B
163 L L L L L L L L D L B
164 L L L L V L I L Y L B
165 G G G G G G G G P G B
166 G N G G G A N G T G B
167 Y Y Y Y Y Y Y Y N Y B
168 T S T T T S S T V T B
169 N N N N N N N N A N B
170 D D D D D D D D A D B
171 T Q T T T T P A N T B
172 Q Q Q Q N A N N T N B
173 N N N N G G N M G G B
174 L L L L Y L L Y W Y B
175 F F F F F F F F Y F B
176 H H H H H H H H H H B
177 H H H H H H H H H H B
178 N N Y Y N N N N N Y B
179 G G G G G G G G G G B
180 G G G G G G G G G G B
181 T T T T S T T T V T B
182 D D D D D D D T T N B
182a * * * * * * * * N * B
183 F F F F F F F F W F B
184 S S S S S S S S N S B
185 T S T T T T S S D S B
186 T Y I I L I Y L F Y B
187 E E E E E E E E F E B
188 N D N N N D D D Q D B
189 G S G G G G S G V G B
190 I I I I I I I I K I B
191 Y Y Y Y Y Y Y Y N Y B
192 K R K K K K R R H R B
193 N N N N N N N N N N B
194 L L L L L L L L L L B
195 Y Y Y Y Y Y Y F F F B
196 D D D D D D D D N D B
197 L L L L L L L L L L B
198 A A A A A A A A S A B
199 D D D D D D D D D D B
200 L Y L L L I Y L L L B
201 N D N N N N D N N D B
202 H L H H H H L H Q Q B
203 N N N N N N N Q S Q B
204 N N N N N N N N N N B
205 S T S S S N T P T S A
206 T V S S T A V V D T A
207 V M V V J M M I V I A
208 D D D D D D D D Y D A
209 V Q V V T A Q R Q S A
210 Y Y Y Y Y Y Y Y Y Y A
211 L L L L F F L L L L A
212 K K K K K K K K L K A
213 D E D D D S E D D A A
214 A S A A A A S A G A A
215 I I I I I I I V S I A
216 K K K K K D K K K K A
217 M F M M L L L M F L A
218 W W W W W W W W W W A
219 L L L L L L L I I L A
220 D D D D D G D D D D A
221 L K L L M M K M A M A
222 G G G G G G G G G G A
223 I I V V V V I I V I A
224 D D D D D D D D D D A
225 G G G G G G G G A G A
226 I I J I I I I I I I A
227 R R R R R R R R R R A
228 M V V V V F V M I M A
229 D D D D D D D D D D A
230 A A A A A A A A A A A
231 V V V V V V V V I V A
232 K K K K K K K K K K A
233 H H H H H Q H H H H A
234 M M M M M Y M M M M A
235 P S P P P P S P D A A
236 F E F F Q F E F K F A
237 G G G G G G G G S G A
238 W W W W W W W W F W A
239 Q Q Q Q Q Q Q Q I Q A
240 K T K K K K T K Q K A
241 S S S S N S S S K N A
242 F L F F W F L L W F A
243 M M M M M V M M T M A
244 A S S A S S S D S D A
245 A B T T S S D E D S A
246 V I I I I I I I I I A
247 N Y N N Y Y Y D Y L A
248 N S N N A G A N D S A
249 Y H Y Y H G H Y Y Y A
249a * * * * * D * * S * A
250 K K K K K H E R K R A
251 P P P P P P P P S P A
252 V V V V V V V V I V A
253 F F F F F F F F G F A
254 T T N T T T T T R T A
255 F F F F F F F F E F A
256 G G G G G G G G G G A
257 E E E E E E E E F E A
258 W W W W W W W W F W A
259 F F F F F Y F F F Y A
260 L L L L L L L L F L A
261 G G G G G G G S G G A
262 V S V V S A S E E T A
263 N G N N A D G N W N A
264 E E E E A Q E E F E A
265 V V I I P T V V G V A
266 S D S S D D D D A D A
267 P P P P A G P A S P A
268 E Q E E D D Q N A N A
269 N N Y Y N N N N N N A
270 H H H H T I H H T T A
271 K H Q Q D K H Y T Y A
272 F F F F F F F F T F A
273 A A A A A A A A G A A
274 N N N N N N N N V N A
275 E E E E E E E E D E A
276 S S S S S S S S G S A
277 G G G G G G G G N G A
278 M M M M M M M M A M A
279 S N S S S N S S I S A
280 L L L L L L L L D L A
281 L L L L L L L L Y L A
282 D D D D D D D D A D A
283 F F F F F F F F N F A
284 R Q P R R E Q R T R A
285 F F F F F Y F F S F A
286 A G A A N A G G G A A
287 Q Q Q Q S Q Q Q S Q A
288 K T K K A E T K A K A
289 V I A A V V I L L V A
290 R R R R R R R R L R A
291 Q N Q Q N E D Q D Q A
292 V V V V V V V V F V A
293 F L F F F F L L G F A
294 R K R R R R M R F R A
295 D D D D D D D N R D A
296 N R N N N K G N D N A
297 T T T T T T S S T T A
298 D S D D S E S D L D A
299 N N N N N T N N E T A
300 M W M M M M W W R M A
301 Y Y Y Y Y K Y Y V Y A
302 G D G G A D D G L G A
303 L F L L L L F F V L A
304 K N K K D Y N N G D A
305 A E A A S E E Q R S A
306 M M M M M V M M S M A
307 L I L L L L I I G I A
308 E T E E T A A Q N Q A
309 G S G G A S S D T S A
310 S T S S T T T T M T A
311 A E E B A B B A K A A
312 A K V V A S E S T A A
313 D E D D D Q D A L D A
314 Y Y Y Y Y Y Y Y N Y A
315 A N A A N D D D S N A
316 Q E Q Q Q Y E B Y F A
317 V V V V V I V V L I A
318 D I N N N N I L I N A
319 D D D D D N D D K D A
320 Q Q Q Q Q M Q Q R M A
321 V V V V V V V V Q V A
322 T T T T T T T T T T A
323 F F F F F F F F V F A
324 I I I I I I I I F I A
325 D D D D D D D D T D A
326 N N N N N N N N S N A
327 H H H H H H H H D H A
328 D D D D D D D D D D A
329 M M M M M M M M W M A
330 E S E E D D S D Q D A
331 R R R R R R R R V R A
332 F F F F F F F F V F A
333 H S H H K Q S M F Y A
334 A V T T T V F I M T A
335 S G S S S A B D D G A
336 N S N N A G Q G N * A
337 A S G G V S S G H G A
338 N S D D N G S D D S A
339 R N R R N T N P M T A
340 R R R R R R R R A R A
341 K Q K K R A H K R P A
342 L T L L L T T V I V A
343 E D E E E E D D G E A
344 Q M Q Q Q Q I M T Q A
345 A A A A A A A A A A A
346 L L L L L L L L L L A
347 A A A A A A A A R A A
348 F V F F F L V V S F A
349 T L T T T T L L N T A
350 L L L L L L L L A L A
351 T T T T T T T T T T A
352 S S S S S S S S T S A
353 R R R R R R R R F R A
354 G G G G G G G G G G A
355 V V V V V V V V P V A
356 P P P P P P P P G P A
357 A T A A A A T N N A A
358 I I I I I I I I N I A
359 Y Y Y Y Y Y Y Y E Y A
360 Y Y Y Y Y Y Y Y T Y A
361 G G G G G G G G G G A
362 T T S S T T T T G T A
363 E E E E E E E E S E A
364 Q Q Q Q Q Q Q Q Q Q A
365 Y Y Y Y Y Y Y Y S Y A
366 M V M M L M L M E M A
367 S T S S T T T T A T A
368 G G G G G G G G F G A
369 G G G G N D G N A N A
370 T N N N G G N G Q G A
371 D D D D D D D D K D A
372 P P P P P P P P R P A
373 D E D D D N E N I Y A
374 N N N N N N N N D N A
375 R R R R R R R R L R A
376 A K A A G A K K G A A
377 R P R R K M P M L M A
378 I L I L M M M M V M A
379 P K P P P T S S A T A
380 S T S S S S D S T S A
381 F F F F F F F F M F A
382 S D S S S N D N T D A
383 T R T T K T R K V T A
384 S S T T S G T N R T A
385 T T T T T T T T G T A
386 T N T T T T N R I T A
387 A S A A A A S A P A A
388 Y Y Y Y F Y Y Y A Y A
389 Q Q Q Q N K Q Q I N A
390 V I V V V V I V Y V A
391 I I I I I I I I Y I A
392 Q S Q Q S Q S Q G K A
393 K K K K K A T K T K A
394 L L L L L L L L E L A
395 A A A A A A A S H A A
396 P S P P P P S S Y P A
397 L L L L L L L L A L A
398 R R R R R R R R A R A
399 K Q K K K K Q R N K A
400 C T S S S S N N F S A
401 N N N N N N N N T N A
402 P S P P P P P P S P A
403 A A A A A A A A N A A
404 I L I I I I L L S I A
405 A G A A A A G A F A A
406 Y Y Y Y Y Y Y Y G Y A
407 G G G G G G G G Q G C
408 S T S S S T N D V T C
409 T T T T T T T T G Q C
410 Q T Q H Q T S E S K C
411 E E E E Q E E Q D Q C
412 R R R R R R R R P R C
413 W W W W W W W W Y W C
414 I L I I I V I I N I C
415 N N N N N N N N R N C
416 N E N N N N S G E N C
417 D D D D D D D D k D C
418 V I V V V V V V M V C
419 L Y I I Y L Y Y P Y C
420 I I I I I I I V G I C
421 Y Y Y Y Y I Y Y F Y C
422 E E E E E E E E D E C
423 R R R R R R R R T R C
424 K T K K K K S Q E Q C
425 F F F F F F F F S F C
426 G G G G G G G G E G C
427 S N N N K S D K A N C
428 N S N N S S S D F N C
429 V I V V V A V V S V C
430 A V A A A A V V I A C
431 V L V V V L L L I L C
432 V T V V V V T V K V C
433 A A A A A A A R T A C
434 V V i I V I V V L I C
435 N N N N N N N N G N C
436 R S R R R R S R D R C
437 N * N N N N * S L N C
438 L S M M L S G S R L C
439 N N N N T S D S K S C
440 A S T T T A T S S T C
441 P N P P P A S N S S C
442 A Q A A T Y Y Y P Y C
443 S T S S S P T S A Y C
444 I I I I I I I I I I C
445 S T T T T S N T Q T C
446 G N G G N G N G N G C
447 L L L L L L L L G L C
448 V N V V N L N F T Y C
449 T T T T T S T T Y T C
450 S S S S S S S A T A C
451 L L L L L L L L E L C
452 P P P R P P P P L P C
453 Q Q Q R S A Q A W A C
454 G G G A G G G G V G C
455 S N S S T T Q T N T C
456 Y Y Y Y Y Y Y Y D Y C
457 N T N N T S T T D S C
458 D D D D D D D D I D C
459 V E V V V V E Q L M C
460 L L L L L L L L V L C
461 G Q G G G N Q G F G C
462 G Q G G G G Q G E G C
463 L R I I V L L L R L C
464 L L L L L L L L R L C
465 N D N N N N D D S N C
466 G G G G G G G G G G C
467 N N N N N N N N N S C
468 T T T T N S E T D S C
469 L I L L I I I I I I C
470 S T T T T T T Q V T C
471 V V V V S V V V I V C
472 G N G G S G N G V S C
473 S A A A G S S S A S C
474 G N G G G G N N L N C
475 G G G G N G G G N G C
476 A A A A I A A S R S C
477 A V A A * V V V G V C
478 S N S S S T D N E T C
479 N S N N S N S A A P C
480 F F F F F F F F N F C
481 T Q T T T T Q D T T C
482 L L L L L L L L I L C
483 A R A A A A S G N A C
484 N A P P A A A P V P C
485 G N G G G G N G K G C
486 G S G G A G G B N E C
487 T V T T T T V V I V C
488 A A A A A A S G A A C
489 V V V V V V V V V V C
490 W W W W W W W W P W C
491 Q Q Q Q Q Q Q A N Q C
492 Y V Y Y Y Y I Y G Y C
493 T S T T T T T S V V C
494 A N T T A A E A Y S C
495 A W D D S P B T P T C
496 T S A A E E H E S T D
497 A T T T T T A S L N D
498 T S A T T S S T I P D
499 P P P P P P P P G P D
500 T L I I T A L I N L D
501 I I N I I I I I N I D
502 G G G G G G G G S G D
503 H Q N N H N H H V H D
504 V V V V V V V V S V D
505 G G G G G G G G V G D
506 P P P P P P P P A P D
507 M M M M V T M M N T D
508 M M M M M M M M K M D
509 A G A A G G G G R T D
510 K K K K K Q K Q T K D
511 P A A P P P H V T A D
512 G G G G G G G G L G D
513 V N V V N N N H T Q D
514 T T T T V I T Q L T D
515 I I I I V V V V M I D
516 T T T T T T T T Q T D
517 I V I I I I I I N I D
518 D S D D D D T D E D D
519 G G G G G G G G A G D
520 R E R R R R B E V R D
521 G G A G G G G G V G D
522 F F * F F F F F I F D
523 G G S G G G G G R G D
524 S D A S S G D T S T D
525 S E R G A T N N Q T D
526 K R Q K K A E T S A D
527 G G G G G G G G D G D
528 T S T T T T S T D Q D
529 V V V V V V V V A V D
530 Y L Y Y Y Y L K E L D
531 F F F F F F F F N F D
532 G D G G G G D G P G D
533 T S T T T T S T T T D
534 T T T T T T D T V T D
535 A S A A A A F A Q P D
536 V S V V V V S A S A D
537 S E T T T T D N I T D
538 G * G G G G * * N * D
539 A * A A S S * * F * D
540 D * D D A G * * T * D
541 I I I I I I V V C I D
542 T I V V T V L V N V D
543 S S A A S S S S N S D
544 W W W W W W W W G W D
545 E S E E E E S S Y E D
546 D N D D D D D N T D D
547 T T T T T T T N I T D
548 Q K Q Q Q Q K Q S E D
549 I I I I I I I I G V D
550 K S Q Q K K E V Q K D
551 V V V V V A V V S V D
552 K K K K T V S A V K D
553 I V I I I I V V Y V D
554 P P L P P P P P I P D
555 A N R A P K D N I A D
556 V V V V V V V V G L D
557 A A P P A A T S N T D
558 G G G G G A A P I P D
559 G G G G G G G G P G D
560 N Y I I D K H K Q K D
561 Y Y Y Y Y T Y Y L Y D
562 N D D D A G D N G N D
563 I L I I V V I I G I D
564 K S R R K S S T W T D
565 V V V V V V V V D L D
566 A V A A A K V Q L K D
567 N T N N A T N S T T D
568 A A A A * S A S K A D
569 A A A A N S G S A S D
570 G N G G G G D G V G D
571 T I A A V T S Q K V D
572 A K A A N A Q T I T D
573 S S S S S S S S S S D
574 N P N N N N P A P N D
575 V T I I A T T A T S D
576 Y Y Y Y Y F Y Y Q Y D
577 D K D D N K D D Y N D
578 N E N N D S K N P N D
579 F F F F F F F F Q I D
580 B E E E T N B E W N D
581 V V V V I V V V S V D
582 L L L L L L L L A L D
583 S S T T S T T T S T D
584 G G G G G G G N L G E
585 D N D D D D D D E N E
586 Q Q Q Q Q Q Q Q L Q E
587 V V V V V V V V P V E
588 S S T T S T S S S C E
589 V V V V V V I V D V E
590 R R R R R R R R L R E
591 F F F F F F F F N F E
592 V G V V V L A V V V E
593 V V I I I V V V E V E
594 N N N N N N N N W N E
595 N N N N N Q N N K N E
596 A A A A A A A A C A E
597 T T T T T N T T V T E
598 T T T T T T T T K T E
599 A S A A A N S N R V E
600 L P L L L Y L L N W E
601 G G G G G G G G E G E
602 Q T Q Q E T T Q T E E
603 N N N N N N N N N N E
604 V L V V I V L I P V E
605 Y Y F F Y Y Y Y T Y E
606 L I L L L L M I A L E
607 T V T T T V V V N T E
608 G G G G G G G G V G E
609 S N N N N N N N E N E
610 V V V V V A V V W V E
611 S N S S S A N Y Q A E
612 E E E E E E E E S E E
613 L L L L L L L L G L E
614 G G G G G G G G A G E
615 N N N N N T N N N N E
616 W W W W W W W W N W E
617 D D D D T D D D Q D E
618 P A P P T P P T F T E
619 A D N N G N D S N S E
620 K K N N A K Q K S K E
621 A A A A A A A A N A E
621a * * * * S * * * * * E
622 I I I I I I I I D I E
623 G G G G G G G G T G E
624 P P P P P P P P Q P E
625 M M M M A M M M T M E
626 Y F Y Y F Y P F T F E
627 N N N N N N N N N N E
628 Q Q Q Q Q Q Q Q G Q E
629 V V V V V V V V S V E
630 V M V V I I M V F V E
631 Y Y Y Y H A Y Y F Y E
632 Q Q Q Q A K Q S Q E
633 Y Y Y Y Y Y Y Y Y E
634 P P P P P P P P P E
635 N T T T T S T T T E
636 W W W W W W W W W E
637 Y Y Y Y Y Y Y Y Y E
638 Y Y Y Y Y Y Y I Y E
639 D D D D D D D D D E
640 V I V V V V I V V E
641 S S S S S S S S S E
642 V V V V V V V V V E
643 P P P P P P P P P E
644 A A A A A A A E A E
645 G G G G G G E G G E
646 K K Q Q K T E K T E
647 T N T T Q K N T T E
648 I L I I L L L I I E
649 E E E E E D E E E E
650 F Y F F F F Y F F E
651 K K K K K K K K K E
652 F Y F F F F F F F E
653 L I L L F I I I I E
654 K K K K K K K K K E
655 K K K K K K K K K E
656 Q D Q Q N G D D N E
657 G Q G G G G S S G E
658 S N S S A G S Q S E
658a * G * * * * G G * E
659 T N T T T T N N T E
660 V V V V I V V V V E
661 T V T T T T V T T E
662 W W W W W W W W W E
663 E Q E B B B B B E E
664 G S G G G G S S G E
665 G G G G G G G G G E
666 S N A A S G N S Y E
667 N N N N N N N N N E
668 H R R R H H H H H E
669 T T T T T T T V V E
670 F Y P P P Y Y Y Y E
671 T T T T T T T T T E
672 A S T T T T T T T E
673 P P P P P P P P P E
674 S T T T T A A T T E
675 S T S S S S T N S E
676 G G G G G G G T G E
677 T T T T T V T T T E
678 A D A A A G D G A E
679 T T T T T T T K T E
680 I V V V V V V I V E
681 N M N N T T L I I E
682 V I V V I V V V V E
683 N N N N N D D D D E
684 W W W W W W W W W E
685 Q Q Q Q Q Q Q Q E
686 P P P N N P E
______________________________________
*Amino acid residue absent in this position
TABLE 2 ______________________________________ CGTase Amino Acid Residues less than 8 Å from the Substrate Positions Identified by CGTase Numbering ______________________________________ 19 142 192 301 476 634 21 143 193 304 596 635 46 144 194 326 597 636 47 145 195 327 598 649 75 146 196 328 599 650 78 147 197 329 600 651 82 148 198 370 601 652 87 149 199 371 602 653 88 150 227 372 603 655 89 151 228 374 604 656 90 154 229 375 605 660 94 167 230 410 607 661 95 168 231 411 608 662 96 176 232 412 609 663 97 177 233 413 615 664 98 178 257 414 616 665 99 179 258 415 617 666 100 180 259 416 618 667 101 181 260 418 624 668 102 182 261 420 625 685 135 183 262 443 626 686 136 184 264 444 627 137 185 266 445 628 138 186 268 446 629 139 187 281 447 631 140 188 283 448 632 141 189 287 449 633 ______________________________________
TABLE 3
______________________________________
CGTase Amino Acid Residues less than 8 Å from the Substrate
Positions Identified in B. circulans Strain 251 (CGTase Numbering)
______________________________________
Gln-19 Ser-142 Gly-189 Phe-0283
Leu-447 Val-629
Phe-21 Pro-143 Lys-192 Gln-287 Val-448 Tyr-631
Arg-47 Ala-144 Asn-193 Tyr-301 Thr-449 Gln-632
Trp-75 Ser-145 Leu-194 Lys-304 Ala-476 Tyr-633
Gln-78 Ser-146 Phe-195 Asn-326 Ala-596 Pro-634
Asn-82 Asp-147 Asp-196 His-327 Thr-597 Asn-635
Ile-87 Gln-148 Leu-197 Asp-328 Thr-598 Trp-636
Asn-88 Pro-149 Ala-198 Met-329 Ala-599 Glu-649
Tyr-89 Ser-150 Asp-199 Thr-370 Leu-600 Phe-650
Ser-90 Phe-151 Arg-227 Asp-371 Gly-601 Lys-651
Asn-94 Asn-154 Met-228 Pro-372 Gln-602 Phe-652
Thr-95 Tyr-167 Asp-229 Asn-374 Asn-603 Lys-655
Ala-96 Thr-168 Ala-230 Arg-375 Val-604 Gln-656
Tyr-97 His-176 Val-231 Gln-410 Tyr-605 Val-660
His-98 His-177 Lys-232 Glu-411 Thr-607 Thr-661
Gly-99 Asn-178 His-233 Arg-412 Gly-608 Trp-662
Tyr-100 Gly-179 Glu-257 Trp-413 Ser-609 Glu-663
Trp-101 Gly-180 Trp-258 Ile-414 Asn-615 Gly-664
Ala-102 Thr-181 Phe-259 Asn-415 Trp-616 Gly-665
Asp-135 Asp-182 Leu-260 Asn-416 Asp-617 Ser-666
Phe-136 Phe-183 Gly-261 Val-418 Pro-618 Asn-667
Ala-137 Ser-184 Val-262 Ile-420 Pro-624 His-668
Pro-138 Thr-185 Glu-264 Ser-443 Met-625 Gln-685
Asn-139 Thr-186 Ser-266 Ile-444 Tyr-626 Pro-686
His-140 Glu-187 Glu-268 Ser-445 Asn-627
Thr-141 Asn-188 Leu-281 Gly-446 Gln-628
______________________________________
TABLE 4
______________________________________
CGTase Amino Acid Residues less than 5 Å from the Substrate
Positions Identified in B circulans Strain 251 (CGTase Numbering)
______________________________________
Tyr-89 Pro-149 Asn-193 Leu-260
Asn-415 Tyr-626
His-98 Ser-150 Leu-194 Gly-261 Gly-446 Asn-627
Tyr-100 Tyr-167 Phe-195 Glu-264 Leu-447 Gln-628
Trp-101 Gly-179 Asp-196 Tyr-301 Val-448 Tyr-633
Ala-137 Gly-180 Leu-197 His-327 Thr-598 Trp-636
His-140 Thr-181 Arg-227 Asp-328 Ala-599 Glu-649
Pro-143 Asp-182 Asp-229 Asp-371 Leu-600 Lys-651
Ala-144 Phe-183 Ala-230 Arg-375 Gly-601 Trp-662
Ser-145 Ser-184 Lys-232 Glu-411 Gln-602 Glu-663
Ser-146 Thr-185 His-233 Arg-412 Asn-603 Asn-667
Asp-147 Glu-187 Glu-257 Trp-413 Trp-616
Gln-148 Asn-188 Phe-259 Ile-414 Met-625
______________________________________
TABLE 5
______________________________________
CGTase Amino Acid Residues less than 3 Å from the Substrate
Positions Identified in B. circulans Strain 251 (CGTase Numbering)
______________________________________
Tyr-89 Asp-147 Asn-193 Phe-259
Thr-598 Gln-628
His-98 Gln-148 Phe-195 His-327 Ala-599 Tyr-633
Tyr-100 Gly-180 Asp-196 Asp-328 Leu-600 Trp-636
Trp-101 Asp-182 Asp-229 Asp-371 Gly-601 Lys-651
His-140 Phe-183 Lys-232 Glu-411 Gln-602 Asn-667
Ser-145 Ser-184 His-233 Ile-414 Asn-603
Ser-146 Thr-185 Glu-257 Gly-446 Asn-627
______________________________________
TABLE 6
______________________________________
CGTase Amino Acid Residues less than 8 Å from the Substrate
Positions Identified in Thermoanaerobacter sp. (CGTase Numbering)
______________________________________
Gln-19 His-140 Tyr-186 Glu-264
Tyr-443
Met-625
Val-21 Thr-141 Glu-187 Asp-266 Ile-444 Phe-626
Leu-46 Ser-142 Asp-188 Asn-268 Thr-445 Asn-627
Lys-47 Pro-143 Gly-189 Leu-281 Gly-446 Gln-628
Trp-75 Ala-144 Arg-192 Phe-283 Leu-447 Gln-632
Gln-78 Ser-145 Asn-193 Gln-287 Tyr-448 Tyr-633
Asn-82 Glu-146 Leu-194 Tyr-301 Ser-476 Pro-634
Leu-87 Thr-147 Phe-195 Asn-326 Ala-596 Thr-635
Pro-88 Asp-148 Asp-196 His-327 Thr-597 Trp-636
Asp-89 Pro-149 Leu-197 Asp-328 Thr-598 Glu-649
Phe-91a Thr-150 Ala-198 Met-329 Val-599 Phe-650
Ser-94 Tyr-151 Asp-199 Gly-370 Trp-600 Lys-651
Thr-95 Asn-154 Arg-227 Asp-371 Gly-601 Phe-652
Ser-96 Tyr-167 Met-228 Pro-372 Glu-602 Ile-653
Tyr-97 Thr-168 Asp-229 Asn-374 Asn-603 Lys-655
His-98 His-176 Ala-230 Arg-375 Val-604 Asn-656
Gly-99 His-177 Val-231 Lys-410 Tyr-605 Thr-661
Tyr-100 Tyr-178 Lys-232 Gln-411 Thr-607 Trp-662
Trp-101 Gly-179 His-233 Arg-412 Gly-608 Glu-663
Ala-102 Gly-180 Glu-257 Trp-413 Asn-609 Gly-664
Asp-135 Thr-181 Trp-258 Ile-414 Asn-615 Gly-665
Phe-136 Asn-182 Tyr-259 Asn-415 Trp-616 Tyr-666
Ala-137 Phe-183 Leu-260 Asn-416 Asp-617 Asn-667
Pro-138 Ser-184 Gly-261 Val-418 Thr-618 His-668
Asn-139 Ser-185 Thr-262 Ile-420 Pro-624 Gln-685
______________________________________
TABLE 7
______________________________________
CGTase Amino Acid Residues less than 5 Å from the Substrate
Positions Identified in Thermoanaerobacter sp. (CGTase Numbering)
______________________________________
Lys-47
Lys-148 Asn-193 Gly-261
Thr-445 Gln-628
Ser-94 Pro-149 Leu-194 Glu-264 Gly-446 Tyr-633
Tyr-97 Thr-150 Phe-195 Asp-266 Leu-447 Trp-636
His-98 Tyr-151 Asp-196 Tyr-301 Tyr-448 Glu-649
Tyr-100 Tyr-167 Leu-197 His-327 Thr-598 Lys-651
Trp-101 Gly-179 Arg-227 Asp-328 Val-599 Trp-662
Ala-137 Gly-180 Asp-229 Asp-371 Trp-600 Glu-663
His-140 Thr-181 Ala-230 Arg-375 Gly-601 Gly-665
Pro-143 Asn-182 Lys-232 Gln-411 Glu-602 Asn-667
Ala-144 Phe-183 His-233 Arg-412 Asn-603
Ser-145 Ser-184 Glu-257 Trp-413 Trp-616
Glu-146 Ser-185 Tyr-259 Ile-414 Met-625
Thr-147 Tyr-186 Leu-260 Asn-415 Asn-627
______________________________________
TABLE 8
______________________________________
CGTase Amino Acid Residues less than 3 Å from the Substrate
Positions Identified in Thermoanaerobacter sp. (CGTase Numbering)
______________________________________
His-98
Thr-147 Phe-195 Asp-328
Thr-598 Asn-627
Tyr-100 Gly-180 Asp-229 Asp-371 Val-599 Tyr-633
Trp-101 Phe-183 His-233 Arg-375 Trp-600 Lys-651
His-140 Ser-184 Glu-257 Gln-411 Gly-601 Asn-667
Ser-145 Ser-185 Tyr-259 Ile-414 Glu-602
Glu-146 Asn-193 His-327 Gly-446 Asn-603
______________________________________
TABLE 9
__________________________________________________________________________
CGTase Variants of Increased Product Selectivity
Positions Identified by CGTase Numbering
Position
α-cyclodextrin
β-cyclodextrin
γ-cyclodextrin
__________________________________________________________________________
21 F,Y F,Y F,Y
47 Q,L A,Q,H,R,L A,Q,H,R,L
87 I,H I,H I,H
88 P,N,K,H P,N,K,H P,N,K,H
89 D,G,A,Y,E,* D,G,A,E,K,R,Y,P,* D,G,A,Y,P,*
90 S G,A,S G,A,S
91 A,V,D,G,T A,V,G,S,T A,V,G,S
91a A,V,G,Y,* A,V,G,Y,F,* A,V,G,Y,F,*
92 G,V,* G,V,* G,V,*
93 G,N,* G,N,H,T,* G,N,H,T,*
94 Q,K,R,W,F,N,S,* Q,K,R,W, F, N,S,* Q,K,R,W,F,N,S,*
98 H G,A G,A
101 W G,A G,A,F,Y
135 L,D L,D L,D
140 A,R,N A,R,N A,R,N
143 G,S,A P P
144 K,R,D,A,N,E,Q A A
145 A,E,W,P,G,F,Y,P,R,K A,E,L,W A,E,L,W
145a P,A,F,Q,S,W,I,R,* P,A,I,Q,S I,A,Q,P,S
146 P,A,F,Q,S,W,I,R,G,E,* P,A,I,Q,S,E,K,D,N,R,F,W,* I,A,Q,P,S,E
147 A,L,I,F,T,* A,L,I,F,W,G,Y,R,D,T,*
S,T,A,D
147a * * D,N,E,Q,T
148 G,A,N G,A,N,Q D,E,R,K,Y,F,N,Q
149 P W,P L,I,F,W,P
150 A,G A,S A,S,N
167 P,F,Y A,F,Y A,F,P,Y
168 S,T S,T S,T
178 N,Y N,Y N,Y
179 S,N,D G,S,N,D G,S,N,D
180 S,N,D G,S,N,D G,S,N,D
183 F,W,Y,A F,W,Y,A F,W,Y,A
185 P,H,R,E,D P,H,R,E,D P,H,R,E,D
192 K,R K,R K,R
193 G,D,E,N,Q G,A,N G,A,N
195 Y,F L,I,W,Y,F L,I,W,F,Y
196 A,D,N,S A,D,N,S A,D,N,S
197 D,E,L D,E,L D,E,L
232 K,Q,L K,Q,L K,Q,L
233 H,Q,N,I H,Q,N,I H,Q,N,I
259 F,W,Y,A F,W,Y,A F,W,Y,A
264 Q Q Q
326 Q,F,L Q,F,L Q,F,L
370 G T,N T,N
371 A,D,S,N,G,E,Q A,G,N,D,S A,G,N,V,L,I,D,S
373 D,N,Y D,E,Y D,E,Y
375 R,K A,P,G,R,K A,P,G,R,K
600 X X X
__________________________________________________________________________
X = any natural amino acid residue
*deleted or absent residue
TABLE 10
______________________________________
CGTase Variants of Reduced Product Inhibition
Positions Identified by CGTase Numbering
______________________________________
47 A,Q,L
89 G
100 A,I,L,F,Y
185 R,E,D
186 A
196 A,L,D
232 K,Q,N,A,L
264 A,N,L
268 A
339 A
371 G,N,A,L,D,S,E,Q
375 G,Q,N,A,L,R,K
382 A,L,V
384 A,L,V
413 A,V,G,W
598 A,V,G,P,T
599a P,R,H
600 X
603 A,V,L,G,N
616 A,I,L,G,W
626 A,I,V,L,G
627 A,V,L,G,N
628 A,V,L,G,Q
633 A,V,L,I,G,Y
636 I,L,A,G,W
649 A,G
651 A,G,V,K
662 A,L,I,G,W
667 A,N
______________________________________
X = any natural amino acid residue
TABLE 11
__________________________________________________________________________
Bacillus Derived CGTase Variants of Increased Product Selectivity
Positions Identified by CGTase Numbering
Position
α-cyclodextrin
β-cyclodextrin
γ-cyclodextrin
__________________________________________________________________________
21 F,Y F,Y F,Y
47 Q,L A,Q,H,R,L A,Q,H,R,L
87 H H H
88 P,N,K,H P,N,K,H P,N,K,H
89 D,G,A,E,* D,G,A,E,K,R,P,* D,G,A,P,*
90 -- G,A G,A
91 A,V,D,T A,V,S,T A,V,S
91a A,V,G,Y,* A,V,G,Y,F,* A,V,G,Y,F,*
92 G,* G,* G,*
93 G,* G,H,T,* G,H,T,*
94 Q,K,R,W,F,S,* Q,K,R,W,F,S,* Q,K,R,W,F,S,*
98 -- G,A G,A
101 -- G,A G,A,F,Y
135 L L L
140 A,R,N A,R,N A,R,N
143 G,S P P
144 K,R,D,A,N,E,Q A A
145 A,E,W,P,G,F,Y,P,R,K A,E,L,W A,E,L,W
145a P,A,F,Q,S,W,I,R,* P,A,I,Q,S I,A,Q,P,S
146 P,A,F,Q,S,W,I,R,G,E,* P,A,I,Q,S,E,K,D,N,R,F,W,* I,A,Q,P,S,E
147 A,L,I,F,* A,L,I,F,W,G,Y,R,D,T,* S,T,A,D
147a * * D,N,E,Q,T
148 G,A,N G,A,N D,E,R,K,Y,F,N
149 -- W L,I,F,W
150 A,G A A,S,N
167 P,F A,F A,F,P
168 S,T S,T S,T
178 N,Y N,Y N,Y
179 S,N,D S,N,D S,N,D
180 S,N,D S,N,D S,N,D
183 W,Y,A W,Y,A W,Y,A
185 P,H,R,E,D P,H,R,E,D P,H,R,E,D
192 K,R K,R K,R
193 G,D,E,Q G,A G,A
195 F L,I,W,F L,I,W,F
196 A,S,N,G A,S,N,G A,S,N,G
197 D,E D,E D,E
232 Q,L Q,L Q,L
233 Q,N,I Q,N,I Q,N,I
259 F,W,A F,W,A F,W,A
264 Q Q Q
326 Q,F,L Q,F,L Q,F,L
370 G T,N T,N
371 A,S,N,G,E,Q A,G,N,S A,G,N,V,L,I,S
373 D,N,Y D,E,Y D,E,Y
375 -- A,P,G,K A,P,G,K
600 X X X
__________________________________________________________________________
X = any natural amino acid residue
-- conserved residue
* deleted or absent residue
TABLE 12
______________________________________
Bacillus Derived CGTase Variants of Reduced Product Inhibition
Positions Identified by CGTase Numbering
______________________________________
47 A,Q,L
89 G
100 A,I,L,F
185 R,E,D
186 A
196 A,L
232 Q,N,A,L
264 A,N,L
268 A
339 A
371 G,N,A,L,S,E,Q
375 G,Q,N,A,L,K
382 A,L,V
384 A,L,V
413 A,V,G
598 A,V,G,P
599a P,R,H
600 X
603 A,V,L,G
616 A,I,L,G
626 A,I,V,L,G
627 A,V,L,G
628 A,V,L,G
633 A,V,L,I,G
636 I,L,A,G
649 A,G
651 A,G,V
662 A,L,I,G
667 A
______________________________________
X = any natural amino acid residue
TABLE 13
__________________________________________________________________________
Thermoanaerobacter Derived CGTase Variants of Increased Product Selectivit
Positions Identified by CGTase Numbering
Position
α-cyclodextrin
β-cyclodextrin
γ-cyclodextrin
__________________________________________________________________________
21 F,Y F,Y F,Y
47 Q,L A,Q,H,R,L A,Q,H,R,L
87 I,H I,H I,H
88 N,K,H N,K,H N,K,H
89 G,A,Y,E,* G,A,E,K,R,Y,P,* G,A,Y,P,*
90 -- G,A G,A
91 A,V,D,G A,V,G,S A,V,G,S
91a A,V,G,Y,* A,V,G,Y,* A,V,G,Y,*
92 V,* V,* V,*
93 N,* N,H,T,* N,H,T,*
94 Q,K,R,W,F,N,* Q,K,R,W,F,N,* Q,K,R,W,F,N,*
98 -- G,A G,A
101 -- G,A G,A,F,Y
135 L L L
140 A,R,N A,R,N A,R,N
143 G,S -- --
144 K,R,D,N,E,Q -- --
145 A,E,W,P,G,F,Y,P,R,K A,E,L,W A,E,L,W
145a P,A,F,Q,S,W,I,R,* P,A,I,Q,S I,A,Q,P,S
146 P,A,F,Q,S,W,I,R,G,* P,A,I,Q,S,K,D,N,R,F,W,* I,A,Q,P,S
147 A,L,I,F,* A,L,I,F,W,G,Y,R,D,* S,A,D
147a * * D,N,E,Q,T
148 G,A,N G,A,N,Q E,R,K,Y,F,N,Q
149 -- W L,I,F,W
150 A,G A,S A,S,N
167 P,F A,F A,F,P
168 S S S
178 N N N
179 S,N,D S,N,D S,N,D
180 S,N,D S,N,D S,N,D
183 W,Y,A W,Y,A W,Y,A
185 P,H,R,E,D P,H,R,E,D P,H,R,E,D
192 K K K
193 G,D,E,Q G,A G,A
195 Y L,I,W,Y L,I,W,Y
196 A,S,N,G A,S,N,G A,S,N,G
197 D,E D,E D,E
232 Q,L Q,L Q,L
233 Q,N.I Q,N,I Q,N,I
259 F,W,A F,W,A F,W,A
264 Q Q Q
326 Q,F,L Q,F,L Q,F,L
370 -- T,N T,N
371 A,S,N,G,E,Q A,G,N,S A,G,N,V,L,I,S
373 D,N D,E D,E
375 -- A,P,G,K A,P,G,K
600 X X X
__________________________________________________________________________
X = any natural amino acid residue
-- conserved residue
* deleted or absent residue
TABLE 14
______________________________________
Thermoanaerobacter Derived CGTase Variants of Reduced
Product Inhibition Positions Identified by CGTase Numbering
______________________________________
47 A,Q,L
89 G
100 A,I,L,F
185 R,E,D
186 A
196 A,L
232 Q,N,A,L
264 A,N,L
268 A
339 A
371 G,N,A,L,S,E,Q
375 G,Q,N,A,L,K
382 A,L,V
384 A,L,V
413 A,V,G
598 A,V,G,P
599a P,R,H
600 X
603 A,V,L,G
616 A,I,L,G
626 A,I,V,L,G
627 A,V,L,G
628 A,V,L,G
633 A,V,L,I,G
636 I,L,A,G
649 A,G
651 A,G,V
662 A,L,I,G
667 A
______________________________________
X = any natural amino acid residue
______________________________________ 5'-G GTC GTT TAC CAG GCG CCG AAC TGG-3' (Y633A) (SEQ ID NO: 13); 5'-GC GAG CTC GGG AAC GCG GAC CCG-3' (W616A:) (SEQ ID NO: 14); 5'-CC GTC ACC GCG GAA GGC GGC-3' (W662A) (SEQ ID NO: 15); 5'-GC ATC TAC AAG GGC CTG TACGAT CTC G-3' (N193G) (SEQ ID NO: 16); 5'-GCA TCA TCA ATG GAT CCG GCG TAA AC-3' (Y89G) (SEQ ID NO: 17); 5'-CAT ACG TCG CCC GCT AGC ATT TCC GAC CAG CCT TCC-3' (145aI) (SEQ ID NO: 18); 5'-CG GGC GGG ACC GGT CCG GAC AAC CG-3' (D371G) (SEQ ID NO: 19); 5'-G TCG GGC GGT ACC AAT CCG GAC AAC C-3' (D371N) (SEQ ID NO: 20); 5'-CG TTC ATC GAT CAG CAT GAC ATG G-3' (N326Q) (SEQ ID NO: 21); 5'-GC ATC ATC AAT GAT TCC GGA GTA AAC AAC ACG GC-3' (Y89D); (SEQ ID NO: 65) and 5'-G CCC GCC TCT CCG GAC CAG CCT TC-3' (S146P) (SEQ ID NO: 22); and the the partial oligonucleotide sequences (primers) described as primers A1-A24, primers B1-B15, and C1-C9, of Examples 5, 6 and 7. ______________________________________
______________________________________
Y89G: 5'-GCA TCA TCA ATG GAT CCG GCG TAA AC-3' (Bam HI)
(SEQ ID NO: 17); and
S146P: 5'-G CCC GCC TCT CCG GAC CAG CCT TC-3' (BspE I)
(SEQ ID NO: 22).
______________________________________
TABLE 15
______________________________________
Specific Activities of α- β- AND γ-CD Forming CGTases
Cyclization Activity (U/mg)
Enzyme α β γ
______________________________________
Wild-type 2 280 80
Y89D 5 270 47
S146P 25 104 82
Y89D/S146P 35 109 79
______________________________________
TABLE 16
______________________________________
Ratio of Cyclodextrin Formation from 10% Paselli ™ WA4
(at 50° C. and for 50 hours)
Cyclodextrin Produced (%)
Enzyme α β γ
______________________________________
Wild-type 14 63 23
Y89D 17 63 20
S146P 26 55 19
Y89D/S146P 31 51 18
______________________________________
______________________________________ Y633A: 5'-G GTC GTT TAC CAG GCG CCG AAC TGG-3' (SEQ ID NO: 13) W616A: 5'-GC GAG CTC GGG AAC GCG GAC CCG-3' (SEQ ID NO: 14) W662A: 5'-CC GTC ACC GCG GAA GGC GGC-3' (SEQ ID NO: 15) ______________________________________
TABLE 17
______________________________________
Binding Properties on Raw Starch
Y.sub.max K.sub.50 (% RS)
n
0 mM 0.1 mM 0 mM 0.1 mM
0 mM 0.1 mM
Enzyme β-CD β-CD β-CD β-CD β-CD β-CD
______________________________________
Wild-type
96.2 ±
76.4 ±
0.70 ±
0.89 ±
1.71 ±
1.25 ±
3.3 1.3 0.05 0.04 0.19 0.06
W616A/ 48.7 ± 33.4 ± 2.36 ± 2.27 ± 1.19 ± 1.07 ±
W662A 1.3 2.0 0.13 0.30 0.06
0.07
Y633A 90.8 ± 58.4 ± 0.99 ± 2.70 ± 1.73 ± 1.34 ±
2.2 5.9 0.05 0.58 0.12
______________________________________
0.20
TABLE 18
__________________________________________________________________________
Kinetic Properties Determined on Paselli ™ SA2
V.sub.max K.sub.50
(units/mg) (% Paselli ™ SA2)
0 mM
0.1 mM
0.2 mM
0 mM
0.1 mM
0.2 mM K.sub.i
Enzyme β-CD β-CD β-CD β-CD β-CD β-CD n
(nM)
__________________________________________________________________________
Wild type
280.4
221.0
184.5
0.098
0.077
0.071
1.40
0.38
±2.6 ±3.7 ±1.6 ±0.003 ±0.005 ±0.002 ±0.12
±0.02
W616A/W662A 247.2 224.5 212.6 0.115 0.104 0.109 1.26 1.11
±3.0 ±1.6 ±1.2 ±0.005 ±0.002 ±0.002 ±0.12
±0.14
Y633A 316.2 316.5 317.2 0.23 0.35 0.44 0.98 0.21
±3.6 ±6.1 ±5.5 ±0.01 ±0.02 ±0.03 ±0.15 ±0.04
__________________________________________________________________________
TABLE 19
______________________________________
Kinetic Properties Determined on Raw Starch
V.sub.max K.sub.50
Enzyme (units/mg) (% RS)
______________________________________
Wild-type 153.1 ± 4.8
18.4 ± 1.3
W616A/W662A (153.1) (42.5)
Y633A 159.5 ± 5.0 42.5 ± 2.7
______________________________________
N193G:
5'-GC ATC TAC AAG GGC CTG TACGAT CTC G-3'
(Dra II)
(SEQ ID NO: 16);
- Y89G: 5'-GCA TCA TCA ATG GAT CCG GCG TAA AC-3' (Bam HI) (SEQ ID NO:
17);
- *145aI: 5'-CAT ACG TCG CCC GCT AGC ATT TCC GAC CAG CCT TCC-3' (Nhe
I) (SEQ ID NO:
18);
- D371G: 5'-CG GGC GGG ACC GGT CCG GAC AAC CG-3' (Pin AI) (SEQ ID NO:
19);
- D371N: 5'-G TCG GGC GGT ACC AAT CCG GAC AAC C-3' (Kpn I) (SEQ ID NO:
20); and
- N326Q: 5'-CG TTC ATC GAT CAG CAT GAC ATG G-3' (Cla I) (SEQ ID NO:
21).
TABLE 20
______________________________________
Ratio of Cyclodextrin Formation from 10% Paselli ™ WA4
(at 50° C. for 5 hours)
Cyclodextrins Produced (%)
Enzyme α β γ
______________________________________
Wild-type
7.4 ± 1.4 70.8 ± 1.8
21.8 ± 0.5
N193G 13.5 ± 0.2 57.5 ± 0.7 29.0 ± 0.8
Y89G 9.5 ± 1.2 72.4 ± 2.1 18.0 ± 1.0
*145aI 18.8 ± 1.2 46.3 ± 1.3 34.3 ± 1.5
D371G 2.4 ± 0.2 71.8 ± 1.6 25.9 ± 1.4
______________________________________
TABLE 21
______________________________________
Ratio of Cyclodextrin Formation from 10% Paselli ™ WA4
(at 50° C. for 45 hours)
Cyclodextrins Produced (%)
Enzyme α β γ
______________________________________
Wild-type 14.4 ± 1.0
67.7 ± 0.7
18.0 ± 0.8
N193G 25.5 ± 0.1 60.4 ± 0.2 14.1 ± 0.4
Y89G 12.7 ± 0.3 69.3 ± 0.6 18.0 ± 0.3
*145aI 24.6 ± 0.5 53.5 ± 0.8 21.9 ± 0.7
D371G 4.4 ± 0.1 73.9 ± 0.1 21.7 ± 0.1
D371N 6.5 ± 0.5 73.4 ± 0.5 20.1 ± 0.4
N326Q 5.0 ± 0.1 75.4 ± 0.1 19.6 ± 0.2
Y89G/N193G 18.8 ± 0.6 53.8 ± 0.7 27.4 ± 0.3
N193G/Q148E 17.5 ± 0.5 60.9 ± 0.7 21.6 ± 0.8
______________________________________
TABLE 22
______________________________________
Specific Activities of Initial Cyclization
Cyclization Activity (U/mg)
Enzyme α β γ
______________________________________
Wild-type 2 280 53
*145aI 111 59
N193G 132 66
N326Q 3 63 14
D371N 25 108 30
D371G 7 81 29
______________________________________
A1:
143-151(GRAGTNPG) (SEQ ID NO: 12);
5'-AATCATACATCTGGACGAGCAGGTACCAACCCGACTTTGGGGAAAA
TGGTAC-3' (SEQ ID NO: 23);
A2: 87-94(IKYSG-VNN) (SEQ ID NO: 11) + 143-151(GRAGTNPG) (SEQ ID NO:
12);
__________________________________________________________________________
A5:
P143G-A144R-S145W;
5'-ATCATACATCCGGACGATGGGAGACAGACCCTACC-3' (SEQ ID NO:24);
- A6: 87-94 (INDSG-VNN) (SEQ ID NO:58);
- 5'-CATTTACGCAGTTATCAATGATTCCGGAGTTAACAATACATCCTATCATGG-3' (SEQ
ID NO:25);
- A7: 87-94 (INDSG-VNN) (SEQ ID NO:58) + 146-150 (SDQPS) (SEQ ID
NO:59);
__________________________________________________________________________
__________________________________________________________________________
A8:
143-148 (GRGPAA) (SEQ ID NO:10);
- 5'-CAAATCATACATCTGGACGAGGACCGGCCGCACCTACCTATGGGG-3' (SEQ ID
NO:26);
- A9: 143-148 (GRAPAA) (SEQ ID NO:9);
- 5'-CAAATCATACATCTGGACGAGCACCGGCCGCACCTACCTATGGGG-3' (SEQ ID
NO:27);
- A10: 143-148 (GRA**A) (SEQ ID NO:7);
- 5'-CAAATCATACATCTGGACGAGCAGCACCTACCTATGGGG-3' (SEQ ID NO:28);
- A11: 143-148 (GRPAAA) (SEQ ID NO:64);
- 5'-CAAATCATACATCTGGACGACCTGCAGCAGCTCCTACCTATGGGG-3' (SEQ ID
NO:29);
- A12: G180S;
- 5'-CCATCATTACGGATCCACTAATTTTTCATC-3' (SEQ ID NO:30);
- A13: A144R;
- 5'-CATACATCTCCTCGATCGGAGACAGACCC-3' (SEQ ID NO:31);
- A14: P143A-A144R;
- 5'-CATACATCTGCTCGATCGGAGACAGACCC-3' (SEQ ID NO:32);
- A15: G180N;
- 5'-CCATCATTACGGAAACACTAATTTTTCATC-3' (SEQ ID NO:33);
- A16: G180D;
- 5'-CCATCATTACGGAGACACTAATTTTTCATC-3' (SEQ ID NO:34);
- A17: G180N + P143G-A144R-S145W;
__________________________________________________________________________
__________________________________________________________________________
A19:
G179N;
5'-CCATCATTATAATGGAACTAATTTTTCATC-3' (SEQ ID NO:35);
- A20: G179S;
- 5'-CCATCATTATAGTGGAACTAATTTTTCATC-3' (SEQ ID NO:36);
- A21: G179D;
- 5'-CCATCATTATGATGGAACTAATTTTTCATC-3' (SEQ ID NO:37);
- A22: G179N + P143G-A144R-S145W;
__________________________________________________________________________
TABLE 23
______________________________________
Production, Purification and Enzyme Activities of CGTases
Enzyme activity
Enzyme Host Purification method (NU/mg)
______________________________________
Wild-type
E. coli AfC 1513
A1 E. coli AfC 432
A2 E. coli
AfC 1009
A E. coli AfC 1404
A4 E. coli
AfC 1082
A5 E. coli AfC + AEC 2100
A9 E. coli
A10 E. coli
A11 E. coli AfC + AEC 2200
______________________________________
TABLE 24
______________________________________
Specific Activities of α-, β- and γ-CD Forming CGTases
Cyclization Activity (U/mg)
Enzyme α β
γ
______________________________________
Wild-type 39 49 40
A1 26 29
A2
32 36
A 24 26
A4
32 39
A5 43 27 27
A11 20 6 13
______________________________________
TABLE 25
______________________________________
Ratio of Cyclodextrin Formation at Optimum CD Formation
Cyclodextrin produced (%)
Enzyme α β
γ
______________________________________
Wild-type 39 45 17
A1 42 38 20
A2 38 45 17
A 42 38 20
A4 39 41 20
A5 42 37 20
______________________________________
__________________________________________________________________________
B1:
S145A;
5'-CTCCTGCAGCTGAGACAGACCC-3' (SEQ ID NO:38);
- B2: E146S;
- 5'-CTCCTGCATCGTCGACAGACCC-3' (SEQ ID NO:39);
- B3: T147A;
- 5'-TCAGAGGCGGATCCTACCTATGG-3' (SEQ ID NO:40);
- B4: T147L;
- 5'-TCAGAGCTCGACCCTACCTATGG-3' (SEQ ID NO:41);
- B5: D148A;
- 5'-CAGAGACGGCGCCTACCTATGGGG-3' (SEQ ID NO:42);
- B6: D89A;
- 5'-CGCAGTTTTGCCGGCTTCCAC-3' (SEQ ID NO:43);
- B7: F91aA;
- 5'-TCCACTGCCGGCGGAAGCAC-3' (SEQ ID NO:44);
- B8: F91a*;
- 5'-AGATTCTACCGGTGGAAGCAC-3' (SEQ ID NO:45);
- B9: 87-94 (IKYSG-VNN) (SEQ ID NO:11);
- 5'-TTTACGCAGTTATTAAATATTCCGGCGTTAACAACACATCCTATCATGG-3'
__________________________________________________________________________
__________________________________________________________________________
B11:
D196S;
5'-CGTAATTTATTCTCGCTAGCAGATTTAG-3' (SEQ ID NO:68);
- B12: D196A;
- 5'-CGTAATTTATTCGCGCTAGCAGATTTAG-3' (SEQ ID NO:47);
- B13: D371N;
- 5'-CAGGTAATGGTAACCCTTATAATAGAGC-3' (SBQ ID NO:48);
- B14: D371G;
- 5'-CAGGTAATGGAGGGCCTTATAATAGAGC-3' (SEQ ID NO:49); and
- B15: D371A;
- 5'-CAGGTAATGGAGCGCCTTATAATAGAGC-3' (SEQ ID NO:50).
__________________________________________________________________________
TABLE 26
______________________________________
Production, Purification and Enzyme Activities of CGTases
Enzyme activity
Enzyme Host Purification method (NU/ml)
______________________________________
Wild-type
Bacillus AfC 1513
B1 Bacillus AfC 1925
B2 Bacillus
AfC 2290
B3 Bacillus AfC 1636
B4 Bacillus
AfC 1949
B5 Bacillus AfC 1839
B6 E. coli
AfC 1908
B7 E. coli AfC 1686
B8 E. coli
AfC 1212
B9 E. coli AfC 1862
______________________________________
TABLE 27
______________________________________
Specific Activities of α-, β- and γ-CD Forming CGTases
Cyclization Activity (U/mg)
Enzyme α β γ
______________________________________
Wild-type 39 131 96
B1 150 140
B2
140 120
B3 120 84
B4
110 97
B5 120 82
B6
101 84
B7 107 80
B8
118 97
B9 131
______________________________________
63
TABLE 28
______________________________________
Ratio of Cyclodextrin Formation at Optimum CD Formation
Cyclodextrin produced (%)
Enzyme α β
γ
______________________________________
Wild-type 39 45 17
B1 35 49 16
B2 35 46 18
B3 35 48 16
B4 35 49 16
B5 35 49 16
B9 35 48 17
______________________________________
__________________________________________________________________________
C1:
N193A;
5'P-TTACCGTGCACTATTTGACTTAGC-3' (SEQ ID NO:51);
- C2: 146-150(SDQPS) (SEQ ID NO:59);
- 5'P-CTCCTGCATCATCTGATCAACCGTCCTTTGGGGAAAATGG-3' (SEQ ID
NO:52);
- C3: 145-148 (AELA) (SEQ ID NO:60);
- 5'P-CATCTCCTGCAGCAGAGCTCGCACCTACCTATGGG-3' (SEQ ID NO:53);
- C4: 145-148(AEWA) (SEQ ID NO:61);
- 5'P-CATCTCCTGCAGCAGAGTGGGCACCTACCTATGGG-3' (SEQ ID NO:54);
- C5: 87-94 (INYSG*VNN) (SEQ ID NO:62);
- 5'P-CATTTACGCAGTTATCAATTATTCCGGAGTTAACAATACATCCTATCATGG-3'
(SEQ ID NO:55);
- C6: 87-94(HP*SGY***) (SEQ ID NO:3);
- 5'P-CATTTACGCAGTTCATCCTTCCGGGTATACATCCTATCATGG-3' (SEQ ID
NO:56);
- C7: 145-148 (LETN) (SEQ ID NO:63);
- 5'P-TACATCTCCTGCACTCGAGACAAATCCTACCTATGG-3' (SEQ ID NO:57);
- C8: 87-94(HP*SGY***) (SEQ ID NO:3) + 145-148(LETN) (SEQ ID NO:63);
Both primers listed as C6 and C7 were used simultaneously;
- C9: 87-94(INYSG*VNN) (SEQ ID NO:62) + 146-150(SDQPS) (SEQ ID NO:59);
Both primers listed C2 and C5 were used simultaneously.
__________________________________________________________________________
TABLE 29
______________________________________
Production, Purification and Enzyme Activities of CGTases
Enzyme activity
Enzyme Host Purification method (NU/ml)
______________________________________
Wild-type
E. coli AfC 1513
C1 E. coli AfC 1643
______________________________________
TABLE 30
______________________________________
Specific Activities of α-, β- and γ-CD Forming CGTases
Cyclization Activity (U/mg)
Enzyme α β γ
______________________________________
Wild-type 39 131 96
C1 102 90
______________________________________
TABLE 31
______________________________________
Ratio of Cyclodextrin Formation at Optimum CD Formation
Cyclodextrin produced (%)
Enzyme α β γ
______________________________________
Wild-type 39 45 17
C1 35 49 16
______________________________________
__________________________________________________________________________
# SEQUENCE LISTING
- - - - <160> NUMBER OF SEQ ID NOS: 78
- - <210> SEQ ID NO 1
<211> LENGTH: 2133
<212> TYPE: DNA
<213> ORGANISM: Thermoanaerobacter sp.
<220> FEATURE:
<221> NAME/KEY: CDS
<222> LOCATION: (82)...(2130)
- - <400> SEQUENCE: 1
- - atgaagaaaa cgcttaaact tctgtcgatt ctgttgataa ccattgctct tc -
#ttttcagc 60
- - tcaattccat ccgtaccggc a gca ccg gat act tca gtt - #tcc aat gtt gtc
111
- # Ala Pro Asp Thr Ser Val Ser - #Asn Val Val
- # 1 - # 5 - # 10
- - aat tat tca aca gat gta atc tac cag ata gt - #c aca gac cgt ttt tta
159
Asn Tyr Ser Thr Asp Val Ile Tyr Gln Ile Va - #l Thr Asp Arg Phe Leu
15 - # 20 - # 25
- - gat ggg aat ccc agt aat aat cca aca ggc ga - #c tta tat gac cct acc
207
Asp Gly Asn Pro Ser Asn Asn Pro Thr Gly As - #p Leu Tyr Asp Pro Thr
30 - # 35 - # 40
- - cat act agt tta aag aaa tat ttt ggt ggc ga - #t tgg cag ggt att att
255
His Thr Ser Leu Lys Lys Tyr Phe Gly Gly As - #p Trp Gln Gly Ile Ile
45 - # 50 - # 55
- - aac aaa att aat gat ggt tat ctt act ggt at - #g gga att aca gct ata
303
Asn Lys Ile Asn Asp Gly Tyr Leu Thr Gly Me - #t Gly Ile Thr Ala Ile
60 - # 65 - # 70
- - tgg att tcg caa cct gta gaa aac att tac gc - #a gtt ttg cca gat tcc
351
Trp Ile Ser Gln Pro Val Glu Asn Ile Tyr Al - #a Val Leu Pro Asp Ser
75 - # 80 - # 85 - # 90
- - act ttt ggc gga agc aca tcc tat cat ggt ta - #c tgg gca cga gac ttc
399
Thr Phe Gly Gly Ser Thr Ser Tyr His Gly Ty - #r Trp Ala Arg Asp Phe
95 - # 100 - # 105
- - aaa aag aca aat ccc ttt ttt gga agc ttt ac - #a gat ttt caa aat ctc
447
Lys Lys Thr Asn Pro Phe Phe Gly Ser Phe Th - #r Asp Phe Gln Asn Leu
110 - # 115 - # 120
- - ata gca aca gct cat gct cac aat ata aaa gt - #t ata ata gac ttt gca
495
Ile Ala Thr Ala His Ala His Asn Ile Lys Va - #l Ile Ile Asp Phe Ala
125 - # 130 - # 135
- - cca aat cat aca tct cct gca tca gag aca ga - #c cct acc tat ggg gaa
543
Pro Asn His Thr Ser Pro Ala Ser Glu Thr As - #p Pro Thr Tyr Gly Glu
140 - # 145 - # 150
- - aat ggt aga tta tat gac aat gga gta tta ct - #t ggt ggt tat acc aat
591
Asn Gly Arg Leu Tyr Asp Asn Gly Val Leu Le - #u Gly Gly Tyr Thr Asn
155 1 - #60 1 - #65 1 -
#70
- - gat aca aat gga tat ttc cat cat tat gga gg - #a act aat ttt tca
tca 639
Asp Thr Asn Gly Tyr Phe His His Tyr Gly Gl - #y Thr Asn Phe Ser Ser
175 - # 180 - # 185
- - tat gaa gat gga att tac cgt aat tta ttt ga - #c tta gca gat tta gat
687
Tyr Glu Asp Gly Ile Tyr Arg Asn Leu Phe As - #p Leu Ala Asp Leu Asp
190 - # 195 - # 200
- - cag cag aat agc act att gat tca tat tta aa - #a gcg gca att aaa cta
735
Gln Gln Asn Ser Thr Ile Asp Ser Tyr Leu Ly - #s Ala Ala Ile Lys Leu
205 - # 210 - # 215
- - tgg tta gac atg ggg att gat ggt ata cgc at - #g gat gca gtc aaa cac
783
Trp Leu Asp Met Gly Ile Asp Gly Ile Arg Me - #t Asp Ala Val Lys His
220 - # 225 - # 230
- - atg gca ttt gga tgg caa aag aac ttt atg ga - #t tct att tta agt tat
831
Met Ala Phe Gly Trp Gln Lys Asn Phe Met As - #p Ser Ile Leu Ser Tyr
235 2 - #40 2 - #45 2 -
#50
- - aga cca gtt ttt aca ttt ggc gag tgg tac ct - #t gga acc aat gaa
gta 879
Arg Pro Val Phe Thr Phe Gly Glu Trp Tyr Le - #u Gly Thr Asn Glu Val
255 - # 260 - # 265
- - gat cct aat aat acg tat ttt gca aat gaa ag - #t ggt atg agc ctt ctt
927
Asp Pro Asn Asn Thr Tyr Phe Ala Asn Glu Se - #r Gly Met Ser Leu Leu
270 - # 275 - # 280
- - gat ttt aga ttt gct caa aaa gtt cgt caa gt - #a ttc aga gac aat aca
975
Asp Phe Arg Phe Ala Gln Lys Val Arg Gln Va - #l Phe Arg Asp Asn Thr
285 - # 290 - # 295
- - gac act atg tat gga ctt gat tcg atg att ca - #g tct act gca gca gat
1023
Asp Thr Met Tyr Gly Leu Asp Ser Met Ile Gl - #n Ser Thr Ala Ala Asp
300 - # 305 - # 310
- - tat aat ttc ata aat gat atg gtt aca ttt at - #a gat aat cat gac atg
1071
Tyr Asn Phe Ile Asn Asp Met Val Thr Phe Il - #e Asp Asn His Asp Met
315 3 - #20 3 - #25 3 -
#30
- - gac aga ttt tat aca gga ggc agt aca cgg cc - #t gtt gag caa gcg
tta 1119
Asp Arg Phe Tyr Thr Gly Gly Ser Thr Arg Pr - #o Val Glu Gln Ala Leu
335 - # 340 - # 345
- - gca ttt act tta act tct cgc ggt gta cct gc - #t ata tat tac ggt aca
1167
Ala Phe Thr Leu Thr Ser Arg Gly Val Pro Al - #a Ile Tyr Tyr Gly Thr
350 - # 355 - # 360
- - gag caa tat atg aca ggt aat gga gac cct ta - #t aat aga gct atg atg
1215
Glu Gln Tyr Met Thr Gly Asn Gly Asp Pro Ty - #r Asn Arg Ala Met Met
365 - # 370 - # 375
- - acg tca ttt gac acc aca acg acg gca tat aa - #t gtg ata aaa aag ctt
1263
Thr Ser Phe Asp Thr Thr Thr Thr Ala Tyr As - #n Val Ile Lys Lys Leu
380 - # 385 - # 390
- - gct cca ctg cgt aaa tct aac cct gca att gc - #t tac ggt aca caa aaa
1311
Ala Pro Leu Arg Lys Ser Asn Pro Ala Ile Al - #a Tyr Gly Thr Gln Lys
395 4 - #00 4 - #05 4 -
#10
- - cag cga tgg ata aat aat gat gtt tac att ta - #t gaa aga caa ttt
ggt 1359
Gln Arg Trp Ile Asn Asn Asp Val Tyr Ile Ty - #r Glu Arg Gln Phe Gly
415 - # 420 - # 425
- - aat aac gtt gct ctt gtt gct att aat cgt aa - #t ctt tca acg agc tat
1407
Asn Asn Val Ala Leu Val Ala Ile Asn Arg As - #n Leu Ser Thr Ser Tyr
430 - # 435 - # 440
- - tac att acc ggc ttg tac acc gca ttg cct gc - #g gga aca tat tct gac
1455
Tyr Ile Thr Gly Leu Tyr Thr Ala Leu Pro Al - #a Gly Thr Tyr Ser Asp
445 - # 450 - # 455
- - atg ctt ggc gga tta tta aat ggc agt agt at - #t aca gta tct agt aat
1503
Met Leu Gly Gly Leu Leu Asn Gly Ser Ser Il - #e Thr Val Ser Ser Asn
460 - # 465 - # 470
- - ggt tct gta aca ccg ttt acc ctt gcg cct gg - #t gaa gtt gca gta tgg
1551
Gly Ser Val Thr Pro Phe Thr Leu Ala Pro Gl - #y Glu Val Ala Val Trp
475 4 - #80 4 - #85 4 -
#90
- - cag tat gtc agt aca act aat cct cca ttg at - #a gga cat gta gga
ccg 1599
Gln Tyr Val Ser Thr Thr Asn Pro Pro Leu Il - #e Gly His Val Gly Pro
495 - # 500 - # 505
- - aca atg aca aag gca ggg cag act ata acc at - #a gat gga agg gga ttt
1647
Thr Met Thr Lys Ala Gly Gln Thr Ile Thr Il - #e Asp Gly Arg Gly Phe
510 - # 515 - # 520
- - ggc aca aca gca ggt caa gta tta ttt ggg ac - #a act cct gca act att
1695
Gly Thr Thr Ala Gly Gln Val Leu Phe Gly Th - #r Thr Pro Ala Thr Ile
525 - # 530 - # 535
- - gtg tca tgg gaa gat act gaa gta aaa gta aa - #a gtt cct gct tta act
1743
Val Ser Trp Glu Asp Thr Glu Val Lys Val Ly - #s Val Pro Ala Leu Thr
540 - # 545 - # 550
- - cct gga aaa tat aac att aca tta aaa aca gc - #a tca gga gtt aca agc
1791
Pro Gly Lys Tyr Asn Ile Thr Leu Lys Thr Al - #a Ser Gly Val Thr Ser
555 5 - #60 5 - #65 5 -
#70
- - aat agc tat aac aat atc aat gtt tta acg gg - #a aat cag gta tgt
gtt 1839
Asn Ser Tyr Asn Asn Ile Asn Val Leu Thr Gl - #y Asn Gln Val Cys Val
575 - # 580 - # 585
- - aga ttt gta gta aat aat gct aca acc gtg tg - #g gga gaa aat gta tat
1887
Arg Phe Val Val Asn Asn Ala Thr Thr Val Tr - #p Gly Glu Asn Val Tyr
590 - # 595 - # 600
- - ctt acg ggc aat gta gct gaa ctt ggc aac tg - #g gat aca tcg aag gca
1935
Leu Thr Gly Asn Val Ala Glu Leu Gly Asn Tr - #p Asp Thr Ser Lys Ala
605 - # 610 - # 615
- - ata gga cca atg ttt aac cag gtt gtg tat ca - #a tat cct acg tgg tat
1983
Ile Gly Pro Met Phe Asn Gln Val Val Tyr Gl - #n Tyr Pro Thr Trp Tyr
620 - # 625 - # 630
- - tac gat gta agt gtg cct gct ggt act act at - #a gag ttt aag ttt ata
2031
Tyr Asp Val Ser Val Pro Ala Gly Thr Thr Il - #e Glu Phe Lys Phe Ile
635 6 - #40 6 - #45 6 -
#50
- - aag aaa aat ggt agt act gta acc tgg gaa gg - #t gga tac aac cac
gta 2079
Lys Lys Asn Gly Ser Thr Val Thr Trp Glu Gl - #y Gly Tyr Asn His Val
655 - # 660 - # 665
- - tat act aca ccc act tct ggt aca gct act gt - #a att gta gac tgg caa
2127
Tyr Thr Thr Pro Thr Ser Gly Thr Ala Thr Va - #l Ile Val Asp Trp Gln
670 - # 675 - # 680
- - ccg tga - # - # -
# 2133
Pro
- - - - <210> SEQ ID NO 2
<211> LENGTH: 683
<212> TYPE: PRT
<213> ORGANISM: Thermoanaerobacter sp.
- - <400> SEQUENCE: 2
- - Ala Pro Asp Thr Ser Val Ser Asn Val Val As - #n Tyr Ser Thr Asp
Val
1 5 - # 10 - # 15
- - Ile Tyr Gln Ile Val Thr Asp Arg Phe Leu As - #p Gly Asn Pro Ser Asn
20 - # 25 - # 30
- - Asn Pro Thr Gly Asp Leu Tyr Asp Pro Thr Hi - #s Thr Ser Leu Lys Lys
35 - # 40 - # 45
- - Tyr Phe Gly Gly Asp Trp Gln Gly Ile Ile As - #n Lys Ile Asn Asp Gly
50 - # 55 - # 60
- - Tyr Leu Thr Gly Met Gly Ile Thr Ala Ile Tr - #p Ile Ser Gln Pro Val
65 - #70 - #75 - #80
- - Glu Asn Ile Tyr Ala Val Leu Pro Asp Ser Th - #r Phe Gly Gly Ser Thr
85 - # 90 - # 95
- - Ser Tyr His Gly Tyr Trp Ala Arg Asp Phe Ly - #s Lys Thr Asn Pro Phe
100 - # 105 - # 110
- - Phe Gly Ser Phe Thr Asp Phe Gln Asn Leu Il - #e Ala Thr Ala His Ala
115 - # 120 - # 125
- - His Asn Ile Lys Val Ile Ile Asp Phe Ala Pr - #o Asn His Thr Ser Pro
130 - # 135 - # 140
- - Ala Ser Glu Thr Asp Pro Thr Tyr Gly Glu As - #n Gly Arg Leu Tyr Asp
145 1 - #50 1 - #55 1 -
#60
- - Asn Gly Val Leu Leu Gly Gly Tyr Thr Asn As - #p Thr Asn Gly Tyr
Phe
165 - # 170 - # 175
- - His His Tyr Gly Gly Thr Asn Phe Ser Ser Ty - #r Glu Asp Gly Ile Tyr
180 - # 185 - # 190
- - Arg Asn Leu Phe Asp Leu Ala Asp Leu Asp Gl - #n Gln Asn Ser Thr Ile
195 - # 200 - # 205
- - Asp Ser Tyr Leu Lys Ala Ala Ile Lys Leu Tr - #p Leu Asp Met Gly Ile
210 - # 215 - # 220
- - Asp Gly Ile Arg Met Asp Ala Val Lys His Me - #t Ala Phe Gly Trp Gln
225 2 - #30 2 - #35 2 -
#40
- - Lys Asn Phe Met Asp Ser Ile Leu Ser Tyr Ar - #g Pro Val Phe Thr
Phe
245 - # 250 - # 255
- - Gly Glu Trp Tyr Leu Gly Thr Asn Glu Val As - #p Pro Asn Asn Thr Tyr
260 - # 265 - # 270
- - Phe Ala Asn Glu Ser Gly Met Ser Leu Leu As - #p Phe Arg Phe Ala Gln
275 - # 280 - # 285
- - Lys Val Arg Gln Val Phe Arg Asp Asn Thr As - #p Thr Met Tyr Gly Leu
290 - # 295 - # 300
- - Asp Ser Met Ile Gln Ser Thr Ala Ala Asp Ty - #r Asn Phe Ile Asn Asp
305 3 - #10 3 - #15 3 -
#20
- - Met Val Thr Phe Ile Asp Asn His Asp Met As - #p Arg Phe Tyr Thr
Gly
325 - # 330 - # 335
- - Gly Ser Thr Arg Pro Val Glu Gln Ala Leu Al - #a Phe Thr Leu Thr Ser
340 - # 345 - # 350
- - Arg Gly Val Pro Ala Ile Tyr Tyr Gly Thr Gl - #u Gln Tyr Met Thr Gly
355 - # 360 - # 365
- - Asn Gly Asp Pro Tyr Asn Arg Ala Met Met Th - #r Ser Phe Asp Thr Thr
370 - # 375 - # 380
- - Thr Thr Ala Tyr Asn Val Ile Lys Lys Leu Al - #a Pro Leu Arg Lys Ser
385 3 - #90 3 - #95 4 -
#00
- - Asn Pro Ala Ile Ala Tyr Gly Thr Gln Lys Gl - #n Arg Trp Ile Asn
Asn
405 - # 410 - # 415
- - Asp Val Tyr Ile Tyr Glu Arg Gln Phe Gly As - #n Asn Val Ala Leu Val
420 - # 425 - # 430
- - Ala Ile Asn Arg Asn Leu Ser Thr Ser Tyr Ty - #r Ile Thr Gly Leu Tyr
435 - # 440 - # 445
- - Thr Ala Leu Pro Ala Gly Thr Tyr Ser Asp Me - #t Leu Gly Gly Leu Leu
450 - # 455 - # 460
- - Asn Gly Ser Ser Ile Thr Val Ser Ser Asn Gl - #y Ser Val Thr Pro Phe
465 4 - #70 4 - #75 4 -
#80
- - Thr Leu Ala Pro Gly Glu Val Ala Val Trp Gl - #n Tyr Val Ser Thr
Thr
485 - # 490 - # 495
- - Asn Pro Pro Leu Ile Gly His Val Gly Pro Th - #r Met Thr Lys Ala Gly
500 - # 505 - # 510
- - Gln Thr Ile Thr Ile Asp Gly Arg Gly Phe Gl - #y Thr Thr Ala Gly Gln
515 - # 520 - # 525
- - Val Leu Phe Gly Thr Thr Pro Ala Thr Ile Va - #l Ser Trp Glu Asp Thr
530 - # 535 - # 540
- - Glu Val Lys Val Lys Val Pro Ala Leu Thr Pr - #o Gly Lys Tyr Asn Ile
545 5 - #50 5 - #55 5 -
#60
- - Thr Leu Lys Thr Ala Ser Gly Val Thr Ser As - #n Ser Tyr Asn Asn
Ile
565 - # 570 - # 575
- - Asn Val Leu Thr Gly Asn Gln Val Cys Val Ar - #g Phe Val Val Asn Asn
580 - # 585 - # 590
- - Ala Thr Thr Val Trp Gly Glu Asn Val Tyr Le - #u Thr Gly Asn Val Ala
595 - # 600 - # 605
- - Glu Leu Gly Asn Trp Asp Thr Ser Lys Ala Il - #e Gly Pro Met Phe Asn
610 - # 615 - # 620
- - Gln Val Val Tyr Gln Tyr Pro Thr Trp Tyr Ty - #r Asp Val Ser Val Pro
625 6 - #30 6 - #35 6 -
#40
- - Ala Gly Thr Thr Ile Glu Phe Lys Phe Ile Ly - #s Lys Asn Gly Ser
Thr
645 - # 650 - # 655
- - Val Thr Trp Glu Gly Gly Tyr Asn His Val Ty - #r Thr Thr Pro Thr Ser
660 - # 665 - # 670
- - Gly Thr Ala Thr Val Ile Val Asp Trp Gln Pr - #o
675 - # 680
- - - - <210> SEQ ID NO 3
<211> LENGTH: 5
<212> TYPE: PRT
<213> ORGANISM: Unknown
<220> FEATURE:
<221> NAME/KEY: VARIANT
<222> LOCATION: (87)...(94)
- - <400> SEQUENCE: 3
- - His Pro Ser Gly Tyr
1 5
- - - - <210> SEQ ID NO 4
<211> LENGTH: 9
<212> TYPE: PRT
<213> ORGANISM: Unknown
<220> FEATURE:
<221> NAME/KEY: VARIANT
<222> LOCATION: (143)...(151)
- - <400> SEQUENCE: 4
- - Pro Ala Leu Glu Thr Asn Pro Asn Phe
1 5
- - - - <210> SEQ ID NO 5
<211> LENGTH: 9
<212> TYPE: PRT
<213> ORGANISM: Unknown
<220> FEATURE:
<221> NAME/KEY: VARIANT
<222> LOCATION: (143)...(151)
- - <400> SEQUENCE: 5
- - Pro Ala Ala Glu Thr Trp Pro Ala Phe
1 5
- - - - <210> SEQ ID NO 6
<211> LENGTH: 9
<212> TYPE: PRT
<213> ORGANISM: Unknown
<220> FEATURE:
<221> NAME/KEY: VARIANT
<222> LOCATION: (143)...(151)
- - <400> SEQUENCE: 6
- - Pro Ala Ala Glu Ala Asp Pro Asn Phe
1 5
- - - - <210> SEQ ID NO 7
<211> LENGTH: 4
<212> TYPE: PRT
<213> ORGANISM: Unknown
<220> FEATURE:
<221> NAME/KEY: VARIANT
<222> LOCATION: (143)...(148)
- - <400> SEQUENCE: 7
- - Gly Arg Ala Ala
1
- - - - <210> SEQ ID NO 8
<211> LENGTH: 6
<212> TYPE: PRT
<213> ORGANISM: Unknown
<220> FEATURE:
<221> NAME/KEY: VARIANT
<222> LOCATION: (143)...(148)
- - <400> SEQUENCE: 8
- - Gly Arg Ala Ala Ala Ala
1 5
- - - - <210> SEQ ID NO 9
<211> LENGTH: 6
<212> TYPE: PRT
<213> ORGANISM: Unknown
<220> FEATURE:
<221> NAME/KEY: VARIANT
<222> LOCATION: (143)...(148)
- - <400> SEQUENCE: 9
- - Gly Arg Ala Pro Ala Ala
1 5
- - - - <210> SEQ ID NO 10
<211> LENGTH: 6
<212> TYPE: PRT
<213> ORGANISM: Unknown
<220> FEATURE:
<221> NAME/KEY: VARIANT
<222> LOCATION: (143)...(148)
- - <400> SEQUENCE: 10
- - Gly Arg Gly Pro Ala Ala
1 5
- - - - <210> SEQ ID NO 11
<211> LENGTH: 8
<212> TYPE: PRT
<213> ORGANISM: Unknown
<220> FEATURE:
<221> NAME/KEY: VARIANT
<222> LOCATION: (87)...(94)
- - <400> SEQUENCE: 11
- - Ile Lys Tyr Ser Gly Val Asn Asn
1 5
- - - - <210> SEQ ID NO 12
<211> LENGTH: 9
<212> TYPE: PRT
<213> ORGANISM: Unknown
<220> FEATURE:
<221> NAME/KEY: VARIANT
<222> LOCATION: (143)...(151)
- - <400> SEQUENCE: 12
- - Gly Arg Ala Gly Thr Asn Pro Gly Phe
1 5
- - - - <210> SEQ ID NO 13
<211> LENGTH: 25
<212> TYPE: DNA
<213> ORGANISM: Artificial Sequence
<220> FEATURE:
<223> OTHER INFORMATION: nucleotide
- - <400> SEQUENCE: 13
- - ggtcgtttac caggcgccga actgg - # - #
25
- - - - <210> SEQ ID NO 14
<211> LENGTH: 23
<212> TYPE: DNA
<213> ORGANISM: Artificial Sequence
<220> FEATURE:
<223> OTHER INFORMATION: nucleotide
- - <400> SEQUENCE: 14
- - gcgagctcgg gaacgcggac ccg - # - #
23
- - - - <210> SEQ ID NO 15
<211> LENGTH: 20
<212> TYPE: DNA
<213> ORGANISM: Artificial Sequence
<220> FEATURE:
<223> OTHER INFORMATION: Nucleotide
- - <400> SEQUENCE: 15
- - ccgtcaccgc ggaaggcggc - # - #
- # 20
- - - - <210> SEQ ID NO 16
<211> LENGTH: 27
<212> TYPE: DNA
<213> ORGANISM: Artificial Sequence
<220> FEATURE:
<223> OTHER INFORMATION: nucleotide
- - <400> SEQUENCE: 16
- - gcatctacaa gggcctgtac gatctcg - # - #
27
- - - - <210> SEQ ID NO 17
<211> LENGTH: 26
<212> TYPE: DNA
<213> ORGANISM: Artificial Sequence
<220> FEATURE:
<223> OTHER INFORMATION: nucleotide
- - <400> SEQUENCE: 17
- - gcatcatcaa tggatccggc gtaaac - # - #
26
- - - - <210> SEQ ID NO 18
<211> LENGTH: 36
<212> TYPE: DNA
<213> ORGANISM: Artificial Sequence
<220> FEATURE:
<223> OTHER INFORMATION: nucleotide
- - <400> SEQUENCE: 18
- - catacgtcgc ccgctagcat ttccgaccag ccttcc - # -
# 36
- - - - <210> SEQ ID NO 19
<211> LENGTH: 25
<212> TYPE: DNA
<213> ORGANISM: Artificial Sequence
<220> FEATURE:
<223> OTHER INFORMATION: nucleotide
- - <400> SEQUENCE: 19
- - cgggcgggac cggtccggac aaccg - # - #
25
- - - - <210> SEQ ID NO 20
<211> LENGTH: 26
<212> TYPE: DNA
<213> ORGANISM: Artificial Sequence
<220> FEATURE:
<223> OTHER INFORMATION: nucleotide
- - <400> SEQUENCE: 20
- - gtcgggcggt accaatccgg acaacc - # - #
26
- - - - <210> SEQ ID NO 21
<211> LENGTH: 24
<212> TYPE: DNA
<213> ORGANISM: Artificial Sequence
<220> FEATURE:
<223> OTHER INFORMATION: nucleotide
- - <400> SEQUENCE: 21
- - cgttcatcga tcagcatgac atgg - # - #
24
- - - - <210> SEQ ID NO 22
<211> LENGTH: 24
<212> TYPE: DNA
<213> ORGANISM: Artificial Sequence
<220> FEATURE:
<223> OTHER INFORMATION: nucleotide
- - <400> SEQUENCE: 22
- - gcccgcctct ccggaccagc cttc - # - #
24
- - - - <210> SEQ ID NO 23
<211> LENGTH: 52
<212> TYPE: DNA
<213> ORGANISM: Artificial Sequence
<220> FEATURE:
<223> OTHER INFORMATION: nucleotide
- - <400> SEQUENCE: 23
- - aatcatacat ctggacgagc aggtaccaac ccgactttgg ggaaaatggt ac - #
52
- - - - <210> SEQ ID NO 24
<211> LENGTH: 35
<212> TYPE: DNA
<213> ORGANISM: Artificial Sequence
<220> FEATURE:
<223> OTHER INFORMATION: nucleotide
- - <400> SEQUENCE: 24
- - atcatacatc cggacgatgg gagacagacc ctacc - # -
# 35
- - - - <210> SEQ ID NO 25
<211> LENGTH: 51
<212> TYPE: DNA
<213> ORGANISM: Artificial Sequence
<220> FEATURE:
<223> OTHER INFORMATION: nucleotide
- - <400> SEQUENCE: 25
- - catttacgca gttatcaatg attccggagt taacaataca tcctatcatg g - #
51
- - - - <210> SEQ ID NO 26
<211> LENGTH: 45
<212> TYPE: DNA
<213> ORGANISM: Artificial sequence
<220> FEATURE:
<223> OTHER INFORMATION: nucleotide
- - <400> SEQUENCE: 26
- - caaatcatac atctggacga ggaccggccg cacctaccta tgggg - #
- #45
- - - - <210> SEQ ID NO 27
<211> LENGTH: 45
<212> TYPE: DNA
<213> ORGANISM: Artificial sequence
<220> FEATURE:
<223> OTHER INFORMATION: nucleotide
- - <400> SEQUENCE: 27
- - caaatcatac atctggacga gcaccggccg cacctaccta tgggg - #
- #45
- - - - <210> SEQ ID NO 28
<211> LENGTH: 39
<212> TYPE: DNA
<213> ORGANISM: Artificial sequence
<220> FEATURE:
<223> OTHER INFORMATION: nucleotide
- - <400> SEQUENCE: 28
- - caaatcatac atctggacga gcagcaccta cctatgggg - #
- # 39
- - - - <210> SEQ ID NO 29
<211> LENGTH: 45
<212> TYPE: DNA
<213> ORGANISM: Artificial sequence
<220> FEATURE:
<223> OTHER INFORMATION: nucleotide
- - <400> SEQUENCE: 29
- - caaatcatac atctggacga cctgcagcag ctcctaccta tgggg - #
- #45
- - - - <210> SEQ ID NO 30
<211> LENGTH: 30
<212> TYPE: DNA
<213> ORGANISM: Artificial sequence
<220> FEATURE:
<223> OTHER INFORMATION: nucleotide
- - <400> SEQUENCE: 30
- - ccatcattac ggatccacta atttttcatc - # - #
30
- - - - <210> SEQ ID NO 31
<211> LENGTH: 29
<212> TYPE: DNA
<213> ORGANISM: Artificial sequence
<220> FEATURE:
<223> OTHER INFORMATION: nucleotide
- - <400> SEQUENCE: 31
- - catacatctc ctcgatcgga gacagaccc - # - #
29
- - - - <210> SEQ ID NO 32
<211> LENGTH: 29
<212> TYPE: DNA
<213> ORGANISM: Artificial sequence
<220> FEATURE:
<223> OTHER INFORMATION: nucleotide
- - <400> SEQUENCE: 32
- - catacatctg ctcgatcgga gacagaccc - # - #
29
- - - - <210> SEQ ID NO 33
<211> LENGTH: 30
<212> TYPE: DNA
<213> ORGANISM: Artificial sequence
<220> FEATURE:
<223> OTHER INFORMATION: nucleotide
- - <400> SEQUENCE: 33
- - ccatcattac ggaaacacta atttttcatc - # - #
30
- - - - <210> SEQ ID NO 34
<211> LENGTH: 30
<212> TYPE: DNA
<213> ORGANISM: Artificial sequence
<220> FEATURE:
<223> OTHER INFORMATION: nucleotide
- - <400> SEQUENCE: 34
- - ccatcattac ggagacacta atttttcatc - # - #
30
- - - - <210> SEQ ID NO 35
<211> LENGTH: 30
<212> TYPE: DNA
<213> ORGANISM: Artificial sequence
<220> FEATURE:
<223> OTHER INFORMATION: nucleotide
- - <400> SEQUENCE: 35
- - ccatcattat aatggaacta atttttcatc - # - #
30
- - - - <210> SEQ ID NO 36
<211> LENGTH: 30
<212> TYPE: DNA
<213> ORGANISM: Artificial sequence
<220> FEATURE:
<223> OTHER INFORMATION: nucleotide
- - <400> SEQUENCE: 36
- - ccatcattat agtggaacta atttttcatc - # - #
30
- - - - <210> SEQ ID NO 37
<211> LENGTH: 30
<212> TYPE: DNA
<213> ORGANISM: Artificial sequence
<220> FEATURE:
<223> OTHER INFORMATION: nucleotide
- - <400> SEQUENCE: 37
- - ccatcattat gatggaacta atttttcatc - # - #
30
- - - - <210> SEQ ID NO 38
<211> LENGTH: 22
<212> TYPE: DNA
<213> ORGANISM: Artificial sequence
<220> FEATURE:
<223> OTHER INFORMATION: nucleotide
- - <400> SEQUENCE: 38
- - ctcctgcagc tgagacagac cc - # - #
22
- - - - <210> SEQ ID NO 39
<211> LENGTH: 22
<212> TYPE: DNA
<213> ORGANISM: Artificial sequence
<220> FEATURE:
<223> OTHER INFORMATION: nucleotide
- - <400> SEQUENCE: 39
- - ctcctgcatc gtcgacagac cc - # - #
22
- - - - <210> SEQ ID NO 40
<211> LENGTH: 23
<212> TYPE: DNA
<213> ORGANISM: Artificial sequence
<220> FEATURE:
<223> OTHER INFORMATION: nucleotide
- - <400> SEQUENCE: 40
- - tcagaggcgg atcctaccta tgg - # - #
23
- - - - <210> SEQ ID NO 41
<211> LENGTH: 23
<212> TYPE: DNA
<213> ORGANISM: Artificial sequence
<220> FEATURE:
<223> OTHER INFORMATION: nucleotide
- - <400> SEQUENCE: 41
- - tcagagctcg accctaccta tgg - # - #
23
- - - - <210> SEQ ID NO 42
<211> LENGTH: 24
<212> TYPE: DNA
<213> ORGANISM: Artificial sequence
<220> FEATURE:
<223> OTHER INFORMATION: nucleotide
- - <400> SEQUENCE: 42
- - cagagacggc gcctacctat gggg - # - #
24
- - - - <210> SEQ ID NO 43
<211> LENGTH: 21
<212> TYPE: DNA
<213> ORGANISM: Artificial sequence
<220> FEATURE:
<223> OTHER INFORMATION: nucleotide
- - <400> SEQUENCE: 43
- - cgcagttttg ccggcttcca c - # - #
- #21
- - - - <210> SEQ ID NO 44
<211> LENGTH: 20
<212> TYPE: DNA
<213> ORGANISM: Artificial sequence
<220> FEATURE:
<223> OTHER INFORMATION: nucleotide
- - <400> SEQUENCE: 44
- - tccactgccg gcggaagcac - # - #
- #20
- - - - <210> SEQ ID NO 45
<211> LENGTH: 21
<212> TYPE: DNA
<213> ORGANISM: Artificial sequence
<220> FEATURE:
<223> OTHER INFORMATION: nucleotide
- - <400> SEQUENCE: 45
- - agattctacc ggtggaagca c - # - #
21
- - - - <210> SEQ ID NO 46
<211> LENGTH: 49
<212> TYPE: DNA
<213> ORGANISM: Artificial sequence
<220> FEATURE:
<223> OTHER INFORMATION: nucleotide
- - <400> SEQUENCE: 46
- - tttacgcagt tattaaatat tccggcgtta acaacacatc ctatcatgg - #
49
- - - - <210> SEQ ID NO 47
<211> LENGTH: 28
<212> TYPE: DNA
<213> ORGANISM: Artificial sequence
<220> FEATURE:
<223> OTHER INFORMATION: nucleotide
- - <400> SEQUENCE: 47
- - cgtaatttat tcgcgctagc agatttag - # - #
28
- - - - <210> SEQ ID NO 48
<211> LENGTH: 28
<212> TYPE: DNA
<213> ORGANISM: Artificial sequence
<220> FEATURE:
<223> OTHER INFORMATION: nucleotide
- - <400> SEQUENCE: 48
- - caggtaatgg taacccttat aatagagc - # - #
28
- - - - <210> SEQ ID NO 49
<211> LENGTH: 28
<212> TYPE: DNA
<213> ORGANISM: Artificial sequence
<220> FEATURE:
<223> OTHER INFORMATION: nucleotide
- - <400> SEQUENCE: 49
- - caggtaatgg agggccttat aatagagc - # - #
28
- - - - <210> SEQ ID NO 50
<211> LENGTH: 28
<212> TYPE: DNA
<213> ORGANISM: Artificial sequence
<220> FEATURE:
<223> OTHER INFORMATION: nucleotide
- - <400> SEQUENCE: 50
- - caggtaatgg agcgccttat aatagagc - # - #
28
- - - - <210> SEQ ID NO 51
<211> LENGTH: 24
<212> TYPE: DNA
<213> ORGANISM: Artificial sequence
<220> FEATURE:
<223> OTHER INFORMATION: nucleotide
- - <400> SEQUENCE: 51
- - ttaccgtgca ctatttgact tagc - # - #
24
- - - - <210> SEQ ID NO 52
<211> LENGTH: 40
<212> TYPE: DNA
<213> ORGANISM: Artificial sequence
<220> FEATURE:
<223> OTHER INFORMATION: nucleotide
- - <400> SEQUENCE: 52
- - ctcctgcatc atctgatcaa ccgtcctttg gggaaaatgg - #
- # 40
- - - - <210> SEQ ID NO 53
<211> LENGTH: 35
<212> TYPE: DNA
<213> ORGANISM: Artificial sequence
<220> FEATURE:
<223> OTHER INFORMATION: nucleotide
- - <400> SEQUENCE: 53
- - catctcctgc agcagagctc gcacctacct atggg - # -
# 35
- - - - <210> SEQ ID NO 54
<211> LENGTH: 35
<212> TYPE: DNA
<213> ORGANISM: Artificial sequence
<220> FEATURE:
<223> OTHER INFORMATION: nucleotide
- - <400> SEQUENCE: 54
- - catctcctgc agcagagtgg gcacctacct atggg - # -
# 35
- - - - <210> SEQ ID NO 55
<211> LENGTH: 51
<212> TYPE: DNA
<213> ORGANISM: Artificial sequence
<220> FEATURE:
<223> OTHER INFORMATION: nucleotide
- - <400> SEQUENCE: 55
- - catttacgca gttatcaatt attccggagt taacaataca tcctatcatg g - #
51
- - - - <210> SEQ ID NO 56
<211> LENGTH: 42
<212> TYPE: DNA
<213> ORGANISM: Artificial sequence
<220> FEATURE:
<223> OTHER INFORMATION: nucleotide
- - <400> SEQUENCE: 56
- - catttacgca gttcatcctt ccgggtatac atcctatcat gg - #
- # 42
- - - - <210> SEQ ID NO 57
<211> LENGTH: 36
<212> TYPE: DNA
<213> ORGANISM: Artificial sequence
<220> FEATURE:
<223> OTHER INFORMATION: nucleotide
- - <400> SEQUENCE: 57
- - tacatctcct gcactcgaga caaatcctac ctatgg - # -
# 36
- - - - <210> SEQ ID NO 58
<211> LENGTH: 8
<212> TYPE: PRT
<213> ORGANISM: Unknown
<220> FEATURE:
<221> NAME/KEY: VARIANT
<222> LOCATION: (87)...(94)
- - <400> SEQUENCE: 58
- - Ile Asn Asp Ser Gly Val Asn Asn
1 5
- - - - <210> SEQ ID NO 59
<211> LENGTH: 5
<212> TYPE: PRT
<213> ORGANISM: Unknown
<220> FEATURE:
<221> NAME/KEY: VARIANT
<222> LOCATION: (146)...(150)
- - <400> SEQUENCE: 59
- - Ser Asp Gln Pro Ser
1 5
- - - - <210> SEQ ID NO 60
<211> LENGTH: 4
<212> TYPE: PRT
<213> ORGANISM: Unknown
<220> FEATURE:
<221> NAME/KEY: VARIANT
<222> LOCATION: (145)...(148)
- - <400> SEQUENCE: 60
- - Ala Glu Leu Ala
1
- - - - <210> SEQ ID NO 61
<211> LENGTH: 4
<212> TYPE: PRT
<213> ORGANISM: Unknown
<220> FEATURE:
<221> NAME/KEY: VARIANT
<222> LOCATION: (145)...(148)
- - <400> SEQUENCE: 61
- - Ala Glu Trp Ala
1
- - - - <210> SEQ ID NO 62
<211> LENGTH: 8
<212> TYPE: PRT
<213> ORGANISM: Unknown
<220> FEATURE:
<221> NAME/KEY: VARIANT
<222> LOCATION: (87)...(94)
- - <400> SEQUENCE: 62
- - Ile Asn Tyr Ser Gly Val Asn Asn
1 5
- - - - <210> SEQ ID NO 63
<211> LENGTH: 4
<212> TYPE: PRT
<213> ORGANISM: Unknown
<220> FEATURE:
<221> NAME/KEY: VARIANT
<222> LOCATION: (145)...(148)
- - <400> SEQUENCE: 63
- - Leu Glu Thr Asn
1
- - - - <210> SEQ ID NO 64
<211> LENGTH: 6
<212> TYPE: PRT
<213> ORGANISM: Unknown
<220> FEATURE:
<221> NAME/KEY: VARIANT
<222> LOCATION: (143)...(148)
- - <400> SEQUENCE: 64
- - Gly Arg Pro Ala Ala Ala
1 5
- - - - <210> SEQ ID NO 65
<211> LENGTH: 34
<212> TYPE: DNA
<213> ORGANISM: Artificial sequence
<220> FEATURE:
<223> OTHER INFORMATION: nucleotide
- - <400> SEQUENCE: 65
- - gcatcatcaa tgattccgga gtaaacaaca cggc - # -
# 34
- - - - <210> SEQ ID NO 66
<211> LENGTH: 8
<212> TYPE: PRT
<213> ORGANISM: Unknown
<220> FEATURE:
<221> NAME/KEY: VARIANT
<222> LOCATION: (87)...(94)
- - <400> SEQUENCE: 66
- - Ile Lys Asp Ser Gly Val Asn Asn
1 5
- - - - <210> SEQ ID NO 67
<211> LENGTH: 26
<212> TYPE: DNA
<213> ORGANISM: Artificial sequence
<220> FEATURE:
<223> OTHER INFORMATION: nucleotide
- - <400> SEQUENCE: 67
- - ttaccgtaat ttatatgact tagcag - # - #
26
- - - - <210> SEQ ID NO 68
<211> LENGTH: 28
<212> TYPE: DNA
<213> ORGANISM: Artificial sequence
<220> FEATURE:
<223> OTHER INFORMATION: nucleotide
- - <400> SEQUENCE: 68
- - cgtaatttat tctcgctagc agatttag - # - #
28
- - - - <210> SEQ ID NO 69
<211> LENGTH: 33
<212> TYPE: DNA
<213> ORGANISM: Artificial sequence
<220> FEATURE:
<223> OTHER INFORMATION: nucleotide
- - <400> SEQUENCE: 69
- - cactgttcct tcgaacgcgt aaccttaaat acc - # - #
33
- - - - <210> SEQ ID NO 70
<211> LENGTH: 686
<212> TYPE: PRT
<213> ORGANISM: Bacillus circulans
- - <400> SEQUENCE: 70
- - Ala Pro Asp Thr Ser Val Ser Asn Lys Gln As - #n Phe Ser Thr Asp Val
1 5 - # 10 - # 15
- - Ile Tyr Gln Ile Phe Thr Asp Arg Phe Ser As - #p Gly Asn Pro Ala Asn
20 - # 25 - # 30
- - Asn Pro Thr Gly Ala Ala Phe Asp Gly Thr Cy - #s Thr Asn Leu Arg Leu
35 - # 40 - # 45
- - Tyr Cys Gly Gly Asp Trp Gln Gly Ile Ile As - #n Lys Ile Asn Asp Gly
50 - # 55 - # 60
- - Tyr Leu Thr Gly Met Gly Val Thr Ala Ile Tr - #p Ile Ser Gln Pro Val
65 - #70 - #75 - #80
- - Glu Asn Ile Tyr Ser Ile Ile Asn Tyr Ser Gl - #y Val Asn Asn Thr Ala
85 - # 90 - # 95
- - Tyr His Gly Tyr Trp Ala Arg Asp Phe Lys Ly - #s Thr Asn Pro Ala Tyr
100 - # 105 - # 110
- - Gly Thr Ile Ala Asp Phe Gln Asn Leu Ile Al - #a Ala Ala His Ala Lys
115 - # 120 - # 125
- - Asn Ile Lys Val Ile Ile Asp Phe Ala Pro As - #n His Thr Ser Pro Ala
130 - # 135 - # 140
- - Ser Ser Asp Gln Pro Ser Phe Ala Glu Asn Gl - #y Arg Leu Tyr Asp Asn
145 1 - #50 1 - #55 1 -
#60
- - Gly Thr Leu Leu Gly Gly Tyr Thr Asn Asp Th - #r Gln Asn Leu Phe
His
165 - # 170 - # 175
- - His Asn Gly Gly Thr Asp Phe Ser Thr Thr Gl - #u Asn Gly Ile Tyr Lys
180 - # 185 - # 190
- - Asn Leu Tyr Asp Leu Ala Asp Leu Asn His As - #n Asn Ser Thr Val Asp
195 - # 200 - # 205
- - Val Tyr Leu Lys Asp Ala Ile Lys Met Trp Le - #u Asp Leu Gly Ile Asp
210 - # 215 - # 220
- - Gly Ile Arg Met Asp Ala Val Lys His Met Pr - #o Phe Gly Trp Gln Lys
225 2 - #30 2 - #35 2 -
#40
- - Ser Phe Met Ala Ala Val Asn Asn Tyr Lys Pr - #o Val Phe Thr Phe
Gly
245 - # 250 - # 255
- - Glu Trp Phe Leu Gly Val Asn Glu Val Ser Pr - #o Glu Asn His Lys Phe
260 - # 265 - # 270
- - Ala Asn Glu Ser Gly Met Ser Leu Leu Asp Ph - #e Arg Phe Ala Gln Lys
275 - # 280 - # 285
- - Val Arg Gln Val Phe Arg Asp Asn Thr Asp As - #n Met Tyr Gly Leu Lys
290 - # 295 - # 300
- - Ala Met Leu Glu Gly Ser Ala Ala Asp Tyr Al - #a Gln Val Asp Asp Gln
305 3 - #10 3 - #15 3 -
#20
- - Val Thr Phe Ile Asp Asn His Asp Met Glu Ar - #g Phe His Ala Ser
Asn
325 - # 330 - # 335
- - Ala Asn Arg Arg Lys Leu Glu Gln Ala Leu Al - #a Phe Thr Leu Thr Ser
340 - # 345 - # 350
- - Arg Gly Val Pro Ala Ile Tyr Tyr Gly Thr Gl - #u Gln Tyr Met Ser Gly
355 - # 360 - # 365
- - Gly Thr Asp Pro Asp Asn Arg Ala Arg Ile Pr - #o Ser Phe Ser Thr Ser
370 - # 375 - # 380
- - Thr Thr Ala Tyr Gln Val Ile Gln Lys Leu Al - #a Pro Leu Arg Lys Cys
385 3 - #90 3 - #95 4 -
#00
- - Asn Pro Ala Ile Ala Tyr Gly Ser Thr Gln Gl - #u Arg Trp Ile Asn
Asn
405 - # 410 - # 415
- - Asp Val Leu Ile Tyr Glu Arg Lys Phe Gly Se - #r Asn Val Ala Val Val
420 - # 425 - # 430
- - Ala Val Asn Arg Asn Leu Asn Ala Pro Ala Se - #r Ile Ser Gly Leu Val
435 - # 440 - # 445
- - Thr Ser Leu Pro Gln Gly Ser Tyr Asn Asp Va - #l Leu Gly Gly Leu Leu
450 - # 455 - # 460
- - Asn Gly Asn Thr Leu Ser Val Gly Ser Gly Gl - #y Ala Ala Ser Asn Phe
465 4 - #70 4 - #75 4 -
#80
- - Thr Leu Ala Ala Gly Gly Thr Ala Val Trp Gl - #n Tyr Thr Ala Ala
Thr
485 - # 490 - # 495
- - Ala Thr Pro Thr Ile Gly His Val Gly Pro Me - #t Met Ala Lys Pro Gly
500 - # 505 - # 510
- - Val Thr Ile Thr Ile Asp Gly Arg Gly Phe Gl - #y Ser Ser Lys Gly Thr
515 - # 520 - # 525
- - Val Tyr Phe Gly Thr Thr Ala Val Ser Gly Al - #a Asp Ile Thr Ser Trp
530 - # 535 - # 540
- - Glu Asp Thr Gln Ile Lys Val Lys Ile Pro Al - #a Val Ala Gly Gly Asn
545 5 - #50 5 - #55 5 -
#60
- - Tyr Asn Ile Lys Val Ala Asn Ala Ala Gly Th - #r Ala Ser Asn Val
Tyr
565 - # 570 - # 575
- - Asp Asn Phe Glu Val Leu Ser Gly Asp Gln Va - #l Ser Val Arg Phe Val
580 - # 585 - # 590
- - Val Asn Asn Ala Thr Thr Ala Leu Gly Gln As - #n Val Tyr Leu Thr Gly
595 - # 600 - # 605
- - Ser Val Ser Glu Leu Gly Asn Trp Asp Pro Al - #a Lys Ala Ile Gly Pro
610 - # 615 - # 620
- - Met Tyr Asn Gln Val Val Tyr Gln Tyr Pro As - #n Trp Tyr Tyr Asp Val
625 6 - #30 6 - #35 6 -
#40
- - Ser Val Pro Ala Gly Lys Thr Ile Glu Phe Ly - #s Phe Leu Lys Lys
Gln
645 - # 650 - # 655
- - Gly Ser Thr Val Thr Trp Glu Gly Gly Ser As - #n His Thr Phe Thr Ala
660 - # 665 - # 670
- - Pro Ser Ser Gly Thr Ala Thr Ile Asn Val As - #n Trp Gln Pro
675 - # 680 - # 685
- - - - <210> SEQ ID NO 71
<211> LENGTH: 676
<212> TYPE: PRT
<213> ORGANISM: Bacillus sp.
- - <400> SEQUENCE: 71
- - Glu Ala Asp Val Thr Asn Lys Val Asn Tyr Se - #r Lys Asp Val Ile Tyr
1 5 - # 10 - # 15
- - Gln Ile Val Thr Asp Arg Phe Ser Asp Gly As - #n Pro Gly Asn Asn Pro
20 - # 25 - # 30
- - Ser Gly Ala Ile Phe Ser Gln Asn Cys Ile As - #p Leu His Lys Tyr Cys
35 - # 40 - # 45
- - Gly Gly Asp Trp Gln Gly Ile Ile Asp Lys Il - #e Asn Asp Gly Tyr Leu
50 - # 55 - # 60
- - Thr Asp Leu Gly Ile Thr Ala Leu Trp Ile Se - #r Gln Pro Val Glu Asn
65 - #70 - #75 - #80
- - Val Tyr Ala Leu His Pro Ser Gly Tyr Thr Se - #r Tyr His Gly Tyr Trp
85 - # 90 - # 95
- - Ala Arg Asp Tyr Lys Lys Thr Asn Pro Tyr Ty - #r Gly Asn Phe Asp Asp
100 - # 105 - # 110
- - Phe Asp Arg Leu Met Ser Thr Ala His Ser As - #n Gly Ile Lys Val Ile
115 - # 120 - # 125
- - Met Asp Phe Thr Pro Asn His Ser Ser Pro Al - #a Leu Glu Thr Asn Pro
130 - # 135 - # 140
- - Asn Tyr Val Glu Asn Gly Ala Ile Tyr Asp As - #n Gly Ala Leu Leu Gly
145 1 - #50 1 - #55 1 -
#60
- - Asn Tyr Ser Asn Asp Gln Gln Asn Leu Phe Hi - #s His Asn Gly Gly
Thr
165 - # 170 - # 175
- - Asp Phe Ser Ser Tyr Glu Asp Ser Ile Tyr Ar - #g Asn Leu Tyr Asp Leu
180 - # 185 - # 190
- - Ala Asp Tyr Asp Leu Asn Asn Thr Val Met As - #p Gln Tyr Leu Lys Glu
195 - # 200 - # 205
- - Ser Ile Lys Phe Trp Leu Asp Lys Gly Ile As - #p Gly Ile Arg Val Asp
210 - # 215 - # 220
- - Ala Val Lys His Met Ser Glu Gly Trp Gln Th - #r Ser Leu Met Ser Glu
225 2 - #30 2 - #35 2 -
#40
- - Ile Tyr Ser His Lys Pro Val Phe Thr Phe Gl - #y Glu Trp Phe Leu
Gly
245 - # 250 - # 255
- - Ser Gly Glu Val Asp Pro Gln Asn His His Ph - #e Ala Asn Glu Ser Gly
260 - # 265 - # 270
- - Met Ser Leu Leu Asp Phe Gln Phe Gly Gln Th - #r Ile Arg Asn Val Leu
275 - # 280 - # 285
- - Lys Asp Arg Thr Ser Asn Trp Tyr Asp Phe As - #n Glu Met Ile Thr Ser
290 - # 295 - # 300
- - Thr Glu Lys Glu Tyr Asn Glu Val Ile Asp Gl - #n Val Thr Phe Ile Asp
305 3 - #10 3 - #15 3 -
#20
- - Asn His Asp Met Ser Arg Phe Ser Val Gly Se - #r Ser Ser Asn Arg
Gln
325 - # 330 - # 335
- - Thr Asp Met Ala Leu Ala Val Leu Leu Thr Se - #r Arg Gly Val Pro Thr
340 - # 345 - # 350
- - Ile Tyr Tyr Gly Thr Glu Gln Tyr Val Thr Gl - #y Gly Asn Asp Pro Glu
355 - # 360 - # 365
- - Asn Arg Lys Pro Leu Lys Thr Phe Asp Arg Se - #r Thr Asn Ser Tyr Gln
370 - # 375 - # 380
- - Ile Ile Ser Lys Leu Ala Ser Leu Arg Gln Th - #r Asn Ser Ala Leu Gly
385 3 - #90 3 - #95 4 -
#00
- - Tyr Gly Thr Thr Thr Glu Arg Trp Leu Asn Gl - #u Asp Ile Tyr Ile
Tyr
405 - # 410 - # 415
- - Glu Arg Thr Phe Gly Asn Ser Ile Val Leu Th - #r Ala Val Asn Ser Ser
420 - # 425 - # 430
- - Asn Ser Asn Gln Thr Ile Thr Asn Leu Asn Th - #r Ser Leu Pro Gln Gly
435 - # 440 - # 445
- - Asn Tyr Thr Asp Glu Leu Gln Gln Arg Leu As - #p Gly Asn Thr Ile Thr
450 - # 455 - # 460
- - Val Asn Ala Asn Gly Ala Val Asn Ser Phe Gl - #n Leu Arg Ala Asn Ser
465 4 - #70 4 - #75 4 -
#80
- - Val Ala Val Trp Gln Val Ser Asn Pro Ser Th - #r Ser Pro Leu Ile
Gly
485 - # 490 - # 495
- - Gln Val Gly Pro Met Met Gly Lys Ala Gly As - #n Thr Ile Thr Val Ser
500 - # 505 - # 510
- - Gly Glu Gly Phe Gly Asp Glu Arg Gly Ser Va - #l Leu Phe Asp Ser Thr
515 - # 520 - # 525
- - Ser Ser Glu Ile Ile Ser Trp Ser Asn Thr Ly - #s Ile Ser Val Lys Val
530 - # 535 - # 540
- - Pro Asn Val Ala Gly Gly Tyr Tyr Asp Leu Se - #r Val Val Thr Ala Ala
545 5 - #50 5 - #55 5 -
#60
- - Asn Ile Lys Ser Pro Thr Tyr Lys Glu Phe Gl - #u Val Leu Ser Gly
Asn
565 - # 570 - # 575
- - Gln Val Ser Val Arg Phe Gly Val Asn Asn Al - #a Thr Thr Ser Pro Gly
580 - # 585 - # 590
- - Thr Asn Leu Tyr Ile Val Gly Asn Val Asn Gl - #u Leu Gly Asn Trp Asp
595 - # 600 - # 605
- - Ala Asp Lys Ala Ile Gly Pro Met Phe Asn Gl - #n Val Met Tyr Gln Tyr
610 - # 615 - # 620
- - Pro Thr Trp Tyr Tyr Asp Ile Ser Val Pro Al - #a Gly Lys Asn Leu Glu
625 6 - #30 6 - #35 6 -
#40
- - Tyr Lys Tyr Ile Lys Lys Asp Gln Asn Gly As - #n Val Val Trp Gln
Ser
645 - # 650 - # 655
- - Gly Asn Asn Arg Thr Tyr Thr Ser Pro Thr Th - #r Gly Thr Asp Thr Val
660 - # 665 - # 670
- - Met Ile Asn Trp
675
- - - - <210> SEQ ID NO 72
<211> LENGTH: 685
<212> TYPE: PRT
<213> ORGANISM: Bacillus sp.
- - <400> SEQUENCE: 72
- - Ala Pro Asp Thr Ser Val Ser Asn Lys Gln As - #n Phe Ser Thr Asp Val
1 5 - # 10 - # 15
- - Ile Tyr Gln Ile Phe Thr Asp Arg Phe Ser As - #p Gly Asn Pro Ala Asn
20 - # 25 - # 30
- - Asn Pro Thr Gly Ala Ala Phe Asp Gly Ser Cy - #s Thr Asn Leu Arg Leu
35 - # 40 - # 45
- - Tyr Cys Gly Gly Asp Trp Gln Gly Ile Ile As - #n Lys Ile Asn Asp Gly
50 - # 55 - # 60
- - Tyr Leu Thr Gly Met Gly Ile Thr Ala Ile Tr - #p Ile Ser Gln Pro Val
65 - #70 - #75 - #80
- - Glu Asn Ile Tyr Ser Val Ile Asn Tyr Ser Gl - #y Val His Asn Thr Ala
85 - # 90 - # 95
- - Tyr His Gly Tyr Trp Ala Arg Asp Phe Lys Ly - #s Thr Asn Pro Ala Tyr
100 - # 105 - # 110
- - Gly Thr Met Gln Asp Phe Lys Asn Leu Ile As - #p Thr Ala His Ala His
115 - # 120 - # 125
- - Asn Ile Lys Val Ile Ile Asp Phe Ala Pro As - #n His Thr Ser Pro Ala
130 - # 135 - # 140
- - Ser Ser Asp Asp Pro Ser Phe Ala Glu Asn Gl - #y Arg Leu Tyr Asp Asn
145 1 - #50 1 - #55 1 -
#60
- - Gly Asn Leu Leu Gly Gly Tyr Thr Asn Asp Th - #r Gln Asn Leu Phe
His
165 - # 170 - # 175
- - His Tyr Gly Gly Thr Asp Phe Ser Thr Ile Gl - #u Asn Gly Ile Tyr Lys
180 - # 185 - # 190
- - Asn Leu Tyr Asp Leu Ala Asp Leu Asn His As - #n Asn Ser Ser Val Asp
195 - # 200 - # 205
- - Val Tyr Leu Lys Asp Ala Ile Lys Met Trp Le - #u Asp Leu Gly Val Asp
210 - # 215 - # 220
- - Gly Ile Arg Val Asp Ala Val Lys His Met Pr - #o Phe Gly Trp Gln Lys
225 2 - #30 2 - #35 2 -
#40
- - Ser Phe Met Ser Thr Ile Asn Asn Tyr Lys Pr - #o Val Phe Asn Phe
Gly
245 - # 250 - # 255
- - Glu Trp Phe Leu Gly Val Asn Glu Ile Ser Pr - #o Glu Tyr His Gln Phe
260 - # 265 - # 270
- - Ala Asn Glu Ser Gly Met Ser Leu Leu Asp Ph - #e Pro Phe Ala Gln Lys
275 - # 280 - # 285
- - Ala Arg Gln Val Phe Arg Asp Asn Thr Asp As - #n Met Tyr Gly Leu Lys
290 - # 295 - # 300
- - Ala Met Leu Glu Gly Ser Glu Val Asp Tyr Al - #a Gln Val Asn Asp Gln
305 3 - #10 3 - #15 3 -
#20
- - Val Thr Phe Ile Asp Asn His Asp Met Glu Ar - #g Phe His Thr Ser
Asn
325 - # 330 - # 335
- - Gly Asp Arg Arg Lys Leu Glu Gln Ala Leu Al - #a Phe Thr Leu Thr Ser
340 - # 345 - # 350
- - Arg Gly Val Pro Ala Ile Tyr Tyr Gly Ser Gl - #u Gln Tyr Met Ser Gly
355 - # 360 - # 365
- - Gly Asn Asp Pro Asp Asn Arg Ala Arg Ile Pr - #o Ser Phe Ser Thr Thr
370 - # 375 - # 380
- - Thr Thr Ala Tyr Gln Val Ile Gln Lys Leu Al - #a Pro Leu Arg Lys Ser
385 3 - #90 3 - #95 4 -
#00
- - Asn Pro Ala Ile Ala Tyr Gly Ser Thr Gln Gl - #u Arg Trp Ile Asn
Asn
405 - # 410 - # 415
- - Asp Val Ile Ile Tyr Glu Arg Lys Phe Gly As - #n Asn Val Ala Val Val
420 - # 425 - # 430
- - Ala Ile Asn Arg Asn Met Asn Thr Pro Ala Se - #r Ile Thr Gly Leu Val
435 - # 440 - # 445
- - Thr Ser Leu Pro Gln Gly Ser Tyr Asn Asp Va - #l Leu Gly Gly Ile Leu
450 - # 455 - # 460
- - Asn Gly Asn Thr Leu Thr Val Gly Ala Gly Gl - #y Ala Ala Ser Asn Phe
465 4 - #70 4 - #75 4 -
#80
- - Thr Leu Ala Pro Gly Gly Thr Ala Val Trp Gl - #n Tyr Thr Thr Asp
Ala
485 - # 490 - # 495
- - Thr Ala Pro Ile Asn Gly Asn Val Gly Pro Me - #t Met Ala Lys Ala Gly
500 - # 505 - # 510
- - Val Thr Ile Thr Ile Asp Gly Arg Ala Ser Al - #a Arg Gln Gly Thr Val
515 - # 520 - # 525
- - Tyr Phe Gly Thr Thr Ala Val Thr Gly Ala As - #p Ile Val Ala Trp Glu
530 - # 535 - # 540
- - Asp Thr Gln Ile Gln Val Lys Ile Leu Arg Va - #l Pro Gly Gly Ile Tyr
545 5 - #50 5 - #55 5 -
#60
- - Asp Ile Arg Val Ala Asn Ala Ala Gly Ala Al - #a Ser Asn Ile Tyr
Asp
565 - # 570 - # 575
- - Asn Phe Glu Val Leu Thr Gly Asp Gln Val Th - #r Val Arg Phe Val Ile
580 - # 585 - # 590
- - Asn Asn Ala Thr Thr Ala Leu Gly Gln Asn Va - #l Phe Leu Thr Gly Asn
595 - # 600 - # 605
- - Val Ser Glu Leu Gly Asn Trp Asp Pro Asn As - #n Ala Ile Gly Pro Met
610 - # 615 - # 620
- - Tyr Asn Gln Val Val Tyr Gln Tyr Pro Thr Tr - #p Tyr Tyr Asp Val Ser
625 6 - #30 6 - #35 6 -
#40
- - Val Pro Ala Gly Gln Thr Ile Glu Phe Lys Ph - #e Leu Lys Lys Gln
Gly
645 - # 650 - # 655
- - Ser Thr Val Thr Trp Glu Gly Gly Ala Asn Ar - #g Thr Phe Thr Thr Pro
660 - # 665 - # 670
- - Thr Ser Gly Thr Ala Thr Val Asn Val Asn Tr - #p Gln Pro
675 - # 680 - # 685
- - - - <210> SEQ ID NO 73
<211> LENGTH: 686
<212> TYPE: PRT
<213> ORGANISM: Bacillus sp.
- - <400> SEQUENCE: 73
- - Ala Pro Asp Thr Ser Val Ser Asn Lys Gln As - #n Phe Ser Thr Asp Val
1 5 - # 10 - # 15
- - Ile Tyr Gln Ile Phe Thr Asp Arg Phe Ser As - #p Gly Asn Pro Ala Asn
20 - # 25 - # 30
- - Asn Pro Thr Gly Ala Ala Phe Asp Gly Ser Cy - #s Thr Asn Leu Arg Leu
35 - # 40 - # 45
- - Tyr Cys Gly Gly Asp Trp Gln Gly Ile Ile As - #n Lys Ile Asn Asp Gly
50 - # 55 - # 60
- - Tyr Leu Thr Gly Met Gly Ile Thr Ala Ile Tr - #p Ile Ser Gln Pro Val
65 - #70 - #75 - #80
- - Glu Asn Ile Tyr Ser Val Ile Asn Tyr Ser Gl - #y Val Asn Asn Thr Ala
85 - # 90 - # 95
- - Tyr His Gly Tyr Trp Ala Arg Asp Phe Lys Ly - #s Thr Asn Pro Ala Tyr
100 - # 105 - # 110
- - Gly Thr Met Gln Asp Phe Lys Asn Leu Ile As - #p Thr Ala His Ala His
115 - # 120 - # 125
- - Asn Ile Lys Val Ile Ile Asp Phe Ala Pro As - #n His Thr Ser Pro Ala
130 - # 135 - # 140
- - Ser Ser Asp Asp Pro Ser Phe Ala Glu Asn Gl - #y Arg Leu Tyr Asp Asn
145 1 - #50 1 - #55 1 -
#60
- - Gly Asn Leu Leu Gly Gly Tyr Thr Asn Asp Th - #r Gln Asn Leu Phe
His
165 - # 170 - # 175
- - His Tyr Gly Gly Thr Asp Phe Ser Thr Ile Gl - #u Asn Gly Ile Tyr Lys
180 - # 185 - # 190
- - Asn Leu Tyr Asp Leu Ala Asp Leu Asn His As - #n Asn Ser Ser Val Asp
195 - # 200 - # 205
- - Val Tyr Leu Lys Asp Ala Ile Lys Met Trp Le - #u Asp Leu Gly Val Asp
210 - # 215 - # 220
- - Gly Ile Arg Val Asp Ala Val Lys His Met Pr - #o Phe Gly Trp Gln Lys
225 2 - #30 2 - #35 2 -
#40
- - Ser Phe Met Ala Thr Ile Asn Asn Tyr Lys Pr - #o Val Phe Thr Phe
Gly
245 - # 250 - # 255
- - Glu Trp Phe Leu Gly Val Asn Glu Ile Ser Pr - #o Glu Tyr His Gln Phe
260 - # 265 - # 270
- - Ala Asn Glu Ser Gly Met Ser Leu Leu Asp Ph - #e Arg Phe Ala Gln Lys
275 - # 280 - # 285
- - Ala Arg Gln Val Phe Arg Asp Asn Thr Asp As - #n Met Tyr Gly Leu Lys
290 - # 295 - # 300
- - Ala Met Leu Glu Gly Ser Glu Val Asp Tyr Al - #a Gln Val Asn Asp Gln
305 3 - #10 3 - #15 3 -
#20
- - Val Thr Phe Ile Asp Asn His Asp Met Glu Ar - #g Phe His Thr Ser
Asn
325 - # 330 - # 335
- - Gly Asp Arg Arg Lys Leu Glu Gln Ala Leu Al - #a Phe Thr Leu Thr Ser
340 - # 345 - # 350
- - Arg Gly Val Pro Ala Ile Tyr Tyr Gly Ser Gl - #u Gln Tyr Met Ser Gly
355 - # 360 - # 365
- - Gly Asn Asp Pro Asp Asn Arg Ala Arg Leu Pr - #o Ser Phe Ser Thr Thr
370 - # 375 - # 380
- - Thr Thr Ala Tyr Gln Val Ile Gln Lys Leu Al - #a Pro Leu Arg Lys Ser
385 3 - #90 3 - #95 4 -
#00
- - Asn Pro Ala Ile Ala Tyr Gly Ser Thr His Gl - #u Arg Trp Ile Asn
Asn
405 - # 410 - # 415
- - Asp Val Ile Ile Tyr Glu Arg Lys Phe Gly As - #n Asn Val Ala Val Val
420 - # 425 - # 430
- - Ala Ile Asn Arg Asn Met Asn Thr Pro Ala Se - #r Ile Thr Gly Leu Val
435 - # 440 - # 445
- - Thr Ser Leu Arg Arg Ala Ser Tyr Asn Asp Va - #l Leu Gly Gly Ile Leu
450 - # 455 - # 460
- - Asn Gly Asn Thr Leu Thr Val Gly Ala Gly Gl - #y Ala Ala Ser Asn Phe
465 4 - #70 4 - #75 4 -
#80
- - Thr Leu Ala Pro Gly Gly Thr Ala Val Trp Gl - #n Tyr Thr Thr Asp
Ala
485 - # 490 - # 495
- - Thr Thr Pro Ile Ile Gly Asn Val Gly Pro Me - #t Met Ala Lys Pro Gly
500 - # 505 - # 510
- - Val Thr Ile Thr Ile Asp Gly Arg Gly Phe Gl - #y Ser Gly Lys Gly Thr
515 - # 520 - # 525
- - Val Tyr Phe Gly Thr Thr Ala Val Thr Gly Al - #a Asp Ile Val Ala Trp
530 - # 535 - # 540
- - Glu Asp Thr Gln Ile Gln Val Lys Ile Pro Al - #a Val Pro Gly Gly Ile
545 5 - #50 5 - #55 5 -
#60
- - Tyr Asp Ile Arg Val Ala Asn Ala Ala Gly Al - #a Ala Ser Asn Ile
Tyr
565 - # 570 - # 575
- - Asp Asn Phe Glu Val Leu Thr Gly Asp Gln Va - #l Thr Val Arg Phe Val
580 - # 585 - # 590
- - Ile Asn Asn Ala Thr Thr Ala Leu Gly Gln As - #n Val Phe Leu Thr Gly
595 - # 600 - # 605
- - Asn Val Ser Glu Leu Gly Asn Trp Asp Pro As - #n Asn Ala Ile Gly Pro
610 - # 615 - # 620
- - Met Tyr Asn Gln Val Val Tyr Gln Tyr Pro Th - #r Trp Tyr Tyr Asp Val
625 6 - #30 6 - #35 6 -
#40
- - Ser Val Pro Ala Gly Gln Thr Ile Glu Phe Ly - #s Phe Leu Lys Lys
Gln
645 - # 650 - # 655
- - Gly Ser Thr Val Thr Trp Glu Gly Gly Ala As - #n Arg Thr Phe Thr Thr
660 - # 665 - # 670
- - Pro Thr Ser Gly Thr Ala Thr Val Asn Val As - #n Trp Gln Pro
675 - # 680 - # 685
- - - - <210> SEQ ID NO 74
<211> LENGTH: 685
<212> TYPE: PRT
<213> ORGANISM: Bacillus licheniformis
- - <400> SEQUENCE: 74
- - Ala Asp Ala Asp Thr Ala Val Thr Asn Lys Gl - #n Asn Phe Ser Thr Asp
1 5 - # 10 - # 15
- - Val Ile Tyr Gln Val Phe Thr Asp Arg Phe Le - #u Asp Gly Asn Pro Ser
20 - # 25 - # 30
- - Asn Asn Pro Thr Gly Ala Ala Phe Asp Gly Th - #r Cys Ser Asn Leu Lys
35 - # 40 - # 45
- - Leu Tyr Cys Gly Gly Asp Trp Gln Gly Leu Va - #l Asn Lys Ile Asn Asp
50 - # 55 - # 60
- - Asn Tyr Phe Ser Asp Leu Gly Val Thr Ala Le - #u Trp Ile Ser Gln Pro
65 - #70 - #75 - #80
- - Val Glu Asn Ile Phe Ala Thr Ile Asn Tyr Se - #r Gly Val Thr Asn Thr
85 - # 90 - # 95
- - Ala Tyr His Gly Tyr Trp Ala Arg Asp Phe Ly - #s Lys Thr Asn Pro Tyr
100 - # 105 - # 110
- - Phe Gly Thr Met Thr Asp Phe Gln Asn Leu Va - #l Thr Thr Ala His Ala
115 - # 120 - # 125
- - Lys Gly Ile Lys Ile Ile Ile Asp Phe Ala Pr - #o Asn His Thr Ser Pro
130 - # 135 - # 140
- - Ala Met Glu Thr Asp Thr Ser Phe Ala Glu As - #n Gly Lys Leu Tyr Asp
145 1 - #50 1 - #55 1 -
#60
- - Asn Gly Asn Leu Val Gly Gly Tyr Thr Asn As - #p Thr Asn Gly Tyr
Phe
165 - # 170 - # 175
- - His His Asn Gly Gly Ser Asp Phe Ser Thr Le - #u Glu Asn Gly Ile Tyr
180 - # 185 - # 190
- - Lys Asn Leu Tyr Asp Leu Ala Asp Leu Asn Hi - #s Asn Asn Ser Thr Ile
195 - # 200 - # 205
- - Asp Thr Tyr Phe Lys Asp Ala Ile Lys Leu Tr - #p Leu Asp Met Gly Val
210 - # 215 - # 220
- - Asp Gly Ile Arg Val Asp Ala Val Lys His Me - #t Pro Gln Gly Trp Gln
225 2 - #30 2 - #35 2 -
#40
- - Lys Asn Trp Met Ser Ser Ile Tyr Ala His Ly - #s Pro Val Phe Thr
Phe
245 - # 250 - # 255
- - Gly Glu Trp Phe Leu Gly Ser Ala Ala Pro As - #p Ala Asp Asn Thr Asp
260 - # 265 - # 270
- - Phe Ala Asn Glu Ser Gly Met Ser Leu Leu As - #p Phe Arg Phe Asn Ser
275 - # 280 - # 285
- - Ala Val Arg Asn Val Phe Arg Asp Asn Thr Se - #r Asn Met Tyr Ala Leu
290 - # 295 - # 300
- - Asp Ser Met Leu Thr Ala Thr Ala Ala Asp Ty - #r Asn Gln Val Asn Asp
305 3 - #10 3 - #15 3 -
#20
- - Gln Val Thr Phe Ile Asp Asn His Asp Met As - #p Arg Phe Lys Thr
Ser
325 - # 330 - # 335
- - Ala Val Asn Asn Arg Arg Leu Glu Gln Ala Le - #u Ala Phe Thr Leu Thr
340 - # 345 - # 350
- - Ser Arg Gly Val Pro Ala Ile Tyr Tyr Gly Th - #r Glu Gln Tyr Leu Thr
355 - # 360 - # 365
- - Gly Asn Gly Asp Pro Asp Asn Arg Gly Lys Me - #t Pro Ser Phe Ser Lys
370 - # 375 - # 380
- - Ser Thr Thr Ala Phe Asn Val Ile Ser Lys Le - #u Ala Pro Leu Arg Lys
385 3 - #90 3 - #95 4 -
#00
- - Ser Asn Pro Ala Ile Ala Tyr Gly Ser Thr Gl - #n Gln Arg Trp Ile
Asn
405 - # 410 - # 415
- - Asn Asp Val Tyr Ile Tyr Glu Arg Lys Phe Gl - #y Lys Ser Val Ala Val
420 - # 425 - # 430
- - Val Ala Val Asn Arg Asn Leu Thr Thr Pro Th - #r Ser Ile Thr Asn Leu
435 - # 440 - # 445
- - Asn Thr Ser Leu Pro Ser Gly Thr Tyr Thr As - #p Val Leu Gly Gly Val
450 - # 455 - # 460
- - Leu Asn Gly Asn Asn Ile Thr Ser Ser Gly Gl - #y Asn Ile Ser Ser Phe
465 4 - #70 4 - #75 4 -
#80
- - Thr Leu Ala Ala Gly Ala Thr Ala Val Trp Gl - #n Tyr Thr Ala Ser
Glu
485 - # 490 - # 495
- - Thr Thr Pro Thr Ile Gly His Val Gly Pro Va - #l Met Gly Lys Pro Gly
500 - # 505 - # 510
- - Asn Val Val Thr Ile Asp Gly Arg Gly Phe Gl - #y Ser Ala Lys Gly Thr
515 - # 520 - # 525
- - Val Tyr Phe Gly Thr Thr Ala Val Thr Gly Se - #r Ala Ile Thr Ser Trp
530 - # 535 - # 540
- - Glu Asp Thr Gln Ile Lys Val Thr Ile Pro Pr - #o Val Ala Gly Gly Asp
545 5 - #50 5 - #55 5 -
#60
- - Tyr Ala Val Lys Val Ala Ala Asn Gly Val As - #n Ser Asn Ala Tyr
Asn
565 - # 570 - # 575
- - Asp Phe Thr Ile Leu Ser Gly Asp Gln Val Se - #r Val Arg Phe Val Ile
580 - # 585 - # 590
- - Asn Asn Ala Thr Thr Ala Leu Gly Glu Asn Il - #e Tyr Leu Thr Gly Asn
595 - # 600 - # 605
- - Val Ser Glu Leu Gly Asn Trp Thr Thr Gly Al - #a Ala Ser Ile Gly Pro
610 - # 615 - # 620
- - Ala Phe Asn Gln Val Ile His Ala Tyr Pro Th - #r Trp Tyr Tyr Asp Val
625 6 - #30 6 - #35 6 -
#40
- - Ser Val Pro Ala Gly Lys Gln Leu Glu Phe Ly - #s Phe Phe Lys Lys
Asn
645 - # 650 - # 655
- - Gly Ala Thr Ile Thr Trp Glu Gly Gly Ser As - #n His Thr Phe Thr Thr
660 - # 665 - # 670
- - Pro Thr Ser Gly Thr Ala Thr Val Thr Ile As - #n Trp Gln
675 - # 680 - # 685
- - - - <210> SEQ ID NO 75
<211> LENGTH: 687
<212> TYPE: PRT
<213> ORGANISM: Bacillus macerans
- - <400> SEQUENCE: 75
- - Ser Pro Asp Thr Ser Val Asp Asn Lys Val As - #n Phe Ser Thr Asp Val
1 5 - # 10 - # 15
- - Ile Tyr Gln Ile Val Thr Asp Arg Phe Ala As - #p Gly Asp Arg Thr Asn
20 - # 25 - # 30
- - Asn Pro Ala Gly Asp Ala Phe Ser Gly Asp Ar - #g Ser Asn Leu Lys Leu
35 - # 40 - # 45
- - Tyr Phe Gly Gly Asp Trp Gln Gly Ile Ile As - #p Lys Ile Asn Asp Gly
50 - # 55 - # 60
- - Tyr Leu Thr Gly Met Gly Val Thr Ala Leu Tr - #p Ile Ser Gln Pro Val
65 - #70 - #75 - #80
- - Glu Asn Ile Thr Ser Val Ile Lys Tyr Ser Gl - #y Val Asn Asn Thr Ser
85 - # 90 - # 95
- - Tyr His Gly Tyr Trp Ala Arg Asp Phe Lys Gl - #n Thr Asn Asp Ala Phe
100 - # 105 - # 110
- - Gly Asp Phe Ala Asp Phe Gln Asn Leu Ile As - #p Thr Ala His Ala His
115 - # 120 - # 125
- - Asn Ile Lys Val Val Ile Asp Phe Ala Pro As - #n His Thr Ser Pro Ala
130 - # 135 - # 140
- - Asp Arg Asp Asn Pro Gly Phe Ala Glu Asn Gl - #y Gly Met Tyr Asp Asn
145 1 - #50 1 - #55 1 -
#60
- - Gly Ser Leu Leu Gly Ala Tyr Ser Asn Asp Th - #r Ala Gly Leu Phe
His
165 - # 170 - # 175
- - His Asn Gly Gly Thr Asp Phe Ser Thr Ile Gl - #u Asp Gly Ile Tyr Lys
180 - # 185 - # 190
- - Asn Leu Tyr Asp Leu Ala Asp Ile Asn His As - #n Asn Asn Ala Met Asp
195 - # 200 - # 205
- - Ala Tyr Phe Lys Ser Ala Ile Asp Leu Trp Le - #u Gly Met Gly Val Asp
210 - # 215 - # 220
- - Gly Ile Arg Phe Asp Ala Val Lys His Met Pr - #o Phe Gly Trp Gln Lys
225 2 - #30 2 - #35 2 -
#40
- - Ser Phe Val Ser Ser Ile Tyr Gly Gly Asp Hi - #s Pro Val Phe Thr
Phe
245 - # 250 - # 255
- - Gly Glu Trp Tyr Leu Gly Ala Asp Gln Thr As - #p Gly Asp Asn Ile Lys
260 - # 265 - # 270
- - Phe Ala Asn Glu Ser Gly Met Asn Leu Leu As - #p Phe Glu Tyr Ala Gln
275 - # 280 - # 285
- - Glu Val Arg Glu Val Phe Arg Asp Lys Thr Gl - #u Thr Met Lys Asp Leu
290 - # 295 - # 300
- - Tyr Glu Val Leu Ala Ser Thr Glu Ser Gln Ty - #r Asp Tyr Ile Asn Asn
305 3 - #10 3 - #15 3 -
#20
- - Met Val Thr Phe Ile Asp Asn His Asp Met As - #p Arg Phe Gln Val
Ala
325 - # 330 - # 335
- - Gly Ser Gly Thr Arg Ala Thr Glu Gln Ala Le - #u Ala Leu Thr Leu Thr
340 - # 345 - # 350
- - Ser Arg Gly Val Pro Ala Ile Tyr Tyr Gly Th - #r Glu Gln Tyr Met Thr
355 - # 360 - # 365
- - Gly Asp Gly Asp Pro Asn Asn Arg Ala Met Me - #t Thr Ser Phe Asn Thr
370 - # 375 - # 380
- - Gly Thr Thr Ala Tyr Lys Val Ile Gln Ala Le - #u Ala Pro Leu Arg Lys
385 3 - #90 3 - #95 4 -
#00
- - Ser Asn Pro Ala Ile Ala Tyr Gly Thr Thr Th - #r Glu Arg Trp Val
Asn
405 - # 410 - # 415
- - Asn Asp Val Leu Ile Ile Glu Arg Lys Phe Gl - #y Ser Ser Ala Ala Leu
420 - # 425 - # 430
- - Val Ala Ile Asn Arg Asn Ser Ser Ala Ala Ty - #r Pro Ile Ser Gly Leu
435 - # 440 - # 445
- - Leu Ser Ser Leu Pro Ala Gly Thr Tyr Ser As - #p Val Leu Asn Gly Leu
450 - # 455 - # 460
- - Leu Asn Gly Asn Ser Ile Thr Val Gly Ser Gl - #y Gly Ala Val Thr Asn
465 4 - #70 4 - #75 4 -
#80
- - Phe Thr Leu Ala Ala Gly Gly Thr Ala Val Tr - #p Gln Tyr Thr Ala
Pro
485 - # 490 - # 495
- - Glu Thr Ser Pro Ala Ile Gly Asn Val Gly Pr - #o Thr Met Gly Gln Pro
500 - # 505 - # 510
- - Gly Asn Ile Val Thr Ile Asp Gly Arg Gly Ph - #e Gly Gly Thr Ala Gly
515 - # 520 - # 525
- - Thr Val Tyr Phe Gly Thr Thr Ala Val Thr Gl - #y Ser Gly Ile Val Ser
530 - # 535 - # 540
- - Trp Glu Asp Thr Gln Ile Lys Ala Val Ile Pr - #o Lys Val Ala Ala Gly
545 5 - #50 5 - #55 5 -
#60
- - Lys Thr Gly Val Ser Val Lys Thr Ser Ser Gl - #y Thr Ala Ser Asn
Thr
565 - # 570 - # 575
- - Phe Lys Ser Phe Asn Val Leu Thr Gly Asp Gl - #n Val Thr Val Arg Phe
580 - # 585 - # 590
- - Leu Val Asn Gln Ala Asn Thr Asn Tyr Gly Th - #r Asn Val Tyr Leu Val
595 - # 600 - # 605
- - Gly Asn Ala Ala Glu Leu Gly Ser Trp Asp Pr - #o Asn Lys Ala Ile Gly
610 - # 615 - # 620
- - Pro Met Tyr Asn Gln Val Ile Ala Lys Tyr Pr - #o Ser Trp Tyr Tyr Asp
625 6 - #30 6 - #35 6 -
#40
- - Val Ser Val Pro Ala Gly Thr Lys Leu Asp Ph - #e Lys Phe Ile Lys
Lys
645 - # 650 - # 655
- - Gly Gly Gly Thr Val Thr Trp Glu Gly Gly Gl - #y Asn His Thr Tyr Thr
660 - # 665 - # 670
- - Thr Pro Ala Ser Gly Val Gly Thr Val Thr Va - #l Asp Trp Gln Asn
675 - # 680 - # 685
- - - - <210> SEQ ID NO 76
<211> LENGTH: 675
<212> TYPE: PRT
<213> ORGANISM: Bacillus ohbensis
- - <400> SEQUENCE: 76
- - Asp Val Thr Asn Lys Val Asn Tyr Thr Arg As - #p Val Ile Tyr Gln Ile
1 5 - # 10 - # 15
- - Val Thr Asp Arg Phe Ser Asp Gly Asp Pro Se - #r Asn Asn Pro Thr Gly
20 - # 25 - # 30
- - Ala Ile Tyr Ser Gln Asp Cys Ser Asp Leu Hi - #s Lys Tyr Cys Gly Gly
35 - # 40 - # 45
- - Asp Trp Gln Gly Ile Ile Asp Lys Ile Asn As - #p Gly Tyr Leu Thr Asp
50 - # 55 - # 60
- - Leu Gly Ile Thr Ala Ile Trp Ile Ser Gln Pr - #o Val Glu Asn Val Tyr
65 - #70 - #75 - #80
- - Ala Leu His Pro Ser Gly Tyr Thr Ser Tyr Hi - #s Gly Tyr Trp Ala Arg
85 - # 90 - # 95
- - Asp Tyr Lys Arg Thr Asn Pro Phe Tyr Gly As - #p Phe Ser Asp Phe Asp
100 - # 105 - # 110
- - Arg Leu Met Asp Thr Ala His Ser Asn Gly Il - #e Lys Val Ile Met Asp
115 - # 120 - # 125
- - Phe Thr Pro Asn His Ser Ser Pro Ala Leu Gl - #u Thr Asp Pro Ser Tyr
130 - # 135 - # 140
- - Ala Glu Asn Gly Ala Val Tyr Asn Asp Gly Va - #l Leu Ile Gly Asn Tyr
145 1 - #50 1 - #55 1 -
#60
- - Ser Asn Asp Pro Asn Asn Leu Phe His His As - #n Gly Gly Thr Asp
Phe
165 - # 170 - # 175
- - Ser Ser Tyr Glu Asp Ser Ile Tyr Arg Asn Le - #u Tyr Asp Leu Ala Asp
180 - # 185 - # 190
- - Tyr Asp Leu Asn Asn Thr Val Met Asp Gln Ty - #r Leu Lys Glu Ser Ile
195 - # 200 - # 205
- - Lys Leu Trp Leu Asp Lys Gly Ile Asp Gly Il - #e Arg Val Asp Ala Val
210 - # 215 - # 220
- - Lys His Met Ser Glu Gly Trp Gln Thr Ser Le - #u Met Ser Asp Ile Tyr
225 2 - #30 2 - #35 2 -
#40
- - Ala His Glu Pro Val Phe Thr Phe Gly Glu Tr - #p Phe Leu Gly Ser
Gly
245 - # 250 - # 255
- - Glu Val Asp Pro Gln Asn His His Phe Ala As - #n Glu Ser Gly Met Ser
260 - # 265 - # 270
- - Leu Leu Asp Phe Gln Phe Gly Gln Thr Ile Ar - #g Asp Val Leu Met Asp
275 - # 280 - # 285
- - Gly Ser Ser Asn Trp Tyr Asp Phe Asn Glu Me - #t Ile Ala Ser Thr Glu
290 - # 295 - # 300
- - Glu Asp Tyr Asp Glu Val Ile Asp Gln Val Th - #r Phe Ile Asp Asn His
305 3 - #10 3 - #15 3 -
#20
- - Asp Met Ser Arg Phe Ser Phe Glu Gln Ser Se - #r Asn Arg His Thr
Asp
325 - # 330 - # 335
- - Ile Ala Leu Ala Val Leu Leu Thr Ser Arg Gl - #y Val Pro Thr Ile Tyr
340 - # 345 - # 350
- - Tyr Gly Thr Glu Gln Tyr Leu Thr Gly Gly As - #n Asp Pro Glu Asn Arg
355 - # 360 - # 365
- - Lys Pro Met Ser Asp Phe Asp Arg Thr Thr As - #n Ser Tyr Gln Ile Ile
370 - # 375 - # 380
- - Ser Thr Leu Ala Ser Leu Arg Gln Asn Asn Pr - #o Ala Leu Gly Tyr Gly
385 3 - #90 3 - #95 4 -
#00
- - Asn Thr Ser Glu Arg Trp Ile Asn Ser Asp Va - #l Tyr Ile Tyr Glu
Arg
405 - # 410 - # 415
- - Ser Phe Gly Asp Ser Val Val Leu Thr Ala Va - #l Asn Ser Gly Asp Thr
420 - # 425 - # 430
- - Ser Tyr Thr Ile Asn Asn Leu Asn Thr Ser Le - #u Pro Gln Gly Gln Tyr
435 - # 440 - # 445
- - Thr Asp Glu Leu Gln Gln Leu Leu Asp Gly As - #n Glu Ile Thr Val Asn
450 - # 455 - # 460
- - Ser Asn Gly Ala Val Asp Ser Phe Gln Leu Se - #r Ala Asn Gly Val Ser
465 4 - #70 4 - #75 4 -
#80
- - Val Trp Gln Ile Thr Glu Glu His Ala Ser Pr - #o Leu Ile Gly His
Val
485 - # 490 - # 495
- - Gly Pro Met Met Gly Lys His Gly Asn Thr Va - #l Thr Ile Thr Gly Glu
500 - # 505 - # 510
- - Gly Phe Gly Asp Asn Glu Gly Ser Val Leu Ph - #e Asp Ser Asp Phe Ser
515 - # 520 - # 525
- - Asp Val Leu Ser Trp Ser Asp Thr Lys Ile Gl - #u Val Ser Val Pro Asp
530 - # 535 - # 540
- - Val Thr Ala Gly His Tyr Asp Ile Ser Val Va - #l Asn Ala Gly Asp Ser
545 5 - #50 5 - #55 5 -
#60
- - Gln Ser Pro Thr Tyr Asp Lys Phe Glu Val Le - #u Thr Gly Asp Gln
Val
565 - # 570 - # 575
- - Ser Ile Arg Phe Ala Val Asn Asn Ala Thr Th - #r Ser Leu Gly Thr Asn
580 - # 585 - # 590
- - Leu Tyr Met Val Gly Asn Val Asn Glu Leu Gl - #y Asn Trp Asp Pro Asp
595 - # 600 - # 605
- - Gln Ala Ile Gly Pro Met Phe Asn Gln Val Me - #t Tyr Gln Tyr Pro Thr
610 - # 615 - # 620
- - Trp Tyr Tyr Asp Ile Ser Val Pro Ala Glu Gl - #u Asn Leu Glu Tyr Lys
625 6 - #30 6 - #35 6 -
#40
- - Phe Ile Lys Lys Asp Ser Ser Gly Asn Val Va - #l Trp Glu Ser Gly
Asn
645 - # 650 - # 655
- - Asn His Thr Tyr Thr Thr Pro Ala Thr Gly Th - #r Asp Thr Val Leu Val
660 - # 665 - # 670
- - Asp Trp Gln
675
- - - - <210> SEQ ID NO 77
<211> LENGTH: 680
<212> TYPE: PRT
<213> ORGANISM: Bacillus stearothermophilus
- - <400> SEQUENCE: 77
- - Ala Gly Asn Leu Asn Lys Val Asn Phe Thr Se - #r Asp Val Val Tyr Gln
1 5 - # 10 - # 15
- - Ile Val Val Asp Arg Phe Val Asp Gly Asn Th - #r Ser Asn Asn Pro Ser
20 - # 25 - # 30
- - Gly Ala Leu Phe Ser Ser Gly Cys Thr Asn Le - #u Arg Lys Tyr Cys Gly
35 - # 40 - # 45
- - Gly Asp Trp Gln Gly Ile Ile Asn Lys Ile As - #n Asp Gly Tyr Leu Thr
50 - # 55 - # 60
- - Asp Met Gly Val Thr Ala Ile Trp Ile Ser Gl - #n Pro Val Glu Asn Val
65 - #70 - #75 - #80
- - Phe Ser Val Met Asn Asp Ala Ser Gly Ser Al - #a Ser Tyr His Gly Tyr
85 - # 90 - # 95
- - Trp Ala Arg Asp Phe Lys Lys Pro Asn Pro Ph - #e Phe Gly Thr Leu Ser
100 - # 105 - # 110
- - Asp Phe Gln Arg Leu Val Asp Ala Ala His Al - #a Lys Gly Ile Lys Val
115 - # 120 - # 125
- - Ile Ile Asp Phe Ala Pro Asn His Thr Ser Pr - #o Ala Ser Glu Thr Asn
130 - # 135 - # 140
- - Pro Ser Tyr Met Glu Asn Gly Arg Leu Tyr As - #p Asn Gly Thr Leu Leu
145 1 - #50 1 - #55 1 -
#60
- - Gly Gly Tyr Thr Asn Asp Ala Asn Met Tyr Ph - #e His His Asn Gly
Gly
165 - # 170 - # 175
- - Thr Thr Phe Ser Ser Leu Glu Asp Gly Ile Ty - #r Arg Asn Leu Phe Asp
180 - # 185 - # 190
- - Leu Ala Asp Leu Asn His Gln Asn Pro Val Il - #e Asp Arg Tyr Leu Lys
195 - # 200 - # 205
- - Asp Ala Val Lys Met Trp Ile Asp Met Gly Il - #e Asp Gly Ile Arg Met
210 - # 215 - # 220
- - Asp Ala Val Lys His Met Pro Phe Gly Trp Gl - #n Lys Ser Leu Met Asp
225 2 - #30 2 - #35 2 -
#40
- - Glu Ile Asp Asn Tyr Arg Pro Val Phe Thr Ph - #e Gly Glu Trp Phe
Leu
245 - # 250 - # 255
- - Ser Glu Asn Glu Val Asp Ala Asn Asn His Ty - #r Phe Ala Asn Glu Ser
260 - # 265 - # 270
- - Gly Met Ser Leu Leu Asp Phe Arg Phe Gly Gl - #n Lys Leu Arg Gln Val
275 - # 280 - # 285
- - Leu Arg Asn Asn Ser Asp Asn Trp Tyr Gly Ph - #e Asn Gln Met Ile Gln
290 - # 295 - # 300
- - Asp Thr Ala Ser Ala Tyr Asp Glu Val Leu As - #p Gln Val Thr Phe Ile
305 3 - #10 3 - #15 3 -
#20
- - Asp Asn His Asp Met Asp Arg Phe Met Ile As - #p Gly Gly Asp Pro
Arg
325 - # 330 - # 335
- - Lys Val Asp Met Ala Leu Ala Val Leu Leu Th - #r Ser Arg Gly Val Pro
340 - # 345 - # 350
- - Asn Ile Tyr Tyr Gly Thr Glu Gln Tyr Met Th - #r Gly Asn Gly Asp Pro
355 - # 360 - # 365
- - Asn Asn Arg Lys Met Met Ser Ser Phe Asn Ly - #s Asn Thr Arg Ala Tyr
370 - # 375 - # 380
- - Gln Val Ile Gln Lys Leu Ser Ser Leu Arg Ar - #g Asn Asn Pro Ala Leu
385 3 - #90 3 - #95 4 -
#00
- - Ala Tyr Gly Asp Thr Glu Gln Arg Trp Ile As - #n Gly Asp Val Tyr
Val
405 - # 410 - # 415
- - Tyr Glu Arg Gln Phe Gly Lys Asp Val Val Le - #u Val Ala Val Asn Arg
420 - # 425 - # 430
- - Ser Ser Ser Ser Asn Tyr Ser Ile Thr Gly Le - #u Phe Thr Ala Leu Pro
435 - # 440 - # 445
- - Ala Gly Thr Tyr Thr Asp Gln Leu Gly Gly Le - #u Leu Asp Gly Asn Thr
450 - # 455 - # 460
- - Ile Gln Val Gly Ser Asn Gly Ser Val Asn Al - #a Phe Asp Leu Gly Pro
465 4 - #70 4 - #75 4 -
#80
- - Gly Glu Val Gly Val Trp Ala Tyr Ser Ala Th - #r Glu Ser Thr Pro
Ile
485 - # 490 - # 495
- - Ile Gly His Val Gly Pro Met Met Gly Gln Va - #l Gly His Gln Val Thr
500 - # 505 - # 510
- - Ile Asp Gly Glu Gly Phe Gly Thr Asn Thr Gl - #y Thr Val Lys Phe Gly
515 - # 520 - # 525
- - Thr Thr Ala Ala Asn Val Val Ser Trp Ser As - #n Asn Gln Ile Val Val
530 - # 535 - # 540
- - Ala Val Pro Asn Val Ser Pro Gly Lys Tyr As - #n Ile Thr Val Gln Ser
545 5 - #50 5 - #55 5 -
#60
- - Ser Ser Gly Gln Thr Ser Ala Ala Tyr Asp As - #n Phe Glu Val Leu
Thr
565 - # 570 - # 575
- - Asn Asp Gln Val Ser Val Arg Phe Val Val As - #n Asn Ala Thr Thr Asn
580 - # 585 - # 590
- - Leu Gly Gln Asn Ile Tyr Ile Val Gly Asn Va - #l Tyr Glu Leu Gly Asn
595 - # 600 - # 605
- - Trp Asp Thr Ser Lys Ala Ile Gly Pro Met Ph - #e Asn Gln Val Val Tyr
610 - # 615 - # 620
- - Ser Tyr Pro Thr Trp Tyr Ile Asp Val Ser Va - #l Pro Glu Gly Lys Thr
625 6 - #30 6 - #35 6 -
#40
- - Ile Glu Phe Lys Phe Ile Lys Lys Asp Ser Gl - #n Gly Asn Val Thr
Trp
645 - # 650 - # 655
- - Glu Ser Gly Ser Asn His Val Tyr Thr Thr Pr - #o Thr Asn Thr Thr Gly
660 - # 665 - # 670
- - Lys Ile Ile Val Asp Trp Gln Asn
675 - # 680
- - - - <210> SEQ ID NO 78
<211> LENGTH: 624
<212> TYPE: PRT
<213> ORGANISM: Klebsiella pneumoniae
- - <400> SEQUENCE: 78
- - Glu Pro Glu Glu Thr Tyr Leu Asp Phe Arg Ly - #s Glu Thr Ile Tyr Phe
1 5 - # 10 - # 15
- - Leu Phe Leu Asp Arg Phe Ser Asp Gly Asp Pr - #o Ser Asn Asn Ala Gly
20 - # 25 - # 30
- - Phe Asn Ser Ala Thr Tyr Asp Pro Asn Asn Le - #u Lys Lys Tyr Thr Gly
35 - # 40 - # 45
- - Gly Asp Leu Arg Gly Leu Ile Asn Lys Leu Pr - #o Tyr Leu Lys Ser Leu
50 - # 55 - # 60
- - Gly Val Thr Ser Ile Trp Ile Thr Pro Pro Il - #e Asp Asn Val Asn Asn
65 - #70 - #75 - #80
- - Thr Asp Ala Ala Gly Asn Thr Gly Tyr His Gl - #y Tyr Trp Gly Arg Asp
85 - # 90 - # 95
- - Tyr Phe Arg Ile Asp Glu His Phe Gly Asn Le - #u Asp Asp Phe Lys Glu
100 - # 105 - # 110
- - Leu Thr Ser Leu Met His Ser Pro Asp Tyr As - #n Met Lys Leu Val Leu
115 - # 120 - # 125
- - Asp Tyr Ala Pro Asn His Ser Asn Ala Asn As - #p Glu Asn Glu Phe Gly
130 - # 135 - # 140
- - Ala Leu Tyr Arg Asp Gly Val Phe Ile Thr As - #p Tyr Pro Thr Asn Val
145 1 - #50 1 - #55 1 -
#60
- - Ala Ala Asn Thr Gly Trp Tyr His His Asn Gl - #y Gly Val Thr Asn
Trp
165 - # 170 - # 175
- - Asn Asp Phe Phe Gln Val Lys Asn His Asn Le - #u Phe Asn Leu Ser Asp
180 - # 185 - # 190
- - Leu Asn Gln Ser Asn Thr Asp Val Tyr Gln Ty - #r Leu Leu Asp Gly Ser
195 - # 200 - # 205
- - Lys Phe Trp Ile Asp Ala Gly Val Asp Ala Il - #e Arg Ile Asp Ala Ile
210 - # 215 - # 220
- - Lys His Met Asp Lys Ser Phe Ile Gln Lys Tr - #p Thr Ser Asp Ile Tyr
225 2 - #30 2 - #35 2 -
#40
- - Asp Tyr Ser Lys Ser Ile Gly Arg Glu Gly Ph - #e Phe Phe Phe Gly
Glu
245 - # 250 - # 255
- - Trp Phe Gly Ala Ser Ala Asn Thr Thr Thr Gl - #y Val Asp Gly Asn Ala
260 - # 265 - # 270
- - Ile Asp Tyr Ala Asn Thr Ser Gly Ser Ala Le - #u Leu Asp Phe Gly Phe
275 - # 280 - # 285
- - Arg Asp Thr Leu Glu Arg Val Leu Val Gly Ar - #g Ser Gly Asn Thr Met
290 - # 295 - # 300
- - Lys Thr Leu Asn Ser Tyr Leu Ile Lys Arg Gl - #n Thr Val Phe Thr Ser
305 3 - #10 3 - #15 3 -
#20
- - Asp Asp Trp Gln Val Val Phe Met Asp Asn Hi - #s Asp Met Ala Arg
Ile
325 - # 330 - # 335
- - Gly Thr Ala Leu Arg Ser Asn Ala Thr Thr Ph - #e Gly Pro Gly Asn Asn
340 - # 345 - # 350
- - Glu Thr Gly Gly Ser Gln Ser Glu Ala Phe Al - #a Gln Lys Arg Ile Asp
355 - # 360 - # 365
- - Leu Gly Leu Val Ala Thr Met Thr Val Arg Gl - #y Ile Pro Ala Ile Tyr
370 - # 375 - # 380
- - Tyr Gly Thr Glu His Tyr Ala Ala Asn Phe Th - #r Ser Asn Ser Phe Gly
385 3 - #90 3 - #95 4 -
#00
- - Gln Val Gly Ser Asp Pro Tyr Asn Arg Glu Ly - #s Met Pro Gly Phe
Asp
405 - # 410 - # 415
- - Thr Glu Ser Glu Ala Phe Ser Ile Ile Lys Th - #r Leu Gly Asp Leu Arg
420 - # 425 - # 430
- - Lys Ser Ser Pro Ala Ile Gln Asn Gly Thr Ty - #r Thr Glu Leu Trp Val
435 - # 440 - # 445
- - Asn Asp Asp Ile Leu Val Phe Glu Arg Arg Se - #r Gly Asn Asp Ile Val
450 - # 455 - # 460
- - Ile Val Ala Leu Asn Arg Gly Glu Ala Asn Th - #r Ile Asn Val Lys Asn
465 4 - #70 4 - #75 4 -
#80
- - Ile Ala Val Pro Asn Gly Val Tyr Pro Ser Le - #u Ile Gly Asn Asn
Ser
485 - # 490 - # 495
- - Val Ser Val Ala Asn Lys Arg Thr Thr Leu Th - #r Leu Met Gln Asn Glu
500 - # 505 - # 510
- - Ala Val Val Ile Arg Ser Gln Ser Asp Asp Al - #a Glu Asn Pro Thr Val
515 - # 520 - # 525
- - Gln Ser Ile Asn Phe Thr Cys Asn Asn Gly Ty - #r Thr Ile Ser Gly Gln
530 - # 535 - # 540
- - Ser Val Tyr Ile Ile Gly Asn Ile Pro Gln Le - #u Gly Gly Trp Asp Leu
545 5 - #50 5 - #55 5 -
#60
- - Thr Lys Ala Val Lys Ile Ser Pro Thr Gln Ty - #r Pro Gln Trp Ser
Ala
565 - # 570 - # 575
- - Ser Leu Glu Leu Pro Ser Asp Leu Asn Val Gl - #u Trp Lys Cys Val Lys
580 - # 585 - # 590
- - Arg Asn Glu Thr Asn Pro Thr Ala Asn Val Gl - #u Trp Gln Ser Gly Ala
595 - # 600 - # 605
- - Asn Asn Gln Phe Asn Ser Asn Asp Thr Gln Th - #r Thr Asn Gly Ser Phe
610 - # 615 - # 620
__________________________________________________________________________
Claims (62)
Applications Claiming Priority (6)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| DK0477/95 | 1995-04-21 | ||
| DK47795 | 1995-04-21 | ||
| DK1173/95 | 1995-10-17 | ||
| DK117395 | 1995-10-17 | ||
| DK1281/95 | 1995-11-16 | ||
| DK128195 | 1995-11-16 |
Related Parent Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| PCT/DK1996/000179 Continuation WO1996033267A1 (en) | 1995-04-21 | 1996-04-22 | Cyclomaltodextrin glucanotransferase variants |
Publications (1)
| Publication Number | Publication Date |
|---|---|
| US6004790A true US6004790A (en) | 1999-12-21 |
Family
ID=27220709
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| US08/947,965 Expired - Fee Related US6004790A (en) | 1995-04-21 | 1997-10-09 | Cyclomaltodextrin glucanotransferase variants |
Country Status (11)
| Country | Link |
|---|---|
| US (1) | US6004790A (en) |
| EP (2) | EP0822982B1 (en) |
| JP (1) | JP4057055B2 (en) |
| KR (1) | KR19990007930A (en) |
| AR (1) | AR001678A1 (en) |
| AT (1) | ATE311439T1 (en) |
| AU (1) | AU5396896A (en) |
| CA (1) | CA2217876A1 (en) |
| DE (1) | DE69635515T2 (en) |
| DK (1) | DK0822982T3 (en) |
| WO (1) | WO1996033267A1 (en) |
Cited By (13)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US20020042120A1 (en) * | 1997-09-24 | 2002-04-11 | Novozymes A/S | Novel cyclomaltodextrin glucanotransferase variants |
| WO2002006508A3 (en) * | 2000-07-19 | 2002-05-02 | Novozymes As | CYCLOMALTODEXTRIN GLUCANOTRANSFERASE (CGTase9, DNA SEQUENCE ENCODING IT AND USES THEREOF |
| US20030194796A1 (en) * | 2000-05-23 | 2003-10-16 | Masayasu Takada | Novel cyclodextrin glucanotransferase, process for producing the same and process for producing cyclodextrin by using this enzyme |
| US20030207408A1 (en) * | 1998-02-27 | 2003-11-06 | Novozymes A/S | Amylolytic enzyme variants |
| WO2004081171A3 (en) * | 2003-03-12 | 2005-03-24 | Danisco | Variants of enzymes of the alpha-amylase family |
| US20090142817A1 (en) * | 2002-02-14 | 2009-06-04 | Novozymes A/S | Process for hydrolysis of starch |
| US20110287142A1 (en) * | 2004-08-02 | 2011-11-24 | Novozymes A/S | Creation of diversity in polypeptides |
| US20120165290A1 (en) * | 2009-05-08 | 2012-06-28 | Lubbert Dijkhuizen | Glucooligosaccharides comprising (alpha 1->4) and (alpha 1->6) glycosidic bonds, use thereof, and methods for providing them |
| CN103602641A (en) * | 2013-11-20 | 2014-02-26 | 广西大学 | Beta-cyclodextrin glucosyl tranferase enzyme mutant for producing gamma-cyclodextrin and use thereof |
| US20190218530A1 (en) * | 2018-01-15 | 2019-07-18 | Jiangnan University | Preparation and Application of Cyclodextrin Glucosyltransferase Mutant |
| US20200181585A1 (en) * | 2018-12-07 | 2020-06-11 | Jiangnan University | Mutant of Cyclodextrin Glycosyltransferase |
| CN111534498A (en) * | 2020-05-28 | 2020-08-14 | 江南大学 | Cyclodextrin glucosyltransferase mutants with increased disproportionation specific activity and AA-2G production |
| CN113493747A (en) * | 2020-04-02 | 2021-10-12 | 青岛蔚蓝生物集团有限公司 | Mutant strain of high-yield cyclodextrin glucosyltransferase and application thereof |
Families Citing this family (31)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| DE19615336A1 (en) * | 1996-04-18 | 1997-10-23 | Consortium Elektrochem Ind | Cyclodextrin glycosyltransferases for the production of gamma-cyclodextrin |
| US6472192B1 (en) | 1996-04-18 | 2002-10-29 | Consortium für elektrochemische Industrie GmbH | Cyclodextrin glycosyl transferases for producing γ-cyclodextrin |
| DE19718978A1 (en) * | 1997-05-05 | 1998-11-12 | Henkel Kgaa | Process for inhibiting color transfer in textiles during washing |
| US6876932B1 (en) | 1998-02-27 | 2005-04-05 | Novozymes A/S | Maltogenic alpha-amylase variants |
| CN103352033B (en) | 1998-02-27 | 2016-05-11 | 诺维信公司 | Maltogenic alpha-amylase variants |
| DE69931615T2 (en) * | 1998-02-27 | 2007-05-10 | Novozymes A/S | VARIANTS OF AMYLOLYTIC ENZYMES |
| WO2001016349A1 (en) * | 1999-09-01 | 2001-03-08 | Novozymes A/S | Method for production of maltose and/or enzymatically modified starch |
| AU2002233186A1 (en) | 2000-11-28 | 2002-06-11 | Henkel Kommanditgesellschaft Auf Aktien | Cyclodextrin glucanotransferase (cgtase), obtained from bacillus agaradherens (dsm 9948) and detergents and cleaning agents containing said novel cyclodextrin glucanotransferase |
| US6960461B2 (en) | 2001-07-11 | 2005-11-01 | Nihon Shokuhin Kako Co., Ltd. | Gene coding for cyclodextrin glucanotransferase chiefly producing γ-cyclodextrin and use thereof |
| ATE500749T1 (en) | 2001-09-18 | 2011-03-15 | Novozymes As | ENZYMATIC TREATMENT OF STARCH-CONTAINED FOOD PRODUCTS TO REDUCE THE TEMPERING TIME |
| EP1644495A1 (en) * | 2003-07-01 | 2006-04-12 | Novozymes A/S | Cgtase variants |
| BRPI0511171A (en) | 2004-05-17 | 2007-12-04 | Procter & Gamble | bleach composition comprising a carbohydrate oxidase |
| JP4989922B2 (en) | 2006-06-08 | 2012-08-01 | 日本食品化工株式会社 | Mutant and gene encoding the same |
| EP2435547A1 (en) | 2009-05-26 | 2012-04-04 | The Procter & Gamble Company | Aqueous liquid composition for pre-treating soiled dishware |
| US20110009307A1 (en) | 2009-07-09 | 2011-01-13 | Alan Thomas Brooker | Laundry Detergent Composition Comprising Low Level of Sulphate |
| WO2011005623A1 (en) | 2009-07-09 | 2011-01-13 | The Procter & Gamble Company | Laundry detergent composition comprising low level of bleach |
| BR112013022939B1 (en) | 2011-03-07 | 2022-01-11 | Hayashibara Co., Ltd | PROCESS TO PRODUCE A PARTICULATE COMPOSITION |
| ES2707894T3 (en) | 2012-10-04 | 2019-04-05 | Ecolab Usa Inc | Pre-soaking technology for washing clothes and cleaning other hard surfaces |
| CN103740669A (en) * | 2013-04-26 | 2014-04-23 | 江南大学 | Method for improving beta-cyclodextrin production capability of cyclodextrin glycosyltransferase by calcium ion binding site amino acid residue mutation |
| CA2956850C (en) | 2014-08-01 | 2021-07-06 | Ecolab Usa Inc. | A method of manual surface cleaning using cleaning textiles and of washing said cleaning textiles |
| CN104293743B (en) * | 2014-09-26 | 2016-09-07 | 江南大学 | A kind of cyclodextrin glycosyltransferase mutant weakened by Product inhibiton |
| CA2972679A1 (en) * | 2015-01-16 | 2016-07-21 | Novozymes A/S | Method to improve sliceability of baked goods |
| US10577571B2 (en) | 2016-11-08 | 2020-03-03 | Ecolab Usa Inc. | Non-aqueous cleaner for vegetable oil soils |
| JP7199358B2 (en) | 2017-09-04 | 2023-01-05 | 昭和電工株式会社 | Method for producing inositol derivative |
| CN111051519B (en) | 2017-09-04 | 2024-04-26 | 株式会社力森诺科 | Mixtures of inositol derivatives |
| US11873465B2 (en) | 2019-08-14 | 2024-01-16 | Ecolab Usa Inc. | Methods of cleaning and soil release of highly oil absorbing substrates employing optimized extended chain nonionic surfactants |
| JP2023534927A (en) | 2020-07-06 | 2023-08-15 | エコラボ ユーエスエー インコーポレイティド | Foaming alcohol/water blend composition containing structured alkoxylated siloxane |
| JP7630600B2 (en) | 2020-07-06 | 2025-02-17 | エコラボ ユーエスエー インコーポレイティド | Foaming mixed alcohol/water compositions containing alkylsiloxane and hydrotrope/solubilizer combinations - Patent Application 20070229933 |
| US12152220B2 (en) | 2020-07-06 | 2024-11-26 | Ecolab Usa Inc. | PEG-modified castor oil based compositions for microemulsifying and removing multiple oily soils |
| US20240026248A1 (en) | 2022-07-20 | 2024-01-25 | Ecolab Usa Inc. | Novel nonionic extended surfactants, compositions and methods of use thereof |
| CN120366258B (en) * | 2025-06-26 | 2025-09-02 | 山东奔月生物科技股份有限公司 | Cyclodextrin glucosyltransferase mutant and application thereof |
Citations (11)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| WO1989003421A1 (en) * | 1987-10-15 | 1989-04-20 | Novo Industri A/S | Thermostable cyclodextrin glycosyl transferase, its production and use |
| WO1991014770A2 (en) * | 1990-03-27 | 1991-10-03 | Consortium für elektrochemische Industrie GmbH | η-CGTase |
| US5162210A (en) * | 1990-06-29 | 1992-11-10 | Iowa State University Research Foundation | Process for enzymatic hydrolysis of starch to glucose |
| JPH0541985A (en) * | 1991-03-04 | 1993-02-23 | Oji Koonsutaac Kk | Mutant cyclomaltodextrin glucanotransferase |
| JPH05219948A (en) * | 1992-02-04 | 1993-08-31 | Takeshi Uozumi | Cyclodextrin glucanotransferase, its production and gene coding the same enzyme |
| JPH05244945A (en) * | 1992-03-04 | 1993-09-24 | Nippon Shokuhin Kako Co Ltd | Method for producing α-cyclodextrin |
| US5278059A (en) * | 1985-12-04 | 1994-01-11 | Kabushiki Kaisha Hayashibara Seibutsu Kagaki Kenkyujo | Polypeptide possessing cyclomaltodextrin glucanotransferase activity |
| EP0614971A2 (en) * | 1993-02-09 | 1994-09-14 | Amano Pharmaceutical Co., Ltd. | Novel cyclodextrin glucanotransferase, process for producing the same and process for producing cyclodextrin using the same |
| EP0630967A1 (en) * | 1993-06-24 | 1994-12-28 | Consortium für elektrochemische Industrie GmbH | Cyclodextrin glycosyltransferase for the production of gamma-cyclodextrin |
| JPH0723781A (en) * | 1993-06-25 | 1995-01-27 | Oji Koonsutaac Kk | Modified cgtase, its production and use |
| US5538882A (en) * | 1993-03-04 | 1996-07-23 | Director-General Of Agency Of Industrial Science And Techology | Variant-type carbohydrate hydrolase, variant gene of the enzyme and method for producing oligosaccharide using the enzyme |
-
1996
- 1996-04-22 AU AU53968/96A patent/AU5396896A/en not_active Abandoned
- 1996-04-22 DE DE69635515T patent/DE69635515T2/en not_active Expired - Fee Related
- 1996-04-22 AR AR33624396A patent/AR001678A1/en unknown
- 1996-04-22 WO PCT/DK1996/000179 patent/WO1996033267A1/en not_active Ceased
- 1996-04-22 EP EP96910915A patent/EP0822982B1/en not_active Expired - Lifetime
- 1996-04-22 JP JP53141396A patent/JP4057055B2/en not_active Expired - Fee Related
- 1996-04-22 CA CA002217876A patent/CA2217876A1/en not_active Abandoned
- 1996-04-22 AT AT96910915T patent/ATE311439T1/en not_active IP Right Cessation
- 1996-04-22 EP EP05110409A patent/EP1632566A3/en not_active Withdrawn
- 1996-04-22 KR KR1019970707453A patent/KR19990007930A/en not_active Ceased
- 1996-04-22 DK DK96910915A patent/DK0822982T3/en active
-
1997
- 1997-10-09 US US08/947,965 patent/US6004790A/en not_active Expired - Fee Related
Patent Citations (13)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US5278059A (en) * | 1985-12-04 | 1994-01-11 | Kabushiki Kaisha Hayashibara Seibutsu Kagaki Kenkyujo | Polypeptide possessing cyclomaltodextrin glucanotransferase activity |
| WO1989003421A1 (en) * | 1987-10-15 | 1989-04-20 | Novo Industri A/S | Thermostable cyclodextrin glycosyl transferase, its production and use |
| WO1991014770A2 (en) * | 1990-03-27 | 1991-10-03 | Consortium für elektrochemische Industrie GmbH | η-CGTase |
| US5162210A (en) * | 1990-06-29 | 1992-11-10 | Iowa State University Research Foundation | Process for enzymatic hydrolysis of starch to glucose |
| JPH0541985A (en) * | 1991-03-04 | 1993-02-23 | Oji Koonsutaac Kk | Mutant cyclomaltodextrin glucanotransferase |
| JPH05219948A (en) * | 1992-02-04 | 1993-08-31 | Takeshi Uozumi | Cyclodextrin glucanotransferase, its production and gene coding the same enzyme |
| JPH05244945A (en) * | 1992-03-04 | 1993-09-24 | Nippon Shokuhin Kako Co Ltd | Method for producing α-cyclodextrin |
| EP0614971A2 (en) * | 1993-02-09 | 1994-09-14 | Amano Pharmaceutical Co., Ltd. | Novel cyclodextrin glucanotransferase, process for producing the same and process for producing cyclodextrin using the same |
| US5538882A (en) * | 1993-03-04 | 1996-07-23 | Director-General Of Agency Of Industrial Science And Techology | Variant-type carbohydrate hydrolase, variant gene of the enzyme and method for producing oligosaccharide using the enzyme |
| US5635378A (en) * | 1993-03-04 | 1997-06-03 | Director-General Of Agency Of Industrial Science And Technology | Variant-type carbohydrate hydrolase, variant gene of the enzyme and method for producing oligosaccharide using the enzyme |
| EP0630967A1 (en) * | 1993-06-24 | 1994-12-28 | Consortium für elektrochemische Industrie GmbH | Cyclodextrin glycosyltransferase for the production of gamma-cyclodextrin |
| US5474917A (en) * | 1993-06-24 | 1995-12-12 | Consortium Fur Elektrochemische Industrie Gmbh | Modified cyclodextrin glycosyltransferases for producing γ-cyclodextrins |
| JPH0723781A (en) * | 1993-06-25 | 1995-01-27 | Oji Koonsutaac Kk | Modified cgtase, its production and use |
Non-Patent Citations (30)
| Title |
|---|
| B. Strokopytov, Biochemistry, vol. 34 (7), pp. 2234 2240 (Feb. 21, 1995). * |
| B. Strokopytov, Biochemistry, vol. 34 (7), pp. 2234-2240 (Feb. 21, 1995). |
| Fujiwara, et al., Journal of Bacteriology, vol. 174, No. 22, pp. 7478 7481 (Nov. 1992). * |
| Fujiwara, et al., Journal of Bacteriology, vol. 174, No. 22, pp. 7478-7481 (Nov. 1992). |
| Kimura et al. (Jun. 1989) Functions of the COOH terminal region of cyclodextrin glucanotransferase of alkalophilic Bacillus sp. 1011: Relation to catalyzing activity and pH stability. Biochem. Biophys. Res. Commun. 161(3): 1273 1279. * |
| Kimura et al. (Jun. 1989) Functions of the COOH-terminal region of cyclodextrin glucanotransferase of alkalophilic Bacillus sp. #1011: Relation to catalyzing activity and pH stability. Biochem. Biophys. Res. Commun. 161(3): 1273-1279. |
| Klein et al., J. Mol. Biol., vol. 217, pp. 737 750 (1991). * |
| Klein et al., J. Mol. Biol., vol. 217, pp. 737-750 (1991). |
| Knegtel et al. (Dec. 1995) Crystallographic studies of the interaction of cyclodextrin glycosyltransferase from Bacillus circulans Strain 251 with natural substrates and products. J. Biol. Chem. 270(49): 29256 29264. * |
| Knegtel et al. (Dec. 1995) Crystallographic studies of the interaction of cyclodextrin glycosyltransferase from Bacillus circulans Strain 251 with natural substrates and products. J. Biol. Chem. 270(49): 29256-29264. |
| Mattsson et al. (Feb. 1995) The role of histidine residues in the catalytic act of cyclomaltodextrin glucanotransferase from Bacillus circulans var. alkalophilus. Biochimica et. Biophysica Acta 1247(1): 97 103. * |
| Mattsson et al. (Feb. 1995) The role of histidine residues in the catalytic act of cyclomaltodextrin glucanotransferase from Bacillus circulans var. alkalophilus. Biochimica et. Biophysica Acta 1247(1): 97-103. |
| Nakamura et al., Biochemistry, vol. 32, pp. 6624 6631 (1993). * |
| Nakamura et al., Biochemistry, vol. 32, pp. 6624-6631 (1993). |
| Nakamura et al., Biochemistry, vol. 33, pp.9929 9936 (1994). * |
| Nakamura et al., Biochemistry, vol. 33, pp.9929-9936 (1994). |
| Nakamura et al., FEBS Lett, vol. 296 (1), pp. 37 40 (Jan. 13, 1992). * |
| Nakamura et al., FEBS Lett, vol. 296 (1), pp. 37-40 (Jan. 13, 1992). |
| Ngo et al. (Jan. 1994) Computational complexity, protein structure prediction, and the ILevinthal paradox. In: The Protein Folding Problem and Tertiary Structure Prediction. Eds. Merz et al. Birkhauser et al. Boston, MA. pp. 491 495. * |
| Ngo et al. (Jan. 1994) Computational complexity, protein structure prediction, and the ILevinthal paradox. In: The Protein Folding Problem and Tertiary Structure Prediction. Eds. Merz et al. Birkhauser et al. Boston, MA. pp. 491-495. |
| Penninga et al., Biochemistry, vol. 34, pp. 3368 3376 (1995). * |
| Penninga et al., Biochemistry, vol. 34, pp. 3368-3376 (1995). |
| Rudinger (Jun. 1976) Characteristics of the amino acids as components of a peptide hormone sequence. In: Peptide Hormones. Ed. J. A. Parsons. University Park Press, Baltimore, MD. pp. 1 7. * |
| Rudinger (Jun. 1976) Characteristics of the amino acids as components of a peptide hormone sequence. In: Peptide Hormones. Ed. J. A. Parsons. University Park Press, Baltimore, MD. pp. 1-7. |
| Sin et al., Journal of Biotechnology, vol. 32, pp. 283 288 (1994). * |
| Sin et al., Journal of Biotechnology, vol. 32, pp. 283-288 (1994). |
| Thorton et al. (Aug. 1995) Protein Engineering: Editorial Overview. Current Opinion in Biotechnology 6(4): 367 369. * |
| Thorton et al. (Aug. 1995) Protein Engineering: Editorial Overview. Current Opinion in Biotechnology 6(4): 367-369. |
| Wallace (Apr. 1993) Understanding cytochrome c function: engineering protein structure by semisynthesis. The FASEB Journal 7: 505 515. * |
| Wallace (Apr. 1993) Understanding cytochrome c function: engineering protein structure by semisynthesis. The FASEB Journal 7: 505-515. |
Cited By (34)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US6777215B2 (en) * | 1997-09-24 | 2004-08-17 | Novozymes A/S | Cyclomaltodextrin glucanotransferase variants |
| US20020042120A1 (en) * | 1997-09-24 | 2002-04-11 | Novozymes A/S | Novel cyclomaltodextrin glucanotransferase variants |
| US20100098804A1 (en) * | 1998-02-27 | 2010-04-22 | Novozymes A/S | Amylolytic enzyme variants |
| US20090142803A1 (en) * | 1998-02-27 | 2009-06-04 | Novozymes A/S | Amylolytic Enzyme Variants |
| US20030207408A1 (en) * | 1998-02-27 | 2003-11-06 | Novozymes A/S | Amylolytic enzyme variants |
| US20030215928A1 (en) * | 1998-02-27 | 2003-11-20 | Novozymes A/S | Amylolytic enzyme variants |
| US6924136B2 (en) * | 2000-05-23 | 2005-08-02 | Nihon Shokuhin Kako Co. Ltd. | Cyclodextrin glucanotransferase and its method of manufacture |
| US20030194796A1 (en) * | 2000-05-23 | 2003-10-16 | Masayasu Takada | Novel cyclodextrin glucanotransferase, process for producing the same and process for producing cyclodextrin by using this enzyme |
| US7700338B2 (en) | 2000-07-19 | 2010-04-20 | Novozymes A/S | CGTase and DNA sequence encoding same |
| US20030170696A1 (en) * | 2000-07-19 | 2003-09-11 | Jorgensen Per Lina | Cgtase and dna sequence encoding same |
| WO2002006508A3 (en) * | 2000-07-19 | 2002-05-02 | Novozymes As | CYCLOMALTODEXTRIN GLUCANOTRANSFERASE (CGTase9, DNA SEQUENCE ENCODING IT AND USES THEREOF |
| US20090142817A1 (en) * | 2002-02-14 | 2009-06-04 | Novozymes A/S | Process for hydrolysis of starch |
| WO2004081171A3 (en) * | 2003-03-12 | 2005-03-24 | Danisco | Variants of enzymes of the alpha-amylase family |
| US20060134266A1 (en) * | 2003-03-12 | 2006-06-22 | Kragh Karsten M | Enzyme |
| US7803598B2 (en) * | 2003-03-12 | 2010-09-28 | Danisco A/S | Cyclodextrin glycosyltransferase (CGTase) polypeptides with modified hydrolysis activity |
| US8309337B2 (en) * | 2004-08-02 | 2012-11-13 | Novozymes A/S | Hybrid polypeptide of a maltogenic alpha-amylase and a cyclodextrin glucanotransferase |
| US20110287142A1 (en) * | 2004-08-02 | 2011-11-24 | Novozymes A/S | Creation of diversity in polypeptides |
| US20120165290A1 (en) * | 2009-05-08 | 2012-06-28 | Lubbert Dijkhuizen | Glucooligosaccharides comprising (alpha 1->4) and (alpha 1->6) glycosidic bonds, use thereof, and methods for providing them |
| US9657322B2 (en) * | 2009-05-08 | 2017-05-23 | Rijksuniversiteit Groningen | Glucooligosaccharides comprising (alpha 1->4) and (alpha 1->6) glycosidic bonds, use thereof, and methods for providing them |
| CN103602641A (en) * | 2013-11-20 | 2014-02-26 | 广西大学 | Beta-cyclodextrin glucosyl tranferase enzyme mutant for producing gamma-cyclodextrin and use thereof |
| CN103602641B (en) * | 2013-11-20 | 2015-05-20 | 广西大学 | Beta-cyclodextrin glucosyl tranferase enzyme mutant for producing gamma-cyclodextrin and use thereof |
| US20190218530A1 (en) * | 2018-01-15 | 2019-07-18 | Jiangnan University | Preparation and Application of Cyclodextrin Glucosyltransferase Mutant |
| US10876099B2 (en) * | 2018-01-15 | 2020-12-29 | Jiangnan University | Preparation and application of cyclodextrin glucosyltransferase mutant |
| CN111560360A (en) * | 2018-12-07 | 2020-08-21 | 江南大学 | Cyclodextrin glucosyltransferase mutant and application thereof |
| CN111411097A (en) * | 2018-12-07 | 2020-07-14 | 江南大学 | A cyclodextrin glucosyltransferase mutant S90G |
| CN111394330A (en) * | 2018-12-07 | 2020-07-10 | 江南大学 | Cyclodextrin glucosyltransferase mutant T168A and preparation method thereof |
| US20200181585A1 (en) * | 2018-12-07 | 2020-06-11 | Jiangnan University | Mutant of Cyclodextrin Glycosyltransferase |
| CN111411097B (en) * | 2018-12-07 | 2022-02-15 | 江南大学 | Cyclodextrin glucosyltransferase mutant S90G |
| CN111560360B (en) * | 2018-12-07 | 2022-03-04 | 江南大学 | Cyclodextrin glucosyltransferase mutant and its application |
| CN111394330B (en) * | 2018-12-07 | 2022-03-04 | 江南大学 | Cyclodextrin glucosyltransferase mutant T168A and preparation method thereof |
| US11566235B2 (en) * | 2018-12-07 | 2023-01-31 | Jiangnan University | Mutant of cyclodextrin glycosyltransferase |
| CN113493747A (en) * | 2020-04-02 | 2021-10-12 | 青岛蔚蓝生物集团有限公司 | Mutant strain of high-yield cyclodextrin glucosyltransferase and application thereof |
| CN111534498A (en) * | 2020-05-28 | 2020-08-14 | 江南大学 | Cyclodextrin glucosyltransferase mutants with increased disproportionation specific activity and AA-2G production |
| CN111534498B (en) * | 2020-05-28 | 2022-03-25 | 江南大学 | Cyclodextrin glucosyltransferase mutant with improved disproportionation specific activity and AA-2G yield |
Also Published As
| Publication number | Publication date |
|---|---|
| EP0822982B1 (en) | 2005-11-30 |
| CA2217876A1 (en) | 1996-10-24 |
| JP4057055B2 (en) | 2008-03-05 |
| JPH11503906A (en) | 1999-04-06 |
| EP0822982A1 (en) | 1998-02-11 |
| DK0822982T3 (en) | 2006-02-13 |
| AU5396896A (en) | 1996-11-07 |
| EP1632566A2 (en) | 2006-03-08 |
| KR19990007930A (en) | 1999-01-25 |
| EP1632566A3 (en) | 2006-08-16 |
| ATE311439T1 (en) | 2005-12-15 |
| DE69635515T2 (en) | 2006-08-10 |
| WO1996033267A1 (en) | 1996-10-24 |
| DE69635515D1 (en) | 2006-01-05 |
| AR001678A1 (en) | 1997-11-26 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| US6004790A (en) | Cyclomaltodextrin glucanotransferase variants | |
| Penninga et al. | Site-directed mutations in tyrosine 195 of cyclodextrin glycosyltransferase from Bacillus circulans strain 251 affect activity and product specificity | |
| EP0605040B1 (en) | Pullulanase, micro-organisms producing the same, method for preparation thereof as well as its use | |
| JP3249514B2 (en) | Hydrolysis of starch to glucose by genetically engineered enzymes | |
| Hirano et al. | Molecular cloning and characterization of a novel γ-CGTase from alkalophilic Bacillus sp. | |
| JP2010504082A (en) | Method for producing 2-O-glyceryl-α-D-glucopyranoside | |
| CA2342124A1 (en) | Nucleic acid molecules encoding an amylosucrase | |
| CN108034645A (en) | The preparation and its application of a kind of yclodextrin glycosyltransferase mutant | |
| Kim et al. | Analysis of the Gene Encoding Cyclomaltodextrinase from AlkalophilicBacillussp. I-5 and Characterization of Enzymatic Properties | |
| JPH04228071A (en) | Amylase a-180 for producing maltopentaose, method of production thereof, dna structure and alkalophilic isolation body | |
| Saini et al. | Microbial cyclodextrin glycosyltransferases: sources, production, and application in cyclodextrin synthesis | |
| CN108018268A (en) | A kind of yclodextrin glycosyltransferase mutant of raising AA-2G yield | |
| WO1995034642A1 (en) | Novel transferase and amylase, process for producing the enzymes,use thereof, and gene coding for the same | |
| Abdalla et al. | One-pot production of maltoheptaose (DP7) from starch by sequential addition of cyclodextrin glucotransferase and cyclomaltodextrinase | |
| Nakapong et al. | Heterologous expression of 4α-glucanotransferase: Overproduction and properties for industrial applications | |
| Rahman et al. | Molecular cloning of a cyclodextrin glucanotransferase gene from alkalophilic Bacillus sp. TS1-1 and characterization of the recombinant enzyme | |
| EP1017793B1 (en) | Novel cyclomaltodextrin glucanotransferase variants | |
| Charoensakdi et al. | Cloning and expression of cyclodextrin glycosyltransferase gene from Paenibacillus sp. T16 isolated from hot spring soil in northern Thailand | |
| Keevil et al. | Inhibition of the synthesis and secretion of extracellular glucosyl-and fructosyltransferase in Streptococcus sanguis by sodium ions | |
| Faridmoayer et al. | An improved purification procedure for soluble processing α-glucosidase I from Saccharomyces cerevisiae overexpressing CWH41 | |
| Abelyan et al. | Characteristics of cyclodextrin production using cyclodextrin glucanotransferases from various groups of microorganisms | |
| Kanpiengjai et al. | Expression and comparative characterization of complete and C-terminally truncated forms of saccharifying α-amylase from Lactobacillus plantarum S21 | |
| Argüello-Morales et al. | Proteolytic modification of Leuconostoc mesenteroides B-512F dextransucrase | |
| Ljubijankić et al. | Synthesis and secretion of Providencia rettgeri and Escherichia coli heterodimeric penicillin amidases in Saccharomyces cerevisiae | |
| Jiang et al. | Cyclodextrin glycosyltransferases for food innovation: from molecular mechanism to industrial application |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| AS | Assignment |
Owner name: NOVO NORDISK A/S, DENMARK Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNORS:DIJKHUIZEN, LUBBERT;DIJKSTRA, BAUKE W.;ANDERSEN, CARSTEN;AND OTHERS;REEL/FRAME:008966/0397;SIGNING DATES FROM 19971017 TO 19971103 |
|
| CC | Certificate of correction | ||
| AS | Assignment |
Owner name: NOVOZYMES A/S, DENMARK Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNOR:NOVO NORDISK A/S;REEL/FRAME:012463/0868 Effective date: 20011029 |
|
| FPAY | Fee payment |
Year of fee payment: 4 |
|
| FPAY | Fee payment |
Year of fee payment: 8 |
|
| REMI | Maintenance fee reminder mailed | ||
| LAPS | Lapse for failure to pay maintenance fees | ||
| STCH | Information on status: patent discontinuation |
Free format text: PATENT EXPIRED DUE TO NONPAYMENT OF MAINTENANCE FEES UNDER 37 CFR 1.362 |
|
| FP | Lapsed due to failure to pay maintenance fee |
Effective date: 20111221 |