CN114703168A - Heparinase III, coding nucleotide sequence thereof, recombinant vector and host cell comprising nucleotide sequence and application - Google Patents
Heparinase III, coding nucleotide sequence thereof, recombinant vector and host cell comprising nucleotide sequence and application Download PDFInfo
- Publication number
- CN114703168A CN114703168A CN202210203224.7A CN202210203224A CN114703168A CN 114703168 A CN114703168 A CN 114703168A CN 202210203224 A CN202210203224 A CN 202210203224A CN 114703168 A CN114703168 A CN 114703168A
- Authority
- CN
- China
- Prior art keywords
- ala
- lys
- asn
- leu
- val
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
- 239000013598 vector Substances 0.000 title claims abstract description 71
- 108010083213 heparitinsulfate lyase Proteins 0.000 title claims abstract description 70
- 239000002773 nucleotide Substances 0.000 title claims abstract description 53
- 125000003729 nucleotide group Chemical group 0.000 title claims abstract description 53
- 125000003275 alpha amino acid group Chemical group 0.000 claims abstract 2
- 241000235058 Komagataella pastoris Species 0.000 claims description 90
- 210000004027 cell Anatomy 0.000 claims description 53
- 239000013604 expression vector Substances 0.000 claims description 20
- 238000000034 method Methods 0.000 claims description 16
- 238000003259 recombinant expression Methods 0.000 claims description 10
- 210000003527 eukaryotic cell Anatomy 0.000 claims description 8
- 230000008569 process Effects 0.000 claims description 7
- 238000012258 culturing Methods 0.000 claims description 2
- 230000001939 inductive effect Effects 0.000 claims description 2
- 238000002360 preparation method Methods 0.000 claims description 2
- 230000002194 synthesizing effect Effects 0.000 claims description 2
- 230000000694 effects Effects 0.000 abstract description 11
- 238000000855 fermentation Methods 0.000 abstract description 7
- 230000004151 fermentation Effects 0.000 abstract description 7
- 238000010353 genetic engineering Methods 0.000 abstract description 4
- 150000001413 amino acids Chemical group 0.000 description 76
- 229940024606 amino acid Drugs 0.000 description 51
- 235000001014 amino acid Nutrition 0.000 description 49
- 108090000623 proteins and genes Proteins 0.000 description 43
- 108020004705 Codon Proteins 0.000 description 39
- 102000004169 proteins and genes Human genes 0.000 description 37
- 235000018102 proteins Nutrition 0.000 description 31
- 230000014509 gene expression Effects 0.000 description 28
- 108020004414 DNA Proteins 0.000 description 27
- 102000004190 Enzymes Human genes 0.000 description 25
- 108090000790 Enzymes Proteins 0.000 description 25
- 238000005457 optimization Methods 0.000 description 18
- 239000012634 fragment Substances 0.000 description 17
- 240000004808 Saccharomyces cerevisiae Species 0.000 description 16
- 235000014680 Saccharomyces cerevisiae Nutrition 0.000 description 16
- 239000000047 product Substances 0.000 description 15
- 241000235648 Pichia Species 0.000 description 12
- 108700010070 Codon Usage Proteins 0.000 description 10
- 238000003752 polymerase chain reaction Methods 0.000 description 10
- 108090000765 processed proteins & peptides Proteins 0.000 description 10
- 108010047495 alanylglycine Proteins 0.000 description 9
- 238000013519 translation Methods 0.000 description 9
- YBJHBAHKTGYVGT-ZKWXMUAHSA-N (+)-Biotin Chemical compound N1C(=O)N[C@@H]2[C@H](CCCCC(=O)O)SC[C@@H]21 YBJHBAHKTGYVGT-ZKWXMUAHSA-N 0.000 description 8
- WHUUTDBJXJRKMK-UHFFFAOYSA-N Glutamic acid Natural products OC(=O)C(N)CCC(O)=O WHUUTDBJXJRKMK-UHFFFAOYSA-N 0.000 description 8
- KDXKERNSBIXSRK-UHFFFAOYSA-N Lysine Natural products NCCCCC(N)C(O)=O KDXKERNSBIXSRK-UHFFFAOYSA-N 0.000 description 8
- 235000004279 alanine Nutrition 0.000 description 8
- 235000004554 glutamine Nutrition 0.000 description 8
- 239000013612 plasmid Substances 0.000 description 8
- 229920001184 polypeptide Polymers 0.000 description 8
- 102000004196 processed proteins & peptides Human genes 0.000 description 8
- 238000000746 purification Methods 0.000 description 8
- 108010022901 Heparin Lyase Proteins 0.000 description 7
- QNAYBMKLOCPYGJ-REOHCLBHSA-N L-alanine Chemical compound C[C@H](N)C(O)=O QNAYBMKLOCPYGJ-REOHCLBHSA-N 0.000 description 7
- 238000006243 chemical reaction Methods 0.000 description 7
- 230000004048 modification Effects 0.000 description 7
- 238000012986 modification Methods 0.000 description 7
- 239000000243 solution Substances 0.000 description 7
- BXUHCIXDSWRSBS-CIUDSAMLSA-N Asn-Leu-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O BXUHCIXDSWRSBS-CIUDSAMLSA-N 0.000 description 6
- 108091026890 Coding region Proteins 0.000 description 6
- QSTLUOIOYLYLLF-WDSKDSINSA-N Gly-Asp-Glu Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O QSTLUOIOYLYLLF-WDSKDSINSA-N 0.000 description 6
- AGGIYSLVUKVOPT-HTFCKZLJSA-N Ile-Ser-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N AGGIYSLVUKVOPT-HTFCKZLJSA-N 0.000 description 6
- PMGDADKJMCOXHX-UHFFFAOYSA-N L-Arginyl-L-glutamin-acetat Natural products NC(=N)NCCCC(N)C(=O)NC(CCC(N)=O)C(O)=O PMGDADKJMCOXHX-UHFFFAOYSA-N 0.000 description 6
- KFKWRHQBZQICHA-STQMWFEESA-N L-leucyl-L-phenylalanine Natural products CC(C)C[C@H](N)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 KFKWRHQBZQICHA-STQMWFEESA-N 0.000 description 6
- DGAAQRAUOFHBFJ-CIUDSAMLSA-N Lys-Asn-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(O)=O DGAAQRAUOFHBFJ-CIUDSAMLSA-N 0.000 description 6
- XMBSYZWANAQXEV-UHFFFAOYSA-N N-alpha-L-glutamyl-L-phenylalanine Natural products OC(=O)CCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 XMBSYZWANAQXEV-UHFFFAOYSA-N 0.000 description 6
- HAYADTTXNZFUDM-IHRRRGAJSA-N Ser-Tyr-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O HAYADTTXNZFUDM-IHRRRGAJSA-N 0.000 description 6
- KZSNJWFQEVHDMF-UHFFFAOYSA-N Valine Natural products CC(C)C(N)C(O)=O KZSNJWFQEVHDMF-UHFFFAOYSA-N 0.000 description 6
- 108010013835 arginine glutamate Proteins 0.000 description 6
- 108010077245 asparaginyl-proline Proteins 0.000 description 6
- 108010069205 aspartyl-phenylalanine Proteins 0.000 description 6
- 108010047857 aspartylglycine Proteins 0.000 description 6
- ZDXPYRJPNDTMRX-UHFFFAOYSA-N glutamine Natural products OC(=O)C(N)CCC(N)=O ZDXPYRJPNDTMRX-UHFFFAOYSA-N 0.000 description 6
- 108010044056 leucyl-phenylalanine Proteins 0.000 description 6
- 239000003446 ligand Substances 0.000 description 6
- 108010064235 lysylglycine Proteins 0.000 description 6
- 108010054155 lysyllysine Proteins 0.000 description 6
- 230000035772 mutation Effects 0.000 description 6
- 108010012581 phenylalanylglutamate Proteins 0.000 description 6
- 230000010076 replication Effects 0.000 description 6
- 108010026333 seryl-proline Proteins 0.000 description 6
- LFQSCWFLJHTTHZ-UHFFFAOYSA-N Ethanol Chemical compound CCO LFQSCWFLJHTTHZ-UHFFFAOYSA-N 0.000 description 5
- KZSNJWFQEVHDMF-BYPYZUCNSA-N L-valine Chemical compound CC(C)[C@H](N)C(O)=O KZSNJWFQEVHDMF-BYPYZUCNSA-N 0.000 description 5
- 108020004999 messenger RNA Proteins 0.000 description 5
- 230000028327 secretion Effects 0.000 description 5
- 239000004474 valine Substances 0.000 description 5
- 102000053602 DNA Human genes 0.000 description 4
- 241000588724 Escherichia coli Species 0.000 description 4
- 241000605114 Pedobacter heparinus Species 0.000 description 4
- FAPWRFPIFSIZLT-UHFFFAOYSA-M Sodium chloride Chemical compound [Na+].[Cl-] FAPWRFPIFSIZLT-UHFFFAOYSA-M 0.000 description 4
- 235000020958 biotin Nutrition 0.000 description 4
- 239000011616 biotin Substances 0.000 description 4
- 229960002685 biotin Drugs 0.000 description 4
- 230000013595 glycosylation Effects 0.000 description 4
- 238000006206 glycosylation reaction Methods 0.000 description 4
- 239000001963 growth medium Substances 0.000 description 4
- -1 heparin small molecules Chemical class 0.000 description 4
- 239000000203 mixture Substances 0.000 description 4
- 239000000725 suspension Substances 0.000 description 4
- RLMISHABBKUNFO-WHFBIAKZSA-N Ala-Ala-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O RLMISHABBKUNFO-WHFBIAKZSA-N 0.000 description 3
- LGQPPBQRUBVTIF-JBDRJPRFSA-N Ala-Ala-Ile Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O LGQPPBQRUBVTIF-JBDRJPRFSA-N 0.000 description 3
- QDRGPQWIVZNJQD-CIUDSAMLSA-N Ala-Arg-Gln Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(O)=O QDRGPQWIVZNJQD-CIUDSAMLSA-N 0.000 description 3
- LWUWMHIOBPTZBA-DCAQKATOSA-N Ala-Arg-Lys Chemical compound NC(=N)NCCC[C@H](NC(=O)[C@@H](N)C)C(=O)N[C@@H](CCCCN)C(O)=O LWUWMHIOBPTZBA-DCAQKATOSA-N 0.000 description 3
- PBAMJJXWDQXOJA-FXQIFTODSA-N Ala-Asp-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N PBAMJJXWDQXOJA-FXQIFTODSA-N 0.000 description 3
- KIUYPHAMDKDICO-WHFBIAKZSA-N Ala-Asp-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O KIUYPHAMDKDICO-WHFBIAKZSA-N 0.000 description 3
- KUDREHRZRIVKHS-UWJYBYFXSA-N Ala-Asp-Tyr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O KUDREHRZRIVKHS-UWJYBYFXSA-N 0.000 description 3
- HMRWQTHUDVXMGH-GUBZILKMSA-N Ala-Glu-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN HMRWQTHUDVXMGH-GUBZILKMSA-N 0.000 description 3
- LDLSENBXQNDTPB-DCAQKATOSA-N Ala-Lys-Arg Chemical compound NCCCC[C@H](NC(=O)[C@@H](N)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N LDLSENBXQNDTPB-DCAQKATOSA-N 0.000 description 3
- IORKCNUBHNIMKY-CIUDSAMLSA-N Ala-Pro-Glu Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O IORKCNUBHNIMKY-CIUDSAMLSA-N 0.000 description 3
- NZGRHTKZFSVPAN-BIIVOSGPSA-N Ala-Ser-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CO)C(=O)N1CCC[C@@H]1C(=O)O)N NZGRHTKZFSVPAN-BIIVOSGPSA-N 0.000 description 3
- WNHNMKOFKCHKKD-BFHQHQDPSA-N Ala-Thr-Gly Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O WNHNMKOFKCHKKD-BFHQHQDPSA-N 0.000 description 3
- BVBKBQRPOJFCQM-DCAQKATOSA-N Arg-Asn-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O BVBKBQRPOJFCQM-DCAQKATOSA-N 0.000 description 3
- ZEAYJGRKRUBDOB-GARJFASQSA-N Arg-Gln-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCCN=C(N)N)N)C(=O)O ZEAYJGRKRUBDOB-GARJFASQSA-N 0.000 description 3
- JQFJNGVSGOUQDH-XIRDDKMYSA-N Arg-Glu-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CCC(O)=O)NC(=O)[C@H](CCCN=C(N)N)N)C(O)=O)=CNC2=C1 JQFJNGVSGOUQDH-XIRDDKMYSA-N 0.000 description 3
- BSYKSCBTTQKOJG-GUBZILKMSA-N Arg-Pro-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O BSYKSCBTTQKOJG-GUBZILKMSA-N 0.000 description 3
- QUBKBPZGMZWOKQ-SZMVWBNQSA-N Arg-Trp-Arg Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CCCN=C(N)N)N)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O)=CNC2=C1 QUBKBPZGMZWOKQ-SZMVWBNQSA-N 0.000 description 3
- JEPNYDRDYNSFIU-QXEWZRGKSA-N Asn-Arg-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H](N)CC(N)=O)C(O)=O JEPNYDRDYNSFIU-QXEWZRGKSA-N 0.000 description 3
- BZMWJLLUAKSIMH-FXQIFTODSA-N Asn-Glu-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O BZMWJLLUAKSIMH-FXQIFTODSA-N 0.000 description 3
- DMLSCRJBWUEALP-LAEOZQHASA-N Asn-Glu-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O DMLSCRJBWUEALP-LAEOZQHASA-N 0.000 description 3
- FHETWELNCBMRMG-HJGDQZAQSA-N Asn-Leu-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O FHETWELNCBMRMG-HJGDQZAQSA-N 0.000 description 3
- KSGAFDTYQPKUAP-GMOBBJLQSA-N Asn-Met-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KSGAFDTYQPKUAP-GMOBBJLQSA-N 0.000 description 3
- PBFXCUOEGVJTMV-QXEWZRGKSA-N Asn-Met-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C(C)C)C(O)=O PBFXCUOEGVJTMV-QXEWZRGKSA-N 0.000 description 3
- HZZIFFOVHLWGCS-KKUMJFAQSA-N Asn-Phe-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O HZZIFFOVHLWGCS-KKUMJFAQSA-N 0.000 description 3
- BKFXFUPYETWGGA-XVSYOHENSA-N Asn-Phe-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O BKFXFUPYETWGGA-XVSYOHENSA-N 0.000 description 3
- UYCPJVYQYARFGB-YDHLFZDLSA-N Asn-Phe-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(O)=O UYCPJVYQYARFGB-YDHLFZDLSA-N 0.000 description 3
- BYLSYQASFJJBCL-DCAQKATOSA-N Asn-Pro-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O BYLSYQASFJJBCL-DCAQKATOSA-N 0.000 description 3
- MYTHOBCLNIOFBL-SRVKXCTJSA-N Asn-Ser-Tyr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O MYTHOBCLNIOFBL-SRVKXCTJSA-N 0.000 description 3
- TZQWZQSMHDVLQL-QEJZJMRPSA-N Asn-Trp-Gln Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N TZQWZQSMHDVLQL-QEJZJMRPSA-N 0.000 description 3
- OERMIMJQPQUIPK-FXQIFTODSA-N Asp-Arg-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(O)=O OERMIMJQPQUIPK-FXQIFTODSA-N 0.000 description 3
- ATYWBXGNXZYZGI-ACZMJKKPSA-N Asp-Asn-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O ATYWBXGNXZYZGI-ACZMJKKPSA-N 0.000 description 3
- PYXXJFRXIYAESU-PCBIJLKTSA-N Asp-Ile-Phe Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O PYXXJFRXIYAESU-PCBIJLKTSA-N 0.000 description 3
- LIVXPXUVXFRWNY-CIUDSAMLSA-N Asp-Lys-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O LIVXPXUVXFRWNY-CIUDSAMLSA-N 0.000 description 3
- XWSIYTYNLKCLJB-CIUDSAMLSA-N Asp-Lys-Asn Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O XWSIYTYNLKCLJB-CIUDSAMLSA-N 0.000 description 3
- WOPJVEMFXYHZEE-SRVKXCTJSA-N Asp-Phe-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O WOPJVEMFXYHZEE-SRVKXCTJSA-N 0.000 description 3
- XXAMCEGRCZQGEM-ZLUOBGJFSA-N Asp-Ser-Asn Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O XXAMCEGRCZQGEM-ZLUOBGJFSA-N 0.000 description 3
- NJLLRXWFPQQPHV-SRVKXCTJSA-N Asp-Tyr-Asn Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(O)=O NJLLRXWFPQQPHV-SRVKXCTJSA-N 0.000 description 3
- 241000206602 Eukaryota Species 0.000 description 3
- IGNGBUVODQLMRJ-CIUDSAMLSA-N Gln-Ala-Met Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCSC)C(O)=O IGNGBUVODQLMRJ-CIUDSAMLSA-N 0.000 description 3
- QYTKAVBFRUGYAU-ACZMJKKPSA-N Gln-Asp-Asn Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O QYTKAVBFRUGYAU-ACZMJKKPSA-N 0.000 description 3
- XKBASPWPBXNVLQ-WDSKDSINSA-N Gln-Gly-Asn Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O XKBASPWPBXNVLQ-WDSKDSINSA-N 0.000 description 3
- IOFDDSNZJDIGPB-GVXVVHGQSA-N Gln-Leu-Val Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O IOFDDSNZJDIGPB-GVXVVHGQSA-N 0.000 description 3
- BZULIEARJFRINC-IHRRRGAJSA-N Gln-Phe-Glu Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CCC(=O)N)N BZULIEARJFRINC-IHRRRGAJSA-N 0.000 description 3
- DYVMTEWCGAVKSE-HJGDQZAQSA-N Gln-Thr-Arg Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CCC(=O)N)N)O DYVMTEWCGAVKSE-HJGDQZAQSA-N 0.000 description 3
- ININBLZFFVOQIO-JHEQGTHGSA-N Gln-Thr-Gly Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CCC(=O)N)N)O ININBLZFFVOQIO-JHEQGTHGSA-N 0.000 description 3
- HLRLXVPRJJITSK-IFFSRLJSSA-N Gln-Thr-Val Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O HLRLXVPRJJITSK-IFFSRLJSSA-N 0.000 description 3
- CSMHMEATMDCQNY-DZKIICNBSA-N Gln-Val-Tyr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O CSMHMEATMDCQNY-DZKIICNBSA-N 0.000 description 3
- RJONUNZIMUXUOI-GUBZILKMSA-N Glu-Asn-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)O)N RJONUNZIMUXUOI-GUBZILKMSA-N 0.000 description 3
- XTZDZAXYPDISRR-MNXVOIDGSA-N Glu-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)O)N XTZDZAXYPDISRR-MNXVOIDGSA-N 0.000 description 3
- IVGJYOOGJLFKQE-AVGNSLFASA-N Glu-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)O)N IVGJYOOGJLFKQE-AVGNSLFASA-N 0.000 description 3
- WNRZUESNGGDCJX-JYJNAYRXSA-N Glu-Leu-Phe Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O WNRZUESNGGDCJX-JYJNAYRXSA-N 0.000 description 3
- FBEJIDRSQCGFJI-GUBZILKMSA-N Glu-Leu-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O FBEJIDRSQCGFJI-GUBZILKMSA-N 0.000 description 3
- BCYGDJXHAGZNPQ-DCAQKATOSA-N Glu-Lys-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O BCYGDJXHAGZNPQ-DCAQKATOSA-N 0.000 description 3
- ZQYZDDXTNQXUJH-CIUDSAMLSA-N Glu-Met-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](CCC(=O)O)N ZQYZDDXTNQXUJH-CIUDSAMLSA-N 0.000 description 3
- QNJNPKSWAHPYGI-JYJNAYRXSA-N Glu-Phe-Leu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C(O)=O)CC1=CC=CC=C1 QNJNPKSWAHPYGI-JYJNAYRXSA-N 0.000 description 3
- UDEPRBFQTWGLCW-CIUDSAMLSA-N Glu-Pro-Asp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O UDEPRBFQTWGLCW-CIUDSAMLSA-N 0.000 description 3
- VXEFAWJTFAUDJK-AVGNSLFASA-N Glu-Tyr-Ser Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CCC(=O)O)N)O VXEFAWJTFAUDJK-AVGNSLFASA-N 0.000 description 3
- CIMULJZTTOBOPN-WHFBIAKZSA-N Gly-Asn-Asn Chemical compound NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O CIMULJZTTOBOPN-WHFBIAKZSA-N 0.000 description 3
- AIJAPFVDBFYNKN-WHFBIAKZSA-N Gly-Asn-Asp Chemical compound C([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)CN)C(=O)N AIJAPFVDBFYNKN-WHFBIAKZSA-N 0.000 description 3
- LCNXZQROPKFGQK-WHFBIAKZSA-N Gly-Asp-Ser Chemical compound NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O LCNXZQROPKFGQK-WHFBIAKZSA-N 0.000 description 3
- AQLHORCVPGXDJW-IUCAKERBSA-N Gly-Gln-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)CN AQLHORCVPGXDJW-IUCAKERBSA-N 0.000 description 3
- ULZCYBYDTUMHNF-IUCAKERBSA-N Gly-Leu-Glu Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O ULZCYBYDTUMHNF-IUCAKERBSA-N 0.000 description 3
- NNCSJUBVFBDDLC-YUMQZZPRSA-N Gly-Leu-Ser Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O NNCSJUBVFBDDLC-YUMQZZPRSA-N 0.000 description 3
- FHQRLHFYVZAQHU-IUCAKERBSA-N Gly-Lys-Gln Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O FHQRLHFYVZAQHU-IUCAKERBSA-N 0.000 description 3
- MHXKHKWHPNETGG-QWRGUYRKSA-N Gly-Lys-Leu Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O MHXKHKWHPNETGG-QWRGUYRKSA-N 0.000 description 3
- OQQKUTVULYLCDG-ONGXEEELSA-N Gly-Lys-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CCCCN)NC(=O)CN)C(O)=O OQQKUTVULYLCDG-ONGXEEELSA-N 0.000 description 3
- QAMMIGULQSIRCD-IRXDYDNUSA-N Gly-Phe-Tyr Chemical compound C([C@H](NC(=O)C[NH3+])C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C([O-])=O)C1=CC=CC=C1 QAMMIGULQSIRCD-IRXDYDNUSA-N 0.000 description 3
- CQMFNTVQVLQRLT-JHEQGTHGSA-N Gly-Thr-Gln Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(O)=O CQMFNTVQVLQRLT-JHEQGTHGSA-N 0.000 description 3
- LLWQVJNHMYBLLK-CDMKHQONSA-N Gly-Thr-Phe Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O LLWQVJNHMYBLLK-CDMKHQONSA-N 0.000 description 3
- BNMRSWQOHIQTFL-JSGCOSHPSA-N Gly-Val-Phe Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 BNMRSWQOHIQTFL-JSGCOSHPSA-N 0.000 description 3
- DHMQDGOQFOQNFH-UHFFFAOYSA-N Glycine Chemical compound NCC(O)=O DHMQDGOQFOQNFH-UHFFFAOYSA-N 0.000 description 3
- 229920002971 Heparan sulfate Polymers 0.000 description 3
- AWASVTXPTOLPPP-MBLNEYKQSA-N His-Ala-Thr Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O AWASVTXPTOLPPP-MBLNEYKQSA-N 0.000 description 3
- SYMSVYVUSPSAAO-IHRRRGAJSA-N His-Arg-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O SYMSVYVUSPSAAO-IHRRRGAJSA-N 0.000 description 3
- TTZAWSKKNCEINZ-AVGNSLFASA-N His-Arg-Val Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(O)=O TTZAWSKKNCEINZ-AVGNSLFASA-N 0.000 description 3
- LIEIYPBMQJLASB-SRVKXCTJSA-N His-Gln-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CC1=CN=CN1 LIEIYPBMQJLASB-SRVKXCTJSA-N 0.000 description 3
- ZNNNYCXPCKACHX-DCAQKATOSA-N His-Gln-Gln Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O ZNNNYCXPCKACHX-DCAQKATOSA-N 0.000 description 3
- VJJSDSNFXCWCEJ-DJFWLOJKSA-N His-Ile-Asn Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O VJJSDSNFXCWCEJ-DJFWLOJKSA-N 0.000 description 3
- XKIYNCLILDLGRS-QWRGUYRKSA-N His-Lys-Gly Chemical compound NCCCC[C@@H](C(=O)NCC(O)=O)NC(=O)[C@@H](N)CC1=CN=CN1 XKIYNCLILDLGRS-QWRGUYRKSA-N 0.000 description 3
- UXZMINKIEWBEQU-SZMVWBNQSA-N His-Trp-Gln Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC3=CN=CN3)N UXZMINKIEWBEQU-SZMVWBNQSA-N 0.000 description 3
- QSPLUJGYOPZINY-ZPFDUUQYSA-N Ile-Asp-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N QSPLUJGYOPZINY-ZPFDUUQYSA-N 0.000 description 3
- LPFBXFILACZHIB-LAEOZQHASA-N Ile-Gly-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N[C@@H](CCC(=O)O)C(=O)O)N LPFBXFILACZHIB-LAEOZQHASA-N 0.000 description 3
- KEKTTYCXKGBAAL-VGDYDELISA-N Ile-His-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CO)C(=O)O)N KEKTTYCXKGBAAL-VGDYDELISA-N 0.000 description 3
- PARSHQDZROHERM-NHCYSSNCSA-N Ile-Lys-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)NCC(=O)O)N PARSHQDZROHERM-NHCYSSNCSA-N 0.000 description 3
- ZUPJCJINYQISSN-XUXIUFHCSA-N Ile-Met-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCCCN)C(=O)O)N ZUPJCJINYQISSN-XUXIUFHCSA-N 0.000 description 3
- XLXPYSDGMXTTNQ-UHFFFAOYSA-N Ile-Phe-Leu Natural products CCC(C)C(N)C(=O)NC(C(=O)NC(CC(C)C)C(O)=O)CC1=CC=CC=C1 XLXPYSDGMXTTNQ-UHFFFAOYSA-N 0.000 description 3
- HXIDVIFHRYRXLZ-NAKRPEOUSA-N Ile-Ser-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)O)N HXIDVIFHRYRXLZ-NAKRPEOUSA-N 0.000 description 3
- QGXQHJQPAPMACW-PPCPHDFISA-N Ile-Thr-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)O)N QGXQHJQPAPMACW-PPCPHDFISA-N 0.000 description 3
- QSXSHZIRKTUXNG-STECZYCISA-N Ile-Val-Tyr Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 QSXSHZIRKTUXNG-STECZYCISA-N 0.000 description 3
- 108010065920 Insulin Lispro Proteins 0.000 description 3
- LZDNBBYBDGBADK-UHFFFAOYSA-N L-valyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)C(C)C)C(O)=O)=CNC2=C1 LZDNBBYBDGBADK-UHFFFAOYSA-N 0.000 description 3
- HBJZFCIVFIBNSV-DCAQKATOSA-N Leu-Arg-Asn Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC(N)=O)C(O)=O HBJZFCIVFIBNSV-DCAQKATOSA-N 0.000 description 3
- FIJMQLGQLBLBOL-HJGDQZAQSA-N Leu-Asn-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O FIJMQLGQLBLBOL-HJGDQZAQSA-N 0.000 description 3
- HVJVUYQWFYMGJS-GVXVVHGQSA-N Leu-Glu-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O HVJVUYQWFYMGJS-GVXVVHGQSA-N 0.000 description 3
- POZULHZYLPGXMR-ONGXEEELSA-N Leu-Gly-Val Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O POZULHZYLPGXMR-ONGXEEELSA-N 0.000 description 3
- ZRHDPZAAWLXXIR-SRVKXCTJSA-N Leu-Lys-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O ZRHDPZAAWLXXIR-SRVKXCTJSA-N 0.000 description 3
- HVHRPWQEQHIQJF-AVGNSLFASA-N Leu-Lys-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O HVHRPWQEQHIQJF-AVGNSLFASA-N 0.000 description 3
- QWWPYKKLXWOITQ-VOAKCMCISA-N Leu-Thr-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(C)C QWWPYKKLXWOITQ-VOAKCMCISA-N 0.000 description 3
- FMFNIDICDKEMOE-XUXIUFHCSA-N Leu-Val-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FMFNIDICDKEMOE-XUXIUFHCSA-N 0.000 description 3
- FDBTVENULFNTAL-XQQFMLRXSA-N Leu-Val-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N FDBTVENULFNTAL-XQQFMLRXSA-N 0.000 description 3
- RVOMPSJXSRPFJT-DCAQKATOSA-N Lys-Ala-Arg Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O RVOMPSJXSRPFJT-DCAQKATOSA-N 0.000 description 3
- KCXUCYYZNZFGLL-SRVKXCTJSA-N Lys-Ala-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O KCXUCYYZNZFGLL-SRVKXCTJSA-N 0.000 description 3
- IXHKPDJKKCUKHS-GARJFASQSA-N Lys-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCCN)N IXHKPDJKKCUKHS-GARJFASQSA-N 0.000 description 3
- IRNSXVOWSXSULE-DCAQKATOSA-N Lys-Ala-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN IRNSXVOWSXSULE-DCAQKATOSA-N 0.000 description 3
- WGCKDDHUFPQSMZ-ZPFDUUQYSA-N Lys-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CCCCN WGCKDDHUFPQSMZ-ZPFDUUQYSA-N 0.000 description 3
- NRQRKMYZONPCTM-CIUDSAMLSA-N Lys-Asp-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O NRQRKMYZONPCTM-CIUDSAMLSA-N 0.000 description 3
- VSRXPEHZMHSFKU-IUCAKERBSA-N Lys-Gln-Gly Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O VSRXPEHZMHSFKU-IUCAKERBSA-N 0.000 description 3
- WLXGMVVHTIUPHE-ULQDDVLXSA-N Lys-Phe-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(O)=O WLXGMVVHTIUPHE-ULQDDVLXSA-N 0.000 description 3
- LUTDBHBIHHREDC-IHRRRGAJSA-N Lys-Pro-Lys Chemical compound NCCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(O)=O LUTDBHBIHHREDC-IHRRRGAJSA-N 0.000 description 3
- YKBSXQFZWFXFIB-VOAKCMCISA-N Lys-Thr-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CCCCN)C(O)=O YKBSXQFZWFXFIB-VOAKCMCISA-N 0.000 description 3
- XGZDDOKIHSYHTO-SZMVWBNQSA-N Lys-Trp-Glu Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](N)CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O)=CNC2=C1 XGZDDOKIHSYHTO-SZMVWBNQSA-N 0.000 description 3
- OPJRECCCQSDDCZ-TUSQITKMSA-N Lys-Trp-Trp Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](Cc1c[nH]c2ccccc12)C(=O)N[C@@H](Cc1c[nH]c2ccccc12)C(O)=O OPJRECCCQSDDCZ-TUSQITKMSA-N 0.000 description 3
- SUZVLFWOCKHWET-CQDKDKBSSA-N Lys-Tyr-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C)C(O)=O SUZVLFWOCKHWET-CQDKDKBSSA-N 0.000 description 3
- LMMBAXJRYSXCOQ-ACRUOGEOSA-N Lys-Tyr-Phe Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](Cc1ccc(O)cc1)C(=O)N[C@@H](Cc1ccccc1)C(O)=O LMMBAXJRYSXCOQ-ACRUOGEOSA-N 0.000 description 3
- MDDUIRLQCYVRDO-NHCYSSNCSA-N Lys-Val-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCCN MDDUIRLQCYVRDO-NHCYSSNCSA-N 0.000 description 3
- QEVRUYFHWJJUHZ-DCAQKATOSA-N Met-Ala-Leu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC(C)C QEVRUYFHWJJUHZ-DCAQKATOSA-N 0.000 description 3
- GPVLSVCBKUCEBI-KKUMJFAQSA-N Met-Gln-Phe Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O GPVLSVCBKUCEBI-KKUMJFAQSA-N 0.000 description 3
- RZJOHSFAEZBWLK-CIUDSAMLSA-N Met-Gln-Ser Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CO)C(=O)O)N RZJOHSFAEZBWLK-CIUDSAMLSA-N 0.000 description 3
- QZUCCDSNETVAIS-RYQLBKOJSA-N Met-Trp-Pro Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N3CCC[C@@H]3C(=O)O)N QZUCCDSNETVAIS-RYQLBKOJSA-N 0.000 description 3
- OKKJLVBELUTLKV-UHFFFAOYSA-N Methanol Chemical compound OC OKKJLVBELUTLKV-UHFFFAOYSA-N 0.000 description 3
- 108010066427 N-valyltryptophan Proteins 0.000 description 3
- 108091028043 Nucleic acid sequence Proteins 0.000 description 3
- 241000320412 Ogataea angusta Species 0.000 description 3
- FPTXMUIBLMGTQH-ONGXEEELSA-N Phe-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=CC=C1 FPTXMUIBLMGTQH-ONGXEEELSA-N 0.000 description 3
- PLNHHOXNVSYKOB-JYJNAYRXSA-N Phe-Arg-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC1=CC=CC=C1)N PLNHHOXNVSYKOB-JYJNAYRXSA-N 0.000 description 3
- CUMXHKAOHNWRFQ-BZSNNMDCSA-N Phe-Asp-Tyr Chemical compound C([C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=CC=C1 CUMXHKAOHNWRFQ-BZSNNMDCSA-N 0.000 description 3
- MGBRZXXGQBAULP-DRZSPHRISA-N Phe-Glu-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 MGBRZXXGQBAULP-DRZSPHRISA-N 0.000 description 3
- PSKRILMFHNIUAO-JYJNAYRXSA-N Phe-Glu-Lys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N PSKRILMFHNIUAO-JYJNAYRXSA-N 0.000 description 3
- WKTSCAXSYITIJJ-PCBIJLKTSA-N Phe-Ile-Asn Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O WKTSCAXSYITIJJ-PCBIJLKTSA-N 0.000 description 3
- KPEIBEPEUAZWNS-ULQDDVLXSA-N Phe-Leu-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 KPEIBEPEUAZWNS-ULQDDVLXSA-N 0.000 description 3
- CZQZSMJXFGGBHM-KKUMJFAQSA-N Phe-Pro-Gln Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O CZQZSMJXFGGBHM-KKUMJFAQSA-N 0.000 description 3
- YMIZSYUAZJSOFL-SRVKXCTJSA-N Phe-Ser-Asn Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O YMIZSYUAZJSOFL-SRVKXCTJSA-N 0.000 description 3
- UNBFGVQVQGXXCK-KKUMJFAQSA-N Phe-Ser-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O UNBFGVQVQGXXCK-KKUMJFAQSA-N 0.000 description 3
- SHUFSZDAIPLZLF-BEAPCOKYSA-N Phe-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CC=CC=C2)N)O SHUFSZDAIPLZLF-BEAPCOKYSA-N 0.000 description 3
- GOUWCZRDTWTODO-YDHLFZDLSA-N Phe-Val-Asn Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O GOUWCZRDTWTODO-YDHLFZDLSA-N 0.000 description 3
- APKRGYLBSCWJJP-FXQIFTODSA-N Pro-Ala-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(O)=O APKRGYLBSCWJJP-FXQIFTODSA-N 0.000 description 3
- UTAUEDINXUMHLG-FXQIFTODSA-N Pro-Asp-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@@H]1CCCN1 UTAUEDINXUMHLG-FXQIFTODSA-N 0.000 description 3
- CJZTUKSFZUSNCC-FXQIFTODSA-N Pro-Asp-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H]1CCCN1 CJZTUKSFZUSNCC-FXQIFTODSA-N 0.000 description 3
- WFHYFCWBLSKEMS-KKUMJFAQSA-N Pro-Glu-Phe Chemical compound N([C@@H](CCC(=O)O)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C(=O)[C@@H]1CCCN1 WFHYFCWBLSKEMS-KKUMJFAQSA-N 0.000 description 3
- VYWNORHENYEQDW-YUMQZZPRSA-N Pro-Gly-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H]1CCCN1 VYWNORHENYEQDW-YUMQZZPRSA-N 0.000 description 3
- RYJRPPUATSKNAY-STECZYCISA-N Pro-Ile-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@@H]2CCCN2 RYJRPPUATSKNAY-STECZYCISA-N 0.000 description 3
- APIAILHCTSBGLU-JYJNAYRXSA-N Pro-Met-Phe Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@@H]2CCCN2 APIAILHCTSBGLU-JYJNAYRXSA-N 0.000 description 3
- RCYUBVHMVUHEBM-RCWTZXSCSA-N Pro-Pro-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O RCYUBVHMVUHEBM-RCWTZXSCSA-N 0.000 description 3
- CWZUFLWPEFHWEI-IHRRRGAJSA-N Pro-Tyr-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O CWZUFLWPEFHWEI-IHRRRGAJSA-N 0.000 description 3
- VDHGTOHMHHQSKG-JYJNAYRXSA-N Pro-Val-Phe Chemical compound CC(C)[C@H](NC(=O)[C@@H]1CCCN1)C(=O)N[C@@H](Cc1ccccc1)C(O)=O VDHGTOHMHHQSKG-JYJNAYRXSA-N 0.000 description 3
- 108010003201 RGH 0205 Proteins 0.000 description 3
- XVAUJOAYHWWNQF-ZLUOBGJFSA-N Ser-Asn-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(O)=O XVAUJOAYHWWNQF-ZLUOBGJFSA-N 0.000 description 3
- CNIIKZQXBBQHCX-FXQIFTODSA-N Ser-Asp-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O CNIIKZQXBBQHCX-FXQIFTODSA-N 0.000 description 3
- NJSPTZXVPZDRCU-UBHSHLNASA-N Ser-Asp-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CO)N NJSPTZXVPZDRCU-UBHSHLNASA-N 0.000 description 3
- CXBFHZLODKPIJY-AAEUAGOBSA-N Ser-Gly-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CO)N CXBFHZLODKPIJY-AAEUAGOBSA-N 0.000 description 3
- KCNSGAMPBPYUAI-CIUDSAMLSA-N Ser-Leu-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O KCNSGAMPBPYUAI-CIUDSAMLSA-N 0.000 description 3
- KCGIREHVWRXNDH-GARJFASQSA-N Ser-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N KCGIREHVWRXNDH-GARJFASQSA-N 0.000 description 3
- ADJDNJCSPNFFPI-FXQIFTODSA-N Ser-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CO ADJDNJCSPNFFPI-FXQIFTODSA-N 0.000 description 3
- JCLAFVNDBJMLBC-JBDRJPRFSA-N Ser-Ser-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JCLAFVNDBJMLBC-JBDRJPRFSA-N 0.000 description 3
- QNBVFKZSSRYNFX-CUJWVEQBSA-N Ser-Thr-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CO)N)O QNBVFKZSSRYNFX-CUJWVEQBSA-N 0.000 description 3
- KIEIJCFVGZCUAS-MELADBBJSA-N Ser-Tyr-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)[C@H](CO)N)C(=O)O KIEIJCFVGZCUAS-MELADBBJSA-N 0.000 description 3
- YEDSOSIKVUMIJE-DCAQKATOSA-N Ser-Val-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O YEDSOSIKVUMIJE-DCAQKATOSA-N 0.000 description 3
- NAXBBCLCEOTAIG-RHYQMDGZSA-N Thr-Arg-Lys Chemical compound NC(N)=NCCC[C@H](NC(=O)[C@@H](N)[C@H](O)C)C(=O)N[C@@H](CCCCN)C(O)=O NAXBBCLCEOTAIG-RHYQMDGZSA-N 0.000 description 3
- JEDIEMIJYSRUBB-FOHZUACHSA-N Thr-Asp-Gly Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O JEDIEMIJYSRUBB-FOHZUACHSA-N 0.000 description 3
- XFTYVCHLARBHBQ-FOHZUACHSA-N Thr-Gly-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O XFTYVCHLARBHBQ-FOHZUACHSA-N 0.000 description 3
- PAXANSWUSVPFNK-IUKAMOBKSA-N Thr-Ile-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N PAXANSWUSVPFNK-IUKAMOBKSA-N 0.000 description 3
- WRQLCVIALDUQEQ-UNQGMJICSA-N Thr-Phe-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O WRQLCVIALDUQEQ-UNQGMJICSA-N 0.000 description 3
- AHERARIZBPOMNU-KATARQTJSA-N Thr-Ser-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O AHERARIZBPOMNU-KATARQTJSA-N 0.000 description 3
- SBYQHZCMVSPQCS-RCWTZXSCSA-N Thr-Val-Met Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCSC)C(O)=O SBYQHZCMVSPQCS-RCWTZXSCSA-N 0.000 description 3
- 108020004566 Transfer RNA Proteins 0.000 description 3
- VIWQOOBRKCGSDK-RYQLBKOJSA-N Trp-Arg-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)N)C(=O)O VIWQOOBRKCGSDK-RYQLBKOJSA-N 0.000 description 3
- YDTKYBHPRULROG-LTHWPDAASA-N Trp-Ile-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N YDTKYBHPRULROG-LTHWPDAASA-N 0.000 description 3
- VDCGPCSLAJAKBB-XIRDDKMYSA-N Trp-Ser-His Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC3=CN=CN3)C(=O)O)N VDCGPCSLAJAKBB-XIRDDKMYSA-N 0.000 description 3
- UIDJDMVRDUANDL-BVSLBCMMSA-N Trp-Tyr-Arg Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O UIDJDMVRDUANDL-BVSLBCMMSA-N 0.000 description 3
- IELISNUVHBKYBX-XDTLVQLUSA-N Tyr-Ala-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 IELISNUVHBKYBX-XDTLVQLUSA-N 0.000 description 3
- MICSYKFECRFCTJ-IHRRRGAJSA-N Tyr-Arg-Asp Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N)O MICSYKFECRFCTJ-IHRRRGAJSA-N 0.000 description 3
- AYPAIRCDLARHLM-KKUMJFAQSA-N Tyr-Asn-Lys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N)O AYPAIRCDLARHLM-KKUMJFAQSA-N 0.000 description 3
- AZGZDDNKFFUDEH-QWRGUYRKSA-N Tyr-Gly-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC1=CC=C(O)C=C1 AZGZDDNKFFUDEH-QWRGUYRKSA-N 0.000 description 3
- NMKJPMCEKQHRPD-IRXDYDNUSA-N Tyr-Gly-Tyr Chemical compound C([C@H](N)C(=O)NCC(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=C(O)C=C1 NMKJPMCEKQHRPD-IRXDYDNUSA-N 0.000 description 3
- WTTRJMAZPDHPGS-KKXDTOCCSA-N Tyr-Phe-Ala Chemical compound C[C@H](NC(=O)[C@H](Cc1ccccc1)NC(=O)[C@@H](N)Cc1ccc(O)cc1)C(O)=O WTTRJMAZPDHPGS-KKXDTOCCSA-N 0.000 description 3
- BIVIUZRBCAUNPW-JRQIVUDYSA-N Tyr-Thr-Asn Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(O)=O BIVIUZRBCAUNPW-JRQIVUDYSA-N 0.000 description 3
- KLQPIEVIKOQRAW-IZPVPAKOSA-N Tyr-Thr-Thr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N)O KLQPIEVIKOQRAW-IZPVPAKOSA-N 0.000 description 3
- LNYOXPDEIZJDEI-NHCYSSNCSA-N Val-Asn-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](C(C)C)N LNYOXPDEIZJDEI-NHCYSSNCSA-N 0.000 description 3
- PWRITNSESKQTPW-NRPADANISA-N Val-Gln-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CO)C(=O)O)N PWRITNSESKQTPW-NRPADANISA-N 0.000 description 3
- SDSCOOZQQGUQFC-GVXVVHGQSA-N Val-His-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N SDSCOOZQQGUQFC-GVXVVHGQSA-N 0.000 description 3
- AEMPCGRFEZTWIF-IHRRRGAJSA-N Val-Leu-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(O)=O AEMPCGRFEZTWIF-IHRRRGAJSA-N 0.000 description 3
- SYSWVVCYSXBVJG-RHYQMDGZSA-N Val-Leu-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](C(C)C)N)O SYSWVVCYSXBVJG-RHYQMDGZSA-N 0.000 description 3
- KTEZUXISLQTDDQ-NHCYSSNCSA-N Val-Lys-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(=O)O)C(=O)O)N KTEZUXISLQTDDQ-NHCYSSNCSA-N 0.000 description 3
- YLRAFVVWZRSZQC-DZKIICNBSA-N Val-Phe-Glu Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N YLRAFVVWZRSZQC-DZKIICNBSA-N 0.000 description 3
- QZKVWWIUSQGWMY-IHRRRGAJSA-N Val-Ser-Phe Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 QZKVWWIUSQGWMY-IHRRRGAJSA-N 0.000 description 3
- MNSSBIHFEUUXNW-RCWTZXSCSA-N Val-Thr-Arg Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N MNSSBIHFEUUXNW-RCWTZXSCSA-N 0.000 description 3
- DOBHJKVVACOQTN-DZKIICNBSA-N Val-Tyr-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C(C)C)CC1=CC=C(O)C=C1 DOBHJKVVACOQTN-DZKIICNBSA-N 0.000 description 3
- IECQJCJNPJVUSB-IHRRRGAJSA-N Val-Tyr-Ser Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](Cc1ccc(O)cc1)C(=O)N[C@@H](CO)C(O)=O IECQJCJNPJVUSB-IHRRRGAJSA-N 0.000 description 3
- 108010044940 alanylglutamine Proteins 0.000 description 3
- 108010008355 arginyl-glutamine Proteins 0.000 description 3
- 108010094001 arginyl-tryptophyl-arginine Proteins 0.000 description 3
- 230000015572 biosynthetic process Effects 0.000 description 3
- 229910052799 carbon Inorganic materials 0.000 description 3
- 238000005119 centrifugation Methods 0.000 description 3
- 239000003153 chemical reaction reagent Substances 0.000 description 3
- 238000004587 chromatography analysis Methods 0.000 description 3
- 238000010276 construction Methods 0.000 description 3
- 108010063718 gamma-glutamylaspartic acid Proteins 0.000 description 3
- XBGGUPMXALFZOT-UHFFFAOYSA-N glycyl-L-tyrosine hemihydrate Natural products NCC(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 XBGGUPMXALFZOT-UHFFFAOYSA-N 0.000 description 3
- 108010082286 glycyl-seryl-alanine Proteins 0.000 description 3
- 108010050848 glycylleucine Proteins 0.000 description 3
- 230000012010 growth Effects 0.000 description 3
- 229920000669 heparin Polymers 0.000 description 3
- 229960002897 heparin Drugs 0.000 description 3
- 238000001727 in vivo Methods 0.000 description 3
- 108010060857 isoleucyl-valyl-tyrosine Proteins 0.000 description 3
- 108010083708 leucyl-aspartyl-valine Proteins 0.000 description 3
- 108010030617 leucyl-phenylalanyl-valine Proteins 0.000 description 3
- 108010009298 lysylglutamic acid Proteins 0.000 description 3
- 108010056582 methionylglutamic acid Proteins 0.000 description 3
- 108010084572 phenylalanyl-valine Proteins 0.000 description 3
- 108010015796 prolylisoleucine Proteins 0.000 description 3
- 108010053725 prolylvaline Proteins 0.000 description 3
- 238000000926 separation method Methods 0.000 description 3
- 235000002639 sodium chloride Nutrition 0.000 description 3
- 238000002415 sodium dodecyl sulfate polyacrylamide gel electrophoresis Methods 0.000 description 3
- 239000000758 substrate Substances 0.000 description 3
- 238000003786 synthesis reaction Methods 0.000 description 3
- 108010061238 threonyl-glycine Proteins 0.000 description 3
- 238000013518 transcription Methods 0.000 description 3
- 230000035897 transcription Effects 0.000 description 3
- 108010073969 valyllysine Proteins 0.000 description 3
- 238000005406 washing Methods 0.000 description 3
- FDKWRPBBCBCIGA-REOHCLBHSA-N (2r)-2-azaniumyl-3-$l^{1}-selanylpropanoate Chemical compound [Se]C[C@H](N)C(O)=O FDKWRPBBCBCIGA-REOHCLBHSA-N 0.000 description 2
- LZNVQOJCDNJNNK-UFLZEWODSA-N 5-[(3as,4s,6ar)-2-oxo-1,3,3a,4,6,6a-hexahydrothieno[3,4-d]imidazol-4-yl]pentanoic acid;sulfuric acid Chemical compound OS(O)(=O)=O.N1C(=O)N[C@@H]2[C@H](CCCCC(=O)O)SC[C@@H]21 LZNVQOJCDNJNNK-UFLZEWODSA-N 0.000 description 2
- 241000228212 Aspergillus Species 0.000 description 2
- 241000228197 Aspergillus flavus Species 0.000 description 2
- 241000351920 Aspergillus nidulans Species 0.000 description 2
- OKTJSMMVPCPJKN-UHFFFAOYSA-N Carbon Chemical compound [C] OKTJSMMVPCPJKN-UHFFFAOYSA-N 0.000 description 2
- 241000235646 Cyberlindnera jadinii Species 0.000 description 2
- FDKWRPBBCBCIGA-UWTATZPHSA-N D-Selenocysteine Natural products [Se]C[C@@H](N)C(O)=O FDKWRPBBCBCIGA-UWTATZPHSA-N 0.000 description 2
- 102000016928 DNA-directed DNA polymerase Human genes 0.000 description 2
- 108010014303 DNA-directed DNA polymerase Proteins 0.000 description 2
- 241000235036 Debaryomyces hansenii Species 0.000 description 2
- 102000007260 Deoxyribonuclease I Human genes 0.000 description 2
- 108010008532 Deoxyribonuclease I Proteins 0.000 description 2
- RTZKZFJDLAIYFH-UHFFFAOYSA-N Diethyl ether Chemical compound CCOCC RTZKZFJDLAIYFH-UHFFFAOYSA-N 0.000 description 2
- 241000196324 Embryophyta Species 0.000 description 2
- 241001465328 Eremothecium gossypii Species 0.000 description 2
- HTTJABKRGRZYRN-UHFFFAOYSA-N Heparin Chemical compound OC1C(NC(=O)C)C(O)OC(COS(O)(=O)=O)C1OC1C(OS(O)(=O)=O)C(O)C(OC2C(C(OS(O)(=O)=O)C(OC3C(C(O)C(O)C(O3)C(O)=O)OS(O)(=O)=O)C(CO)O2)NS(O)(=O)=O)C(C(O)=O)O1 HTTJABKRGRZYRN-UHFFFAOYSA-N 0.000 description 2
- OAKJQQAXSVQMHS-UHFFFAOYSA-N Hydrazine Chemical compound NN OAKJQQAXSVQMHS-UHFFFAOYSA-N 0.000 description 2
- DCXYFEDJOCDNAF-REOHCLBHSA-N L-asparagine Chemical compound OC(=O)[C@@H](N)CC(N)=O DCXYFEDJOCDNAF-REOHCLBHSA-N 0.000 description 2
- CKLJMWTZIZZHCS-REOHCLBHSA-N L-aspartic acid Chemical compound OC(=O)[C@@H](N)CC(O)=O CKLJMWTZIZZHCS-REOHCLBHSA-N 0.000 description 2
- AGPKZVBTJJNPAG-WHFBIAKZSA-N L-isoleucine Chemical compound CC[C@H](C)[C@H](N)C(O)=O AGPKZVBTJJNPAG-WHFBIAKZSA-N 0.000 description 2
- ROHFNLRQFUQHCH-YFKPBYRVSA-N L-leucine Chemical compound CC(C)C[C@H](N)C(O)=O ROHFNLRQFUQHCH-YFKPBYRVSA-N 0.000 description 2
- COLNVLDHVKWLRT-QMMMGPOBSA-N L-phenylalanine Chemical compound OC(=O)[C@@H](N)CC1=CC=CC=C1 COLNVLDHVKWLRT-QMMMGPOBSA-N 0.000 description 2
- QIVBCDIJIAJPQS-VIFPVBQESA-N L-tryptophane Chemical compound C1=CC=C2C(C[C@H](N)C(O)=O)=CNC2=C1 QIVBCDIJIAJPQS-VIFPVBQESA-N 0.000 description 2
- OUYCCCASQSFEME-QMMMGPOBSA-N L-tyrosine Chemical compound OC(=O)[C@@H](N)CC1=CC=C(O)C=C1 OUYCCCASQSFEME-QMMMGPOBSA-N 0.000 description 2
- FZUNSVYYPYJYAP-NAKRPEOUSA-N Met-Ile-Ala Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O FZUNSVYYPYJYAP-NAKRPEOUSA-N 0.000 description 2
- 241001465754 Metazoa Species 0.000 description 2
- 241001489174 Ogataea minuta Species 0.000 description 2
- 108090000854 Oxidoreductases Proteins 0.000 description 2
- 108020002230 Pancreatic Ribonuclease Proteins 0.000 description 2
- 102000005891 Pancreatic ribonuclease Human genes 0.000 description 2
- 108091005804 Peptidases Proteins 0.000 description 2
- QMMRHASQEVCJGR-UBHSHLNASA-N Phe-Ala-Pro Chemical compound C([C@H](N)C(=O)N[C@@H](C)C(=O)N1[C@@H](CCC1)C(O)=O)C1=CC=CC=C1 QMMRHASQEVCJGR-UBHSHLNASA-N 0.000 description 2
- 108010076504 Protein Sorting Signals Proteins 0.000 description 2
- 102000007056 Recombinant Fusion Proteins Human genes 0.000 description 2
- 108010008281 Recombinant Fusion Proteins Proteins 0.000 description 2
- 240000005384 Rhizopus oryzae Species 0.000 description 2
- 235000013752 Rhizopus oryzae Nutrition 0.000 description 2
- 241000235347 Schizosaccharomyces pombe Species 0.000 description 2
- 239000007983 Tris buffer Substances 0.000 description 2
- NMCBVGFGWSIGSB-NUTKFTJISA-N Trp-Ala-Leu Chemical compound C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N NMCBVGFGWSIGSB-NUTKFTJISA-N 0.000 description 2
- QIVBCDIJIAJPQS-UHFFFAOYSA-N Tryptophan Natural products C1=CC=C2C(CC(N)C(O)=O)=CNC2=C1 QIVBCDIJIAJPQS-UHFFFAOYSA-N 0.000 description 2
- 241000700605 Viruses Species 0.000 description 2
- 241000222124 [Candida] boidinii Species 0.000 description 2
- 125000003295 alanine group Chemical group N[C@@H](C)C(=O)* 0.000 description 2
- 230000003321 amplification Effects 0.000 description 2
- 238000004458 analytical method Methods 0.000 description 2
- 230000008901 benefit Effects 0.000 description 2
- 230000003197 catalytic effect Effects 0.000 description 2
- 238000006555 catalytic reaction Methods 0.000 description 2
- 210000000349 chromosome Anatomy 0.000 description 2
- 238000001514 detection method Methods 0.000 description 2
- 239000003480 eluent Substances 0.000 description 2
- 125000000404 glutamine group Chemical group N[C@@H](CCC(N)=O)C(=O)* 0.000 description 2
- 150000002309 glutamines Chemical group 0.000 description 2
- 239000013315 hypercross-linked polymer Substances 0.000 description 2
- 239000005457 ice water Substances 0.000 description 2
- 230000003993 interaction Effects 0.000 description 2
- 238000011068 loading method Methods 0.000 description 2
- 229910052751 metal Inorganic materials 0.000 description 2
- 239000002184 metal Substances 0.000 description 2
- 238000003199 nucleic acid amplification method Methods 0.000 description 2
- 230000035699 permeability Effects 0.000 description 2
- 230000004481 post-translational protein modification Effects 0.000 description 2
- 239000010453 quartz Substances 0.000 description 2
- 239000011535 reaction buffer Substances 0.000 description 2
- 230000003248 secreting effect Effects 0.000 description 2
- ZKZBPNGNEQAJSX-UHFFFAOYSA-N selenocysteine Natural products [SeH]CC(N)C(O)=O ZKZBPNGNEQAJSX-UHFFFAOYSA-N 0.000 description 2
- 235000016491 selenocysteine Nutrition 0.000 description 2
- 229940055619 selenocysteine Drugs 0.000 description 2
- VYPSYNLAJGMNEJ-UHFFFAOYSA-N silicon dioxide Inorganic materials O=[Si]=O VYPSYNLAJGMNEJ-UHFFFAOYSA-N 0.000 description 2
- 239000011780 sodium chloride Substances 0.000 description 2
- 239000006228 supernatant Substances 0.000 description 2
- 231100000331 toxic Toxicity 0.000 description 2
- 230000002588 toxic effect Effects 0.000 description 2
- 239000003053 toxin Substances 0.000 description 2
- 231100000765 toxin Toxicity 0.000 description 2
- LENZDBCJOHFCAS-UHFFFAOYSA-N tris Chemical compound OCC(N)(CO)CO LENZDBCJOHFCAS-UHFFFAOYSA-N 0.000 description 2
- 108091032973 (ribonucleotides)n+m Proteins 0.000 description 1
- LDCYZAJDBXYCGN-VIFPVBQESA-N 5-hydroxy-L-tryptophan Chemical compound C1=C(O)C=C2C(C[C@H](N)C(O)=O)=CNC2=C1 LDCYZAJDBXYCGN-VIFPVBQESA-N 0.000 description 1
- 229940000681 5-hydroxytryptophan Drugs 0.000 description 1
- 241000186361 Actinobacteria <class> Species 0.000 description 1
- 108010011170 Ala-Trp-Arg-His-Pro-Gln-Phe-Gly-Gly Proteins 0.000 description 1
- 108020005098 Anticodon Proteins 0.000 description 1
- 239000004475 Arginine Substances 0.000 description 1
- DCXYFEDJOCDNAF-UHFFFAOYSA-N Asparagine Natural products OC(=O)C(N)CC(N)=O DCXYFEDJOCDNAF-UHFFFAOYSA-N 0.000 description 1
- 241000980809 Aspergillus aureus Species 0.000 description 1
- 241000228245 Aspergillus niger Species 0.000 description 1
- 241000228260 Aspergillus wentii Species 0.000 description 1
- 241000223651 Aureobasidium Species 0.000 description 1
- 241000894006 Bacteria Species 0.000 description 1
- 241000680806 Blastobotrys adeninivorans Species 0.000 description 1
- 102000004506 Blood Proteins Human genes 0.000 description 1
- 108010017384 Blood Proteins Proteins 0.000 description 1
- BTBUEUYNUDRHOZ-UHFFFAOYSA-N Borate Chemical compound [O-]B([O-])[O-] BTBUEUYNUDRHOZ-UHFFFAOYSA-N 0.000 description 1
- 241001260012 Bursa Species 0.000 description 1
- 229920002101 Chitin Polymers 0.000 description 1
- 108020004638 Circular DNA Proteins 0.000 description 1
- 239000004971 Cross linker Substances 0.000 description 1
- WQZGKKKJIJFFOK-QTVWNMPRSA-N D-mannopyranose Chemical compound OC[C@H]1OC(O)[C@@H](O)[C@@H](O)[C@@H]1O WQZGKKKJIJFFOK-QTVWNMPRSA-N 0.000 description 1
- 102000012410 DNA Ligases Human genes 0.000 description 1
- 108010061982 DNA Ligases Proteins 0.000 description 1
- 230000006820 DNA synthesis Effects 0.000 description 1
- 108091029865 Exogenous DNA Proteins 0.000 description 1
- 108060002716 Exonuclease Proteins 0.000 description 1
- 102000010834 Extracellular Matrix Proteins Human genes 0.000 description 1
- 108010037362 Extracellular Matrix Proteins Proteins 0.000 description 1
- 239000004471 Glycine Substances 0.000 description 1
- 229920001499 Heparinoid Polymers 0.000 description 1
- BIAKMWKJMQLZOJ-ZKWXMUAHSA-N His-Ala-Ala Chemical compound C[C@H](NC(=O)[C@H](C)NC(=O)[C@@H](N)Cc1cnc[nH]1)C(O)=O BIAKMWKJMQLZOJ-ZKWXMUAHSA-N 0.000 description 1
- QIVPRLJQQVXCIY-HGNGGELXSA-N His-Ala-Gln Chemical compound C[C@H](NC(=O)[C@@H](N)Cc1cnc[nH]1)C(=O)N[C@@H](CCC(N)=O)C(O)=O QIVPRLJQQVXCIY-HGNGGELXSA-N 0.000 description 1
- HXKZJLWGSWQKEA-LSJOCFKGSA-N His-Ala-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CN=CN1 HXKZJLWGSWQKEA-LSJOCFKGSA-N 0.000 description 1
- AVXURJPOCDRRFD-UHFFFAOYSA-N Hydroxylamine Chemical compound ON AVXURJPOCDRRFD-UHFFFAOYSA-N 0.000 description 1
- 241001562081 Ikeda Species 0.000 description 1
- 229930194542 Keto Natural products 0.000 description 1
- FFEARJCKVFRZRR-BYPYZUCNSA-N L-methionine Chemical compound CSCC[C@H](N)C(O)=O FFEARJCKVFRZRR-BYPYZUCNSA-N 0.000 description 1
- ZFOMKMMPBOQKMC-KXUCPTDWSA-N L-pyrrolysine Chemical compound C[C@@H]1CC=N[C@H]1C(=O)NCCCC[C@H]([NH3+])C([O-])=O ZFOMKMMPBOQKMC-KXUCPTDWSA-N 0.000 description 1
- ROHFNLRQFUQHCH-UHFFFAOYSA-N Leucine Natural products CC(C)CC(N)C(O)=O ROHFNLRQFUQHCH-UHFFFAOYSA-N 0.000 description 1
- 239000004472 Lysine Substances 0.000 description 1
- 108700020482 Maltose-Binding protein Proteins 0.000 description 1
- 241000124008 Mammalia Species 0.000 description 1
- RVYDCISQIGHAFC-ZPFDUUQYSA-N Met-Ile-Gln Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(N)=O)C(O)=O RVYDCISQIGHAFC-ZPFDUUQYSA-N 0.000 description 1
- 206010027476 Metastases Diseases 0.000 description 1
- 241000863391 Methylophilus Species 0.000 description 1
- 241000863393 Methylophilus methylotrophus Species 0.000 description 1
- 206010028980 Neoplasm Diseases 0.000 description 1
- 241000826199 Ogataea wickerhamii Species 0.000 description 1
- 108091034117 Oligonucleotide Proteins 0.000 description 1
- 108700026244 Open Reading Frames Proteins 0.000 description 1
- 102000004316 Oxidoreductases Human genes 0.000 description 1
- 238000012408 PCR amplification Methods 0.000 description 1
- 102000035195 Peptidases Human genes 0.000 description 1
- IDUCUXTUHHIQIP-SOUVJXGZSA-N Phe-Gln-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC2=CC=CC=C2)N)C(=O)O IDUCUXTUHHIQIP-SOUVJXGZSA-N 0.000 description 1
- 239000002202 Polyethylene glycol Substances 0.000 description 1
- ONIBWKKTOPOVIA-UHFFFAOYSA-N Proline Natural products OC(=O)C1CCCN1 ONIBWKKTOPOVIA-UHFFFAOYSA-N 0.000 description 1
- 239000004365 Protease Substances 0.000 description 1
- 229940124158 Protease/peptidase inhibitor Drugs 0.000 description 1
- 108020004511 Recombinant DNA Proteins 0.000 description 1
- 102100037486 Reverse transcriptase/ribonuclease H Human genes 0.000 description 1
- 108091028664 Ribonucleotide Proteins 0.000 description 1
- 241000235060 Scheffersomyces stipitis Species 0.000 description 1
- MTCFGRXMJLQNBG-UHFFFAOYSA-N Serine Natural products OCC(N)C(O)=O MTCFGRXMJLQNBG-UHFFFAOYSA-N 0.000 description 1
- 108091081024 Start codon Proteins 0.000 description 1
- 241001052560 Thallis Species 0.000 description 1
- AYFVYJQAPQTCCC-UHFFFAOYSA-N Threonine Natural products CC(O)C(N)C(O)=O AYFVYJQAPQTCCC-UHFFFAOYSA-N 0.000 description 1
- 239000004473 Threonine Substances 0.000 description 1
- 108700019146 Transgenes Proteins 0.000 description 1
- MDDYTWOFHZFABW-SZMVWBNQSA-N Trp-Gln-Leu Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O)=CNC2=C1 MDDYTWOFHZFABW-SZMVWBNQSA-N 0.000 description 1
- 230000009471 action Effects 0.000 description 1
- 125000002252 acyl group Chemical group 0.000 description 1
- 238000005377 adsorption chromatography Methods 0.000 description 1
- 238000001042 affinity chromatography Methods 0.000 description 1
- 108010087924 alanylproline Proteins 0.000 description 1
- 125000003342 alkenyl group Chemical group 0.000 description 1
- 125000000217 alkyl group Chemical group 0.000 description 1
- 125000000304 alkynyl group Chemical group 0.000 description 1
- 150000001371 alpha-amino acids Chemical class 0.000 description 1
- 235000008206 alpha-amino acids Nutrition 0.000 description 1
- 125000003277 amino group Chemical group 0.000 description 1
- 150000001450 anions Chemical class 0.000 description 1
- 239000000427 antigen Substances 0.000 description 1
- 102000036639 antigens Human genes 0.000 description 1
- 108091007433 antigens Proteins 0.000 description 1
- ODKSFYDXXFIFQN-UHFFFAOYSA-N arginine Natural products OC(=O)C(N)CCCNC(N)=N ODKSFYDXXFIFQN-UHFFFAOYSA-N 0.000 description 1
- 210000004507 artificial chromosome Anatomy 0.000 description 1
- 125000003118 aryl group Chemical group 0.000 description 1
- 235000009582 asparagine Nutrition 0.000 description 1
- 229960001230 asparagine Drugs 0.000 description 1
- 235000003704 aspartic acid Nutrition 0.000 description 1
- 230000001580 bacterial effect Effects 0.000 description 1
- 208000022362 bacterial infectious disease Diseases 0.000 description 1
- 230000009286 beneficial effect Effects 0.000 description 1
- OQFSQFPPLPISGP-UHFFFAOYSA-N beta-carboxyaspartic acid Natural products OC(=O)C(N)C(C(O)=O)C(O)=O OQFSQFPPLPISGP-UHFFFAOYSA-N 0.000 description 1
- 150000001615 biotins Chemical class 0.000 description 1
- 238000006664 bond formation reaction Methods 0.000 description 1
- 235000008429 bread Nutrition 0.000 description 1
- 239000000872 buffer Substances 0.000 description 1
- 201000011510 cancer Diseases 0.000 description 1
- 125000003178 carboxy group Chemical group [H]OC(*)=O 0.000 description 1
- 150000001768 cations Chemical class 0.000 description 1
- 238000004113 cell culture Methods 0.000 description 1
- 230000003915 cell function Effects 0.000 description 1
- 230000008859 change Effects 0.000 description 1
- 238000012512 characterization method Methods 0.000 description 1
- 239000013611 chromosomal DNA Substances 0.000 description 1
- 230000002759 chromosomal effect Effects 0.000 description 1
- 238000005352 clarification Methods 0.000 description 1
- 230000000295 complement effect Effects 0.000 description 1
- 239000002299 complementary DNA Substances 0.000 description 1
- 238000005520 cutting process Methods 0.000 description 1
- 125000004093 cyano group Chemical group *C#N 0.000 description 1
- 235000018417 cysteine Nutrition 0.000 description 1
- XUJNEKJLAYXESH-UHFFFAOYSA-N cysteine Natural products SCC(N)C(O)=O XUJNEKJLAYXESH-UHFFFAOYSA-N 0.000 description 1
- 239000005547 deoxyribonucleotide Substances 0.000 description 1
- 125000002637 deoxyribonucleotide group Chemical group 0.000 description 1
- 238000003745 diagnosis Methods 0.000 description 1
- 238000012444 downstream purification process Methods 0.000 description 1
- 238000001962 electrophoresis Methods 0.000 description 1
- 210000002472 endoplasmic reticulum Anatomy 0.000 description 1
- 239000002158 endotoxin Substances 0.000 description 1
- 238000005516 engineering process Methods 0.000 description 1
- 238000001952 enzyme assay Methods 0.000 description 1
- 210000002919 epithelial cell Anatomy 0.000 description 1
- 150000002148 esters Chemical class 0.000 description 1
- 102000013165 exonuclease Human genes 0.000 description 1
- 210000002744 extracellular matrix Anatomy 0.000 description 1
- 238000000605 extraction Methods 0.000 description 1
- 238000001914 filtration Methods 0.000 description 1
- 125000002485 formyl group Chemical class [H]C(*)=O 0.000 description 1
- 238000010230 functional analysis Methods 0.000 description 1
- 125000000524 functional group Chemical group 0.000 description 1
- 230000002538 fungal effect Effects 0.000 description 1
- 238000003209 gene knockout Methods 0.000 description 1
- 230000002068 genetic effect Effects 0.000 description 1
- 235000013922 glutamic acid Nutrition 0.000 description 1
- 239000004220 glutamic acid Substances 0.000 description 1
- 229910052736 halogen Inorganic materials 0.000 description 1
- 150000002367 halogens Chemical class 0.000 description 1
- ZFGMDIBRIDKWMY-PASTXAENSA-N heparin Chemical compound CC(O)=N[C@@H]1[C@@H](O)[C@H](O)[C@@H](COS(O)(=O)=O)O[C@@H]1O[C@@H]1[C@@H](C(O)=O)O[C@@H](O[C@H]2[C@@H]([C@@H](OS(O)(=O)=O)[C@@H](O[C@@H]3[C@@H](OC(O)[C@H](OS(O)(=O)=O)[C@H]3O)C(O)=O)O[C@@H]2O)CS(O)(=O)=O)[C@H](O)[C@H]1O ZFGMDIBRIDKWMY-PASTXAENSA-N 0.000 description 1
- 229960001008 heparin sodium Drugs 0.000 description 1
- 108010006406 heparinase II Proteins 0.000 description 1
- 239000002554 heparinoid Substances 0.000 description 1
- 229940025770 heparinoids Drugs 0.000 description 1
- 125000000623 heterocyclic group Chemical group 0.000 description 1
- HNDVDQJCIGZPNO-UHFFFAOYSA-N histidine Natural products OC(=O)C(N)CC1=CN=CN1 HNDVDQJCIGZPNO-UHFFFAOYSA-N 0.000 description 1
- 108010050343 histidyl-alanyl-glutamine Proteins 0.000 description 1
- 238000004191 hydrophobic interaction chromatography Methods 0.000 description 1
- 150000002466 imines Chemical class 0.000 description 1
- 230000002998 immunogenetic effect Effects 0.000 description 1
- 238000001114 immunoprecipitation Methods 0.000 description 1
- 230000006872 improvement Effects 0.000 description 1
- 230000006698 induction Effects 0.000 description 1
- 230000002401 inhibitory effect Effects 0.000 description 1
- 238000005342 ion exchange Methods 0.000 description 1
- AGPKZVBTJJNPAG-UHFFFAOYSA-N isoleucine Natural products CCC(C)C(N)C(O)=O AGPKZVBTJJNPAG-UHFFFAOYSA-N 0.000 description 1
- 229960000310 isoleucine Drugs 0.000 description 1
- 125000000468 ketone group Chemical group 0.000 description 1
- 239000007788 liquid Substances 0.000 description 1
- 239000006166 lysate Substances 0.000 description 1
- 230000001320 lysogenic effect Effects 0.000 description 1
- 125000000311 mannosyl group Chemical group C1([C@@H](O)[C@@H](O)[C@H](O)[C@H](O1)CO)* 0.000 description 1
- 238000004519 manufacturing process Methods 0.000 description 1
- 239000003550 marker Substances 0.000 description 1
- 239000000463 material Substances 0.000 description 1
- 238000005259 measurement Methods 0.000 description 1
- 230000007246 mechanism Effects 0.000 description 1
- 239000002609 medium Substances 0.000 description 1
- 230000009401 metastasis Effects 0.000 description 1
- 229930182817 methionine Natural products 0.000 description 1
- 230000000813 microbial effect Effects 0.000 description 1
- 244000005700 microbiome Species 0.000 description 1
- 239000000178 monomer Substances 0.000 description 1
- 108020004707 nucleic acids Proteins 0.000 description 1
- 102000039446 nucleic acids Human genes 0.000 description 1
- 150000007523 nucleic acids Chemical class 0.000 description 1
- 150000002482 oligosaccharides Polymers 0.000 description 1
- 150000007524 organic acids Chemical class 0.000 description 1
- PSWJVKKJYCAPTI-UHFFFAOYSA-N oxido-oxo-phosphonophosphanylphosphanium Chemical compound OP(O)(=O)PP(=O)=O PSWJVKKJYCAPTI-UHFFFAOYSA-N 0.000 description 1
- LDCYZAJDBXYCGN-UHFFFAOYSA-N oxitriptan Natural products C1=C(O)C=C2C(CC(N)C(O)=O)=CNC2=C1 LDCYZAJDBXYCGN-UHFFFAOYSA-N 0.000 description 1
- 230000007918 pathogenicity Effects 0.000 description 1
- 239000000137 peptide hydrolase inhibitor Substances 0.000 description 1
- 238000005191 phase separation Methods 0.000 description 1
- COLNVLDHVKWLRT-UHFFFAOYSA-N phenylalanine Natural products OC(=O)C(N)CC1=CC=CC=C1 COLNVLDHVKWLRT-UHFFFAOYSA-N 0.000 description 1
- 229920000570 polyether Polymers 0.000 description 1
- 229920001223 polyethylene glycol Polymers 0.000 description 1
- 108091033319 polynucleotide Proteins 0.000 description 1
- 102000040430 polynucleotide Human genes 0.000 description 1
- 239000002157 polynucleotide Substances 0.000 description 1
- 108091022901 polysaccharide lyase Proteins 0.000 description 1
- 102000020244 polysaccharide lyase Human genes 0.000 description 1
- 230000001124 posttranscriptional effect Effects 0.000 description 1
- 230000001323 posttranslational effect Effects 0.000 description 1
- 238000001556 precipitation Methods 0.000 description 1
- 230000009465 prokaryotic expression Effects 0.000 description 1
- 230000035755 proliferation Effects 0.000 description 1
- 239000002510 pyrogen Substances 0.000 description 1
- 230000002285 radioactive effect Effects 0.000 description 1
- 230000003362 replicative effect Effects 0.000 description 1
- 238000004366 reverse phase liquid chromatography Methods 0.000 description 1
- 239000002336 ribonucleotide Substances 0.000 description 1
- 125000002652 ribonucleotide group Chemical group 0.000 description 1
- 150000003839 salts Chemical class 0.000 description 1
- 238000012216 screening Methods 0.000 description 1
- 230000009962 secretion pathway Effects 0.000 description 1
- 108010027322 single cell proteins Proteins 0.000 description 1
- 238000002741 site-directed mutagenesis Methods 0.000 description 1
- 238000001542 size-exclusion chromatography Methods 0.000 description 1
- 238000012916 structural analysis Methods 0.000 description 1
- 235000000346 sugar Nutrition 0.000 description 1
- 150000008163 sugars Chemical class 0.000 description 1
- 125000000446 sulfanediyl group Chemical group *S* 0.000 description 1
- 230000004083 survival effect Effects 0.000 description 1
- 235000013619 trace mineral Nutrition 0.000 description 1
- 239000011573 trace mineral Substances 0.000 description 1
- 230000014616 translation Effects 0.000 description 1
- 230000032258 transport Effects 0.000 description 1
- 210000004881 tumor cell Anatomy 0.000 description 1
- 230000004614 tumor growth Effects 0.000 description 1
- OUYCCCASQSFEME-UHFFFAOYSA-N tyrosine Natural products OC(=O)C(N)CC1=CC=C(O)C=C1 OUYCCCASQSFEME-UHFFFAOYSA-N 0.000 description 1
- 238000005199 ultracentrifugation Methods 0.000 description 1
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Substances O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 1
- 210000005253 yeast cell Anatomy 0.000 description 1
Images
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
- C12N9/88—Lyases (4.)
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
- C12N15/79—Vectors or expression systems specially adapted for eukaryotic hosts
- C12N15/80—Vectors or expression systems specially adapted for eukaryotic hosts for fungi
- C12N15/81—Vectors or expression systems specially adapted for eukaryotic hosts for fungi for yeasts
- C12N15/815—Vectors or expression systems specially adapted for eukaryotic hosts for fungi for yeasts for yeasts other than Saccharomyces
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Y—ENZYMES
- C12Y402/00—Carbon-oxygen lyases (4.2)
- C12Y402/02—Carbon-oxygen lyases (4.2) acting on polysaccharides (4.2.2)
- C12Y402/02008—Heparin-sulfate lyase (4.2.2.8)
-
- Y—GENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y02—TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
- Y02A—TECHNOLOGIES FOR ADAPTATION TO CLIMATE CHANGE
- Y02A50/00—TECHNOLOGIES FOR ADAPTATION TO CLIMATE CHANGE in human health protection, e.g. against extreme weather
- Y02A50/30—Against vector-borne diseases, e.g. mosquito-borne, fly-borne, tick-borne or waterborne diseases whose impact is exacerbated by climate change
Landscapes
- Life Sciences & Earth Sciences (AREA)
- Health & Medical Sciences (AREA)
- Chemical & Material Sciences (AREA)
- Genetics & Genomics (AREA)
- Organic Chemistry (AREA)
- Engineering & Computer Science (AREA)
- Wood Science & Technology (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Zoology (AREA)
- General Engineering & Computer Science (AREA)
- General Health & Medical Sciences (AREA)
- Biotechnology (AREA)
- Biochemistry (AREA)
- Biomedical Technology (AREA)
- Microbiology (AREA)
- Molecular Biology (AREA)
- Mycology (AREA)
- Medicinal Chemistry (AREA)
- Physics & Mathematics (AREA)
- Biophysics (AREA)
- Plant Pathology (AREA)
- Micro-Organisms Or Cultivation Processes Thereof (AREA)
- Preparation Of Compounds By Using Micro-Organisms (AREA)
Abstract
The invention provides heparinase III, a coding nucleotide sequence thereof, a recombinant vector and a host cell comprising the nucleotide sequence and application, and belongs to the field of genetic engineering and fermentation engineering. The heparinase III comprises an amino acid sequence shown as Seq ID No.2 or Seq ID No.3, and compared with the original heparinase III, the activity of the modified heparinase III is not reduced, and the stability is enhanced.
Description
Technical Field
The invention relates to the field of genetic engineering and fermentation engineering, in particular to heparinase III, a coding nucleotide sequence thereof, a recombinant vector and a host cell comprising the nucleotide sequence and application.
Background
Heparinases (heparinases) are a class of polysaccharide lyases that act on heparin or heparan sulfate and are present in a wide variety of microorganisms, with heparinases from Flavobacterium heparinum being the most common. There are only three heparinases from Flavobacterium heparinum, heparinase I (EC 4.2.2.7), heparinase II (No EC code), and heparinase III (EC 4.2.2.8) (Robert J. Linhardt et al purification and catalysis of heparin from Flavobacterium heparinum JBC 1992 Vol.267: 24347-24355).
Heparinase III acts mainly on heparan sulfate and has a molecular weight of 73 kDa. Compared with the other two heparinases, the study shows that heparinase III can act on heparinoids in extracellular matrix to produce active heparin small molecules which can inhibit the proliferation of capillary epithelial cells, thereby inhibiting the growth of tumor cells and reducing the metastasis and spread of cancer cells (Liu DF, Prjanek K, Shriver Z, et al. heparinase III and uses thermoof United States, US 6869789B2[ P ]. 205-5-22).
Disclosure of Invention
At present, the stability of heparinase III is poor, and few heparinase III which can improve the enzyme stability on the basis of ensuring the enzyme catalytic activity are researched and developed by a genetic engineering means.
In view of the above-mentioned drawbacks, the present invention aims to provide heparinase III and a nucleotide sequence encoding the same, a recombinant vector and a host cell comprising the nucleotide sequence.
The invention provides heparinase III which comprises an amino acid sequence shown as Seq ID No.2 or Seq ID No. 3.
The original amino acid sequence of heparinase III is shown in Seq ID No. 1.
Seq ID No.2 indicates that all of the Q glutamines at positions 85, 114, 403 and 547 in the amino acid sequence of Seq ID No.1 are replaced by A alanine.
The amino acid sequence of Seq ID No.3 is that the 403-position Q glutamine is replaced by V valine, and the 85-, 114-and 547-position Q glutamine is replaced by A alanine in Seq ID No. 1.
Another objective of the invention is to provide a nucleotide sequence encoding the heparinase III.
Further, the nucleotide sequence is shown as Seq ID No.4 or Seq ID No.5, wherein Seq ID No.4 is the nucleotide sequence for coding the heparinase III described by Seq ID No.2, and Seq ID No.5 is the nucleotide sequence for coding the heparinase III described by Seq ID No. 3.
It is a further object of the present invention to provide a recombinant vector comprising at least one nucleotide sequence as described above.
Further, the recombinant vector is a eukaryotic cell recombinant vector.
Furthermore, the eukaryotic cell recombinant vector is any one of pPink-HC, pPICZaA and pPICZ A.
Preferably, the eukaryotic cell recombinant vector is pPink-HC.
It is a further object of the present invention to provide a host cell comprising any of the above recombinant vectors.
Further, the host cell is pichia pastoris.
Further, the pichia pastoris is X33.
The invention also provides a preparation method of the heparinase III, which comprises the following steps:
(a) And synthesizing a nucleotide sequence encoding the heparinase III
(b) Combining the nucleotide sequence in the step (a) with a eukaryotic cell recombinant expression vector to obtain a recombinant vector;
(c) and (c) introducing the recombinant vector in the step (b) into a host cell, then culturing, inducing and expressing, and purifying to obtain the heparinase III.
Preferably, the step (c) uses Buffer W to purify the Srep-Tactin column.
Compared with the prior art, the invention has the beneficial effects that:
the invention modifies protease enzyme cutting sites which may affect the stability of heparinase III in the amino acid sequence (as shown in Seq ID No. 1) of the existing heparinase III, and specifically, Q glutamine at 85, 114, 403 and 547 sites in the amino acid sequence of the existing heparinase III is subjected to site-specific mutagenesis to A alanine or V valine, so that the stability of the heparinase III, especially the thermal stability and the enzyme activity half-life are improved, when the 85, 114, 403 and 547 sites in the original amino acid sequence are replaced by the A alanine, the stability is obviously improved, the enzyme activity half-life at the optimal reaction temperature of 30 ℃ reaches about 47 hours, when the 85, 114 and 547 sites in the original amino acid sequence are replaced by the A alanine and the 403 site is replaced by the V valine, the stability is obviously improved, and the enzyme activity half-life at the optimal reaction temperature of 30 ℃ is longer.
The nucleotide sequence provided by the invention is used for coding the heparinase III, a rare codon optimization method is adopted, DNA sequence optimization is carried out according to codon usage preference of pichia pastoris, and the expression efficiency of the optimized sequence in the pichia pastoris is obviously improved.
The recombinant vector and the host cell provided by the invention comprise the nucleotide sequence and can be used for expressing the heparinase III.
Drawings
In order to more clearly illustrate the embodiments of the present invention or the technical solutions in the prior art, the drawings used in the description of the embodiments or the prior art will be briefly described below, and it is obvious that the drawings in the following description are some embodiments of the present invention, and other drawings can be obtained by those skilled in the art without creative efforts.
FIG. 1 is an SDS-PAGE electrophoresis chart of example 8 of the present invention
FIG. 2 is a graph showing stability measurements before and after modification of heparinase III according to example 10 of the present invention.
Detailed Description
"amino acid" refers to any monomeric unit that can be incorporated into a peptide, polypeptide, or protein. As used herein, the term "amino acid" includes the following 20 naturally or genetically encoded α -amino acids: alanine (Ala or A), arginine (Arg or R), asparagine (Asn or N), aspartic acid (Asp or D), cysteine (Cys or C), glutamine (Gln or Q), glutamic acid (Glu or E), glycine (Gly or G), histidine (His or H), isoleucine (Ile or I), leucine (Leu or L), lysine (Lys or K), methionine (Met or M), phenylalanine (Phe or F), proline (Pro or P), serine (Ser or S), threonine (Thr or T), tryptophan (Trp or W), tyrosine (Tyr or Y) and valine (Val or V). In cases where the "X" residues are undefined, these should be defined as "any amino acid". The structures of these 20 natural amino acids are shown, for example, in Stryer et al, BioChemistry, 5 th edition, Freeman and Company (202). Additional amino acids such as Selenocysteine and pyrrolysine can also be genetically encoded (Stadtman (1996) "Selenocysteine," Annu RevBiochem.65:83-100 and Ibba et al (202) "Genetic code: interconnecting pyrolysine," Curr biol.12(13): R464-R466). The term "amino acid" also includes unnatural amino acids, modified amino acids (e.g., having modified side chains and/or backbones), and amino acid analogs. (see, for example, Zhang et al (204) "selection characterization of 5-hydroxytryptophan proteins in macromolecular cells," Proc. Natl. Acad. Sci. U.S.A.11(24):8882-8887, Anderson et al (204) "An expression code with a functional quartz" Proc. Natl. Acad. Sci. U.S.A.11(20):7566-7571, Ikeda et al (203) "Synthesis of a novel heterologous gene and expression interaction of Protein in vivo," Protein Engineer. Des.16 (9 699, 9, 203) "expression of Protein in cells" 1. 31. Biocoding. C.31, expression of Protein in vivo, "expression of Protein, III. 12, 1. D.201, and D.31, Escherichia coli, and No. 31, No. 5-7, No. 5-amino acids in cells, No. 31, No. 5-amino acids, No. 5, No. 7, No. 1, No. 7, No. 1, 7, No. 1, No. 7, 1, 2, 1, 2, 1, a, bacher et al (201) "Selection and catalysis of Escherichia coli Variants Cap of Growth on and thermal toxin Tryptophan Analyze," J.Bacterol.183 (18): 5414-minus 5425, Hamano-Takaku et al (2000) "A mutation Escherichia coli Tyrosyl-tRNA said Amino acids said Amino Acid molecule affinity ligand, J.biol.Chem.275(51): 4324-minus 4328, and Bursa et al (201)" Property with { factor a } - (thio) ligands Amino acids ligand Amino Acid ligand and polypeptide molecule antigen ligand, molecular Amino Acid ligand and polypeptide 1297 (10) } Process enzyme.
To further illustrate, an amino acid is typically an organic acid that includes a substituted or unsubstituted amino group, a substituted or unsubstituted carboxyl group, and one or more side chains or groups, or analogs of any of these groups. Exemplary side chains include, for example, mercapto, seleno, sulfonyl, alkyl, aryl, acyl, keto, azido, hydroxyl, hydrazine, cyano, halogen, hydrazide, alkenyl, alkynyl, ether, borate, phospho, phosphono, phosphine, heterocycle, enone, imine, aldehyde, ester, thioacid, hydroxylamine, or any combination of these groups. Other representative amino acids include, but are not limited to, amino acids comprising photoactive crosslinkers, metal-binding amino acids, spin-labeled amino acids, fluorescing amino acids, metal-containing amino acids, amino acids containing new functional groups, amino acids that interact covalently or non-covalently with other molecules, photolabile (photocaged) and/or photoisomerizable amino acids, radioactive amino acids, amino acids comprising biotin or biotin analogs, glycosylated amino acids, other carbohydrate-modified amino acids, amino acids comprising polyethylene glycol or polyethers, heavy atom substituted amino acids, chemically cleavable and/or photocleavable amino acids, amino acids comprising carbon-linked sugars, redox-active amino acids, amino acids comprising aminothioacids, and amino acids comprising one or more toxic moieties.
The original amino acid sequence of heparinase III in the invention (as shown in Seq ID No. 1) is the sequence of heparinase III published in NCBI database.
Then, on the basis of the amino acid sequence, according to the preference of the expression host cell for codon usage, the nucleotide sequence corresponding to the amino acid sequence is subjected to codon optimization, so as to improve the translation efficiency of the target sequence in the fermentation process of the host cell, and obtain more target proteins as much as possible.
After a target sequence is synthesized by a target company, a target amino acid is mutated on the basis of the sequence, and the fragments are connected by a Gibson reaction after the product is amplified by fragment PCR to form a mutated vector.
Polymerase Chain Reaction (PCR) is a method of using a DNA fragment as a template, and amplifying the DNA fragment to a sufficient amount for structural and functional analysis in the presence of DNA polymerase and nucleotide substrates. The PCR detection method has extremely important significance in the aspects of clinical rapid diagnosis of bacterial infectious diseases and the like.
The principle of PCR is used to amplify a DNA fragment located between two known sequences, similar to the process of replication of natural DNA. The DNA molecule to be amplified is used as a template, a pair of oligonucleotide fragments which are respectively complementary with the 5 'end and the 3' end of the template are used as primers, under the action of DNA polymerase, the DNA molecule is extended along the template chain according to a semi-reserved replication mechanism until new DNA synthesis is completed, and the process is repeated, so that the target DNA fragment can be amplified.
Gibson assembly was first proposed in 2009 by Daniel Gibson doctor and his colleagues j. Gibson assembly is well suited for splicing multiple linear DNA fragments, as well as for inserting the DNA of interest into a vector. First, it is necessary to add a homologous fragment to the end of a DNA fragment (by PCR); then, these DNA fragments were mixed with a Master Mix (containing three enzymes) and incubated for one hour.
Wherein the Master Mix contains three different types of enzymes:
an exonuclease which digests the DNA starting from the 5' end to produce a cohesive end which grows to facilitate paired binding to another homologous end;
a polymerase for repairing a gap;
a DNA ligase realizes traceless splicing to form a complete DNA molecule.
Designing a primer near a mutation amino acid site, wherein a modified amino acid codon is contained in the primer, and the sequence also comprises a homologous sequence required by connecting adjacent segments, purifying the PCR product after amplification, and connecting the PCR amplified segments by a Gibson reaction reagent to obtain a heparinase III recombinant expression vector sequence with the amino acid mutation at a specified position.
Finally, the optimized amino acid sequence shown as Seq ID No.2 or Seq ID No.3 is obtained. In addition, Seq ID No.2 indicates that all of the Q glutamines at positions 85, 114, 403 and 547 in the amino acid sequence of Seq ID No.1 are replaced with A alanine. The amino acid sequence of Seq ID No.3 is that Q glutamine at position 403 of Seq ID No.1 is replaced by V valine, and Q glutamine at positions 85, 114 and 547 is replaced by A alanine.
A coding sequence: the term "coding sequence" as used herein includes a nucleotide sequence which directly indicates the amino acid sequence of its protein product. The boundaries of the coding sequence are generally determined by an open reading frame, which usually begins with the ATG start codon. The coding sequence typically includes DNA, cDNA, and recombinant nucleotide sequences.
The term "nucleotide", in addition to referring to naturally occurring ribonucleotide or deoxyribonucleotide monomers, is also understood herein to refer to related structural variants thereof, including derivatives and analogs, which are functionally equivalent with respect to the particular context in which the nucleotide is used, unless the context clearly indicates otherwise.
The terms "codon-optimized," "codon-optimized," or "codon-usage bias" refer to the practice of selecting codons (i.e., codon usage) in a manner that optimizes or customizes expression as desired (i.e., a technique that improves protein expression in an organism by increasing the translation efficiency of a gene of interest). In other words, codon optimization is a method of adjusting codons to match host tRNA abundance and has traditionally been used to express heterologous genes. New strategies for optimizing heterologous expression take into account global nucleotide content such as local mRNA folding, codon pair bias, codon ramp (codon ramp) or codon dependence. Codon optimization is possible because codon degeneracy is inherent. Since there are more codons than can encode an amino acid, resulting in degeneracy. Thus, the vast majority of amino acids are encoded by multiple codons, meaning that there are multiple trnas (with different anti-codon loops) that carry any given amino acid. Thus, different codons can be used without changing the encoded amino acid sequence. That is, a gene or fragment of a nucleic acid may be mutated/altered (or de novo synthesized) to change the codon used to encode a particular amino acid, without altering the amino acid sequence of the polypeptide/protein itself. For example, rare codons can be replaced with more abundant codons while keeping the amino acid sequence unchanged.
The term "host cell" refers to unicellular prokaryotic and eukaryotic organisms (e.g., bacteria, yeast, and actinomycetes) as well as unicellular cells from higher plants or animals when grown in cell culture. A "host cell" can be an animal host cell, a plant host cell, a yeast host cell, a fungal host cell, a protozoan host cell and a prokaryotic host cell.
For example, the host cell is selected from: pichia pastoris (Pichia pastoris), Pichia angusta (Pichia angusta) (Hansenula polymorpha)), Pichia finnanensis (Pichia finlandica), Pichia mycorrhiza (Pichia pastoris), Pichia pastoris (Ogataea minuta), Pichia stipitis (Pichia pastoris), Pichia pastoris (Ogataea minuta), Pichia pastoris (Pichia pastoris), Pichia pastoris (Pichia pastoris), Pichia stipityrium), Pichia pastoris (Pichia pastoris), Pichia pastoris (Aspergillus niger (Pichia pastoris), Pichia pastoris (Pichia pastoris) or (Pichia pastoris), Pichia pastoris (Pichia pastoris) strain (Pichia pastoris), Pichia pastoris (Pichia pastoris), Pichia pastoris (Pichia pastoris), Pichia pastoris) or (Pichia pastoris), Pichia pastoris) stem (Pichia pastoris), Pichia pastoris (Pichia pastoris), Pichia pastoris) stem (Pichia pastoris), Pichia pastoris (Pichia pastoris), Pichia pastoris (Pichia pastoris), Pichia pastoris) or (Pichia pastoris), Pichia pastoris (Pichia pastoris), Pichia pastoris (Pichia pastoris) or (Pichia pastoris) or (Pichia pastoris) strain (Pichia pastoris), Pichia pastoris) or (Pichia pastoris), Pichia pastoris (Pichia pastoris) strain (Pichia pastoris) or (Pichia pastoris), Pichia pastoris (Pichia pastoris), Pichia pastoris (Pichia pastoris), Pichia pastoris (Pichia pastoris), Pichia pastoris) or (Pichia pastoris) or (Pichia pastoris), Pichia pastoris) or (Pichia pastoris) or (Pichia pastoris), Pichia pastoris) or (Pichia pastoris) strain (Pichia pastoris (, Arxula adeninivorans, Aspergillus nidulans (Aspergillus nidulans), Aspergillus went (Aspergillus wentii), Aspergillus aureobasidium (Aspergillus aureus), Aspergillus flavus (Aspergillus flavus), Ashbya gossypii (Ashbya gossypii), Methylophilus methylotrophus (Methyphus methylotrophus), Schizosaccharomyces pombe (Schizosaccharomyces pombe), Candida boidinii (Candida boidinii), Candida utilis (Candida utilis), Rhizopus oryzae (Rhizopus oryzae), Debaryomyces hansenii (Debaromyces hansenii) and Saccharomyces cerevisiae. In one embodiment of the invention, the host cell may be pichia or saccharomyces cerevisiae. In one embodiment of the invention, the host cell is of the genus Saccharomyces cerevisiae. In a preferred embodiment of the invention, the host cell is of the genus Pichia (Pichia), using the Pichia pastoris expression system. Compared with a prokaryotic expression system, the yeast as a unicellular eukaryote has the following characteristics: first, yeast is a eukaryote that can make glycosylation of certain proteins more stable, and post-translational modifications such as correct disulfide bond formation and removal of signal peptides, N-and O-binding glycosylation modifications; secondly, the yeast has a unicellular microbial structure and has the advantages of rapid growth of a bacterial system, easy genetic engineering operation and the like; compared with Escherichia coli, yeast has no endotoxin, no lysogenic virus, and has close relationship with human, no pathogenicity, and has been used in bread and wine industry for a long time. Yeast is also an ideal secretory expression system. Normally, no protein is added into the yeast culture medium, and the normal yeast secretion protein is only 0.5 percent of the total protein of the yeast cells; it can also secrete the produced foreign protein into the culture medium, thus facilitating the separation and purification of the product, and greatly reducing the cost. For those proteins that require correct post-translational folding, natural secretion, and foreign proteins that are not stable or toxic intracellularly, expression in such secretory expression systems is well suited.
The pichia pastoris system has several advantages: (1) the system adopts a methanol-induced promoter, a single-copy ethanol oxidase gene can produce 30 percent of ethanol oxidase of the total soluble protein amount of cells under the drive of the promoter, and the promoter is introduced into an expression vector to drive the expression of a foreign gene. (2) High density growth, in the 70 s, pichia was used to produce single cell proteins, thus producing fermentation technology of pichia up to 100 g of dry cells per liter of culture. (3) High level secretion in protein-free culture medium, and the expression vector with Saccharomyces cerevisiae factor a signal peptide can secrete the expression product to outside cell. The medium for expression contains inorganic salts, trace elements, biotin and carbon source, and has no toxin and pyrogen, so that the secreted expression product is easy to purify. (4) The glycosylation form of Pichia is closer to that of mammals, and the expression product N-combined mannose is generally only (8-14), but is as much as (50-150) in Saccharomyces cerevisiae. Furthermore, these products do not contain a combination of terminal a-1, 3-mannose as in Saccharomyces cerevisiae, and this modification can result in immunogenetics. (5) The stability of the plasmid is good, and the pichia pastoris has no plasmid, allows the tandem repetition of the expression components, and is suitable for integrating to a chromosome to express the foreign gene. (6) The permeability of the secretion product is good, and the permeability is further influenced by the fact that in Saccharomyces cerevisiae, due to excessive glycosylation, many mannose residues are present on each N-bonded oligosaccharide chain. Generally, foreign proteins with molecular weight of more than 20KDa can not be secreted into a culture solution, and pichia does not have the phenomenon, has a set of secretion pathways extremely similar to those of eukaryotes in vivo, and is secreted from endoplasmic reticulum to the outside through Golgi bodies and vesicles. Both the diverting enzyme with the molecular weight of more than 50KDa and the human serum protein can be secreted to the outside of cells, and the expression amount is 1 g/L.
In a preferred embodiment of the present invention, the nucleotide sequence corresponding to the above amino acid sequence is codon optimized according to the codon usage preference of pichia pastoris for expression, and the protein sequence of heparinase III is recorded in codon preference analysis software, such as GenSmart, which is self-developed by tsingji biotechnology limited of NanjingTMSequence Optimization was performed by Codon Optimization sequence Optimization software (https:// www.genscript.com/tools/generational-code-Optimization); on the premise of ensuring that the protein sequence of heparinase III is unchanged and only utilizing the degeneracy of codons, the codons which are low in use frequency and can influence the passing efficiency of a ribosome in the translation process in pichia pastoris are replaced by the codons with high use frequency to obtain a nucleotide sequence with optimized codons, wherein the nucleotide sequence is shown as Seq ID No.4 or Seq ID No.5 in a sequence table. Wherein, Seq ID No.4 is the nucleotide sequence for coding the heparinase III described by Seq ID No.2, and Seq ID No.5 is the nucleotide sequence for coding the heparinase III described by Seq ID No. 3.
Expressing: the term "expression" in this context includes any step involved in the production of a polypeptide, including but not limited to transcription, post-transcriptional modification, translation, post-translational modification, and secretion.
The term "vector" refers to a piece of DNA, usually double stranded, into which a foreign piece of DNA may have been inserted. The vector may be of plasmid origin, for example. The vector contains a "replicon" polynucleotide sequence that facilitates autonomous replication of the vector in a host cell. Exogenous DNA is defined as heterologous DNA, which is DNA not naturally found in the host cell, which, for example, replicates a vector molecule, encodes a selectable or screenable marker, or encodes a transgene. The vector is used to transport the foreign or heterologous DNA into a suitable host cell. Once in the host cell, the vector may replicate independently of or simultaneously with the host chromosomal DNA, and several copies of the vector and its inserted DNA may be generated. In addition, the vector may contain the necessary elements to permit transcription of the inserted DNA into an mRNA molecule or to otherwise cause replication of the inserted DNA into multiple copies of RNA. Some expression vectors additionally contain sequence elements near the inserted DNA that increase the half-life of the expressed mRNA and/or allow translation of the mRNA into a protein molecule. Thus, many molecules of mRNA and polypeptide encoded by the inserted DNA can be synthesized rapidly.
Expression vector: the term "expression vector" in this context includes a linear or circular DNA molecule comprising a fragment encoding a polypeptide of the present invention, which fragment may be operably linked to other fragments which allow its transcription.
The recombinant expression vector may be any vector (e.g., a plasmid or virus) which can be conveniently treated with recombinant DNA procedures and can express the nucleotide sequence. The choice of vector will generally depend on the compatibility of the vector with the host cell into which the vector is to be introduced. The vector may be a linear or closed circular plasmid.
The vector may be an autonomously replicating vector, i.e., a vector which exists as an extrachromosomal entity, the replication of which is independent of chromosomal replication, e.g., a plasmid, an extrachromosomal element, a minichromosome, or an artificial chromosome.
The vector may contain any means for ensuring self-replication (means). Alternatively, the vector may be integrated into the genome and replicated together with the chromosome(s) into which it has been integrated, when introduced into the host cell. Alternatively, a single vector or plasmid, or two or more vectors or plasmids which collectively contain the entire DNA to be introduced into the genome of the host cell, may be used, or a transposon may be used.
In one embodiment of the invention, the recombinant vector comprises a nucleotide sequence encoding the amino acid sequence shown in Seq ID No.2 or Seq ID No.3, preferably, when the host cell is selected from Pichia pastoris, said recombinant vector comprises the nucleotide sequence shown in Seq ID No.4 or Seq ID No. 5.
In one embodiment of the invention, the recombinant vector is a eukaryotic recombinant vector. The expression vector adopted by the recombinant vector can be pPink-HC, pPICZaA, pPICZ A, pPICZ, pPICZ alpha, pGAPZ, pGAPZ, pHBM905A, pPIC9K, pPIC9K-His, pPIC3.5K, pPIC9, pPICZ alpha A, pAO815, pPIC9k-His, pHIL-S1, pGADT7, pGB 7, pWB980, pT3 and the like; preference is given to pPink-HC, pPICZaA and pPICZ A, particular preference to pPink-HC. The above-mentioned vectors are commercially available.
In a preferred embodiment of the present invention, the eukaryotic cell recombinant vector is pPink-HC. In another preferred embodiment of the present invention, the eukaryotic recombinant vector is pPICZaA.
In another preferred embodiment of the invention, the eukaryotic recombinant vector is pPICZ a.
In one embodiment of the invention, the nucleotide sequence corresponding to the amino acid sequence is codon optimized according to the codon usage preference of pichia pastoris, and the protein sequence of heparinase III is recorded into GenSmart autonomously developed by codon preference analysis software Nanjing Kingsler Biotech Co TMSequence Optimization was performed by the Codon Optimization sequence Optimization software (https:// www.genscript.com/tools/genetic-code-Optimization); on the premise of ensuring that the protein sequence of heparinase III is not changed and only utilizing the degeneracy of codons, the codons which are low in use frequency and can influence the passing efficiency of a ribosome in the translation process in pichia pastoris are replaced by the codons with high use frequency, so that the nucleotide sequence of the amino acid sequence shown in Seq ID No.2 or Seq ID No.3 is obtained.
In a preferred embodiment of the invention, the codon-optimized nucleotide sequence is the nucleotide sequence shown in Seq ID No.4 or Seq ID No. 5.
In one embodiment of the present invention, the above nucleotide sequence is combined with a eukaryotic recombinant vector to obtain a recombinant vector. For example, the above nucleotide sequence is combined with pPink-HC to obtain a pPink-HC recombinant vector.
In one embodiment of the present invention, the above recombinant vector is introduced into a host cell, such as a pPink-HC recombinant vector into Pichia pastoris.
In one embodiment of the present invention, the above nucleotide sequence is combined with pPICZaA to obtain a pPICZaA recombinant vector.
In one embodiment of the present invention, the above recombinant vector is introduced into a host cell, such as a pPICZaA recombinant vector into Pichia pastoris.
In one embodiment of the present invention, the above nucleotide sequence is combined with pPICZ A to obtain a pPICZ A recombinant vector.
In one embodiment of the present invention, the above recombinant vector is introduced into a host cell, such as a pPICZ A recombinant vector into Pichia pastoris.
Means for purifying recombinant proteins are well known in the art and may include clarification (e.g., filtration or centrifugation), affinity chromatography, immunoaffinity chromatography, protein a (or G) chromatography, ion exchange (i.e., cation and/or anion) chromatography, size exclusion chromatography, adsorption chromatography, hydrophobic interaction chromatography, reverse phase chromatography, ultracentrifugation, precipitation, immunoprecipitation, extraction, phase separation, and the like.
The methods described herein comprise expressing a recombinant protein in a host cell line engineered to have at least one labeled Host Cell Protein (HCP), wherein the at least one labeled HCP is labeled with at least one purification tag. Typically, labeled HCPs are proteins that are highly abundant, difficult to remove during downstream purification processes, and/or affect product quality (e.g., residual proteases may degrade the biotherapeutic product, thereby reducing its efficacy). An HCP with these characteristics is referred to as a "problematic" HCP. Typically, labeled HCPs are proteins essential for cell survival and/or cell function (and, therefore, are not good candidates for gene knockout strategies). They can be captured in a separation process (e.g., a separation column). Non-limiting embodiments of such additional domains include peptide motifs referred to as Myc tags, HAT tags, HA tags, TAP tags, GST tags, chitin binding domains (CBD tags), maltose binding proteins (MBP tags), Flag tags, Strep tags and variants thereof (e.g., strell tags), and His tags.
In some embodiments, the disclosed polypeptides comprise a Strep tag, e.g., a Strepll tag. Adding a strepII tag sequence at the C end of the heparinase III, purifying a Srep-Tactin column by using Buffer W through a desulfurated biotin purification column by using a crude cell extract, and realizing the purification of the target protein heparinase III by virtue of the interaction between the strepII tag and biotin.
In one embodiment of the invention, a disruption pretreatment is performed prior to purification, specifically as follows:
(a) taking thalli fermentation liquor for induced expression and carrying out low-temperature centrifugation;
(b) collecting the supernatant, and suspending by using Buffer W;
(c) breaking cells on ice-water mixture by ultrasonic;
(d) the suspension was centrifuged.
Preferably, the lysate is e.g.viscous, RNase A (10ug/ml) and DNase I (5ug/ml) can be added and incubated on ice.
In one embodiment of the invention, the purification steps of heparinase III are as follows:
(a) washing the Srep-Tactin column by using Buffer W, and then carrying out column loading operation;
(b) washing the column by using Buffer W, and collecting eluent;
(c) buffer E was added and the eluate was collected.
(d) The D-biotin sulfate was removed in several portions using Buffer R, and after complete removal, the column was washed using Buffer W.
Examples
The invention will be further illustrated with reference to specific embodiments. It should be understood that these embodiments are merely illustrative of the present invention and are not intended to limit the scope of the present invention. Further, it should be understood that various changes or modifications of the present invention may be made by those skilled in the art after reading the contents of the present invention, and those equivalents may fall within the scope of the present invention defined by the appended claims.
Materials, reagents and the like used in the following examples are commercially available unless otherwise specified. In the following examples, unless otherwise specified, all methods are conventional. The DNA sequence optimization was done by sumizia jingzhi biotechnology limited. (Pichia pastoris, available from Thermo Fisher corporation, product Cat. A11152. Pichia PinkTMThe vector kit was purchased from Thermo Fisher, inc, product No. a 11152. pPink-HC was purchased from Thermo Fisher, product No. A11152 (both expression vector and strain are included in the kit).
EXAMPLE 1 Synthesis and screening of amino acid III sequences
The original amino acid sequence of heparinase III is a sequence from heparinase III that has been published in the NCBI database.
Then, on the basis of the amino acid sequence, according to the preference of an expression host pichia pastoris for codon use, codon optimization is carried out on the nucleotide sequence corresponding to the amino acid sequence, so as to improve the translation efficiency of the target sequence in the pichia pastoris fermentation process, and obtain more target proteins as far as possible.
After a target sequence is synthesized by Suzhou Jinzhi biotechnology, Inc., mutation of a target amino acid is carried out on the basis of the sequence, and each fragment is connected through Gibson reaction after a fragment PCR amplification product to form a mutated vector.
Designing a primer near a mutation amino acid site, wherein a modified amino acid codon is contained in the primer, and the sequence also comprises a homologous sequence required by connecting adjacent segments, purifying the PCR product after amplification, and connecting the PCR amplified segments by a Gibson reaction reagent to obtain a heparinase III recombinant expression vector sequence with the amino acid mutation at a specified position.
The amino acid sequence shown in Seq ID No.2 or Seq ID No.3 was obtained.
Example 2 Synthesis of heparinase III nucleotide sequence and construction of recombinant expression vectors
(a) The sequence of the coding region of heparinase III from Flavobacterium heparinum contains a large amount of rare codons, and the translation efficiency of protein is influenced when Pichia pastoris is used as a host cell for expression, so the codon optimization is required. The protein sequence of heparinase III is recorded into codon preference analysis software, GenSmart which is independently developed by Nanjing Kingsler Biotech Co., Ltd TMSequence Optimization was performed by the Codon Optimization sequence Optimization software (https:// www.genscript.com/tools/genetic-code-Optimization);
(b) according to the codon usage preference of pichia pastoris in a pichia pastoris codon usage preference data table, optimizing the coding region sequence of heparinase III preliminarily screened in the embodiment 1 by using the software, replacing codons which are low in use frequency and can influence the passing efficiency of a ribosome in the translation process in pichia pastoris with codons with high use frequency on the premise of ensuring that the protein sequence of the heparinase III is unchanged and only utilizing the degeneracy of the codons to obtain a nucleotide sequence after codon optimization, wherein the obtained sequence is shown as Seq ID No.4 or Seq ID No.5 in a sequence table;
and then connecting the obtained codon-optimized nucleotide sequence of the heparinase III with a pPink-HC eukaryotic cell expression vector by a conventional means of molecular biology to construct a pPink-HC-heparinase III recombinant expression vector.
Example 3 eukaryotic cell expression vector is pPICZaA, other operation steps as shown in example 2, construction of pPICZaA-heparinase III recombinant expression vector.
Example 4 construction of pPICZ A-heparinase III recombinant expression vector for eukaryotic expression vector pPICZ A and other procedures as shown in example 2
Example 5 expression of heparinase III
Taking the recombinant expression vectors of example 2, example 3 and example 4, preparing competence andreference for electrotransformation and induction expression of thallus culture medium for Pichia PinkTMAnd (5) carrying out operation by the carrier kit instruction.
Example 6 disruption pretreatment of heparinase III
(a) Taking the thallus fermentation liquor induced and expressed in the embodiment 5 for low-temperature centrifugation, and centrifuging for 15min at 4500g and 5 ℃;
(b) collecting supernatant, suspending 100ml of the thallus obtained in (a) with 1ml of Buffer W, and adding appropriate amount of protease inhibitor if necessary;
(c) breaking cells on ice-water mixture by ultrasonic;
(d) and (c) centrifuging the suspension obtained in the step (c) for 15min at the temperature of 4 ℃ and at the speed of 13000 rpm.
Preferably, the suspension is e.g.viscous, RNase A (10ug/ml) and DNase I (5ug/ml) can be added and incubated on ice for 10-15 min.
Example 7 purification of heparinase III
(a) The Srep-Tactin column is cleaned by using Buffer W of 2CVs, and then the suspension obtained in the example 6 is subjected to column loading operation;
(b) washing the column by using Buffer W of 5CV, and collecting eluent;
(c) 6 times of Buffer E at 0.5CVs was added and collected every 0.5 CV.
(d) D-Biotin sulfate was removed 3 times using a Buffer R of 15CVs, and after complete removal, the column was washed using 8CVs Buffer W.
Example 8 SDS-PAGE detection of heparinase III
The Buffer E treated sample of example 7 was collected and 20uL of the sample was subjected to heparinase III SDS-PAGE, and the molecular weight of heparinase III was about 73.36kDa as shown in FIG. 1. After the analysis by adopting gel density scanning software Bandscan 5.0, the protein purity of the target heparinase III corresponding to the amino acid sequence shown in Seq ID No.2 or Seq ID No.3 can reach more than 85%.
Example 9 enzyme Activity assay
The scanning wavelength was 232nm and the time was 3 min. Reaction buffer (20mM Tris, 200mM NaCl, pH 7.4) was mixed with the enzyme solution in a total of 1000uL (the ratio was adjusted according to the activity of the enzyme solution)Section), 500uL substrate solution (17mM Tris, 44mM NaCl, 3.5mM CaCl2,25g/L heparin sodium pH 7.4) is placed in a quartz cuvette, mixed uniformly and placed in a spectrophotometer to scan (reaction buffer and substrate solution are preheated to 30 ℃ in a water bath), the scanning time is 70s, data of 40-60s time period are taken, and the slope k (min) of the curve is calculated-1) The enzyme activity (IU/L) of the heparinase III is calculated as follows (V is the volume of the enzyme solution added into the reaction system):
through determination, the enzyme activity of the modified heparinase III (Seq ID No.3) is 473.68IU/L, and the catalytic activity is not reduced.
Example 10 stability analysis
The purified heparinase III is placed in a constant temperature incubator at 30 ℃, certain enzyme liquid is periodically sucked, the enzyme activity is measured according to the method described in the embodiment 9, and the result is shown in figure 2 by comparing the enzyme activity with the enzyme activity of unmodified heparinase III. In FIG. 2, the enzyme of SEQ ID No. 1 represents a protein consisting of original amino acids, the enzyme of SEQ ID No.2 represents a protein consisting of Seq ID No.2 in the sequence Listing, and the enzyme of SEQ ID No.3 represents a protein consisting of Seq ID No.3 in the sequence Listing.
The comparison shows that the stability of the modified heparinase III is obviously improved, compared with the enzyme with the sequence 1, the half-life period of the enzyme activity of the heparinase III corresponding to the enzyme with the sequence 2 is improved to about 47h from about 30h, and the stability of the heparinase III corresponding to the enzyme with the mutant sequence 3 is obviously improved.
The above description is only a preferred embodiment of the present invention and is not intended to limit the present invention, and various modifications and changes may be made by those skilled in the art. Any modification, equivalent replacement, or improvement made within the spirit and principle of the present invention should be included in the protection scope of the present invention.
Sequence listing
<110> Liu Ying
<120> heparinase III, nucleotide sequence encoding same, recombinant vector and host cell comprising nucleotide sequence and application
<130> TPE01497
<160> 5
<170> PatentIn version 3.5
<210> 1
<211> 644
<212> PRT
<213> Artificial sequence
<220>
<223> Artificial sequence description: artificially synthesized sequences
<400> 1
Met Gln Ser Ser Ser Ile Thr Arg Lys Asp Phe Asp His Ile Asn Leu
1 5 10 15
Glu Tyr Ser Gly Leu Glu Lys Val Asn Lys Ala Val Ala Ala Gly Asn
20 25 30
Tyr Asp Asp Ala Ala Lys Ala Leu Leu Ala Tyr Tyr Arg Glu Lys Ser
35 40 45
Lys Ala Arg Glu Pro Asp Phe Ser Asn Ala Glu Lys Pro Ala Asp Ile
50 55 60
Arg Gln Pro Ile Asp Lys Val Thr Arg Glu Met Ala Asp Lys Ala Leu
65 70 75 80
Val His Gln Phe Gln Pro His Lys Gly Tyr Gly Tyr Phe Asp Tyr Gly
85 90 95
Lys Asp Ile Asn Trp Gln Met Trp Pro Val Lys Asp Asn Glu Val Arg
100 105 110
Trp Gln Leu His Arg Val Lys Trp Trp Gln Ala Met Ala Leu Val Tyr
115 120 125
His Ala Thr Gly Asp Glu Lys Tyr Ala Arg Glu Trp Val Tyr Gln Tyr
130 135 140
Ser Asp Trp Ala Arg Lys Asn Pro Leu Gly Leu Ser Gln Asp Asn Asp
145 150 155 160
Lys Phe Val Trp Arg Pro Leu Glu Val Ser Asp Arg Val Gln Ser Leu
165 170 175
Pro Pro Thr Phe Ser Leu Phe Val Asn Ser Pro Ala Phe Thr Pro Ala
180 185 190
Phe Leu Met Glu Phe Leu Asn Ser Tyr His Gln Gln Ala Asp Tyr Leu
195 200 205
Ser Thr His Tyr Ala Glu Gln Gly Asn His Arg Leu Phe Glu Ala Gln
210 215 220
Arg Asn Leu Phe Ala Gly Val Ser Phe Pro Glu Phe Lys Asp Ser Pro
225 230 235 240
Arg Trp Arg Gln Thr Gly Ile Ser Val Leu Asn Thr Glu Ile Lys Lys
245 250 255
Gln Val Tyr Ala Asp Gly Met Gln Phe Glu Leu Ser Pro Ile Tyr His
260 265 270
Val Ala Ala Ile Asp Ile Phe Leu Lys Ala Tyr Gly Ser Ala Lys Arg
275 280 285
Val Asn Leu Glu Lys Glu Phe Pro Gln Ser Tyr Val Gln Thr Val Glu
290 295 300
Asn Met Ile Met Ala Leu Ile Ser Ile Ser Leu Pro Asp Tyr Asn Thr
305 310 315 320
Pro Met Phe Gly Asp Ser Trp Ile Thr Asp Lys Asn Phe Arg Met Ala
325 330 335
Gln Phe Ala Ser Trp Ala Arg Val Phe Pro Ala Asn Gln Ala Ile Lys
340 345 350
Tyr Phe Ala Thr Asp Gly Lys Gln Gly Lys Ala Pro Asn Phe Leu Ser
355 360 365
Lys Ala Leu Ser Asn Ala Gly Phe Tyr Thr Phe Arg Ser Gly Trp Asp
370 375 380
Lys Asn Ala Thr Val Met Val Leu Lys Ala Ser Pro Pro Gly Glu Phe
385 390 395 400
His Ala Gln Pro Asp Asn Gly Thr Phe Glu Leu Phe Ile Lys Gly Arg
405 410 415
Asn Phe Thr Pro Asp Ala Gly Val Phe Val Tyr Ser Gly Asp Glu Ala
420 425 430
Ile Met Lys Leu Arg Asn Trp Tyr Arg Gln Thr Arg Ile His Ser Thr
435 440 445
Leu Thr Leu Asp Asn Gln Asn Met Val Ile Thr Lys Ala Arg Gln Asn
450 455 460
Lys Trp Glu Thr Gly Asn Asn Leu Asp Val Leu Thr Tyr Thr Asn Pro
465 470 475 480
Ser Tyr Pro Asn Leu Asp His Gln Arg Ser Val Leu Phe Ile Asn Lys
485 490 495
Lys Tyr Phe Leu Val Ile Asp Arg Ala Ile Gly Glu Ala Thr Gly Asn
500 505 510
Leu Gly Val His Trp Gln Leu Lys Glu Asp Ser Asn Pro Val Phe Asp
515 520 525
Lys Thr Lys Asn Arg Val Tyr Thr Thr Tyr Arg Asp Gly Asn Asn Leu
530 535 540
Met Ile Gln Ser Leu Asn Ala Asp Arg Thr Ser Leu Asn Glu Glu Glu
545 550 555 560
Gly Lys Val Ser Tyr Val Tyr Asn Lys Glu Leu Lys Arg Pro Ala Phe
565 570 575
Val Phe Glu Lys Pro Lys Lys Asn Ala Gly Thr Gln Asn Phe Val Ser
580 585 590
Ile Val Tyr Pro Tyr Asp Gly Gln Lys Ala Pro Glu Ile Ser Ile Arg
595 600 605
Glu Asn Lys Gly Asn Asp Phe Glu Lys Gly Lys Leu Asn Leu Thr Leu
610 615 620
Thr Ile Asn Gly Lys Gln Gln Leu Val Leu Val Pro Trp Ser His Pro
625 630 635 640
Gln Phe Glu Lys
<210> 2
<211> 644
<212> PRT
<213> Artificial sequence
<220>
<223> Artificial sequence description: artificially synthesized sequences
<400> 2
Met Gln Ser Ser Ser Ile Thr Arg Lys Asp Phe Asp His Ile Asn Leu
1 5 10 15
Glu Tyr Ser Gly Leu Glu Lys Val Asn Lys Ala Val Ala Ala Gly Asn
20 25 30
Tyr Asp Asp Ala Ala Lys Ala Leu Leu Ala Tyr Tyr Arg Glu Lys Ser
35 40 45
Lys Ala Arg Glu Pro Asp Phe Ser Asn Ala Glu Lys Pro Ala Asp Ile
50 55 60
Arg Gln Pro Ile Asp Lys Val Thr Arg Glu Met Ala Asp Lys Ala Leu
65 70 75 80
Val His Gln Phe Ala Pro His Lys Gly Tyr Gly Tyr Phe Asp Tyr Gly
85 90 95
Lys Asp Ile Asn Trp Gln Met Trp Pro Val Lys Asp Asn Glu Val Arg
100 105 110
Trp Ala Leu His Arg Val Lys Trp Trp Gln Ala Met Ala Leu Val Tyr
115 120 125
His Ala Thr Gly Asp Glu Lys Tyr Ala Arg Glu Trp Val Tyr Gln Tyr
130 135 140
Ser Asp Trp Ala Arg Lys Asn Pro Leu Gly Leu Ser Gln Asp Asn Asp
145 150 155 160
Lys Phe Val Trp Arg Pro Leu Glu Val Ser Asp Arg Val Gln Ser Leu
165 170 175
Pro Pro Thr Phe Ser Leu Phe Val Asn Ser Pro Ala Phe Thr Pro Ala
180 185 190
Phe Leu Met Glu Phe Leu Asn Ser Tyr His Gln Gln Ala Asp Tyr Leu
195 200 205
Ser Thr His Tyr Ala Glu Gln Gly Asn His Arg Leu Phe Glu Ala Gln
210 215 220
Arg Asn Leu Phe Ala Gly Val Ser Phe Pro Glu Phe Lys Asp Ser Pro
225 230 235 240
Arg Trp Arg Gln Thr Gly Ile Ser Val Leu Asn Thr Glu Ile Lys Lys
245 250 255
Gln Val Tyr Ala Asp Gly Met Gln Phe Glu Leu Ser Pro Ile Tyr His
260 265 270
Val Ala Ala Ile Asp Ile Phe Leu Lys Ala Tyr Gly Ser Ala Lys Arg
275 280 285
Val Asn Leu Glu Lys Glu Phe Pro Gln Ser Tyr Val Gln Thr Val Glu
290 295 300
Asn Met Ile Met Ala Leu Ile Ser Ile Ser Leu Pro Asp Tyr Asn Thr
305 310 315 320
Pro Met Phe Gly Asp Ser Trp Ile Thr Asp Lys Asn Phe Arg Met Ala
325 330 335
Gln Phe Ala Ser Trp Ala Arg Val Phe Pro Ala Asn Gln Ala Ile Lys
340 345 350
Tyr Phe Ala Thr Asp Gly Lys Gln Gly Lys Ala Pro Asn Phe Leu Ser
355 360 365
Lys Ala Leu Ser Asn Ala Gly Phe Tyr Thr Phe Arg Ser Gly Trp Asp
370 375 380
Lys Asn Ala Thr Val Met Val Leu Lys Ala Ser Pro Pro Gly Glu Phe
385 390 395 400
His Ala Ala Pro Asp Asn Gly Thr Phe Glu Leu Phe Ile Lys Gly Arg
405 410 415
Asn Phe Thr Pro Asp Ala Gly Val Phe Val Tyr Ser Gly Asp Glu Ala
420 425 430
Ile Met Lys Leu Arg Asn Trp Tyr Arg Gln Thr Arg Ile His Ser Thr
435 440 445
Leu Thr Leu Asp Asn Gln Asn Met Val Ile Thr Lys Ala Arg Gln Asn
450 455 460
Lys Trp Glu Thr Gly Asn Asn Leu Asp Val Leu Thr Tyr Thr Asn Pro
465 470 475 480
Ser Tyr Pro Asn Leu Asp His Gln Arg Ser Val Leu Phe Ile Asn Lys
485 490 495
Lys Tyr Phe Leu Val Ile Asp Arg Ala Ile Gly Glu Ala Thr Gly Asn
500 505 510
Leu Gly Val His Trp Gln Leu Lys Glu Asp Ser Asn Pro Val Phe Asp
515 520 525
Lys Thr Lys Asn Arg Val Tyr Thr Thr Tyr Arg Asp Gly Asn Asn Leu
530 535 540
Met Ile Ala Ser Leu Asn Ala Asp Arg Thr Ser Leu Asn Glu Glu Glu
545 550 555 560
Gly Lys Val Ser Tyr Val Tyr Asn Lys Glu Leu Lys Arg Pro Ala Phe
565 570 575
Val Phe Glu Lys Pro Lys Lys Asn Ala Gly Thr Gln Asn Phe Val Ser
580 585 590
Ile Val Tyr Pro Tyr Asp Gly Gln Lys Ala Pro Glu Ile Ser Ile Arg
595 600 605
Glu Asn Lys Gly Asn Asp Phe Glu Lys Gly Lys Leu Asn Leu Thr Leu
610 615 620
Thr Ile Asn Gly Lys Gln Gln Leu Val Leu Val Pro Trp Ser His Pro
625 630 635 640
Gln Phe Glu Lys
<210> 3
<211> 644
<212> PRT
<213> Artificial sequence
<220>
<223> Artificial sequence description: artificially synthesized sequences
<400> 3
Met Gln Ser Ser Ser Ile Thr Arg Lys Asp Phe Asp His Ile Asn Leu
1 5 10 15
Glu Tyr Ser Gly Leu Glu Lys Val Asn Lys Ala Val Ala Ala Gly Asn
20 25 30
Tyr Asp Asp Ala Ala Lys Ala Leu Leu Ala Tyr Tyr Arg Glu Lys Ser
35 40 45
Lys Ala Arg Glu Pro Asp Phe Ser Asn Ala Glu Lys Pro Ala Asp Ile
50 55 60
Arg Gln Pro Ile Asp Lys Val Thr Arg Glu Met Ala Asp Lys Ala Leu
65 70 75 80
Val His Gln Phe Ala Pro His Lys Gly Tyr Gly Tyr Phe Asp Tyr Gly
85 90 95
Lys Asp Ile Asn Trp Gln Met Trp Pro Val Lys Asp Asn Glu Val Arg
100 105 110
Trp Ala Leu His Arg Val Lys Trp Trp Gln Ala Met Ala Leu Val Tyr
115 120 125
His Ala Thr Gly Asp Glu Lys Tyr Ala Arg Glu Trp Val Tyr Gln Tyr
130 135 140
Ser Asp Trp Ala Arg Lys Asn Pro Leu Gly Leu Ser Gln Asp Asn Asp
145 150 155 160
Lys Phe Val Trp Arg Pro Leu Glu Val Ser Asp Arg Val Gln Ser Leu
165 170 175
Pro Pro Thr Phe Ser Leu Phe Val Asn Ser Pro Ala Phe Thr Pro Ala
180 185 190
Phe Leu Met Glu Phe Leu Asn Ser Tyr His Gln Gln Ala Asp Tyr Leu
195 200 205
Ser Thr His Tyr Ala Glu Gln Gly Asn His Arg Leu Phe Glu Ala Gln
210 215 220
Arg Asn Leu Phe Ala Gly Val Ser Phe Pro Glu Phe Lys Asp Ser Pro
225 230 235 240
Arg Trp Arg Gln Thr Gly Ile Ser Val Leu Asn Thr Glu Ile Lys Lys
245 250 255
Gln Val Tyr Ala Asp Gly Met Gln Phe Glu Leu Ser Pro Ile Tyr His
260 265 270
Val Ala Ala Ile Asp Ile Phe Leu Lys Ala Tyr Gly Ser Ala Lys Arg
275 280 285
Val Asn Leu Glu Lys Glu Phe Pro Gln Ser Tyr Val Gln Thr Val Glu
290 295 300
Asn Met Ile Met Ala Leu Ile Ser Ile Ser Leu Pro Asp Tyr Asn Thr
305 310 315 320
Pro Met Phe Gly Asp Ser Trp Ile Thr Asp Lys Asn Phe Arg Met Ala
325 330 335
Gln Phe Ala Ser Trp Ala Arg Val Phe Pro Ala Asn Gln Ala Ile Lys
340 345 350
Tyr Phe Ala Thr Asp Gly Lys Gln Gly Lys Ala Pro Asn Phe Leu Ser
355 360 365
Lys Ala Leu Ser Asn Ala Gly Phe Tyr Thr Phe Arg Ser Gly Trp Asp
370 375 380
Lys Asn Ala Thr Val Met Val Leu Lys Ala Ser Pro Pro Gly Glu Phe
385 390 395 400
His Ala Val Pro Asp Asn Gly Thr Phe Glu Leu Phe Ile Lys Gly Arg
405 410 415
Asn Phe Thr Pro Asp Ala Gly Val Phe Val Tyr Ser Gly Asp Glu Ala
420 425 430
Ile Met Lys Leu Arg Asn Trp Tyr Arg Gln Thr Arg Ile His Ser Thr
435 440 445
Leu Thr Leu Asp Asn Gln Asn Met Val Ile Thr Lys Ala Arg Gln Asn
450 455 460
Lys Trp Glu Thr Gly Asn Asn Leu Asp Val Leu Thr Tyr Thr Asn Pro
465 470 475 480
Ser Tyr Pro Asn Leu Asp His Gln Arg Ser Val Leu Phe Ile Asn Lys
485 490 495
Lys Tyr Phe Leu Val Ile Asp Arg Ala Ile Gly Glu Ala Thr Gly Asn
500 505 510
Leu Gly Val His Trp Gln Leu Lys Glu Asp Ser Asn Pro Val Phe Asp
515 520 525
Lys Thr Lys Asn Arg Val Tyr Thr Thr Tyr Arg Asp Gly Asn Asn Leu
530 535 540
Met Ile Ala Ser Leu Asn Ala Asp Arg Thr Ser Leu Asn Glu Glu Glu
545 550 555 560
Gly Lys Val Ser Tyr Val Tyr Asn Lys Glu Leu Lys Arg Pro Ala Phe
565 570 575
Val Phe Glu Lys Pro Lys Lys Asn Ala Gly Thr Gln Asn Phe Val Ser
580 585 590
Ile Val Tyr Pro Tyr Asp Gly Gln Lys Ala Pro Glu Ile Ser Ile Arg
595 600 605
Glu Asn Lys Gly Asn Asp Phe Glu Lys Gly Lys Leu Asn Leu Thr Leu
610 615 620
Thr Ile Asn Gly Lys Gln Gln Leu Val Leu Val Pro Trp Ser His Pro
625 630 635 640
Gln Phe Glu Lys
<210> 4
<211> 1935
<212> DNA
<213> Artificial sequence
<220>
<223> Artificial sequence description: artificially synthesized sequences
<400> 4
atgcaatctt cttctatcac tagaaaggat ttcgatcata tcaacttgga gtattctggt 60
ttggaaaagg ttaacaaagc tgttgctgct ggtaattacg atgatgctgc taaggctttg 120
ttggcttact atagagagaa gtctaaagct agagaaccag atttttctaa tgctgagaag 180
ccagctgata tcagacaacc tatcgataag gttactagag aaatggctga taaggctttg 240
gttcatcaat tcgctccaca caagggttac ggttacttcg attacggtaa agatatcaac 300
tggcaaatgt ggcctgttaa ggataacgaa gttagatggg ctttgcatcg tgttaagtgg 360
tggcaagcta tggctttggt ttatcatgct actggagatg agaagtatgc tagagaatgg 420
gtttaccaat attctgattg ggctagaaag aacccattgg gtttgtctca agataacgat 480
aagttcgttt ggagaccttt ggaggtttct gatagagttc aatctttgcc acctactttt 540
tctttgttcg ttaactctcc agcttttact cctgctttct tgatggaatt tttgaactct 600
taccatcaac aagctgatta cttgtctact cattatgctg agcaaggtaa ccacagattg 660
ttcgaagctc aaagaaattt gtttgctggt gtttcttttc cagagtttaa ggattctcct 720
agatggagac aaactggtat ctctgttttg aacactgaga ttaagaaaca agtttacgct 780
gatggtatgc aattcgaatt gtctccaatc taccacgttg ctgctattga tattttcttg 840
aaggcttacg gttctgctaa aagagttaac ttggagaagg aatttcctca atcttacgtt 900
caaactgttg aaaacatgat catggctttg atctctattt ctttgccaga ttacaacact 960
cctatgtttg gagattcttg gatcactgat aagaacttca gaatggctca atttgcttct 1020
tgggctagag ttttcccagc taaccaagct attaaatact tcgctactga tggtaaacag 1080
ggtaaagctc ctaacttctt gtctaaggct ttgtctaatg ctggttttta cactttcaga 1140
tctggttggg ataaaaatgc tactgttatg gttttgaagg cttctccacc tggagagttt 1200
catgctgctc cagataacgg tactttcgaa ttgttcatta agggtagaaa cttcactcct 1260
gatgctggtg tttttgttta ttctggagat gaggctatta tgaagttgag aaactggtac 1320
agacaaacta gaatccattc tactttgact ttggataacc aaaacatggt tattactaag 1380
gctagacaaa acaagtggga aactggtaac aatttggatg ttttgactta cactaaccca 1440
tcttatccta atttggatca tcaaagatct gttttgttca ttaacaagaa atactttttg 1500
gttattgata gagctatcgg agaggctact ggtaatttgg gtgttcactg gcaattgaag 1560
gaagattcta acccagtttt cgataagact aaaaatagag tttacactac ttacagagat 1620
ggtaacaatt tgatgattgc ttctttgaac gctgatagaa cttctttgaa tgaagaggag 1680
ggtaaagttt cttacgttta caataaggag ttgaaaagac cagcttttgt tttcgaaaag 1740
cctaagaaaa acgctggtac tcaaaacttc gtttctatcg tttacccata tgatggtcaa 1800
aaagctcctg agatctctat cagagaaaac aagggtaacg atttcgagaa gggtaaattg 1860
aacttgactt tgactattaa tggtaaacaa caattggttt tggttccatg gtctcaccct 1920
caatttgaaa agtaa 1935
<210> 5
<211> 1935
<212> DNA
<213> Artificial sequence
<220>
<223> Artificial sequence description: artificially synthesized sequences
<400> 5
atgcaatctt cttctatcac tagaaaggat ttcgatcata tcaacttgga gtattctggt 60
ttggaaaagg ttaacaaagc tgttgctgct ggtaattacg atgatgctgc taaggctttg 120
ttggcttact atagagagaa gtctaaagct agagaaccag atttttctaa tgctgagaag 180
ccagctgata tcagacaacc tatcgataag gttactagag aaatggctga taaggctttg 240
gttcatcaat tcgctccaca caagggttac ggttacttcg attacggtaa agatatcaac 300
tggcaaatgt ggcctgttaa ggataacgaa gttagatggg ctttgcatcg tgttaagtgg 360
tggcaagcta tggctttggt ttatcatgct actggagatg agaagtatgc tagagaatgg 420
gtttaccaat attctgattg ggctagaaag aacccattgg gtttgtctca agataacgat 480
aagttcgttt ggagaccttt ggaggtttct gatagagttc aatctttgcc acctactttt 540
tctttgttcg ttaactctcc agcttttact cctgctttct tgatggaatt tttgaactct 600
taccatcaac aagctgatta cttgtctact cattatgctg agcaaggtaa ccacagattg 660
ttcgaagctc aaagaaattt gtttgctggt gtttcttttc cagagtttaa ggattctcct 720
agatggagac aaactggtat ctctgttttg aacactgaga ttaagaaaca agtttacgct 780
gatggtatgc aattcgaatt gtctccaatc taccacgttg ctgctattga tattttcttg 840
aaggcttacg gttctgctaa aagagttaac ttggagaagg aatttcctca atcttacgtt 900
caaactgttg aaaacatgat catggctttg atctctattt ctttgccaga ttacaacact 960
cctatgtttg gagattcttg gatcactgat aagaacttca gaatggctca atttgcttct 1020
tgggctagag ttttcccagc taaccaagct attaaatact tcgctactga tggtaaacag 1080
ggtaaagctc ctaacttctt gtctaaggct ttgtctaatg ctggttttta cactttcaga 1140
tctggttggg ataaaaatgc tactgttatg gttttgaagg cttctccacc tggagagttt 1200
catgctgttc cagataacgg tactttcgaa ttgttcatta agggtagaaa cttcactcct 1260
gatgctggtg tttttgttta ttctggagat gaggctatta tgaagttgag aaactggtac 1320
agacaaacta gaatccattc tactttgact ttggataacc aaaacatggt tattactaag 1380
gctagacaaa acaagtggga aactggtaac aatttggatg ttttgactta cactaaccca 1440
tcttatccta atttggatca tcaaagatct gttttgttca ttaacaagaa atactttttg 1500
gttattgata gagctatcgg agaggctact ggtaatttgg gtgttcactg gcaattgaag 1560
gaagattcta acccagtttt cgataagact aaaaatagag tttacactac ttacagagat 1620
ggtaacaatt tgatgattgc ttctttgaac gctgatagaa cttctttgaa tgaagaggag 1680
ggtaaagttt cttacgttta caataaggag ttgaaaagac cagcttttgt tttcgaaaag 1740
cctaagaaaa acgctggtac tcaaaacttc gtttctatcg tttacccata tgatggtcaa 1800
aaagctcctg agatctctat cagagaaaac aagggtaacg atttcgagaa gggtaaattg 1860
aacttgactt tgactattaa tggtaaacaa caattggttt tggttccatg gtctcaccct 1920
caatttgaaa agtaa 1935
Claims (10)
1. Heparinase III comprising the amino acid sequence shown as Seq ID No.2 or Seq ID No. 3.
2. A nucleotide sequence encoding the heparinase III of claim 1.
3. The nucleotide sequence of claim 2, wherein the nucleotide sequence is as set forth in Seq ID No.4 or Seq ID No. 5.
4. A recombinant vector comprising the nucleotide sequence of claim 2 or 3.
5. The recombinant vector according to claim 4, wherein the recombinant vector is a eukaryotic recombinant vector.
6. The recombinant vector according to claim 5, wherein the eukaryotic recombinant vector is any one of pPink-HC, pPICZaA, pPICZ A.
7. The recombinant vector according to claim 6, wherein the eukaryotic recombinant vector is pPink-HC.
8. A host cell comprising the recombinant vector of any one of claims 4 to 7.
9. The host cell of claim 8, wherein the host cell is pichia pastoris; preferably, the pichia pastoris is X33.
10. A process for the preparation of heparinase III according to claim 1, characterized in that it comprises the following steps:
(a) synthesizing a nucleotide sequence encoding the heparinase III of claim 1;
(b) combining the nucleotide sequence in the step (a) with a eukaryotic cell recombinant expression vector to obtain a recombinant vector;
(c) introducing the recombinant vector in the step (b) into a host cell, then culturing, inducing and expressing, and purifying to obtain heparinase III;
preferably, the step (c) uses Buffer W to purify the Srep-Tactin column.
Priority Applications (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| CN202210203224.7A CN114703168B (en) | 2022-03-02 | 2022-03-02 | Heparinase III |
Applications Claiming Priority (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| CN202210203224.7A CN114703168B (en) | 2022-03-02 | 2022-03-02 | Heparinase III |
Publications (2)
| Publication Number | Publication Date |
|---|---|
| CN114703168A true CN114703168A (en) | 2022-07-05 |
| CN114703168B CN114703168B (en) | 2023-08-18 |
Family
ID=82167546
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| CN202210203224.7A Active CN114703168B (en) | 2022-03-02 | 2022-03-02 | Heparinase III |
Country Status (1)
| Country | Link |
|---|---|
| CN (1) | CN114703168B (en) |
Citations (5)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| WO1995034635A1 (en) * | 1994-06-10 | 1995-12-21 | Ibex Technologies | Nucleic acid sequences and expression systems for heparinase ii and heparinase iii derived from flavobacterium heparinum |
| WO2001066772A2 (en) * | 2000-03-08 | 2001-09-13 | Massachusetts Institute Of Technology | Heparinase iii and uses thereof |
| CN101942025A (en) * | 2010-08-20 | 2011-01-12 | 清华大学 | Heparanase III fusion protein and coding gene and expression method thereof |
| CN110343691A (en) * | 2019-07-24 | 2019-10-18 | 宝锐生物科技泰州有限公司 | Saltant type heparinase I and its coding nucleotide sequence, the recombinant vector including the nucleotide sequence and host cell and application |
| CN113025598A (en) * | 2021-01-25 | 2021-06-25 | 上海宝维医药技术有限公司 | Method for preparing recombinant heparinase III by utilizing SUMO fusion expression system and SUMO _ heparinase III fusion protein prepared by method |
-
2022
- 2022-03-02 CN CN202210203224.7A patent/CN114703168B/en active Active
Patent Citations (5)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| WO1995034635A1 (en) * | 1994-06-10 | 1995-12-21 | Ibex Technologies | Nucleic acid sequences and expression systems for heparinase ii and heparinase iii derived from flavobacterium heparinum |
| WO2001066772A2 (en) * | 2000-03-08 | 2001-09-13 | Massachusetts Institute Of Technology | Heparinase iii and uses thereof |
| CN101942025A (en) * | 2010-08-20 | 2011-01-12 | 清华大学 | Heparanase III fusion protein and coding gene and expression method thereof |
| CN110343691A (en) * | 2019-07-24 | 2019-10-18 | 宝锐生物科技泰州有限公司 | Saltant type heparinase I and its coding nucleotide sequence, the recombinant vector including the nucleotide sequence and host cell and application |
| CN113025598A (en) * | 2021-01-25 | 2021-06-25 | 上海宝维医药技术有限公司 | Method for preparing recombinant heparinase III by utilizing SUMO fusion expression system and SUMO _ heparinase III fusion protein prepared by method |
Non-Patent Citations (3)
| Title |
|---|
| HASHIMOTO, W. 等: "ACCESSION:4MMH_A,"Chain A, Heparinase III protein"", 《GENBANK》 * |
| WANG HAO 等: "Engineering the heparin-binding pocket to enhance the catalytic efficiency of a thermostable heparinase III from Bacteroides thetaiotaomicron", 《ENZYME MICROB TECHNOL 》, vol. 137 * |
| 孙腾腾 等: "肝素酶的分子结构与改造及其应用进展", 《西华师范大学学报(自然科学版)》, vol. 42, no. 1, pages 7 - 13 * |
Also Published As
| Publication number | Publication date |
|---|---|
| CN114703168B (en) | 2023-08-18 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| KR101728906B1 (en) | A mutant enzyme for production of cephalosporin antibiotics | |
| CN113767168B (en) | Efficient introduction of aminoacyl-tRNA synthetases into lysine derivatives | |
| AU2020242724B2 (en) | Aminoacyl-tRNA synthetase for efficiently introducing lysine derivative in protein | |
| CN113631712A (en) | Introduction of unnatural amino acids in proteins using a two-plasmid system | |
| CN110511916B (en) | Production process of recombinant trypsin | |
| JP4964284B2 (en) | Recombinantly expressed carboxypeptidase B and purification thereof | |
| CN111850020B (en) | Introduction of Unnatural Amino Acids into Proteins Using a Plasmid System | |
| JP2016523539A (en) | N-terminal truncation glycosyltransferase | |
| CN113234704B (en) | Method for preparing recombinant serratia marcescens nuclease | |
| CN109136209B (en) | Enterokinase light chain mutant and application thereof | |
| CN114703168B (en) | Heparinase III | |
| US8557558B2 (en) | Compositions and methods of producing enterokinase in yeast | |
| CN107109446A (en) | The manufacture method of cis 5 hydroxyl L pipecolic acids | |
| RU2447151C1 (en) | ALKALINE PHOSPHATASE CmAP SYNTHESIS-DETERMINING 40Ph PLASMID, E. coli rosetta(DE3)/40Ph STRAIN - PRODUCER OF CHIMERIC PROTEIN, CONTAINING AMINO ACID SEQUENCE OF RECOMBINANT ALKALINE PHOSPHATASE CmAP, AND PRODUCTION METHOD THEREOF | |
| CN116478969A (en) | Collagenase mutant, method for promoting secretory expression of recombinant collagenase and application thereof | |
| CN113061598A (en) | Trypsin mutant, preparation method and application thereof | |
| JP2517861B2 (en) | Method for producing immobilized enzyme | |
| WO2025104126A1 (en) | Tags for enhanced expression of recombinant proteins | |
| CN113913414A (en) | Double-base enzyme Kex2 mutant with high stability and high catalytic efficiency | |
| Patoli et al. | APPROACHES FOR THE PURIFICATION OF E. COLI Β-CLAMP IN COMPLEX WITH THE C-TERMINAL DOMAIN OF THE Α-SUBUNIT OF DNA POLYMERASE III. | |
| CN116286753A (en) | Preparation method and application of recombinant type I collagenase and mutant thereof | |
| CN113913413A (en) | Salt-tolerant RPK mutant and application thereof | |
| WO2025163019A1 (en) | Tags for enhanced expression of recombinant proteins | |
| CN117126831A (en) | Raw starch saccharifying enzyme mutant and encoding gene and application thereof | |
| Adamec et al. | Mitochondrial processing peptidase/mitochondrial intermediate peptidase |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| PB01 | Publication | ||
| PB01 | Publication | ||
| SE01 | Entry into force of request for substantive examination | ||
| SE01 | Entry into force of request for substantive examination | ||
| GR01 | Patent grant | ||
| GR01 | Patent grant | ||
| TR01 | Transfer of patent right | ||
| TR01 | Transfer of patent right |
Effective date of registration: 20231005 Address after: 101204 No. 739-1927, zone 1, Mafang Industrial Park, Zhongguancun Science Park, Pinggu District, Beijing Patentee after: Beijing ed Hauck International Technology Co.,Ltd. Address before: 101399 room 505, block D, Nanfaxin Xuhui airport, Shunyi District, Beijing Patentee before: Liu Ying |