CN1351079A - Human protein with cancer cell growth suppressing function and its coding sequence - Google Patents
Human protein with cancer cell growth suppressing function and its coding sequence Download PDFInfo
- Publication number
- CN1351079A CN1351079A CN 00125900 CN00125900A CN1351079A CN 1351079 A CN1351079 A CN 1351079A CN 00125900 CN00125900 CN 00125900 CN 00125900 A CN00125900 A CN 00125900A CN 1351079 A CN1351079 A CN 1351079A
- Authority
- CN
- China
- Prior art keywords
- seq
- leu
- gag
- ctg
- ser
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
- 206010028980 Neoplasm Diseases 0.000 title claims abstract description 18
- 201000011510 cancer Diseases 0.000 title claims abstract description 13
- 108091026890 Coding region Proteins 0.000 title claims description 13
- 108090000144 Human Proteins Proteins 0.000 title abstract description 39
- 102000003839 Human Proteins Human genes 0.000 title abstract description 38
- 230000010261 cell growth Effects 0.000 title 1
- 108090000765 processed proteins & peptides Proteins 0.000 claims abstract description 90
- 229920001184 polypeptide Polymers 0.000 claims abstract description 89
- 102000004196 processed proteins & peptides Human genes 0.000 claims abstract description 89
- 108091033319 polynucleotide Proteins 0.000 claims abstract description 54
- 102000040430 polynucleotide Human genes 0.000 claims abstract description 54
- 239000002157 polynucleotide Substances 0.000 claims abstract description 54
- 108090000623 proteins and genes Proteins 0.000 claims description 99
- 102000004169 proteins and genes Human genes 0.000 claims description 62
- 238000000034 method Methods 0.000 claims description 60
- 239000002773 nucleotide Substances 0.000 claims description 28
- 125000003729 nucleotide group Chemical group 0.000 claims description 28
- 239000012634 fragment Substances 0.000 claims description 24
- 125000003275 alpha amino acid group Chemical group 0.000 claims description 14
- 230000014509 gene expression Effects 0.000 claims description 13
- 239000008194 pharmaceutical composition Substances 0.000 claims description 5
- 230000000295 complement effect Effects 0.000 claims description 3
- 239000003937 drug carrier Substances 0.000 claims description 3
- 238000002360 preparation method Methods 0.000 claims description 3
- 235000018102 proteins Nutrition 0.000 claims 1
- 208000037265 diseases, disorders, signs and symptoms Diseases 0.000 abstract description 19
- 201000010099 disease Diseases 0.000 abstract description 18
- 238000004519 manufacturing process Methods 0.000 abstract description 7
- 238000005215 recombination Methods 0.000 abstract description 4
- 230000006798 recombination Effects 0.000 abstract description 3
- OVBICQMTCPFEBS-SATRDZAXSA-N (2s)-1-[(2s)-2-[[(2s)-2-[[(2r)-2-[[(2s)-2-[[(2s)-2-[[(2r)-2-[[(2r)-2-[[(2r)-2-acetamido-3-naphthalen-2-ylpropanoyl]amino]-3-(4-chlorophenyl)propanoyl]amino]-3-pyridin-3-ylpropanoyl]amino]-3-hydroxypropanoyl]amino]-3-(4-hydroxyphenyl)propanoyl]amino]-6-[bi Chemical compound CC(O)=O.CC(O)=O.C([C@@H](C(=O)N[C@H](CCCCN=C(NCC)NCC)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN=C(NCC)NCC)C(=O)N1[C@@H](CCC1)C(=O)N[C@H](C)C(N)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](CC=1C=NC=CC=1)NC(=O)[C@@H](CC=1C=CC(Cl)=CC=1)NC(=O)[C@@H](CC=1C=C2C=CC=CC2=CC=1)NC(C)=O)C1=CC=C(O)C=C1 OVBICQMTCPFEBS-SATRDZAXSA-N 0.000 abstract 1
- 108700032141 ganirelix Proteins 0.000 abstract 1
- BYXHQQCXAJARLQ-ZLUOBGJFSA-N Ala-Ala-Ala Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O BYXHQQCXAJARLQ-ZLUOBGJFSA-N 0.000 description 132
- RLMISHABBKUNFO-WHFBIAKZSA-N Ala-Ala-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O RLMISHABBKUNFO-WHFBIAKZSA-N 0.000 description 111
- 230000006870 function Effects 0.000 description 96
- 102000001742 Tumor Suppressor Proteins Human genes 0.000 description 88
- 108010040002 Tumor Suppressor Proteins Proteins 0.000 description 88
- 210000004027 cell Anatomy 0.000 description 55
- 239000002299 complementary DNA Substances 0.000 description 44
- 108020004414 DNA Proteins 0.000 description 27
- 108091028043 Nucleic acid sequence Proteins 0.000 description 21
- 239000013598 vector Substances 0.000 description 19
- 238000005516 engineering process Methods 0.000 description 15
- 241000282414 Homo sapiens Species 0.000 description 13
- 150000001413 amino acids Chemical class 0.000 description 11
- 210000000349 chromosome Anatomy 0.000 description 11
- 239000013604 expression vector Substances 0.000 description 11
- 239000013615 primer Substances 0.000 description 11
- 108091032973 (ribonucleotides)n+m Proteins 0.000 description 10
- 239000005557 antagonist Substances 0.000 description 10
- 238000007796 conventional method Methods 0.000 description 10
- 230000000694 effects Effects 0.000 description 10
- 238000009396 hybridization Methods 0.000 description 10
- 238000012216 screening Methods 0.000 description 10
- 108010050848 glycylleucine Proteins 0.000 description 9
- 108020004999 messenger RNA Proteins 0.000 description 9
- 230000035772 mutation Effects 0.000 description 9
- 150000001875 compounds Chemical class 0.000 description 8
- 150000007523 nucleic acids Chemical class 0.000 description 8
- 239000013612 plasmid Substances 0.000 description 8
- 239000000523 sample Substances 0.000 description 8
- KOSRFJWDECSPRO-UHFFFAOYSA-N alpha-L-glutamyl-L-glutamic acid Natural products OC(=O)CCC(N)C(=O)NC(CCC(O)=O)C(O)=O KOSRFJWDECSPRO-UHFFFAOYSA-N 0.000 description 7
- 230000002759 chromosomal effect Effects 0.000 description 7
- 239000003814 drug Substances 0.000 description 7
- 108010057821 leucylproline Proteins 0.000 description 7
- 238000003786 synthesis reaction Methods 0.000 description 7
- 108010061238 threonyl-glycine Proteins 0.000 description 7
- 230000002159 abnormal effect Effects 0.000 description 6
- 230000037430 deletion Effects 0.000 description 6
- 238000012217 deletion Methods 0.000 description 6
- 229940079593 drug Drugs 0.000 description 6
- 108010055341 glutamyl-glutamic acid Proteins 0.000 description 6
- 238000012163 sequencing technique Methods 0.000 description 6
- 238000013518 transcription Methods 0.000 description 6
- 230000035897 transcription Effects 0.000 description 6
- FWMNVWWHGCHHJJ-SKKKGAJSSA-N 4-amino-1-[(2r)-6-amino-2-[[(2r)-2-[[(2r)-2-[[(2r)-2-amino-3-phenylpropanoyl]amino]-3-phenylpropanoyl]amino]-4-methylpentanoyl]amino]hexanoyl]piperidine-4-carboxylic acid Chemical compound C([C@H](C(=O)N[C@H](CC(C)C)C(=O)N[C@H](CCCCN)C(=O)N1CCC(N)(CC1)C(O)=O)NC(=O)[C@H](N)CC=1C=CC=CC=1)C1=CC=CC=C1 FWMNVWWHGCHHJJ-SKKKGAJSSA-N 0.000 description 5
- YBAFDPFAUTYYRW-UHFFFAOYSA-N N-L-alpha-glutamyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CCC(O)=O YBAFDPFAUTYYRW-UHFFFAOYSA-N 0.000 description 5
- SITLTJHOQZFJGG-UHFFFAOYSA-N N-L-alpha-glutamyl-L-valine Natural products CC(C)C(C(O)=O)NC(=O)C(N)CCC(O)=O SITLTJHOQZFJGG-UHFFFAOYSA-N 0.000 description 5
- 241000700605 Viruses Species 0.000 description 5
- 238000004458 analytical method Methods 0.000 description 5
- 230000015572 biosynthetic process Effects 0.000 description 5
- 239000003623 enhancer Substances 0.000 description 5
- 210000003527 eukaryotic cell Anatomy 0.000 description 5
- 210000004962 mammalian cell Anatomy 0.000 description 5
- 238000013507 mapping Methods 0.000 description 5
- 108090000994 Catalytic RNA Proteins 0.000 description 4
- 102000053642 Catalytic RNA Human genes 0.000 description 4
- 241000588724 Escherichia coli Species 0.000 description 4
- LFQSCWFLJHTTHZ-UHFFFAOYSA-N Ethanol Chemical compound CCO LFQSCWFLJHTTHZ-UHFFFAOYSA-N 0.000 description 4
- 238000012408 PCR amplification Methods 0.000 description 4
- 108010047495 alanylglycine Proteins 0.000 description 4
- 125000000539 amino acid group Chemical group 0.000 description 4
- 238000003556 assay Methods 0.000 description 4
- 230000001580 bacterial effect Effects 0.000 description 4
- 230000008827 biological function Effects 0.000 description 4
- 230000004663 cell proliferation Effects 0.000 description 4
- 238000001514 detection method Methods 0.000 description 4
- 238000001415 gene therapy Methods 0.000 description 4
- 239000001963 growth medium Substances 0.000 description 4
- 238000007901 in situ hybridization Methods 0.000 description 4
- 238000000338 in vitro Methods 0.000 description 4
- 102000039446 nucleic acids Human genes 0.000 description 4
- 108020004707 nucleic acids Proteins 0.000 description 4
- 108091092562 ribozyme Proteins 0.000 description 4
- 108010026333 seryl-proline Proteins 0.000 description 4
- 239000000126 substance Substances 0.000 description 4
- 108010073969 valyllysine Proteins 0.000 description 4
- 108020005544 Antisense RNA Proteins 0.000 description 3
- 241000196324 Embryophyta Species 0.000 description 3
- VBOBNHSVQKKTOT-YUMQZZPRSA-N Gly-Lys-Ala Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O VBOBNHSVQKKTOT-YUMQZZPRSA-N 0.000 description 3
- PEDCQBHIVMGVHV-UHFFFAOYSA-N Glycerine Chemical compound OCC(O)CO PEDCQBHIVMGVHV-UHFFFAOYSA-N 0.000 description 3
- 241000238631 Hexapoda Species 0.000 description 3
- 241000880493 Leptailurus serval Species 0.000 description 3
- KEVYYIMVELOXCT-KBPBESRZSA-N Leu-Gly-Phe Chemical compound CC(C)C[C@H]([NH3+])C(=O)NCC(=O)N[C@H](C([O-])=O)CC1=CC=CC=C1 KEVYYIMVELOXCT-KBPBESRZSA-N 0.000 description 3
- WEDZFLRYSIDIRX-IHRRRGAJSA-N Phe-Ser-Arg Chemical compound NC(=N)NCCC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CC=CC=C1 WEDZFLRYSIDIRX-IHRRRGAJSA-N 0.000 description 3
- 108020004511 Recombinant DNA Proteins 0.000 description 3
- 102000007056 Recombinant Fusion Proteins Human genes 0.000 description 3
- 108010008281 Recombinant Fusion Proteins Proteins 0.000 description 3
- 240000004808 Saccharomyces cerevisiae Species 0.000 description 3
- CRZRTKAVUUGKEQ-ACZMJKKPSA-N Ser-Gln-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O CRZRTKAVUUGKEQ-ACZMJKKPSA-N 0.000 description 3
- UIGMAMGZOJVTDN-WHFBIAKZSA-N Ser-Gly-Ser Chemical compound OC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O UIGMAMGZOJVTDN-WHFBIAKZSA-N 0.000 description 3
- ASQFIHTXXMFENG-XPUUQOCRSA-N Val-Ala-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O ASQFIHTXXMFENG-XPUUQOCRSA-N 0.000 description 3
- 239000000556 agonist Substances 0.000 description 3
- 230000003321 amplification Effects 0.000 description 3
- 108010069205 aspartyl-phenylalanine Proteins 0.000 description 3
- 108010093581 aspartyl-proline Proteins 0.000 description 3
- 230000008859 change Effects 0.000 description 3
- 238000006243 chemical reaction Methods 0.000 description 3
- 239000003795 chemical substances by application Substances 0.000 description 3
- 239000003184 complementary RNA Substances 0.000 description 3
- 238000003745 diagnosis Methods 0.000 description 3
- VPZXBVLAVMBEQI-UHFFFAOYSA-N glycyl-DL-alpha-alanine Natural products OC(=O)C(C)NC(=O)CN VPZXBVLAVMBEQI-UHFFFAOYSA-N 0.000 description 3
- 108010089804 glycyl-threonine Proteins 0.000 description 3
- 210000003917 human chromosome Anatomy 0.000 description 3
- 210000004408 hybridoma Anatomy 0.000 description 3
- 108010051673 leucyl-glycyl-phenylalanine Proteins 0.000 description 3
- 239000002502 liposome Substances 0.000 description 3
- 239000003550 marker Substances 0.000 description 3
- 239000000203 mixture Substances 0.000 description 3
- 238000010369 molecular cloning Methods 0.000 description 3
- 238000003199 nucleic acid amplification method Methods 0.000 description 3
- 108010029020 prolylglycine Proteins 0.000 description 3
- 108010053725 prolylvaline Proteins 0.000 description 3
- 238000003259 recombinant expression Methods 0.000 description 3
- 238000010839 reverse transcription Methods 0.000 description 3
- 230000001225 therapeutic effect Effects 0.000 description 3
- 210000001519 tissue Anatomy 0.000 description 3
- 241000701161 unidentified adenovirus Species 0.000 description 3
- 241001430294 unidentified retrovirus Species 0.000 description 3
- XVZCXCTYGHPNEM-IHRRRGAJSA-N (2s)-1-[(2s)-2-[[(2s)-2-amino-4-methylpentanoyl]amino]-4-methylpentanoyl]pyrrolidine-2-carboxylic acid Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(O)=O XVZCXCTYGHPNEM-IHRRRGAJSA-N 0.000 description 2
- YHKANGMVQWRMAP-DCAQKATOSA-N Ala-Leu-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N YHKANGMVQWRMAP-DCAQKATOSA-N 0.000 description 2
- ADSGHMXEAZJJNF-DCAQKATOSA-N Ala-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](C)N ADSGHMXEAZJJNF-DCAQKATOSA-N 0.000 description 2
- NLYYHIKRBRMAJV-AEJSXWLSSA-N Ala-Val-Pro Chemical compound C[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N NLYYHIKRBRMAJV-AEJSXWLSSA-N 0.000 description 2
- 108020004491 Antisense DNA Proteins 0.000 description 2
- HKRXJBBCQBAGIM-FXQIFTODSA-N Arg-Asp-Ser Chemical compound C(C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CO)C(=O)O)N)CN=C(N)N HKRXJBBCQBAGIM-FXQIFTODSA-N 0.000 description 2
- NXDXECQFKHXHAM-HJGDQZAQSA-N Arg-Glu-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O NXDXECQFKHXHAM-HJGDQZAQSA-N 0.000 description 2
- SRUUBQBAVNQZGJ-LAEOZQHASA-N Asn-Gln-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC(=O)N)N SRUUBQBAVNQZGJ-LAEOZQHASA-N 0.000 description 2
- HPBNLFLSSQDFQW-WHFBIAKZSA-N Asn-Ser-Gly Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O HPBNLFLSSQDFQW-WHFBIAKZSA-N 0.000 description 2
- 241000894006 Bacteria Species 0.000 description 2
- 108091033380 Coding strand Proteins 0.000 description 2
- MBILEVLLOHJZMG-FXQIFTODSA-N Cys-Gln-Glu Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CS)N MBILEVLLOHJZMG-FXQIFTODSA-N 0.000 description 2
- 102000053602 DNA Human genes 0.000 description 2
- 230000006820 DNA synthesis Effects 0.000 description 2
- 102000004163 DNA-directed RNA polymerases Human genes 0.000 description 2
- 108090000626 DNA-directed RNA polymerases Proteins 0.000 description 2
- 238000002965 ELISA Methods 0.000 description 2
- ZHNUHDYFZUAESO-UHFFFAOYSA-N Formamide Chemical compound NC=O ZHNUHDYFZUAESO-UHFFFAOYSA-N 0.000 description 2
- UICOTGULOUGGLC-NUMRIWBASA-N Gln-Asp-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCC(=O)N)N)O UICOTGULOUGGLC-NUMRIWBASA-N 0.000 description 2
- GQZDDFRXSDGUNG-YVNDNENWSA-N Gln-Ile-Gln Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(N)=O)C(O)=O GQZDDFRXSDGUNG-YVNDNENWSA-N 0.000 description 2
- ITYRYNUZHPNCIK-GUBZILKMSA-N Glu-Ala-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O ITYRYNUZHPNCIK-GUBZILKMSA-N 0.000 description 2
- VPKBCVUDBNINAH-GARJFASQSA-N Glu-Arg-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CCC(=O)O)N)C(=O)O VPKBCVUDBNINAH-GARJFASQSA-N 0.000 description 2
- CQGBSALYGOXQPE-HTUGSXCWSA-N Glu-Thr-Phe Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CCC(=O)O)N)O CQGBSALYGOXQPE-HTUGSXCWSA-N 0.000 description 2
- GWCRIHNSVMOBEQ-BQBZGAKWSA-N Gly-Arg-Ser Chemical compound [H]NCC(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O GWCRIHNSVMOBEQ-BQBZGAKWSA-N 0.000 description 2
- YYPFZVIXAVDHIK-IUCAKERBSA-N Gly-Glu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)CN YYPFZVIXAVDHIK-IUCAKERBSA-N 0.000 description 2
- NSTUFLGQJCOCDL-UWVGGRQHSA-N Gly-Leu-Arg Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N NSTUFLGQJCOCDL-UWVGGRQHSA-N 0.000 description 2
- 108010043121 Green Fluorescent Proteins Proteins 0.000 description 2
- 102000004144 Green Fluorescent Proteins Human genes 0.000 description 2
- MDOBWSFNSNPENN-PMVVWTBXSA-N His-Thr-Gly Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O MDOBWSFNSNPENN-PMVVWTBXSA-N 0.000 description 2
- 241000282412 Homo Species 0.000 description 2
- 102100034343 Integrase Human genes 0.000 description 2
- CLVUXCBGKUECIT-HJGDQZAQSA-N Leu-Asp-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CLVUXCBGKUECIT-HJGDQZAQSA-N 0.000 description 2
- JFSGIJSCJFQGSZ-MXAVVETBSA-N Leu-Ile-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC(C)C)N JFSGIJSCJFQGSZ-MXAVVETBSA-N 0.000 description 2
- DSFYPIUSAMSERP-IHRRRGAJSA-N Leu-Leu-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N DSFYPIUSAMSERP-IHRRRGAJSA-N 0.000 description 2
- XVZCXCTYGHPNEM-UHFFFAOYSA-N Leu-Leu-Pro Natural products CC(C)CC(N)C(=O)NC(CC(C)C)C(=O)N1CCCC1C(O)=O XVZCXCTYGHPNEM-UHFFFAOYSA-N 0.000 description 2
- RXGLHDWAZQECBI-SRVKXCTJSA-N Leu-Leu-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O RXGLHDWAZQECBI-SRVKXCTJSA-N 0.000 description 2
- ZGUMORRUBUCXEH-AVGNSLFASA-N Leu-Lys-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O ZGUMORRUBUCXEH-AVGNSLFASA-N 0.000 description 2
- YESNGRDJQWDYLH-KKUMJFAQSA-N Leu-Phe-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CS)C(=O)O)N YESNGRDJQWDYLH-KKUMJFAQSA-N 0.000 description 2
- DRWMRVFCKKXHCH-BZSNNMDCSA-N Leu-Phe-Leu Chemical compound CC(C)C[C@H]([NH3+])C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C([O-])=O)CC1=CC=CC=C1 DRWMRVFCKKXHCH-BZSNNMDCSA-N 0.000 description 2
- PJWOOBTYQNNRBF-BZSNNMDCSA-N Leu-Phe-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCCN)C(=O)O)N PJWOOBTYQNNRBF-BZSNNMDCSA-N 0.000 description 2
- BRTVHXHCUSXYRI-CIUDSAMLSA-N Leu-Ser-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O BRTVHXHCUSXYRI-CIUDSAMLSA-N 0.000 description 2
- AIMGJYMCTAABEN-GVXVVHGQSA-N Leu-Val-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O AIMGJYMCTAABEN-GVXVVHGQSA-N 0.000 description 2
- SJNZALDHDUYDBU-IHRRRGAJSA-N Lys-Arg-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCCN)C(O)=O SJNZALDHDUYDBU-IHRRRGAJSA-N 0.000 description 2
- WOEDRPCHKPSFDT-MXAVVETBSA-N Lys-His-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CCCCN)N WOEDRPCHKPSFDT-MXAVVETBSA-N 0.000 description 2
- TWRXJAOTZQYOKJ-UHFFFAOYSA-L Magnesium chloride Chemical compound [Mg+2].[Cl-].[Cl-] TWRXJAOTZQYOKJ-UHFFFAOYSA-L 0.000 description 2
- 241001465754 Metazoa Species 0.000 description 2
- XMBSYZWANAQXEV-UHFFFAOYSA-N N-alpha-L-glutamyl-L-phenylalanine Natural products OC(=O)CCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 XMBSYZWANAQXEV-UHFFFAOYSA-N 0.000 description 2
- KZNQNBZMBZJQJO-UHFFFAOYSA-N N-glycyl-L-proline Natural products NCC(=O)N1CCCC1C(O)=O KZNQNBZMBZJQJO-UHFFFAOYSA-N 0.000 description 2
- AJHCSUXXECOXOY-UHFFFAOYSA-N N-glycyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)CN)C(O)=O)=CNC2=C1 AJHCSUXXECOXOY-UHFFFAOYSA-N 0.000 description 2
- 238000000636 Northern blotting Methods 0.000 description 2
- 108091034117 Oligonucleotide Proteins 0.000 description 2
- 108700026244 Open Reading Frames Proteins 0.000 description 2
- VXCHGLYSIOOZIS-GUBZILKMSA-N Pro-Ala-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1 VXCHGLYSIOOZIS-GUBZILKMSA-N 0.000 description 2
- FRKBNXCFJBPJOL-GUBZILKMSA-N Pro-Glu-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O FRKBNXCFJBPJOL-GUBZILKMSA-N 0.000 description 2
- FMLRRBDLBJLJIK-DCAQKATOSA-N Pro-Leu-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]1CCCN1 FMLRRBDLBJLJIK-DCAQKATOSA-N 0.000 description 2
- RMODQFBNDDENCP-IHRRRGAJSA-N Pro-Lys-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O RMODQFBNDDENCP-IHRRRGAJSA-N 0.000 description 2
- 108010092799 RNA-directed DNA polymerase Proteins 0.000 description 2
- 108020005091 Replication Origin Proteins 0.000 description 2
- 238000012300 Sequence Analysis Methods 0.000 description 2
- HVKMTOIAYDOJPL-NRPADANISA-N Ser-Gln-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O HVKMTOIAYDOJPL-NRPADANISA-N 0.000 description 2
- YUJLIIRMIAGMCQ-CIUDSAMLSA-N Ser-Leu-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O YUJLIIRMIAGMCQ-CIUDSAMLSA-N 0.000 description 2
- 238000002105 Southern blotting Methods 0.000 description 2
- VTVVYQOXJCZVEB-WDCWCFNPSA-N Thr-Leu-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O VTVVYQOXJCZVEB-WDCWCFNPSA-N 0.000 description 2
- YDPFWRVQHFWBKI-GVXVVHGQSA-N Val-Glu-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N YDPFWRVQHFWBKI-GVXVVHGQSA-N 0.000 description 2
- UMPVMAYCLYMYGA-ONGXEEELSA-N Val-Leu-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O UMPVMAYCLYMYGA-ONGXEEELSA-N 0.000 description 2
- VHIZXDZMTDVFGX-DCAQKATOSA-N Val-Ser-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](C(C)C)N VHIZXDZMTDVFGX-DCAQKATOSA-N 0.000 description 2
- JLCPHMBAVCMARE-UHFFFAOYSA-N [3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-hydroxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methyl [5-(6-aminopurin-9-yl)-2-(hydroxymethyl)oxolan-3-yl] hydrogen phosphate Polymers Cc1cn(C2CC(OP(O)(=O)OCC3OC(CC3OP(O)(=O)OCC3OC(CC3O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c3nc(N)[nH]c4=O)C(COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3CO)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cc(C)c(=O)[nH]c3=O)n3cc(C)c(=O)[nH]c3=O)n3ccc(N)nc3=O)n3cc(C)c(=O)[nH]c3=O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)O2)c(=O)[nH]c1=O JLCPHMBAVCMARE-UHFFFAOYSA-N 0.000 description 2
- 230000005856 abnormality Effects 0.000 description 2
- 239000002671 adjuvant Substances 0.000 description 2
- 108010087924 alanylproline Proteins 0.000 description 2
- 108010070783 alanyltyrosine Proteins 0.000 description 2
- 239000003816 antisense DNA Substances 0.000 description 2
- 108010013835 arginine glutamate Proteins 0.000 description 2
- 108010068265 aspartyltyrosine Proteins 0.000 description 2
- 230000004071 biological effect Effects 0.000 description 2
- 238000001574 biopsy Methods 0.000 description 2
- 239000000969 carrier Substances 0.000 description 2
- 239000003153 chemical reaction reagent Substances 0.000 description 2
- 230000005757 colony formation Effects 0.000 description 2
- 108010016616 cysteinylglycine Proteins 0.000 description 2
- 108010060199 cysteinylproline Proteins 0.000 description 2
- 238000002405 diagnostic procedure Methods 0.000 description 2
- 238000004520 electroporation Methods 0.000 description 2
- 238000001502 gel electrophoresis Methods 0.000 description 2
- 108010085059 glutamyl-arginyl-proline Proteins 0.000 description 2
- JYPCXBJRLBHWME-UHFFFAOYSA-N glycyl-L-prolyl-L-arginine Natural products NCC(=O)N1CCCC1C(=O)NC(CCCN=C(N)N)C(O)=O JYPCXBJRLBHWME-UHFFFAOYSA-N 0.000 description 2
- 108010000434 glycyl-alanyl-leucine Proteins 0.000 description 2
- 108010084389 glycyltryptophan Proteins 0.000 description 2
- 239000005090 green fluorescent protein Substances 0.000 description 2
- 230000036541 health Effects 0.000 description 2
- 108010036413 histidylglycine Proteins 0.000 description 2
- 108010028295 histidylhistidine Proteins 0.000 description 2
- 108010018006 histidylserine Proteins 0.000 description 2
- 230000001900 immune effect Effects 0.000 description 2
- 238000001727 in vivo Methods 0.000 description 2
- 238000003780 insertion Methods 0.000 description 2
- 230000037431 insertion Effects 0.000 description 2
- 230000003993 interaction Effects 0.000 description 2
- 238000002955 isolation Methods 0.000 description 2
- 108010027338 isoleucylcysteine Proteins 0.000 description 2
- 108010034529 leucyl-lysine Proteins 0.000 description 2
- 108010090333 leucyl-lysyl-proline Proteins 0.000 description 2
- 210000004185 liver Anatomy 0.000 description 2
- 201000007270 liver cancer Diseases 0.000 description 2
- 208000014018 liver neoplasm Diseases 0.000 description 2
- 108010009298 lysylglutamic acid Proteins 0.000 description 2
- 108010064235 lysylglycine Proteins 0.000 description 2
- 239000000463 material Substances 0.000 description 2
- 239000002609 medium Substances 0.000 description 2
- 238000002493 microarray Methods 0.000 description 2
- 108010012581 phenylalanylglutamate Proteins 0.000 description 2
- 108010073025 phenylalanylphenylalanine Proteins 0.000 description 2
- 108010051242 phenylalanylserine Proteins 0.000 description 2
- 210000002826 placenta Anatomy 0.000 description 2
- 210000005059 placental tissue Anatomy 0.000 description 2
- 210000001236 prokaryotic cell Anatomy 0.000 description 2
- 108010079317 prolyl-tyrosine Proteins 0.000 description 2
- 108010070643 prolylglutamic acid Proteins 0.000 description 2
- 230000004952 protein activity Effects 0.000 description 2
- 238000000746 purification Methods 0.000 description 2
- 230000002285 radioactive effect Effects 0.000 description 2
- 238000010188 recombinant method Methods 0.000 description 2
- 230000001105 regulatory effect Effects 0.000 description 2
- 239000004017 serum-free culture medium Substances 0.000 description 2
- 230000019491 signal transduction Effects 0.000 description 2
- 239000007790 solid phase Substances 0.000 description 2
- 230000000392 somatic effect Effects 0.000 description 2
- 238000006467 substitution reaction Methods 0.000 description 2
- 238000012360 testing method Methods 0.000 description 2
- 238000001890 transfection Methods 0.000 description 2
- 230000005945 translocation Effects 0.000 description 2
- 108010080629 tryptophan-leucine Proteins 0.000 description 2
- 108010038745 tryptophylglycine Proteins 0.000 description 2
- 239000000225 tumor suppressor protein Substances 0.000 description 2
- 239000013603 viral vector Substances 0.000 description 2
- 238000001262 western blot Methods 0.000 description 2
- JWDFQMWEFLOOED-UHFFFAOYSA-N (2,5-dioxopyrrolidin-1-yl) 3-(pyridin-2-yldisulfanyl)propanoate Chemical compound O=C1CCC(=O)N1OC(=O)CCSSC1=CC=CC=N1 JWDFQMWEFLOOED-UHFFFAOYSA-N 0.000 description 1
- OGILYBDMVOATLU-CQJMVLFOSA-N (2s)-2-[[(2s)-2-amino-3-(4-hydroxyphenyl)propanoyl]amino]-n-[(2s)-1-[[(2s)-1-amino-1-oxo-3-phenylpropan-2-yl]amino]-4-methyl-1-oxopentan-2-yl]-4-methylpentanamide Chemical compound C([C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC=1C=CC=CC=1)C(N)=O)C1=CC=C(O)C=C1 OGILYBDMVOATLU-CQJMVLFOSA-N 0.000 description 1
- RRBGTUQJDFBWNN-MUGJNUQGSA-N (2s)-6-amino-2-[[(2s)-6-amino-2-[[(2s)-6-amino-2-[[(2s)-2,6-diaminohexanoyl]amino]hexanoyl]amino]hexanoyl]amino]hexanoic acid Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(O)=O RRBGTUQJDFBWNN-MUGJNUQGSA-N 0.000 description 1
- HGHOBRRUMWJWCU-FXQIFTODSA-N (4s)-4-[[(2s)-2-aminopropanoyl]amino]-5-[[(2s)-3-carboxy-1-(carboxymethylamino)-1-oxopropan-2-yl]amino]-5-oxopentanoic acid Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O HGHOBRRUMWJWCU-FXQIFTODSA-N 0.000 description 1
- UZGKAASZIMOAMU-UHFFFAOYSA-N 124177-85-1 Chemical compound NP(=O)=O UZGKAASZIMOAMU-UHFFFAOYSA-N 0.000 description 1
- BSFODEXXVBBYOC-UHFFFAOYSA-N 8-[4-(dimethylamino)butan-2-ylamino]quinolin-6-ol Chemical compound C1=CN=C2C(NC(CCN(C)C)C)=CC(O)=CC2=C1 BSFODEXXVBBYOC-UHFFFAOYSA-N 0.000 description 1
- PIPTUBPKYFRLCP-NHCYSSNCSA-N Ala-Ala-Phe Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 PIPTUBPKYFRLCP-NHCYSSNCSA-N 0.000 description 1
- WRDANSJTFOHBPI-FXQIFTODSA-N Ala-Arg-Cys Chemical compound C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CS)C(=O)O)N WRDANSJTFOHBPI-FXQIFTODSA-N 0.000 description 1
- NHCPCLJZRSIDHS-ZLUOBGJFSA-N Ala-Asp-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O NHCPCLJZRSIDHS-ZLUOBGJFSA-N 0.000 description 1
- WXERCAHAIKMTKX-ZLUOBGJFSA-N Ala-Asp-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O WXERCAHAIKMTKX-ZLUOBGJFSA-N 0.000 description 1
- KIUYPHAMDKDICO-WHFBIAKZSA-N Ala-Asp-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O KIUYPHAMDKDICO-WHFBIAKZSA-N 0.000 description 1
- ZODMADSIQZZBSQ-FXQIFTODSA-N Ala-Gln-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O ZODMADSIQZZBSQ-FXQIFTODSA-N 0.000 description 1
- FVSOUJZKYWEFOB-KBIXCLLPSA-N Ala-Gln-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@H](C)N FVSOUJZKYWEFOB-KBIXCLLPSA-N 0.000 description 1
- YIGLXQRFQVWFEY-NRPADANISA-N Ala-Gln-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O YIGLXQRFQVWFEY-NRPADANISA-N 0.000 description 1
- KXEVYGKATAMXJJ-ACZMJKKPSA-N Ala-Glu-Asp Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O KXEVYGKATAMXJJ-ACZMJKKPSA-N 0.000 description 1
- PAIHPOGPJVUFJY-WDSKDSINSA-N Ala-Glu-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O PAIHPOGPJVUFJY-WDSKDSINSA-N 0.000 description 1
- PUBLUECXJRHTBK-ACZMJKKPSA-N Ala-Glu-Ser Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O PUBLUECXJRHTBK-ACZMJKKPSA-N 0.000 description 1
- CXISPYVYMQWFLE-VKHMYHEASA-N Ala-Gly Chemical compound C[C@H]([NH3+])C(=O)NCC([O-])=O CXISPYVYMQWFLE-VKHMYHEASA-N 0.000 description 1
- VGPWRRFOPXVGOH-BYPYZUCNSA-N Ala-Gly-Gly Chemical compound C[C@H](N)C(=O)NCC(=O)NCC(O)=O VGPWRRFOPXVGOH-BYPYZUCNSA-N 0.000 description 1
- SIGTYDNEPYEXGK-ZANVPECISA-N Ala-Gly-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)CNC(=O)[C@@H](N)C)C(O)=O)=CNC2=C1 SIGTYDNEPYEXGK-ZANVPECISA-N 0.000 description 1
- SMCGQGDVTPFXKB-XPUUQOCRSA-N Ala-Gly-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@H](C)N SMCGQGDVTPFXKB-XPUUQOCRSA-N 0.000 description 1
- VNYMOTCMNHJGTG-JBDRJPRFSA-N Ala-Ile-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O VNYMOTCMNHJGTG-JBDRJPRFSA-N 0.000 description 1
- SUMYEVXWCAYLLJ-GUBZILKMSA-N Ala-Leu-Gln Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O SUMYEVXWCAYLLJ-GUBZILKMSA-N 0.000 description 1
- CCDFBRZVTDDJNM-GUBZILKMSA-N Ala-Leu-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O CCDFBRZVTDDJNM-GUBZILKMSA-N 0.000 description 1
- AWZKCUCQJNTBAD-SRVKXCTJSA-N Ala-Leu-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCCN AWZKCUCQJNTBAD-SRVKXCTJSA-N 0.000 description 1
- SOBIAADAMRHGKH-CIUDSAMLSA-N Ala-Leu-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O SOBIAADAMRHGKH-CIUDSAMLSA-N 0.000 description 1
- SDZRIBWEVVRDQI-CIUDSAMLSA-N Ala-Lys-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O SDZRIBWEVVRDQI-CIUDSAMLSA-N 0.000 description 1
- MFMDKJIPHSWSBM-GUBZILKMSA-N Ala-Lys-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O MFMDKJIPHSWSBM-GUBZILKMSA-N 0.000 description 1
- RNHKOQHGYMTHFR-UBHSHLNASA-N Ala-Phe-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](C)N)CC1=CC=CC=C1 RNHKOQHGYMTHFR-UBHSHLNASA-N 0.000 description 1
- IPZQNYYAYVRKKK-FXQIFTODSA-N Ala-Pro-Ala Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O IPZQNYYAYVRKKK-FXQIFTODSA-N 0.000 description 1
- IORKCNUBHNIMKY-CIUDSAMLSA-N Ala-Pro-Glu Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O IORKCNUBHNIMKY-CIUDSAMLSA-N 0.000 description 1
- CQJHFKKGZXKZBC-BPNCWPANSA-N Ala-Pro-Tyr Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 CQJHFKKGZXKZBC-BPNCWPANSA-N 0.000 description 1
- AUFACLFHBAGZEN-ZLUOBGJFSA-N Ala-Ser-Cys Chemical compound N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)O AUFACLFHBAGZEN-ZLUOBGJFSA-N 0.000 description 1
- RTZCUEHYUQZIDE-WHFBIAKZSA-N Ala-Ser-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O RTZCUEHYUQZIDE-WHFBIAKZSA-N 0.000 description 1
- DYXOFPBJBAHWFY-JBDRJPRFSA-N Ala-Ser-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@H](C)N DYXOFPBJBAHWFY-JBDRJPRFSA-N 0.000 description 1
- OEVCHROQUIVQFZ-YTLHQDLWSA-N Ala-Thr-Ala Chemical compound C[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](C)C(O)=O OEVCHROQUIVQFZ-YTLHQDLWSA-N 0.000 description 1
- IOFVWPYSRSCWHI-JXUBOQSCSA-N Ala-Thr-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@H](C)N IOFVWPYSRSCWHI-JXUBOQSCSA-N 0.000 description 1
- KUFVXLQLDHJVOG-SHGPDSBTSA-N Ala-Thr-Thr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](C)N)O KUFVXLQLDHJVOG-SHGPDSBTSA-N 0.000 description 1
- TVUFMYKTYXTRPY-HERUPUMHSA-N Ala-Trp-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CO)C(O)=O TVUFMYKTYXTRPY-HERUPUMHSA-N 0.000 description 1
- YCTIYBUTCKNOTI-UWJYBYFXSA-N Ala-Tyr-Asp Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N YCTIYBUTCKNOTI-UWJYBYFXSA-N 0.000 description 1
- BVLPIIBTWIYOML-ZKWXMUAHSA-N Ala-Val-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O BVLPIIBTWIYOML-ZKWXMUAHSA-N 0.000 description 1
- 102000002260 Alkaline Phosphatase Human genes 0.000 description 1
- 108020004774 Alkaline Phosphatase Proteins 0.000 description 1
- HULHGJZIZXCPLD-FXQIFTODSA-N Arg-Ala-Cys Chemical compound C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N HULHGJZIZXCPLD-FXQIFTODSA-N 0.000 description 1
- KWKQGHSSNHPGOW-BQBZGAKWSA-N Arg-Ala-Gly Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(=O)NCC(O)=O KWKQGHSSNHPGOW-BQBZGAKWSA-N 0.000 description 1
- DBKNLHKEVPZVQC-LPEHRKFASA-N Arg-Ala-Pro Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@@H]1C(O)=O DBKNLHKEVPZVQC-LPEHRKFASA-N 0.000 description 1
- MUXONAMCEUBVGA-DCAQKATOSA-N Arg-Arg-Gln Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(N)=O)C(O)=O MUXONAMCEUBVGA-DCAQKATOSA-N 0.000 description 1
- HJVGMOYJDDXLMI-AVGNSLFASA-N Arg-Arg-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCNC(N)=N)NC(=O)[C@@H](N)CCCNC(N)=N HJVGMOYJDDXLMI-AVGNSLFASA-N 0.000 description 1
- OVVUNXXROOFSIM-SDDRHHMPSA-N Arg-Arg-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CCCN=C(N)N)N)C(=O)O OVVUNXXROOFSIM-SDDRHHMPSA-N 0.000 description 1
- RVDVDRUZWZIBJQ-CIUDSAMLSA-N Arg-Asn-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O RVDVDRUZWZIBJQ-CIUDSAMLSA-N 0.000 description 1
- GHNDBBVSWOWYII-LPEHRKFASA-N Arg-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCCN=C(N)N)N)C(=O)O GHNDBBVSWOWYII-LPEHRKFASA-N 0.000 description 1
- GIVWETPOBCRTND-DCAQKATOSA-N Arg-Gln-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O GIVWETPOBCRTND-DCAQKATOSA-N 0.000 description 1
- OBFTYSPXDRROQO-SRVKXCTJSA-N Arg-Gln-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CCCN=C(N)N OBFTYSPXDRROQO-SRVKXCTJSA-N 0.000 description 1
- LLZXKVAAEWBUPB-KKUMJFAQSA-N Arg-Gln-Phe Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O LLZXKVAAEWBUPB-KKUMJFAQSA-N 0.000 description 1
- BEXGZLUHRXTZCC-CIUDSAMLSA-N Arg-Gln-Ser Chemical compound C(C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CO)C(=O)O)N)CN=C(N)N BEXGZLUHRXTZCC-CIUDSAMLSA-N 0.000 description 1
- MZRBYBIQTIKERR-GUBZILKMSA-N Arg-Glu-Gln Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O MZRBYBIQTIKERR-GUBZILKMSA-N 0.000 description 1
- PBSOQGZLPFVXPU-YUMQZZPRSA-N Arg-Glu-Gly Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O PBSOQGZLPFVXPU-YUMQZZPRSA-N 0.000 description 1
- HPSVTWMFWCHKFN-GARJFASQSA-N Arg-Glu-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCCN=C(N)N)N)C(=O)O HPSVTWMFWCHKFN-GARJFASQSA-N 0.000 description 1
- UFBURHXMKFQVLM-CIUDSAMLSA-N Arg-Glu-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O UFBURHXMKFQVLM-CIUDSAMLSA-N 0.000 description 1
- GOWZVQXTHUCNSQ-NHCYSSNCSA-N Arg-Glu-Val Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O GOWZVQXTHUCNSQ-NHCYSSNCSA-N 0.000 description 1
- AQPVUEJJARLJHB-BQBZGAKWSA-N Arg-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)[C@@H](N)CCCN=C(N)N AQPVUEJJARLJHB-BQBZGAKWSA-N 0.000 description 1
- HQIZDMIGUJOSNI-IUCAKERBSA-N Arg-Gly-Arg Chemical compound N[C@@H](CCCNC(N)=N)C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(O)=O HQIZDMIGUJOSNI-IUCAKERBSA-N 0.000 description 1
- CYXCAHZVPFREJD-LURJTMIESA-N Arg-Gly-Gly Chemical compound NC(=N)NCCC[C@H](N)C(=O)NCC(=O)NCC(O)=O CYXCAHZVPFREJD-LURJTMIESA-N 0.000 description 1
- ZZZWQALDSQQBEW-STQMWFEESA-N Arg-Gly-Tyr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O ZZZWQALDSQQBEW-STQMWFEESA-N 0.000 description 1
- NVCIXQYNWYTLDO-IHRRRGAJSA-N Arg-His-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CCCN=C(N)N)N NVCIXQYNWYTLDO-IHRRRGAJSA-N 0.000 description 1
- UBCPNBUIQNMDNH-NAKRPEOUSA-N Arg-Ile-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O UBCPNBUIQNMDNH-NAKRPEOUSA-N 0.000 description 1
- LVMUGODRNHFGRA-AVGNSLFASA-N Arg-Leu-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O LVMUGODRNHFGRA-AVGNSLFASA-N 0.000 description 1
- RIQBRKVTFBWEDY-RHYQMDGZSA-N Arg-Lys-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O RIQBRKVTFBWEDY-RHYQMDGZSA-N 0.000 description 1
- PAPSMOYMQDWIOR-AVGNSLFASA-N Arg-Lys-Val Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O PAPSMOYMQDWIOR-AVGNSLFASA-N 0.000 description 1
- YFHATWYGAAXQCF-JYJNAYRXSA-N Arg-Pro-Phe Chemical compound NC(N)=NCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 YFHATWYGAAXQCF-JYJNAYRXSA-N 0.000 description 1
- OWSMKCJUBAPHED-JYJNAYRXSA-N Arg-Pro-Tyr Chemical compound NC(N)=NCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 OWSMKCJUBAPHED-JYJNAYRXSA-N 0.000 description 1
- VUGWHBXPMAHEGZ-SRVKXCTJSA-N Arg-Pro-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CCCN=C(N)N VUGWHBXPMAHEGZ-SRVKXCTJSA-N 0.000 description 1
- KXOPYFNQLVUOAQ-FXQIFTODSA-N Arg-Ser-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O KXOPYFNQLVUOAQ-FXQIFTODSA-N 0.000 description 1
- JOTRDIXZHNQYGP-DCAQKATOSA-N Arg-Ser-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCCN=C(N)N)N JOTRDIXZHNQYGP-DCAQKATOSA-N 0.000 description 1
- AUZAXCPWMDBWEE-HJGDQZAQSA-N Arg-Thr-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O AUZAXCPWMDBWEE-HJGDQZAQSA-N 0.000 description 1
- XMGVWQWEWWULNS-BPUTZDHNSA-N Arg-Trp-Ser Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N XMGVWQWEWWULNS-BPUTZDHNSA-N 0.000 description 1
- NVPHRWNWTKYIST-BPNCWPANSA-N Arg-Tyr-Ala Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C)C(O)=O)CC1=CC=C(O)C=C1 NVPHRWNWTKYIST-BPNCWPANSA-N 0.000 description 1
- NMTANZXPDAHUKU-ULQDDVLXSA-N Arg-Tyr-Lys Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CCCCN)C(O)=O)CC1=CC=C(O)C=C1 NMTANZXPDAHUKU-ULQDDVLXSA-N 0.000 description 1
- LFWOQHSQNCKXRU-UFYCRDLUSA-N Arg-Tyr-Phe Chemical compound C([C@H](NC(=O)[C@H](CCCN=C(N)N)N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=C(O)C=C1 LFWOQHSQNCKXRU-UFYCRDLUSA-N 0.000 description 1
- ULBHWNVWSCJLCO-NHCYSSNCSA-N Arg-Val-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCN=C(N)N ULBHWNVWSCJLCO-NHCYSSNCSA-N 0.000 description 1
- XEOXPCNONWHHSW-AVGNSLFASA-N Arg-Val-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N XEOXPCNONWHHSW-AVGNSLFASA-N 0.000 description 1
- WOZDCBHUGJVJPL-AVGNSLFASA-N Arg-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N WOZDCBHUGJVJPL-AVGNSLFASA-N 0.000 description 1
- NNMUHYLAYUSTTN-FXQIFTODSA-N Asn-Gln-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O NNMUHYLAYUSTTN-FXQIFTODSA-N 0.000 description 1
- SUEIIIFUBHDCCS-PBCZWWQYSA-N Asn-His-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SUEIIIFUBHDCCS-PBCZWWQYSA-N 0.000 description 1
- ANPFQTJEPONRPL-UGYAYLCHSA-N Asn-Ile-Asp Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O ANPFQTJEPONRPL-UGYAYLCHSA-N 0.000 description 1
- XVBDDUPJVQXDSI-PEFMBERDSA-N Asn-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CC(=O)N)N XVBDDUPJVQXDSI-PEFMBERDSA-N 0.000 description 1
- PNHQRQTVBRDIEF-CIUDSAMLSA-N Asn-Leu-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC(=O)N)N PNHQRQTVBRDIEF-CIUDSAMLSA-N 0.000 description 1
- NCFJQJRLQJEECD-NHCYSSNCSA-N Asn-Leu-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O NCFJQJRLQJEECD-NHCYSSNCSA-N 0.000 description 1
- FODVBOKTYKYRFJ-CIUDSAMLSA-N Asn-Lys-Cys Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(=O)N)N FODVBOKTYKYRFJ-CIUDSAMLSA-N 0.000 description 1
- FBODFHMLALOPHP-GUBZILKMSA-N Asn-Lys-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O FBODFHMLALOPHP-GUBZILKMSA-N 0.000 description 1
- ORJQQZIXTOYGGH-SRVKXCTJSA-N Asn-Lys-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O ORJQQZIXTOYGGH-SRVKXCTJSA-N 0.000 description 1
- ZYPWIUFLYMQZBS-SRVKXCTJSA-N Asn-Lys-Lys Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)N)N ZYPWIUFLYMQZBS-SRVKXCTJSA-N 0.000 description 1
- HZZIFFOVHLWGCS-KKUMJFAQSA-N Asn-Phe-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O HZZIFFOVHLWGCS-KKUMJFAQSA-N 0.000 description 1
- YUOXLJYVSZYPBJ-CIUDSAMLSA-N Asn-Pro-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O YUOXLJYVSZYPBJ-CIUDSAMLSA-N 0.000 description 1
- QIRJQYQOIKBPBZ-IHRRRGAJSA-N Asn-Tyr-Arg Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O QIRJQYQOIKBPBZ-IHRRRGAJSA-N 0.000 description 1
- ZAESWDKAMDVHLL-RCOVLWMOSA-N Asn-Val-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O ZAESWDKAMDVHLL-RCOVLWMOSA-N 0.000 description 1
- KRXIWXCXOARFNT-ZLUOBGJFSA-N Asp-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC(O)=O KRXIWXCXOARFNT-ZLUOBGJFSA-N 0.000 description 1
- ZLGKHJHFYSRUBH-FXQIFTODSA-N Asp-Arg-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O ZLGKHJHFYSRUBH-FXQIFTODSA-N 0.000 description 1
- AXXCUABIFZPKPM-BQBZGAKWSA-N Asp-Arg-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O AXXCUABIFZPKPM-BQBZGAKWSA-N 0.000 description 1
- FAEIQWHBRBWUBN-FXQIFTODSA-N Asp-Arg-Ser Chemical compound C(C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC(=O)O)N)CN=C(N)N FAEIQWHBRBWUBN-FXQIFTODSA-N 0.000 description 1
- CASGONAXMZPHCK-FXQIFTODSA-N Asp-Asn-Arg Chemical compound C(C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC(=O)O)N)CN=C(N)N CASGONAXMZPHCK-FXQIFTODSA-N 0.000 description 1
- VPSHHQXIWLGVDD-ZLUOBGJFSA-N Asp-Asp-Asp Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O VPSHHQXIWLGVDD-ZLUOBGJFSA-N 0.000 description 1
- AMRANMVXQWXNAH-ZLUOBGJFSA-N Asp-Cys-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CS)NC(=O)[C@@H](N)CC(O)=O AMRANMVXQWXNAH-ZLUOBGJFSA-N 0.000 description 1
- QQXOYLWJQUPXJU-WHFBIAKZSA-N Asp-Cys-Gly Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CS)C(=O)NCC(O)=O QQXOYLWJQUPXJU-WHFBIAKZSA-N 0.000 description 1
- VAWNQIGQPUOPQW-ACZMJKKPSA-N Asp-Glu-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O VAWNQIGQPUOPQW-ACZMJKKPSA-N 0.000 description 1
- OMMIEVATLAGRCK-BYPYZUCNSA-N Asp-Gly-Gly Chemical compound OC(=O)C[C@H](N)C(=O)NCC(=O)NCC(O)=O OMMIEVATLAGRCK-BYPYZUCNSA-N 0.000 description 1
- KTTCQQNRRLCIBC-GHCJXIJMSA-N Asp-Ile-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O KTTCQQNRRLCIBC-GHCJXIJMSA-N 0.000 description 1
- LBFYTUPYYZENIR-GHCJXIJMSA-N Asp-Ile-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(=O)O)N LBFYTUPYYZENIR-GHCJXIJMSA-N 0.000 description 1
- YFSLJHLQOALGSY-ZPFDUUQYSA-N Asp-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)O)N YFSLJHLQOALGSY-ZPFDUUQYSA-N 0.000 description 1
- QNMKWNONJGKJJC-NHCYSSNCSA-N Asp-Leu-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O QNMKWNONJGKJJC-NHCYSSNCSA-N 0.000 description 1
- DPNWSMBUYCLEDG-CIUDSAMLSA-N Asp-Lys-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O DPNWSMBUYCLEDG-CIUDSAMLSA-N 0.000 description 1
- MYLZFUMPZCPJCJ-NHCYSSNCSA-N Asp-Lys-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O MYLZFUMPZCPJCJ-NHCYSSNCSA-N 0.000 description 1
- LKVKODXGSAFOFY-VEVYYDQMSA-N Asp-Met-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LKVKODXGSAFOFY-VEVYYDQMSA-N 0.000 description 1
- GYWQGGUCMDCUJE-DLOVCJGASA-N Asp-Phe-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C)C(O)=O GYWQGGUCMDCUJE-DLOVCJGASA-N 0.000 description 1
- KRQFMDNIUOVRIF-KKUMJFAQSA-N Asp-Phe-His Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)NC(=O)[C@H](CC(=O)O)N KRQFMDNIUOVRIF-KKUMJFAQSA-N 0.000 description 1
- USNJAPJZSGTTPX-XVSYOHENSA-N Asp-Phe-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O USNJAPJZSGTTPX-XVSYOHENSA-N 0.000 description 1
- YFGUZQQCSDZRBN-DCAQKATOSA-N Asp-Pro-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O YFGUZQQCSDZRBN-DCAQKATOSA-N 0.000 description 1
- YXALGQBWVHQVLC-UHFFFAOYSA-N Asp-Pro-Ser-Leu-Lys Natural products CC(C)CC(NC(=O)C(CO)NC(=O)C1CCCN1C(=O)C(N)CC(=O)O)C(=O)NC(CCCCN)C(=O)O YXALGQBWVHQVLC-UHFFFAOYSA-N 0.000 description 1
- JSNWZMFSLIWAHS-HJGDQZAQSA-N Asp-Thr-Leu Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)O)NC(=O)[C@H](CC(=O)O)N)O JSNWZMFSLIWAHS-HJGDQZAQSA-N 0.000 description 1
- UXIPUCUHQBIQOS-SRVKXCTJSA-N Asp-Tyr-Cys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(=O)O)N)O UXIPUCUHQBIQOS-SRVKXCTJSA-N 0.000 description 1
- 241000020089 Atacta Species 0.000 description 1
- 231100000699 Bacterial toxin Toxicity 0.000 description 1
- UXVMQQNJUSDDNG-UHFFFAOYSA-L Calcium chloride Chemical compound [Cl-].[Cl-].[Ca+2] UXVMQQNJUSDDNG-UHFFFAOYSA-L 0.000 description 1
- 108091062157 Cis-regulatory element Proteins 0.000 description 1
- 108020004705 Codon Proteins 0.000 description 1
- PKNIZMPLMSKROD-BIIVOSGPSA-N Cys-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CS)N PKNIZMPLMSKROD-BIIVOSGPSA-N 0.000 description 1
- OCEHKDFAWQIBHH-FXQIFTODSA-N Cys-Arg-Cys Chemical compound C(C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CS)N)CN=C(N)N OCEHKDFAWQIBHH-FXQIFTODSA-N 0.000 description 1
- GEEXORWTBTUOHC-FXQIFTODSA-N Cys-Arg-Ser Chemical compound C(C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CS)N)CN=C(N)N GEEXORWTBTUOHC-FXQIFTODSA-N 0.000 description 1
- UPJGYXRAPJWIHD-CIUDSAMLSA-N Cys-Asn-Leu Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O UPJGYXRAPJWIHD-CIUDSAMLSA-N 0.000 description 1
- XABFFGOGKOORCG-CIUDSAMLSA-N Cys-Asp-Leu Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O XABFFGOGKOORCG-CIUDSAMLSA-N 0.000 description 1
- HNNGTYHNYDOSKV-FXQIFTODSA-N Cys-Cys-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CS)N HNNGTYHNYDOSKV-FXQIFTODSA-N 0.000 description 1
- MGAWEOHYNIMOQJ-ACZMJKKPSA-N Cys-Gln-Asp Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CS)N MGAWEOHYNIMOQJ-ACZMJKKPSA-N 0.000 description 1
- YZKOXEJTLWZOQL-GUBZILKMSA-N Cys-Gln-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CS)N YZKOXEJTLWZOQL-GUBZILKMSA-N 0.000 description 1
- SBORMUFGKSCGEN-XHNCKOQMSA-N Cys-Glu-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CS)N)C(=O)O SBORMUFGKSCGEN-XHNCKOQMSA-N 0.000 description 1
- HQZGVYJBRSISDT-BQBZGAKWSA-N Cys-Gly-Arg Chemical compound [H]N[C@@H](CS)C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(O)=O HQZGVYJBRSISDT-BQBZGAKWSA-N 0.000 description 1
- YFAFBAPQHGULQT-HJPIBITLSA-N Cys-Ile-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@H](CS)N YFAFBAPQHGULQT-HJPIBITLSA-N 0.000 description 1
- IZUNQDRIAOLWCN-YUMQZZPRSA-N Cys-Leu-Gly Chemical compound CC(C)C[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CS)N IZUNQDRIAOLWCN-YUMQZZPRSA-N 0.000 description 1
- OETOANMAHTWESF-KKUMJFAQSA-N Cys-Phe-His Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)NC(=O)[C@H](CS)N OETOANMAHTWESF-KKUMJFAQSA-N 0.000 description 1
- XBELMDARIGXDKY-GUBZILKMSA-N Cys-Pro-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CS)N XBELMDARIGXDKY-GUBZILKMSA-N 0.000 description 1
- ZGERHCJBLPQPGV-ACZMJKKPSA-N Cys-Ser-Gln Chemical compound C(CC(=O)N)[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CS)N ZGERHCJBLPQPGV-ACZMJKKPSA-N 0.000 description 1
- LKHMGNHQULEPFY-ACZMJKKPSA-N Cys-Ser-Glu Chemical compound SC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O LKHMGNHQULEPFY-ACZMJKKPSA-N 0.000 description 1
- ABLQPNMKLMFDQU-BIIVOSGPSA-N Cys-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CS)N)C(=O)O ABLQPNMKLMFDQU-BIIVOSGPSA-N 0.000 description 1
- KXHAPEPORGOXDT-UWJYBYFXSA-N Cys-Tyr-Ala Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C)C(O)=O KXHAPEPORGOXDT-UWJYBYFXSA-N 0.000 description 1
- IGXWBGJHJZYPQS-SSDOTTSWSA-N D-Luciferin Chemical compound OC(=O)[C@H]1CSC(C=2SC3=CC=C(O)C=C3N=2)=N1 IGXWBGJHJZYPQS-SSDOTTSWSA-N 0.000 description 1
- 238000000018 DNA microarray Methods 0.000 description 1
- 239000003155 DNA primer Substances 0.000 description 1
- 239000003298 DNA probe Substances 0.000 description 1
- CYCGRDQQIOGCKX-UHFFFAOYSA-N Dehydro-luciferin Natural products OC(=O)C1=CSC(C=2SC3=CC(O)=CC=C3N=2)=N1 CYCGRDQQIOGCKX-UHFFFAOYSA-N 0.000 description 1
- 241000702421 Dependoparvovirus Species 0.000 description 1
- 108010053187 Diphtheria Toxin Proteins 0.000 description 1
- 102000016607 Diphtheria Toxin Human genes 0.000 description 1
- 241000255581 Drosophila <fruit fly, genus> Species 0.000 description 1
- 102000004190 Enzymes Human genes 0.000 description 1
- 108090000790 Enzymes Proteins 0.000 description 1
- 241000701959 Escherichia virus Lambda Species 0.000 description 1
- 229920001917 Ficoll Polymers 0.000 description 1
- BJGNCJDXODQBOB-UHFFFAOYSA-N Fivefly Luciferin Natural products OC(=O)C1CSC(C=2SC3=CC(O)=CC=C3N=2)=N1 BJGNCJDXODQBOB-UHFFFAOYSA-N 0.000 description 1
- 241001123946 Gaga Species 0.000 description 1
- 108700028146 Genetic Enhancer Elements Proteins 0.000 description 1
- REJJNXODKSHOKA-ACZMJKKPSA-N Gln-Ala-Asp Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CCC(=O)N)N REJJNXODKSHOKA-ACZMJKKPSA-N 0.000 description 1
- KVYVOGYEMPEXBT-GUBZILKMSA-N Gln-Ala-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(N)=O KVYVOGYEMPEXBT-GUBZILKMSA-N 0.000 description 1
- KWUSGAIFNHQCBY-DCAQKATOSA-N Gln-Arg-Arg Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O KWUSGAIFNHQCBY-DCAQKATOSA-N 0.000 description 1
- DTMLKCYOQKZXKZ-HJGDQZAQSA-N Gln-Arg-Thr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O DTMLKCYOQKZXKZ-HJGDQZAQSA-N 0.000 description 1
- MGJMFSBEMSNYJL-AVGNSLFASA-N Gln-Asn-Tyr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O MGJMFSBEMSNYJL-AVGNSLFASA-N 0.000 description 1
- XEYMBRRKIFYQMF-GUBZILKMSA-N Gln-Asp-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O XEYMBRRKIFYQMF-GUBZILKMSA-N 0.000 description 1
- XEZWLWNGUMXAJI-QEJZJMRPSA-N Gln-Cys-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CS)NC(=O)[C@H](CCC(N)=O)N)C(O)=O)=CNC2=C1 XEZWLWNGUMXAJI-QEJZJMRPSA-N 0.000 description 1
- NVEASDQHBRZPSU-BQBZGAKWSA-N Gln-Gln-Gly Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O NVEASDQHBRZPSU-BQBZGAKWSA-N 0.000 description 1
- KVXVVDFOZNYYKZ-DCAQKATOSA-N Gln-Gln-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O KVXVVDFOZNYYKZ-DCAQKATOSA-N 0.000 description 1
- NPTGGVQJYRSMCM-GLLZPBPUSA-N Gln-Gln-Thr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O NPTGGVQJYRSMCM-GLLZPBPUSA-N 0.000 description 1
- SNLOOPZHAQDMJG-CIUDSAMLSA-N Gln-Glu-Glu Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O SNLOOPZHAQDMJG-CIUDSAMLSA-N 0.000 description 1
- XJKAKYXMFHUIHT-AUTRQRHGSA-N Gln-Glu-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCC(=O)N)N XJKAKYXMFHUIHT-AUTRQRHGSA-N 0.000 description 1
- CLPQUWHBWXFJOX-BQBZGAKWSA-N Gln-Gly-Gln Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)NCC(=O)N[C@@H](CCC(N)=O)C(O)=O CLPQUWHBWXFJOX-BQBZGAKWSA-N 0.000 description 1
- PODFFOWWLUPNMN-DCAQKATOSA-N Gln-His-Gln Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(N)=O)C(O)=O PODFFOWWLUPNMN-DCAQKATOSA-N 0.000 description 1
- ZNTDJIMJKNNSLR-RWRJDSDZSA-N Gln-Ile-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](CCC(=O)N)N ZNTDJIMJKNNSLR-RWRJDSDZSA-N 0.000 description 1
- HWEINOMSWQSJDC-SRVKXCTJSA-N Gln-Leu-Arg Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O HWEINOMSWQSJDC-SRVKXCTJSA-N 0.000 description 1
- QBLMTCRYYTVUQY-GUBZILKMSA-N Gln-Leu-Asp Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O QBLMTCRYYTVUQY-GUBZILKMSA-N 0.000 description 1
- QKCZZAZNMMVICF-DCAQKATOSA-N Gln-Leu-Glu Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O QKCZZAZNMMVICF-DCAQKATOSA-N 0.000 description 1
- ZBKUIQNCRIYVGH-SDDRHHMPSA-N Gln-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)N)N ZBKUIQNCRIYVGH-SDDRHHMPSA-N 0.000 description 1
- YPMDZWPZFOZYFG-GUBZILKMSA-N Gln-Leu-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O YPMDZWPZFOZYFG-GUBZILKMSA-N 0.000 description 1
- IHSGESFHTMFHRB-GUBZILKMSA-N Gln-Lys-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCC(N)=O IHSGESFHTMFHRB-GUBZILKMSA-N 0.000 description 1
- GURIQZQSTBBHRV-SRVKXCTJSA-N Gln-Lys-Arg Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O GURIQZQSTBBHRV-SRVKXCTJSA-N 0.000 description 1
- QMVCEWKHIUHTSD-GUBZILKMSA-N Gln-Met-Glu Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CCC(=O)N)N QMVCEWKHIUHTSD-GUBZILKMSA-N 0.000 description 1
- XZUUUKNKNWVPHQ-JYJNAYRXSA-N Gln-Phe-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O XZUUUKNKNWVPHQ-JYJNAYRXSA-N 0.000 description 1
- WLRYGVYQFXRJDA-DCAQKATOSA-N Gln-Pro-Pro Chemical compound NC(=O)CC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 WLRYGVYQFXRJDA-DCAQKATOSA-N 0.000 description 1
- RWQCWSGOOOEGPB-FXQIFTODSA-N Gln-Ser-Glu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(O)=O RWQCWSGOOOEGPB-FXQIFTODSA-N 0.000 description 1
- LPIKVBWNNVFHCQ-GUBZILKMSA-N Gln-Ser-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O LPIKVBWNNVFHCQ-GUBZILKMSA-N 0.000 description 1
- BYKZWDGMJLNFJY-XKBZYTNZSA-N Gln-Ser-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCC(=O)N)N)O BYKZWDGMJLNFJY-XKBZYTNZSA-N 0.000 description 1
- XKPACHRGOWQHFH-IRIUXVKKSA-N Gln-Thr-Tyr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O XKPACHRGOWQHFH-IRIUXVKKSA-N 0.000 description 1
- WBBVTGIFQIZBHP-JBACZVJFSA-N Gln-Trp-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)NC(=O)[C@H](CCC(=O)N)N WBBVTGIFQIZBHP-JBACZVJFSA-N 0.000 description 1
- ZZLDMBMFKZFQMU-NRPADANISA-N Gln-Val-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O ZZLDMBMFKZFQMU-NRPADANISA-N 0.000 description 1
- MKRDNSWGJWTBKZ-GVXVVHGQSA-N Gln-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)N)N MKRDNSWGJWTBKZ-GVXVVHGQSA-N 0.000 description 1
- HNAUFGBKJLTWQE-IFFSRLJSSA-N Gln-Val-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CCC(=O)N)N)O HNAUFGBKJLTWQE-IFFSRLJSSA-N 0.000 description 1
- RUFHOVYUYSNDNY-ACZMJKKPSA-N Glu-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(O)=O RUFHOVYUYSNDNY-ACZMJKKPSA-N 0.000 description 1
- LKDIBBOKUAASNP-FXQIFTODSA-N Glu-Ala-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O LKDIBBOKUAASNP-FXQIFTODSA-N 0.000 description 1
- CGYDXNKRIMJMLV-GUBZILKMSA-N Glu-Arg-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O CGYDXNKRIMJMLV-GUBZILKMSA-N 0.000 description 1
- VTTSANCGJWLPNC-ZPFDUUQYSA-N Glu-Arg-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O VTTSANCGJWLPNC-ZPFDUUQYSA-N 0.000 description 1
- NADWTMLCUDMDQI-ACZMJKKPSA-N Glu-Asp-Cys Chemical compound C(CC(=O)O)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CS)C(=O)O)N NADWTMLCUDMDQI-ACZMJKKPSA-N 0.000 description 1
- WATXSTJXNBOHKD-LAEOZQHASA-N Glu-Asp-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O WATXSTJXNBOHKD-LAEOZQHASA-N 0.000 description 1
- RQNYYRHRKSVKAB-GUBZILKMSA-N Glu-Cys-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(O)=O RQNYYRHRKSVKAB-GUBZILKMSA-N 0.000 description 1
- KVBPDJIFRQUQFY-ACZMJKKPSA-N Glu-Cys-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CS)C(=O)N[C@@H](CO)C(O)=O KVBPDJIFRQUQFY-ACZMJKKPSA-N 0.000 description 1
- VFZIDQZAEBORGY-GLLZPBPUSA-N Glu-Gln-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O VFZIDQZAEBORGY-GLLZPBPUSA-N 0.000 description 1
- QYPKJXSMLMREKF-BPUTZDHNSA-N Glu-Glu-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCC(=O)O)N QYPKJXSMLMREKF-BPUTZDHNSA-N 0.000 description 1
- QJCKNLPMTPXXEM-AUTRQRHGSA-N Glu-Glu-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O QJCKNLPMTPXXEM-AUTRQRHGSA-N 0.000 description 1
- LZMQSTPFYJLVJB-GUBZILKMSA-N Glu-Leu-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCC(=O)O)N LZMQSTPFYJLVJB-GUBZILKMSA-N 0.000 description 1
- CUPSDFQZTVVTSK-GUBZILKMSA-N Glu-Lys-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCC(O)=O CUPSDFQZTVVTSK-GUBZILKMSA-N 0.000 description 1
- SUIAHERNFYRBDZ-GVXVVHGQSA-N Glu-Lys-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O SUIAHERNFYRBDZ-GVXVVHGQSA-N 0.000 description 1
- FGSGPLRPQCZBSQ-AVGNSLFASA-N Glu-Phe-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O FGSGPLRPQCZBSQ-AVGNSLFASA-N 0.000 description 1
- QJVZSVUYZFYLFQ-CIUDSAMLSA-N Glu-Pro-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O QJVZSVUYZFYLFQ-CIUDSAMLSA-N 0.000 description 1
- JYXKPJVDCAWMDG-ZPFDUUQYSA-N Glu-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CCC(=O)O)N JYXKPJVDCAWMDG-ZPFDUUQYSA-N 0.000 description 1
- SYWCGQOIIARSIX-SRVKXCTJSA-N Glu-Pro-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O SYWCGQOIIARSIX-SRVKXCTJSA-N 0.000 description 1
- DAHLWSFUXOHMIA-FXQIFTODSA-N Glu-Ser-Gln Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O DAHLWSFUXOHMIA-FXQIFTODSA-N 0.000 description 1
- MWTGQXBHVRTCOR-GLLZPBPUSA-N Glu-Thr-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(O)=O MWTGQXBHVRTCOR-GLLZPBPUSA-N 0.000 description 1
- ZGXGVBYEJGVJMV-HJGDQZAQSA-N Glu-Thr-Met Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CCC(=O)O)N)O ZGXGVBYEJGVJMV-HJGDQZAQSA-N 0.000 description 1
- ZTNHPMZHAILHRB-JSGCOSHPSA-N Glu-Trp-Gly Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CCC(O)=O)N)C(=O)NCC(O)=O)=CNC2=C1 ZTNHPMZHAILHRB-JSGCOSHPSA-N 0.000 description 1
- MFYLRRCYBBJYPI-JYJNAYRXSA-N Glu-Tyr-Lys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)O)N)O MFYLRRCYBBJYPI-JYJNAYRXSA-N 0.000 description 1
- WQZGKKKJIJFFOK-GASJEMHNSA-N Glucose Natural products OC[C@H]1OC(O)[C@H](O)[C@@H](O)[C@@H]1O WQZGKKKJIJFFOK-GASJEMHNSA-N 0.000 description 1
- MFVQGXGQRIXBPK-WDSKDSINSA-N Gly-Ala-Glu Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O MFVQGXGQRIXBPK-WDSKDSINSA-N 0.000 description 1
- YMUFWNJHVPQNQD-ZKWXMUAHSA-N Gly-Ala-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)CN YMUFWNJHVPQNQD-ZKWXMUAHSA-N 0.000 description 1
- JBRBACJPBZNFMF-YUMQZZPRSA-N Gly-Ala-Lys Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCCN JBRBACJPBZNFMF-YUMQZZPRSA-N 0.000 description 1
- MZZSCEANQDPJER-ONGXEEELSA-N Gly-Ala-Phe Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 MZZSCEANQDPJER-ONGXEEELSA-N 0.000 description 1
- QSDKBRMVXSWAQE-BFHQHQDPSA-N Gly-Ala-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)CN QSDKBRMVXSWAQE-BFHQHQDPSA-N 0.000 description 1
- JXYMPBCYRKWJEE-BQBZGAKWSA-N Gly-Arg-Ala Chemical compound [H]NCC(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(O)=O JXYMPBCYRKWJEE-BQBZGAKWSA-N 0.000 description 1
- GGEJHJIXRBTJPD-BYPYZUCNSA-N Gly-Asn-Gly Chemical compound NCC(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O GGEJHJIXRBTJPD-BYPYZUCNSA-N 0.000 description 1
- JVACNFOPSUPDTK-QWRGUYRKSA-N Gly-Asn-Phe Chemical compound NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 JVACNFOPSUPDTK-QWRGUYRKSA-N 0.000 description 1
- LXXLEUBUOMCAMR-NKWVEPMBSA-N Gly-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)CN)C(=O)O LXXLEUBUOMCAMR-NKWVEPMBSA-N 0.000 description 1
- PMNHJLASAAWELO-FOHZUACHSA-N Gly-Asp-Thr Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PMNHJLASAAWELO-FOHZUACHSA-N 0.000 description 1
- GVVKYKCOFMMTKZ-WHFBIAKZSA-N Gly-Cys-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CS)NC(=O)CN GVVKYKCOFMMTKZ-WHFBIAKZSA-N 0.000 description 1
- XXGQRGQPGFYECI-WDSKDSINSA-N Gly-Cys-Glu Chemical compound NCC(=O)N[C@@H](CS)C(=O)N[C@H](C(O)=O)CCC(O)=O XXGQRGQPGFYECI-WDSKDSINSA-N 0.000 description 1
- KTSZUNRRYXPZTK-BQBZGAKWSA-N Gly-Gln-Glu Chemical compound NCC(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O KTSZUNRRYXPZTK-BQBZGAKWSA-N 0.000 description 1
- PABFFPWEJMEVEC-JGVFFNPUSA-N Gly-Gln-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)N)NC(=O)CN)C(=O)O PABFFPWEJMEVEC-JGVFFNPUSA-N 0.000 description 1
- SOEATRRYCIPEHA-BQBZGAKWSA-N Gly-Glu-Glu Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O SOEATRRYCIPEHA-BQBZGAKWSA-N 0.000 description 1
- YWAQATDNEKZFFK-BYPYZUCNSA-N Gly-Gly-Ser Chemical compound NCC(=O)NCC(=O)N[C@@H](CO)C(O)=O YWAQATDNEKZFFK-BYPYZUCNSA-N 0.000 description 1
- OLPPXYMMIARYAL-QMMMGPOBSA-N Gly-Gly-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)CNC(=O)CN OLPPXYMMIARYAL-QMMMGPOBSA-N 0.000 description 1
- TVDHVLGFJSHPAX-UWVGGRQHSA-N Gly-His-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CN=CN1 TVDHVLGFJSHPAX-UWVGGRQHSA-N 0.000 description 1
- FSPVILZGHUJOHS-QWRGUYRKSA-N Gly-His-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CNC=N1 FSPVILZGHUJOHS-QWRGUYRKSA-N 0.000 description 1
- BHPQOIPBLYJNAW-NGZCFLSTSA-N Gly-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN BHPQOIPBLYJNAW-NGZCFLSTSA-N 0.000 description 1
- HAXARWKYFIIHKD-ZKWXMUAHSA-N Gly-Ile-Ser Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O HAXARWKYFIIHKD-ZKWXMUAHSA-N 0.000 description 1
- LRQXRHGQEVWGPV-NHCYSSNCSA-N Gly-Leu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)CN LRQXRHGQEVWGPV-NHCYSSNCSA-N 0.000 description 1
- UHPAZODVFFYEEL-QWRGUYRKSA-N Gly-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)CN UHPAZODVFFYEEL-QWRGUYRKSA-N 0.000 description 1
- LLZXNUUIBOALNY-QWRGUYRKSA-N Gly-Leu-Lys Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCCN LLZXNUUIBOALNY-QWRGUYRKSA-N 0.000 description 1
- PTIIBFKSLCYQBO-NHCYSSNCSA-N Gly-Lys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)CN PTIIBFKSLCYQBO-NHCYSSNCSA-N 0.000 description 1
- VEPBEGNDJYANCF-QWRGUYRKSA-N Gly-Lys-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCCN VEPBEGNDJYANCF-QWRGUYRKSA-N 0.000 description 1
- JPVGHHQGKPQYIL-KBPBESRZSA-N Gly-Phe-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CC=CC=C1 JPVGHHQGKPQYIL-KBPBESRZSA-N 0.000 description 1
- JYPCXBJRLBHWME-IUCAKERBSA-N Gly-Pro-Arg Chemical compound NCC(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(O)=O JYPCXBJRLBHWME-IUCAKERBSA-N 0.000 description 1
- IXHQLZIWBCQBLQ-STQMWFEESA-N Gly-Pro-Phe Chemical compound NCC(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 IXHQLZIWBCQBLQ-STQMWFEESA-N 0.000 description 1
- HAOUOFNNJJLVNS-BQBZGAKWSA-N Gly-Pro-Ser Chemical compound NCC(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O HAOUOFNNJJLVNS-BQBZGAKWSA-N 0.000 description 1
- GLACUWHUYFBSPJ-FJXKBIBVSA-N Gly-Pro-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)CN GLACUWHUYFBSPJ-FJXKBIBVSA-N 0.000 description 1
- OHUKZZYSJBKFRR-WHFBIAKZSA-N Gly-Ser-Asp Chemical compound [H]NCC(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O OHUKZZYSJBKFRR-WHFBIAKZSA-N 0.000 description 1
- SOEGEPHNZOISMT-BYPYZUCNSA-N Gly-Ser-Gly Chemical compound NCC(=O)N[C@@H](CO)C(=O)NCC(O)=O SOEGEPHNZOISMT-BYPYZUCNSA-N 0.000 description 1
- FFJQHWKSGAWSTJ-BFHQHQDPSA-N Gly-Thr-Ala Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O FFJQHWKSGAWSTJ-BFHQHQDPSA-N 0.000 description 1
- DBUNZBWUWCIELX-JHEQGTHGSA-N Gly-Thr-Glu Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O DBUNZBWUWCIELX-JHEQGTHGSA-N 0.000 description 1
- TVTZEOHWHUVYCG-KYNKHSRBSA-N Gly-Thr-Thr Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O TVTZEOHWHUVYCG-KYNKHSRBSA-N 0.000 description 1
- YJDALMUYJIENAG-QWRGUYRKSA-N Gly-Tyr-Asn Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)CN)O YJDALMUYJIENAG-QWRGUYRKSA-N 0.000 description 1
- NWOSHVVPKDQKKT-RYUDHWBXSA-N Gly-Tyr-Gln Chemical compound [H]NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O NWOSHVVPKDQKKT-RYUDHWBXSA-N 0.000 description 1
- GWCJMBNBFYBQCV-XPUUQOCRSA-N Gly-Val-Ala Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O GWCJMBNBFYBQCV-XPUUQOCRSA-N 0.000 description 1
- FULZDMOZUZKGQU-ONGXEEELSA-N Gly-Val-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)CN FULZDMOZUZKGQU-ONGXEEELSA-N 0.000 description 1
- AFMOTCMSEBITOE-YEPSODPASA-N Gly-Val-Thr Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O AFMOTCMSEBITOE-YEPSODPASA-N 0.000 description 1
- JBCLFWXMTIKCCB-UHFFFAOYSA-N H-Gly-Phe-OH Natural products NCC(=O)NC(C(O)=O)CC1=CC=CC=C1 JBCLFWXMTIKCCB-UHFFFAOYSA-N 0.000 description 1
- AWHJQEYGWRKPHE-LSJOCFKGSA-N His-Ala-Arg Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O AWHJQEYGWRKPHE-LSJOCFKGSA-N 0.000 description 1
- QZAFGJNKLMNDEM-DCAQKATOSA-N His-Asn-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CC1=CN=CN1 QZAFGJNKLMNDEM-DCAQKATOSA-N 0.000 description 1
- HRGGKHFHRSFSDE-CIUDSAMLSA-N His-Asn-Ser Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CO)C(=O)O)N HRGGKHFHRSFSDE-CIUDSAMLSA-N 0.000 description 1
- LIEIYPBMQJLASB-SRVKXCTJSA-N His-Gln-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CC1=CN=CN1 LIEIYPBMQJLASB-SRVKXCTJSA-N 0.000 description 1
- NWGXCPUKPVISSJ-AVGNSLFASA-N His-Gln-Lys Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N NWGXCPUKPVISSJ-AVGNSLFASA-N 0.000 description 1
- IMPKSPYRPUXYAP-SZMVWBNQSA-N His-Gln-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC3=CN=CN3)N IMPKSPYRPUXYAP-SZMVWBNQSA-N 0.000 description 1
- ZUPVLBAXUUGKKN-VHSXEESVSA-N His-Gly-Pro Chemical compound C1C[C@@H](N(C1)C(=O)CNC(=O)[C@H](CC2=CN=CN2)N)C(=O)O ZUPVLBAXUUGKKN-VHSXEESVSA-N 0.000 description 1
- BDFCIKANUNMFGB-PMVVWTBXSA-N His-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC1=CN=CN1 BDFCIKANUNMFGB-PMVVWTBXSA-N 0.000 description 1
- STOOMQFEJUVAKR-KKUMJFAQSA-N His-His-His Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1N=CNC=1)C(=O)N[C@@H](CC=1N=CNC=1)C(O)=O)C1=CNC=N1 STOOMQFEJUVAKR-KKUMJFAQSA-N 0.000 description 1
- VYUXYMRNGALHEA-DLOVCJGASA-N His-Leu-Ala Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O VYUXYMRNGALHEA-DLOVCJGASA-N 0.000 description 1
- SKYULSWNBYAQMG-IHRRRGAJSA-N His-Leu-Arg Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O SKYULSWNBYAQMG-IHRRRGAJSA-N 0.000 description 1
- YAALVYQFVJNXIV-KKUMJFAQSA-N His-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CN=CN1 YAALVYQFVJNXIV-KKUMJFAQSA-N 0.000 description 1
- ZSKJIISDJXJQPV-BZSNNMDCSA-N His-Leu-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CN=CN1 ZSKJIISDJXJQPV-BZSNNMDCSA-N 0.000 description 1
- XKIYNCLILDLGRS-QWRGUYRKSA-N His-Lys-Gly Chemical compound NCCCC[C@@H](C(=O)NCC(O)=O)NC(=O)[C@@H](N)CC1=CN=CN1 XKIYNCLILDLGRS-QWRGUYRKSA-N 0.000 description 1
- VUUFXXGKMPLKNH-BZSNNMDCSA-N His-Phe-His Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)NC(=O)[C@H](CC3=CN=CN3)N VUUFXXGKMPLKNH-BZSNNMDCSA-N 0.000 description 1
- PYNPBMCLAKTHJL-SRVKXCTJSA-N His-Pro-Glu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O PYNPBMCLAKTHJL-SRVKXCTJSA-N 0.000 description 1
- FLXCRBXJRJSDHX-AVGNSLFASA-N His-Pro-Val Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O FLXCRBXJRJSDHX-AVGNSLFASA-N 0.000 description 1
- STGQSBKUYSPPIG-CIUDSAMLSA-N His-Ser-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CN=CN1 STGQSBKUYSPPIG-CIUDSAMLSA-N 0.000 description 1
- PZAJPILZRFPYJJ-SRVKXCTJSA-N His-Ser-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O PZAJPILZRFPYJJ-SRVKXCTJSA-N 0.000 description 1
- WUEIUSDAECDLQO-NAKRPEOUSA-N Ile-Ala-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CCSC)C(=O)O)N WUEIUSDAECDLQO-NAKRPEOUSA-N 0.000 description 1
- BOTVMTSMOUSDRW-GMOBBJLQSA-N Ile-Arg-Asn Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC(N)=O)C(O)=O BOTVMTSMOUSDRW-GMOBBJLQSA-N 0.000 description 1
- IDAHFEPYTJJZFD-PEFMBERDSA-N Ile-Asp-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N IDAHFEPYTJJZFD-PEFMBERDSA-N 0.000 description 1
- JHCVYQKVKOLAIU-NAKRPEOUSA-N Ile-Cys-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](C(C)C)C(=O)O)N JHCVYQKVKOLAIU-NAKRPEOUSA-N 0.000 description 1
- KIAOPHMUNPPGEN-PEXQALLHSA-N Ile-Gly-His Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N KIAOPHMUNPPGEN-PEXQALLHSA-N 0.000 description 1
- AFERFBZLVUFWRA-HTFCKZLJSA-N Ile-Ile-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CS)C(=O)O)N AFERFBZLVUFWRA-HTFCKZLJSA-N 0.000 description 1
- AXNGDPAKKCEKGY-QPHKQPEJSA-N Ile-Ile-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N AXNGDPAKKCEKGY-QPHKQPEJSA-N 0.000 description 1
- HUORUFRRJHELPD-MNXVOIDGSA-N Ile-Leu-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N HUORUFRRJHELPD-MNXVOIDGSA-N 0.000 description 1
- GAZGFPOZOLEYAJ-YTFOTSKYSA-N Ile-Leu-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N GAZGFPOZOLEYAJ-YTFOTSKYSA-N 0.000 description 1
- GVKKVHNRTUFCCE-BJDJZHNGSA-N Ile-Leu-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)O)N GVKKVHNRTUFCCE-BJDJZHNGSA-N 0.000 description 1
- PWUMCBLVWPCKNO-MGHWNKPDSA-N Ile-Leu-Tyr Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 PWUMCBLVWPCKNO-MGHWNKPDSA-N 0.000 description 1
- CIDLJWVDMNDKPT-FIRPJDEBSA-N Ile-Phe-Phe Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC2=CC=CC=C2)C(=O)O)N CIDLJWVDMNDKPT-FIRPJDEBSA-N 0.000 description 1
- OWSWUWDMSNXTNE-GMOBBJLQSA-N Ile-Pro-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(=O)O)C(=O)O)N OWSWUWDMSNXTNE-GMOBBJLQSA-N 0.000 description 1
- XOZOSAUOGRPCES-STECZYCISA-N Ile-Pro-Tyr Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 XOZOSAUOGRPCES-STECZYCISA-N 0.000 description 1
- ZLFNNVATRMCAKN-ZKWXMUAHSA-N Ile-Ser-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)NCC(=O)O)N ZLFNNVATRMCAKN-ZKWXMUAHSA-N 0.000 description 1
- PXKACEXYLPBMAD-JBDRJPRFSA-N Ile-Ser-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)O)N PXKACEXYLPBMAD-JBDRJPRFSA-N 0.000 description 1
- RQJUKVXWAKJDBW-SVSWQMSJSA-N Ile-Ser-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N RQJUKVXWAKJDBW-SVSWQMSJSA-N 0.000 description 1
- HXIDVIFHRYRXLZ-NAKRPEOUSA-N Ile-Ser-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)O)N HXIDVIFHRYRXLZ-NAKRPEOUSA-N 0.000 description 1
- CNMOKANDJMLAIF-CIQUZCHMSA-N Ile-Thr-Ala Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O CNMOKANDJMLAIF-CIQUZCHMSA-N 0.000 description 1
- YCKPUHHMCFSUMD-IUKAMOBKSA-N Ile-Thr-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N YCKPUHHMCFSUMD-IUKAMOBKSA-N 0.000 description 1
- KXUKTDGKLAOCQK-LSJOCFKGSA-N Ile-Val-Gly Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O KXUKTDGKLAOCQK-LSJOCFKGSA-N 0.000 description 1
- UYODHPPSCXBNCS-XUXIUFHCSA-N Ile-Val-Leu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC(C)C UYODHPPSCXBNCS-XUXIUFHCSA-N 0.000 description 1
- 102000001706 Immunoglobulin Fab Fragments Human genes 0.000 description 1
- 108010054477 Immunoglobulin Fab Fragments Proteins 0.000 description 1
- 108010065920 Insulin Lispro Proteins 0.000 description 1
- PMGDADKJMCOXHX-UHFFFAOYSA-N L-Arginyl-L-glutamin-acetat Natural products NC(=N)NCCCC(N)C(=O)NC(CCC(N)=O)C(O)=O PMGDADKJMCOXHX-UHFFFAOYSA-N 0.000 description 1
- SITWEMZOJNKJCH-UHFFFAOYSA-N L-alanine-L-arginine Natural products CC(N)C(=O)NC(C(O)=O)CCCNC(N)=N SITWEMZOJNKJCH-UHFFFAOYSA-N 0.000 description 1
- SENJXOPIZNYLHU-UHFFFAOYSA-N L-leucyl-L-arginine Natural products CC(C)CC(N)C(=O)NC(C(O)=O)CCCN=C(N)N SENJXOPIZNYLHU-UHFFFAOYSA-N 0.000 description 1
- KFKWRHQBZQICHA-STQMWFEESA-N L-leucyl-L-phenylalanine Natural products CC(C)C[C@H](N)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 KFKWRHQBZQICHA-STQMWFEESA-N 0.000 description 1
- RNKSNIBMTUYWSH-YFKPBYRVSA-N L-prolylglycine Chemical compound [O-]C(=O)CNC(=O)[C@@H]1CCC[NH2+]1 RNKSNIBMTUYWSH-YFKPBYRVSA-N 0.000 description 1
- PBCHMHROGNUXMK-DLOVCJGASA-N Leu-Ala-His Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 PBCHMHROGNUXMK-DLOVCJGASA-N 0.000 description 1
- QPRQGENIBFLVEB-BJDJZHNGSA-N Leu-Ala-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O QPRQGENIBFLVEB-BJDJZHNGSA-N 0.000 description 1
- WSGXUIQTEZDVHJ-GARJFASQSA-N Leu-Ala-Pro Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@@H]1C(O)=O WSGXUIQTEZDVHJ-GARJFASQSA-N 0.000 description 1
- XBBKIIGCUMBKCO-JXUBOQSCSA-N Leu-Ala-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XBBKIIGCUMBKCO-JXUBOQSCSA-N 0.000 description 1
- REPPKAMYTOJTFC-DCAQKATOSA-N Leu-Arg-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O REPPKAMYTOJTFC-DCAQKATOSA-N 0.000 description 1
- KSZCCRIGNVSHFH-UWVGGRQHSA-N Leu-Arg-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O KSZCCRIGNVSHFH-UWVGGRQHSA-N 0.000 description 1
- CUXRXAIAVYLVFD-ULQDDVLXSA-N Leu-Arg-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 CUXRXAIAVYLVFD-ULQDDVLXSA-N 0.000 description 1
- RIMMMMYKGIBOSN-DCAQKATOSA-N Leu-Asn-Met Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCSC)C(O)=O RIMMMMYKGIBOSN-DCAQKATOSA-N 0.000 description 1
- LJKJVTCIRDCITR-SRVKXCTJSA-N Leu-Cys-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N LJKJVTCIRDCITR-SRVKXCTJSA-N 0.000 description 1
- PPTAQBNUFKTJKA-BJDJZHNGSA-N Leu-Cys-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CS)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O PPTAQBNUFKTJKA-BJDJZHNGSA-N 0.000 description 1
- ZYLJULGXQDNXDK-GUBZILKMSA-N Leu-Gln-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O ZYLJULGXQDNXDK-GUBZILKMSA-N 0.000 description 1
- ZTLGVASZOIKNIX-DCAQKATOSA-N Leu-Gln-Glu Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N ZTLGVASZOIKNIX-DCAQKATOSA-N 0.000 description 1
- RSFGIMMPWAXNML-MNXVOIDGSA-N Leu-Gln-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O RSFGIMMPWAXNML-MNXVOIDGSA-N 0.000 description 1
- CQGSYZCULZMEDE-SRVKXCTJSA-N Leu-Gln-Pro Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N1CCC[C@H]1C(O)=O CQGSYZCULZMEDE-SRVKXCTJSA-N 0.000 description 1
- CQGSYZCULZMEDE-UHFFFAOYSA-N Leu-Gln-Pro Natural products CC(C)CC(N)C(=O)NC(CCC(N)=O)C(=O)N1CCCC1C(O)=O CQGSYZCULZMEDE-UHFFFAOYSA-N 0.000 description 1
- WIDZHJTYKYBLSR-DCAQKATOSA-N Leu-Glu-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O WIDZHJTYKYBLSR-DCAQKATOSA-N 0.000 description 1
- QVFGXCVIXXBFHO-AVGNSLFASA-N Leu-Glu-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O QVFGXCVIXXBFHO-AVGNSLFASA-N 0.000 description 1
- OGUUKPXUTHOIAV-SDDRHHMPSA-N Leu-Glu-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N OGUUKPXUTHOIAV-SDDRHHMPSA-N 0.000 description 1
- HVJVUYQWFYMGJS-GVXVVHGQSA-N Leu-Glu-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O HVJVUYQWFYMGJS-GVXVVHGQSA-N 0.000 description 1
- VWHGTYCRDRBSFI-ZETCQYMHSA-N Leu-Gly-Gly Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)NCC(O)=O VWHGTYCRDRBSFI-ZETCQYMHSA-N 0.000 description 1
- HYIFFZAQXPUEAU-QWRGUYRKSA-N Leu-Gly-Leu Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(C)C HYIFFZAQXPUEAU-QWRGUYRKSA-N 0.000 description 1
- POZULHZYLPGXMR-ONGXEEELSA-N Leu-Gly-Val Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O POZULHZYLPGXMR-ONGXEEELSA-N 0.000 description 1
- CFZZDVMBRYFFNU-QWRGUYRKSA-N Leu-His-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)NCC(O)=O CFZZDVMBRYFFNU-QWRGUYRKSA-N 0.000 description 1
- AVEGDIAXTDVBJS-XUXIUFHCSA-N Leu-Ile-Arg Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O AVEGDIAXTDVBJS-XUXIUFHCSA-N 0.000 description 1
- KUIDCYNIEJBZBU-AJNGGQMLSA-N Leu-Ile-Leu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O KUIDCYNIEJBZBU-AJNGGQMLSA-N 0.000 description 1
- LIINDKYIGYTDLG-PPCPHDFISA-N Leu-Ile-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LIINDKYIGYTDLG-PPCPHDFISA-N 0.000 description 1
- PDQDCFBVYXEFSD-SRVKXCTJSA-N Leu-Leu-Asp Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O PDQDCFBVYXEFSD-SRVKXCTJSA-N 0.000 description 1
- YOKVEHGYYQEQOP-QWRGUYRKSA-N Leu-Leu-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O YOKVEHGYYQEQOP-QWRGUYRKSA-N 0.000 description 1
- KYIIALJHAOIAHF-KKUMJFAQSA-N Leu-Leu-His Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 KYIIALJHAOIAHF-KKUMJFAQSA-N 0.000 description 1
- IEWBEPKLKUXQBU-VOAKCMCISA-N Leu-Leu-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O IEWBEPKLKUXQBU-VOAKCMCISA-N 0.000 description 1
- HVHRPWQEQHIQJF-AVGNSLFASA-N Leu-Lys-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O HVHRPWQEQHIQJF-AVGNSLFASA-N 0.000 description 1
- BGZCJDGBBUUBHA-KKUMJFAQSA-N Leu-Lys-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O BGZCJDGBBUUBHA-KKUMJFAQSA-N 0.000 description 1
- OVZLLFONXILPDZ-VOAKCMCISA-N Leu-Lys-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OVZLLFONXILPDZ-VOAKCMCISA-N 0.000 description 1
- BJWKOATWNQJPSK-SRVKXCTJSA-N Leu-Met-Glu Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N BJWKOATWNQJPSK-SRVKXCTJSA-N 0.000 description 1
- GNRPTBRHRRZCMA-RWMBFGLXSA-N Leu-Met-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCSC)C(=O)N1CCC[C@@H]1C(=O)O)N GNRPTBRHRRZCMA-RWMBFGLXSA-N 0.000 description 1
- FYPWFNKQVVEELI-ULQDDVLXSA-N Leu-Phe-Val Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C(C)C)C(O)=O)CC1=CC=CC=C1 FYPWFNKQVVEELI-ULQDDVLXSA-N 0.000 description 1
- WMIOEVKKYIMVKI-DCAQKATOSA-N Leu-Pro-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O WMIOEVKKYIMVKI-DCAQKATOSA-N 0.000 description 1
- BMVFXOQHDQZAQU-DCAQKATOSA-N Leu-Pro-Asp Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(=O)O)C(=O)O)N BMVFXOQHDQZAQU-DCAQKATOSA-N 0.000 description 1
- XWEVVRRSIOBJOO-SRVKXCTJSA-N Leu-Pro-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O XWEVVRRSIOBJOO-SRVKXCTJSA-N 0.000 description 1
- PWPBLZXWFXJFHE-RHYQMDGZSA-N Leu-Pro-Thr Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O PWPBLZXWFXJFHE-RHYQMDGZSA-N 0.000 description 1
- JDBQSGMJBMPNFT-AVGNSLFASA-N Leu-Pro-Val Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O JDBQSGMJBMPNFT-AVGNSLFASA-N 0.000 description 1
- KZZCOWMDDXDKSS-CIUDSAMLSA-N Leu-Ser-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O KZZCOWMDDXDKSS-CIUDSAMLSA-N 0.000 description 1
- XOWMDXHFSBCAKQ-SRVKXCTJSA-N Leu-Ser-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC(C)C XOWMDXHFSBCAKQ-SRVKXCTJSA-N 0.000 description 1
- AMSSKPUHBUQBOQ-SRVKXCTJSA-N Leu-Ser-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)O)N AMSSKPUHBUQBOQ-SRVKXCTJSA-N 0.000 description 1
- SVBJIZVVYJYGLA-DCAQKATOSA-N Leu-Ser-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O SVBJIZVVYJYGLA-DCAQKATOSA-N 0.000 description 1
- ZDJQVSIPFLMNOX-RHYQMDGZSA-N Leu-Thr-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N ZDJQVSIPFLMNOX-RHYQMDGZSA-N 0.000 description 1
- LJBVRCDPWOJOEK-PPCPHDFISA-N Leu-Thr-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O LJBVRCDPWOJOEK-PPCPHDFISA-N 0.000 description 1
- QWWPYKKLXWOITQ-VOAKCMCISA-N Leu-Thr-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(C)C QWWPYKKLXWOITQ-VOAKCMCISA-N 0.000 description 1
- AIQWYVFNBNNOLU-RHYQMDGZSA-N Leu-Thr-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O AIQWYVFNBNNOLU-RHYQMDGZSA-N 0.000 description 1
- AAKRWBIIGKPOKQ-ONGXEEELSA-N Leu-Val-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O AAKRWBIIGKPOKQ-ONGXEEELSA-N 0.000 description 1
- LMDVGHQPPPLYAR-IHRRRGAJSA-N Leu-Val-His Chemical compound N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)O LMDVGHQPPPLYAR-IHRRRGAJSA-N 0.000 description 1
- QESXLSQLQHHTIX-RHYQMDGZSA-N Leu-Val-Thr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O QESXLSQLQHHTIX-RHYQMDGZSA-N 0.000 description 1
- DDWFXDSYGUXRAY-UHFFFAOYSA-N Luciferin Natural products CCc1c(C)c(CC2NC(=O)C(=C2C=C)C)[nH]c1Cc3[nH]c4C(=C5/NC(CC(=O)O)C(C)C5CC(=O)O)CC(=O)c4c3C DDWFXDSYGUXRAY-UHFFFAOYSA-N 0.000 description 1
- RVOMPSJXSRPFJT-DCAQKATOSA-N Lys-Ala-Arg Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O RVOMPSJXSRPFJT-DCAQKATOSA-N 0.000 description 1
- JCFYLFOCALSNLQ-GUBZILKMSA-N Lys-Ala-Gln Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(O)=O JCFYLFOCALSNLQ-GUBZILKMSA-N 0.000 description 1
- ZTPWXNOOKAXPPE-DCAQKATOSA-N Lys-Arg-Cys Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CS)C(=O)O)N ZTPWXNOOKAXPPE-DCAQKATOSA-N 0.000 description 1
- IWWMPCPLFXFBAF-SRVKXCTJSA-N Lys-Asp-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O IWWMPCPLFXFBAF-SRVKXCTJSA-N 0.000 description 1
- NRQRKMYZONPCTM-CIUDSAMLSA-N Lys-Asp-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O NRQRKMYZONPCTM-CIUDSAMLSA-N 0.000 description 1
- GGNOBVSOZPHLCE-GUBZILKMSA-N Lys-Gln-Asp Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O GGNOBVSOZPHLCE-GUBZILKMSA-N 0.000 description 1
- HWMZUBUEOYAQSC-DCAQKATOSA-N Lys-Gln-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O HWMZUBUEOYAQSC-DCAQKATOSA-N 0.000 description 1
- GRADYHMSAUIKPS-DCAQKATOSA-N Lys-Glu-Gln Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O GRADYHMSAUIKPS-DCAQKATOSA-N 0.000 description 1
- LCMWVZLBCUVDAZ-IUCAKERBSA-N Lys-Gly-Glu Chemical compound [NH3+]CCCC[C@H]([NH3+])C(=O)NCC(=O)N[C@H](C([O-])=O)CCC([O-])=O LCMWVZLBCUVDAZ-IUCAKERBSA-N 0.000 description 1
- PBLLTSKBTAHDNA-KBPBESRZSA-N Lys-Gly-Phe Chemical compound [H]N[C@@H](CCCCN)C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O PBLLTSKBTAHDNA-KBPBESRZSA-N 0.000 description 1
- ZMMDPRTXLAEMOD-BZSNNMDCSA-N Lys-His-Phe Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O ZMMDPRTXLAEMOD-BZSNNMDCSA-N 0.000 description 1
- GNLJXWBNLAIPEP-MELADBBJSA-N Lys-His-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CN=CN2)NC(=O)[C@H](CCCCN)N)C(=O)O GNLJXWBNLAIPEP-MELADBBJSA-N 0.000 description 1
- SLQJJFAVWSZLBL-BJDJZHNGSA-N Lys-Ile-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCCCN SLQJJFAVWSZLBL-BJDJZHNGSA-N 0.000 description 1
- IUWMQCZOTYRXPL-ZPFDUUQYSA-N Lys-Ile-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O IUWMQCZOTYRXPL-ZPFDUUQYSA-N 0.000 description 1
- IZJGPPIGYTVXLB-FQUUOJAGSA-N Lys-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCCN)N IZJGPPIGYTVXLB-FQUUOJAGSA-N 0.000 description 1
- OVAOHZIOUBEQCJ-IHRRRGAJSA-N Lys-Leu-Arg Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O OVAOHZIOUBEQCJ-IHRRRGAJSA-N 0.000 description 1
- PYFNONMJYNJENN-AVGNSLFASA-N Lys-Lys-Gln Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N PYFNONMJYNJENN-AVGNSLFASA-N 0.000 description 1
- UQRZFMQQXXJTTF-AVGNSLFASA-N Lys-Lys-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O UQRZFMQQXXJTTF-AVGNSLFASA-N 0.000 description 1
- GAHJXEMYXKLZRQ-AJNGGQMLSA-N Lys-Lys-Ile Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O GAHJXEMYXKLZRQ-AJNGGQMLSA-N 0.000 description 1
- QQPSCXKFDSORFT-IHRRRGAJSA-N Lys-Lys-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCCCN QQPSCXKFDSORFT-IHRRRGAJSA-N 0.000 description 1
- XFOAWKDQMRMCDN-ULQDDVLXSA-N Lys-Phe-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CCCCN)CC1=CC=CC=C1 XFOAWKDQMRMCDN-ULQDDVLXSA-N 0.000 description 1
- MSSJJDVQTFTLIF-KBPBESRZSA-N Lys-Phe-Gly Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](Cc1ccccc1)C(=O)NCC(O)=O MSSJJDVQTFTLIF-KBPBESRZSA-N 0.000 description 1
- UDXSLGLHFUBRRM-OEAJRASXSA-N Lys-Phe-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](CCCCN)N)O UDXSLGLHFUBRRM-OEAJRASXSA-N 0.000 description 1
- WGILOYIKJVQUPT-DCAQKATOSA-N Lys-Pro-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O WGILOYIKJVQUPT-DCAQKATOSA-N 0.000 description 1
- LECIJRIRMVOFMH-ULQDDVLXSA-N Lys-Pro-Phe Chemical compound NCCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 LECIJRIRMVOFMH-ULQDDVLXSA-N 0.000 description 1
- LOGFVTREOLYCPF-RHYQMDGZSA-N Lys-Pro-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CCCCN LOGFVTREOLYCPF-RHYQMDGZSA-N 0.000 description 1
- IOQWIOPSKJOEKI-SRVKXCTJSA-N Lys-Ser-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O IOQWIOPSKJOEKI-SRVKXCTJSA-N 0.000 description 1
- IMDJSVBFQKDDEQ-MGHWNKPDSA-N Lys-Tyr-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CCCCN)N IMDJSVBFQKDDEQ-MGHWNKPDSA-N 0.000 description 1
- XABXVVSWUVCZST-GVXVVHGQSA-N Lys-Val-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCCN XABXVVSWUVCZST-GVXVVHGQSA-N 0.000 description 1
- GAELMDJMQDUDLJ-BQBZGAKWSA-N Met-Ala-Gly Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O GAELMDJMQDUDLJ-BQBZGAKWSA-N 0.000 description 1
- ULNXMMYXQKGNPG-LPEHRKFASA-N Met-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCSC)N ULNXMMYXQKGNPG-LPEHRKFASA-N 0.000 description 1
- WXHHTBVYQOSYSL-FXQIFTODSA-N Met-Ala-Ser Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O WXHHTBVYQOSYSL-FXQIFTODSA-N 0.000 description 1
- CWFYZYQMUDWGTI-GUBZILKMSA-N Met-Arg-Asp Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O CWFYZYQMUDWGTI-GUBZILKMSA-N 0.000 description 1
- OHMKUHXCDSCOMT-QXEWZRGKSA-N Met-Asn-Val Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O OHMKUHXCDSCOMT-QXEWZRGKSA-N 0.000 description 1
- KQBJYJXPZBNEIK-DCAQKATOSA-N Met-Glu-Arg Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCNC(N)=N KQBJYJXPZBNEIK-DCAQKATOSA-N 0.000 description 1
- ULLIQRYQNMAAHC-RWMBFGLXSA-N Met-His-Pro Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N2CCC[C@@H]2C(=O)O)N ULLIQRYQNMAAHC-RWMBFGLXSA-N 0.000 description 1
- HGAJNEWOUHDUMZ-SRVKXCTJSA-N Met-Leu-Glu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCC(O)=O HGAJNEWOUHDUMZ-SRVKXCTJSA-N 0.000 description 1
- CHDYFPCQVUOJEB-ULQDDVLXSA-N Met-Leu-Phe Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 CHDYFPCQVUOJEB-ULQDDVLXSA-N 0.000 description 1
- YHBHDYYHOUAKLR-AVGNSLFASA-N Met-Met-His Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N YHBHDYYHOUAKLR-AVGNSLFASA-N 0.000 description 1
- QQPMHUCGDRJFQK-RHYQMDGZSA-N Met-Thr-Leu Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(C)C QQPMHUCGDRJFQK-RHYQMDGZSA-N 0.000 description 1
- VYXIKLFLGRTANT-HRCADAONSA-N Met-Tyr-Pro Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N2CCC[C@@H]2C(=O)O)N VYXIKLFLGRTANT-HRCADAONSA-N 0.000 description 1
- OVTOTTGZBWXLFU-QXEWZRGKSA-N Met-Val-Asp Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC(O)=O OVTOTTGZBWXLFU-QXEWZRGKSA-N 0.000 description 1
- CNFMPVYIVQUJOO-NHCYSSNCSA-N Met-Val-Gln Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCC(N)=O CNFMPVYIVQUJOO-NHCYSSNCSA-N 0.000 description 1
- FSTWDRPCQQUJIT-NHCYSSNCSA-N Met-Val-Glu Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CCSC)N FSTWDRPCQQUJIT-NHCYSSNCSA-N 0.000 description 1
- IIHMNTBFPMRJCN-RCWTZXSCSA-N Met-Val-Thr Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O IIHMNTBFPMRJCN-RCWTZXSCSA-N 0.000 description 1
- 206010027476 Metastases Diseases 0.000 description 1
- 241000699670 Mus sp. Species 0.000 description 1
- WUGMRIBZSVSJNP-UHFFFAOYSA-N N-L-alanyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)C)C(O)=O)=CNC2=C1 WUGMRIBZSVSJNP-UHFFFAOYSA-N 0.000 description 1
- XZFYRXDAULDNFX-UHFFFAOYSA-N N-L-cysteinyl-L-phenylalanine Natural products SCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 XZFYRXDAULDNFX-UHFFFAOYSA-N 0.000 description 1
- PKFBJSDMCRJYDC-GEZSXCAASA-N N-acetyl-s-geranylgeranyl-l-cysteine Chemical compound CC(C)=CCC\C(C)=C\CC\C(C)=C\CC\C(C)=C\CSC[C@@H](C(O)=O)NC(C)=O PKFBJSDMCRJYDC-GEZSXCAASA-N 0.000 description 1
- 108010079364 N-glycylalanine Proteins 0.000 description 1
- 101710118186 Neomycin resistance protein Proteins 0.000 description 1
- 241000283973 Oryctolagus cuniculus Species 0.000 description 1
- XWBJLKDCHJVKAK-KKUMJFAQSA-N Phe-Arg-Gln Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N XWBJLKDCHJVKAK-KKUMJFAQSA-N 0.000 description 1
- JOXIIFVCSATTDH-IHPCNDPISA-N Phe-Asn-Trp Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC2=CNC3=CC=CC=C32)C(=O)O)N JOXIIFVCSATTDH-IHPCNDPISA-N 0.000 description 1
- HPECNYCQLSVCHH-BZSNNMDCSA-N Phe-Cys-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC2=CC=CC=C2)C(=O)O)N HPECNYCQLSVCHH-BZSNNMDCSA-N 0.000 description 1
- UMKYAYXCMYYNHI-AVGNSLFASA-N Phe-Gln-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC(=O)N)C(=O)O)N UMKYAYXCMYYNHI-AVGNSLFASA-N 0.000 description 1
- SXJGROGVINAYSH-AVGNSLFASA-N Phe-Gln-Asp Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N SXJGROGVINAYSH-AVGNSLFASA-N 0.000 description 1
- KAGCQPSEVAETCA-JYJNAYRXSA-N Phe-Gln-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC1=CC=CC=C1)N KAGCQPSEVAETCA-JYJNAYRXSA-N 0.000 description 1
- BFYHIHGIHGROAT-HTUGSXCWSA-N Phe-Glu-Thr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O BFYHIHGIHGROAT-HTUGSXCWSA-N 0.000 description 1
- RFEXGCASCQGGHZ-STQMWFEESA-N Phe-Gly-Arg Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(O)=O RFEXGCASCQGGHZ-STQMWFEESA-N 0.000 description 1
- XEXSSIBQYNKFBX-KBPBESRZSA-N Phe-Gly-His Chemical compound C([C@H](N)C(=O)NCC(=O)N[C@@H](CC=1N=CNC=1)C(O)=O)C1=CC=CC=C1 XEXSSIBQYNKFBX-KBPBESRZSA-N 0.000 description 1
- FXPZZKBHNOMLGA-HJWJTTGWSA-N Phe-Ile-Arg Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N FXPZZKBHNOMLGA-HJWJTTGWSA-N 0.000 description 1
- DVOCGBNHAUHKHJ-DKIMLUQUSA-N Phe-Ile-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O DVOCGBNHAUHKHJ-DKIMLUQUSA-N 0.000 description 1
- CWFGECHCRMGPPT-MXAVVETBSA-N Phe-Ile-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O CWFGECHCRMGPPT-MXAVVETBSA-N 0.000 description 1
- RSPUIENXSJYZQO-JYJNAYRXSA-N Phe-Leu-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 RSPUIENXSJYZQO-JYJNAYRXSA-N 0.000 description 1
- KDYPMIZMXDECSU-JYJNAYRXSA-N Phe-Leu-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 KDYPMIZMXDECSU-JYJNAYRXSA-N 0.000 description 1
- KZRQONDKKJCAOL-DKIMLUQUSA-N Phe-Leu-Ile Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KZRQONDKKJCAOL-DKIMLUQUSA-N 0.000 description 1
- CMHTUJQZQXFNTQ-OEAJRASXSA-N Phe-Leu-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC1=CC=CC=C1)N)O CMHTUJQZQXFNTQ-OEAJRASXSA-N 0.000 description 1
- PBWNICYZGJQKJV-BZSNNMDCSA-N Phe-Phe-Cys Chemical compound N[C@@H](Cc1ccccc1)C(=O)N[C@@H](Cc1ccccc1)C(=O)N[C@@H](CS)C(O)=O PBWNICYZGJQKJV-BZSNNMDCSA-N 0.000 description 1
- RBRNEFJTEHPDSL-ACRUOGEOSA-N Phe-Phe-Lys Chemical compound C([C@@H](C(=O)N[C@@H](CCCCN)C(O)=O)NC(=O)[C@@H](N)CC=1C=CC=CC=1)C1=CC=CC=C1 RBRNEFJTEHPDSL-ACRUOGEOSA-N 0.000 description 1
- JLLJTMHNXQTMCK-UBHSHLNASA-N Phe-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CC1=CC=CC=C1 JLLJTMHNXQTMCK-UBHSHLNASA-N 0.000 description 1
- AFNJAQVMTIQTCB-DLOVCJGASA-N Phe-Ser-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CC=CC=C1 AFNJAQVMTIQTCB-DLOVCJGASA-N 0.000 description 1
- UNBFGVQVQGXXCK-KKUMJFAQSA-N Phe-Ser-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O UNBFGVQVQGXXCK-KKUMJFAQSA-N 0.000 description 1
- GLJZDMZJHFXJQG-BZSNNMDCSA-N Phe-Ser-Phe Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O GLJZDMZJHFXJQG-BZSNNMDCSA-N 0.000 description 1
- IAOZOFPONWDXNT-IXOXFDKPSA-N Phe-Ser-Thr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O IAOZOFPONWDXNT-IXOXFDKPSA-N 0.000 description 1
- MRWOVVNKSXXLRP-IHPCNDPISA-N Phe-Ser-Trp Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O MRWOVVNKSXXLRP-IHPCNDPISA-N 0.000 description 1
- MVIJMIZJPHQGEN-IHRRRGAJSA-N Phe-Ser-Val Chemical compound CC(C)[C@@H](C([O-])=O)NC(=O)[C@H](CO)NC(=O)[C@@H]([NH3+])CC1=CC=CC=C1 MVIJMIZJPHQGEN-IHRRRGAJSA-N 0.000 description 1
- KLYYKKGCPOGDPE-OEAJRASXSA-N Phe-Thr-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O KLYYKKGCPOGDPE-OEAJRASXSA-N 0.000 description 1
- GNRMAQSIROFNMI-IXOXFDKPSA-N Phe-Thr-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O GNRMAQSIROFNMI-IXOXFDKPSA-N 0.000 description 1
- APXXVISUHOLGEE-ILWGZMRPSA-N Phe-Trp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CNC3=CC=CC=C32)NC(=O)[C@H](CC4=CC=CC=C4)N)C(=O)O APXXVISUHOLGEE-ILWGZMRPSA-N 0.000 description 1
- DBNGDEAQXGFGRA-ACRUOGEOSA-N Phe-Tyr-Lys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)N[C@@H](CCCCN)C(=O)O)N DBNGDEAQXGFGRA-ACRUOGEOSA-N 0.000 description 1
- DXWNFNOPBYAFRM-IHRRRGAJSA-N Phe-Val-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N DXWNFNOPBYAFRM-IHRRRGAJSA-N 0.000 description 1
- 231100000742 Plant toxin Toxicity 0.000 description 1
- 241000276498 Pollachius virens Species 0.000 description 1
- 239000002202 Polyethylene glycol Substances 0.000 description 1
- KIZQGKLMXKGDIV-BQBZGAKWSA-N Pro-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1 KIZQGKLMXKGDIV-BQBZGAKWSA-N 0.000 description 1
- OCSACVPBMIYNJE-GUBZILKMSA-N Pro-Arg-Asn Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(O)=O OCSACVPBMIYNJE-GUBZILKMSA-N 0.000 description 1
- QBFONMUYNSNKIX-AVGNSLFASA-N Pro-Arg-His Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O QBFONMUYNSNKIX-AVGNSLFASA-N 0.000 description 1
- QSKCKTUQPICLSO-AVGNSLFASA-N Pro-Arg-Lys Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCCN)C(=O)O QSKCKTUQPICLSO-AVGNSLFASA-N 0.000 description 1
- CYQQWUPHIZVCNY-GUBZILKMSA-N Pro-Arg-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O CYQQWUPHIZVCNY-GUBZILKMSA-N 0.000 description 1
- SWXSLPHTJVAWDF-VEVYYDQMSA-N Pro-Asn-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SWXSLPHTJVAWDF-VEVYYDQMSA-N 0.000 description 1
- HXOLCSYHGRNXJJ-IHRRRGAJSA-N Pro-Asp-Phe Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O HXOLCSYHGRNXJJ-IHRRRGAJSA-N 0.000 description 1
- ZYBUKTMPPFQSHL-JYJNAYRXSA-N Pro-Asp-Trp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O ZYBUKTMPPFQSHL-JYJNAYRXSA-N 0.000 description 1
- ZBAGOWGNNAXMOY-IHRRRGAJSA-N Pro-Cys-Phe Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CS)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O ZBAGOWGNNAXMOY-IHRRRGAJSA-N 0.000 description 1
- LANQLYHLMYDWJP-SRVKXCTJSA-N Pro-Gln-Lys Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O LANQLYHLMYDWJP-SRVKXCTJSA-N 0.000 description 1
- SKICPQLTOXGWGO-GARJFASQSA-N Pro-Gln-Pro Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CCC(=O)N)C(=O)N2CCC[C@@H]2C(=O)O SKICPQLTOXGWGO-GARJFASQSA-N 0.000 description 1
- PULPZRAHVFBVTO-DCAQKATOSA-N Pro-Glu-Arg Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O PULPZRAHVFBVTO-DCAQKATOSA-N 0.000 description 1
- NXEYSLRNNPWCRN-SRVKXCTJSA-N Pro-Glu-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O NXEYSLRNNPWCRN-SRVKXCTJSA-N 0.000 description 1
- QGOZJLYCGRYYRW-KKUMJFAQSA-N Pro-Glu-Tyr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O QGOZJLYCGRYYRW-KKUMJFAQSA-N 0.000 description 1
- CLNJSLSHKJECME-BQBZGAKWSA-N Pro-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)[C@@H]1CCCN1 CLNJSLSHKJECME-BQBZGAKWSA-N 0.000 description 1
- FKLSMYYLJHYPHH-UWVGGRQHSA-N Pro-Gly-Leu Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O FKLSMYYLJHYPHH-UWVGGRQHSA-N 0.000 description 1
- HAEGAELAYWSUNC-WPRPVWTQSA-N Pro-Gly-Val Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O HAEGAELAYWSUNC-WPRPVWTQSA-N 0.000 description 1
- VWXGFAIZUQBBBG-UWVGGRQHSA-N Pro-His-Gly Chemical compound C([C@@H](C(=O)NCC(=O)[O-])NC(=O)[C@H]1[NH2+]CCC1)C1=CN=CN1 VWXGFAIZUQBBBG-UWVGGRQHSA-N 0.000 description 1
- IBGCFJDLCYTKPW-NAKRPEOUSA-N Pro-Ile-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H]1CCCN1 IBGCFJDLCYTKPW-NAKRPEOUSA-N 0.000 description 1
- BCNRNJWSRFDPTQ-HJWJTTGWSA-N Pro-Ile-Phe Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O BCNRNJWSRFDPTQ-HJWJTTGWSA-N 0.000 description 1
- ZTMLZUNPFDGPKY-VKOGCVSHSA-N Pro-Ile-Trp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@@H]3CCCN3 ZTMLZUNPFDGPKY-VKOGCVSHSA-N 0.000 description 1
- HATVCTYBNCNMAA-AVGNSLFASA-N Pro-Leu-Met Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(O)=O HATVCTYBNCNMAA-AVGNSLFASA-N 0.000 description 1
- MCWHYUWXVNRXFV-RWMBFGLXSA-N Pro-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@@H]2CCCN2 MCWHYUWXVNRXFV-RWMBFGLXSA-N 0.000 description 1
- INDVYIOKMXFQFM-SRVKXCTJSA-N Pro-Lys-Gln Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(=O)N)C(=O)O INDVYIOKMXFQFM-SRVKXCTJSA-N 0.000 description 1
- LEIKGVHQTKHOLM-IUCAKERBSA-N Pro-Pro-Gly Chemical compound OC(=O)CNC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 LEIKGVHQTKHOLM-IUCAKERBSA-N 0.000 description 1
- SBVPYBFMIGDIDX-SRVKXCTJSA-N Pro-Pro-Pro Chemical compound OC(=O)[C@@H]1CCCN1C(=O)[C@H]1N(C(=O)[C@H]2NCCC2)CCC1 SBVPYBFMIGDIDX-SRVKXCTJSA-N 0.000 description 1
- FNGOXVQBBCMFKV-CIUDSAMLSA-N Pro-Ser-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(O)=O FNGOXVQBBCMFKV-CIUDSAMLSA-N 0.000 description 1
- LNICFEXCAHIJOR-DCAQKATOSA-N Pro-Ser-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O LNICFEXCAHIJOR-DCAQKATOSA-N 0.000 description 1
- RMJZWERKFFNNNS-XGEHTFHBSA-N Pro-Thr-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O RMJZWERKFFNNNS-XGEHTFHBSA-N 0.000 description 1
- VBZXFFYOBDLLFE-HSHDSVGOSA-N Pro-Trp-Thr Chemical compound N([C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)N[C@@H]([C@H](O)C)C(O)=O)C(=O)[C@@H]1CCCN1 VBZXFFYOBDLLFE-HSHDSVGOSA-N 0.000 description 1
- UIUWGMRJTWHIJZ-ULQDDVLXSA-N Pro-Tyr-Lys Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)N[C@@H](CCCCN)C(=O)O UIUWGMRJTWHIJZ-ULQDDVLXSA-N 0.000 description 1
- OQSGBXGNAFQGGS-CYDGBPFRSA-N Pro-Val-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O OQSGBXGNAFQGGS-CYDGBPFRSA-N 0.000 description 1
- YDTUEBLEAVANFH-RCWTZXSCSA-N Pro-Val-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H]1CCCN1 YDTUEBLEAVANFH-RCWTZXSCSA-N 0.000 description 1
- MTMJNKFZDQEVSY-BZSNNMDCSA-N Pro-Val-Trp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O MTMJNKFZDQEVSY-BZSNNMDCSA-N 0.000 description 1
- 241000125945 Protoparvovirus Species 0.000 description 1
- 108010079005 RDV peptide Proteins 0.000 description 1
- 108010003201 RGH 0205 Proteins 0.000 description 1
- 230000006819 RNA synthesis Effects 0.000 description 1
- 241000700159 Rattus Species 0.000 description 1
- 108010039491 Ricin Proteins 0.000 description 1
- 241000293869 Salmonella enterica subsp. enterica serovar Typhimurium Species 0.000 description 1
- MMGJPDWSIOAGTH-ACZMJKKPSA-N Ser-Ala-Gln Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(O)=O MMGJPDWSIOAGTH-ACZMJKKPSA-N 0.000 description 1
- DWUIECHTAMYEFL-XVYDVKMFSA-N Ser-Ala-His Chemical compound OC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 DWUIECHTAMYEFL-XVYDVKMFSA-N 0.000 description 1
- HRNQLKCLPVKZNE-CIUDSAMLSA-N Ser-Ala-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O HRNQLKCLPVKZNE-CIUDSAMLSA-N 0.000 description 1
- YQHZVYJAGWMHES-ZLUOBGJFSA-N Ser-Ala-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O YQHZVYJAGWMHES-ZLUOBGJFSA-N 0.000 description 1
- JPIDMRXXNMIVKY-VZFHVOOUSA-N Ser-Ala-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O JPIDMRXXNMIVKY-VZFHVOOUSA-N 0.000 description 1
- FIDMVVBUOCMMJG-CIUDSAMLSA-N Ser-Asn-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CO FIDMVVBUOCMMJG-CIUDSAMLSA-N 0.000 description 1
- QPFJSHSJFIYDJZ-GHCJXIJMSA-N Ser-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CO QPFJSHSJFIYDJZ-GHCJXIJMSA-N 0.000 description 1
- MMAPOBOTRUVNKJ-ZLUOBGJFSA-N Ser-Asp-Ser Chemical compound C([C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CO)N)C(=O)O MMAPOBOTRUVNKJ-ZLUOBGJFSA-N 0.000 description 1
- WTPKKLMBNBCCNL-ACZMJKKPSA-N Ser-Cys-Glu Chemical compound C(CC(=O)O)[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CO)N WTPKKLMBNBCCNL-ACZMJKKPSA-N 0.000 description 1
- GYXVUTAOICLGKJ-ACZMJKKPSA-N Ser-Glu-Cys Chemical compound C(CC(=O)O)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CO)N GYXVUTAOICLGKJ-ACZMJKKPSA-N 0.000 description 1
- QKQDTEYDEIJPNK-GUBZILKMSA-N Ser-Glu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CO QKQDTEYDEIJPNK-GUBZILKMSA-N 0.000 description 1
- UQFYNFTYDHUIMI-WHFBIAKZSA-N Ser-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)[C@@H](N)CO UQFYNFTYDHUIMI-WHFBIAKZSA-N 0.000 description 1
- MUARUIBTKQJKFY-WHFBIAKZSA-N Ser-Gly-Asp Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O MUARUIBTKQJKFY-WHFBIAKZSA-N 0.000 description 1
- MIJWOJAXARLEHA-WDSKDSINSA-N Ser-Gly-Glu Chemical compound OC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(O)=O MIJWOJAXARLEHA-WDSKDSINSA-N 0.000 description 1
- YMTLKLXDFCSCNX-BYPYZUCNSA-N Ser-Gly-Gly Chemical compound OC[C@H](N)C(=O)NCC(=O)NCC(O)=O YMTLKLXDFCSCNX-BYPYZUCNSA-N 0.000 description 1
- GZFAWAQTEYDKII-YUMQZZPRSA-N Ser-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CO GZFAWAQTEYDKII-YUMQZZPRSA-N 0.000 description 1
- HMRAQFJFTOLDKW-GUBZILKMSA-N Ser-His-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(O)=O)C(O)=O HMRAQFJFTOLDKW-GUBZILKMSA-N 0.000 description 1
- CICQXRWZNVXFCU-SRVKXCTJSA-N Ser-His-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(O)=O CICQXRWZNVXFCU-SRVKXCTJSA-N 0.000 description 1
- HBTCFCHYALPXME-HTFCKZLJSA-N Ser-Ile-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O HBTCFCHYALPXME-HTFCKZLJSA-N 0.000 description 1
- RIAKPZVSNBBNRE-BJDJZHNGSA-N Ser-Ile-Leu Chemical compound OC[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O RIAKPZVSNBBNRE-BJDJZHNGSA-N 0.000 description 1
- GJFYFGOEWLDQGW-GUBZILKMSA-N Ser-Leu-Gln Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CO)N GJFYFGOEWLDQGW-GUBZILKMSA-N 0.000 description 1
- MUJQWSAWLLRJCE-KATARQTJSA-N Ser-Leu-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MUJQWSAWLLRJCE-KATARQTJSA-N 0.000 description 1
- GVIGVIOEYBOTCB-XIRDDKMYSA-N Ser-Leu-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)CO)CC(C)C)C(O)=O)=CNC2=C1 GVIGVIOEYBOTCB-XIRDDKMYSA-N 0.000 description 1
- JWOBLHJRDADHLN-KKUMJFAQSA-N Ser-Leu-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O JWOBLHJRDADHLN-KKUMJFAQSA-N 0.000 description 1
- FPCGZYMRFFIYIH-CIUDSAMLSA-N Ser-Lys-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O FPCGZYMRFFIYIH-CIUDSAMLSA-N 0.000 description 1
- PMCMLDNPAZUYGI-DCAQKATOSA-N Ser-Lys-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O PMCMLDNPAZUYGI-DCAQKATOSA-N 0.000 description 1
- IFLVBVIYADZIQO-DCAQKATOSA-N Ser-Met-Lys Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CO)N IFLVBVIYADZIQO-DCAQKATOSA-N 0.000 description 1
- NQZFFLBPNDLTPO-DLOVCJGASA-N Ser-Phe-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](CO)N NQZFFLBPNDLTPO-DLOVCJGASA-N 0.000 description 1
- MQUZANJDFOQOBX-SRVKXCTJSA-N Ser-Phe-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O MQUZANJDFOQOBX-SRVKXCTJSA-N 0.000 description 1
- FBLNYDYPCLFTSP-IXOXFDKPSA-N Ser-Phe-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O FBLNYDYPCLFTSP-IXOXFDKPSA-N 0.000 description 1
- ADJDNJCSPNFFPI-FXQIFTODSA-N Ser-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CO ADJDNJCSPNFFPI-FXQIFTODSA-N 0.000 description 1
- GZGFSPWOMUKKCV-NAKRPEOUSA-N Ser-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CO GZGFSPWOMUKKCV-NAKRPEOUSA-N 0.000 description 1
- CKDXFSPMIDSMGV-GUBZILKMSA-N Ser-Pro-Val Chemical compound [H]N[C@@H](CO)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O CKDXFSPMIDSMGV-GUBZILKMSA-N 0.000 description 1
- KQNDIKOYWZTZIX-FXQIFTODSA-N Ser-Ser-Arg Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCNC(N)=N KQNDIKOYWZTZIX-FXQIFTODSA-N 0.000 description 1
- SRSPTFBENMJHMR-WHFBIAKZSA-N Ser-Ser-Gly Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O SRSPTFBENMJHMR-WHFBIAKZSA-N 0.000 description 1
- NVNPWELENFJOHH-CIUDSAMLSA-N Ser-Ser-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CO)N NVNPWELENFJOHH-CIUDSAMLSA-N 0.000 description 1
- CUXJENOFJXOSOZ-BIIVOSGPSA-N Ser-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CO)N)C(=O)O CUXJENOFJXOSOZ-BIIVOSGPSA-N 0.000 description 1
- JURQXQBJKUHGJS-UHFFFAOYSA-N Ser-Ser-Ser-Ser Chemical compound OCC(N)C(=O)NC(CO)C(=O)NC(CO)C(=O)NC(CO)C(O)=O JURQXQBJKUHGJS-UHFFFAOYSA-N 0.000 description 1
- OLKICIBQRVSQMA-SRVKXCTJSA-N Ser-Ser-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O OLKICIBQRVSQMA-SRVKXCTJSA-N 0.000 description 1
- PCJLFYBAQZQOFE-KATARQTJSA-N Ser-Thr-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CO)N)O PCJLFYBAQZQOFE-KATARQTJSA-N 0.000 description 1
- ZSDXEKUKQAKZFE-XAVMHZPKSA-N Ser-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N)O ZSDXEKUKQAKZFE-XAVMHZPKSA-N 0.000 description 1
- VLMIUSLQONKLDV-HEIBUPTGSA-N Ser-Thr-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O VLMIUSLQONKLDV-HEIBUPTGSA-N 0.000 description 1
- SGZVZUCRAVSPKQ-FXQIFTODSA-N Ser-Val-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CO)N SGZVZUCRAVSPKQ-FXQIFTODSA-N 0.000 description 1
- JZRYFUGREMECBH-XPUUQOCRSA-N Ser-Val-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O JZRYFUGREMECBH-XPUUQOCRSA-N 0.000 description 1
- SYCFMSYTIFXWAJ-DCAQKATOSA-N Ser-Val-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CO)N SYCFMSYTIFXWAJ-DCAQKATOSA-N 0.000 description 1
- RCOUFINCYASMDN-GUBZILKMSA-N Ser-Val-Met Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCSC)C(O)=O RCOUFINCYASMDN-GUBZILKMSA-N 0.000 description 1
- 241000700584 Simplexvirus Species 0.000 description 1
- 108091081024 Start codon Proteins 0.000 description 1
- 241000187747 Streptomyces Species 0.000 description 1
- 239000004098 Tetracycline Substances 0.000 description 1
- DDPVJPIGACCMEH-XQXXSGGOSA-N Thr-Ala-Gln Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(O)=O DDPVJPIGACCMEH-XQXXSGGOSA-N 0.000 description 1
- PXQUBKWZENPDGE-CIQUZCHMSA-N Thr-Ala-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](C)NC(=O)[C@H]([C@@H](C)O)N PXQUBKWZENPDGE-CIQUZCHMSA-N 0.000 description 1
- XSLXHSYIVPGEER-KZVJFYERSA-N Thr-Ala-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O XSLXHSYIVPGEER-KZVJFYERSA-N 0.000 description 1
- TWLMXDWFVNEFFK-FJXKBIBVSA-N Thr-Arg-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O TWLMXDWFVNEFFK-FJXKBIBVSA-N 0.000 description 1
- JNQZPAWOPBZGIX-RCWTZXSCSA-N Thr-Arg-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)[C@@H](C)O)CCCN=C(N)N JNQZPAWOPBZGIX-RCWTZXSCSA-N 0.000 description 1
- OHAJHDJOCKKJLV-LKXGYXEUSA-N Thr-Asp-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O OHAJHDJOCKKJLV-LKXGYXEUSA-N 0.000 description 1
- LIXBDERDAGNVAV-XKBZYTNZSA-N Thr-Gln-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O LIXBDERDAGNVAV-XKBZYTNZSA-N 0.000 description 1
- RCEHMXVEMNXRIW-IRIUXVKKSA-N Thr-Gln-Tyr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N)O RCEHMXVEMNXRIW-IRIUXVKKSA-N 0.000 description 1
- VOHWDZNIESHTFW-XKBZYTNZSA-N Thr-Glu-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CS)C(=O)O)N)O VOHWDZNIESHTFW-XKBZYTNZSA-N 0.000 description 1
- LHEZGZQRLDBSRR-WDCWCFNPSA-N Thr-Glu-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O LHEZGZQRLDBSRR-WDCWCFNPSA-N 0.000 description 1
- XPNSAQMEAVSQRD-FBCQKBJTSA-N Thr-Gly-Gly Chemical compound C[C@@H](O)[C@H](N)C(=O)NCC(=O)NCC(O)=O XPNSAQMEAVSQRD-FBCQKBJTSA-N 0.000 description 1
- SXAGUVRFGJSFKC-ZEILLAHLSA-N Thr-His-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SXAGUVRFGJSFKC-ZEILLAHLSA-N 0.000 description 1
- RFKVQLIXNVEOMB-WEDXCCLWSA-N Thr-Leu-Gly Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)NCC(=O)O)N)O RFKVQLIXNVEOMB-WEDXCCLWSA-N 0.000 description 1
- YOOAQCZYZHGUAZ-KATARQTJSA-N Thr-Leu-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O YOOAQCZYZHGUAZ-KATARQTJSA-N 0.000 description 1
- SPVHQURZJCUDQC-VOAKCMCISA-N Thr-Lys-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O SPVHQURZJCUDQC-VOAKCMCISA-N 0.000 description 1
- QNCFWHZVRNXAKW-OEAJRASXSA-N Thr-Lys-Phe Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O QNCFWHZVRNXAKW-OEAJRASXSA-N 0.000 description 1
- CGCMNOIQVAXYMA-UNQGMJICSA-N Thr-Met-Phe Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O CGCMNOIQVAXYMA-UNQGMJICSA-N 0.000 description 1
- KZURUCDWKDEAFZ-XVSYOHENSA-N Thr-Phe-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(=O)N)C(=O)O)N)O KZURUCDWKDEAFZ-XVSYOHENSA-N 0.000 description 1
- NZRUWPIYECBYRK-HTUGSXCWSA-N Thr-Phe-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O NZRUWPIYECBYRK-HTUGSXCWSA-N 0.000 description 1
- WNQJTLATMXYSEL-OEAJRASXSA-N Thr-Phe-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O WNQJTLATMXYSEL-OEAJRASXSA-N 0.000 description 1
- XKWABWFMQXMUMT-HJGDQZAQSA-N Thr-Pro-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O XKWABWFMQXMUMT-HJGDQZAQSA-N 0.000 description 1
- DEGCBBCMYWNJNA-RHYQMDGZSA-N Thr-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)[C@@H](C)O DEGCBBCMYWNJNA-RHYQMDGZSA-N 0.000 description 1
- XHWCDRUPDNSDAZ-XKBZYTNZSA-N Thr-Ser-Glu Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N)O XHWCDRUPDNSDAZ-XKBZYTNZSA-N 0.000 description 1
- BCYUHPXBHCUYBA-CUJWVEQBSA-N Thr-Ser-His Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](Cc1cnc[nH]1)C(O)=O BCYUHPXBHCUYBA-CUJWVEQBSA-N 0.000 description 1
- AHERARIZBPOMNU-KATARQTJSA-N Thr-Ser-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O AHERARIZBPOMNU-KATARQTJSA-N 0.000 description 1
- WPSKTVVMQCXPRO-BWBBJGPYSA-N Thr-Ser-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O WPSKTVVMQCXPRO-BWBBJGPYSA-N 0.000 description 1
- NDZYTIMDOZMECO-SHGPDSBTSA-N Thr-Thr-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O NDZYTIMDOZMECO-SHGPDSBTSA-N 0.000 description 1
- TZQWJCGVCIJDMU-HEIBUPTGSA-N Thr-Thr-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CS)C(=O)O)N)O TZQWJCGVCIJDMU-HEIBUPTGSA-N 0.000 description 1
- JNKAYADBODLPMQ-HSHDSVGOSA-N Thr-Trp-Val Chemical compound C1=CC=C2C(C[C@@H](C(=O)N[C@@H](C(C)C)C(O)=O)NC(=O)[C@@H](N)[C@@H](C)O)=CNC2=C1 JNKAYADBODLPMQ-HSHDSVGOSA-N 0.000 description 1
- LVRFMARKDGGZMX-IZPVPAKOSA-N Thr-Tyr-Thr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H]([C@@H](C)O)C(O)=O)CC1=CC=C(O)C=C1 LVRFMARKDGGZMX-IZPVPAKOSA-N 0.000 description 1
- QGVBFDIREUUSHX-IFFSRLJSSA-N Thr-Val-Gln Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O QGVBFDIREUUSHX-IFFSRLJSSA-N 0.000 description 1
- KPMIQCXJDVKWKO-IFFSRLJSSA-N Thr-Val-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O KPMIQCXJDVKWKO-IFFSRLJSSA-N 0.000 description 1
- BKVICMPZWRNWOC-RHYQMDGZSA-N Thr-Val-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)[C@@H](C)O BKVICMPZWRNWOC-RHYQMDGZSA-N 0.000 description 1
- 108010022394 Threonine synthase Proteins 0.000 description 1
- 102000006601 Thymidine Kinase Human genes 0.000 description 1
- 108020004440 Thymidine kinase Proteins 0.000 description 1
- OETOOJXFNSEYHQ-WFBYXXMGSA-N Trp-Ala-Asp Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(O)=O)=CNC2=C1 OETOOJXFNSEYHQ-WFBYXXMGSA-N 0.000 description 1
- PXQPYPMSLBQHJJ-WFBYXXMGSA-N Trp-Asp-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N PXQPYPMSLBQHJJ-WFBYXXMGSA-N 0.000 description 1
- GKUROEIXVURAAO-BPUTZDHNSA-N Trp-Asp-Arg Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O GKUROEIXVURAAO-BPUTZDHNSA-N 0.000 description 1
- SNJAPSVIPKUMCK-NWLDYVSISA-N Trp-Glu-Thr Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SNJAPSVIPKUMCK-NWLDYVSISA-N 0.000 description 1
- LDMUNXDDIDAPJH-VMBFOHBNSA-N Trp-Ile-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N LDMUNXDDIDAPJH-VMBFOHBNSA-N 0.000 description 1
- RWAYYYOZMHMEGD-XIRDDKMYSA-N Trp-Leu-Ser Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O)=CNC2=C1 RWAYYYOZMHMEGD-XIRDDKMYSA-N 0.000 description 1
- HJWLQSFTGDQSRX-BPUTZDHNSA-N Trp-Met-Ser Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CO)C(O)=O HJWLQSFTGDQSRX-BPUTZDHNSA-N 0.000 description 1
- PWPJLBWYRTVYQS-PMVMPFDFSA-N Trp-Phe-Leu Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O PWPJLBWYRTVYQS-PMVMPFDFSA-N 0.000 description 1
- UQHPXCFAHVTWFU-BVSLBCMMSA-N Trp-Phe-Val Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(O)=O UQHPXCFAHVTWFU-BVSLBCMMSA-N 0.000 description 1
- KXFYAQUYJKOQMI-QEJZJMRPSA-N Trp-Ser-Gln Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O)=CNC2=C1 KXFYAQUYJKOQMI-QEJZJMRPSA-N 0.000 description 1
- GNCPKOZDOCQRAF-BPUTZDHNSA-N Trp-Ser-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N GNCPKOZDOCQRAF-BPUTZDHNSA-N 0.000 description 1
- QJBWZNTWJSZUOY-UWJYBYFXSA-N Tyr-Ala-Cys Chemical compound C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N QJBWZNTWJSZUOY-UWJYBYFXSA-N 0.000 description 1
- MICSYKFECRFCTJ-IHRRRGAJSA-N Tyr-Arg-Asp Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N)O MICSYKFECRFCTJ-IHRRRGAJSA-N 0.000 description 1
- JBBYKPZAPOLCPK-JYJNAYRXSA-N Tyr-Arg-Met Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCSC)C(O)=O JBBYKPZAPOLCPK-JYJNAYRXSA-N 0.000 description 1
- YLRLHDFMMWDYTK-KKUMJFAQSA-N Tyr-Cys-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CS)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 YLRLHDFMMWDYTK-KKUMJFAQSA-N 0.000 description 1
- NQJDICVXXIMMMB-XDTLVQLUSA-N Tyr-Glu-Ala Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O NQJDICVXXIMMMB-XDTLVQLUSA-N 0.000 description 1
- HVHJYXDXRIWELT-RYUDHWBXSA-N Tyr-Glu-Gly Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O HVHJYXDXRIWELT-RYUDHWBXSA-N 0.000 description 1
- ZRPLVTZTKPPSBT-AVGNSLFASA-N Tyr-Glu-Ser Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O ZRPLVTZTKPPSBT-AVGNSLFASA-N 0.000 description 1
- WSFXJLFSJSXGMQ-MGHWNKPDSA-N Tyr-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N WSFXJLFSJSXGMQ-MGHWNKPDSA-N 0.000 description 1
- SBLZVFCEOCWRLS-BPNCWPANSA-N Tyr-Met-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](CC1=CC=C(C=C1)O)N SBLZVFCEOCWRLS-BPNCWPANSA-N 0.000 description 1
- IGXLNVIYDYONFB-UFYCRDLUSA-N Tyr-Phe-Arg Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O)C1=CC=C(O)C=C1 IGXLNVIYDYONFB-UFYCRDLUSA-N 0.000 description 1
- JXGUUJMPCRXMSO-HJOGWXRNSA-N Tyr-Phe-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=C(O)C=C1 JXGUUJMPCRXMSO-HJOGWXRNSA-N 0.000 description 1
- HZDQUVQEVVYDDA-ACRUOGEOSA-N Tyr-Tyr-Leu Chemical compound C([C@@H](C(=O)N[C@@H](CC(C)C)C(O)=O)NC(=O)[C@@H](N)CC=1C=CC(O)=CC=1)C1=CC=C(O)C=C1 HZDQUVQEVVYDDA-ACRUOGEOSA-N 0.000 description 1
- SQUMHUZLJDUROQ-YDHLFZDLSA-N Tyr-Val-Asp Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O SQUMHUZLJDUROQ-YDHLFZDLSA-N 0.000 description 1
- NWEGIYMHTZXVBP-JSGCOSHPSA-N Tyr-Val-Gly Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O NWEGIYMHTZXVBP-JSGCOSHPSA-N 0.000 description 1
- NVJCMGGZHOJNBU-UFYCRDLUSA-N Tyr-Val-Phe Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N NVJCMGGZHOJNBU-UFYCRDLUSA-N 0.000 description 1
- 108010064997 VPY tripeptide Proteins 0.000 description 1
- IZFVRRYRMQFVGX-NRPADANISA-N Val-Ala-Gln Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N IZFVRRYRMQFVGX-NRPADANISA-N 0.000 description 1
- AZSHAZJLOZQYAY-FXQIFTODSA-N Val-Ala-Ser Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O AZSHAZJLOZQYAY-FXQIFTODSA-N 0.000 description 1
- LABUITCFCAABSV-UHFFFAOYSA-N Val-Ala-Tyr Natural products CC(C)C(N)C(=O)NC(C)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 LABUITCFCAABSV-UHFFFAOYSA-N 0.000 description 1
- COYSIHFOCOMGCF-UHFFFAOYSA-N Val-Arg-Gly Natural products CC(C)C(N)C(=O)NC(C(=O)NCC(O)=O)CCCN=C(N)N COYSIHFOCOMGCF-UHFFFAOYSA-N 0.000 description 1
- UDNYEPLJTRDMEJ-RCOVLWMOSA-N Val-Asn-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)NCC(=O)O)N UDNYEPLJTRDMEJ-RCOVLWMOSA-N 0.000 description 1
- JLFKWDAZBRYCGX-ZKWXMUAHSA-N Val-Asn-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CO)C(=O)O)N JLFKWDAZBRYCGX-ZKWXMUAHSA-N 0.000 description 1
- XQVRMLRMTAGSFJ-QXEWZRGKSA-N Val-Asp-Arg Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N XQVRMLRMTAGSFJ-QXEWZRGKSA-N 0.000 description 1
- VXCAZHCVDBQMTP-NRPADANISA-N Val-Cys-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N VXCAZHCVDBQMTP-NRPADANISA-N 0.000 description 1
- LHADRQBREKTRLR-DCAQKATOSA-N Val-Cys-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](C(C)C)N LHADRQBREKTRLR-DCAQKATOSA-N 0.000 description 1
- PWRITNSESKQTPW-NRPADANISA-N Val-Gln-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CO)C(=O)O)N PWRITNSESKQTPW-NRPADANISA-N 0.000 description 1
- BRPKEERLGYNCNC-NHCYSSNCSA-N Val-Glu-Arg Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N BRPKEERLGYNCNC-NHCYSSNCSA-N 0.000 description 1
- GBESYURLQOYWLU-LAEOZQHASA-N Val-Glu-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N GBESYURLQOYWLU-LAEOZQHASA-N 0.000 description 1
- YTPLVNUZZOBFFC-SCZZXKLOSA-N Val-Gly-Pro Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N1CCC[C@@H]1C(O)=O YTPLVNUZZOBFFC-SCZZXKLOSA-N 0.000 description 1
- CPGJELLYDQEDRK-NAKRPEOUSA-N Val-Ile-Ala Chemical compound CC[C@H](C)[C@H](NC(=O)[C@@H](N)C(C)C)C(=O)N[C@@H](C)C(O)=O CPGJELLYDQEDRK-NAKRPEOUSA-N 0.000 description 1
- LKUDRJSNRWVGMS-QSFUFRPTSA-N Val-Ile-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N LKUDRJSNRWVGMS-QSFUFRPTSA-N 0.000 description 1
- KNYHAWKHFQRYOX-PYJNHQTQSA-N Val-Ile-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](C(C)C)N KNYHAWKHFQRYOX-PYJNHQTQSA-N 0.000 description 1
- DJQIUOKSNRBTSV-CYDGBPFRSA-N Val-Ile-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H](C(C)C)N DJQIUOKSNRBTSV-CYDGBPFRSA-N 0.000 description 1
- OTJMMKPMLUNTQT-AVGNSLFASA-N Val-Leu-Arg Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](C(C)C)N OTJMMKPMLUNTQT-AVGNSLFASA-N 0.000 description 1
- LYERIXUFCYVFFX-GVXVVHGQSA-N Val-Leu-Glu Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N LYERIXUFCYVFFX-GVXVVHGQSA-N 0.000 description 1
- AEMPCGRFEZTWIF-IHRRRGAJSA-N Val-Leu-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(O)=O AEMPCGRFEZTWIF-IHRRRGAJSA-N 0.000 description 1
- SYSWVVCYSXBVJG-RHYQMDGZSA-N Val-Leu-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](C(C)C)N)O SYSWVVCYSXBVJG-RHYQMDGZSA-N 0.000 description 1
- VENKIVFKIPGEJN-NHCYSSNCSA-N Val-Met-Glu Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N VENKIVFKIPGEJN-NHCYSSNCSA-N 0.000 description 1
- RQOMPQGUGBILAG-AVGNSLFASA-N Val-Met-Leu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O RQOMPQGUGBILAG-AVGNSLFASA-N 0.000 description 1
- YDVDTCJGBBJGRT-GUBZILKMSA-N Val-Met-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CO)C(=O)O)N YDVDTCJGBBJGRT-GUBZILKMSA-N 0.000 description 1
- SSYBNWFXCFNRFN-GUBZILKMSA-N Val-Pro-Ser Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O SSYBNWFXCFNRFN-GUBZILKMSA-N 0.000 description 1
- QWCZXKIFPWPQHR-JYJNAYRXSA-N Val-Pro-Tyr Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 QWCZXKIFPWPQHR-JYJNAYRXSA-N 0.000 description 1
- VIKZGAUAKQZDOF-NRPADANISA-N Val-Ser-Glu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O VIKZGAUAKQZDOF-NRPADANISA-N 0.000 description 1
- PGQUDQYHWICSAB-NAKRPEOUSA-N Val-Ser-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](C(C)C)N PGQUDQYHWICSAB-NAKRPEOUSA-N 0.000 description 1
- QTPQHINADBYBNA-DCAQKATOSA-N Val-Ser-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCCN QTPQHINADBYBNA-DCAQKATOSA-N 0.000 description 1
- UVHFONIHVHLDDQ-IFFSRLJSSA-N Val-Thr-Glu Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N)O UVHFONIHVHLDDQ-IFFSRLJSSA-N 0.000 description 1
- ZNGPROMGGGFOAA-JYJNAYRXSA-N Val-Tyr-Val Chemical compound CC(C)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C(C)C)C(O)=O)CC1=CC=C(O)C=C1 ZNGPROMGGGFOAA-JYJNAYRXSA-N 0.000 description 1
- NLNCNKIVJPEFBC-DLOVCJGASA-N Val-Val-Glu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCC(O)=O NLNCNKIVJPEFBC-DLOVCJGASA-N 0.000 description 1
- JSOXWWFKRJKTMT-WOPDTQHZSA-N Val-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N JSOXWWFKRJKTMT-WOPDTQHZSA-N 0.000 description 1
- LLJLBRRXKZTTRD-GUBZILKMSA-N Val-Val-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(=O)O)N LLJLBRRXKZTTRD-GUBZILKMSA-N 0.000 description 1
- 108010008685 alanyl-glutamyl-aspartic acid Proteins 0.000 description 1
- 108010084217 alanyl-glutamyl-aspartyl-glycine Proteins 0.000 description 1
- 108010076324 alanyl-glycyl-glycine Proteins 0.000 description 1
- 108010024078 alanyl-glycyl-serine Proteins 0.000 description 1
- 108010045023 alanyl-prolyl-tyrosine Proteins 0.000 description 1
- 108010086434 alanyl-seryl-glycine Proteins 0.000 description 1
- 108010041407 alanylaspartic acid Proteins 0.000 description 1
- 108010005233 alanylglutamic acid Proteins 0.000 description 1
- 108010011559 alanylphenylalanine Proteins 0.000 description 1
- 125000003277 amino group Chemical group 0.000 description 1
- 229960000723 ampicillin Drugs 0.000 description 1
- AVKUERGKIZMTKX-NJBDSQKTSA-N ampicillin Chemical compound C1([C@@H](N)C(=O)N[C@H]2[C@H]3SC([C@@H](N3C2=O)C(O)=O)(C)C)=CC=CC=C1 AVKUERGKIZMTKX-NJBDSQKTSA-N 0.000 description 1
- 230000003698 anagen phase Effects 0.000 description 1
- 210000004102 animal cell Anatomy 0.000 description 1
- 239000000427 antigen Substances 0.000 description 1
- 230000000890 antigenic effect Effects 0.000 description 1
- 108091007433 antigens Proteins 0.000 description 1
- 102000036639 antigens Human genes 0.000 description 1
- 108010001271 arginyl-glutamyl-arginine Proteins 0.000 description 1
- 108010052670 arginyl-glutamyl-glutamic acid Proteins 0.000 description 1
- 108010091092 arginyl-glycyl-proline Proteins 0.000 description 1
- 108010069926 arginyl-glycyl-serine Proteins 0.000 description 1
- 108010059459 arginyl-threonyl-phenylalanine Proteins 0.000 description 1
- 108010040443 aspartyl-aspartic acid Proteins 0.000 description 1
- 108010047857 aspartylglycine Proteins 0.000 description 1
- 210000003719 b-lymphocyte Anatomy 0.000 description 1
- 239000000688 bacterial toxin Substances 0.000 description 1
- 230000008901 benefit Effects 0.000 description 1
- WQZGKKKJIJFFOK-VFUOTHLCSA-N beta-D-glucose Chemical compound OC[C@H]1O[C@@H](O)[C@H](O)[C@@H](O)[C@@H]1O WQZGKKKJIJFFOK-VFUOTHLCSA-N 0.000 description 1
- 108010066270 beta-lactorphin Proteins 0.000 description 1
- 239000000872 buffer Substances 0.000 description 1
- 239000001110 calcium chloride Substances 0.000 description 1
- 235000011148 calcium chloride Nutrition 0.000 description 1
- 229910001628 calcium chloride Inorganic materials 0.000 description 1
- 239000001506 calcium phosphate Substances 0.000 description 1
- 229910000389 calcium phosphate Inorganic materials 0.000 description 1
- 235000011010 calcium phosphates Nutrition 0.000 description 1
- 244000309466 calf Species 0.000 description 1
- 230000005907 cancer growth Effects 0.000 description 1
- 150000001720 carbohydrates Chemical class 0.000 description 1
- 235000014633 carbohydrates Nutrition 0.000 description 1
- 238000004113 cell culture Methods 0.000 description 1
- 230000011712 cell development Effects 0.000 description 1
- 210000000170 cell membrane Anatomy 0.000 description 1
- 230000019522 cellular metabolic process Effects 0.000 description 1
- 238000005119 centrifugation Methods 0.000 description 1
- 238000004587 chromatography analysis Methods 0.000 description 1
- 239000013611 chromosomal DNA Substances 0.000 description 1
- 238000003200 chromosome mapping Methods 0.000 description 1
- 238000010367 cloning Methods 0.000 description 1
- 238000000975 co-precipitation Methods 0.000 description 1
- 238000010276 construction Methods 0.000 description 1
- 230000002596 correlated effect Effects 0.000 description 1
- 230000000875 corresponding effect Effects 0.000 description 1
- 239000003431 cross linking reagent Substances 0.000 description 1
- 108010004073 cysteinylcysteine Proteins 0.000 description 1
- 238000013461 design Methods 0.000 description 1
- 238000011161 development Methods 0.000 description 1
- 239000008121 dextrose Substances 0.000 description 1
- 102000004419 dihydrofolate reductase Human genes 0.000 description 1
- FSXRLASFHBWESK-UHFFFAOYSA-N dipeptide phenylalanyl-tyrosine Natural products C=1C=C(O)C=CC=1CC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 FSXRLASFHBWESK-UHFFFAOYSA-N 0.000 description 1
- 108010054812 diprotin A Proteins 0.000 description 1
- 208000035475 disorder Diseases 0.000 description 1
- 238000009826 distribution Methods 0.000 description 1
- 230000000857 drug effect Effects 0.000 description 1
- 238000010828 elution Methods 0.000 description 1
- 238000002474 experimental method Methods 0.000 description 1
- 238000000605 extraction Methods 0.000 description 1
- 230000002538 fungal effect Effects 0.000 description 1
- 108010063718 gamma-glutamylaspartic acid Proteins 0.000 description 1
- 239000000499 gel Substances 0.000 description 1
- 238000002523 gelfiltration Methods 0.000 description 1
- 230000002068 genetic effect Effects 0.000 description 1
- 238000010353 genetic engineering Methods 0.000 description 1
- 108010078144 glutaminyl-glycine Proteins 0.000 description 1
- 108010080575 glutamyl-aspartyl-alanine Proteins 0.000 description 1
- 108010079547 glutamylmethionine Proteins 0.000 description 1
- 108010090037 glycyl-alanyl-isoleucine Proteins 0.000 description 1
- 108010019832 glycyl-asparaginyl-glycine Proteins 0.000 description 1
- 108010078326 glycyl-glycyl-valine Proteins 0.000 description 1
- 108010023364 glycyl-histidyl-arginine Proteins 0.000 description 1
- 108010019154 glycyl-leucyl-leucyl-aspartyl-leucyl-lysine Proteins 0.000 description 1
- 108010066198 glycyl-leucyl-phenylalanine Proteins 0.000 description 1
- 108010082286 glycyl-seryl-alanine Proteins 0.000 description 1
- 108010010147 glycylglutamine Proteins 0.000 description 1
- 108010015792 glycyllysine Proteins 0.000 description 1
- 108010081551 glycylphenylalanine Proteins 0.000 description 1
- 239000010931 gold Substances 0.000 description 1
- 229910052737 gold Inorganic materials 0.000 description 1
- 230000012010 growth Effects 0.000 description 1
- 238000004128 high performance liquid chromatography Methods 0.000 description 1
- 108010025306 histidylleucine Proteins 0.000 description 1
- 230000014726 immortalization of host cell Effects 0.000 description 1
- 230000028993 immune response Effects 0.000 description 1
- 230000005847 immunogenicity Effects 0.000 description 1
- 230000002055 immunohistochemical effect Effects 0.000 description 1
- 239000002596 immunotoxin Substances 0.000 description 1
- 230000002637 immunotoxin Effects 0.000 description 1
- 231100000608 immunotoxin Toxicity 0.000 description 1
- 229940051026 immunotoxin Drugs 0.000 description 1
- 230000006698 induction Effects 0.000 description 1
- 239000004615 ingredient Substances 0.000 description 1
- 239000003112 inhibitor Substances 0.000 description 1
- 230000002401 inhibitory effect Effects 0.000 description 1
- 230000002700 inhibitory effect on cancer Effects 0.000 description 1
- 230000005764 inhibitory process Effects 0.000 description 1
- 238000007918 intramuscular administration Methods 0.000 description 1
- 238000007912 intraperitoneal administration Methods 0.000 description 1
- 238000001990 intravenous administration Methods 0.000 description 1
- 238000004255 ion exchange chromatography Methods 0.000 description 1
- 108010031424 isoleucyl-prolyl-proline Proteins 0.000 description 1
- 108010044374 isoleucyl-tyrosine Proteins 0.000 description 1
- 108010078274 isoleucylvaline Proteins 0.000 description 1
- 108010053037 kyotorphin Proteins 0.000 description 1
- 238000012177 large-scale sequencing Methods 0.000 description 1
- 108010077158 leucinyl-arginyl-tryptophan Proteins 0.000 description 1
- 108010083708 leucyl-aspartyl-valine Proteins 0.000 description 1
- 108010044311 leucyl-glycyl-glycine Proteins 0.000 description 1
- 108010030617 leucyl-phenylalanyl-valine Proteins 0.000 description 1
- 108010073472 leucyl-prolyl-proline Proteins 0.000 description 1
- 108010000761 leucylarginine Proteins 0.000 description 1
- 108010091871 leucylmethionine Proteins 0.000 description 1
- 239000003446 ligand Substances 0.000 description 1
- 150000002632 lipids Chemical class 0.000 description 1
- 238000004811 liquid chromatography Methods 0.000 description 1
- 230000004807 localization Effects 0.000 description 1
- 108010054155 lysyllysine Proteins 0.000 description 1
- 229910001629 magnesium chloride Inorganic materials 0.000 description 1
- 238000010297 mechanical methods and process Methods 0.000 description 1
- 230000010534 mechanism of action Effects 0.000 description 1
- 201000001441 melanoma Diseases 0.000 description 1
- 239000012528 membrane Substances 0.000 description 1
- 230000031864 metaphase Effects 0.000 description 1
- 230000009401 metastasis Effects 0.000 description 1
- 125000001360 methionine group Chemical group N[C@@H](CCSC)C(=O)* 0.000 description 1
- 108010016686 methionyl-alanyl-serine Proteins 0.000 description 1
- 238000000520 microinjection Methods 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 239000002808 molecular sieve Substances 0.000 description 1
- 238000009377 nuclear transmutation Methods 0.000 description 1
- 230000003204 osmotic effect Effects 0.000 description 1
- 238000004806 packaging method and process Methods 0.000 description 1
- 239000000546 pharmaceutical excipient Substances 0.000 description 1
- 108010065135 phenylalanyl-phenylalanyl-phenylalanine Proteins 0.000 description 1
- 108010024654 phenylalanyl-prolyl-alanine Proteins 0.000 description 1
- 108010084572 phenylalanyl-valine Proteins 0.000 description 1
- 108010018625 phenylalanylarginine Proteins 0.000 description 1
- 230000003169 placental effect Effects 0.000 description 1
- 239000003123 plant toxin Substances 0.000 description 1
- 229920002401 polyacrylamide Polymers 0.000 description 1
- 229920001223 polyethylene glycol Polymers 0.000 description 1
- 238000003752 polymerase chain reaction Methods 0.000 description 1
- 102000054765 polymorphisms of proteins Human genes 0.000 description 1
- 230000001376 precipitating effect Effects 0.000 description 1
- 230000035755 proliferation Effects 0.000 description 1
- 108010004914 prolylarginine Proteins 0.000 description 1
- -1 promoters Substances 0.000 description 1
- 238000001742 protein purification Methods 0.000 description 1
- 238000003127 radioimmunoassay Methods 0.000 description 1
- 238000010814 radioimmunoprecipitation assay Methods 0.000 description 1
- 230000010076 replication Effects 0.000 description 1
- 238000011160 research Methods 0.000 description 1
- 238000003757 reverse transcription PCR Methods 0.000 description 1
- 238000012552 review Methods 0.000 description 1
- 239000002342 ribonucleoside Substances 0.000 description 1
- 238000005185 salting out Methods 0.000 description 1
- 150000003839 salts Chemical class 0.000 description 1
- 230000003248 secreting effect Effects 0.000 description 1
- 238000000926 separation method Methods 0.000 description 1
- 210000002966 serum Anatomy 0.000 description 1
- 239000012679 serum free medium Substances 0.000 description 1
- 108010048818 seryl-histidine Proteins 0.000 description 1
- 108010048397 seryl-lysyl-leucine Proteins 0.000 description 1
- URGAHOPLAPQHLN-UHFFFAOYSA-N sodium aluminosilicate Chemical compound [Na+].[Al+3].[O-][Si]([O-])=O.[O-][Si]([O-])=O URGAHOPLAPQHLN-UHFFFAOYSA-N 0.000 description 1
- 238000001179 sorption measurement Methods 0.000 description 1
- 238000010183 spectrum analysis Methods 0.000 description 1
- 238000010561 standard procedure Methods 0.000 description 1
- 230000004936 stimulating effect Effects 0.000 description 1
- 238000007920 subcutaneous administration Methods 0.000 description 1
- 125000001424 substituent group Chemical group 0.000 description 1
- 239000000758 substrate Substances 0.000 description 1
- 238000001308 synthesis method Methods 0.000 description 1
- 230000002194 synthesizing effect Effects 0.000 description 1
- 229960002180 tetracycline Drugs 0.000 description 1
- 229930101283 tetracycline Natural products 0.000 description 1
- 235000019364 tetracycline Nutrition 0.000 description 1
- 150000003522 tetracyclines Chemical class 0.000 description 1
- 125000003396 thiol group Chemical group [H]S* 0.000 description 1
- 230000000699 topical effect Effects 0.000 description 1
- 239000003053 toxin Substances 0.000 description 1
- 231100000765 toxin Toxicity 0.000 description 1
- 108700012359 toxins Proteins 0.000 description 1
- 238000003151 transfection method Methods 0.000 description 1
- 238000012546 transfer Methods 0.000 description 1
- 230000009466 transformation Effects 0.000 description 1
- 238000013519 translation Methods 0.000 description 1
- 230000014621 translational initiation Effects 0.000 description 1
- QORWJWZARLRLPR-UHFFFAOYSA-H tricalcium bis(phosphate) Chemical compound [Ca+2].[Ca+2].[Ca+2].[O-]P([O-])([O-])=O.[O-]P([O-])([O-])=O QORWJWZARLRLPR-UHFFFAOYSA-H 0.000 description 1
- 210000004881 tumor cell Anatomy 0.000 description 1
- 108010079202 tyrosyl-alanyl-cysteine Proteins 0.000 description 1
- 108010020532 tyrosyl-proline Proteins 0.000 description 1
- 238000005199 ultracentrifugation Methods 0.000 description 1
- 241000701447 unidentified baculovirus Species 0.000 description 1
- IBIDRSSEHFLGSD-UHFFFAOYSA-N valinyl-arginine Natural products CC(C)C(N)C(=O)NC(C(O)=O)CCCN=C(N)N IBIDRSSEHFLGSD-UHFFFAOYSA-N 0.000 description 1
- 108010009962 valyltyrosine Proteins 0.000 description 1
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Substances O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 1
- 210000005253 yeast cell Anatomy 0.000 description 1
Landscapes
- Medicines That Contain Protein Lipid Enzymes And Other Medicines (AREA)
- Peptides Or Proteins (AREA)
Abstract
Description
本发明属于生物技术领域,具体地说,本发明涉及新的编码具有抑癌功能的人蛋白的多核苷酸,以及此多核苷酸编码的多肽。本发明还涉及此多核苷酸和多肽的用途和制备。The invention belongs to the field of biotechnology, in particular, the invention relates to a new polynucleotide encoding a human protein with tumor suppressor function, and a polypeptide encoded by the polynucleotide. The present invention also relates to the use and preparation of such polynucleotides and polypeptides.
人基因组学研究目前是国际上的热点,除人染色体DNA大规模测序,表达序列测序(EST)的方法外,还缺少从功能开始的筛选具有功能基因的高通量的方法。Human genomics research is currently a hot spot in the world. In addition to large-scale sequencing of human chromosomal DNA and expressed sequence sequencing (EST), there is still a lack of high-throughput methods for screening functional genes starting from function.
癌症是危害人类健康的主要疾病之一。为了有效地治疗和预防肿瘤,目前人们已越来越关注肿瘤的基因治疗。因此,本领域迫切需要开发研究具有抑癌功能的人蛋白及其激动剂/抑制剂。Cancer is one of the major diseases that endanger human health. In order to effectively treat and prevent tumors, people have paid more and more attention to gene therapy of tumors. Therefore, there is an urgent need in this field to develop and study human proteins with tumor suppressor functions and their agonists/inhibitors.
本发明的目的是提供一类新的具有抑癌功能的人蛋白多肽以及其片段、类似物和衍生物。The object of the present invention is to provide a new class of human protein polypeptides with tumor suppressor function and fragments, analogs and derivatives thereof.
本发明的另一目的是提供编码这些多肽的多核苷酸。Another object of the present invention is to provide polynucleotides encoding these polypeptides.
本发明的另一目的是提供生产这些多肽的方法以及该多肽和编码序列的用途。Another object of the present invention is to provide methods for producing these polypeptides and uses of the polypeptides and coding sequences.
在本发明的第一方面,提供新颖的分离出的具有抑癌功能的蛋白多肽,它包含具有选自下组的氨基酸序列的多肽:SEQ ID NO:2、SEQ ID NO:5、SEQ ID NO:8、SEQ IDNO:11、SEQ ID NO:14、SEQ ID NO:17、SEQ ID NO:20;或其保守性变异多肽、或其活性片段、或其活性衍生物。In the first aspect of the present invention, a novel isolated protein polypeptide with tumor suppressor function is provided, which comprises a polypeptide having an amino acid sequence selected from the group consisting of: SEQ ID NO: 2, SEQ ID NO: 5, SEQ ID NO : 8, SEQ ID NO: 11, SEQ ID NO: 14, SEQ ID NO: 17, SEQ ID NO: 20; or its conservative variant polypeptide, or its active fragment, or its active derivative.
较佳地,该多肽是具有选自下组的氨基酸序列的多肽:SEQ ID NO:2、SEQ ID NO:5、SEQ ID NO:8、SEQ ID NO:11、SEQ ID NO:14、SEQ ID NO:17、SEQ ID NO:20。Preferably, the polypeptide is a polypeptide having an amino acid sequence selected from the group consisting of SEQ ID NO: 2, SEQ ID NO: 5, SEQ ID NO: 8, SEQ ID NO: 11, SEQ ID NO: 14, SEQ ID NO: 17, SEQ ID NO: 20.
在本发明的第二方面,提供了一种分离的多核苷酸,它包含一核苷酸序列,该核苷酸序列与选自下组的一种核苷酸序列有至少85%相同性:(a)编码上述的具有抑癌功能的蛋白多肽的多核苷酸;(b)与多核苷酸(a)互补的多核苷酸。较佳地,该多核苷酸编码的多肽具有选自下组的氨基酸序列:SEQ ID NO:2、SEQ ID NO:5、SEQ ID NO:8、SEQ IDNO:11、SEQ ID NO:14、SEQ ID NO:17、SEQ ID NO:20。更佳地,该多核苷酸的序列选自下组:SEQ ID NO:3、SEQ ID NO:6、SEQ ID NO:9、SEQ ID NO:12、SEQ ID NO:15、SEQ ID NO:18、SEQ ID NO:21的编码区序列或全长序列。In a second aspect of the present invention there is provided an isolated polynucleotide comprising a nucleotide sequence having at least 85% identity to a nucleotide sequence selected from the group consisting of: (a) a polynucleotide encoding the above-mentioned protein polypeptide with a tumor suppressor function; (b) a polynucleotide complementary to the polynucleotide (a). Preferably, the polypeptide encoded by the polynucleotide has an amino acid sequence selected from the group consisting of SEQ ID NO: 2, SEQ ID NO: 5, SEQ ID NO: 8, SEQ ID NO: 11, SEQ ID NO: 14, SEQ ID NO: 14, SEQ ID NO: ID NO: 17, SEQ ID NO: 20. More preferably, the sequence of the polynucleotide is selected from the group consisting of SEQ ID NO: 3, SEQ ID NO: 6, SEQ ID NO: 9, SEQ ID NO: 12, SEQ ID NO: 15, SEQ ID NO: 18 , the coding region sequence or full-length sequence of SEQ ID NO: 21.
在本发明的第三方面,提供了含有上述多核苷酸的载体,以及被该载体转化或转导的宿主细胞或者被上述多核苷酸直接转化或转导的宿主细胞。In the third aspect of the present invention, there are provided vectors containing the above-mentioned polynucleotides, and host cells transformed or transduced by the vectors or host cells directly transformed or transduced by the above-mentioned polynucleotides.
在本发明的第四方面,提供了制备具有抑癌功能的蛋白活性的多肽的制备方法,该方法包含:(a)在适合表达具有抑癌功能的蛋白的条件下,培养上述被转化或转导的宿主细胞;(b)从培养物中分离出具有抑癌功能的蛋白活性的多肽。In the fourth aspect of the present invention, there is provided a method for preparing a polypeptide having a protein activity of a tumor suppressor function, the method comprising: (a) cultivating the above-mentioned transformed or transformed protein under conditions suitable for expressing a protein with a tumor suppressor function (b) isolating the polypeptide with protein activity of tumor suppressor function from the culture.
在本发明的第五方面,提供了与上述的具有抑癌功能的蛋白多肽特异性结合的抗体。还提供了可用于检测的核酸分子,它含有上述的多核苷酸中连续10个核苷酸至全长核苷酸,较佳地它含有连续的约10-800个核苷酸。In the fifth aspect of the present invention, an antibody specifically binding to the above-mentioned protein polypeptide with tumor suppressor function is provided. Also provided is a nucleic acid molecule that can be used for detection, which contains 10 consecutive nucleotides to the full-length nucleotides of the above-mentioned polynucleotide, preferably it contains continuous about 10-800 nucleotides.
在本发明的第六方面,提供了一种药物组合物,它含有安全有效量的本发明的具有抑癌功能的蛋白多肽以及药学上可接受的载体。这些药物组合物可治疗癌症以及细胞异常增殖等病症。In the sixth aspect of the present invention, a pharmaceutical composition is provided, which contains a safe and effective amount of the protein polypeptide with tumor suppressor function of the present invention and a pharmaceutically acceptable carrier. These pharmaceutical compositions can treat diseases such as cancer and abnormal proliferation of cells.
本发明的其它方面由于本文的技术的公开,对本领域的技术人员而言是显而易见的。Other aspects of the invention will be apparent to those skilled in the art from the technical disclosure herein.
本发明采用大规模cDNA克隆转染癌细胞,在获得具有抑癌作用的基础上,经测序证明为新的基因,进一步得到全长cDNA克隆。DNA转染试验证明,本发明的具有抑癌功能的蛋白对癌细胞(肝癌细胞)具有抑制克隆形成的作用,其抑制率在50%或50%以上。The invention adopts large-scale cDNA clones to transfect cancer cells, and on the basis of obtaining tumor-suppressing effects, it is proved to be new genes by sequencing, and further obtains full-length cDNA clones. The DNA transfection test proves that the protein with tumor suppressor function of the present invention has the effect of inhibiting the colony formation of cancer cells (liver cancer cells), and the inhibition rate is 50% or above.
如本文所用,“分离的”是指物质从其原始环境中分离出来(如果是天然的物质,原始环境即是天然环境)。如活体细胞内的天然状态下的多聚核苷酸和多肽是没有分离纯化的,但同样的多聚核苷酸或多肽如从天然状态中同存在的其他物质中分开,则为分离纯化的。As used herein, "isolated" means that the material is separated from its original environment (if the material is native, the original environment is the natural environment). For example, polynucleotides and polypeptides in the natural state in living cells are not isolated and purified, but the same polynucleotides or polypeptides are isolated and purified if they are separated from other substances that exist together in the natural state .
如本文所用,“分离的具有抑癌功能的蛋白或多肽”是指具有抑癌功能的蛋白多肽基本上不含天然与其相关的其它蛋白、脂类、糖类或其它物质。本领域的技术人员能用标准的蛋白质纯化技术纯化具有抑癌功能的蛋白。基本上纯的多肽在非还原聚丙烯酰胺凝胶上能产生单一的主带。具有抑癌功能的蛋白多肽的纯度能用氨基酸序列分析。As used herein, "isolated protein or polypeptide with tumor suppressor function" means that the protein or polypeptide with tumor suppressor function does not substantially contain other proteins, lipids, carbohydrates or other substances associated with it in nature. Those skilled in the art can use standard protein purification techniques to purify proteins with tumor suppressor function. Substantially pure polypeptides yield a single major band on non-reducing polyacrylamide gels. The purity of the protein polypeptide with tumor suppressor function can be analyzed by amino acid sequence.
本发明的多肽可以是重组多肽、天然多肽、合成多肽,优选重组多肽。本发明的多肽可以是天然纯化的产物,或是化学合成的产物,或使用重组技术从原核或真核宿主(例如,细菌、酵母、高等植物、昆虫和哺乳动物细胞)中产生。根据重组生产方案所用的宿主,本发明的多肽可以是糖基化的,或可以是非糖基化的。本发明的多肽还可包括或不包括起始的甲硫氨酸残基。The polypeptide of the present invention can be a recombinant polypeptide, a natural polypeptide, a synthetic polypeptide, preferably a recombinant polypeptide. Polypeptides of the present invention may be naturally purified, or chemically synthesized, or produced using recombinant techniques from prokaryotic or eukaryotic hosts (eg, bacteria, yeast, higher plants, insect and mammalian cells). Depending on the host used in the recombinant production protocol, the polypeptides of the invention may be glycosylated, or may be non-glycosylated. Polypeptides of the invention may or may not include an initial methionine residue.
本发明还包括具有抑癌功能的人蛋白的片段、衍生物和类似物。如本文所用,术语“片段”、“衍生物”和“类似物”是指基本上保持本发明的天然具有抑癌功能的人蛋白相同的生物学功能或活性的多肽。本发明的多肽片段、衍生物或类似物可以是(i)有一个或多个保守或非保守性氨基酸残基(优选保守性氨基酸残基)被取代的多肽,而这样的取代的氨基酸残基可以是也可以不是由遗传密码编码的,或(ii)在一个或多个氨基酸残基中具有取代基团的多肽,或(iii)成熟多肽与另一个化合物(比如延长多肽半衰期的化合物,例如聚乙二醇)融合所形成的多肽,或(iv)附加的氨基酸序列融合到此多肽序列而形成的多肽(如前导序列或分泌序列或用来纯化此多肽的序列或蛋白原序列)。根据本文的教导,这些片段、衍生物和类似物属于本领域熟练技术人员公知的范围。The present invention also includes fragments, derivatives and analogs of human proteins with tumor suppressor function. As used herein, the terms "fragment", "derivative" and "analogue" refer to a polypeptide that substantially maintains the same biological function or activity of the natural human protein with tumor suppressor function of the present invention. The polypeptide fragments, derivatives or analogs of the present invention may be (i) polypeptides having one or more conservative or non-conservative amino acid residues (preferably conservative amino acid residues) substituted, and such substituted amino acid residues It may or may not be encoded by the genetic code, or (ii) a polypeptide having a substituent group in one or more amino acid residues, or (iii) a mature polypeptide in combination with another compound (such as a compound that extends the half-life of the polypeptide, e.g. polyethylene glycol), or (iv) an additional amino acid sequence fused to the polypeptide sequence (such as a leader sequence or secretory sequence or a sequence used to purify the polypeptide or a proprotein sequence). Such fragments, derivatives and analogs are within the purview of those skilled in the art in light of the teachings herein.
本发明的多核苷酸可以是DNA形式或RNA形式。DNA形式包括cDNA、基因组DNA或人工合成的DNA。DNA可以是单链的或是双链的。DNA可以是编码链或非编码链。以PP8153蛋白(在本申请中,蛋白质的命名采用其克隆编号)为例,编码成熟多肽的编码区序列可以与SEQ ID NO:3所示的编码区序列相同或者是简并的变异体。如本文所用,“简并的变异体”在本发明中是指编码具有SEQ ID NO:2的蛋白质,但与SEQ IDNO:3所示的编码区序列有差别的核酸序列。以PP8332蛋白(在本申请中,蛋白质的命名采用其克隆编号)为例,编码成熟多肽的编码区序列可以与SEQ ID NO:6所示的编码区序列相同或者是简并的变异体。如本文所用,“简并的变异体”在本发明中是指编码具有SEQ ID NO:5的蛋白质,但与SEQ ID NO:6所示的编码区序列有差别的核酸序列。对于其他具有抑癌功能的蛋白,可依此类推。对于其他具有抑癌功能的蛋白,可依此类推。A polynucleotide of the invention may be in the form of DNA or RNA. Forms of DNA include cDNA, genomic DNA or synthetic DNA. DNA can be single-stranded or double-stranded. DNA can be either the coding strand or the non-coding strand. Taking the PP8153 protein (in this application, the protein is named using its clone number) as an example, the coding region sequence encoding the mature polypeptide can be the same as the coding region sequence shown in SEQ ID NO: 3 or a degenerate variant. As used herein, "degenerate variant" in the present invention refers to a nucleic acid sequence that encodes a protein with SEQ ID NO: 2, but differs from the sequence of the coding region shown in SEQ ID NO: 3. Taking the PP8332 protein (in this application, the protein is named using its clone number) as an example, the coding region sequence encoding the mature polypeptide can be the same as the coding region sequence shown in SEQ ID NO: 6 or a degenerate variant. As used herein, "degenerate variant" in the present invention refers to a nucleic acid sequence that encodes a protein having SEQ ID NO: 5, but differs from the sequence of the coding region shown in SEQ ID NO: 6. For other proteins with tumor suppressor function, it can be deduced by analogy. For other proteins with tumor suppressor function, it can be deduced by analogy.
编码成熟多肽的多核苷酸包括:只编码成熟多肽的编码序列;成熟多肽的编码序列和各种附加编码序列;成熟多肽的编码序列(和任选的附加编码序列)以及非编码序列。A polynucleotide encoding a mature polypeptide includes: a coding sequence that encodes only the mature polypeptide; a coding sequence for the mature polypeptide and various additional coding sequences; a coding sequence for the mature polypeptide (and optionally additional coding sequences) and non-coding sequences.
术语“编码多肽的多核苷酸”可以是包括编码此多肽的多核苷酸,也可以是还包括附加编码和/或非编码序列的多核苷酸。The term "polynucleotide encoding a polypeptide" may include a polynucleotide encoding the polypeptide, or may also include additional coding and/or non-coding sequences.
本发明还涉及上述多核苷酸的变异体,其编码与本发明有相同的氨基酸序列的多肽或多肽的片段、类似物和衍生物。此多核苷酸的变异体可以是天然发生的等位变异体或非天然发生的变异体。这些核苷酸变异体包括取代变异体、缺失变异体和插入变异体。如本领域所知的,等位变异体是一个多核苷酸的替换形式,它可能是一个或多个核苷酸的取代、缺失或插入,但不会从实质上改变其编码的多肽的功能。The present invention also relates to variants of the above-mentioned polynucleotides, which encode polypeptides or polypeptide fragments, analogs and derivatives having the same amino acid sequence as the present invention. Variants of this polynucleotide may be naturally occurring allelic variants or non-naturally occurring variants. These nucleotide variants include substitution variants, deletion variants and insertion variants. As known in the art, an allelic variant is an alternative form of a polynucleotide which may be a substitution, deletion or insertion of one or more nucleotides without substantially altering the function of the polypeptide it encodes .
本发明还涉及与上述的序列杂交且两个序列之间具有至少50%,较佳地至少70%,更佳地至少80%相同性的多核苷酸。本发明特别涉及在严格条件下与本发明所述多核苷酸可杂交的多核苷酸。在本发明中,“严格条件”是指:(1)在较低离子强度和较高温度下的杂交和洗脱,如0.2×SSC,0.1%SDS,60℃;或(2)杂交时加有变性剂,如50%(v/v)甲酰胺,0.1%小牛血清/0.1%Ficoll,42℃等;或(3)仅在两条序列之间的相同性至少在95%以上,更好是97%以上时才发生杂交。并且,可杂交的多核苷酸编码的多肽与SEQ IDNO:2所示的成熟多肽有相同的生物学功能(以PP8153蛋白为例)和活性。The present invention also relates to polynucleotides that hybridize to the above-mentioned sequences and have at least 50%, preferably at least 70%, more preferably at least 80% identity between the two sequences. The invention particularly relates to polynucleotides which are hybridizable under stringent conditions to the polynucleotides of the invention. In the present invention, "stringent conditions" refers to: (1) hybridization and elution at lower ionic strength and higher temperature, such as 0.2×SSC, 0.1% SDS, 60°C; or (2) hybridization with There are denaturing agents, such as 50% (v/v) formamide, 0.1% calf serum/0.1% Ficoll, etc.; or (3) only the identity between the two sequences is at least 95%, more Preferably hybridization occurs above 97%. Moreover, the polypeptide encoded by the hybridizable polynucleotide has the same biological function (taking PP8153 protein as an example) and activity as the mature polypeptide shown in SEQ ID NO: 2.
本发明还涉及与上述的序列杂交的核酸片段。如本文所用,“核酸片段”的长度至少含15个核苷酸,较好是至少30个核苷酸,更好是至少50个核苷酸,最好是至少100个核苷酸以上。核酸片段可用于核酸的扩增技术(如PCR)以确定和/或分离编码具有抑癌功能的蛋白的多聚核苷酸。The present invention also relates to nucleic acid fragments that hybridize to the above-mentioned sequences. As used herein, a "nucleic acid fragment" is at least 15 nucleotides in length, preferably at least 30 nucleotides in length, more preferably at least 50 nucleotides in length, most preferably at least 100 nucleotides in length. Nucleic acid fragments can be used in nucleic acid amplification techniques (such as PCR) to identify and/or isolate polynucleotides encoding proteins with tumor suppressor function.
本发明中的多肽和多核苷酸优选以分离的形式提供,更佳地被纯化至均质。The polypeptides and polynucleotides of the invention are preferably provided in isolated form, more preferably purified to homogeneity.
本发明的DNA序列能用几种方法获得。例如,用本领域熟知的杂交技术分离DNA。这些技术包括但不局限于:1)用探针与基因组或cDNA文库杂交以检出同源性核苷酸序列,和2)表达文库的抗体筛选以检出具有共同结构特征的克隆的DNA片段。The DNA sequences of the present invention can be obtained in several ways. For example, DNA is isolated using hybridization techniques well known in the art. These techniques include, but are not limited to: 1) hybridization of probes to genomic or cDNA libraries to detect homologous nucleotide sequences, and 2) antibody screening of expression libraries to detect cloned DNA fragments with common structural features .
编码具有抑癌功能的蛋白的特异DNA片段序列产生也能用下列方法获得:1)从基因组DNA分离双链DNA序列;2)化学合成DNA序列以获得所需多肽的双链DNA。The production of specific DNA fragment sequences encoding proteins with tumor suppressor function can also be obtained by the following methods: 1) isolation of double-stranded DNA sequences from genomic DNA; 2) chemical synthesis of DNA sequences to obtain double-stranded DNA of desired polypeptides.
上述提到的方法中,分离基因组DNA最不常用。当需要的多肽产物的整个氨基酸序列已知时,DNA序列的直接化学合成是经常选用的方法。如果所需的氨基酸的整个序列不清楚时,DNA序列的直接化学合成是不可能的,选用的方法是cDNA序列的分离。分离感兴趣的cDNA的标准方法是从高表达该基因的供体细胞分离mRNA并进行逆转录,形成质粒或噬菌体cDNA文库。提取mRNA的方法已有多种成熟的技术,试剂盒也可从商业途径获得(Qiagene)。而构建cDNA文库也是通常的方法(Sambrook,et al.,Molecular Cloning,A Laboratory Manual,Cold Spring Harbor Laboratory.New York,1989)。还可得到商业供应的cDNA文库,如Clontech公司的不同cDNA文库。当结合使用聚合酶反应技术时,即使极少的表达产物也能克隆。Of the methods mentioned above, isolating genomic DNA is the least commonly used. Direct chemical synthesis of DNA sequences is often the method of choice when the entire amino acid sequence of the desired polypeptide product is known. If the entire sequence of the desired amino acids is not known, direct chemical synthesis of the DNA sequence is not possible and the method of choice is isolation of the cDNA sequence. The standard method for isolating cDNA of interest is to isolate mRNA from donor cells that highly express the gene and perform reverse transcription to form a plasmid or phage cDNA library. There are many mature technologies for the method of extracting mRNA, and the kit is also available from commercial sources (Qiagene). And constructing a cDNA library is also a common method (Sambrook, et al., Molecular Cloning, A Laboratory Manual, Cold Spring Harbor Laboratory. New York, 1989). Commercially available cDNA libraries are also available, such as various cDNA libraries from the company Clontech. When combined with polymerase reaction technology, even minimal expression products can be cloned.
可用常规方法从这些cDNA文库中筛选本发明的基因。这些方法包括(但不限于):(1)DNA-DNA或DNA-RNA杂交;(2)标志基因的功能出现或丧失;(3)测定具有抑癌功能的蛋白的转录本的水平;(4)通过免疫学技术或测定生物学活性,来检测基因表达的蛋白产物。上述方法可单用,也可多种方法联合应用。These cDNA libraries can be screened for the gene of the present invention by a conventional method. These methods include (but are not limited to): (1) DNA-DNA or DNA-RNA hybridization; (2) function appearance or loss of marker genes; (3) measuring the level of transcripts of proteins with tumor suppressor function; (4) ) Detection of protein products expressed by genes by immunological techniques or assays of biological activity. The above methods can be used alone or in combination with multiple methods.
在第(1)种方法中,杂交所用的探针是与本发明的多核苷酸的任何一部分同源,其长度至少15个核苷酸,较好是至少30个核苷酸,更好是至少50个核苷酸,最好是至少100个核苷酸。此外,探针的长度通常在2kb之内,较佳地为1kb之内。此处所用的探针通常是在本发明的基因DNA序列信息的基础上化学合成的DNA序列。本发明的基因本身或者片段当然可以用作探针。DNA探针的标记可用放射性同位素,荧光素或酶(如碱性磷酸酶)等。In the (1) method, the probe used for hybridization is homologous to any part of the polynucleotide of the present invention, and its length is at least 15 nucleotides, preferably at least 30 nucleotides, more preferably At least 50 nucleotides, preferably at least 100 nucleotides. In addition, the length of the probe is usually within 2kb, preferably within 1kb. The probes used here are usually DNA sequences chemically synthesized based on the gene DNA sequence information of the present invention. The genes themselves or fragments of the present invention can of course be used as probes. DNA probes can be labeled with radioactive isotopes, luciferin or enzymes (such as alkaline phosphatase) and the like.
在第(4)种方法中,检测具有抑癌功能的蛋白基因表达的蛋白产物可用免疫学技术如Western印迹法,放射免疫沉淀法,酶联免疫吸附法(ELISA)等。In the (4) method, immunological techniques such as Western blotting, radioimmunoprecipitation, and enzyme-linked immunosorbent assay (ELISA) can be used to detect the protein product expressed by the protein gene with tumor suppressor function.
应用PCR技术扩增DNA/RNA的方法(Saiki,et al.Science 1985;230:1350-1354)被优选用于获得本发明的基因。特别是很难从文库中得到全长的cDNA时,可优选使用RACE法(RACE-cDNA末端快速扩增法),用于PCR的引物可根据本文所公开的本发明的序列信息适当地选择,并可用常规方法合成。可用常规方法如通过凝胶电泳分离和纯化扩增的DNA/RNA片段。A method of amplifying DNA/RNA using the PCR technique (Saiki, et al. Science 1985; 230: 1350-1354) is preferably used to obtain the gene of the present invention. Especially when it is difficult to obtain full-length cDNA from the library, the RACE method (RACE-cDNA terminal rapid amplification method) can be preferably used, and the primers used for PCR can be appropriately selected according to the sequence information of the present invention disclosed herein, And can be synthesized by conventional methods. Amplified DNA/RNA fragments can be separated and purified by conventional methods such as by gel electrophoresis.
如上所述得到的本发明的基因,或者各种DNA片段等的核苷酸序列的测定可用常规方法如双脱氧链终止法(Sanger et al.PNAS,1977,74:5463-5467)。这类核苷酸序列测定也可用商业测序试剂盒等。为了获得全长的cDNA序列,测序需反复进行。有时需要测定多个克隆的cDNA序列,才能拼接成全长的cDNA序列。The nucleotide sequence of the gene of the present invention obtained as described above, or various DNA fragments, etc., can be determined by conventional methods such as the dideoxy chain termination method (Sanger et al. PNAS, 1977, 74: 5463-5467). Such nucleotide sequence determination can also use commercial sequencing kits and the like. In order to obtain the full-length cDNA sequence, sequencing needs to be repeated. Sometimes it is necessary to determine the cDNA sequence of multiple clones before splicing into a full-length cDNA sequence.
本发明也涉及包含本发明的多核苷酸的载体,以及用本发明的载体或具有抑癌功能的蛋白编码序列经基因工程产生的宿主细胞,以及经重组技术产生本发明所述多肽的方法。The present invention also relates to a vector containing the polynucleotide of the present invention, a host cell produced by genetic engineering using the vector of the present invention or a protein coding sequence with tumor suppressor function, and a method for producing the polypeptide of the present invention by recombinant technology.
通过常规的重组DNA技术(Science,1984;224:1431),可利用本发明的多聚核苷酸序列可用来表达或生产重组的具有抑癌功能的蛋白多肽。一般来说有以下步骤:Through conventional recombinant DNA technology (Science, 1984; 224: 1431), the polynucleotide sequence of the present invention can be used to express or produce recombinant protein polypeptides with tumor suppressor function. Generally speaking, there are the following steps:
(1).用本发明的编码具有抑癌功能的人蛋白的多核苷酸(或变异体),或用含有该多核苷酸的重组表达载体转化或转导合适的宿主细胞;(1). Transform or transduce a suitable host cell with the polynucleotide (or variant) encoding a human protein with tumor suppressor function of the present invention, or with a recombinant expression vector containing the polynucleotide;
(2).在合适的培养基中培养的宿主细胞;(2). Host cells cultured in a suitable medium;
(3).从培养基或细胞中分离、纯化蛋白质。(3). Isolate and purify protein from culture medium or cells.
本发明中,具有抑癌功能的人蛋白多核苷酸序列可插入到重组表达载体中。术语“重组表达载体”指本领域熟知的细菌质粒、噬菌体、酵母质粒、植物细胞病毒、哺乳动物细胞病毒如腺病毒、逆转录病毒或其他载体。在本发明中适用的载体包括但不限于:在细菌中表达的基于T7的表达载体(Rosenberg,et al.Gene,1987,56:125);在哺乳动物细胞中表达的pMSXND表达载体(Lee and Nathans,J Bio Chem.263:3521,1988)和在昆虫细胞中表达的来源于杆状病毒的载体。总之,只要能在宿主体内复制和稳定,任何质粒和载体都可以用。表达载体的一个重要特征是通常含有复制起点、启动子、标记基因和翻译控制元件。In the present invention, the human protein polynucleotide sequence with tumor suppressor function can be inserted into the recombinant expression vector. The term "recombinant expression vector" refers to bacterial plasmid, phage, yeast plasmid, plant cell virus, mammalian cell virus such as adenovirus, retrovirus or other vectors well known in the art. Vectors applicable in the present invention include, but are not limited to: T7-based expression vectors (Rosenberg, et al. Gene, 1987, 56: 125) expressed in bacteria; pMSXND expression vectors expressed in mammalian cells (Lee and Nathans, J Bio Chem.263:3521, 1988) and vectors derived from baculovirus expressed in insect cells. In short, any plasmid and vector can be used as long as it can be replicated and stabilized in the host. An important feature of expression vectors is that they usually contain an origin of replication, a promoter, marker genes, and translational control elements.
本领域的技术人员熟知的方法能用于构建含具有抑癌功能的人蛋白编码DNA序列和合适的转录/翻译控制信号的表达载体。这些方法包括体外重组DNA技术、DNA合成技术、体内重组技术等(Sambroook,et al.Molecular Cloning,a Laboratory Manual,coldSpring Harbor Laboratory.New York,1989)。所述的DNA序列可有效连接到表达载体中的适当启动子上,以指导mRNA合成。这些启动子的代表性例子有:大肠杆菌的lac或trp启动子;λ噬菌体PL启动子;真核启动子包括CMV立即早期启动子、HSV胸苷激酶启动子、早期和晚期SV40启动子、反转录病毒的LTRs和其他一些已知的可控制基因在原核或真核细胞或其病毒中表达的启动子。表达载体还包括翻译起始用的核糖体结合位点和转录终止子。Methods well known to those skilled in the art can be used to construct expression vectors containing human protein-coding DNA sequences with tumor suppressor function and appropriate transcription/translation control signals. These methods include in vitro recombinant DNA technology, DNA synthesis technology, in vivo recombination technology, etc. (Sambroook, et al. Molecular Cloning, a Laboratory Manual, cold Spring Harbor Laboratory. New York, 1989). Said DNA sequence can be operably linked to an appropriate promoter in the expression vector to direct mRNA synthesis. Representative examples of these promoters are: E. coli lac or trp promoter; lambda phage PL promoter; eukaryotic promoters include CMV immediate early promoter, HSV thymidine kinase promoter, early and late SV40 promoter, LTRs of retroviruses and other promoters known to control gene expression in prokaryotic or eukaryotic cells or their viruses. The expression vector also includes a ribosome binding site for translation initiation and a transcription terminator.
此外,表达载体优选地包含一个或多个选择性标记基因,以提供用于选择转化的宿主细胞的表型性状,如真核细胞培养用的二氢叶酸还原酶、新霉素抗性以及绿色荧光蛋白(GFP),或用于大肠杆菌的四环素或氨苄青霉素抗性。In addition, the expression vector preferably contains one or more selectable marker genes to provide phenotypic traits for selection of transformed host cells, such as dihydrofolate reductase for eukaryotic cell culture, neomycin resistance, and green Fluorescent protein (GFP), or tetracycline or ampicillin resistance for E. coli.
包含上述的适当DNA序列以及适当启动子或者控制序列的载体,可以用于转化适当的宿主细胞,以使其能够表达蛋白质。Vectors containing the above-mentioned appropriate DNA sequences and appropriate promoters or control sequences can be used to transform appropriate host cells so that they can express proteins.
宿主细胞可以是原核细胞,如细菌细胞;或是低等真核细胞,如酵母细胞;或是高等真核细胞,如哺乳动物细胞。代表性例子有:大肠杆菌,链霉菌属;鼠伤寒沙门氏菌的细菌细胞;真菌细胞如酵母;植物细胞;果蝇S2或Sf9的昆虫细胞;CHO、COS或Bowes黑素瘤细胞的动物细胞等。The host cell may be a prokaryotic cell, such as a bacterial cell; or a lower eukaryotic cell, such as a yeast cell; or a higher eukaryotic cell, such as a mammalian cell. Representative examples are: Escherichia coli, Streptomyces spp; bacterial cells of Salmonella typhimurium; fungal cells such as yeast; plant cells; insect cells of Drosophila S2 or Sf9; animal cells of CHO, COS or Bowes melanoma cells, etc.
本发明的多核苷酸在高等真核细胞中表达时,如果在载体中插入增强子序列时将会使转录得到增强。增强子是DNA的顺式作用因子,通常大约有10到300个碱基对,作用于启动子以增强基因的转录。可举的例子包括在复制起始点晚期一侧的100到270个碱基对的SV40增强子、在复制起始点晚期一侧的多瘤增强子以及腺病毒增强子等。When the polynucleotide of the present invention is expressed in higher eukaryotic cells, if an enhancer sequence is inserted into the vector, the transcription will be enhanced. Enhancers are cis-acting elements of DNA, usually about 10 to 300 base pairs in length, that act on promoters to enhance gene transcription. Examples include the SV40 enhancer of 100 to 270 base pairs on the late side of the replication origin, the polyoma enhancer on the late side of the replication origin, and the adenovirus enhancer.
本领域一般技术人员都清楚如何选择适当的载体、启动子、增强子和宿主细胞。Those of ordinary skill in the art will know how to select appropriate vectors, promoters, enhancers and host cells.
用重组DNA转化宿主细胞可用本领域技术人员熟知的常规技术进行。当宿主为原核生物如大肠杆菌时,能吸收DNA的感受态细胞可在指数生长期后收获,用CaCl2法处理,所用的步骤在本领域众所周知。可供选择的是用MgCl2。如果需要,转化也可用电穿孔的方法进行。当宿主是真核生物,可选用如下的DNA转染方法:磷酸钙共沉淀法,常规机械方法如显微注射、电穿孔、脂质体包装等。Transformation of host cells with recombinant DNA can be performed using conventional techniques well known to those skilled in the art. When the host is a prokaryotic organism such as E. coli, competent cells capable of taking up DNA can be harvested after the exponential growth phase and treated with the CaCl2 method using procedures well known in the art. An alternative is to use MgCl2 . Transformation can also be performed by electroporation, if desired. When the host is eukaryotic, the following DNA transfection methods can be used: calcium phosphate co-precipitation method, conventional mechanical methods such as microinjection, electroporation, liposome packaging, etc.
获得的转化子可以用常规方法培养,表达本发明的基因所编码的多肽。根据所用的宿主细胞,培养中所用的培养基可选自各种常规培养基。在适于宿主细胞生长的条件下进行培养。当宿主细胞生长到适当的细胞密度后,用合适的方法(如温度转换或化学诱导)诱导选择的启动子,将细胞再培养一段时间。The obtained transformant can be cultured by conventional methods to express the polypeptide encoded by the gene of the present invention. The medium used in the culture can be selected from various conventional media according to the host cells used. The culture is carried out under conditions suitable for the growth of the host cells. After the host cells have grown to an appropriate cell density, the selected promoter is induced by an appropriate method (such as temperature shift or chemical induction), and the cells are cultured for an additional period of time.
在上面的方法中的重组多肽可包被于细胞内、细胞外或在细胞膜上表达或分泌到细胞外。如果需要,可利用其物理的、化学的和其它特性通过各种分离方法分离和纯化重组的蛋白。这些方法是本领域技术人员所熟知的。这些方法的例子包括但并不限于:常规的复性处理、用蛋白沉淀剂处理(盐析方法)、离心、渗透破菌、超处理、超离心、分子筛层析(凝胶过滤)、吸附层析、离子交换层析、高效液相层析(HPLC)和其它各种液相层析技术及这些方法的结合。The recombinant polypeptide in the above method can be encapsulated inside the cell, outside the cell or expressed on the cell membrane or secreted outside the cell. The recombinant protein can be isolated and purified by various separation methods by taking advantage of its physical, chemical and other properties, if desired. These methods are well known to those skilled in the art. Examples of these methods include, but are not limited to: conventional refolding treatment, treatment with protein precipitating agents (salting out method), centrifugation, osmotic disruption, supertreatment, ultracentrifugation, molecular sieve chromatography (gel filtration), adsorption layer Analysis, ion exchange chromatography, high performance liquid chromatography (HPLC) and various other liquid chromatography techniques and combinations of these methods.
重组的具有抑癌功能的人蛋白或多肽有多方面的用途。这些用途包括(但不限于):直接做为药物治疗具有抑癌功能的蛋白功能低下或丧失所致的疾病,和用于筛选促进或对抗具有抑癌功能的蛋白功能的抗体、多肽或其它配体。例如,抗体可用于激活或抑制具有抑癌功能的人蛋白的功能。用表达的重组具有抑癌功能的人蛋白筛选多肽库可用于寻找有治疗价值的能抑制或刺激具有抑癌功能的人蛋白功能的多肽分子。The recombinant human protein or polypeptide with tumor suppressor function has many uses. These uses include (but are not limited to): direct use as a drug to treat diseases caused by the hypofunction or loss of proteins with tumor suppressor functions, and screening for antibodies, polypeptides or other ligands that promote or resist the functions of proteins with tumor suppressor functions. body. For example, antibodies can be used to activate or inhibit the function of human proteins that function as tumor suppressors. Screening the polypeptide library with the expressed recombinant human protein with tumor suppressor function can be used to find therapeutically valuable polypeptide molecules that can inhibit or stimulate the function of the human protein with tumor suppressor function.
本发明也提供了筛选药物以鉴定提高(激动剂)或阻遏(拮抗剂)具有抑癌功能的人蛋白的药剂的方法。激动剂提高具有抑癌功能的人蛋白刺激细胞增殖等生物功能,而拮抗剂阻止和治疗与细胞过度增殖有关的紊乱如各种癌症。例如,能在药物的存在下,将哺乳动物细胞或表达具有抑癌功能的人蛋白的膜制剂与标记的具有抑癌功能的人蛋白一起培养。然后测定药物提高或阻遏此相互作用的能力。The invention also provides methods of screening drugs to identify agents that enhance (agonists) or repress (antagonists) human proteins with tumor suppressor function. Agonists enhance biological functions of human proteins with tumor suppressor functions such as stimulating cell proliferation, while antagonists prevent and treat disorders related to excessive cell proliferation, such as various cancers. For example, mammalian cells or membrane preparations expressing a human protein with tumor suppressor function can be cultured with the labeled human protein with tumor suppressor function in the presence of a drug. The ability of the drug to enhance or repress this interaction is then determined.
具有抑癌功能的人蛋白的拮抗剂包括筛选出的抗体、化合物、受体缺失物和类似物等。具有抑癌功能的人蛋白的拮抗剂可以与具有抑癌功能的人蛋白结合并消除其功能,或是抑制具有抑癌功能的人蛋白的产生,或是与多肽的活性位点结合使多肽不能发挥生物学功能。具有抑癌功能的人蛋白的拮抗剂可用于治疗用途。Antagonists of human proteins with tumor suppressor function include screened antibodies, compounds, receptor deletions and analogs. The antagonist of the human protein with tumor suppressor function can combine with the human protein with tumor suppressor function and eliminate its function, or inhibit the production of the human protein with tumor suppressor function, or combine with the active site of the polypeptide so that the polypeptide cannot perform biological functions. Antagonists of human proteins that function as tumor suppressors are useful for therapeutic use.
在筛选作为拮抗剂的化合物时,可以将具有抑癌功能的蛋白加入生物分析测定中,通过测定化合物影响具有抑癌功能的蛋白和其受体之间的相互作用来确定化合物是否是拮抗剂。用上述筛选化合物的同样方法,可以筛选出起拮抗剂作用的受体缺失物和类似物。When screening compounds as antagonists, a protein with tumor suppressor function can be added to a bioanalytical assay, and whether the compound is an antagonist can be determined by measuring the effect of the compound on the interaction between the protein with tumor suppressor function and its receptor. Receptor deletions and analogs that function as antagonists can be screened in the same manner as described above for screening compounds.
本发明的多肽可直接用于疾病治疗,例如,各种恶性肿瘤、和细胞异常增殖等。The polypeptide of the present invention can be directly used in the treatment of diseases, for example, various malignant tumors, abnormal cell proliferation and the like.
本发明的多肽,及其片段、衍生物、类似物或它们的细胞可以用来作为抗原以生产抗体。这些抗体可以是多克隆或单克隆抗体。多克隆抗体可以通过将此多肽直接注射动物的方法得到。制备单克隆抗体的技术包括杂交瘤技术,三瘤技术,人B-细胞杂交瘤技术,EBV-杂交瘤技术等。The polypeptides of the present invention, and fragments, derivatives, analogs thereof or their cells can be used as antigens to produce antibodies. These antibodies can be polyclonal or monoclonal. Polyclonal antibodies can be obtained by injecting the polypeptide directly into animals. Techniques for preparing monoclonal antibodies include hybridoma technology, trioma technology, human B-cell hybridoma technology, EBV-hybridoma technology, and the like.
可以将本发明的多肽和拮抗剂与合适的药物载体组合后使用。这些载体可以是水、葡萄糖、乙醇、盐类、缓冲液、甘油以及它们的组合。组合物包含安全有效量的多肽或拮抗剂以及不影响药物效果的载体和赋形剂。这些组合物可以作为药物用于疾病治疗。The polypeptides and antagonists of the present invention can be used in combination with suitable pharmaceutical carriers. These carriers can be water, dextrose, ethanol, salts, buffers, glycerol and combinations thereof. The composition contains safe and effective doses of polypeptides or antagonists as well as carriers and excipients that do not affect the drug effect. These compositions can be used as medicine for disease treatment.
本发明还提供含有一种或多种容器的药盒或试剂盒,容器中装有一种或多种本发明的药用组合物成分。与这些容器一起,可以有由制造、使用或销售药品或生物制品的政府管理机构所给出的指示性提示,该提示反映出生产、使用或销售的政府管理机构许可其在人体上施用。此外,本发明的多肽可以与其它的治疗化合物结合使用。The invention also provides kits or kits comprising one or more containers containing one or more ingredients of the pharmaceutical compositions of the invention. Along with these containers, there may be an indicative notice given by the governmental regulatory agency that manufactures, uses or sells the drug or biological product reflecting its approval for human administration by the governmental regulatory agency that manufactures, uses or sells the drug or biological product. In addition, the polypeptides of the invention can be used in combination with other therapeutic compounds.
药物组合物可以以方便的方式给药,如通过局部、静脉内、腹膜内、肌内、皮下、鼻内或皮内的给药途径。具有抑癌功能的蛋白以有效地治疗和/或预防具体的适应症的量来给药。施用于患者的具有抑癌功能的蛋白的量和剂量范围将取决于许多因素,如给药方式、待治疗者的健康条件和诊断医生的判断。The pharmaceutical compositions may be administered in a convenient manner, such as by topical, intravenous, intraperitoneal, intramuscular, subcutaneous, intranasal or intradermal routes of administration. Proteins with tumor suppressor function are administered in amounts effective to treat and/or prevent the particular indication. The amount and dosage range of the protein with tumor suppressor function administered to a patient will depend on many factors, such as the mode of administration, the health condition of the person to be treated, and the judgment of the diagnosing physician.
具有抑癌功能的人蛋白的多聚核苷酸也可用于多种治疗目的。基因治疗技术可用于治疗由于具有抑癌功能的蛋白的无表达或异常/无活性的具有抑癌功能的蛋白的表达所致的细胞增殖、发育或代谢异常。重组的基因治疗载体(如病毒载体)可设计成表达变异的具有抑癌功能的蛋白,以抑制内源性的具有抑癌功能的蛋白活性。例如,一种变异的具有抑癌功能的蛋白可以是缩短的、缺失了信号传导功能域的具有抑癌功能的蛋白,虽可与下游的底物结合,但缺乏信号传导活性。因此重组的基因治疗载体可用于治疗具有抑癌功能的蛋白表达或活性异常所致的疾病。来源于病毒的表达载体如逆转录病毒、腺病毒、腺病毒相关病毒、单纯疱疹病毒、细小病毒等可用于将具有抑癌功能的蛋白基因转移至细胞内。构建携带具有抑癌功能的蛋白基因的重组病毒载体的方法可见于已有文献(Sambrook,et al.)。另外重组具有抑癌功能的人蛋白基因可包装到脂质体中转移至细胞内。Polynucleotides of human proteins with tumor suppressor function can also be used for various therapeutic purposes. Gene therapy technology can be used to treat abnormalities in cell proliferation, development or metabolism due to non-expression or abnormal/inactive expression of proteins with tumor suppressor functions. Recombinant gene therapy vectors (such as viral vectors) can be designed to express mutated proteins with tumor suppressor functions to inhibit the activity of endogenous proteins with tumor suppressor functions. For example, a mutated protein with tumor suppressor function may be a shortened protein with tumor suppressor function that lacks a signal transduction domain, and although it can bind to a downstream substrate, it lacks signal transduction activity. Therefore, the recombinant gene therapy vector can be used to treat diseases caused by abnormal expression or activity of proteins with tumor suppressor function. Expression vectors derived from viruses such as retroviruses, adenoviruses, adeno-associated viruses, herpes simplex viruses, and parvoviruses can be used to transfer protein genes with tumor suppressor functions into cells. The method for constructing a recombinant viral vector carrying a protein gene with tumor suppressor function can be found in existing literature (Sambrook, et al.). In addition, the recombinant human protein gene with tumor suppressor function can be packaged into liposomes and transferred into cells.
抑制具有抑癌功能的人蛋白mRNA的寡聚核苷酸(包括反义RNA和DNA)以及核酶也在本发明的范围之内。核酶是一种能特异性分解特定RNA的酶样RNA分子,其作用机制是核酶分子与互补的靶RNA特异性杂交后进行核酸内切作用。反义的RNA和DNA及核酶可用已有的任何RNA或DNA合成技术获得,如固相磷酸酰胺化学合成法合成寡核苷酸的技术已广泛应用。反义RNA分子可通过编码该RNA的DNA序列在体外或体内转录获得。这种DNA序列已整合到载体的RNA聚合酶启动子的下游。为了增加核酸分子的稳定性,可用多种方法对其进行修饰,如增加两侧的序列长度,核糖核苷之间的连接应用磷酸硫酯键或肽键而非磷酸二酯键。Oligonucleotides (including antisense RNA and DNA) and ribozymes that inhibit human protein mRNA with tumor suppressor function are also within the scope of the invention. A ribozyme is an enzyme-like RNA molecule that can specifically decompose a specific RNA. Its mechanism of action is that the ribozyme molecule specifically hybridizes with a complementary target RNA to perform an endonucleic cut. Antisense RNA, DNA and ribozyme can be obtained by any existing RNA or DNA synthesis technology, such as solid-phase phosphoamide chemical synthesis of oligonucleotides, which has been widely used. Antisense RNA molecules can be obtained by in vitro or in vivo transcription of the DNA sequence encoding the RNA. This DNA sequence has been integrated into the vector downstream of the RNA polymerase promoter. In order to increase the stability of nucleic acid molecules, it can be modified in a variety of ways, such as increasing the sequence length on both sides, and the connection between ribonucleosides should use phosphothioester bonds or peptide bonds instead of phosphodiester bonds.
多聚核苷酸导入组织或细胞内的方法包括:将多聚核苷酸直接注入到体内组织中;或在体外通过载体(如病毒、噬菌体或质粒等)先将多聚核苷酸导入细胞中,再将细胞移植到体内等。The methods for introducing polynucleotides into tissues or cells include: directly injecting polynucleotides into tissues in the body; or first introducing polynucleotides into cells in vitro through vectors (such as viruses, phages, or plasmids, etc.) , and then transplant the cells into the body, etc.
本发明的多肽还可用作肽谱分析,例如,多肽可用物理的、化学或酶进行特异性切割,并进行一维或二维或三维的凝胶电泳分析。The polypeptide of the present invention can also be used for peptide spectrum analysis, for example, the polypeptide can be specifically cleaved physically, chemically or enzymatically, and subjected to one-dimensional, two-dimensional or three-dimensional gel electrophoresis analysis.
本发明还提供了针对具有抑癌功能的人蛋白抗原决定簇的抗体。这些抗体包括(但不限于):多克隆抗体、单克隆抗体、嵌合抗体、单链抗体、Fab片段和Fab表达文库产生的片段。The invention also provides an antibody against the human protein antigenic determinant with tumor suppressor function. These antibodies include, but are not limited to: polyclonal antibodies, monoclonal antibodies, chimeric antibodies, single chain antibodies, Fab fragments and fragments produced by a Fab expression library.
抗具有抑癌功能的人蛋白的抗体可用于免疫组织化学技术中,检测活检标本中的具有抑癌功能的人蛋白。Antibodies against human tumor suppressor proteins can be used in immunohistochemical techniques to detect human tumor suppressor proteins in biopsy specimens.
与具有抑癌功能的人蛋白结合的单克隆抗体也可用放射性同位素标记,注入体内可跟踪其位置和分布。这种放射性标记的抗体可作为一种非创伤性诊断方法用于肿瘤细胞的定位和判断是否有转移。Monoclonal antibodies that bind to human proteins with tumor suppressor functions can also be labeled with radioactive isotopes, and injected into the body to track their location and distribution. This radiolabeled antibody can be used as a non-invasive diagnostic method for localization of tumor cells and judgment of metastasis.
本发明中的抗体可用于治疗或预防与具有抑癌功能的人蛋白相关的疾病。给予适当剂量的抗体可以刺激或阻断具有抑癌功能的人蛋白的产生或活性。The antibody of the present invention can be used to treat or prevent diseases related to the human protein with tumor suppressor function. Administration of appropriate doses of antibodies can stimulate or block the production or activity of human proteins with tumor suppressor functions.
抗体也可用于设计针对体内某一特殊部位的免疫毒素。如具有抑癌功能的人蛋白高亲和性的单克隆抗体可与细菌或植物毒素(如白喉毒素,蓖麻蛋白,红豆碱等)共价结合。一种通常的方法是用巯基交联剂如SPDP,攻击抗体的氨基,通过二硫键的交换,将毒素结合于抗体上,这种杂交抗体可用于杀灭具有抑癌功能的人蛋白阳性的细胞。Antibodies can also be used to design immunotoxins that target a particular site in the body. For example, high-affinity monoclonal antibodies to human proteins with tumor suppressor functions can be covalently combined with bacterial or plant toxins (such as diphtheria toxin, ricin, rhododine, etc.). A common method is to use a sulfhydryl cross-linking agent such as SPDP to attack the amino group of the antibody, and bind the toxin to the antibody through the exchange of disulfide bonds. This hybrid antibody can be used to kill human proteins with tumor suppressor functions. cell.
多克隆抗体的生产可用具有抑癌功能的人蛋白或多肽免疫动物,如家兔,小鼠,大鼠等。多种佐剂可用于增强免疫反应,包括但不限于弗氏佐剂等。The production of polyclonal antibodies can be used to immunize animals, such as rabbits, mice, rats, etc., with human proteins or polypeptides with tumor suppressor functions. Various adjuvants can be used to enhance the immune response, including but not limited to Freund's adjuvant and the like.
具有抑癌功能的人蛋白单克隆抗体可用杂交瘤技术生产(Kohler and Milstein.Nature,1975,256:495-497)。将人恒定区和非人源的可变区结合的嵌合抗体可用已有的技术生产(Morrison et al,PNAS,1985,81:6851)。而已有的生产单链抗体的技术(U.S.PatNo.4946778)也可用于生产抗具有抑癌功能的人蛋白的单链抗体。Human protein monoclonal antibodies with tumor suppressor function can be produced by hybridoma technology (Kohler and Milstein. Nature, 1975, 256: 495-497). Chimeric antibodies combining human constant regions and non-human variable regions can be produced using existing techniques (Morrison et al, PNAS, 1985, 81:6851). The existing technology for producing single-chain antibodies (U.S. Pat No. 4946778) can also be used to produce single-chain antibodies against human proteins with tumor suppressor functions.
能与具有抑癌功能的人蛋白结合的多肽分子可通过筛选由各种可能组合的氨基酸结合于固相物组成的随机多肽库而获得。筛选时,必须对具有抑癌功能的人蛋白分子进行标记。Polypeptide molecules capable of binding to human proteins with tumor suppressor functions can be obtained by screening random polypeptide libraries composed of various possible combinations of amino acids bound to solid phases. During screening, human protein molecules with tumor suppressor function must be labeled.
本发明还涉及定量和定位检测具有抑癌功能的人蛋白水平的诊断试验方法。这些试验是本领域所熟知的,且包括FISH测定和放射免疫测定。试验中所检测的具有抑癌功能的人蛋白水平,可以用作解释具有抑癌功能的人蛋白在各种疾病中的重要性和用于诊断具有抑癌功能的蛋白起作用的疾病。The invention also relates to a diagnostic test method for quantitative and localized detection of the human protein level with tumor suppressor function. These assays are well known in the art and include FISH assays and radioimmunoassays. The level of human protein with tumor suppressor function detected in the test can be used to explain the importance of human protein with tumor suppressor function in various diseases and to diagnose diseases in which the protein with tumor suppressor function plays a role.
具有抑癌功能的蛋白的多聚核苷酸可用于具有抑癌功能的蛋白相关疾病的诊断和治疗。在诊断方面,具有抑癌功能的蛋白的多聚核苷酸可用于检测具有抑癌功能的蛋白的表达与否或在疾病状态下具有抑癌功能的蛋白的异常表达。如具有抑癌功能的蛋白DNA序列可用于对活检标本的杂交以判断具有抑癌功能的蛋白的表达异常。杂交技术包括Southern印迹法,Northern印迹法、原位杂交等。这些技术方法都是公开的成熟技术,相关的试剂盒都可从商业途径得到。本发明的多核苷酸的一部分或全部可作为探针固定在微阵列(Microarray)或DNA芯片(又称为“基因芯片”)上,用于分析组织中基因的差异表达分析和基因诊断。用具有抑癌功能的蛋白特异的引物进行RNA-聚合酶链反应(RT-PCR)体外扩增也可检测具有抑癌功能的蛋白的转录产物。The polynucleotide of the protein with tumor suppressor function can be used for the diagnosis and treatment of diseases related to the protein with tumor suppressor function. In terms of diagnosis, the polynucleotide of the protein with tumor suppressor function can be used to detect the expression of the protein with tumor suppressor function or the abnormal expression of the protein with tumor suppressor function in a disease state. For example, DNA sequences of proteins with tumor suppressor function can be used for hybridization of biopsy specimens to determine abnormal expression of proteins with tumor suppressor function. Hybridization techniques include Southern blotting, Northern blotting, in situ hybridization, and the like. These technical methods are all open and mature technologies, and relevant kits are available from commercial sources. Part or all of the polynucleotides of the present invention can be used as probes to be immobilized on microarrays (Microarray) or DNA chips (also known as "gene chips") for analysis of differential expression of genes in tissues and gene diagnosis. RNA-polymerase chain reaction (RT-PCR) in vitro amplification with specific primers of the protein with tumor suppressor function can also detect the transcription product of the protein with tumor suppressor function.
检测具有抑癌功能的蛋白基因的突变也可用于诊断具有抑癌功能的蛋白相关的疾病。具有抑癌功能的蛋白突变的形式包括与正常野生型具有抑癌功能的蛋白DNA序列相比的点突变、易位、缺失、重组和其它任何异常等。可用已有的技术如Southern印迹法、DNA序列分析、PCR和原位杂交检测突变。另外,突变有可能影响蛋白的表达,因此用Northern印迹法、Western印迹法可间接判断基因有无突变。The detection of the mutation of the protein gene with tumor suppressor function can also be used to diagnose the diseases related to the protein with tumor suppressor function. The form of protein mutation with tumor suppressor function includes point mutation, translocation, deletion, recombination and any other abnormality compared with the normal wild type protein DNA sequence with tumor suppressor function. Mutations can be detected using established techniques such as Southern blotting, DNA sequence analysis, PCR and in situ hybridization. In addition, mutations may affect protein expression, so Northern blotting and Western blotting can be used to indirectly determine whether a gene has a mutation.
本发明的序列对染色体鉴定也是有价值的。该序列会特异性地针对某条人染色体具体位置且并可以与其杂交。目前,需要鉴定染色体上的各基因的具体位点。现在,只有很少的基于实际序列数据(重复多态性)的染色体标记物可用于标记染色体位置。根据本发明,为了将这些序列与疾病相关基因相关联,其重要的第一步就是将这些DNA序列定位于染色体上。The sequences of the invention are also valuable for chromosome identification. The sequence will be specific for a particular location on a human chromosome and can hybridize thereto. Currently, there is a need to identify the specific site of each gene on the chromosome. Currently, only a few chromosomal markers based on actual sequence data (repeat polymorphisms) are available to mark chromosomal positions. According to the present invention, in order to associate these sequences with disease-related genes, an important first step is to locate these DNA sequences on chromosomes.
简而言之,根据cDNA制备PCR引物(优选15-35bp),可以将序列定位于染色体上。然后,将这些引物用于PCR筛选含各条人染色体的体细胞杂合细胞。只有那些含有相应于引物的人基因的杂合细胞会产生扩增的片段。In short, PCR primers (preferably 15-35bp) are prepared according to the cDNA, and the sequence can be positioned on the chromosome. These primers were then used for PCR screening of somatic heterozygous cells containing individual human chromosomes. Only those cells heterozygous for the human gene corresponding to the primer will produce an amplified fragment.
体细胞杂合细胞的PCR定位法,是将DNA定位到具体染色体的快捷方法。使用本发明的的寡核苷酸引物,通过类似方法,可利用一组来自特定染色体的片段或大量基因组克隆而实现亚定位。可用于染色体定位的其它类似策略包括原位杂交、用标记的流式分选的染色体预筛选和杂交预选,从而构建染色体特异的cDNA库。The PCR mapping method of somatic heterozygous cells is a quick method to locate DNA to specific chromosomes. Using the oligonucleotide primers of the present invention, sublocalization can be achieved using a set of fragments from a specific chromosome or a large number of genomic clones by a similar method. Other similar strategies that can be used for chromosome mapping include in situ hybridization, chromosome prescreening by flow sorting with markers, and hybridization preselection to construct chromosome-specific cDNA libraries.
将cDNA克隆与中期染色体进行荧光原位杂交(FISH),可以在一个步骤中精确地进行染色体定位。此技术的综述,参见Verma等,Human Chromosomes:a Manual of BasicTechniques,Pergamon Press,New York(1988)。Fluorescence in situ hybridization (FISH) of cDNA clones to metaphase chromosomes allows precise chromosomal mapping in a single step. For a review of this technique, see Verma et al., Human Chromosomes: a Manual of Basic Techniques, Pergamon Press, New York (1988).
一旦序列被定位到准确的染色体位置,此序列在染色体上的物理位置就可以与基因图数据相关联。这些数据可见于例如,V.Mckusick,Mendelian Inheritance in Man(可通过与Johns Hopkins University Welch Medical Library联机获得)。然后可通过连锁分析,确定基因与业已定位到染色体区域上的疾病之间的关系。Once a sequence has been mapped to an exact chromosomal location, the physical location of the sequence on the chromosome can be correlated with gene map data. These data can be found, for example, in V. Mckusick, Mendelian Inheritance in Man (available online through Johns Hopkins University Welch Medical Library). Linkage analysis can then be used to determine the relationship between the gene and the disease that has been mapped to the chromosomal region.
接着,需要测定患病和未患病个体间的cDNA或基因组序列差异。如果在一些或所有的患病个体中观察到某突变,而该突变在任何正常个体中未观察到,则该突变可能是疾病的病因。比较患病和未患病个体,通常涉及首先寻找染色体中结构的变化,如从染色体水平可见的或用基于cDNA序列的PCR可检测的缺失或易位。根据目前的物理作图和基因定位技术的分辨能力,被精确定位至与疾病有关的染色体区域的cDNA,可以是50至500个潜在致病基因间之一种(假定1兆碱基作图分辨能力和每20kb对应于一个基因)。Next, the cDNA or genome sequence differences between affected and non-affected individuals need to be determined. If a mutation is observed in some or all of the affected individuals but not in any normal individual, the mutation may be the cause of the disease. Comparing affected and unaffected individuals usually involves first looking for structural changes in chromosomes, such as deletions or translocations that are visible at the chromosomal level or detectable with cDNA sequence-based PCR. Based on the resolution capabilities of current physical mapping and gene mapping techniques, the cDNA that is pinpointed to a disease-associated chromosomal region can be one of 50 to 500 potential disease-causing genes (assuming 1 megabase mapping resolution capacity and each 20kb corresponds to a gene).
本发明的具有抑癌功能的蛋白核苷酸全长序列或其片段通常可以用PCR扩增法、重组法或人工合成的方法获得。对于PCR扩增法,可根据本发明所公开的有关核苷酸序列,尤其是开放阅读框序列来设计引物,并用市售的cDNA库或按本领域技术人员已知的常规方法所制备的cDNA库作为模板,扩增而得有关序列。当序列较长时,常常需要进行两次或多次PCR扩增,然后再将各次扩增出的片段按正确次序拼接在一起。The protein nucleotide full-length sequence or its fragments with tumor suppressor function of the present invention can usually be obtained by PCR amplification method, recombination method or artificial synthesis method. For the PCR amplification method, primers can be designed according to the relevant nucleotide sequences disclosed in the present invention, especially the open reading frame sequence, and the cDNA prepared by a commercially available cDNA library or a conventional method known to those skilled in the art can be used. The library is used as a template to amplify related sequences. When the sequence is long, it is often necessary to carry out two or more PCR amplifications, and then splice together the amplified fragments in the correct order.
一旦获得了有关的序列,就可以用重组法来大批量地获得有关序列。这通常是将其克隆入载体,再转入细胞,然后通过常规方法从增殖后的宿主细胞中分离得到有关序列。Once the relevant sequences are obtained, recombinant methods can be used to obtain the relevant sequences in large quantities. Usually, it is cloned into a vector, then transformed into a cell, and then the relevant sequence is isolated from the proliferated host cell by conventional methods.
此外,还可用人工合成的方法来合成有关序列,尤其是片段长度较短时。通常,通过先合成多个小片段,然后再进行连接可获得序列很长的片段。In addition, related sequences can also be synthesized by artificial synthesis, especially when the fragment length is relatively short. Often, fragments with very long sequences are obtained by synthesizing multiple small fragments and then ligating them.
目前,已经可以完全通过化学合成来编码本发明蛋白(或其片段,或其衍生物)的DNA序列。然后可将该DNA序列引入本领域中的各种DNA分子(如载体)和细胞中。此外,还可通过化学合成将突变引入本发明蛋白序列中。At present, the DNA sequence encoding the protein of the present invention (or its fragments, or its derivatives) can be completely chemically synthesized. This DNA sequence can then be introduced into various DNA molecules (such as vectors) and cells known in the art. In addition, mutations can also be introduced into the protein sequences of the invention by chemical synthesis.
此外,由于本发明的具有抑癌功能的蛋白具有源自人的天然氨基酸序列,因此,与来源于其他物种的同族蛋白相比,预计在施用于人时将具有更高的活性和/或更低的副作用(例如在人体内的免疫原性更低或没有)。In addition, since the protein with tumor suppressor function of the present invention has a natural amino acid sequence derived from humans, it is expected to have higher activity and/or lower Low side effects (eg, less or no immunogenicity in humans).
下面结合具体实施例,进一步阐述本发明。应理解,这些实施例仅用于说明本发明而不用于限制本发明的范围。下列实施例中未注明具体条件的实验方法,通常按照常规条件如Sambrook等人,分子克隆:实验室手册(New York:Cold Spring Harbor LaboratoryPress,1989)中所述的条件,或按照制造厂商所建议的条件。Below in conjunction with specific embodiment, further illustrate the present invention. It should be understood that these examples are only used to illustrate the present invention and are not intended to limit the scope of the present invention. The experimental method that does not indicate specific conditions in the following examples, usually according to conventional conditions such as Sambrook et al., molecular cloning: the conditions described in the laboratory manual (New York: Cold Spring Harbor Laboratory Press, 1989), or according to the manufacturer's instructions suggested conditions.
实施例1:cDNA基因的获得及对癌细胞克隆形成的抑制作用Example 1: Obtaining of cDNA gene and its inhibitory effect on cancer cell clone formation
SP2114a来自于从GIBCO BRL公司购得的肝cDNA文库(目录号:10422-012),PP8153,PP8332,PP9177,PP9445,PP10199和PP10226是通过用常规方法构建人胎盘cDNA文库获得的。取3、6、10月龄的胎盘组织,用Trizol试剂(GIBCOBRL公司)按厂方说明书提取总RNA,用mRNA提纯试剂盒(Pharmacia公司)提取mRNA。用pCMV-script TMXR cDNA文库构建试剂盒(Stratagene公司)构建上述mRNA的cDNA文库。其中反转录酶改用MMLV-RT-Superscript II(GIBCO BRL),反转录反应在42℃进行。转化XL 10-Gold感受细胞,获得了1×106 cfu/μg cDNA滴度的cDNA文库。第一轮随机挑取cDNA克隆,其后以高丰度cDNA克隆和已证明有抑癌细胞生长功能的cDNA克隆为探针,杂交筛选cDNA文库,挑取弱阳性及阴性克隆。用Qiagen 96孔板质粒抽提试剂盒,按厂家说明书进行质粒DNA的提取。质粒DNA和空载体同时转染肝癌细胞系7721。100ng DNA酒精沉淀干燥后,加6μl H2O溶解,待转染。每份DNA样品中加0.74μl脂质体及9.3μl无血清培液,混匀后,室温放置10分钟。每管中加150μl无血清培液,均分加入3孔生长于96孔板的7721细胞中,37℃放置2小时,每孔再加50μl无血清培液,37℃ 24小时。每孔换100μl全培液,37℃24小时,换含G418的全培液100μl,37℃ 24~48小时,边观察,边换G418浓度不等的培液。约2~3次后,直到镜检细胞有克隆形成,计数。发现以上克隆有抑制细胞克隆形成作用,结果如下表所示。SP2114a was obtained from a liver cDNA library purchased from GIBCO BRL (catalogue number: 10422-012), and PP8153, PP8332, PP9177, PP9445, PP10199 and PP10226 were obtained by constructing a human placenta cDNA library by conventional methods. The placental tissue of 3, 6, and 10 months old was taken, and the total RNA was extracted with Trizol reagent (GIBCOBRL company) according to the manufacturer's instructions, and the mRNA was extracted with the mRNA purification kit (Pharmacia company). The cDNA library of the above mRNA was constructed with the pCMV-script TMXR cDNA library construction kit (Stratagene). The reverse transcriptase was changed to MMLV-RT-Superscript II (GIBCO BRL), and the reverse transcription reaction was carried out at 42°C. XL 10-Gold competent cells were transformed, and a cDNA library with a titer of 1×10 6 cfu/μg cDNA was obtained. In the first round, cDNA clones were randomly selected, and then high-abundance cDNA clones and cDNA clones that had been proven to inhibit the growth of cancer cells were used as probes to hybridize and screen the cDNA library to pick weakly positive and negative clones. Plasmid DNA was extracted using Qiagen 96-well plate plasmid extraction kit according to the manufacturer's instructions. Plasmid DNA and empty vector were transfected into liver cancer cell line 7721 at the same time. After 100 ng of DNA was precipitated and dried by alcohol, 6 μl of H 2 O was added to dissolve it and wait for transfection. Add 0.74 μl liposome and 9.3 μl serum-free medium to each DNA sample, mix well, and place at room temperature for 10 minutes. Add 150 μl of serum-free culture medium to each tube, add evenly to 7721 cells grown in 96-well plates in 3 wells, place at 37°C for 2 hours, add 50 μl of serum-free culture medium to each well, and incubate at 37°C for 24 hours. Change 100 μl whole culture medium for each well, 37°C for 24 hours, change 100 μl whole culture medium containing G418, 37°C for 24-48 hours, change the culture medium with different concentrations of G418 while observing. After about 2 to 3 times, until the microscopic examination of the cells shows colony formation, count them. It was found that the above clones can inhibit the formation of cell clones, and the results are shown in the table below.
cDNA克隆转染细胞(7721)克隆形成情况
对上述cDNA克隆采用双脱氧终止法,在ABI377 DNA自动测序仪上测定其一端近500bp的核苷酸序列。分析后,确定为新基因克隆,进行另一端测序,仍未获得全长cDNA序列,设计引物,再次进行测序,直到获得全长序列(SEQ ID NO:1、4、7、10、13、16、19)。The dideoxy termination method was used for the above cDNA clones, and the nucleotide sequence of nearly 500 bp at one end was determined on an ABI377 DNA automatic sequencer. After analysis, it was determined to be a new gene clone, and the other end was sequenced, but the full-length cDNA sequence was still not obtained, primers were designed, and sequencing was performed again until the full-length sequence (SEQ ID NO: 1, 4, 7, 10, 13, 16 , 19).
实施例2:从胎盘cDNA中PCR获得全长基因:Embodiment 2: PCR obtains full-length gene from placenta cDNA:
取3、6、10月龄的胎盘组织,用Trizol试剂(GIBCO BRL公司)按厂方说明书提取总RNA,用mRNA提纯试剂盒(Pharmacia公司)提取mRNA。用MMLV-RT-Superscript II(GIBCO BRL),反转录酶在42℃进行反转录反应,获得胎盘cDNA。利用各个基因的转异引物(如下表所示),按97℃ 3分钟、1个循环;94℃ 30秒→60℃ 30秒→72℃ 1分钟,共35个循环;72℃ 10分钟,1个循环,进行PCR扩增,获得含有完整开放阅读框序列的各蛋白基因的扩增产物。扩增产物经测序验证,与实施例1测得的序列相符,随后用常规技术将扩增产物转入宿主细胞,从而获得重组蛋白。(注:对于SP2114a,可使用从GIBCO BRL公司购得的肝cDNA文库(目录号:10422-012)作为模板)。Placenta tissues at 3, 6, and 10 months old were taken, and total RNA was extracted with Trizol reagent (GIBCO BRL company) according to the manufacturer's instructions, and mRNA was extracted with mRNA purification kit (Pharmacia company). Use MMLV-RT-Superscript II (GIBCO BRL) and reverse transcriptase to perform reverse transcription reaction at 42°C to obtain placental cDNA. Using the transmutation primers of each gene (as shown in the table below), follow 97°C for 3 minutes, 1 cycle; 94°C for 30 seconds → 60°C for 30 seconds → 72°C for 1 minute, a total of 35 cycles; 72°C for 10 minutes, 1 cycle. PCR amplification was carried out for one cycle to obtain the amplified products of each protein gene containing the complete open reading frame sequence. The amplified product was verified by sequencing and was consistent with the sequence measured in Example 1, and then the amplified product was transferred into host cells by conventional techniques to obtain the recombinant protein. (Note: For SP2114a, a liver cDNA library (catalogue number: 10422-012) purchased from GIBCO BRL company can be used as a template).
基因特异引物
1.PP81531. PP8153
A:核苷酸序列(SEQ ID NO:1)长度:2349bp1 GTGGAAGTAG AAGGCGGTGG CTGAGGCGGT TCCGGAGGTT CTAGTGTCGG AGTTGGGTGC61 AGGCAGGTGC CATGGGCCCG CTTGAGGCAC ACTGAGGGGA CGCGGGGCTG GGCCATGGCC121 GGCGCTCGGG CCGCCGCCGC CGCTGCCTCG GCGGGGTCCT CGGCCTCTTC AGGCAACCAG181 CCGCCTCAGG AGCTGGGGCT TGGGGAGCTG CTGGAGGAGT TCTCCCGGAC TCAGTACCGG241 GCCAAGGATG GCAGCGGGAC CGGCGGCTCT AAGGTTGAGC GCATTGAGAA GAGATGTCTG301 GAGCTGTTTG GCCGAGACTA CTGTTTCAGC GTGATTCCAA ACACGAATGG GGATATCTGT361 GGCCACTATC CCCGGCACAT CGTGTTCCTG GAGTATGAGA GTTCTGAGAA GGAGAAAGAC421 ACGTTTGAGA GTACCGTACA GGTGAGCAAG TTGCAAGACC TCATCCACCG CAGCAAGATG481 GCCCGGTGCA GAGGACGGTT TGTCTGCCCA GTAATCCTGT TCAAGGGCAA GCACATTTGC541 AGGTCGGCCA CACTGGCTGG ATGGGGAGAG CTGTATGGAC GCTCAGGCTA CAACTATTTT601 TTCTCAGGGG GTGCAGATGA TGCCTGGGCA GATGTGGAGG ACGTCACGGA GGAGGACTGT661 GCTCTTCGAA GTGGTGACAC GCATCTTTTT GATAAGGTCA GAGGCTATGA CATCAAGCTG721 CTTCGATACC TGTCAGTCAA ATACATCTGT GACCTGATGG TGGAGAACAA GAAGGTGAAG781 TTTGGCATGA ATGTAACCTC CTCTGAGAAG GTGGACAAAG CCCAGCGCTA TGCCCACTTC841 ACTCTCCTCT CCATCCCGTA TCCAGGCTGT GAATTTTTCA AGGAATATAA AGATCGGGAT901 TACATGGCAG AAGGGCTCAT ATTTAACTGG AAGCAGGACT ACGTTGATGC CCCATTGAGC961 ATCCCCGACT TCCTGACTCA CTCTCTGAAC ATTGACTGGA GCCAGTATCA GTGTTGGGAT1021 CTGGTGCAAC AAACACAAAA CTACCTGAAG CTGCTGCTTT CCTTAGTTAA CAGTGATGAT1081 GACAGCGGGC TGCTGGTACA CTGTATCTCA GGCTGGGATC GGACCCCCCT CTTCATCTCC1141 CTCCTGCGCC TTTCCTTGTG GGCTGATGGG CTCATCCACA CGTCCCTGAA GCCCACTGAG1201 ATCCTCTACC TCACTGTGGC CTATGACTGG TTCCTCTTCG GGCACATGTT GGTAGATCGG1261 CTCAGCAAAG GGGAGGAGAT TTTCTTCTTC TGCTTCAATT TTTTGAAGCA TATTACCTCC1321 GAGGAGTTCT CTGCTCTGAA GACCCAGAGG AGGAAGAGTT TGCCAGCCCG GGATGGAGGC1381 TTCACCCTGG AAGACATCTG CATGCTGAGA CGAAAGGACC GTGGCAGCAC CACCAGCCTT1441 GGCAGCGACT TCTCCCTGGT CATGGAGAGT TCCCCAGGAG CCACTGGGAG CTTCACCTAT1501 GAAGGCCGTG GAGCTGGTCC CAGCAGGAGC GCCAACTCAG GCAGCTTGAA GGAAGAGCCA1561 CTCATCCTCT CCACAGAGTG TCCTCTGGAA CCGGCCACAA CCCTCAGAGG ACCGCTTGCC1621 TTCCCAGCAG GGGCTGGCGG AAGCCAGGTC TTCCAGCTCC TCTTCCTCAA ACCATTCTGA1681 TAACTTTTTC AGGATGGGTA GCAGTCCCCT GGAGGTCCCC AAACCCAGGC TTGCAGCCCT1741 GAGTGATCGA GAGACTCGGC TGCAGGAGGT GCGCTCAGCC TTCTTGGCTG CGTACAGCAG1801 CACAGTGGGG CTTCGGGCAG TAGCCCCCAG TCCTTCCGGT GCCATCGGGG GCCTGCTGGA1861 GCAATTTGCC CGTGGTGTTG GACTCCGGAG CATCAGCAGC AATGCCTTGT GAAGAAGCCA1921 GCCCATGACA TTTTCCTGCT CCTCTCTCAG CTGAGCCCTT AGCAGAGAAT CAAAGCCATG1981 CCTGGCCGAA GGGGTACTTC CAGGTCAGGG GAAATTTCAG TCCCCCATCT CCATCATGAA2041 CATGGCAGCC CCAAAGCTGA GCAAGGCCAA AGACAGGGTT TTCCAACCCC CAGCCTCTTG2101 ACTGGTGACC ACCACCCCTT CTTGTCACTG TCTCCCACCC ACCCCATCTT TGCTGGGATT2161 CCCATCAACT CTCAGAACTG TGTGGGGTTT CCCTGGGGCC TTGTGGAAGC CATGACTTCA2221 CAAAGACCCT ACCTGTCAGT TCTTGTTTCT GGGGAGGAGG GATCACCTGC ACTGAGAATG2281 AGGCAGTTTG ACACAGATCA CAAAATAAAA TCAAAGTCTT TTTGAATAGC CAAAAAAAAA2341 AAAAAAAAAAB:氨基酸序列(SEQ ID NO:2)长度:521个氨基酸1 MAGARAAAAA ASAGSSASSG NQPPQELGLG ELLEEFSRTQ YRAKDGSGTG GSKVERIEKR61 CLELFGRDYC FSVIPNTNGD ICGHYPRHIV FLEYESSEKE KDTFESTVQV SKLQDLIHRS121 KMARCRGRFV CPVILFKGKH ICRSATLAGW GELYGRSGYN YFFSGGADDA WADVEDVTEE181 DCALRSGDTH LFDKVRGYDI KLLRYLSVKY ICDLMVENKK VKFGMNVTSS EKVDKAQRYA241 DFTLLSIPYP GCEFFKEYKD RDYMAEGLIF NWKQDYVDAP LSIPDFLTHS LNIDWSQYQC301 WDLVQQTQNY LKLLLSLVNS DDDSGLLVHC ISGWDRTPLF ISLLRLSLWA DGLIHTSLKP361 TEILYLTVAY DWFLFGHMLV DRLSKGEEIF FFCFNFLKHI TSEEFSALKT QRRKSLPARD421 GGFTLEDICM LRRKDRGSTT SLGSDFSLVM ESSPGATGSF TYEGRGAGPS RSANSGSLKE481 EPLILSTECP LEPATTLRGP LAFPAGAGGS QVFQLLFLKP FC.核苷酸及氨基酸组合序列(SEQ ID NO:3)克隆号:PP8153起始编码子:115 ATG 终止编码子:1678 TGA蛋白质分子量:58350.051 GTG GAA GTA GAA GGC GGT GGC TGA GGC GGT TCC GGA GGT TCT AGT GTC 4849 GGA GTT GGG TGC AGG CAG GTG CCA TGG GCC CGC TTG AGG CAC ACT GAG 9697 GGG ACG CGG GGC TGG GCC ATG GCC GGC GCT CGG GCC GCC GCC GCC GCT 1441 Met Ala Gly Ala Arg Ala Ala Ala Ala Ala 10145 GCC TCG GCG GGG TCC TCG GCC TCT TCA GGC AAC CAG CCG CCT CAG GAG 19211 Ala Ser Ala Gly Ser Ser Ala Ser Ser Gly Asn Gln Pro Pro Gln Glu 26193 CTG GGG CTT GGG GAG CTG CTG GAG GAG TTC TCC CGG ACT CAG TAC CGG 24027 Leu Gly Leu Gly Glu Leu Leu Glu Glu Phe Ser Arg Thr Gln Tyr Arg 42241 GCC AAG GAT GGC AGC GGG ACC GGC GGC TCT AAG GTT GAG CGC ATT GAG 288 43 Ala Lys Asp Gly Ser Gly Thr Gly Gly Ser Lys Val Glu Arg Ile Glu 58289 AAG AGA TGT CTG GAG CTG TTT GGC CGA GAC TAC TGT TTC AGC GTG ATT 33659 Lys Arg Cys Leu Glu Leu Phe Gly Arg Asp Tyr Cys Phe Ser Val Ile 74337 CCA AAC ACG AAT GGG GAT ATC TGT GGC CAC TAT CCC CGG CAC ATC GTG 38475 Pro Asn Thr Asn Gly Asp Ile Cys Gly His Tyr Pro Arg His Ile Val 90385 TTC CTG GAG TAT GAG AGT TCT GAG AAG GAG AAA GAC ACG TTT GAG AGT 43291 Phe Leu Glu Tyr Glu Ser Ser Glu Lys Glu Lys Asp Thr Phe Glu Ser 106433 ACC GTA CAG GTG AGC AAG TTG CAA GAC CTC ATC CAC CGC AGC AAG ATG 480107 Thr Val Gln Val Ser Lys Leu Gln Asp Leu Ile His Arg Ser Lys Met 122481 GCC CGG TGC AGA GGA CGG TTT GTC TGC CCA GTA ATC CTG TTC AAG GGC 528123 Ala Arg Cys Arg Gly Arg Phe Val Cys Pro Val Ile Leu Phe Lys Gly 138529 AAG CAC ATT TGC AGG TCG GCC ACA CTG GCT GGA TGG GGA GAG CTG TAT 576139 Lys His Ile Cys Arg Ser Ala Thr Leu Ala Gly Trp Gly Glu Leu Tyr 154577 GGA CGC TCA GGC TAC AAC TAT TTT TTC TCA GGG GGT GCA GAT GAT GCC 624155 Gly Arg Ser Gly Tyr Asn Tyr Phe Phe Ser Gly Gly Ala Asp Asp Ala 170625 TGG GCA GAT GTG GAG GAC GTC ACG GAG GAG GAC TGT GCT CTT CGA AGT 672171 Trp Ala Asp Val Glu Asp Val Thr Glu Glu Asp Cys Ala Leu Arg Ser 186673 GGT GAC ACG CAT CTT TTT GAT AAG GTC AGA GGC TAT GAC ATC AAG CTG 720187 Gly Asp Thr His Leu Phe Asp Lys Val Arg Gly Tyr Asp Ile Lys Leu 202721 CTT CGA TAC CTG TCA GTC AAA TAC ATC TGT GAC CTG ATG GTG GAG AAC 768203 Leu Arg Tyr Leu Ser Val Lys Tyr Ile Cys Asp Leu Met Val Glu Asn 218769 AAG AAG GTG AAG TTT GGC ATG AAT GTA ACC TCC TCT GAG AAG GTG GAC 816219 Lys Lys Val Lys Phe Gly Met Asn Val Thr Ser Ser Glu Lys Val Asp 234817 AAA GCC CAG CGC TAT GCC GAC TTC ACT CTC CTC TCC ATC CCG TAT CCA 864235 Lys Ala Gln Arg Tyr Ala Asp Phe Thr Leu Leu Ser Ile Pro Tyr Pro 250865 GGC TGT GAA TTT TTC AAG GAA TAT AAA GAT CGG GAT TAC ATG GCA GAA 912251 Gly Cys Glu Phe Phe Lys Glu Tyr Lys Asp Arg Asp Tyr Met Ala Glu 266913 GGG CTC ATA TTT AAC TGG AAG CAG GAC TAC GTT GAT GCC CCA TTG AGC 960267 Gly Leu Ile Phe Asn Trp Lys Gln Asp Tyr Val Asp Ala Pro Leu Ser 282961 ATC CCC GAC TTC CTG ACT CAC TCT CTG AAC ATT GAC TGG AGC CAG TAT 1008283 Ile Pro Asp Phe Leu Thr His Ser Leu Asn Ile Asp Trp Ser Gln Tyr 2981009 CAG TGT TGG GAT CTG GTG CAA CAA ACA CAA AAC TAC CTG AAG CTG CTG 1056299 Gln Cys Trp Asp Leu Val Gln Gln Thr Gln Asn Tyr Leu Lys Leu Leu 3141057 CTT TCC TTA GTT AAC AGT GAT GAT GAC AGC GGG CTG CTG GTA CAC TGT 1104315 Leu Ser Leu Val Asn Ser Asp Asp Asp Ser Gly Leu Leu Val His Cys 3301105 ATC TCA GGC TGG GAT CGG ACC CCC CTC TTC ATC TCC CTC CTG CGC CTT 1152331 Ile Ser Gly Trp Asp Arg Thr Pro Leu Phe Ile Ser Leu Leu Arg Leu 3461153 TCC TTG TGG GCT GAT GGG CTC ATC CAC ACG TCC CTG AAG CCC ACT GAG 1200347 Ser Leu Trp Ala Asp Gly Leu Ile His Thr Ser Leu Lys Pro Thr Glu 3621201 ATC CTC TAC CTC ACT GTG GCC TAT GAC TGG TTC CTC TTC GGG CAC ATG 1248363 Ile Leu Tyr Leu Thr Val Ala Tyr Asp Trp Phe Leu Phe Gly His Met 3781249 TTG GTA GAT CGG CTC AGC AAA GGG GAG GAG ATT TTC TTC TTC TGC TTC 1296379 Leu Val Asp Arg Leu Ser Lys Gly Glu Glu Ile Phe Phe Phe Cys Phe 3941297 AAT TTT TTG AAG CAT ATT ACC TCC GAG GAG TTC TCT GCT CTG AAG ACC 1344395 Asn Phe Leu Lys His Ile Thr Ser Glu Glu Phe Ser Ala Leu Lys Thr 4101345 CAG AGG AGG AAG AGT TTG CCA GCC CGG GAT GGA GGC TTC ACC CTG GAA 1392411 Gln Arg Arg Lys Ser Leu Pro Ala Arg Asp Gly Gly Phe Thr Leu Glu 4261393 GAC ATC TGC ATG CTG AGA CGA AAG GAC CGT GGC AGC ACC ACC AGC CTT 1440427 Asp Ile Cys Met Leu Arg Arg Lys Asp Arg Gly Ser Thr Thr Ser Leu 4421441 GGC AGC GAC TTC TCC CTG GTC ATG GAG AGT TCC CCA GGA GCC ACT GGG 1488443 Gly Ser Asp Phe Ser Leu Val Met Glu Ser Ser Pro Gly Ala Thr Gly 4581489 AGC TTC ACC TAT GAA GGC CGT GGA GCT GGT CCC AGC AGG AGC GCC AAC 1536459 Ser Phe Thr Tyr Glu Gly Arg Gly Ala Gly Pro Ser Arg Ser Ala Asn 4741537 TCA GGC AGC TTG AAG GAA GAG CCA CTC ATC CTC TCC ACA GAG TGT CCT 1584475 Ser Gly Ser Leu Lys Glu Glu Pro Leu Ile Leu Ser Thr Glu Cys Pro 4901585 CTG GAA CCG GCC ACA ACC CTC AGA GGA CCG CTT GCC TTC CCA GCA GGG 1632491 Leu Glu Pro Ala Thr Thr Leu Arg Gly Pro Leu Ala Phe Pro Ala Gly 5061633 GCT GGC GGA AGC CAG GTC TTC CAG CTC CTC TTC CTC AAA CCA TTC TGA 1680507 Ala Gly Gly Ser Gln Val Phe Gln Leu Leu Phe Leu Lys Pro Phe *** 5221681 TAA CTT TTT CAG GAT GGG TAG CAG TCC CCT GGA GGT CCC CAA ACC CAG 17281729 GCT TGC AGC CCT GAG TGA TCG AGA GAC TCG GCT GCA GGA GGT GCG CTC 17761777 AGC CTT CTT GGC TGC GTA CAG CAG CAC AGT GGG GCT TCG GGC AGT AGC 18241825 CCC CAG TCC TTC CGG TGC CAT CGG GGG CCT GCT GGA GCA ATT TGC CCG 18721873 TGG TGT TGG ACT CCG GAG CAT CAG CAG CAA TGC CTT GTG AAG AAG CCA 19201921 GCC CAT GAC ATT TTC CTG CTC CTC TCT CAG CTG AGC CCT TAG CAG AGA 19681969 ATC AAA GCC ATG CCT GGC CGA AGG GGT ACT TCC AGG TCA GGG GAA ATT 20162017 TCA GTC CCC CAT CTC CAT CAT GAA CAT GGC AGC CCC AAA GCT GAG CAA 20642065 GGC CAA AGA CAG GGT TTT CCA ACC CCC AGC CTC TTG ACT GGT GAC CAC 21122113 CAC CCC TTC TTG TCA CTG TCT CCC ACC CAC CCC ATC TTT GCT GGG ATT 21602161 CCC ATC AAC TCT CAG AAC TGT GTG GGG TTT CCC TGG GGC CTT GTG GAA 22082209 GCC ATG ACT TCA CAA AGA CCC TAC CTG TCA GTT CTT GTT TCT GGG GAG 22562257 GAG GGA TCA CCT GCA CTG AGA ATG AGG CAG TTT GAC ACA GAT CAC AAA 23042305 ATA AAA TCA AAG TCT TTT TGA ATA GCC AAA AAA AAA AAA AAA AAA 23492.PP8332A:核苷酸序列(SEQ ID NO:4)长度:1771bp1 GCCTGGGGCG TCCCCGCGAA GCCTGGGCCT GTCAGGCGGT TCCGTCCGGG TCTCGGCCAC61 CGTCGAGTTC CGTCGAGTTC CGTCCCGGCC CTGCTCACAG CAGCGCCCTC GGAGCGCCCA121 GCACCTGCGG CCGGCCAGGC AGCGCGATCC TGCGGCGTCT GGCCATCCCG AATGCTATGG181 CCGCCGTCGC CGTCTTGCGG GCCTTCGGGG CAAGTGGGCC CATGTGTCTC CGGCGCGGCC241 CCTGGGCCCA GCTCCCCGCC CGCTTCTGCA GCCGGGACCC GGCCGGGGCG GGGCGGCGGG301 AGTCGGAGCC GCGGCCCACC AGCGCGCGGC AGCTGGACGG CATAAGGAAC ATCGTCTTGA361 GCAATCCCAA GAAGAGGAAC ACGTTGTCAC TTGCAATGCT GAAATCTCTC CAAAGTGACA421 TTCTTCATGA CGCTGACAGC AACGATCTGA AAGTCATTAT CATCTCGGCT GAGGGGCCTG481 TGTTTTCTTC TGGGCATGAC TTAAAGGAGC TGACAGAGGA GCAAGGCCGT GATTACCATG541 CCGAAGTATT TCAGACCTGT TCCAAGGGTC TCGCTCTGTC GCCCAGGCTG GATTACAGTG601 GCATGATCTC GGCTCACTGC AACCTCTGCC TCCCGGGTTC AAGCAATTCT CCTGCCTCAG661 CCTCCTGAGT AGCTGGGACT ACAGGTCATG ATGCACATCC GGAACCACCC CGTCCCCGTC721 ATTGCCATGG TCAATGGCCT GGCCACGGCT GCCGGCTGTC AACTGGTTGC CAGCTGCGAC781 ATTGCCGTGG CGAGCGACAA GTCCTCTTTT GCCACTCCTG GGGTGAACGT CGGGCTCTTC841 TGTTCTACCC CTGGGGTTGC CTTGGCAAGA GCAGTGCCTA GAAAGGTGGC CTTGGAGATG901 CTCTTTACTG GTGAGCCCAT TTCTGCCCAG GAGGCCCTGC TCCACGGGCT GCTTAGCAAG961 GTGGTGCCAG AGGCGGAGCT GCAGGAGGAG ACCATGCGGA TCGCTAGGAA GATCGCATCG1021 CTGAGCCGTC CGGTGGTGTC CCTGGGCAAA GCCACCTTCT ACAAGCAGCT GCCCCAGGAC1081 CTGGGGACGG CTTACTACCT CACCTCCCAG GCCATGGTGG ACAACCTGGC CCTGCGGGAC1141 GGGCAGGAGG GCATCACGGC CTTCCTCCAG AAGAGAAAAC CTGTCTGGTC ACACGAGCCA1201 GTGTGAGTGG AGGCAGAGGA GTGAGGCCCA CGGGCAGCGC CCAGGAGCCC ACCTTCCCCT1261 CTGGCCCAGC CACCACTGCC TCTCAGCTTC AACAGGTGAC AGGCTGCTTT CGTGACTTGA1321 TATTGGTGTC ATAGCATTTG GCCTACATTA AAAGCCACAA TTTCATGGGG AAAGGACAAA1381 ATGGAGAGTG ACTGAGGTGC TGACCTCAGT GCAAGGCTGG TGAACCCTGC AGCGGGCCAG1441 CTATGGTGGG AAGCCTGGCA TTTGGGGTGC TCCTTGCAAC GTCTTAAGCA AGCGACCCCC1501 CTGACATAGC AAAAGGTGGC AACCCATGGA GGCAGAAAGA AGGACGCCAG CCTGACCCTT1561 ATCTTGAAAC GTCCTAAGCA GAGTTAATCC TGGCTGCTCA GGAGAGGCGA CACATTTCAA1621 ATCTCCACGA GATATTCTCC ACACAGAAAA TCTTCTTGAT TCTATAGAGA CTTAATCATG1681 CCTATGGCTT TGAATAATCT TATGTGATTT AAATAAATTA AATCTTTATA GAGACTGGAA1741 AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AA:核苷酸序列(SEQ ID NO:1)长度:2349bp1 GTGGAAGTAG AAGGCGGTGG CTGAGGCGGT TCCGGAGGTT CTAGTGTCGG AGTTGGGTGC61 AGGCAGGTGC CATGGGCCCG CTTGAGGCAC ACTGAGGGGA CGCGGGGCTG GGCCATGGCC121 GGCGCTCGGG CCGCCGCCGC CGCTGCCTCG GCGGGGTCCT CGGCCTCTTC AGGCAACCAG181 CCGCCTCAGG AGCTGGGGCT TGGGGAGCTG CTGGAGGAGT TCTCCCGGAC TCAGTACCGG241 GCCAAGGATG GCAGCGGGAC CGGCGGCTCT AAGGTTGAGC GCATTGAGAA GAGATGTCTG301 GAGCTGTTTG GCCGAGACTA CTGTTTCAGC GTGATTCCAA ACACGAATGG GGATATCTGT361 GGCCACTATC CCCGGCACAT CGTGTTCCTG GAGTATGAGA GTTCTGAGAA GGAGAAAGAC421 ACGTTTGAGA GTACCGTACA GGTGAGCAAG TTGCAAGACC TCATCCACCG CAGCAAGATG481 GCCCGGTGCA GAGGACGGTT TGTCTGCCCA GTAATCCTGT TCAAGGGCAA GCACATTTGC541 AGGTCGGCCA CACTGGCTGG ATGGGGAGAG CTGTATGGAC GCTCAGGCTA CAACTATTTT601 TTCTCAGGGG GTGCAGATGA TGCCTGGGCA GATGTGGAGG ACGTCACGGA GGAGGACTGT661 GCTCTTCGAA GTGGTGACAC GCATCTTTTT GATAAGGTCA GAGGCTATGA CATCAAGCTG721 CTTCGATACC TGTCAGTCAA ATACATCTGT GACCTGATGG TGGAGAACAA GAAGGTGAAG781 TTTGGCATGA ATGTAACCTC CTCTGAGAAG GTGGACAAAG CCCAGCGCTA TGCCCACTTC841 ACTCTCCTCT CCATCCCGTA TCCAGGCTGT GAATTTTTCA AGGAATATAA AGATCGGGAT901 TACATGGCAG AAGGGCTCAT ATTTAACTGG AAGCAGGACT ACGTTGATGC CCCATTGAGC961 ATCCCCGACT TCCTGACTCA CTCTCTGAAC ATTGACTGGA GCCAGTATCA GTGTTGGGAT1021 CTGGTGCAAC AAACACAAAA CTACCTGAAG CTGCTGCTTT CCTTAGTTAA CAGTGATGAT1081 GACAGCGGGC TGCTGGTACA CTGTATCTCA GGCTGGGATC GGACCCCCCT CTTCATCTCC1141 CTCCTGCGCC TTTCCTTGTG GGCTGATGGG CTCATCCACA CGTCCCTGAA GCCCACTGAG1201 ATCCTCTACC TCACTGTGGC CTATGACTGG TTCCTCTTCG GGCACATGTT GGTAGATCGG1261 CTCAGCAAAG GGGAGGAGAT TTTCTTCTTC TGCTTCAATT TTTTGAAGCA TATTACCTCC1321 GAGGAGTTCT CTGCTCTGAA GACCCAGAGG AGGAAGAGTT TGCCAGCCCG GGATGGAGGC1381 TTCACCCTGG AAGACATCTG CATGCTGAGA CGAAAGGACC GTGGCAGCAC CACCAGCCTT1441 GGCAGCGACT TCTCCCTGGT CATGGAGAGT TCCCCAGGAG CCACTGGGAG CTTCACCTAT1501 GAAGGCCGTG GAGCTGGTCC CAGCAGGAGC GCCAACTCAG GCAGCTTGAA GGAAGAGCCA1561 CTCATCCTCT CCACAGAGTG TCCTCTGGAA CCGGCCACAA CCCTCAGAGG ACCGCTTGCC1621 TTCCCAGCAG GGGCTGGCGG AAGCCAGGTC TTCCAGCTCC TCTTCCTCAA ACCATTCTGA1681 TAACTTTTTC AGGATGGGTA GCAGTCCCCT GGAGGTCCCC AAACCCAGGC TTGCAGCCCT1741 GAGTGATCGA GAGACTCGGC TGCAGGAGGT GCGCTCAGCC TTCTTGGCTG CGTACAGCAG1801 CACAGTGGGG CTTCGGGCAG TAGCCCCCAG TCCTTCCGGT GCCATCGGGG GCCTGCTGGA1861 GCAATTTGCC CGTGGTGTTG GACTCCGGAG CATCAGCAGC AATGCCTTGT GAAGAAGCCA1921 GCCCATGACA TTTTCCTGCT CCTCTCTCAG CTGAGCCCTT AGCAGAGAAT CAAAGCCATG1981 CCTGGCCGAA GGGGTACTTC CAGGTCAGGG GAAATTTCAG TCCCCCATCT CCATCATGAA2041 CATGGCAGCC CCAAAGCTGA GCAAGGCCAA AGACAGGGTT TTCCAACCCC CAGCCTCTTG2101 ACTGGTGACC ACCACCCCTT CTTGTCACTG TCTCCCACCC ACCCCATCTT TGCTGGGATT2161 CCCATCAACT CTCAGAACTG TGTGGGGTTT CCCTGGGGCC TTGTGGAAGC CATGACTTCA2221 CAAAGACCCT ACCTGTCAGT TCTTGTTTCT GGGGAGGAGG GATCACCTGC ACTGAGAATG2281 AGGCAGTTTG ACACAGATCA CAAAATAAAA TCAAAGTCTT TTTGAATAGC CAAAAAAAAA2341 AAAAAAAAAAB:氨基酸序列(SEQ ID NO:2)长度:521个氨基酸1 MAGARAAAAA ASAGSSASSG NQPPQELGLG ELLEEFSRTQ YRAKDGSGTG GSKVERIEKR61 CLELFGRDYC FSVIPNTNGD ICGHYPRHIV FLEYESSEKE KDTFESTVQV SKLQDLIHRS121 KMARCRGRFV CPVILFKGKH ICRSATLAGW GELYGRSGYN YFFSGGADDA WADVEDVTEE181 DCALRSGDTH LFDKVRGYDI KLLRYLSVKY ICDLMVENKK VKFGMNVTSS EKVDKAQRYA241 DFTLLSIPYP GCEFFKEYKD RDYMAEGLIF NWKQDYVDAP LSIPDFLTHS LNIDWSQYQC301 WDLVQQTQNY LKLLLSLVNS DDDSGLLVHC ISGWDRTPLF ISLLRLSLWA DGLIHTSLKP361 TEILYLTVAY DWFLFGHMLV DRLSKGEEIF FFCFNFLKHI TSEEFSALKT QRRKSLPARD421 GGFTLEDICM LRRKDRGSTT SLGSDFSLVM ESSPGATGSF TYEGRGAGPS RSANSGSLKE481 EPLILSTECP LEPATTLRGP LAFPAGAGGS QVFQLLFLKP FC.核苷酸及氨基酸组合序列(SEQ ID NO:3)克隆号:PP8153起始编码子:115 ATG 终止编码子: 1678 TGA蛋白质分子量:58350.051 GTG GAA GTA GAA GGC GGT GGC TGA GGC GGT TCC GGA GGT TCT AGT GTC 4849 GGA GTT GGG TGC AGG CAG GTG CCA TGG GCC CGC TTG AGG CAC ACT GAG 9697 GGG ACG CGG GGC TGG GCC ATG GCC GGC GCT CGG GCC GCC GCC GCC GCT 1441 MET Ala Gly Ala Ala Ala Ala Ala Ala Ala 10145 GCC TCG GGG GGG TCC TCC TCA GGC AAC CCG CCG CCG CCG CCG CCG CCG CCG CCG CCG CCG CCG CCG CCG CCG CCG CCG CCG CCG CCG CCG CCG CCG CCG CCG CCG CCG CCG CCG CCG CCG CCG CCG CCG CCG CCG CCG CCG CCG CAG Ala GLY SELA SELA Ser Ser Ser Ser Ser Ser Sera -G's G Gl that Fly PLT that PLC that PLC that Glu 26193 CTG GGG CTT GGG GAG CTG CTG GAG GAG TTC TCC CGG ACT CAG TAC CGG 24027 Leu Gly Leu Gly Glu Leu Leu Glu Glu Phe Ser Arg Thr Gln Tyr Arg 42241 GCC AAG GAT GGC AGC GGG ACC GGC GGC TCT AAG GTT GAG CGC ATT GAG 288 43 Ala Lys Asp Gly Ser Gly Thr Gly Gly Ser Lys Val Glu Arg Ile Glu 58289 AAG AGA TGT CTG GAG CTG TTT GGC CGA GAC TAC TGT TTC AGC GTG ATT 33659 Lys Arg Cys Leu Glu Leu Phe Gly Arg Asp Tyr Cys PHE Ser Val Ile 7437 CCA ACG AAT GGG GGG GAT ATC TGT GGC CAC CCC CCC CAC GTG 38475 Pro Asn THR Asp Ile Cyr Pro ARG HIS Ile Vag Gag Gag Gag Tat Gag Tat Gag Tat Gag Tat Gag Tat Gag Tat Gag Tat Gag Ag Tat Gag Tat Gag Tat Gag Tat Gag Tat Gag Tat Gag Tat Gag Tat Gag Tat Gag Tat Gag Tat Gag Tat Gag Tat Gag Tat Gag Tat Gag Tat Gag Tat Gag Tat Gag Tat Gag Tat Gag Tat Gag Tat Gag Tat Gag Tat Gag Tat Gag Tat Gag Tat Gag Tat Gag Tat Gag Tat Gag Agg GAC ACG TTT GAG AGT 43291 Phe Leu Glu Tyr Glu Ser Ser Glu Lys Glu Lys Asp Thr Phe Glu Ser 106433 ACC GTA CAG GTG AGC AAG TTG CAA GAC CTC ATC CAC CGC AGC AAG ATG 480107 Thr Val Gln Val Ser Lys Leu Gln Asp Leu Ile His Arg Ser Lys Met 122481 GCC CGG TGC AGA GGA CGG TTT GTC TGC CCA GTA ATC CTG TTC AAG GGC 528123 Ala Arg Cys Arg Gly Arg Phe Val Cys Pro Val Ile Leu Phe Lys Gly 138529 AAG CAC ATT TGC AGG TCG GCC ACA CTG GCT GGA TGG GGA GAG CTG TAT 576139 Lys His Ile Cys Arg Ser Ala Thr Leu Ala Gly Trp Gly Glu Leu Tyr 154577 GGA CGC TCA GGC TAC AAC TAT TTT TTC TCA GGG GGT GCA GAT GAT GCC 624155 Gly Arg Ser Gly Tyr Asn Tyr Phe Phe Ser Gly Gly Ala Asp Asp Ala 170625 TGG GCA GAT GTG GAG GAC GTC ACG GAG GAG GAC TGT GCT CTT CGA AGT 672171 Trp Ala Asp Val Glu Asp Val Thr Glu Glu Asp Cys Ala Leu Arg Ser 186673 GGT GAC ACG CAT CTT TTT GAT AAG GTC AGA GGC TAT GAC ATC AAG CTG 720187 Gly Asp Thr His Leu Phe Asp Lys Val Arg Gly Tyr Asp Ile Lys Leu 202721 CTT CGA TAC CTG TCA GTC AAA TAC ATC TGT GAC CTG ATG GTG GAG AAC 768203 Leu Arg Tyr Leu Ser Val Lys Tyr Ile Cys Asp Leu Met Val Glu Asn 218769 AAG AAG GTG AAG TTT GGC ATG AAT GTA ACC TCC TCT GAG AAG GTG GAC 816219 Lys Lys Val Lys Phe Gly Met Asn Val Thr Ser Ser Glu Lys Val Asp 234817 AAA GCC CAG CGC TAT GCC GAC TTC ACT CTC CTC TCC ATC CCG TAT CCA 864235 Lys Ala Gln Arg Tyr Ala Asp Phe Thr Leu Leu Ser Ile Pro Tyr Pro 250865 GGC TGT GAA TTT TTC AAG GAA TAT AAA GAT CGG GAT TAC ATG GCA GAA 912251 Gly Cys Glu Phe Phe Lys Glu Tyr Lys Asp Arg Asp Tyr Met Ala Glu 266913 GGG CTC ATA TTT AAC TGG AAG CAG GAC TAC GTT GAT GCC CCA TTG AGC 960267 Gly Leu Ile Phe Asn Trp Lys Gln Asp Tyr Val Asp Ala Pro Leu Ser 282961 ATC CCC GAC TTC CTG ACT CAC TCT CTG AAC ATT GAC TGG AGC CAG TAT 1008283 Ile Pro Asp Phe Leu Thr His Ser Leu Asn Ile Asp Trp Ser Gln Tyr 2981009 CAG TGT TGG GAT CTG GTG CAA CAA ACA CAA AAC TAC CTG AAG CTG CTG 1056299 Gln Cys Trp Asp Leu Val Gln Gln Thr Gln Asn Tyr Leu Lys Leu Leu 3141057 CTT TCC TTA GTT AAC AGT GAT GAT GAC AGC GGG CTG CTG GTA CAC TGT 1104315 Leu Ser Leu Val Asn Ser Asp Asp Asp Ser Gly Leu Leu Val His Cys 3301105 ATC TCA GGC TGG GAT CGG ACC CCC CTC TTC ATC TCC CTC CTG CGC CTT 1152331 Ile Ser Gly Trp Asp Arg Thr Pro Leu Phe Ile Ser Leu Leu Arg Leu 3461153 TCC TTG TGG GCT GAT GGG CTC ATC CAC ACG TCC CTG AAG CCC ACT GAG 1200347 Ser Leu Trp Ala Asp Gly Leu Ile His Thr Ser Leu Lys Pro Thr Glu 3621201 ATC CTC TAC CTC ACT GTG GCC TAT GAC TGG TTC CTC TTC GGG CAC ATG 1248363 Ile Leu Tyr Leu Thr Val Ala Tyr Asp Trp Phe Leu Phe Gly His Met 3781249 TTG GTA GAT CGG CTC AGC AAA GGG GAG GAG ATT TTC TTC TTC TGC TTC 1296379 Leu Val Asp Arg Leu Ser Lys Gly Glu Glu Ile Phe Phe Phe Cys Phe 3941297 AAT TTT TTG AAG CAT ATT ACC TCC GAG GAG TTC TCT GCT CTG AAG ACC 1344395 Asn Phe Leu Lys His Ile Thr Ser Glu Glu Phe Ser Ala Leu Lys Thr 4101345 CAG AGG AGG AAG AGT TTG CCA GCC CGG GAT GGA GGC TTC ACC CTG GAA 1392411 Gln Arg Arg Lys Ser Leu Pro Ala Arg Asp Gly Gly Phe Thr Leu Glu 4261393 GAC ATC TGC ATG CTG AGA CGA AAG GAC CGT GGC AGC ACC ACC AGC CTT 1440427 Asp Ile Cys Met Leu Arg Arg Lys Asp Arg Gly Ser Thr Thr Ser Leu 4421441 GGC AGC GAC TTC TCC CTG GTC ATG GAG AGT TCC CCA GGA GCC ACT GGG 1488443 Gly Ser Asp Phe Ser Leu Val Met Glu Ser Ser Pro Gly Ala Thr Gly 4581489 AGC TTC ACC TAT GAA GGC CGT GGA GCT GGT CCC AGC AGG AGC GCC AAC 1536459 Ser Phe Thr Tyr Glu Gly Arg Gly Ala Gly Pro Ser Arg Ser Ala Asn 4741537 TCA GGC AGC TTG AAG GAA GAG CCA CTC ATC CTC TCC ACA GAG TGT CCT 1584475 Ser Gly Ser Leu Lys Glu Glu Pro Leu Ile Leu Ser Thr Glu Cys Pro 4901585 CTG GAA CCG GCC ACA ACC CTC AGA GGA CCG CTT GCC TTC CCA GCA GGG 1632491 Leu Glu Pro Ala Thr Thr Leu Arg Gly Pro Leu Ala Phe Pro Ala Gly 5061633 GCT GGC GGA AGC CAG GTC TTC CAG CTC CTC TTC CTC AAA CCA TTC TGA 1680507 Ala Gly Gly Ser Gln Val Phe Gln Leu Leu Phe Leu Lys Pro Phe *** 5221681 TAA CTT TTT CAG GAT GGG TAG CAG TCC CCT GGA GGT CCC CAA ACC CAG 17281729 GCT TGC AGC CCT GAG TGA TCG AGA GAC TCG GCT GCA GGA GGT GCG CTC 17761777 AGC CTT CTT GGC TGC GTA CAG CAG CAC AGT GGG GCT TCG GGC AGT AGC 18241825 CCC CAG TCC TTC CGG TGC CAT CGG GGG CCT GCT GGA GCA ATT TGC CCG 18721873 TGG TGT TGG ACT CCG GAG CAT CAG CAG CAA TGC CTT GTG AAG AAG CCA 19201921 GCC CAT GAC ATT TTC CTG CTC CTC TCT CAG CTG AGC CCT TAG CAG AGA 19681969 ATC AAA GCC ATG CCT GGC CGA AGG GGT ACT TCC AGG TCA GGG GAA ATT 20162017 TCA GTC CCC CAT CTC CAT CAT GAA CAT GGC AGC CCC AAA GCT GAG CAA 20642065 GGC CAA AGA CAG GGT TTT CCA ACC CCC AGC CTC TTG ACT GGT GAC CAC 21122113 CAC CCC TTC TTG TCA CTG TCT CCC ACC CAC CCC ATC TTT GCT GGG ATT 21602161 CCC ATC AAC TCT CAG AAC TGT GTG GGG TTT CCC TGG GGC CTT GTG GAA 22082209 GCC ATG ACT TCA CAA AGA CCC TAC CTG TCA GTT CTT GTT TCT GGG GAG 22562257 GAG GGA TCA CCT GCA CTG AGA ATG AGG CAG TTT GAC ACA GAT CAC AAA 23042305 ATA AAA TCA AAG TCT TTT TGA ATA GCC AAA AAA AAA AAA AAA AAA 23492.PP8332A:核苷酸序列(SEQ ID NO:4)长度:1771bp1 GCCTGGGGCG TCCCCGCGAA GCCTGGGCCT GTCAGGCGGT TCCGTCCGGG TCTCGGCCAC61 CGTCGAGTTC CGTCGAGTTC CGTCCCGGCC CTGCTCACAG CAGCGCCCTC GGAGCGCCCA121 GCACCTGCGG CCGGCCAGGC AGCGCGATCC TGCGGCGTCT GGCCATCCCG AATGCTATGG181 CCGCCGTCGC CGTCTTGCGG GCCTTCGGGG CAAGTGGGCC CATGTGTCTC CGGCGCGGCC241 CCTGGGCCCA GCTCCCCGCC CGCTTCTGCA GCCGGGACCC GGCCGGGGCG GGGCGGCGGG301 AGTCGGAGCC GCGGCCCACC AGCGCGCGGC AGCTGGACGG CATAAGGAAC ATCGTCTTGA361 GCAATCCCAA GAAGAGGAAC ACGTTGTCAC TTGCAATGCT GAAATCTCTC CAAAGTGACA421 TTCTTCATGA CGCTGACAGC AACGATCTGA AAGTCATTAT CATCTCGGCT GAGGGGCCTG481 TGTTTTCTTC TGGGCATGAC TTAAAGGAGC TGACAGAGGA GCAAGGCCGT GATTACCATG541 CCGAAGTATT TCAGACCTGT TCCAAGGGTC TCGCTCTGTC GCCCAGGCTG GATTACAGTG601 GCATGATCTC GGCTCACTGC AACCTCTGCC TCCCGGGTTC AAGCAATTCT CCTGCCTCAG661 CCTCCTGAGT AGCTGGGACT ACAGGTCATG ATGCACATCC GGAACCACCC CGTCCCCGTC721 ATTGCCATGG TCAATGGCCT GGCCACGGCT GCCGGCTGTC AACTGGTTGC CAGCTGCGAC781 ATTGCCGTGG CGAGCGACAA GTCCTCTTTT GCCACTCCTG GGGTGAACGT CGGGCTCTTC841 TGTTCTACCC CTGGGGTTGC CTTGGCAAGA GCAGTGCCTA GAAAGGTGGC CTTGGAGATG901 CTCTTTACTG GTGAGCCCAT TTCTGCCCAG GAGGCCCTGC TCCACGGGCT GCTTAGCAAG961 GTGGTGCCAG AGGCGGAGCT GCAGGAGGAG ACCATGCGGA TCGCTAGGAA GATCGCATCG1021 CTGAGCCGTC CGGTGGTGTC CCTGGGCAAA GCCACCTTCT ACAAGCAGCT GCCCCAGGAC1081 CTGGGGACGG CTTACTACCT CACCTCCCAG GCCATGGTGG ACAACCTGGC CCTGCGGGAC1141 GGGCAGGAGG GCATCACGGC CTTCCTCCAG AAGAGAAAAC CTGTCTGGTC ACACGAGCCA1201 GTGTGAGTGG AGGCAGAGGA GTGAGGCCCA CGGGCAGCGC CCAGGAGCCC ACCTTCCCCT1261 CTGGCCCAGC CACCACTGCC TCTCAGCTTC AACAGGTGAC AGGCTGCTTT CGTGACTTGA1321 TATTGGTGTC ATAGCATTTG GCCTACATTA AAAGCCACAA TTTCATGGGG AAAGGACAAA1381 ATGGAGAGTG ACTGAGGTGC TGACCTCAGT GCAAGGCTGG TGAACCCTGC AGCGGGCCAG1441 CTATGGTGGG AAGCCTGGCA TTTGGGGTGC TCCTTGCAAC GTCTTAAGCA AGCGACCCCC1501 CTGACATAGC AAAAGGTGGC AACCCATGGA GGCAGAAAGA AGGACGCCAG CCTGACCCTT1561 ATCTTGAAAC GTCCTAAGCA GAGTTAATCC TGGCTGCTCA GGAGAGGCGA CACATTTCAA1621 ATCTCCACGA GATATTCTCC ACACAGAAAA TCTTCTTGAT TCTATAGAGA CTTAATCATG1681 CCTATGGCTT TGAATAATCT TATGTGATTT AAATAAATTA AATCTTTATA GAGACTGGAA1741 AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA A
B:氨基酸序列(SEQ ID NO:5)长度:172个氨基酸1 MMHIRNHPVP VIAMVNGLAT AAGCQLVASC DIAVASDKSS FATPGVNVGL FCSTPGVALA61 RAVPRKVALE MLFTGEPISA QEALLHGLLS KVVPEAELQE ETMRIARKIA SLSRPVVSLG121 KATFYKQLPQ DLGTAYYLTS QAMVDNLALR DGQEGITAFL QKRKPVWSHE PVB:氨基酸序列(SEQ ID NO:5)长度:172个氨基酸1 MMHIRNHPVP VIAMVNGLAT AAGCQLVASC DIAVASDKSS FATPGVNVGL FCSTPGVALA61 RAVPRKVALE MLFTGEPISA QEALLHGLLS KVVPEAELQE ETMRIARKIA SLSRPVVSLG121 KATFYKQLPQ DLGTAYYLTS QAMVDNLALR DGQEGITAFL QKRKPVWSHE PV
C.核苷酸及氨基酸组合序列(SEQ ID NO:6)C. Nucleotide and amino acid combination sequence (SEQ ID NO: 6)
克隆号:PP8332Clone number: PP8332
起始编码子:688 ATG 终止编码子:1204 TGAStart code: 688 ATG End code: 1204 TGA
蛋白质分子量:18409.481 GCC TGG GGC GTC CCC GCG AAG CCT GGG CCT GTC AGG CGG TTC CGT CCG 4849 GGT CTC GGC CAC CGT CGA GTT CCG TCG AGT TCC GTC CCG GCC CTG CTC 9697 ACA GCA GCG CCC TCG GAG CGC CCA GCA CCT GCG GCC GGC CAG GCA GCG 144145 CGA TCC TGC GGC GTC TGG CCA TCC CGA ATG CTA TGG CCG CCG TCG CCG 192193 TCT TGC GGG CCT TCG GGG CAA GTG GGC CCA TGT GTC TCC GGC GCG GCC 240241 CCT GGG CCC AGC TCC CCG CCC GCT TCT GCA GCC GGG ACC CGG CCG GGG 288289 CGG GGC GGC GGG AGT CGG AGC CGC GGC CCA CCA GCG CGC GGC AGC TGG 336 337 ACG GCA TAA GGA ACA TCG TCT TGA GCA ATC CCA AGA AGA GGA ACA CGT 384385 TGT CAC TTG CAA TGC TGA AAT CTC TCC AAA GTG ACA TTC TTC ATG ACG 432433 CTG ACA GCA ACG ATC TGA AAG TCA TTA TCA TCT CGG CTG AGG GGC CTG 480481 TGT TTT CTT CTG GGC ATG ACT TAA AGG AGC TGA CAG AGG AGC AAG GCC 528529 GTG ATT ACC ATG CCG AAG TAT TTC AGA CCT GTT CCA AGG GTC TCG CTC 576577 TGT CGC CCA GGC TGG ATT ACA GTG GCA TGA TCT CGG CTC ACT GCA ACC 624625 TCT GCC TCC CGG GTT CAA GCA ATT CTC CTG CCT CAG CCT CCT GAG TAG 672673 CTG GGA CTA CAG GTC ATG ATG CAC ATC CGG AAC CAC CCC GTC CCC GTC 7201 Met Met His Ile Arg Asn His Pro Val Pro Val 11721 ATT GCC ATG GTC AAT GGC CTG GCC ACG GCT GCC GGC TGT CAA CTG GTT 76812 Ile Ala Met Val Asn Gly Leu Ala Thr Ala Ala Gly Cys Gln Leu Val 27769 GCC AGC TGC GAC ATT GCC GTG GCG AGC GAC AAG TCC TCT TTT GCC ACT 81628 Ala Ser Cys Asp Ile Ala Val Ala Ser Asp Lys Ser Ser Phe Ala Thr 43817 CCT GGG GTG AAC GTC GGG CTC TTC TGT TCT ACC CCT GGG GTT GCC TTG 86444 Pro Gly Val Asn Val Gly Leu Phe Cys Ser Thr Pro Gly Val Ala Leu 59865 GCA AGA GCA GTG CCT AGA AAG GTG GCC TTG GAG ATG CTC TTT ACT GGT 91260 Ala Arg Ala Val Pro Arg Lys Val Ala Leu Glu Met Leu Phe Thr Gly 75913 GAG CCC ATT TCT GCC CAG GAG GCC CTG CTC CAC GGG CTG CTT AGC AAG 96076 Glu Pro Ile Ser Ala Gln Glu Ala Leu Leu His Gly Leu Leu Ser Lys 91961 GTG GTG CCA GAG GCG GAG CTG CAG GAG GAG ACC ATG CGG ATC GCT AGG 100892 Val Val Pro Glu Ala Glu Leu Gln Glu Glu Thr Met Arg Ile Ala Arg 1071009 AAG ATC GCA TCG CTG AGC CGT CCG GTG GTG TCC CTG GGC AAA GCC ACC 1056108 Lys Ile Ala Ser Leu Ser Arg Pro Val Val Ser Leu Gly Lys Ala Thr 1231057 TTC TAC AAG CAG CTG CCC CAG GAC CTG GGG ACG GCT TAC TAC CTC ACC 1104124 Phe Tyr Lys Gln Leu Pro Gln Asp Leu Gly Thr Ala Tyr Tyr Leu Thr 1391105 TCC CAG GCC ATG GTG GAC AAC CTG GCC CTG CGG GAC GGG CAG GAG GGC 1152140 Ser Gln Ala Met Val Asp Asn Leu Ala Leu Arg Asp Gly Gln Glu Gly 1551153 ATC ACG GCC TTC CTC CAG AAG AGA AAA CCT GTC TGG TCA CAC GAG CCA 1200156 Ile Thr Ala Phe Leu Gln Lys Arg Lys Pro Val Trp Ser His Glu Pro 1711201 GTG TGA GTG GAG GCA GAG GAG TGA GGC CCA CGG GCA GCG CCC AGG AGC 1248172 Val *** 1731249 CCA CCT TCC CCT CTG GCC CAG CCA CCA CTG CCT CTC AGC TTC AAC AGG 12961297 TGA CAG GCT GCT TTC GTG ACT TGA TAT TGG TGT CAT AGC ATT TGG CCT 13441345 ACA TTA AAA GCC ACA ATT TCA TGG GGA AAG GAC AAA ATG GAG AGT GAC 13921393 TGA GGT GCT GAC CTC AGT GCA AGG CTG GTG AAC CCT GCA GCG GGC CAG 14401441 CTA TGG TGG GAA GCC TGG CAT TTG GGG TGC TCC TTG CAA CGT CTT AAG 14881489 CAA GCG ACC CCC CTG ACA TAG CAA AAG GTG GCA ACC CAT GGA GGC AGA 15361537 AAG AAG GAC GCC AGC CTG ACC CTT ATC TTG AAA CGT CCT AAG CAG AGT 15841585 TAA TCC TGG CTG CTC AGG AGA GGC GAC ACA TTT CAA ATC TCC ACG AGA 16321633 TAT TCT CCA CAC AGA AAA TCT TCT TGA TTC TAT AGA GAC TTA ATC ATG 16801681 CCT ATG GCT TTG AAT AAT CTT ATG TGA TTT AAA TAA ATT AAA TCT TTA 17281729 TAG AGA CTG GAA AAA AAA AAA AAA AAA AAA AAA AAA AAA AAA A 1771蛋白质分子量:18409.481 GCC TGG GGC GTC CCC GCG AAG CCT GGG CCT GTC AGG CGG TTC CGT CCG 4849 GGT CTC GGC CAC CGT CGA GTT CCG TCG AGT TCC GTC CCG GCC CTG CTC 9697 ACA GCA GCG CCC TCG GAG CGC CCA GCA CCT GCG GCC GGC CAG GCA GCG 144145 CGA TCC TGC GGC GTC TGG CCA TCC CGA ATG CTA TGG CCG CCG TCG CCG 192193 TCT TGC GGG CCT TCG GGG CAA GTG GGC CCA TGT GTC TCC GGC GCG GCC 240241 CCT GGG CCC AGC TCC CCG CCC GCT TCT GCA GCC GGG ACC CGG CCG GGG 288289 CGG GGC GGC GGG AGT CGG AGC CGC GGC CCA CCA GCG CGC GGC AGC TGG 336 337 ACG GCA TAA GGA ACA TCG TCT TGA GCA ATC CCA AGA AGA GGA ACA CGT 384385 TGT CAC TTG CAA TGC TGA AAT CTC TCC AAA GTG ACA TTC TTC ATG ACG 432433 CTG ACA GCA ACG ATC TGA AAG TCA TTA TCA TCT CGG CTG AGG GGC CTG 480481 TGT TTT CTT CTG GGC ATG ACT TAA AGG AGC TGA CAG AGG AGC AAG GCC 528529 GTG ATT ACC ATG CCG AAG TAT TTC AGA CCT GTT CCA AGG GTC TCG CTC 576577 TGT CGC CCA GGC TGG ATT ACA GTG GCA TGA TCT CGG CTC ACT GCA ACC 624625 TCT GCC TCC CGG GTT CAA GCA ATT CTC CTG CCT CAG CCT CCT GAG TAG 672673 CTG GGA CTA CAG GTC ATG ATG CAC ATC CGG AAC CAC CCC GTC CCC GTC 7201 Met Met His Ile Arg Asn His Pro Val Pro Val 11721 ATT GCC ATG GTC AAT GGC CTG GCC ACG GCT GCC GGC TGT CAA CTG GTT 76812 Ile Ala Met Val Asn Gly Leu Ala Thr Ala Ala Gly Cys Gln Leu Val 27769 GCC AGC TGC GAC ATT GCC GTG GCG AGC GAC AAG TCC TCT TTT GCC ACT 81628 Ala Ser Cys Asp Ile Ala Val Ala Ser Asp Lys Ser Ser Phe Ala Thr 43817 CCT GGG GTG AAC GTC GGG CTC TTC TGT TCT ACC CCT GGG GTT GCC TTG 86444 Pro Gly Val Asn Val Gly Leu Phe Cys Ser Thr Pro Gly Val Ala Leu 59865 GCA AGA GCA GTG CCT AGA AAG GTG GCC TTG GAG ATG CTC TTT ACT GGT 91260 Ala Arg Ala Val Pro Arg Lys Val Ala Leu Glu Met Leu Phe Thr Gly 75913 GAG CCC ATT TCT GCC CAG GAG GCC CTG CTC CAC GGG CTG CTT AGC AAG 96076 Glu Pro Ile Ser Ala Gln Glu Ala Leu Leu His Gly Leu Leu Ser Lys 91961 GTG GTG CCA GAG GCG GAG CTG CAG GAG GAG ACC ATG CGG ATC GCT AGG 100892 Val Val Pro Glu Ala Glu Leu Gln Glu Glu Thr Met Arg Ile Ala Arg 1071009 AAG ATC GCA TCG CTG AGC CGT CCG GTG GTG TCC CTG GGC AAA GCC ACC 1056108 Lys Ile Ala Ser Leu Ser Arg Pro Val Val Ser Leu Gly Lys Ala Thr 1231057 TTC TAC AAG CAG CTG CCC CAG GAC CTG GGG ACG GCT TAC TAC CTC ACC 1104124 Phe Tyr Lys Gln Leu Pro Gln Asp Leu Gly Thr Ala Tyr Tyr Leu Thr 1391105 TCC CAG GCC ATG GTG GAC AAC CTG GCC CTG CGG GAC GGG CAG GAG GGC 1152140 Ser Gln Ala Met Val Asp Asn Leu Ala Leu Arg Asp Gly Gln Glu Gly 1551153 ATC ACG GCC TTC CTC CAG AAG AGA AAA CCT GTC TGG TCA CAC GAG CCA 1200156 Ile Thr Ala Phe Leu Gln Lys Arg Lys Pro Val Trp Ser His Glu Pro 1711201 GTG TGA GTG GAG GCA GAG GAG TGA GGC CCA CGG GCA GCG CCC AGG AGC 1248172 Val *** 1731249 CCA CCT TCC CCT CTG GCC CAG CCA CCA CTG CCT CTC AGC TTC AAC AGG 12961297 TGA CAG GCT GCT TTC GTG ACT TGA TAT TGG TGT CAT AGC ATT TGG CCT 13441345 ACA TTA AAA GCC ACA ATT TCA TGG GGA AAG GAC AAA ATG GAG AGT GAC 13921393 TGA GGT GCT GAC CTC AGT GCA AGG CTG GTG AAC CCT GCA GCG GGC CAG 14401441 CTA TGG TGG GAA GCC TGG CAT TTG GGG TGC TCC TTG CAA CGT CTT AAG 14881489 CAA GCG ACC CCC CTG ACA TAG CAA AAG GTG GCA ACC CAT GGA GGC AGA 15361537 AAG AAG GAC GCC AGC CTG ACC CTT ATC TTG AAA CGT CCT AAG CAG AGT 15841585 TAA TCC TGG CTG CTC AGG AGA GGC GAC ACA TTT CAA ATC TCC ACG AGA 16321633 TAT TCT CCA CAC AGA AAA TCT TCT TGA TTC TAT AGA GAC TTA ATC ATG 16801681 CCT ATG GCT TTG AAT AAT CTT ATG TGA TTT AAA TAA ATT AAA TCT TTA 17281729 TAG AGA CTG GAA AAA AAA AAA AAA AAA AAA AAA AAA AAA AAA A 1771
3.PP91773. PP9177
A:核苷酸序列(SEQ ID NO:7)长度:2160bp1 GCGTCTGCCA GCCGGCTTGG CTAGCGCGCG GCGGCCGTGG CTAAGGCTGC TACGAAGCGA61 GCTTGGGAGG AGCAGCGGCC TGCGGGGCAG AGGAGCATCC CGTCTACCAG GTCCCAAGCG121 GCCGTGGCCC GCGGGTCATG GCCAAAGGAG AAGGCGCCGA GAGCGGCTCC GCGGCGGGGC181 TGCTACCCAC CAGCATCCTC CAAAGCACTG AACGCCCGGC CCAGGTGAAG AAAGAACCGA241 AAAAGAAGAA ACAACAGTTG TCTGTTTGCA ACAAGCTTTG CTATGCACTT GGGGGAGCCC301 CCTACCAGGT GACGGGCTGT GCCCTGGGTT TCTTCCTTCA GATCTACCTA TTGGATGTGG361 CTCAGGTGGG CCCTTTCTCT GCCTCCATCA TCCTGTTTGT GGGCCGAGCC TGGGATGCCA421 TCACAGACCC CCTGGTGGGC CTCTGCATCA GCAAATCCCC CTGGACCTGC CTGGGTCGCC481 TTATGCCCTG GATCATCTTC TCCACGCCCC TGGCCGTCAT TGCCTACTTC CTCATCTGGT541 TCGTGCCCGA CTTCCCACAC GGCCAGACCT ATTGGTACCT GCTTTTCTAT TGCCTCTTTG601 AAACAATGGT CACGTGTTTC CATGTTCCCT ACTCGGCTCT CACCATGTTC ATCAGCACCG661 AGCAGACTGA GCGGGATTCT GCCACCGCCT ATCGGATGAC TGTGGAAGTG CTGGGCACAG721 TGCTGGGCAC GGCGATCCAG GGACAAATCG TGGGCCAAGC AGACACGCCT TGTTTCCAGG781 ACCTCAATAG CTCTACAGTA GCTTCACAAA GTGCCAACCA TACACATGGC ACCACCTCAC841 ACAGGGAAAC GCAAAAGGCA TACCTGCTGG CAGCGGGGGT CATTGTCTGT ATCTATATAA901 TCTGTGCTGT CATCCTGATC CTGGGCGTGC GGGAGCAGAG AGAACCCTAT GAAGCCCAGC961 AGTCTGAGCC AATCGCCTAC TTCCGGGGCC TACGGCTGGT CATGAGCCAC GGCCCATACA1021 TCAAACTTAT TACTGGCTTC CTCTTCACCT CCTTGGCTTT CATGCTGGTG GAGGGGAACT1081 TTGTCTTGTT TTGCACCTAC ACCTTGGGCT TCCGCAATGA ATTCCAGAAT CTACTCCTGG1141 CCATCATGCT CTCGGCCACT TTAACCATTC CCATCTGGCA GTGGTTCTTG ACCCGGTTTG1201 GCAAGAAGAC AGCTGTATAT GTTGGGATCT CATCAGCAGT GCCATTTCTC ATCTTGGTGG1261 CCCTCATGGA GAGTAACCTC ATCATTACAT ATGCGGTAGC TGTGGCAGCT GGCATCAGTG1321 TGGCAGCTGC CTTCTTACTA CCCTGGTCCA TGCTGCCTGA TGTCATTGAC GACTTCCATC1381 TGAAGCAGCC CCACTTCCAT GGAACCGAGC CCATCTTCTT CTCCTTCTAT GTCTTCTTCA1441 CCAAGTTTGC CTCTGGAGTG TCACTGGGCA TTTCTACCCT CAGTCTGGAC TTTGCAGGGT1501 ACCAGACCCG TGGCTGCTCG CAGCCGGAAC GTGTCAAGTT TACACTGAAC ATGCTCGTGA1561 CCATGGCTCC CATAGTTCTC ATCCTGCTGG GCCTGCTGCT CTTCAAAATG TACCCCATTG1621 ATGAGGAGAG GCGGCGGCAG AATAAGAAGG CCCTGCAGGC ACTGAGGGAC GAGGCCAGCA1681 GCTCTGGCTG CTCAGAAACA GACTCCACAG AGCTGGCTAG CATCCTCTAG GGCCCGCCAC1741 GTTGCCCGAA GCCACCATGC AGAAGGCCAC AGAAGGGATC AGGACCTGTC TGCCGGCTTG1801 CTGAGCAGCT GGACTGCAGG TGCTAGGAAG GGAACTGAAG ACTCAAGGAG GTGGCCCAGG1861 ACACTTGCTG TGCTCACTGT GGGGCCGGCT GCTCTGTGGC CTCCTGCCTC CCCTCTGCCT1921 GCCTGTGGGG CCAAGCCCTG GGGCTGCCAC TGTGAATATG CCAAGGACTG ATCGGGCCTA1981 GCCCGGAACA CTAATGTAGA AACCTTTTTT TTACAGAGCC TAATTAATAA CTTAATGACT2041 GTGTACATAG CAATGTGTGT GTATGTATAT GTCTGTGAGC TATTAATGTT ATTAATTTTC2101 ATAAAAGCTG GAAAGCAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAAA:核苷酸序列(SEQ ID NO:7)长度:2160bp1 GCGTCTGCCA GCCGGCTTGG CTAGCGCGCG GCGGCCGTGG CTAAGGCTGC TACGAAGCGA61 GCTTGGGAGG AGCAGCGGCC TGCGGGGCAG AGGAGCATCC CGTCTACCAG GTCCCAAGCG121 GCCGTGGCCC GCGGGTCATG GCCAAAGGAG AAGGCGCCGA GAGCGGCTCC GCGGCGGGGC181 TGCTACCCAC CAGCATCCTC CAAAGCACTG AACGCCCGGC CCAGGTGAAG AAAGAACCGA241 AAAAGAAGAA ACAACAGTTG TCTGTTTGCA ACAAGCTTTG CTATGCACTT GGGGGAGCCC301 CCTACCAGGT GACGGGCTGT GCCCTGGGTT TCTTCCTTCA GATCTACCTA TTGGATGTGG361 CTCAGGTGGG CCCTTTCTCT GCCTCCATCA TCCTGTTTGT GGGCCGAGCC TGGGATGCCA421 TCACAGACCC CCTGGTGGGC CTCTGCATCA GCAAATCCCC CTGGACCTGC CTGGGTCGCC481 TTATGCCCTG GATCATCTTC TCCACGCCCC TGGCCGTCAT TGCCTACTTC CTCATCTGGT541 TCGTGCCCGA CTTCCCACAC GGCCAGACCT ATTGGTACCT GCTTTTCTAT TGCCTCTTTG601 AAACAATGGT CACGTGTTTC CATGTTCCCT ACTCGGCTCT CACCATGTTC ATCAGCACCG661 AGCAGACTGA GCGGGATTCT GCCACCGCCT ATCGGATGAC TGTGGAAGTG CTGGGCACAG721 TGCTGGGCAC GGCGATCCAG GGACAAATCG TGGGCCAAGC AGACACGCCT TGTTTCCAGG781 ACCTCAATAG CTCTACAGTA GCTTCACAAA GTGCCAACCA TACACATGGC ACCACCTCAC841 ACAGGGAAAC GCAAAAGGCA TACCTGCTGG CAGCGGGGGT CATTGTCTGT ATCTATATAA901 TCTGTGCTGT CATCCTGATC CTGGGCGTGC GGGAGCAGAG AGAACCCTAT GAAGCCCAGC961 AGTCTGAGCC AATCGCCTAC TTCCGGGGCC TACGGCTGGT CATGAGCCAC GGCCCATACA1021 TCAAACTTAT TACTGGCTTC CTCTTCACCT CCTTGGCTTT CATGCTGGTG GAGGGGAACT1081 TTGTCTTGTT TTGCACCTAC ACCTTGGGCT TCCGCAATGA ATTCCAGAAT CTACTCCTGG1141 CCATCATGCT CTCGGCCACT TTAACCATTC CCATCTGGCA GTGGTTCTTG ACCCGGTTTG1201 GCAAGAAGAC AGCTGTATAT GTTGGGATCT CATCAGCAGT GCCATTTCTC ATCTTGGTGG1261 CCCTCATGGA GAGTAACCTC ATCATTACAT ATGCGGTAGC TGTGGCAGCT GGCATCAGTG1321 TGGCAGCTGC CTTCTTACTA CCCTGGTCCA TGCTGCCTGA TGTCATTGAC GACTTCCATC1381 TGAAGCAGCC CCACTTCCAT GGAACCGAGC CCATCTTCTT CTCCTTCTAT GTCTTCTTCA1441 CCAAGTTTGC CTCTGGAGTG TCACTGGGCA TTTCTACCCT CAGTCTGGAC TTTGCAGGGT1501 ACCAGACCCG TGGCTGCTCG CAGCCGGAAC GTGTCAAGTT TACACTGAAC ATGCTCGTGA1561 CCATGGCTCC CATAGTTCTC ATCCTGCTGG GCCTGCTGCT CTTCAAAATG TACCCCATTG1621 ATGAGGAGAG GCGGCGGCAG AATAAGAAGG CCCTGCAGGC ACTGAGGGAC GAGGCCAGCA1681 GCTCTGGCTG CTCAGAAACA GACTCCACAG AGCTGGCTAG CATCCTCTAG GGCCCGCCAC1741 GTTGCCCGAA GCCACCATGC AGAAGGCCAC AGAAGGGATC AGGACCTGTC TGCCGGCTTG1801 CTGAGCAGCT GGACTGCAGG TGCTAGGAAG GGAACTGAAG ACTCAAGGAG GTGGCCCAGG1861 ACACTTGCTG TGCTCACTGT GGGGCCGGCT GCTCTGTGGC CTCCTGCCTC CCCTCTGCCT1921 GCCTGTGGGG CCAAGCCCTG GGGCTGCCAC TGTGAATATG CCAAGGACTG ATCGGGCCTA1981 GCCCGGAACA CTAATGTAGA AACCTTTTTT TTACAGAGCC TAATTAATAA CTTAATGACT2041 GTGTACATAG CAATGTGTGT GTATGTATAT GTCTGTGAGC TATTAATGTT ATTAATTTTC2101 ATAAAAGCTG GAAAGCAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA
B:氨基酸序列(SEQ ID NO:8) 长度:530个氨基酸1 MAKGEGAESG SAAGLLPTSI LQSTERPAQV KKEPKKKKQQ LSVCNKLCYA LGGAPYQVTG61 CALGFFLQIY LLDVAQVGPF SASIILFVGR AWDAITDPLV GLCISKSPWT CLGRLMPWII121 FSTPLAVIAY FLIWFVPDFP HGQTYWYLLF YCLFETMVTC FHVPYSALTM FISTEQTERD181 SATAYRMTVE VLGTVLGTAI QGQIVGQADT PCFQDLNSST VASQSANHTH GTTSHRETQK241 AYLLAAGVIV CIYIICAVIL ILGVREQREP YEAQQSEPIA YFRGLRLVMS HGPYIKLITG301 FLFTSLAFML VEGNFVLFCT YTLGFRNEFQ NLLLAIMLSA TLTIPIWQWF LTRFGKKTAV361 YVGISSAVPF LILVALMESN LIITYAVAVA AGISVAAAFL LPWSMLPDVI DDFHLKQPHF421 HGTEPIFFSF YVFFTKFASG VSLGISTLSL DFAGYQTRGC SQPERVKFTL NMLVTMAPIV481 LILLGLLLFK MYPIDEERRR QNKKALQALR DEASSSGCSE TDSTELASILB:氨基酸序列(SEQ ID NO:8) 长度:530个氨基酸1 MAKGEGAESG SAAGLLPTSI LQSTERPAQV KKEPKKKKQQ LSVCNKLCYA LGGAPYQVTG61 CALGFFLQIY LLDVAQVGPF SASIILFVGR AWDAITDPLV GLCISKSPWT CLGRLMPWII121 FSTPLAVIAY FLIWFVPDFP HGQTYWYLLF YCLFETMVTC FHVPYSALTM FISTEQTERD181 SATAYRMTVE VLGTVLGTAI QGQIVGQADT PCFQDLNSST VASQSANHTH GTTSHRETQK241 AYLLAAGVIV CIYIICAVIL ILGVREQREP YEAQQSEPIA YFRGLRLVMS HGPYIKLITG301 FLFTSLAFML VEGNFVLFCT YTLGFRNEFQ NLLLAIMLSA TLTIPIWQWF LTRFGKKTAV361 YVGISSAVPF LILVALMESN LIITYAVAVA AGISVAAAFL LPWSMLPDVI DDFHLKQPHF421 HGTEPIFFSF YVFFTKFASG VSLGISTLSL DFAGYQTRGC SQPERVKFTL NMLVTMAPIV481 LILLGLLLFK MYPIDEERRR QNKKALQALR DEASSSGCSE TDSTELASIL
C.核苷酸及氨基酸组合序列(SEQ ID NO:9)C. Nucleotide and amino acid combination sequence (SEQ ID NO: 9)
克隆号:PP9177Clone number: PP9177
起始编码子:138 ATG 终止编码子:1728 TAGStart code: 138 ATG End code: 1728 TAG
蛋白质分子量:58620.361 GC GTC TGC CAG CCG GCT TGG CTA GCG CGC GGC GGC CGT GGC TAA GGC 4748 TGC TAC GAA GCG AGC TTG GGA GGA GCA GCG GCC TGC GGG GCA GAG GAG 9596 CAT CCC GTC TAC CAG GTC CCA AGC GGC CGT GGC CCG CGG GTC ATG GCC 1431 Met Ala 2144 AAA GGA GAA GGC GCC GAG AGC GGC TCC GCG GCG GGG CTG CTA CCC ACC 1913 Lys Gly Glu Gly Ala Glu Ser Gly Ser Ala Ala Gly Leu Leu Pro Thr 18192 AGC ATC CTC CAA AGC ACT GAA CGC CCG GCC CAG GTG AAG AAA GAA CCG 23919 Ser Ile Leu Gln Ser Thr Glu Arg Pro Ala Gln Val Lys Lys Glu Pro 34240 AAA AAG AAG AAA CAA CAG TTG TCT GTT TGC AAC AAG CTT TGC TAT GCA 28735 Lys Lys Lys Lys Gln Gln Leu Ser Val Cys Asn Lys Leu Cys Tyr Ala 50288 CTT GGG GGA GCC CCC TAC CAG GTG ACG GGC TGT GCC CTG GGT TTC TTC 33551 Leu Gly Gly Ala Pro Tyr Gln Val Thr Gly Cys Ala Leu Gly Phe Phe 66336 CTT CAG ATC TAC CTA TTG GAT GTG GCT CAG GTG GGC CCT TTC TCT GCC 38367 Leu Gln Ile Tyr Leu Leu Asp Val Ala Gln Val Gly Pro Phe Ser Ala 82384 TCC ATC ATC CTG TTT GTG GGC CGA GCC TGG GAT GCC ATC ACA GAC CCC 43183 Ser Ile Ile Leu Phe Val Gly Arg Ala Trp Asp Ala Ile Thr Asp Pro 98432 CTG GTG GGC CTC TGC ATC AGC AAA TCC CCC TGG ACC TGC CTG GGT CGC 47999 Leu Val Gly Leu Cys Ile Ser Lys Ser Pro Trp Thr Cys Leu Gly Arg 114480 CTT ATG CCC TGG ATC ATC TTC TCC ACG CCC CTG GCC GTC ATT GCC TAC 527115 Leu Met Pro Trp Ile Ile Phe Ser Thr Pro Leu Ala Val Ile Ala Tyr 130528 TTC CTC ATC TGG TTC GTG CCC GAC TTC CCA CAC GGC CAG ACC TAT TGG 575131 Phe Leu Ile Trp Phe Val Pro Asp Phe Pro His Gly Gln Thr Tyr Trp 146576 TAC CTG CTT TTC TAT TGC CTC TTT GAA ACA ATG GTC ACG TGT TTC CAT 623147 Tyr Leu Leu Phe Tyr Cys Leu Phe Glu Thr Met Val Thr Cys Phe His 162624 GTT CCC TAC TCG GCT CTC ACC ATG TTC ATC AGC ACC GAG CAG ACT GAG 671163 Val Pro Tyr Ser Ala Leu Thr Met Phe Ile Ser Thr Glu Gln Thr Glu 178672 CGG GAT TCT GCC ACC GCC TAT CGG ATG ACT GTG GAA GTG CTG GGC ACA 719179 Arg Asp Ser Ala Thr Ala Tyr Arg Met Thr Val Glu Val Leu Gly Thr 194720 GTG CTG GGC ACG GCG ATC CAG GGA CAA ATC GTG GGC CAA GCA GAC ACG 767195 Val Leu Gly Thr Ala Ile Gln Gly Gln Ile Val Gly Gln Ala Asp Thr 210 768 CCT TGT TTC CAG GAC CTC AAT AGC TCT ACA GTA GCT TCA CAA AGT GCC 815211 Pro Cys Phe Gln Asp Leu Asn Ser Ser Thr Val Ala Ser Gln Ser Ala 226816 AAC CAT ACA CAT GGC ACC ACC TCA CAC AGG GAA ACG CAA AAG GCA TAC 863227 Asn His Thr His Gly Thr Thr Ser His Arg Glu Thr Gln Lys Ala Tyr 242864 CTG CTG GCA GCG GGG GTC ATT GTC TGT ATC TAT ATA ATC TGT GCT GTC 911243 Leu Leu Ala Ala Gly Val Ile Val Cys Ile Tyr Ile Ile Cys Ala Val 258912 ATC CTG ATC CTG GGC GTG CGG GAG CAG AGA GAA CCC TAT GAA GCC CAG 959259 Ile Leu Ile Leu Gly Val Arg Glu Gln Arg Glu Pro Tyr Glu Ala Gln 274960 CAG TCT GAG CCA ATC GCC TAC TTC CGG GGC CTA CGG CTG GTC ATG AGC 1007275 Gln Ser Glu Pro Ile Ala Tyr Phe Arg Gly Leu Arg Leu Val Met Ser 2901008 CAC GGC CCA TAC ATC AAA CTT ATT ACT GGC TTC CTC TTC ACC TCC TTG 1055291 His Gly Pro Tyr Ile Lys Leu Ile Thr Gly Phe Leu Phe Thr Ser Leu 3061056 GCT TTC ATG CTG GTG GAG GGG AAC TTT GTC TTG TTT TGC ACC TAC ACC 1103307 Ala Phe Met Leu Val Glu Gly Asn Phe Val Leu Phe Cys Thr Tyr Thr 3221104 TTG GGC TTC CGC AAT GAA TTC CAG AAT CTA CTC CTG GCC ATC ATG CTC 1151323 Leu Gly Phe Arg Asn Glu Phe Gln Asn Leu Leu Leu Ala Ile Met Leu 3381152 TCG GCC ACT TTA ACC ATT CCC ATC TGG CAG TGG TTC TTG ACC CGG TTT 1199339 Ser Ala Thr Leu Thr Ile Pro Ile Trp Gln Trp Phe Leu Thr Arg Phe 3541200 GGC AAG AAG ACA GCT GTA TAT GTT GGG ATC TCA TCA GCA GTG CCA TTT 1247355 Gly Lys Lys Thr Ala Val Tyr Val Gly Ile Ser Ser Ala Val Pro Phe 3701248 CTC ATC TTG GTG GCC CTC ATG GAG AGT AAC CTC ATC ATT ACA TAT GCG 1295371 Leu Ile Leu Val Ala Leu Met Glu Ser Asn Leu Ile Ile Thr Tyr Ala 3861296 GTA GCT GTG GCA GCT GGC ATC AGT GTG GCA GCT GCC TTC TTA CTA CCC 1343387 Val Ala Val Ala Ala Gly Ile Ser Val Ala Ala Ala Phe Leu Leu Pro 4021344 TGG TCC ATG CTG CCT GAT GTC ATT GAC GAC TTC CAT CTG AAG CAG CCC 1391403 Trp Ser Met Leu Pro Asp Val Ile Asp Asp Phe His Leu Lys Gln Pro 4181392 CAC TTC CAT GGA ACC GAG CCC ATC TTC TTC TCC TTC TAT GTC TTC TTC 1439419 His Phe His Gly Thr Glu Pro Ile Phe Phe Ser Phe Tyr Val Phe Phe 4341440 ACC AAG TTT GCC TCT GGA GTG TCA CTG GGC ATT TCT ACC CTC AGT CTG 1487435 Thr Lys Phe Ala Ser Gly Val Ser Leu Gly Ile Ser Thr Leu Ser Leu 4501488 GAC TTT GCA GGG TAC CAG ACC CGT GGC TGC TCG CAG CCG GAA CGT GTC 1535451 Asp Phe Ala Gly Tyr Gln Thr Arg Gly Cys Ser Gln Pro Glu Arg Val 4661536 AAG TTT ACA CTG AAC ATG CTC GTG ACC ATG GCT CCC ATA GTT CTC ATC 1583467 Lys Phe Thr Leu Asn Met Leu Val Thr Met Ala Pro Ile Val Leu Ile 4821584 CTG CTG GGC CTG CTG CTC TTC AAA ATG TAC CCC ATT GAT GAG GAG AGG 1631 483 Leu Leu Gly Leu Leu Leu Phe Lys Met Tyr Pro Ile Asp Glu Glu Arg 4981632 CGG CGG CAG AAT AAG AAG GCC CTG CAG GCA CTG AGG GAC GAG GCC AGC 1679499 Arg Arg Gln Asn Lys Lys Ala Leu Gln Ala Leu Arg Asp Glu Ala Ser 5141680 AGC TCT GGC TGC TCA GAA ACA GAC TCC ACA GAG CTG GCT AGC ATC CTC 1727515 Ser Ser Gly Cys Ser Glu Thr Asp Ser Thr Glu Leu Ala Ser Ile Leu 5301728 TAG GGC CCG CCA CGT TGC CCG AAG CCA CCA TGC AGA AGG CCA CAG AAG 1775531 *** 5311776 GGA TCA GGA CCT GTC TGC CGG CTT GCT GAG CAG CTG GAC TGC AGG TGC 18231824 TAG GAA GGG AAC TGA AGA CTC AAG GAG GTG GCC CAG GAC ACT TGC TGT 18711872 GCT CAC TGT GGG GCC GGC TGC TCT GTG GCC TCC TGC CTC CCC TCT GCC 19191920 TGC CTG TGG GGC CAA GCC CTG GGG CTG CCA CTG TGA ATA TGC CAA GGA 19671968 CTG ATC GGG CCT AGC CCG GAA CAC TAA TGT AGA AAC CTT TTT TTT ACA 20152016 GAG CCT AAT TAA TAA CTT AAT GAC TGT GTA CAT AGC AAT GTG TGT GTA 20632064 TGT ATA TGT CTG TGA GCT ATT AAT GTT ATT AAT TTT CAT AAA AGC TGG 21112112 AAA GCA AAA AAA AAA AAA AAA AAA AAA AAA AAA AAA AAA AAA AAA AAA 21592160 A 2160蛋白质分子量:58620.361 GC GTC TGC CAG CCG GCT TGG CTA GCG CGC GGC GGC CGT GGC TAA GGC 4748 TGC TAC GAA GCG AGC TTG GGA GGA GCA GCG GCC TGC GGG GCA GAG GAG 9596 CAT CCC GTC TAC CAG GTC CCA AGC GGC CGT GGC CCG CGG GTC ATG GCC 1431 Met Ala 2144 AAA GGA GAA GGC GCC GAG AGC GGC TCC GCG GCG GGG CTG CTA CCC ACC 1913 Lys Gly Glu Gly Ala Glu Ser Gly Ser Ala Ala Gly Leu Leu Pro Thr 18192 AGC ATC CTC CAA AGC ACT GAA CGC CCG GCC CAG GTG AAG AAA GAA CCG 23919 Ser Ile Leu Gln Ser Thr Glu Arg Pro Ala Gln Val Lys Lys Glu Pro 34240 AAA AAG AAG AAA CAA CAG TTG TCT GTT TGC AAC AAG CTT TGC TAT GCA 28735 Lys Lys Lys Lys Gln Gln Leu Ser Val Cys Asn Lys Leu Cys Tyr Ala 50288 CTT GGG GGA GCC CCC TAC CAG GTG ACG GGC TGT GCC CTG GGT TTC TTC 33551 Leu Gly Gly Ala Pro Tyr Gln Val Thr Gly Cys Ala Leu Gly Phe Phe 66336 CTT CAG ATC TAC CTA TTG GAT GTG GCT CAG GTG GGC CCT TTC TCT GCC 38367 Leu Gln Ile Tyr Leu Leu Asp Val Ala Gln Val Gly Pro Phe Ser Ala 82384 TCC ATC ATC CTG TTT GTG GGC CGA GCC TGG GAT GCC ATC ACA GAC CCC 43183 Ser Ile Ile Leu Phe Val Gly Arg Ala Trp Asp Ala Ile Thr Asp Pro 98432 CTG GTG GGC CTC TGC ATC AGC AAA TCC CCC TGG ACC TGC CTG GGT CGC 47999 Leu Val Gly Leu Cys Ile Ser Lys Ser Pro Trp Thr Cys Leu Gly Arg 114480 CTT ATG CCC TGG ATC ATC TTC TCC ACG CCC CTG GCC GTC ATT GCC TAC 527115 Leu Met Pro Trp Ile Ile Phe Ser Thr Pro Leu Ala Val Ile Ala Tyr 130528 TTC CTC ATC TGG TTC GTG CCC GAC TTC CCA CAC GGC CAG ACC TAT TGG 575131 Phe Leu Ile Trp Phe Val Pro Asp Phe Pro His Gly Gln Thr Tyr Trp 146576 TAC CTG CTT TTC TAT TGC CTC TTT GAA ACA ATG GTC ACG TGT TTC CAT 623147 Tyr Leu Leu Phe Tyr Cys Leu Phe Glu Thr Met Val Thr Cys Phe His 162624 GTT CCC TAC TCG GCT CTC ACC ATG TTC ATC AGC ACC GAG CAG ACT GAG 671163 Val Pro Tyr Ser Ala Leu Thr Met Phe Ile Ser Thr Glu Gln Thr Glu 178672 CGG GAT TCT GCC ACC GCC TAT CGG ATG ACT GTG GAA GTG CTG GGC ACA 719179 Arg Asp Ser Ala Thr Ala Tyr Arg Met Thr Val Glu Val Leu Gly Thr 194720 GTG CTG GGC ACG GCG ATC CAG GGA CAA ATC GTG GGC CAA GCA GAC ACG 767195 Val Leu Gly Thr Ala Ile Gln Gly Gln Ile Val Gly Gln Ala Asp Thr 210 768 CCT TGT TGT TTC CAC GAC CTC AAT AGC TCT ACA GCT TCA CAA AGT GCC 815211 Pro CYS PHE GLN ASN SER Val Ala Val Ala 2268816 AC Cat ACAGCAGCAGCAG GAG GAG GAG GAG GAG GAG GAG GAG GAG TC TC -CAC TC TC -C -C -CTA's G's TAC 863227 Asn His Thr His Gly Thr Thr Ser His Arg Glu Thr Gln Lys Ala Tyr 242864 CTG CTG GCA GCG GGG GTC ATT GTC TGT ATC TAT ATA ATC TGT GCT GTC 911243 Leu Leu Ala Ala Gly Val Ile Val Cys Ile Tyr Ile Ile Cys Ala Val 258912 ATC CTG ATC CTG GGC GTG CGG GAG CAG AGA GAA CCC TAT GAA GCC CAG 959259 Ile Leu Ile Leu Gly Val Arg Glu Gln Arg Glu Pro Tyr Glu Ala Gln 274960 CAG TCT GAG CCA ATC GCC TAC TTC CGG GGC CTA CGG CTG GTC ATG AGC 1007275 Gln Ser Glu Pro Ile Ala Tyr Phe Arg Gly Leu Arg Leu Val Met Ser 2901008 CAC GGC CCA TAC ATC AAA CTT ATT ACT GGC TTC CTC TTC ACC TCC TTG 1055291 His Gly Pro Tyr Ile Lys Leu Ile Thr Gly Phe Leu Phe Thr Ser Leu 3061056 GCT TTC ATG CTG GTG GAG GGG AAC TTT GTC TTG TTT TGC ACC TAC ACC 1103307 Ala Phe Met Leu Val Glu Gly Asn Phe Val Leu Phe Cys Thr Tyr Thr 3221104 TTG GGC TTC CGC AAT GAA TTC CAG AAT CTA CTC CTG GCC ATC ATG CTC 1151323 Leu Gly Phe Arg Asn Glu Phe Gln Asn Leu Leu Leu Ala Ile Met Leu 3381152 TCG GCC ACT TTA ACC ATT CCC ATC TGG CAG TGG TTC TTG ACC CGG TTT 1199339 Ser Ala Thr Leu Thr Ile Pro Ile Trp Gln Trp Phe Leu Thr Arg Phe 3541200 GGC AAG AAG ACA GCT GTA TAT GTT GGG ATC TCA TCA GCA GTG CCA TTT 1247355 Gly Lys Lys Thr Ala Val Tyr Val Gly Ile Ser Ser Ala Val Pro Phe 3701248 CTC ATC TTG GTG GCC CTC ATG GAG AGT AAC CTC ATC ATT ACA TAT GCG 1295371 Leu Ile Leu Val Ala Leu Met Glu Ser Asn Leu Ile Ile Thr Tyr Ala 3861296 GTA GCT GTG GCA GCT GGC ATC AGT GTG GCA GCT GCC TTC TTA CTA CCC 1343387 Val Ala Val Ala Ala Gly Ile Ser Val Ala Ala Ala Phe Leu Leu Pro 4021344 TGG TCC ATG CTG CCT GAT GTC ATT GAC GAC TTC CAT CTG AAG CAG CCC 1391403 Trp Ser Met Leu Pro Asp Val Ile Asp Asp Phe His Leu Lys Gln Pro 4181392 CAC TTC CAT GGA ACC GAG CCC ATC TTC TTC TCC TTC TAT GTC TTC TTC 1439419 His Phe His Gly Thr Glu Pro Ile Phe Phe Ser Phe Tyr Val Phe Phe 4341440 ACC AAG TTT GCC TCT GGA GTG TCA CTG GGC ATT TCT ACC CTC AGT CTG 1487435 Thr Lys Phe Ala Ser Gly Val Ser Leu Gly Ile Ser Thr Leu Ser Leu 4501488 GAC TTT GCA GGG TAC CAG ACC CGT GGC TGC TCG CAG CCG GAA CGT GTC 1535451 Asp Phe Ala Gly Tyr Gln Thr Arg Gly Cys Ser Gln Pro Glu Arg Val 4661536 AAG TTT ACA CTG AAC ATG CTC GTG ACC ATG GCT CCC ATA GTT CTC ATC 1583467 Lys Phe Thr Leu Asn Met Leu Val Thr Met Ala Pro Ile Val Leu Ile 4821584 CTG CTG GGC CTG CTG CTC TTC AAA ATG TAC CCC ATT GAT GAG GAG AGG 1631 483 Leu Leu Gly Leu Leu Leu Phe Lys Met Tyr Pro Ile Asp Glu Glu Arg 4981632 CGG CGG CAG AAT AAG AAG GCC CTG CAG GCA CTG AGG GAC GAG GCC AGC 1679499 Arg Arg Gln Asn Lys Lys Ala Leu Gln Ala Leu Arg Asp Glu Ala Ser 5141680 AGC TCT GGC TGC TCA GAA ACA GAC TCC ACA GAG CTG GCT AGC ATC CTC 1727515 Ser Ser Gly Cys Ser Glu Thr Asp Ser Thr Glu Leu Ala Ser Ile Leu 5301728 TAG GGC CCG CCA CGT TGC CCG AAG CCA CCA TGC AGA AGG CCA CAG AAG 1775531 * ** 5311776 GGA TCA GGA CCT GTC TGC CGG CTT GCT GAG CAG CTG GAC TGC AGG TGC 18231824 TAG GAA GGG AAC TGA AGA CTC AAG GAG GTG GCC CAG GAC ACT TGC TGT 18711872 GCT CAC TGT GGG GCC GGC TGC TCT GTG GCC TCC TGC CTC CCC TCT GCC 19191920 TGC CTG TGG GGC CAA GCC CTG GGG CTG CCA CTG TGA ATA TGC CAA GGA 19671968 CTG ATC GGG CCT AGC CCG GAA CAC TAA TGT AGA AAC CTT TTT TTT ACA 20152016 GAG CCT AAT TAA TAA CTT AAT GAC TGT GTA CAT AGC AAT GTG TGT GTA 2062064 TGT ATA TGT CTG TGA GCT ATT AAT AAT AAT TTT CAT AAA AGC TGG 2112112 AAA GCA AAA AAA AAA AAA AAA AAA AAA AAA AAA Aaa
4.PP94454.PP9445
A:核苷酸序列(SEQ ID NO:10)长度:1831bp1 GCCGCCGCGG AGCGAGGTTG ACTGGAGAGA GCGCCTGGGC GCAGAAGGGT TAACGGGCCA61 CCGGGGGCTC GCAGAGCAGG AGGGTGCTCT CGGACGGTGT GTCCCCCACT GCACTCCTGA121 ACTTGGAGGA CAGGGTCGCC GCGAGGGACG CAGGTGGGTG CCCTTGATCC AGCTCAGCCC181 GATGGCAGAA GAGGTTGACA AAAAAGAAAG ACACCTGTTG GGGTGGCCTG CCAGACCCAG241 GAGTGGAGGG CTCTGTGAGG GCCCGGGAAT TCGGACTCAG GACAGGGATT CTCCATGGCT301 AGGCCCAGAA ACACAGGGTC CAACCACTCT CCAGCAGGGA GACCTGGGGG TGAAGGGGTG361 AGCCCTGCGC AGGTCTCTGT TCCTTGGTCT TCACTGGGCA GTGTGGAGAG GTGTGGCCAG421 GAGGAGCCCG CGTTTGTCCA GAGCAGGGTC TACTCTGGCA CCAGAGTGAC CACCTCTGAC481 CTCTCCTTTC CTCGTCCTGG GCCGGGAACG ACACCAAATG AGGGACATGG AAAGGGCTGG541 AGTAACAAGA GTCAGGCAGA GCCTGAAGAC TTGGGTGGAA CATGGGCCCT TCTCTGGAGA601 TCCTGGCCTC CCCCGTTCAG TCAGGGTGGA GTTGCTGACC TTAGTGGCCG GCCCAGCCAG661 GGGAAGGAGT GGCCATCGGC AACCCCCACC CCAACCCCAA TCCCTGAGGC GCCCGCTCTG721 GCTCAGCCAC TCTGACCCCT CCCTCAAATT CCGAACCCTA GGTCTCAGGG AGGGCAGTGG781 GGCTGAGTGT CTGCCCCCAG GCACATTCCT ACCCTTCTCT TGGTCATTTT CTGCCCCAGA841 GCTGGCCCAC CTCAGCAATG CGAGGGCTCC CTGGATTCCT CTCCCGGGTG CCTTTCAGAT901 CCAACAGAAA CAGATTTTTT TTTTCCTGGA AAGCAGAACT AAGAGTGGGA TGAGGAGCAG961 GGGTGGGAAG GACTCAAAGT GAGAAGAAGG GGGCAAAGAG AGTCAGGCTT GGTGGCTGGG1021 GTGGCTTCCA AGCCTCACTT CTCCAGTGTT CAAAGCTGAA CTTCAGATGG ACTTCCCGGC1081 TCTTCAGAAT GAGAGGCCTG TGGCTGGGGC ATGAGGCAGC CCCGGCTGCA CCTCTCCTTC1141 CCGCTTCCCC AGCTGGTAGA GACGCACAGG AAACAAGCCC TCACTGAACC AACTCCAGAT1201 GCTGGCACCC AGAGTGGGTG TTACATTGCC GGCTTCTTCT CTAGAGATTA AACCGTCAAC1261 CCATTTAGCT TATCCCTTGG CCAAAAAGTG TATGAGATGT GCCTGGATGT TCCCTAAAGA1321 GCTTATCTAA GAAGGGAAGA GAAAGCCGGG AGGCAAGTAG GACAGAGAGA TGACTGGGGA1381 AGGTCTTGTG TCTGGAAGAC CCAAGGAAGG GGCTTCTGGT GGGTCCTCAG AGAGAGTGTC1441 TGGCGCATCC TCAGTGGAGC CTTCCTCCTC TACTTTCTAG GCACCTCTGG GAGGGCAGGA1501 GTGGGAGCAG ATGACAACCA TTTTAGAAGG AGCCCTCTGG CTGGGTGCGG TGGCTCACAC1561 CTGTCATCCC AGCACTTTGG GAGGCCAAGG CAGGAGAAGC GCTTGAGGCC TGGAGTTCAA1621 GACCAGCCTG TGCAATTTAG CTGGATCCCA TCTCCACCAA AAAATACCAA AATTAGCTGG1681 GTGTGGTGGT GCACGCATGT AGTCCCACCT ACTCAGGAGG CTGAGGAAGG AGAGCCTGTG1741 AGTTTGAGGC TGCAATGAGC TTTGGTGGCA CCACTGCCCT CCAGCCTGGA TGACAGAGTG1801 AGATCTCCAT CTCAAAAAAA AAAAAAAAAA AA:核苷酸序列(SEQ ID NO:10)长度:1831bp1 GCCGCCGCGG AGCGAGGTTG ACTGGAGAGA GCGCCTGGGC GCAGAAGGGT TAACGGGCCA61 CCGGGGGCTC GCAGAGCAGG AGGGTGCTCT CGGACGGTGT GTCCCCCACT GCACTCCTGA121 ACTTGGAGGA CAGGGTCGCC GCGAGGGACG CAGGTGGGTG CCCTTGATCC AGCTCAGCCC181 GATGGCAGAA GAGGTTGACA AAAAAGAAAG ACACCTGTTG GGGTGGCCTG CCAGACCCAG241 GAGTGGAGGG CTCTGTGAGG GCCCGGGAAT TCGGACTCAG GACAGGGATT CTCCATGGCT301 AGGCCCAGAA ACACAGGGTC CAACCACTCT CCAGCAGGGA GACCTGGGGG TGAAGGGGTG361 AGCCCTGCGC AGGTCTCTGT TCCTTGGTCT TCACTGGGCA GTGTGGAGAG GTGTGGCCAG421 GAGGAGCCCG CGTTTGTCCA GAGCAGGGTC TACTCTGGCA CCAGAGTGAC CACCTCTGAC481 CTCTCCTTTC CTCGTCCTGG GCCGGGAACG ACACCAAATG AGGGACATGG AAAGGGCTGG541 AGTAACAAGA GTCAGGCAGA GCCTGAAGAC TTGGGTGGAA CATGGGCCCT TCTCTGGAGA601 TCCTGGCCTC CCCCGTTCAG TCAGGGTGGA GTTGCTGACC TTAGTGGCCG GCCCAGCCAG661 GGGAAGGAGT GGCCATCGGC AACCCCCACC CCAACCCCAA TCCCTGAGGC GCCCGCTCTG721 GCTCAGCCAC TCTGACCCCT CCCTCAAATT CCGAACCCTA GGTCTCAGGG AGGGCAGTGG781 GGCTGAGTGT CTGCCCCCAG GCACATTCCT ACCCTTCTCT TGGTCATTTT CTGCCCCAGA841 GCTGGCCCAC CTCAGCAATG CGAGGGCTCC CTGGATTCCT CTCCCGGGTG CCTTTCAGAT901 CCAACAGAAA CAGATTTTTT TTTTCCTGGA AAGCAGAACT AAGAGTGGGA TGAGGAGCAG961 GGGTGGGAAG GACTCAAAGT GAGAAGAAGG GGGCAAAGAG AGTCAGGCTT GGTGGCTGGG1021 GTGGCTTCCA AGCCTCACTT CTCCAGTGTT CAAAGCTGAA CTTCAGATGG ACTTCCCGGC1081 TCTTCAGAAT GAGAGGCCTG TGGCTGGGGC ATGAGGCAGC CCCGGCTGCA CCTCTCCTTC1141 CCGCTTCCCC AGCTGGTAGA GACGCACAGG AAACAAGCCC TCACTGAACC AACTCCAGAT1201 GCTGGCACCC AGAGTGGGTG TTACATTGCC GGCTTCTTCT CTAGAGATTA AACCGTCAAC1261 CCATTTAGCT TATCCCTTGG CCAAAAAGTG TATGAGATGT GCCTGGATGT TCCCTAAAGA1321 GCTTATCTAA GAAGGGAAGA GAAAGCCGGG AGGCAAGTAG GACAGAGAGA TGACTGGGGA1381 AGGTCTTGTG TCTGGAAGAC CCAAGGAAGG GGCTTCTGGT GGGTCCTCAG AGAGAGTGTC1441 TGGCGCATCC TCAGTGGAGC CTTCCTCCTC TACTTTCTAG GCACCTCTGG GAGGGCAGGA1501 GTGGGAGCAG ATGACAACCA TTTTAGAAGG AGCCCTCTGG CTGGGTGCGG TGGCTCACAC1561 CTGTCATCCC AGCACTTTGG GAGGCCAAGG CAGGAGAAGC GCTTGAGGCC TGGAGTTCAA1621 GACCAGCCTG TGCAATTTAG CTGGATCCCA TCTCCACCAA AAAATACCAA AATTAGCTGG1681 GTGTGGTGGT GCACGCATGT AGTCCCACCT ACTCAGGAGG CTGAGGAAGG AGAGCCTGTG1741 AGTTTGAGGC TGCAATGAGC TTTGGTGGCA CCACTGCCCT CCAGCCTGGA TGACAGAGTG1801 AGATCTCCAT CTCAAAAAAA AAAAAAAAAA A
B:氨基酸序列(SEQ ID NO:11)长度:154个氨基酸1 MRDMERAGVT RVRQSLKTWV EHGPFSGDPG LPRSVRVELL TLVAGPARGR SGHRQPPPQP61 QSLRRPLWLS HSDPSLKFRT LGLREGSGAE CLPPGTFLPF SWSFSAPELA HLSNARAPWI121 PLPGAFQIQQ KQIFFFLESR TKSGMRSRGG KDSKB: Amino acid sequence (SEQ ID NO: 11) length: 154 amino acids
C.核苷酸及氨基酸组合序列(SEQ ID NO:12)C. Nucleotide and amino acid combination sequence (SEQ ID NO: 12)
克隆号:PP9445Clone number: PP9445
起始编码子:518 ATG 终止编码子:980 TGAStart code: 518 ATG End code: 980 TGA
蛋白质分子量:17150.751 G CCG CCG CGG AGC GAG GTT GAC TGG AGA GAG CGC CTG GGC GCA GAA 4647 GGG TTA ACG GGC CAC CGG GGG CTC GCA GAG CAG GAG GGT GCT CTC GGA 9495 CGG TGT GTC CCC CAC TGC ACT CCT GAA CTT GGA GGA CAG GGT CGC CGC 142143 GAG GGA CGC AGG TGG GTG CCC TTG ATC CAG CTC AGC CCG ATG GCA GAA 190191 GAG GTT GAC AAA AAA GAA AGA CAC CTG TTG GGG TGG CCT GCC AGA CCC 238239 AGG AGT GGA GGG CTC TGT GAG GGC CCG GGA ATT CGG ACT CAG GAC AGG 286287 GAT TCT CCA TGG CTA GGC CCA GAA ACA CAG GGT CCA ACC ACT CTC CAG 334335 CAG GGA GAC CTG GGG GTG AAG GGG TGA GCC CTG CGC AGG TCT CTG TTC 382383 CTT GGT CTT CAC TGG GCA GTG TGG AGA GGT GTG GCC AGG AGG AGC CCG 430431 CGT TTG TCC AGA CCA GGG TCT ACT CTG GCA CCA GAG TGA CCA CCT CTG 478479 ACC TCT CCT TTC CTC GTC CTG GGC CGG GAA CGA CAC CAA ATG AGG GAC 5261 Met Arg Asp 3527 ATG GAA AGG GCT GGA GTA ACA AGA GTC AGG CAG AGC CTG AAG ACT TGG 5744 Met Glu Arg Ala Gly Val Thr Arg Val Arg Gln Ser Leu Lys Thr Trp 19575 GTG GAA CAT GGG CCC TTC TCT GGA GAT CCT GGC CTC CCC CGT TCA GTC 62220 Val Glu His Gly Pro Phe Ser Gly Asp Pro Gly Leu Pro Arg Ser Val 35623 AGG GTG GAG TTG CTG ACC TTA GTG GCC GGC CCA GCC AGG GGA AGG AGT 67036 Arg Val Glu Leu Leu Thr Leu Val Ala Gly Pro Ala Arg Gly Arg Ser 51671 GGC CAT CGG CAA CCC CCA CCC CAA CCC CAA TCC CTG AGG CGC CCG CTC 71852 Gly His Arg Gln Pro Pro Pro Gln Pro Gln Ser Leu Arg Arg Pro Leu 67719 TGG CTC AGC CAC TCT GAC CCC TCC CTC AAA TTC CGA ACC CTA GGT CTC 76668 Trp Leu Ser His Ser Asp Pro Ser Leu Lys Phe Arg Thr Leu Gly Leu 83767 AGG GAG GGC AGT GGG GCT GAG TGT CTG CCC CCA GGC ACA TTC CTA CCC 81484 Arg Glu Gly Ser Gly Ala Glu Cys Leu Pro Pro Gly Thr Phe Leu Pro 99815 TTC TCT TGG TCA TTT TCT GCC CCA GAG CTG GCC CAC CTC AGC AAT GCG 862100 Phe Ser Trp Ser Phe Ser Ala Pro Glu Leu Ala His Leu Ser Asn Ala 115863 AGG GCT CCC TGG ATT CCT CTC CCG GGT GCC TTT CAG ATC CAA CAG AAA 910116 Arg Ala Pro Trp Ile Pro Leu Pro Gly Ala Phe Gln Ile Gln Gln Lys 131 911 CAG ATT TTT TTT TTC CTG GAA AGC AGA ACT AAG AGT GGG ATG AGG AGC 958132 Gln Ile Phe Phe Phe Leu Glu Ser Arg Thr Lys Ser Gly Met Arg Ser 147959 AGG GGT GGG AAG GAC TCA AAG TGA GAA GAA GGG GGC AAA GAG AGT CAG 1006148 Arg Gly Gly Lys Asp Ser Lys *** 1551007 GCT TGG TGG CTG GGG TGG CTT CCA AGC CTC ACT TCT CCA GTG TTC AAA 10541055 GCT GAA CTT CAG ATG GAC TTC CCG GCT CTT CAG AAT GAG AGG CCT GTG 11021103 GCT GGG GCA TGA GGC AGC CCC GGC TGC ACC TCT CCT TCC CGC TTC CCC 11501151 AGC TGG TAG AGA CGC ACA GGA AAC AAG CCC TCA CTG AAC CAA CTC CAG 11981199 ATG CTG GCA CCC AGA GTG GGT GTT ACA TTG CCG GCT TCT TCT CTA GAG 12461247 ATT AAA CCG TCA ACC CAT TTA GCT TAT CCC TTG GCC AAA AAG TGT ATG 12941295 AGA TGT GCC TGG ATG TTC CCT AAA GAG CTT ATC TAA GAA GGG AAG AGA 13421343 AAG CCG GGA GGC AAG TAG GAC AGA GAG ATG ACT GGG GAA GGT CTT GTG 13901391 TCT GGA AGA CCC AAG GAA GGG GCT TCT GGT GGG TCC TCA GAG AGA GTG 14381439 TCT GGC GCA TCC TCA GTG GAG CCT TCC TCC TCT ACT TTC TAG GCA CCT 14861487 CTG GGA GGG CAG GAG TGG GAG CAG ATG ACA ACC ATT TTA GAA GGA GCC 15341535 CTC TGG CTG GGT GCG GTG GCT CAC ACC TGT CAT CCC AGC ACT TTG GGA 15821583 GGC CAA GGC AGG AGA AGC GCT TGA GGC CTG GAG TTC AAG ACC AGC CTG 16301631 TGC AAT TTA GCT GGA TCC CAT CTC CAC CAA AAA ATA CCA AAA TTA GCT 16781679 GGG TGT GGT GGT GCA CGC ATG TAG TCC CAC CTA CTC AGG AGG CTG AGG 17261727 AAG GAG AGC CTG TGA GTT TGA GGC TGC AAT GAG CTT TGG TGG CAC CAC 17741775 TGC CCT CCA GCC TGG ATG ACA GAG TGA GAT CTC CAT CTC AAA AAA AAA 18221823 AAA AAA AAA 1831蛋白质分子量:17150.751 G CCG CCG CGG AGC GAG GTT GAC TGG AGA GAG CGC CTG GGC GCA GAA 4647 GGG TTA ACG GGC CAC CGG GGG CTC GCA GAG CAG GAG GGT GCT CTC GGA 9495 CGG TGT GTC CCC CAC TGC ACT CCT GAA CTT GGA GGA CAG GGT CGC CGC 142143 GAG GGA CGC AGG TGG GTG CCC TTG ATC CAG CTC AGC CCG ATG GCA GAA 190191 GAG GTT GAC AAA AAA GAA AGA CAC CTG TTG GGG TGG CCT GCC AGA CCC 238239 AGG AGT GGA GGG CTC TGT GAG GGC CCG GGA ATT CGG ACT CAG GAC AGG 286287 GAT TCT CCA TGG CTA GGC CCA GAA ACA CAG GGT CCA ACC ACT CTC CAG 334335 CAG GGA GAC CTG GGG GTG AAG GGG TGA GCC CTG CGC AGG TCT CTG TTC 382383 CTT GGT CTT CAC TGG GCA GTG TGG AGA GGT GTG GCC AGG AGG AGC CCG 430431 CGT TTG TCC AGA CCA GGG TCT ACT CTG GCA CCA GAG TGA CCA CCT CTG 478479 ACC TCT CCT TTC CTC GTC CTG GGC CGG GAA CGA CAC CAA ATG AGG GAC 5261 Met Arg Asp 3527 ATG GAA AGG GCT GGA GTA ACA AGA GTC AGG CAG AGC CTG AAG ACT TGG 5744 Met Glu Arg Ala Gly Val Thr Arg Val Arg Gln Ser Leu Lys Thr Trp 19575 GTG GAA CAT GGG CCC TTC TCT GGA GAT CCT GGC CTC CCC CGT TCA GTC 62220 Val Glu His Gly Pro Phe Ser Gly Asp Pro Gly Leu Pro Arg Ser Val 35623 AGG GTG GAG TTG CTG ACC TTA GTG GCC GGC CCA GCC AGG GGA AGG AGT 67036 Arg Val Glu Leu Leu Thr Leu Val Ala Gly Pro Ala Arg Gly Arg Ser 51671 GGC CAT CGG CAA CCC CCA CCC CAA CCC CAA TCC CTG AGG CGC CCG CTC 71852 Gly His Arg Gln Pro Pro Pro Gln Pro Gln Ser Leu Arg Arg Pro Leu 67719 TGG CTC AGC CAC TCT GAC CCC TCC CTC AAA TTC CGA ACC CTA GGT CTC 76668 Trp Leu Ser His Ser Asp Pro Ser Leu Lys Phe Arg Thr Leu Gly Leu 83767 AGG GAG GGC AGT GGG GCT GAG TGT CTG CCC CCA GGC ACA TTC CTA CCC 81484 Arg Glu Gly Ser Gly Ala Glu Cys Leu Pro Pro Gly Thr Phe Leu Pro 99815 TTC TCT TGG TCA TTT TCT GCC CCA GAG CTG GCC CAC CTC AGC AAT GCG 862100 Phe Ser Trp Ser Phe Ser Ala Pro Glu Leu Ala His Leu Ser Asn Ala 115863 AGG GCT CCC TGG ATT CCT CTC CCG GGT GCC TTT CAG ATC CAA CAG AAA 910116 ARG ALA Pro TRP I Pro Leu Pro Gly Ala Phe Gln Ile Gln GLN LYS 131 911 CAG ATT TTT TTC CTG GAA AGC AGT GGG AGG AGC AGC 958132 Gl Merile PHE Leu sele Ser 147959 AGG GGT GGG AAG GAC TCA AAG TGA GAA GAA GGG GGC AAA GAG AGT CAG 1006148 Arg Gly Gly Lys Asp Ser Lys *** 1551007 GCT TGG TGG CTG GGG TGG CTT CCA AGC CTC ACT TCT CCA GTG TTC AAA 10541055 GCT GAA CTT CAG ATG GAC TTC CCG GCT CTT CAG AAT GAG AGG CCT GTG 11021103 GCT GGG GCA TGA GGC AGC CCC GGC TGC ACC TCT CCT TCC CGC TTC CCC 11501151 AGC TGG TAG AGA CGC ACA GGA AAC AAG CCC TCA CTG AAC CAA CTC CAG 11981199 ATG CTG GCA CCC AGA GTG GGT GTT ACA TTG CCG GCT TCT TCT CTA GAG 12461247 ATT AAA CCG TCA ACC CAT TTA GCT TAT CCC TTG GCC AAA AAG TGT ATG 12941295 AGA TGT GCC TGG ATG TTC CCT AAA GAG CTT ATC TAA GAA GGG AAG AGA 13421343 AAG CCG GGA GGC AAG TAG GAC AGA GAG ATG ACT GGG GAA GGT CTT GTG 13901391 TCT GGA AGA CCC AAG GAA GGG GCT TCT GGT GGG TCC TCA GAG AGA GTG 14381439 TCT GGC GCA TCC TCA GTG GAG CCT TCC TCC TCT ACT TTC TAG GCA CCT 14861487 CTG GGA GGG CAG GAG TGG GAG CAG ATG ACA ACC ATT TTA GAA GGA GCC 15341535 CTC TGG CTG GGT GCG GTG GCT CAC ACC TGT CAT CCC AGC ACT TTG GGA 15821583 GGC CAA GGC AGG AGA AGC GCT TGA GGC CTG GAG TTC AAG ACC AGC CTG 16301631 TGC AAT TTA GCT GGA TCC CAT CTC CAC CAA AAA ATA CCA AAA TTA GCT 16781679 GGG TGT GGT GGT GCA CGC ATG TAG TCC CAC CTA CTC AGG AGG CTG AGG 17261727 AAG GAG AGC CTG TGA GTT TGA GGC TGC AAT GAG CTT TGG TGG CAC CAC 17741775 TGC CCT CCA GCC TGG ATG ACA GAG TGA GAT CTC CAT CTC AAA AAA AAA 18221823 AAA AAA AAA 1 8 8
5.PP101995. PP10199
A:核苷酸序列(SEQ ID NO:13)长度:1739bp1 GTTCCAGAGC CACTTTTAAG ATTCTTCAAT TCCAAATGCA TGTCTTTTTT TAAAAAAAAG61 AAAGAAAGAA AAATAAGTTT CTAATATTAG AGAAGTACAG CCCTGAATTG GGTTTTGTGT121 CCACTGCTGG ACCCCATGAG GGCCAGGTGG AGTGGACCTC TGCAGCCCCA GTTGTGTGCA181 CTCTCTGTTT GGTGCAAATT CCAGTTTGCT GGTTCTCAAT AGCAAGACCA GCCTGAGACC241 ACCTGTCCTG CTCTTCCCAT GAGAGGGCCG AATGCTCCCA GCCTCCATGC CATGTCCTGT301 TCCTGGGGTC CTGGGGGTCA TTGCAGCCTG TATGTGCTTC CTCCAGCCAG GGTGATCATC361 GGGTGCCCCA GTGAGCCCCA GCACTGAGGG TCAGCCCCAG GCACTGTCAA AGGTGAGAGC421 TCAGAGGCTG TGCCCAGAAA GAGAGGTGGG CCCTGCCTGC CCTGGACGGA GGGAGAGAGG481 CTTCTCAGAG CCCGAGGCAT GAACCCTCAG GTGGGTCGTG GCCATAGTCA GATGATGGCT541 GCTGGTGAGC TCAGTGACCA GGCGTCTTCA GGCAGCTCAT AAGTTTGAGA GGACACAGCC601 TAAGGGAGGT TTGCTGGGGA GTAGCCCCAC TTCCACCCTG AATAGACAAG AGATGGTAAA661 GCAGGTACCC AGCACTTAGT GCTTTCTTGG GGATATCGCG TGGGTCCCCG GGGGCCTGGG721 TGCCCGAAGT GCCGCAGTAC TCCATGGTGC AGAGAGCTTG CTCCTGTGGA GGAAGTGTCT781 ATGTGGTCCC CAGCTCCTCT GTCTGCCTGT CCACTGAGGG GCACCCATGG CTCAGCAGAA841 GGGCTATTCT TGGGGTTCCC GGTCCTCCTC CAGCCCCGCT AATCTGTGTA GGCCTCAAGT901 GCTGTGTGTT TGTAAGCATT GTCATCCACA GTCCTATTGT ACGAGCTGGT TCACCCGCAG961 CTCTGAGCTG CTCTCCAGCC CCAGCCCTTT CTTCCTGTGC CCCTACCCCC GCTGGGATGA1021 CTCTCCTCAC CCTCCCTGGG GCGACAACCG CCCTGTCTGT AATGAGTGGC AGTCCCAAGC1081 TTCCTGACTG GCTTCCGCAG CTCTCTGACT CCCCTAAACA AGGCCTCAGG GACTCCACAT1141 CCAAATTAAG GCGGCACCTG GTGGCAGGTT GGCATTTTCC GGTGTCCTAT CTATGAAAGA1201 CAGGAAGACA GCTGGGAGCA AACTCCCCTG GGCCAGACTC TTGGAAACAT AAAGGCTTGG1261 GTGCCCAGCT GGGGACCGGG AGAAAGTCTA AAACACGGGA CTGGGCCAAG GACCCCACAG1321 GTCCCTGTCT CATTAGGTCC CCTGAAACGT GTGGAAGCTA AAATGGCATT CACGTGATTC1381 TTGATCATTT AACAGTGGAT TCTGATCTGA TACTACACTG AGAAGTGCCC CTGGGCCGGG1441 CGCGGTGGCT CACGCCTGTA ATCCCAGCAC TTTGGGAGGC CGAGGCGGGC GGATCACAAG1501 GAGATTGAGA CCATCCTGGC TAACACGGTG AAACCCTGTC TCTACTAAAA ATACAAAAAA1561 TTAGCCAGGC ATGGTGGCAG GCGCCTCTAG TACTAGCTAC TCGGGAGGCT GAGGCAGGAG1621 AATGGTGTGA ACCCGGGAGG CGGAACTTGC AGTGAGCCAA GATTGTGCCA CTGCACTCTA1681 GCATGGGCGA CAGAGCAAGA CTCAGTCTCA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA:核苷酸序列(SEQ ID NO:13)长度:1739bp1 GTTCCAGAGC CACTTTTAAG ATTCTTCAAT TCCAAATGCA TGTCTTTTTT TAAAAAAAAG61 AAAGAAAGAA AAATAAGTTT CTAATATTAG AGAAGTACAG CCCTGAATTG GGTTTTGTGT121 CCACTGCTGG ACCCCATGAG GGCCAGGTGG AGTGGACCTC TGCAGCCCCA GTTGTGTGCA181 CTCTCTGTTT GGTGCAAATT CCAGTTTGCT GGTTCTCAAT AGCAAGACCA GCCTGAGACC241 ACCTGTCCTG CTCTTCCCAT GAGAGGGCCG AATGCTCCCA GCCTCCATGC CATGTCCTGT301 TCCTGGGGTC CTGGGGGTCA TTGCAGCCTG TATGTGCTTC CTCCAGCCAG GGTGATCATC361 GGGTGCCCCA GTGAGCCCCA GCACTGAGGG TCAGCCCCAG GCACTGTCAA AGGTGAGAGC421 TCAGAGGCTG TGCCCAGAAA GAGAGGTGGG CCCTGCCTGC CCTGGACGGA GGGAGAGAGG481 CTTCTCAGAG CCCGAGGCAT GAACCCTCAG GTGGGTCGTG GCCATAGTCA GATGATGGCT541 GCTGGTGAGC TCAGTGACCA GGCGTCTTCA GGCAGCTCAT AAGTTTGAGA GGACACAGCC601 TAAGGGAGGT TTGCTGGGGA GTAGCCCCAC TTCCACCCTG AATAGACAAG AGATGGTAAA661 GCAGGTACCC AGCACTTAGT GCTTTCTTGG GGATATCGCG TGGGTCCCCG GGGGCCTGGG721 TGCCCGAAGT GCCGCAGTAC TCCATGGTGC AGAGAGCTTG CTCCTGTGGA GGAAGTGTCT781 ATGTGGTCCC CAGCTCCTCT GTCTGCCTGT CCACTGAGGG GCACCCATGG CTCAGCAGAA841 GGGCTATTCT TGGGGTTCCC GGTCCTCCTC CAGCCCCGCT AATCTGTGTA GGCCTCAAGT901 GCTGTGTGTT TGTAAGCATT GTCATCCACA GTCCTATTGT ACGAGCTGGT TCACCCGCAG961 CTCTGAGCTG CTCTCCAGCC CCAGCCCTTT CTTCCTGTGC CCCTACCCCC GCTGGGATGA1021 CTCTCCTCAC CCTCCCTGGG GCGACAACCG CCCTGTCTGT AATGAGTGGC AGTCCCAAGC1081 TTCCTGACTG GCTTCCGCAG CTCTCTGACT CCCCTAAACA AGGCCTCAGG GACTCCACAT1141 CCAAATTAAG GCGGCACCTG GTGGCAGGTT GGCATTTTCC GGTGTCCTAT CTATGAAAGA1201 CAGGAAGACA GCTGGGAGCA AACTCCCCTG GGCCAGACTC TTGGAAACAT AAAGGCTTGG1261 GTGCCCAGCT GGGGACCGGG AGAAAGTCTA AAACACGGGA CTGGGCCAAG GACCCCACAG1321 GTCCCTGTCT CATTAGGTCC CCTGAAACGT GTGGAAGCTA AAATGGCATT CACGTGATTC1381 TTGATCATTT AACAGTGGAT TCTGATCTGA TACTACACTG AGAAGTGCCC CTGGGCCGGG1441 CGCGGTGGCT CACGCCTGTA ATCCCAGCAC TTTGGGAGGC CGAGGCGGGC GGATCACAAG1501 GAGATTGAGA CCATCCTGGC TAACACGGTG AAACCCTGTC TCTACTAAAA ATACAAAAAA1561 TTAGCCAGGC ATGGTGGCAG GCGCCTCTAG TACTAGCTAC TCGGGAGGCT GAGGCAGGAG1621 AATGGTGTGA ACCCGGGAGG CGGAACTTGC AGTGAGCCAA GATTGTGCCA CTGCACTCTA1681 GCATGGGCGA CAGAGCAAGA CTCAGTCTCA AAAAAAAAAA AAAAAAAAAA AAAAAAAAA
B:氨基酸序列(SEQ ID NO:14)长度:150个氨基酸1 MVQRACSCGG SVYVVPSSSV CLSTEGHPML SRRAILGVPG PPPAPLICVG LKCCVFVSIV61 IHSPIVRAGS PAALSCSPAP ALSSCAPTPA GMTLLTLPGA TTALSVMSGS PKLPDWLPQL121 SDSPKQGLRD STSKLRRHLV AGWHFPVSYLB: amino acid sequence (SEQ ID NO: 14) length: 150 amino acids
C.核苷酸及氨基酸组合序列(SEQ ID NO:15)C. Nucleotide and amino acid combination sequence (SEQ ID NO: 15)
克隆号:PP10199Clone number: PP10199
起始编码子:744 ATG 终止编码子:1194 TGAStart code: 744 ATG End code: 1194 TGA
蛋白质分子量:15496.341 GT TCC AGA GCC ACT TTT AAG ATT CTT CAA TTC CAA ATG CAT GTC TTT 4748 TTT TAA AAA AAA GAA AGA AAG AAA AAT AAG TTT CTA ATA TTA GAG AAG 9596 TAC AGC CCT GAA TTG GGT TTT GTG TCC ACT GCT GGA CCC CAT GAG GGC 143144 CAG GTG GAG TGG ACC TCT GCA GCC CCA GTT GTG TGC ACT CTC TGT TTG 191192 GTG CAA ATT CCA GTT TGC TGG TTC TCA ATA GCA AGA CCA GCC TGA GAC 239240 CAC CTG TCC TGC TCT TCC CAT GAG AGG GCC GAA TGC TCC CAG CCT CCA 287288 TGC CAT GTC CTG TTC CTG GGG TCC TGG GGG TCA TTG CAG CCT GTA TGT 335336 GCT TCC TCC AGC CAG GGT GAT CAT CGG GTG CCC CAG TGA GCC CCA GCA 383384 CTG AGG GTC AGC CCC AGG CAC TGT CAA AGG TGA GAG CTC AGA GGC TGT 431432 GCC CAG AAA GAG AGG TGG GCC CTG CCT GCC CTG GAC GGA GGG AGA GAG 479480 GCT TCT CAG AGC CCG AGG CAT GAA CCC TCA GGT GGG TCG TGG CCA TAG 527528 TCA GAT GAT GGC TGC TGG TGA GCT CAG TGA CCA GGC GTC TTC AGG CAG 575576 CTC ATA AGT TTG AGA GGA CAC AGC CTA AGG GAG GTT TGC TGG GGA GTA 623624 GCC CCA CTT CCA CCC TGA ATA GAC AAG AGA TGG TAA AGC AGG TAC CCA 671672 GCA CTT AGT GCT TTC TTG GGG ATA TCG CGT GGG TCC CCG GGG GCC TGG 719720 GTG CCC GAA GTG CCG CAG TAC TCC ATG GTG CAG AGA GCT TGC TCC TGT 7671 Met Val Gln Arg Ala Cys Ser Cys 8768 GGA GGA AGT GTC TAT GTG GTC CCC AGC TCC TCT GTC TGC CTG TCC ACT 8159 Gly Gly Ser Val Tyr Val Val Pro Ser Ser Ser Val Cys Leu Ser Thr 24816 GAG GGG CAC CCA TGG CTC AGC AGA AGG GCT ATT CTT GGG GTT CCC GGT 86325 Glu Gly His Pro Trp Leu Ser Arg Arg Ala Ile Leu Gly Val Pro Gly 40864 CCT CCT CCA GCC CCG CTA ATC TGT GTA GGC CTC AAG TGC TGT GTG TTT 91141 Pro Pro Pro Ala Pro Leu Ile Cys Val Gly Leu Lys Cys Cys Val Phe 56912 GTA AGC ATT GTC ATC CAC AGT CCT ATT GTA CGA GCT GGT TCA CCC GCA 95957 Val Ser Ile Val Ile His Ser Pro Ile Val Arg Ala Gly Ser Pro Ala 72960 GCT CTG AGC TGC TCT CCA GCC CCA GCC CTT TCT TCC TGT GCC CCT ACC 100773 Ala Leu Ser Cys Ser Pro Ala Pro Ala Leu Ser Ser Cys Ala Pro Thr 881008 CCC GCT GGG ATG ACT CTC CTC ACC CTC CCT GGG GCG ACA ACC GCC CTG 105589 Pro Ala Gly Met Thr Leu Leu Thr Leu Pro Gly Ala Thr Thr Ala Leu 1041056 TCT GTA ATG AGT GGC AGT CCC AAG CTT CCT GAC TGG CTT CCG CAG CTC 1103105 Ser Val Met Ser Gly Ser Pro Lys Leu Pro Asp Trp Leu Pro Gln Leu 1201104 TCT GAC TCC CCT AAA CAA GGC CTC AGG GAC TCC ACA TCC AAA TTA AGG 1151121 Ser Asp Ser Pro Lys Gln Gly Leu Arg Asp Ser Thr Ser Lys Leu Arg 1361152 CGG CAC CTG GTG GCA GGT TGG CAT TTT CCG GTG TCC TAT CTA TGA AAG 1199137 Arg His Leu Val Ala Gly Trp His Phe Pro Val Ser Tyr Leu *** 1511200 ACA GGA AGA CAG CTG GGA GCA AAC TCC CCT GGG CCA GAC TCT TGG AAA 12471248 CAT AAA GGC TTG GGT GCC CAG CTG GGG ACC GGG AGA AAG TCT AAA ACA 12951296 CGG GAC TGG GCC AAG GAC CCC ACA GGT CCC TGT CTC ATT AGG TCC CCT 13431344 GAA ACG TGT GGA AGC TAA AAT GGC ATT CAC GTG ATT CTT GAT CAT TTA 13911392 ACA GTG GAT TCT GAT CTG ATA CTA CAC TGA GAA GTG CCC CTG GGC CGG 14391440 GCG CGG TGG CTC ACG CCT GTA ATC CCA GCA CTT TGG GAG GCC GAG GCG 14871488 GGC GGA TCA CAA GGA GAT TGA GAC CAT CCT GGC TAA CAC GGT GAA ACC 15351536 CTG TCT CTA CTA AAA ATA CAA AAA ATT AGC CAG GCA TGG TGG CAG GCG 15831584 CCT CTA GTA CTA GCT ACT CGG GAG GCT GAG GCA GGA GAA TGG TGT GAA 16311632 CCC GGG AGG CGG AAC TTG CAG TGA GCC AAG ATT GTG CCA CTG CAC TCT 16791680 AGC ATG GGC GAC AGA GCA AGA CTC AGT CTC AAA AAA AAA AAA AAA AAA 17271728 AAA AAA AAA AAA 1739蛋白质分子量:15496.341 GT TCC AGA GCC ACT TTT AAG ATT CTT CAA TTC CAA ATG CAT GTC TTT 4748 TTT TAA AAA AAA GAA AGA AAG AAA AAT AAG TTT CTA ATA TTA GAG AAG 9596 TAC AGC CCT GAA TTG GGT TTT GTG TCC ACT GCT GGA CCC CAT GAG GGC 143144 CAG GTG GAG TGG ACC TCT GCA GCC CCA GTT GTG TGC ACT CTC TGT TTG 191192 GTG CAA ATT CCA GTT TGC TGG TTC TCA ATA GCA AGA CCA GCC TGA GAC 239240 CAC CTG TCC TGC TCT TCC CAT GAG AGG GCC GAA TGC TCC CAG CCT CCA 287288 TGC CAT GTC CTG TTC CTG GGG TCC TGG GGG TCA TTG CAG CCT GTA TGT 335336 GCT TCC TCC AGC CAG GGT GAT CAT CGG GTG CCC CAG TGA GCC CCA GCA 383384 CTG AGG GTC AGC CCC AGG CAC TGT CAA AGG TGA GAG CTC AGA GGC TGT 431432 GCC CAG AAA GAG AGG TGG GCC CTG CCT GCC CTG GAC GGA GGG AGA GAG 479480 GCT TCT CAG AGC CCG AGG CAT GAA CCC TCA GGT GGG TCG TGG CCA TAG 527528 TCA GAT GAT GGC TGC TGG TGA GCT CAG TGA CCA GGC GTC TTC AGG CAG 575576 CTC ATA AGT TTG AGA GGA CAC AGC CTA AGG GAG GTT TGC TGG GGA GTA 623624 GCC CCA CTT CCA CCC TGA ATA GAC AAG AGA TGG TAA AGC AGG TAC CCA 671672 GCA CTT AGT GCT TTC TTG GGG ATA TCG CGT GGG TCC CCG GGG GCC TGG 719720 GTG CCC GAA GTG CCG CAG TAC TCC ATG GTG CAG AGA GCT TGC TCC TGT 7671 Met Val Gln Arg Ala Cys Ser Cys 8768 GGA GGA AGT GTC TAT GTG GTC CCC AGC TCC TCT GTC TGC CTG TCC ACT 8159 Gly Gly Ser Val Tyr Val Val Pro Ser Seer Val Cys Leu Ser 24816 GGG CAC CAC CCA TGG CTC AGGC AGG GCT CTT GGG GGG GGT 8632 GLULY ARGLRL's ARGRL's HRT -GTC's C's GTC -C's Pro Gly 40864 CCT CCT CCA GCC CCG CTA ATC TGT GTA GGC CTC AAG TGC TGT GTG TTT 91141 Pro Pro Pro Ala Pro Leu Ile Cys Val Gly Leu Lys Cys Cys Val Phe 56912 GTA AGC ATT GTC ATC CAC AGT CCT ATT GTA CGA GCT GGT TCA CCC GCA 95957 Val Ser Ile Val Ile His Ser Pro Ile Val Arg Ala Gly Ser Pro Ala 72960 GCT CTG AGC TGC TCT CCA GCC CCA GCC CTT TCT TCC TGT GCC CCT ACC 100773 Ala Leu Ser Cys Ser Pro Ala Pro Ala Leu Ser Ser Cys Ala Pro Thr 881008 CCC GCT GGG ATG ACT CTC CTC ACC CTC CCT GGG GCG ACA ACC GCC CTG 105589 Pro Ala Gly Met Thr Leu Leu Thr Leu Pro Gly Ala Thr Thr Ala Leu 1041056 TCT GTA ATG AGT GGC AGT CCC AAG CTT CCT GAC TGG CTT CCG CAG CTC 1103105 Ser Val Met Ser Gly Ser Pro Lys Leu Pro Asp Trp Leu Pro Gln Leu 1201104 TCT GAC TCC CCT AAA CAA GGC CTC AGG GAC TCC ACA TCC AAA TTA AGG 1151121 Ser Asp Ser Pro Lys Gln Gly Leu Arg Asp Sering Thr Lys Leu ARG 1361152 CGG CAC CTG GCA GCA GGT TGG Cat TTT CCG GTG TAT CTA TGA AAG 1199137 ARG His Leu Val Ala Gly Pro Valr Leu *** 1511200 ACA GAGA AC TCC CCT GGG CCA GAC TCT TGG AAA 12471248 CAT AAA GGC TTG GGT GCC CAG CTG GGG ACC GGG AGA AAG TCT AAA ACA 12951296 CGG GAC TGG GCC AAG GAC CCC ACA GGT CCC TGT CTC ATT AGG TCC CCT 13431344 GAA ACG TGT GGA AGC TAA AAT GGC ATT CAC GTG ATT CTT GAT CAT TTA 13911392 ACA GTG GAT TCT GAT CTG ATA CTA CAC TGA GAA GTG CCC CTG GGC CGG 14391440 GCG CGG TGG CTC ACG CCT GTA ATC CCA GCA CTT TGG GAG GCC GAG GCG 14871488 GGC GGA TCA CAA GGA GAT TGA GAC CAT CCT GGC TAA CAC GGT GAA ACC 15351536 CTG TCT CTA CTA AAA ATA CAA AAA ATT AGC CAG GCA TGG TGG CAG GCG 15831584 CCT CTA GTA CTA GCT ACT CGG GAG GCT GAG GCA GGA GAA TGG TGT GAA 16311632 CCC GGG AGG CGG AAC TTG CAG TGA GCC Aag Att GTG CCA CTG CAC TCT 16791680 AGC ATG GGC GAC AGA GCA AGA CTC AGT CTC AAA AAA AAA AAA 17271728 AAA AAA 17399
6.PP102266. PP10226
A:核苷酸序列(SEQ ID NO:16)长度:1012bp1 GTGAGAGAGG GGTTTGGAAA TACCAGACTA TAATTGTGGA TTTGTCCATT ACTCCTTTCA61 GTTCTAGCAG TTTTTGCTTC TTGTGTTTTG AAGCTCTGTT ATTTGATAAA AATTTTTAGA121 ATTTTTAATG TTTATTTTAG AATGTATAAA ATTTTAGAAT TTATATGGAT AAATTGAATC181 CTCTATCATT ATAACATTAT GTTCTTTATG CCTGTAATAT TTTTTGCTGC AAAATCTACT241 GTCTTAAATA ATATAGACAC AACAGCCTGA TTAGTGTTTG CATAGTACAT CTTCCCCTTC301 TTCCATTGTT TTACATTTAG CCTATTTGTG CTTTAAAAAA ATTTAAGTAC CTATATTGTA361 GGCAGCATAG AGTTGGATCT TGTTTTATTA ATGCACCCTG TTTTGAGAGA GAGAGAGAGA421 GAGACAGAGA CAGAGACACA GAGAGAGAGT GTGAGCGAGC AAAAGAGATT TATTCTGGTT481 TTTTTTTGTT TGTTTTTGAG ATGGAGTCTT GCTCTCTTGC TCAGGCTGGA GTGCAGTGCC541 GCAATCTCAG CTCACTGCAA CCTCCACCTC CTGGGTTCAA GTTATTCTCC TGTCTCAGCC601 TCCCAAGTAG CTGGGACTAC AGGCCTGTGC CACCATGCCC GGCTACGTTT TGTATTTTTA661 GTACAGACGG TGTTTCACCA TGTTGGCCAG CCTGGTCTCA AACTCCTGGC CTCAAGTTGA721 TCTGCTGGCC TCACGCCTGT AATCCTAGTA CTTTGGGAGG CCGAGGCGGG CGGATCTCGA781 GTTCAGGAGA TCGACCATCC TGGCTAACAC GGTGAAACCT CGTCTCTACT AAAAATACAA841 AAAATTAGCC GGGCATGGTG GTGGGCACCC GTAGTCCCAG CTACTTGGGA GGCTGAGGCA901 GGAGAATGGC ATGAATCCAG TAGGCGGAGC TTGCAGTGAG CCAAGATCAC GCCACTGCAC961 TCCACCCTGG GTGACAGAGC GAGACTTTGT CTCAAAAAAA AAAAAAAAAA AAA:核苷酸序列(SEQ ID NO:16)长度:1012bp1 GTGAGAGAGG GGTTTGGAAA TACCAGACTA TAATTGTGGA TTTGTCCATT ACTCCTTTCA61 GTTCTAGCAG TTTTTGCTTC TTGTGTTTTG AAGCTCTGTT ATTTGATAAA AATTTTTAGA121 ATTTTTAATG TTTATTTTAG AATGTATAAA ATTTTAGAAT TTATATGGAT AAATTGAATC181 CTCTATCATT ATAACATTAT GTTCTTTATG CCTGTAATAT TTTTTGCTGC AAAATCTACT241 GTCTTAAATA ATATAGACAC AACAGCCTGA TTAGTGTTTG CATAGTACAT CTTCCCCTTC301 TTCCATTGTT TTACATTTAG CCTATTTGTG CTTTAAAAAA ATTTAAGTAC CTATATTGTA361 GGCAGCATAG AGTTGGATCT TGTTTTATTA ATGCACCCTG TTTTGAGAGA GAGAGAGAGA421 GAGACAGAGA CAGAGACACA GAGAGAGAGT GTGAGCGAGC AAAAGAGATT TATTCTGGTT481 TTTTTTTGTT TGTTTTTGAG ATGGAGTCTT GCTCTCTTGC TCAGGCTGGA GTGCAGTGCC541 GCAATCTCAG CTCACTGCAA CCTCCACCTC CTGGGTTCAA GTTATTCTCC TGTCTCAGCC601 TCCCAAGTAG CTGGGACTAC AGGCCTGTGC CACCATGCCC GGCTACGTTT TGTATTTTTA661 GTACAGACGG TGTTTCACCA TGTTGGCCAG CCTGGTCTCA AACTCCTGGC CTCAAGTTGA721 TCTGCTGGCC TCACGCCTGT AATCCTAGTA CTTTGGGAGG CCGAGGCGGG CGGATCTCGA781 GTTCAGGAGA TCGACCATCC TGGCTAACAC GGTGAAACCT CGTCTCTACT AAAAATACAA841 AAAATTAGCC GGGCATGGTG GTGGGCACCC GTAGTCCCAG CTACTTGGGA GGCTGAGGCA901 GGAGAATGGC ATGAATCCAG TAGGCGGAGC TTGCAGTGAG CCAAGATCAC GCCACTGCAC961 TCCACCCTGG GTGAAAGAGC GA GACTTTGT AAAAAAAA
B:氨基酸序列(SEQ ID NG:17)长度:109个氨基酸1 MHPVLRERER ETETETQRES VSEQKRFILV FFCLFLRWSL ALLLRLECSG AISAHCNLHL61 LGSSYSPVSA SQVAGTTGLC HHARLRFVFL VQTVFHHVGQ AGLKLLASSB: amino acid sequence (SEQ ID NG: 17) length: 109 amino acids
C.核苷酸及氨基酸组合序列(SEQ ID NO:18)C. Nucleotide and amino acid combination sequence (SEQ ID NO: 18)
克隆号:PP10226起始编码子:391 ATG 终止编码子:718 TGA蛋白质分子量:12262.591 GTG AGA GAG GGG TTT GGA AAT ACC AGA CTA TAA TTG TGG ATT TGT CCA 4849 TTA CTC CTT TCA GTT CTA GCA GTT TTT GCT TCT TGT GTT TTG AAG CTC 9697 TGT TAT TTG ATA AAA ATT TTT AGA ATT TTT AAT GTT TAT TTT AGA ATG 144145 TAT AAA ATT TTA GAA TTT ATA TGG ATA AAT TGA ATC CTC TAT CAT TAT 192193 AAC ATT ATG TTC TTT ATG CCT GTA ATA TTT TTT GCT GCA AAA TCT ACT 240241 GTC TTA AAT AAT ATA GAC ACA ACA GCC TGA TTA GTG TTT GCA TAG TAC 288289 ATC TTC CCC TTC TTC CAT TGT TTT ACA TTT AGC CTA TTT GTG CTT TAA 336337 AAA AAT TTA AGT ACC TAT ATT GTA GGC AGC ATA GAG TTG GAT CTT GTT 384385 TTA TTA ATG CAC CCT GTT TTG AGA GAG AGA GAG AGA GAG ACA GAG ACA 4321 Met His Pro Val Leu Arg Glu Arg Glu Arg Glu Thr Glu Thr 14433 GAG ACA CAG AGA GAG AGT GTG AGC GAG CAA AAG AGA TTT ATT CTG GTT 48015 Glu Thr Gln Arg Glu Ser Val Ser Glu Gln Lys Arg Phe Ile Leu Val 30481 TTT TTT TGT TTG TTT TTG AGA TGG AGT CTT GCT CTC TTG CTC AGG CTG 52831 Phe Phe Cys Leu Phe Leu Arg Trp Ser Leu Ala Leu Leu Leu Arg Leu 46529 GAG TGC AGT GGC GCA ATC TCA GCT CAC TGC AAC CTC CAC CTC CTG GGT 57647 Glu Cys Ser Gly Ala Ile Ser Ala His Cys Asn Leu His Leu Leu Gly 62577 TCA AGT TAT TCT CCT GTC TCA GCC TCC CAA GTA GCT GGG ACT ACA GGC 62463 Ser Ser Tyr Ser Pro Val Ser Ala Ser Gln Val Ala Gly Thr Thr Gly 78625 CTG TGC CAC CAT GCC CGG CTA CGT TTT GTA TTT TTA GTA CAG ACG GTG 67279 Leu Cys His His Ala Arg Leu Arg Phe Val Phe Leu Val Gln Thr Val 94673 TTT CAC CAT GTT GGC CAG GCT GGT CTC AAA CTC CTG GCC TCA AGT TGA 72095 Phe His His Val Gly Gln Ala Gly Leu Lys Leu Leu Ala Ser Ser *** 110721 TCT GCT GGC CTC ACG CCT GTA ATC CTA GTA CTT TGG GAG GCC GAG GCG 768769 GGC GGA TCT CGA GTT CAG GAG ATC GAC CAT CCT GGC TAA CAC GGT GAA 816817 ACC TCG TCT CTA CTA AAA ATA CAA AAA ATT AGC CGG GCA TGG TGG TGG 864865 GCA CCC GTA GTC CCA GCT ACT TGG GAG GCT GAG GCA GGA GAA TGG CAT 912913 GAA TCC AGT AGG CGG AGC TTG CAG TGA GCC AAG ATC ACG CCA CTG CAC 960961 TCC AGC CTG GGT GAC AGA GCG AGA CTT TGT CTC AAA AAA AAA AAA AAA 10081009 AAA A 1012Clone No.: PP10226 Start Codon: 391 ATG Stop Codon: 718 TGA Protein Molecular Weight: 12262.591 GTG AGA GAG GGG TTT GGA AAT ACC AGA CTA TAA TTG TGG ATT TGT CCA CT T 4849 G GTT CTC CTT TCA GT CT GTT TTG AAG CTC 997 TAT TAA AAA AAA AAA AGA Attt TAT GTT TAT TAGA AGA AGA AGA AGA AGA AGA Atg 144145 TAAA ATT TTA TGG ATA ATC CTC AAC Atg TTC TTT AAC Atg TTT A GCT GCA AAA TCT ACT 240241 GTC TTA AAT AAT ATA GAC ACA ACA GCC TGA TTA GTG TTT GCA TAG TAC 288289 ATC TTC CCC TTC TTC CAT TGT TTT ACA TTT AGC CTA TTT GTG CTT TAA 336337 AAA AAT TTA AGT ACC TAT ATT GTA GGC AGC ATA GAG TTG GAT CTT GTT 384385 TTA TTA ATG CAC CCT GTT TTG AGA GAG AGA GAG AGA GAG ACA GAG ACA 4321 Met His Pro Val Leu Arg Glu Arg Glu Arg Glu Thr Glu Thr 14433 GAG ACA CAG AGA GAG AGT GTG AGC GAG CAA AAG AGA TTT ATT CTG GTT 48015 Glu Thr Gln Arg Glu Ser Val Ser Glu Gln Lys Arg Phe Ile Leu Val 30481 TTT TTT TGT TTG TTT TTG AGA TGG AGT CTT GCT CTC TTG CTC AGG CTG 52831 Phe Phe Cys Leu Phe Leu Arg Trp Ser Leu Ala Leu Leu Leu Arg Leu 46529 GAG TGC AGT GGC GCA ATC TCA GCT CAC TGC AAC CTC CAC CTC CTG GGT 57647 Glu Cys Ser Gly Ala Ile Ser Ala His Cys Asn Leu His Leu Leu Gly 62577 TCA AGT TAT TCT CCT GTC TCA GCC TCC CAA GTA GCT GGG ACT ACA GGC 62463 Ser Ser Tyr Ser Pro Val Ser Ala Ser Gln Val Ala Gly Thr Thr Gly 78625 CTG TGC CAC CAT GCC CGG CTA CGT TTT GTA TTT TTA GTA CAG ACG GTG 67279 Leu Cys His His Ala Arg Leu Arg PHTTT, PHTTT, PHR PHTTT, PHR PHR PHR GLN TTT GGT GGT, CAC, CTC GCC, has His His His Leu Leu Ala, Gly Val His His Leu Ala? CCT GTA ATC CTA GTA CTT TGG GAG GCC GAG GCG 768769 GGC GGA TCT CGA GTT CAG GAG ATC GAC CAT CCT GGC TAA CAC GGT GAA 816817 ACC TCG TCT CTA CTA AAA ATA CAA AAA ATT AGC CGG GCA TGG TGG TGG 864865 GCA CCC GTA GTC CCA GCT ACT TGG GAG GCT GAG GCA GGA GAA TGG CAT 912913 GAA TCC AGT AGG CGG AGC TTG CAG TGA GCC AAG ATC ACG CCA CTG CAC 960961 TCC AGC CTG GGT GAC AGA GCG AGA CTT TGT CTC AAA AAA AAA AAA AAA 10081009 AAA A 1012
7.SP2114a7. SP2114a
A:核苷酸序列(SEQ ID NO:19)长度:2546bp1 GGCCAGTCAA GATGGCCGCC GCTGGGTGAG GCAAGCTGGC GCGCCGCGGG GGCGTCTGGG61 AGTTGTAGTT CGGGACGGCG GGCTGACGCA CTTCGCCGCC GGCCGACGGG CGCCATTGTG121 CGGCGCGCGC CGGGACTCTG CCCACTTCCA CCAGAGACAC ATTGAGAAGG AGGAAACTAT181 GGCCTCCAGG CTTCCGACGG CCTGGTCCTG TGAACCAGAG ACCTTTGAAG ATCTAACACT241 GGGTTTTACC CCGGAAGAGT GGGGACTGCT GGACCTCAAA CAGAAGTCCC TGTACAGGGA301 AGTGATGCTG GAGAACTACA GGAACCTGGT CTCAGTGGAA CATCAGCTTT CCAAACCAGA361 TGTGGTATCT CAGTTAGAGG AGGCAGAAGA TTTCTGGCCA GTGGAGAGAG GAATTCCTCA421 AGACACCATT CCTGAGTATC CTGAGCTCCA GCTGGACCCT AAATTGGATC CTCTTCCTGC481 TGAGAGTCCC CTAATGAACA TTGAGGTTGT TGAGGTCCTC ACACTGAACC AGGAGGTGGC 541 TGGTCCCCGG AATGCCCAGA TCCAGGCCCT ATATGCTGAA GATGGAAGCC TGAGTGCAGA601 TGCCCCCAGT GAGCAGATCC AACAGCAGGG CAAGCATCCA GGTGACCCTG AGGCCGCGCG661 CCAGAGGTTC CGGCAGTTCC GTTATAAGGA CATGACAGGT CCCCGGGAGG CCCTGGACCA721 GCTCCGAGAG CTGTGTCACC AGTGGCTACA GCCTAAGGCA CGCTCCAAGG AGCAGATCCT781 GGAGCTGCTG GTGCTGGAGC AGTTCCTAGG TACACTGCCT GTGAAGCTCC GGACATGGGT841 GGAATCGCAG CACCCAGAGA ACTGCCAAGA GGTGGTGGCC CTGGTAGAGG GTGTGACCTG901 GATGTCTGAG GAGGAAGTAC TTCCTGCAGG ACAACCTGCC GAGGGCACCA CCTGCTGCCT961 CGAGGTCACT GCCCAGCAGG AGGAGAAGCA GGAGGATGCA GCCATCTGCC CAGTGACAGT1021 GCTCCCTGAG GAGCCAGTGA CCTTCCAGGA TGTGGCTGTG GACTTCAGCC GGGAGGAGTG1081 GGGGCTGCTG GGCCCGACAC AGAGGACCGA GTACCGCGAT GTGATGCTGG AGACCTTTGG1141 GCACCTGGTC TCTGTGGGGT GGGAGACTAC ACTGGAAAAT AAAGAGTTAG CTCCAAATTC1201 TGACATTCCT GAGGAAGAAC CAGCCCCCAG CCTGAAAGTA CAAGAATCCT CAAGGGATTG1261 TGCCTTGTCC TCTACATTAG AAGATACCTT GCAGGGTGGG GTCCAGGAAG TCCAAGACAC1321 AGTGTTGAAG CAGATGGAGT CTGCTCAGGA AAAAGACCTT CCTCAGAAGA AGCACTTTGA1381 CAACCGTGAG TCCCAGGCAA ACAGTGGTGC TCTTGACACA AACCAAGTTT CGCTCCAGAA1441 AATTGACAAC CCTGAGTCCC AGGCAAACAG TGGCGCTCTT GACACAAACC AAGTTTTGCT1501 CCACAAAATT CCTCCTAGAA AACGATTGCG CAAACGTGAC TCACAAGTTA AAAGTATGAA1561 ACATAATTCA CGTGTAAAAA TTCATCAGAA GAGCTGTGAA AGGCAAAAGG CCAAGGAAGG1621 CAATGGTTGT AGGAAAACCT TCAGTCGGAG TACTAAACAG ATTACGTTTA TAAGAATTCA1681 CAAGGGGAGC CAAGTTTGCC GATGCAGTGA ATGTGGTAAA ATATTCCGGA ACCCAAGATA1741 CTTTTCTGTG CATAAGAAAA TCCATACCGG AGAGAGGCCC TATGTGTGTC AAGACTGTGG1801 GAAAGGATTT GTTCAGAGCT CTTCCCTCAC ACAGCATCAG AGAGTTCATT CTGGAGAGAG1861 ACCATTTGAA TGTCAGGAGT GTGGGAGGAC CTTCAATGAT CGCTCAGCCA TCTCCCAGCA1921 CCTGAGGACT CACACTGGCG CTAAGCCCTA CAAGTGTCAG GACTGTGGAA AAGCCTTCCG1981 CCAGAGTTCC CACCTCATCA GACATCAGAG GACTCACACC GGGGAGCGCC CATATGCATG2041 CAACAAATGT GGAAAGGCCT TCACCCAGAG CTCACACCTT ATTGGGCACC AGAGAACCCA2101 CAATAGGACA AAGCGAAAGA AGAAACAGCC TACCTCATAG CTCTCAAGCC AGTTGAAGAA2161 ACCTTGCCTT TTCAGCTTGA CCCTGCAATA TAACATGCAC AGGCCTGCTT GTGAATCAGG2221 ACTGAATGTG AAAGGGAAGT ATTGAGTGAG GACATTCCCA AAACCAAAGG ACAACTGAGG2281 AGACTGCCCA GCACATAATG AATAAATAAG AAAATGAGTG AGGAGTTATT AACATCATTT2341 GGAAAAAAGA TTTCCCATTC ACTTGATATT GTTTGTTCAC TCATTTAGTC ATTAAAAGTG2401 AGATTAATAA AATCTGAAAA TGTTATATAA TAACTTTAAA AAGCCAGGTA ATTAATAATC2461 TGCACTGATA TTACATCCAC AGTACCACAG TATTTATGTG TATGAATTAA GGATTAAAAG2521 ATAATGTGGA TAAAAAAAAA AAAAAAA:核苷酸序列(SEQ ID NO:19)长度:2546bp1 GGCCAGTCAA GATGGCCGCC GCTGGGTGAG GCAAGCTGGC GCGCCGCGGG GGCGTCTGGG61 AGTTGTAGTT CGGGACGGCG GGCTGACGCA CTTCGCCGCC GGCCGACGGG CGCCATTGTG121 CGGCGCGCGC CGGGACTCTG CCCACTTCCA CCAGAGACAC ATTGAGAAGG AGGAAACTAT181 GGCCTCCAGG CTTCCGACGG CCTGGTCCTG TGAACCAGAG ACCTTTGAAG ATCTAACACT241 GGGTTTTACC CCGGAAGAGT GGGGACTGCT GGACCTCAAA CAGAAGTCCC TGTACAGGGA301 AGTGATGCTG GAGAACTACA GGAACCTGGT CTCAGTGGAA CATCAGCTTT CCAAACCAGA361 TGTGGTATCT CAGTTAGAGG AGGCAGAAGA TTTCTGGCCA GTGGAGAGAG GAATTCCTCA421 AGACACCATT CCTGAGTATC CTGAGCTCCA GCTGGACCCT AAATTGGATC CTCTTCCTGC481 TGAGAGTCCC CTAATGAACA TTGAGGTTGT TGAGGTCCTC ACACTGAACC AGGAGGTGGC 541 TGGTCCCCGG AATGCCCAGA TCCAGGCCCT ATATGCTGAA GATGGAAGCC TGAGTGCAGA601 TGCCCCCAGT GAGCAGATCC AACAGCAGGG CAAGCATCCA GGTGACCCTG AGGCCGCGCG661 CCAGAGGTTC CGGCAGTTCC GTTATAAGGA CATGACAGGT CCCCGGGAGG CCCTGGACCA721 GCTCCGAGAG CTGTGTCACC AGTGGCTACA GCCTAAGGCA CGCTCCAAGG AGCAGATCCT781 GGAGCTGCTG GTGCTGGAGC AGTTCCTAGG TACACTGCCT GTGAAGCTCC GGACATGGGT841 GGAATCGCAG CACCCAGAGA ACTGCCAAGA GGTGGTGGCC CTGGTAGAGG GTGTGACCTG901 GATGTCTGAG GAGGAAGTAC TTCCTGCAGG ACAACCTGCC GAGGGCACCA CCTGCTGCCT961 CGAGGTCACT GCCCAGCAGG AGGAGAAGCA GGAGGATGCA GCCATCTGCC CAGTGACAGT1021 GCTCCCTGAG GAGCCAGTGA CCTTCCAGGA TGTGGCTGTG GACTTCAGCC GGGAGGAGTG1081 GGGGCTGCTG GGCCCGACAC AGAGGACCGA GTACCGCGAT GTGATGCTGG AGACCTTTGG1141 GCACCTGGTC TCTGTGGGGT GGGAGACTAC ACTGGAAAAT AAAGAGTTAG CTCCAAATTC1201 TGACATTCCT GAGGAAGAAC CAGCCCCCAG CCTGAAAGTA CAAGAATCCT CAAGGGATTG1261 TGCCTTGTCC TCTACATTAG AAGATACCTT GCAGGGTGGG GTCCAGGAAG TCCAAGACAC1321 AGTGTTGAAG CAGATGGAGT CTGCTCAGGA AAAAGACCTT CCTCAGAAGA AGCACTTTGA1381 CAACCGTGAG TCCCAGGCAA ACAGTGGTGC TCTTGACACA AACCAAGTTT CGCTCCAGAA1441 AATTGACAAC CCTGAGTCCC AGGCAAACAG TGGCGCTCTT GACACAAACC AAGTTTTGCT1501 CCACAAAATT CCTCCTAGAA AACGATTGCG CAAACGTGAC TCACAAGTTA AAAGTATGAA1561 ACATAATTCA CGTGTAAAAA TTCATCAGAA GAGCTGTGAA AGGCAAAAGG CCAAGGAAGG1621 CAATGGTTGT AGGAAAACCT TCAGTCGGAG TACTAAACAG ATTACGTTTA TAAGAATTCA1681 CAAGGGGAGC CAAGTTTGCC GATGCAGTGA ATGTGGTAAA ATATTCCGGA ACCCAAGATA1741 CTTTTCTGTG CATAAGAAAA TCCATACCGG AGAGAGGCCC TATGTGTGTC AAGACTGTGG1801 GAAAGGATTT GTTCAGAGCT CTTCCCTCAC ACAGCATCAG AGAGTTCATT CTGGAGAGAG1861 ACCATTTGAA TGTCAGGAGT GTGGGAGGAC CTTCAATGAT CGCTCAGCCA TCTCCCAGCA1921 CCTGAGGACT CACACTGGCG CTAAGCCCTA CAAGTGTCAG GACTGTGGAA AAGCCTTCCG1981 CCAGAGTTCC CACCTCATCA GACATCAGAG GACTCACACC GGGGAGCGCC CATATGCATG2041 CAACAAATGT GGAAAGGCCT TCACCCAGAG CTCACACCTT ATTGGGCACC AGAGAACCCA2101 CAATAGGACA AAGCGAAAGA AGAAACAGCC TACCTCATAG CTCTCAAGCC AGTTGAAGAA2161 ACCTTGCCTT TTCAGCTTGA CCCTGCAATA TAACATGCAC AGGCCTGCTT GTGAATCAGG2221 ACTGAATGTG AAAGGGAAGT ATTGAGTGAG GACATTCCCA AAACCAAAGG ACAACTGAGG2281 AGACTGCCCA GCACATAATG AATAAATAAG AAAATGAGTG AGGAGTTATT AACATCATTT2341 GGAAAAAAGA TTTCCCATTC ACTTGATATT GTTTGTTCAC TCATTTAGTC ATTAAAAGTG2401 AGATTAATAA AATCTGAAAA TGTTATATAA TAACTTTAAA AAGCCAGGTA ATTAATAATC2461 TGCACTGATA TTACATCCAC AGTACCACAG TATTTATGTG TATGAATTAA GGATTAAAAG2521 ATAATGTGGA TAAAAAAAAA AAAAAA
B:氨基酸序列(SEQ ID NO:20) 长度:653个氨基酸1 MASRLPTAWS CEPETFEDVT LGFTPEEWGL LDLKQKSLYR EVMLENYRNL VSVEHQLSKP61 DVVSQLEEAE DFWPVERGIP QDTIPEYPEL QLDPKLDPLP AESPLMNIEV VEVLTLNQEV121 AGPRNAQIQA LYAEDGSLSA DAPSEQIQQQ GKHPGDPEAA RQRFRQFRYK DMTGPREALD181 QLRELCHQWL QPKARSKEQI LELLVLEQFL GTLPVKLRTW VESQHPENCQ EVVALVEGVT241 WMSEEEVLPA GQPAEGTTCC LEVTAQQEEK QEDAAICPVT VLPEEPVTFQ DVAVDFSREE301 WGLLGPTQRT EYRDVMLETF GHLVSVGWET TLENKELAPN SDIPEEEPAP SLKVQESSRD361 CALSSTLEDT LQGGVQEVQD TVLKQMESAQ EKDLPQKKHF DNRESQANSG ALDTNQVSLQ421 KIDNPESQAN SGALDTNQVL LHKIPPRKRL RKRDSQVKSM KHNSRVKIHQ KSCERQKAKE481 GNGCRKTFSR STKQITFIRI HKGSQVCRCS ECGKIFRNPR YFSVHKKIHT GERPYVCQDC541 GKGFVQSSSL TQHQRVHSGE RPFECQECGR TFNDRSAISQ HLRTHTGAKP YKCQDCGKAF601 RQSSHLIRHQ RTHTGERPYA CNKCGKAFTQ SSHLIGHQRT HNRTKRKKKQ PTSB:氨基酸序列(SEQ ID NO:20) 长度:653个氨基酸1 MASRLPTAWS CEPETFEDVT LGFTPEEWGL LDLKQKSLYR EVMLENYRNL VSVEHQLSKP61 DVVSQLEEAE DFWPVERGIP QDTIPEYPEL QLDPKLDPLP AESPLMNIEV VEVLTLNQEV121 AGPRNAQIQA LYAEDGSLSA DAPSEQIQQQ GKHPGDPEAA RQRFRQFRYK DMTGPREALD181 QLRELCHQWL QPKARSKEQI LELLVLEQFL GTLPVKLRTW VESQHPENCQ EVVALVEGVT241 WMSEEEVLPA GQPAEGTTCC LEVTAQQEEK QEDAAICPVT VLPEEPVTFQ DVAVDFSREE301 WGLLGPTQRT EYRDVMLETF GHLVSVGWET TLENKELAPN SDIPEEEPAP SLKVQESSRD361 CALSSTLEDT LQGGVQEVQD TVLKQMESAQ EKDLPQKKHF DNRESQANSG ALDTNQVSLQ421 KIDNPESQAN SGALDTNQVL LHKIPPRKRL RKRDSQVKSM KHNSRVKIHQ KSCERQKAKE481 GNGCRKTFSR STKQITFIRI HKGSQVCRCS ECGKIFRNPR YFSVHKKIHT GERPYVCQDC541 GKGFVQSSSL TQHQRVHSGE RPFECQECGR TFNDRSAISQ HLRTHTGAKP YKCQDCGKAF601 RQSSHLIRHQ RTHTGERPYA CNKCGKAFTQ SSHLIGHQRT HNRTKRKKKQ PTS
C.核苷酸及氨基酸组合序列(SEQ ID NO:21)C. Nucleotide and amino acid combination sequence (SEQ ID NO: 21)
克隆号:SP2114aClone number: SP2114a
起始编码子:179 ATG 终止编码子:2138 TAGStart code: 179 ATG Stop code: 2138 TAG
蛋白质分子量:74247.901 G GCC AGT CAA GAT GGC CGC CGC TGG GTG AGG CAA GCT GGC GCG CCG 4647 CGG GGG CGT CTG GGA GTT GTA GTT CGG GAC GGC GGG CTG ACG CAG TTC 9495 GCC GCC GGC CGA CGG GCG CCA TTG TGC GGC GCG CGC CGG GAC TCT GCC 142143 CAC TTC CAC CAG AGA CAC ATT GAG AAG GAG GAA ACT ATG GCC TCC AGG 1901 Met Ala Ser Arg 4191 CTT CCG ACG GCC TGG TCC TGT GAA CCA GAG ACC TTT GAA GAT GTA ACA 2385 Leu Pro Thr Ala Trp Ser Cys Glu Pro Glu Thr Phe Glu Asp Val Thr 20239 CTG GGT TTT ACC CCG GAA GAG TGG GGA CTG CTG GAC CTC AAA CAG AAG 28621 Leu Gly Phe Thr Pro Glu Glu Trp Gly Leu Leu Asp Leu Lys Gln Lys 36287 TCC CTG TAC AGG GAA GTG ATG CTG GAG AAC TAC AGG AAC CTG GTC TCA 33437 Ser Leu Tyr Arg Glu Val Met Leu Glu Asn Tyr Arg Asn Leu Val Ser 52335 GTG GAA CAT CAG CTT TCC AAA CCA GAT GTG GTA TCT CAG TTA GAG GAG 38253 Val Glu His Gln Leu Ser Lys Pro Asp Val Val Ser Gln Leu Glu Glu 68383 GCA GAA GAT TTC TGG CCA GTG GAG AGA GGA ATT CCT CAA GAC ACC ATT 43069 Ala Glu Asp Phe Trp Pro Val Glu Arg Gly Ile Pro Gln Asp Thr Ile 84431 CCT GAG TAT CCT GAG CTC CAG CTG GAC CCT AAA TTG GAT CCT CTT CCT 47885 Pro Glu Tyr Pro Glu Leu Gln Leu Asp Pro Lys Leu Asp Pro Leu Pro 100479 GCT GAG AGT CCC CTA ATG AAC ATT GAG GTT GTT GAG GTC CTC ACA CTG 526101 Ala Glu Ser Pro Leu Met Asn Ile Glu Val Val Glu Val Leu Thr Leu 116527 AAC CAG GAG GTG GCT GGT CCC CGG AAT GCC CAG ATC CAG GCC CTA TAT 574117 Asn Gln Glu Val Ala Gly Pro Arg Asn Ala Gln Ile Gln Ala Leu Tyr 132575 GCT GAA GAT GGA AGC CTG AGT GCA GAT GCC CCC AGT GAG CAG ATC CAA 622133 Ala Glu Asp Gly Ser Leu Ser Ala Asp Ala Pro Ser Glu Gln Ile Gln 148623 CAG CAG GGC AAG CAT CCA GGT GAC CCT GAG GCC GCG CGC CAG AGG TTC 670149 Gln Gln Gly Lys His Pro Gly Asp Pro Glu Ala Ala Arg Gln Arg Phe 164671 CGG CAG TTC CGT TAT AAG GAC ATG ACA GGT CCC CGG GAG GCC CTG GAC 718165 Arg Gln Phe Arg Tyr Lys Asp Met Thr Gly Pro Arg Glu Ala Leu Asp 180719 CAG CTC CGA GAG CTG TGT CAC CAG TGG CTA CAG CCT AAG GCA CGC TCC 766181 Gln Leu Arg Glu Leu Cys His Gln Trp Leu Gln Pro Lys Ala Arg Ser 196767 AAG GAG CAG ATC CTG GAG CTG CTG GTG CTG GAG CAG TTC CTA GGT ACA 814197 Lys Glu Gln Ile Leu Glu Leu Leu Val Leu Glu Gln Phe Leu Gly Thr 212815 CTG CCT GTG AAG CTC CGG ACA TGG GTG GAA TCG CAG CAC CCA GAG AAC 862213 Leu Pro Val Lys Leu Arg Thr Trp Val Glu Ser Gln His Pro Glu Asn 228863 TGC CAA GAG GTG GTG GCC CTG GTA GAG GGT GTG ACC TGG ATG TCT GAG 910 229 Cys Gln Glu Val Val Ala Leu Val Glu Gly Val Thr Trp Met Ser Glu 244911 GAG GAA GTA CTT CCT GCA GGA CAA CCT GCC GAG GGC ACC ACC TGC TGC 958245 Glu Glu Val Leu Pro Ala Gly Gln Pro Ala Glu Gly Thr Thr Cys Cys 260959 CTC GAG GTC ACT GCC CAG CAG GAG GAG AAG CAG GAG GAT GCA GCC ATC 1006261 Leu Glu Val Thr Ala Gln Gln Glu Glu Lys Gln Glu Asp Ala Ala Ile 2761007 TGC CCA GTG ACA GTG CTC CCT GAG GAG CCA GTG ACC TTC CAG GAT GTG 1054277 Cys Pro Val Thr Val Leu Pro Glu Glu Pro Val Thr Phe Gln Asp Val 2921055 GCT GTG GAC TTC AGC CGG GAG GAG TGG GGG CTG CTG GGC CCG ACA CAG 1102293 Ala Val Asp Phe Ser Arg Glu Glu Trp Gly Leu Leu Gly Pro Thr Gln 3081103 AGG ACC GAG TAC CGC GAT GTG ATG CTG GAG ACC TTT GGG CAC CTG GTC 1150309 Arg Thr Glu Tyr Arg Asp Val Met Leu Glu Thr Phe Gly His Leu Val 3241151 TCT GTG GGG TGG GAG ACT ACA CTG GAA AAT AAA GAG TTA GCT CCA AAT 1198325 Ser Val Gly Trp Glu Thr Thr Leu Glu Asn Lys Glu Leu Ala Pro Asn 3401199 TCT GAC ATT CCT GAG GAA GAA CCA GCC CCC AGC CTG AAA GTA CAA GAA 1246341 Ser Asp Ile Pro Glu Glu Glu Pro Ala Pro Ser Leu Lys Val Gln Glu 3561247 TCC TCA AGG GAT TGT GCC TTG TCC TCT ACA TTA GAA GAT ACC TTG CAG 1294357 Ser Ser Arg Asp Cys Ala Leu Ser Ser Thr Leu Glu Asp Thr Leu Gln 3721295 GGT GGG GTC CAG GAA GTC CAA GAC ACA GTG TTG AAG CAG ATG GAG TCT 1342373 Gly Gly Val Gln Glu Val Gln Asp Thr Val Leu Lys Gln Met Glu Ser 3881343 GCT CAG GAA AAA GAC CTT CCT CAG AAG AAG CAC TTT GAC AAC CGT GAG 1390389 Ala Gln Glu Lys Asp Leu Pro Gln Lys Lys His Phe Asp Asn Arg Glu 4041391 TCC CAG GCA AAC AGT GGT GCT CTT GAC ACA AAC CAA GTT TCG CTC CAG 1438405 Ser Gln Ala Asn Ser Gly Ala Leu Asp Thr Asn Gln Val Ser Leu Gln 4201439 AAA ATT GAC AAC CCT GAG TCC CAG GCA AAC AGT GGC GCT CTT GAC ACA 1486421 Lys Ile Asp Asn Pro Glu Ser Gln Ala Asn Ser Gly Ala Leu Asp Thr 4361487 AAC CAA GTT TTG CTC CAC AAA ATT CCT CCT AGA AAA CGA TTG CGC AAA 1534437 Asn Gln Val Leu Leu His Lys Ile Pro Pro Arg Lys Arg Leu Arg Lys 4521535 CGT GAC TCA CAA GTT AAA AGT ATG AAA CAT AAT TCA CGT GTA AAA ATT 1582453 Arg Asp Ser Gln Val Lys Ser Met Lys His Asn Ser Arg Val Lys Ile 4681583 CAT CAG AAG AGC TGT GAA AGG CAA AAG GCC AAG GAA GGC AAT GGT TGT 1630469 His Gln Lys Ser Cys Glu Arg Gln Lys Ala Lys Glu Gly Asn Gly Cys 4841631 AGG AAA ACC TTC AGT CGG AGT ACT AAA CAG ATT ACG TTT ATA AGA ATT 1678485 Arg Lys Thr Phe Ser Arg Ser Thr Lys Gln Ile Thr Phe Ile Arg Ile 5001679 CAC AAG GGG AGC CAA GTT TGC CGA TGC AGT GAA TGT GGT AAA ATA TTC 1726501 His Lys Gly Ser Gln Val Cys Arg Cys Ser Glu Cys Gly Lys Ile Phe 5161727 CGG AAC CCA AGA TAC TTT TCT GTG CAT AAG AAA ATC CAT ACC GGA GAG 1774517 Arg Asn Pro Arg Tyr Phe Ser Val His Lys Lys Ile His Thr Gly Glu 5321775 AGG CCC TAT GTG TGT CAA GAC TGT GGG AAA GGA TTT GTT CAG AGC TCT 1822533 Arg Pro Tyr Val Cys Gln Asp Cys Gly Lys Gly Phe Val Gln Ser Ser 5481823 TCC CTC ACA CAG CAT CAG AGA GTT CAT TCT GGA GAG AGA CCA TTT GAA 1870549 Ser Leu Thr Gln His Gln Arg Val His Ser Gly Glu Arg Pro Phe Glu 5641871 TGT CAG GAG TGT GGG AGG ACC TTC AAT GAT CGC TCA GCC ATC TCC CAG 1918565 Cys Gln Glu Cys Gly Arg Thr Phe Asn Asp Arg Ser Ala Ile Ser Gln 5801919 CAC CTG AGG ACT CAC ACT GGC GCT AAG CCC TAC AAG TGT CAG GAC TGT 1966581 His Leu Arg Thr His Thr Gly Ala Lys Pro Tyr Lys Cys Gln Asp Cys 5961967 GGA AAA GCC TTC CGC CAG AGT TCC CAC CTC ATC AGA CAT CAG AGG ACT 2014597 Gly Lys Ala Phe Arg Gln Ser Ser His Leu Ile Arg His Gln Arg Thr 6122015 CAC ACC GGG GAG CGC CCA TAT GCA TGC AAC AAA TGT GGA AAG GCC TTC 2062613 His Thr Gly Glu Arg Pro Tyr Ala Cys Asn Lys Cys Gly Lys Ala Phe 6282063 ACC CAG AGC TCA CAC CTT ATT GGG CAC CAG AGA ACC CAC AAT AGG ACA 2110629 Thr Gln Ser Ser His Leu Ile Gly His Gln Arg Thr His Asn Arg Thr 6442111 AAG CGA AAG AAG AAA CAG CCT ACC TCA TAG CTC TCA AGC CAG TTG AAG 2158645 Lys Arg Lys Lys Lys Gln Pro Thr Ser *** 6542159 AAA CCT TGC CTT TTC AGC TTG ACC CTG CAA TAT AAC ATG CAC AGG CCT 22062207 GCT TGT GAA TCA GGA CTG AAT GTG AAA GGG AAG TAT TGA GTG AGG ACA 22542255 TTC CCA AAA CCA AAG GAC AAC TGA GGA GAC TGC CCA GCA CAT AAT GAA 23022303 TAA ATA AGA AAA TGA GTG AGG AGT TAT TAA CAT CAT TTG GAA AAA AGA 23502351 TTT CCC ATT CAC TTG ATA TTG TTT GTT CAC TCA TTT AGT CAT TAA AAG 23982399 TGA GAT TAA TAA AAT CTG AAA ATG TTA TAT AAT AAC TTT AAA AAG CCA 24462447 GGT AAT TAA TAA TCT GCA CTG ATA TTA CAT CCA CAG TAC CAC AGT ATT 24942495 TAT GTG TAT GAA TTA AGG ATT AAA AGA TAA TGT GGA TAA AAA AAA AAA 25422543 AAA A 2546蛋白质分子量:74247.901 G GCC AGT CAA GAT GGC CGC CGC TGG GTG AGG CAA GCT GGC GCG CCG 4647 CGG GGG CGT CTG GGA GTT GTA GTT CGG GAC GGC GGG CTG ACG CAG TTC 9495 GCC GCC GGC CGA CGG GCG CCA TTG TGC GGC GCG CGC CGG GAC TCT GCC 142143 CAC TTC CAC CAG AGA CAC ATT GAG AAG GAG GAA ACT ATG GCC TCC AGG 1901 Met Ala Ser Arg 4191 CTT CCG ACG GCC TGG TCC TGT GAA CCA GAG ACC TTT GAA GAT GTA ACA 2385 Leu Pro Thr Ala Trp Ser Cys Glu Pro Glu Thr Phe Glu Asp Val Thr 20239 CTG GGT TTT ACC CCG GAA GAG TGG GGA CTG CTG GAC CTC AAA CAG AAG 28621 Leu Gly Phe Thr Pro Glu Glu Trp Gly Leu Leu Asp Leu Lys Gln Lys 36287 TCC CTG TAC AGG GAA GTG ATG CTG GAG AAC TAC AGG AAC CTG GTC TCA 33437 Ser Leu Tyr Arg Glu Val Met Leu Glu Asn Tyr Arg Asn Leu Val Ser 52335 GTG GAA CAT CAG CTT TCC AAA CCA GAT GTG GTA TCT CAG TTA GAG GAG 38253 Val Glu His Gln Leu Ser Lys Pro Asp Val Val Ser Gln Leu Glu Glu 68383 GCA GAA GAT TTC TGG CCA GTG GAG AGA GGA ATT CCT CAA GAC ACC ATT 43069 Ala Glu Asp Phe Trp Pro Val Glu Arg Gly Ile Pro Gln Asp Thr Ile 84431 CCT GAG TAT CCT GAG CTC CAG CTG GAC CCT AAA TTG GAT CCT CTT CCT 47885 Pro Glu Tyr Pro Glu Leu Gln Leu Asp Pro Lys Leu Asp Pro Leu Pro 100479 GCT GAG AGT CCC CTA ATG AAC ATT GAG GTT GTT GAG GTC CTC ACA CTG 526101 Ala Glu Ser Pro Leu Met Asn Ile Glu Val Val Glu Val Leu Thr Leu 116527 AAC CAG GAG GTG GCT GGT CCC CGG AAT GCC CAG ATC CAG GCC CTA TAT 574117 Asn Gln Glu Val Ala Gly Pro Arg Asn Ala Gln Ile Gln Ala Leu Tyr 132575 GCT GAA GAT GGA AGC CTG AGT GCA GAT GCC CCC AGT GAG CAG ATC CAA 622133 Ala Glu Asp Gly Ser Leu Ser Ala Asp Ala Pro Ser Glu Gln Ile Gln 148623 CAG CAG GGC AAG CAT CCA GGT GAC CCT GAG GCC GCG CGC CAG AGG TTC 670149 Gln Gln Gly Lys His Pro Gly Asp Pro Glu Ala Ala Arg Gln Arg Phe 164671 CGG CAG TTC CGT TAT AAG GAC ATG ACA GGT CCC CGG GAG GCC CTG GAC 718165 Arg Gln Phe Arg Tyr Lys Asp Met Thr Gly Pro Arg Glu Ala Leu Asp 180719 CAG CTC CGA GAG CTG TGT CAC CAG TGG CTA CAG CCT AAG GCA CGC TCC 766181 Gln Leu Arg Glu Leu Cys His Gln Trp Leu Gln Pro Lys Ala Arg Ser 196767 AAG GAG CAG ATC CTG GAG CTG CTG GTG CTG GAG CAG TTC CTA GGT ACA 814197 Lys Glu Gln Ile Leu Glu Leu Leu Val Leu Glu Gln Phe Leu Gly Thr 212815 CTG CCT GTG AAG CTC CGG ACA TGG GTG GAA TCG CAG CAC CCA GAG AAC 862213 Leu Pro Val Lys Leu Arg Thr Trp Val Glu Ser Gln His Pro Glu Asn 228863 TGC CAA GAG GTG GTG GCC CTG GTA GAG GGT GTG ACC TGG ATG TCT GAG 910 229 Cys Gln Glu Val Val Ala Leu Val Glu Gly Val Thr Trp Met Ser Glu 244911 GAG GAA GTA CTT CCT GCA GGA CAA CCT GCC GAG GGC ACC ACC TGC TGC 958245 Glu Glu Val Leu Pro Ala Gly Gln Pro Ala Glu Gly Thr Thr Cys Cys 260959 CTC GAG GTC ACT GCC CAG CAG GAG GAG AAG CAG GAG GAT GCA GCC ATC 1006261 Leu Glu Val Thr Ala Gln Gln Glu Glu Lys Gln Glu Asp Ala Ala Ile 2761007 TGC CCA GTG ACA GTG CTC CCT GAG GAG CCA GTG ACC TTC CAG GAT GTG 1054277 Cys Pro Val Thr Val Leu Pro Glu Glu Pro Val Thr Phe Gln Asp Val 2921055 GCT GTG GAC TTC AGC CGG GAG GAG TGG GGG CTG CTG GGC CCG ACA CAG 1102293 Ala Val Asp Phe Ser Arg Glu Glu Trp Gly Leu Leu Gly Pro Thr Gln 3081103 AGG ACC GAG TAC CGC GAT GTG ATG CTG GAG ACC TTT GGG CAC CTG GTC 1150309 Arg Thr Glu Tyr Arg Asp Val Met Leu Glu Thr Phe Gly His Leu Val 3241151 TCT GTG GGG TGG GAG ACT ACA CTG GAA AAT AAA GAG TTA GCT CCA AAT 1198325 Ser Val Gly Trp Glu Thr Thr Leu Glu Asn Lys Glu Leu Ala Pro Asn 3401199 TCT GAC ATT CCT GAG GAA GAA CCA GCC CCC AGC CTG AAA GTA CAA GAA 1246341 Ser Asp Ile Pro Glu Glu Glu Pro Ala Pro Ser Leu Lys Val Gln Glu 3561247 TCC TCA AGG GAT TGT GCC TTG TCC TCT ACA TTA GAA GAT ACC TTG CAG 1294357 Ser Ser Arg Asp Cys Ala Leu Ser Ser Thr Leu Glu Asp Thr Leu Gln 3721295 GGT GGG GTC CAG GAA GTC CAA GAC ACA GTG TTG AAG CAG ATG GAG TCT 1342373 Gly Gly Val Gln Glu Val Gln Asp Thr Val Leu Lys Gln Met Glu Ser 3881343 GCT CAG GAA AAA GAC CTT CCT CAG AAG AAG CAC TTT GAC AAC CGT GAG 1390389 Ala Gln Glu Lys Asp Leu Pro Gln Lys Lys His Phe Asp Asn Arg Glu 4041391 TCC CAG GCA AAC AGT GGT GCT CTT GAC ACA AAC CAA GTT TCG CTC CAG 1438405 Ser Gln Ala Asn Ser Gly Ala Leu Asp Thr Asn Gln Val Ser Leu Gln 4201439 AAA ATT GAC AAC CCT GAG TCC CAG GCA AAC AGT GGC GCT CTT GAC ACA 1486421 Lys Ile Asp Asn Pro Glu Ser Gln Ala Asn Ser Gly Ala Leu Asp Thr 4361487 AAC CAA GTT TTG CTC CAC AAA ATT CCT CCT AGA AAA CGA TTG CGC AAA 1534437 Asn Gln Val Leu Leu His Lys Ile Pro Pro Arg Lys Arg Leu Arg Lys 4521535 CGT GAC TCA CAA GTT AAA AGT ATG AAA CAT AAT TCA CGT GTA AAA ATT 1582453 Arg Asp Ser Gln Val Lys Ser Met Lys His Asn Ser Arg Val Lys Ile 4681583 CAT CAG AAG AGC TGT GAA AGG CAA AAG GCC AAG GAA GGC AAT GGT TGT 1630469 His Gln Lys Ser Cys Glu Arg Gln Lys Ala Lys Glu Gly Asn Gly Cys 4841631 AGG AAA ACC TTC AGT CGG AGT ACT AAA CAG ATT ACG TTT ATA AGA ATT 1678485 Arg Lys Thr Phe Ser Arg Ser Thr Lys Gln Ile Thr Phe Ile Arg Ile 5001679 CAC AAG GGG AGC CAA GTT TGC CGA TGC AGT GAA TGT GGT AAA ATA TTC 1726501 His Lys Gly Ser Gln Val Cys Arg Cys Ser Glu Cys Gly Lys Ile Phe 5161727 CGG AAC CCA AGA TAC TTT TCT GTG CAT AAG AAA ATC CAT ACC GGA GAG 1774517 Arg Asn Pro Arg Tyr Phe Ser Val His Lys Lys Ile His Thr Gly Glu 5321775 AGG CCC TAT GTG TGT CAA GAC TGT GGG AAA GGA TTT GTT CAG AGC TCT 1822533 Arg Pro Tyr Val Cys Gln Asp Cys Gly Lys Gly Phe Val Gln Ser Ser 5481823 TCC CTC ACA CAG CAT CAG AGA GTT CAT TCT GGA GAG AGA CCA TTT GAA 1870549 Ser Leu Thr Gln His Gln Arg Val His Ser Gly Glu Arg Pro Phe Glu 5641871 TGT CAG GAG TGT GGG AGG ACC TTC AAT GAT CGC TCA GCC ATC TCC CAG 1918565 Cys Gln Glu Cys Gly Arg Thr Phe Asn Asp Arg Ser Ala Ile Ser Gln 5801919 CAC CTG AGG ACT CAC ACT GGC GCT AAG CCC TAC AAG TGT CAG GAC TGT 1966581 His Leu Arg Thr His Thr Gly Ala Lys Pro Tyr Lys Cys Gln Asp Cys 5961967 GGA AAA GCC TTC CGC CAG AGT TCC CAC CTC ATC AGA CAT CAG AGG ACT 2014597 Gly Lys Ala Phe Arg Gln Ser Ser His Leu Ile Arg His Gln Arg Thr 6122015 CAC ACC GGG GAG CGC CCA TAT GCA TGC AAC AAA TGT GGA AAG GCC TTC 2062613 His Thr Gly Glu Arg Pro Tyr Ala Cys Asn Lys Cys Gly Lys Ala Phe 6282063 ACC CAG AGC TCA CAC CTT ATT GGG CAC CAG AGA ACC CAC AAT AGG ACA 2110629 Thr Gln Ser Ser His Leu Ile Gly His Gln Arg Thr His Asn Arg Thr 6442111 AAG CGA AAG AAG AAA CAG CCT ACC TCA TAG CTC TCA AGC CAG TTG AAG 2158645 Lys Arg Lys Lys Lys Gln Pro Thr Ser *** 6542159 AAA CCT TGC CTT TTC AGC TTG ACC CTG CAA TAT AAC ATG CAC AGG CCT 22062207 GCT TGT GAA TCA GGA CTG AAT GTG AAA GGG AAG TAT TGA GTG AGG ACA 22542255 TTC CCA AAA CCA AAG GAC AAC TGA GGA GAC TGC CCA GCA CAT AAT GAA 23022303 TAA ATA AGA AAA TGA GTG AGG AGT TAT TAA CAT CAT TTG GAA AAA AGA 23502351 TTT CCC ATT CAC TTG ATA TTG TTT GTT CAC TCA TTT AGT CAT TAA AAG 23982399 TGA GAT TAA TAA AAT CTG AAA ATG TTA TAT AAT AAC TTT AAA AAG CCA 24462447 GGT AAT TAA TAA TCT GCA CTG ATA TTA CAT CCA CAG TAC CAC AGT ATT 24942495 TAT GTG TAT GAA TTA AGG ATT AAA AGA TAA TGT GGA TAA AAA AAA AAA 25422543 AAA A 2546
在本发明提及的所有文献都在本申请中引用作为参考,就如同每一篇文献被单独引用作为参考那样。此外应理解,在阅读了本发明的上述讲授内容之后,本领域技术人员可以对本发明作各种改动或修改,这些等价形式同样落于本申请所附权利要求书所限定的范围。All documents mentioned in this application are incorporated by reference in this application as if each were individually incorporated by reference. In addition, it should be understood that after reading the above teaching content of the present invention, those skilled in the art can make various changes or modifications to the present invention, and these equivalent forms also fall within the scope defined by the appended claims of the present application.
Claims (10)
Priority Applications (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| CNB001259008A CN1155614C (en) | 2000-10-31 | 2000-10-31 | Novel human protein with the function of inhibiting the growth of cancer cells and its coding sequence |
Applications Claiming Priority (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| CNB001259008A CN1155614C (en) | 2000-10-31 | 2000-10-31 | Novel human protein with the function of inhibiting the growth of cancer cells and its coding sequence |
Publications (2)
| Publication Number | Publication Date |
|---|---|
| CN1351079A true CN1351079A (en) | 2002-05-29 |
| CN1155614C CN1155614C (en) | 2004-06-30 |
Family
ID=4591681
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| CNB001259008A Expired - Fee Related CN1155614C (en) | 2000-10-31 | 2000-10-31 | Novel human protein with the function of inhibiting the growth of cancer cells and its coding sequence |
Country Status (1)
| Country | Link |
|---|---|
| CN (1) | CN1155614C (en) |
-
2000
- 2000-10-31 CN CNB001259008A patent/CN1155614C/en not_active Expired - Fee Related
Also Published As
| Publication number | Publication date |
|---|---|
| CN1155614C (en) | 2004-06-30 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| CN1150804A (en) | Fibroblast Growth Factor-10 | |
| CN1308346C (en) | Novel human phosphotidylethanolamine binding protein, its coding sequence and application | |
| CN1313297A (en) | Human protein able to suppress growth of cancer cells and its coding sequence | |
| CN1351079A (en) | Human protein with cancer cell growth suppressing function and its coding sequence | |
| CN1351081A (en) | Human protein with cancer cell growth suppressing function and its coding sequence | |
| CN1368510A (en) | Human protein with suppression to cancer cell growth and its coding sequence | |
| CN1323803A (en) | New human protein with the function of inhibiting cancer cell growth and its coding sequence | |
| US20030148331A1 (en) | Novel human hepatoma associated protein and the polynucleotide encoding said polypeptide | |
| CN1329065A (en) | Noven huamn protein with function of promoting growth of cancer cell and its code sequence | |
| US7087715B2 (en) | Human paris-1 antigen and nucleic acids: diagnostic and therapeutic uses | |
| CN100425695C (en) | Novel human Rab GTP enzyme, its coding sequence and application | |
| CN1369506A (en) | Human Protein for promoting transform of 3T3 cell and its coding sequence | |
| CN1342713A (en) | Human macrobiosis-ensuring protein and its coding sequence and application | |
| CN1369505A (en) | Human protein for promoting transform of 3T3 cell and its coding sequence | |
| CN1324819A (en) | New human protein with the function of inhibiting tumor cell growth and its encoding sequence | |
| CN1368511A (en) | Human protein with cancer inhibiting function and its coding sequence | |
| CN1323802A (en) | New human protein with the function of inhibiting cancer cell growth and its coding sequence | |
| CN1352138A (en) | Sperm formation relative protein and its coding sequence and use | |
| CN1324820A (en) | New human protein with the function of inhibiting cancer cell growth and its encoding sequence | |
| CN1368509A (en) | Human protein with suppression to cancer cell growth and its coding sequence | |
| CN1475503A (en) | Novel growth inhibitory factor derived from human bone marrow stromal cells, its coding sequence and use | |
| CN1343688A (en) | Human NIP2 associated protein and coding sequence and application thereof | |
| US20040005658A1 (en) | Novel polypeptide-human an1-like protein 16 and the polynucleotide encoding the same | |
| AU1118301A (en) | Tumour suppressor genes from chromosome 16 | |
| WO2004033493A1 (en) | Human protein for promoting transformation of 3t3 cell and its coding sequence |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| C06 | Publication | ||
| PB01 | Publication | ||
| C10 | Entry into substantive examination | ||
| SE01 | Entry into force of request for substantive examination | ||
| C14 | Grant of patent or utility model | ||
| GR01 | Patent grant | ||
| C19 | Lapse of patent right due to non-payment of the annual fee | ||
| CF01 | Termination of patent right due to non-payment of annual fee |