CN1231497C - Novel human protein with function for promoting mouse NIH/313 cell transformation and coding sequence thereof - Google Patents
Novel human protein with function for promoting mouse NIH/313 cell transformation and coding sequence thereof Download PDFInfo
- Publication number
- CN1231497C CN1231497C CNB021369984A CN02136998A CN1231497C CN 1231497 C CN1231497 C CN 1231497C CN B021369984 A CNB021369984 A CN B021369984A CN 02136998 A CN02136998 A CN 02136998A CN 1231497 C CN1231497 C CN 1231497C
- Authority
- CN
- China
- Prior art keywords
- pro
- ser
- leu
- gly
- ala
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Expired - Fee Related
Links
- 230000010307 cell transformation Effects 0.000 title claims abstract description 95
- 108091026890 Coding region Proteins 0.000 title claims description 11
- 230000001737 promoting effect Effects 0.000 title abstract description 5
- 102000003839 Human Proteins Human genes 0.000 title abstract 3
- 108090000144 Human Proteins Proteins 0.000 title abstract 3
- 108090000765 processed proteins & peptides Proteins 0.000 claims abstract description 78
- 229920001184 polypeptide Polymers 0.000 claims abstract description 75
- 102000004196 processed proteins & peptides Human genes 0.000 claims abstract description 75
- 238000000034 method Methods 0.000 claims abstract description 66
- 108091033319 polynucleotide Proteins 0.000 claims abstract description 54
- 102000040430 polynucleotide Human genes 0.000 claims abstract description 54
- 239000002157 polynucleotide Substances 0.000 claims abstract description 54
- 108090000623 proteins and genes Proteins 0.000 claims description 90
- 102000004169 proteins and genes Human genes 0.000 claims description 50
- 235000018102 proteins Nutrition 0.000 claims description 48
- 125000003275 alpha amino acid group Chemical group 0.000 claims description 19
- 238000002360 preparation method Methods 0.000 claims description 5
- 230000000295 complement effect Effects 0.000 claims description 3
- 238000005516 engineering process Methods 0.000 abstract description 28
- 239000005557 antagonist Substances 0.000 abstract description 14
- 230000001225 therapeutic effect Effects 0.000 abstract description 3
- 230000006870 function Effects 0.000 description 93
- 108020004414 DNA Proteins 0.000 description 69
- 241000282414 Homo sapiens Species 0.000 description 61
- 210000004027 cell Anatomy 0.000 description 59
- 239000002773 nucleotide Substances 0.000 description 45
- 125000003729 nucleotide group Chemical group 0.000 description 45
- 239000002299 complementary DNA Substances 0.000 description 37
- 235000001014 amino acid Nutrition 0.000 description 23
- 150000001413 amino acids Chemical class 0.000 description 23
- 239000012634 fragment Substances 0.000 description 19
- 108091028043 Nucleic acid sequence Proteins 0.000 description 17
- 208000037265 diseases, disorders, signs and symptoms Diseases 0.000 description 15
- 201000010099 disease Diseases 0.000 description 14
- 238000009396 hybridization Methods 0.000 description 13
- 230000014509 gene expression Effects 0.000 description 12
- 108091032973 (ribonucleotides)n+m Proteins 0.000 description 11
- 241000282326 Felis catus Species 0.000 description 11
- 239000002131 composite material Substances 0.000 description 11
- 239000013604 expression vector Substances 0.000 description 11
- 238000004519 manufacturing process Methods 0.000 description 10
- 239000000523 sample Substances 0.000 description 10
- 108010026333 seryl-proline Proteins 0.000 description 10
- 230000008859 change Effects 0.000 description 9
- 108020004999 messenger RNA Proteins 0.000 description 9
- 238000012216 screening Methods 0.000 description 9
- 206010028980 Neoplasm Diseases 0.000 description 8
- 150000001875 compounds Chemical class 0.000 description 8
- 230000000694 effects Effects 0.000 description 8
- 239000013612 plasmid Substances 0.000 description 8
- 238000012360 testing method Methods 0.000 description 8
- SBANPBVRHYIMRR-UHFFFAOYSA-N Leu-Ser-Pro Natural products CC(C)CC(N)C(=O)NC(CO)C(=O)N1CCCC1C(O)=O SBANPBVRHYIMRR-UHFFFAOYSA-N 0.000 description 7
- 108010005233 alanylglutamic acid Proteins 0.000 description 7
- 201000011510 cancer Diseases 0.000 description 7
- 108010031719 prolyl-serine Proteins 0.000 description 7
- 241000894006 Bacteria Species 0.000 description 6
- 241000700605 Viruses Species 0.000 description 6
- 108010013835 arginine glutamate Proteins 0.000 description 6
- 108010060035 arginylproline Proteins 0.000 description 6
- 108010060199 cysteinylproline Proteins 0.000 description 6
- 239000003814 drug Substances 0.000 description 6
- 108010050848 glycylleucine Proteins 0.000 description 6
- 230000012010 growth Effects 0.000 description 6
- 108010057821 leucylproline Proteins 0.000 description 6
- 150000007523 nucleic acids Chemical class 0.000 description 6
- 230000008521 reorganization Effects 0.000 description 6
- 238000000926 separation method Methods 0.000 description 6
- 238000011282 treatment Methods 0.000 description 6
- 241000880493 Leptailurus serval Species 0.000 description 5
- 230000005856 abnormality Effects 0.000 description 5
- 230000003321 amplification Effects 0.000 description 5
- 230000008034 disappearance Effects 0.000 description 5
- 239000000284 extract Substances 0.000 description 5
- 108010078144 glutaminyl-glycine Proteins 0.000 description 5
- VPZXBVLAVMBEQI-UHFFFAOYSA-N glycyl-DL-alpha-alanine Natural products OC(=O)C(C)NC(=O)CN VPZXBVLAVMBEQI-UHFFFAOYSA-N 0.000 description 5
- 238000003199 nucleic acid amplification method Methods 0.000 description 5
- 239000008194 pharmaceutical composition Substances 0.000 description 5
- 238000001890 transfection Methods 0.000 description 5
- 108010038745 tryptophylglycine Proteins 0.000 description 5
- 108090000994 Catalytic RNA Proteins 0.000 description 4
- 102000053642 Catalytic RNA Human genes 0.000 description 4
- VNCNWQPIQYAMAK-ACZMJKKPSA-N Glu-Ser-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O VNCNWQPIQYAMAK-ACZMJKKPSA-N 0.000 description 4
- UHPAZODVFFYEEL-QWRGUYRKSA-N Gly-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)CN UHPAZODVFFYEEL-QWRGUYRKSA-N 0.000 description 4
- IBMVEYRWAWIOTN-UHFFFAOYSA-N L-Leucyl-L-Arginyl-L-Proline Natural products CC(C)CC(N)C(=O)NC(CCCN=C(N)N)C(=O)N1CCCC1C(O)=O IBMVEYRWAWIOTN-UHFFFAOYSA-N 0.000 description 4
- XVZCXCTYGHPNEM-UHFFFAOYSA-N Leu-Leu-Pro Natural products CC(C)CC(N)C(=O)NC(CC(C)C)C(=O)N1CCCC1C(O)=O XVZCXCTYGHPNEM-UHFFFAOYSA-N 0.000 description 4
- VUBIPAHVHMZHCM-KKUMJFAQSA-N Leu-Tyr-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CO)C(O)=O)CC1=CC=C(O)C=C1 VUBIPAHVHMZHCM-KKUMJFAQSA-N 0.000 description 4
- KZNQNBZMBZJQJO-UHFFFAOYSA-N N-glycyl-L-proline Natural products NCC(=O)N1CCCC1C(O)=O KZNQNBZMBZJQJO-UHFFFAOYSA-N 0.000 description 4
- 238000012408 PCR amplification Methods 0.000 description 4
- DXTOOBDIIAJZBJ-BQBZGAKWSA-N Pro-Gly-Ser Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](CO)C(O)=O DXTOOBDIIAJZBJ-BQBZGAKWSA-N 0.000 description 4
- MCWHYUWXVNRXFV-RWMBFGLXSA-N Pro-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@@H]2CCCN2 MCWHYUWXVNRXFV-RWMBFGLXSA-N 0.000 description 4
- NNFMANHDYSVNIO-DCAQKATOSA-N Ser-Lys-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O NNFMANHDYSVNIO-DCAQKATOSA-N 0.000 description 4
- AEFJNECXZCODJM-UWVGGRQHSA-N Val-Val-Gly Chemical compound CC(C)[C@H]([NH3+])C(=O)N[C@@H](C(C)C)C(=O)NCC([O-])=O AEFJNECXZCODJM-UWVGGRQHSA-N 0.000 description 4
- 108010087924 alanylproline Proteins 0.000 description 4
- 125000000539 amino acid group Chemical group 0.000 description 4
- 108010043240 arginyl-leucyl-glycine Proteins 0.000 description 4
- 230000008827 biological function Effects 0.000 description 4
- 230000010261 cell growth Effects 0.000 description 4
- 230000002759 chromosomal effect Effects 0.000 description 4
- 108010089804 glycyl-threonine Proteins 0.000 description 4
- 108010085325 histidylproline Proteins 0.000 description 4
- 210000003917 human chromosome Anatomy 0.000 description 4
- 238000007901 in situ hybridization Methods 0.000 description 4
- 108010034529 leucyl-lysine Proteins 0.000 description 4
- 108010073472 leucyl-prolyl-proline Proteins 0.000 description 4
- 210000004962 mammalian cell Anatomy 0.000 description 4
- 239000003550 marker Substances 0.000 description 4
- 239000000203 mixture Substances 0.000 description 4
- 108010089198 phenylalanyl-prolyl-arginine Proteins 0.000 description 4
- 108010004914 prolylarginine Proteins 0.000 description 4
- 108010029020 prolylglycine Proteins 0.000 description 4
- 108010090894 prolylleucine Proteins 0.000 description 4
- 108091092562 ribozyme Proteins 0.000 description 4
- 239000000126 substance Substances 0.000 description 4
- MFMDKJIPHSWSBM-GUBZILKMSA-N Ala-Lys-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O MFMDKJIPHSWSBM-GUBZILKMSA-N 0.000 description 3
- MAZZQZWCCYJQGZ-GUBZILKMSA-N Ala-Pro-Arg Chemical compound [H]N[C@@H](C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(O)=O MAZZQZWCCYJQGZ-GUBZILKMSA-N 0.000 description 3
- NXDXECQFKHXHAM-HJGDQZAQSA-N Arg-Glu-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O NXDXECQFKHXHAM-HJGDQZAQSA-N 0.000 description 3
- XKDYWGLNSCNRGW-WDSOQIARSA-N Arg-Lys-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)CCCN=C(N)N)CCCCN)C(O)=O)=CNC2=C1 XKDYWGLNSCNRGW-WDSOQIARSA-N 0.000 description 3
- LFQSCWFLJHTTHZ-UHFFFAOYSA-N Ethanol Chemical compound CCO LFQSCWFLJHTTHZ-UHFFFAOYSA-N 0.000 description 3
- GWCRIHNSVMOBEQ-BQBZGAKWSA-N Gly-Arg-Ser Chemical compound [H]NCC(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O GWCRIHNSVMOBEQ-BQBZGAKWSA-N 0.000 description 3
- YFGONBOFGGWKKY-VHSXEESVSA-N Gly-His-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CN=CN2)NC(=O)CN)C(=O)O YFGONBOFGGWKKY-VHSXEESVSA-N 0.000 description 3
- FXGRXIATVXUAHO-WEDXCCLWSA-N Gly-Lys-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCCN FXGRXIATVXUAHO-WEDXCCLWSA-N 0.000 description 3
- 241000238631 Hexapoda Species 0.000 description 3
- PTRKPHUGYULXPU-KKUMJFAQSA-N Leu-Phe-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O PTRKPHUGYULXPU-KKUMJFAQSA-N 0.000 description 3
- UCBPDSYUVAAHCD-UWVGGRQHSA-N Leu-Pro-Gly Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O UCBPDSYUVAAHCD-UWVGGRQHSA-N 0.000 description 3
- 206010027336 Menstruation delayed Diseases 0.000 description 3
- YBAFDPFAUTYYRW-UHFFFAOYSA-N N-L-alpha-glutamyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CCC(O)=O YBAFDPFAUTYYRW-UHFFFAOYSA-N 0.000 description 3
- SITLTJHOQZFJGG-UHFFFAOYSA-N N-L-alpha-glutamyl-L-valine Natural products CC(C)C(C(O)=O)NC(=O)C(N)CCC(O)=O SITLTJHOQZFJGG-UHFFFAOYSA-N 0.000 description 3
- ZLAKUZDMKVKFAI-JYJNAYRXSA-N Phe-Pro-Val Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O ZLAKUZDMKVKFAI-JYJNAYRXSA-N 0.000 description 3
- FISHYTLIMUYTQY-GUBZILKMSA-N Pro-Gln-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H]1CCCN1 FISHYTLIMUYTQY-GUBZILKMSA-N 0.000 description 3
- AFXCXDQNRXTSBD-FJXKBIBVSA-N Pro-Gly-Thr Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O AFXCXDQNRXTSBD-FJXKBIBVSA-N 0.000 description 3
- LNICFEXCAHIJOR-DCAQKATOSA-N Pro-Ser-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O LNICFEXCAHIJOR-DCAQKATOSA-N 0.000 description 3
- 108020004511 Recombinant DNA Proteins 0.000 description 3
- 240000004808 Saccharomyces cerevisiae Species 0.000 description 3
- QVOGDCQNGLBNCR-FXQIFTODSA-N Ser-Arg-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O QVOGDCQNGLBNCR-FXQIFTODSA-N 0.000 description 3
- DINQYZRMXGWWTG-GUBZILKMSA-N Ser-Pro-Pro Chemical compound OC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 DINQYZRMXGWWTG-GUBZILKMSA-N 0.000 description 3
- NVNPWELENFJOHH-CIUDSAMLSA-N Ser-Ser-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CO)N NVNPWELENFJOHH-CIUDSAMLSA-N 0.000 description 3
- RCLOWEZASFJFEX-KKUMJFAQSA-N Tyr-Asp-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 RCLOWEZASFJFEX-KKUMJFAQSA-N 0.000 description 3
- BTWMICVCQLKKNR-DCAQKATOSA-N Val-Leu-Ser Chemical compound CC(C)[C@H]([NH3+])C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C([O-])=O BTWMICVCQLKKNR-DCAQKATOSA-N 0.000 description 3
- UEXPMFIAZZHEAD-HSHDSVGOSA-N Val-Thr-Trp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](C(C)C)N)O UEXPMFIAZZHEAD-HSHDSVGOSA-N 0.000 description 3
- 230000002159 abnormal effect Effects 0.000 description 3
- 239000002253 acid Substances 0.000 description 3
- 230000009471 action Effects 0.000 description 3
- 239000000556 agonist Substances 0.000 description 3
- 108010044940 alanylglutamine Proteins 0.000 description 3
- 238000004458 analytical method Methods 0.000 description 3
- 230000001580 bacterial effect Effects 0.000 description 3
- 230000015572 biosynthetic process Effects 0.000 description 3
- 239000000969 carrier Substances 0.000 description 3
- 230000001413 cellular effect Effects 0.000 description 3
- 238000006243 chemical reaction Methods 0.000 description 3
- 210000000349 chromosome Anatomy 0.000 description 3
- 238000003745 diagnosis Methods 0.000 description 3
- 239000003937 drug carrier Substances 0.000 description 3
- 210000003527 eukaryotic cell Anatomy 0.000 description 3
- 210000003754 fetus Anatomy 0.000 description 3
- 238000001415 gene therapy Methods 0.000 description 3
- 230000002068 genetic effect Effects 0.000 description 3
- 108010049041 glutamylalanine Proteins 0.000 description 3
- 108010077515 glycylproline Proteins 0.000 description 3
- 210000004754 hybrid cell Anatomy 0.000 description 3
- 239000002502 liposome Substances 0.000 description 3
- 239000007788 liquid Substances 0.000 description 3
- 108010064235 lysylglycine Proteins 0.000 description 3
- 239000000463 material Substances 0.000 description 3
- 230000035772 mutation Effects 0.000 description 3
- 102000039446 nucleic acids Human genes 0.000 description 3
- 108020004707 nucleic acids Proteins 0.000 description 3
- 108010077112 prolyl-proline Proteins 0.000 description 3
- 108010053725 prolylvaline Proteins 0.000 description 3
- 230000004853 protein function Effects 0.000 description 3
- 238000000746 purification Methods 0.000 description 3
- 238000003259 recombinant expression Methods 0.000 description 3
- 238000011160 research Methods 0.000 description 3
- 238000010839 reverse transcription Methods 0.000 description 3
- 108010048818 seryl-histidine Proteins 0.000 description 3
- 210000001519 tissue Anatomy 0.000 description 3
- 241000701161 unidentified adenovirus Species 0.000 description 3
- 239000013598 vector Substances 0.000 description 3
- XVZCXCTYGHPNEM-IHRRRGAJSA-N (2s)-1-[(2s)-2-[[(2s)-2-amino-4-methylpentanoyl]amino]-4-methylpentanoyl]pyrrolidine-2-carboxylic acid Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(O)=O XVZCXCTYGHPNEM-IHRRRGAJSA-N 0.000 description 2
- NTUPOKHATNSWCY-PMPSAXMXSA-N (2s)-2-[[(2s)-1-[(2r)-2-amino-3-phenylpropanoyl]pyrrolidine-2-carbonyl]amino]-5-(diaminomethylideneamino)pentanoic acid Chemical compound C([C@@H](N)C(=O)N1[C@@H](CCC1)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O)C1=CC=CC=C1 NTUPOKHATNSWCY-PMPSAXMXSA-N 0.000 description 2
- DQVAZKGVGKHQDS-UHFFFAOYSA-N 2-[[1-[2-[(2-amino-4-methylpentanoyl)amino]-4-methylpentanoyl]pyrrolidine-2-carbonyl]amino]-4-methylpentanoic acid Chemical compound CC(C)CC(N)C(=O)NC(CC(C)C)C(=O)N1CCCC1C(=O)NC(CC(C)C)C(O)=O DQVAZKGVGKHQDS-UHFFFAOYSA-N 0.000 description 2
- BUANFPRKJKJSRR-ACZMJKKPSA-N Ala-Ala-Gln Chemical compound C[C@H]([NH3+])C(=O)N[C@@H](C)C(=O)N[C@H](C([O-])=O)CCC(N)=O BUANFPRKJKJSRR-ACZMJKKPSA-N 0.000 description 2
- CXRCVCURMBFFOL-FXQIFTODSA-N Ala-Ala-Pro Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@H]1C(O)=O CXRCVCURMBFFOL-FXQIFTODSA-N 0.000 description 2
- SDMAQFGBPOJFOM-GUBZILKMSA-N Ala-Arg-Arg Chemical compound NC(=N)NCCC[C@H](NC(=O)[C@@H](N)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O SDMAQFGBPOJFOM-GUBZILKMSA-N 0.000 description 2
- IKKVASZHTMKJIR-ZKWXMUAHSA-N Ala-Asp-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O IKKVASZHTMKJIR-ZKWXMUAHSA-N 0.000 description 2
- VIGKUFXFTPWYER-BIIVOSGPSA-N Ala-Cys-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CS)C(=O)N1CCC[C@@H]1C(=O)O)N VIGKUFXFTPWYER-BIIVOSGPSA-N 0.000 description 2
- MVBWLRJESQOQTM-ACZMJKKPSA-N Ala-Gln-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O MVBWLRJESQOQTM-ACZMJKKPSA-N 0.000 description 2
- OBVSBEYOMDWLRJ-BFHQHQDPSA-N Ala-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@H](C)N OBVSBEYOMDWLRJ-BFHQHQDPSA-N 0.000 description 2
- CKLDHDOIYBVUNP-KBIXCLLPSA-N Ala-Ile-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(O)=O CKLDHDOIYBVUNP-KBIXCLLPSA-N 0.000 description 2
- OYJCVIGKMXUVKB-GARJFASQSA-N Ala-Leu-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N OYJCVIGKMXUVKB-GARJFASQSA-N 0.000 description 2
- SOBIAADAMRHGKH-CIUDSAMLSA-N Ala-Leu-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O SOBIAADAMRHGKH-CIUDSAMLSA-N 0.000 description 2
- XWFWAXPOLRTDFZ-FXQIFTODSA-N Ala-Pro-Ser Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O XWFWAXPOLRTDFZ-FXQIFTODSA-N 0.000 description 2
- NZGRHTKZFSVPAN-BIIVOSGPSA-N Ala-Ser-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CO)C(=O)N1CCC[C@@H]1C(=O)O)N NZGRHTKZFSVPAN-BIIVOSGPSA-N 0.000 description 2
- LSMDIAAALJJLRO-XQXXSGGOSA-N Ala-Thr-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O LSMDIAAALJJLRO-XQXXSGGOSA-N 0.000 description 2
- UISQLSIBJKEJSS-GUBZILKMSA-N Arg-Arg-Ser Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CO)C(O)=O UISQLSIBJKEJSS-GUBZILKMSA-N 0.000 description 2
- MFAMTAVAFBPXDC-LPEHRKFASA-N Arg-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCCN=C(N)N)N)C(=O)O MFAMTAVAFBPXDC-LPEHRKFASA-N 0.000 description 2
- GIVWETPOBCRTND-DCAQKATOSA-N Arg-Gln-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O GIVWETPOBCRTND-DCAQKATOSA-N 0.000 description 2
- UBCPNBUIQNMDNH-NAKRPEOUSA-N Arg-Ile-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O UBCPNBUIQNMDNH-NAKRPEOUSA-N 0.000 description 2
- LVMUGODRNHFGRA-AVGNSLFASA-N Arg-Leu-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O LVMUGODRNHFGRA-AVGNSLFASA-N 0.000 description 2
- NGTYEHIRESTSRX-UWVGGRQHSA-N Arg-Lys-Gly Chemical compound NCCCC[C@@H](C(=O)NCC(O)=O)NC(=O)[C@@H](N)CCCN=C(N)N NGTYEHIRESTSRX-UWVGGRQHSA-N 0.000 description 2
- BSYKSCBTTQKOJG-GUBZILKMSA-N Arg-Pro-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O BSYKSCBTTQKOJG-GUBZILKMSA-N 0.000 description 2
- NGYHSXDNNOFHNE-AVGNSLFASA-N Arg-Pro-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O NGYHSXDNNOFHNE-AVGNSLFASA-N 0.000 description 2
- AIFHRTPABBBHKU-RCWTZXSCSA-N Arg-Thr-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O AIFHRTPABBBHKU-RCWTZXSCSA-N 0.000 description 2
- ZCSHHTFOZULVLN-SZMVWBNQSA-N Arg-Trp-Val Chemical compound C1=CC=C2C(C[C@@H](C(=O)N[C@@H](C(C)C)C(O)=O)NC(=O)[C@@H](N)CCCN=C(N)N)=CNC2=C1 ZCSHHTFOZULVLN-SZMVWBNQSA-N 0.000 description 2
- QHUOOCKNNURZSL-IHRRRGAJSA-N Arg-Tyr-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(O)=O QHUOOCKNNURZSL-IHRRRGAJSA-N 0.000 description 2
- FTMRPIVPSDVGCC-GUBZILKMSA-N Arg-Val-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N FTMRPIVPSDVGCC-GUBZILKMSA-N 0.000 description 2
- VYZBPPBKFCHCIS-WPRPVWTQSA-N Arg-Val-Gly Chemical compound OC(=O)CNC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCN=C(N)N VYZBPPBKFCHCIS-WPRPVWTQSA-N 0.000 description 2
- SLKLLQWZQHXYSV-CIUDSAMLSA-N Asn-Ala-Lys Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCCN)C(O)=O SLKLLQWZQHXYSV-CIUDSAMLSA-N 0.000 description 2
- JWKDQOORUCYUIW-ZPFDUUQYSA-N Asn-Lys-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JWKDQOORUCYUIW-ZPFDUUQYSA-N 0.000 description 2
- ZAESWDKAMDVHLL-RCOVLWMOSA-N Asn-Val-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O ZAESWDKAMDVHLL-RCOVLWMOSA-N 0.000 description 2
- HOQGTAIGQSDCHR-SRVKXCTJSA-N Asp-Asn-Phe Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O HOQGTAIGQSDCHR-SRVKXCTJSA-N 0.000 description 2
- IVPNEDNYYYFAGI-GARJFASQSA-N Asp-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)O)N IVPNEDNYYYFAGI-GARJFASQSA-N 0.000 description 2
- KPSHWSWFPUDEGF-FXQIFTODSA-N Asp-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CC(O)=O KPSHWSWFPUDEGF-FXQIFTODSA-N 0.000 description 2
- 108091033380 Coding strand Proteins 0.000 description 2
- NLCZGISONIGRQP-DCAQKATOSA-N Cys-Arg-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CS)N NLCZGISONIGRQP-DCAQKATOSA-N 0.000 description 2
- ISWAQPWFWKGCAL-ACZMJKKPSA-N Cys-Cys-Glu Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(O)=O)C(O)=O ISWAQPWFWKGCAL-ACZMJKKPSA-N 0.000 description 2
- NDNZRWUDUMTITL-FXQIFTODSA-N Cys-Ser-Val Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O NDNZRWUDUMTITL-FXQIFTODSA-N 0.000 description 2
- NGOIQDYZMIKCOK-NAKRPEOUSA-N Cys-Val-Ile Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O NGOIQDYZMIKCOK-NAKRPEOUSA-N 0.000 description 2
- 102000004163 DNA-directed RNA polymerases Human genes 0.000 description 2
- 108090000626 DNA-directed RNA polymerases Proteins 0.000 description 2
- 102100037840 Dehydrogenase/reductase SDR family member 2, mitochondrial Human genes 0.000 description 2
- 238000002965 ELISA Methods 0.000 description 2
- 102000004190 Enzymes Human genes 0.000 description 2
- 108090000790 Enzymes Proteins 0.000 description 2
- LYCAIKOWRPUZTN-UHFFFAOYSA-N Ethylene glycol Chemical compound OCCO LYCAIKOWRPUZTN-UHFFFAOYSA-N 0.000 description 2
- KWUSGAIFNHQCBY-DCAQKATOSA-N Gln-Arg-Arg Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O KWUSGAIFNHQCBY-DCAQKATOSA-N 0.000 description 2
- PNENQZWRFMUZOM-DCAQKATOSA-N Gln-Glu-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O PNENQZWRFMUZOM-DCAQKATOSA-N 0.000 description 2
- LGIKBBLQVSWUGK-DCAQKATOSA-N Gln-Leu-Gln Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O LGIKBBLQVSWUGK-DCAQKATOSA-N 0.000 description 2
- OZEQPCDLCDRCGY-SOUVJXGZSA-N Gln-Phe-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)NC(=O)[C@H](CCC(=O)N)N)C(=O)O OZEQPCDLCDRCGY-SOUVJXGZSA-N 0.000 description 2
- DUGYCMAIAKAQPB-GLLZPBPUSA-N Gln-Thr-Glu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O DUGYCMAIAKAQPB-GLLZPBPUSA-N 0.000 description 2
- XKPACHRGOWQHFH-IRIUXVKKSA-N Gln-Thr-Tyr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O XKPACHRGOWQHFH-IRIUXVKKSA-N 0.000 description 2
- VDMABHYXBULDGN-LAEOZQHASA-N Gln-Val-Asp Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O VDMABHYXBULDGN-LAEOZQHASA-N 0.000 description 2
- APHGWLWMOXGZRL-DCAQKATOSA-N Glu-Glu-His Chemical compound N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](Cc1cnc[nH]1)C(O)=O APHGWLWMOXGZRL-DCAQKATOSA-N 0.000 description 2
- MUSGDMDGNGXULI-DCAQKATOSA-N Glu-Glu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O MUSGDMDGNGXULI-DCAQKATOSA-N 0.000 description 2
- ATVYZJGOZLVXDK-IUCAKERBSA-N Glu-Leu-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O ATVYZJGOZLVXDK-IUCAKERBSA-N 0.000 description 2
- UGSVSNXPJJDJKL-SDDRHHMPSA-N Glu-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)O)N UGSVSNXPJJDJKL-SDDRHHMPSA-N 0.000 description 2
- QMOSCLNJVKSHHU-YUMQZZPRSA-N Glu-Met-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCSC)C(=O)NCC(O)=O QMOSCLNJVKSHHU-YUMQZZPRSA-N 0.000 description 2
- HQTDNEZTGZUWSY-XVKPBYJWSA-N Glu-Val-Gly Chemical compound CC(C)[C@H](NC(=O)[C@@H](N)CCC(O)=O)C(=O)NCC(O)=O HQTDNEZTGZUWSY-XVKPBYJWSA-N 0.000 description 2
- RMWAOBGCZZSJHE-UMNHJUIQSA-N Glu-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)O)N RMWAOBGCZZSJHE-UMNHJUIQSA-N 0.000 description 2
- PYUCNHJQQVSPGN-BQBZGAKWSA-N Gly-Arg-Cys Chemical compound C(C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)CN)CN=C(N)N PYUCNHJQQVSPGN-BQBZGAKWSA-N 0.000 description 2
- RJIVPOXLQFJRTG-LURJTMIESA-N Gly-Arg-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)CN)CCCN=C(N)N RJIVPOXLQFJRTG-LURJTMIESA-N 0.000 description 2
- KKBWDNZXYLGJEY-UHFFFAOYSA-N Gly-Arg-Pro Natural products NCC(=O)NC(CCNC(=N)N)C(=O)N1CCCC1C(=O)O KKBWDNZXYLGJEY-UHFFFAOYSA-N 0.000 description 2
- IWAXHBCACVWNHT-BQBZGAKWSA-N Gly-Asp-Arg Chemical compound NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N IWAXHBCACVWNHT-BQBZGAKWSA-N 0.000 description 2
- QSVCIFZPGLOZGH-WDSKDSINSA-N Gly-Glu-Ser Chemical compound NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O QSVCIFZPGLOZGH-WDSKDSINSA-N 0.000 description 2
- LUJVWKKYHSLULQ-ZKWXMUAHSA-N Gly-Ile-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)CN LUJVWKKYHSLULQ-ZKWXMUAHSA-N 0.000 description 2
- TWTPDFFBLQEBOE-IUCAKERBSA-N Gly-Leu-Gln Chemical compound [H]NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O TWTPDFFBLQEBOE-IUCAKERBSA-N 0.000 description 2
- LLZXNUUIBOALNY-QWRGUYRKSA-N Gly-Leu-Lys Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCCN LLZXNUUIBOALNY-QWRGUYRKSA-N 0.000 description 2
- YSDLIYZLOTZZNP-UWVGGRQHSA-N Gly-Leu-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)CN YSDLIYZLOTZZNP-UWVGGRQHSA-N 0.000 description 2
- WDEHMRNSGHVNOH-VHSXEESVSA-N Gly-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)CN)C(=O)O WDEHMRNSGHVNOH-VHSXEESVSA-N 0.000 description 2
- YKJUITHASJAGHO-HOTGVXAUSA-N Gly-Lys-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)CN YKJUITHASJAGHO-HOTGVXAUSA-N 0.000 description 2
- HJARVELKOSZUEW-YUMQZZPRSA-N Gly-Pro-Gln Chemical compound [H]NCC(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O HJARVELKOSZUEW-YUMQZZPRSA-N 0.000 description 2
- NSVOVKWEKGEOQB-LURJTMIESA-N Gly-Pro-Gly Chemical compound NCC(=O)N1CCC[C@H]1C(=O)NCC(O)=O NSVOVKWEKGEOQB-LURJTMIESA-N 0.000 description 2
- GLACUWHUYFBSPJ-FJXKBIBVSA-N Gly-Pro-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)CN GLACUWHUYFBSPJ-FJXKBIBVSA-N 0.000 description 2
- WNGHUXFWEWTKAO-YUMQZZPRSA-N Gly-Ser-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)CN WNGHUXFWEWTKAO-YUMQZZPRSA-N 0.000 description 2
- FXTUGWXZTFMTIV-GJZGRUSLSA-N Gly-Trp-Arg Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)CN FXTUGWXZTFMTIV-GJZGRUSLSA-N 0.000 description 2
- DUAWRXXTOQOECJ-JSGCOSHPSA-N Gly-Tyr-Val Chemical compound [H]NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O DUAWRXXTOQOECJ-JSGCOSHPSA-N 0.000 description 2
- PEDCQBHIVMGVHV-UHFFFAOYSA-N Glycerine Chemical compound OCC(O)CO PEDCQBHIVMGVHV-UHFFFAOYSA-N 0.000 description 2
- 108010043121 Green Fluorescent Proteins Proteins 0.000 description 2
- 102000004144 Green Fluorescent Proteins Human genes 0.000 description 2
- JBCLFWXMTIKCCB-UHFFFAOYSA-N H-Gly-Phe-OH Natural products NCC(=O)NC(C(O)=O)CC1=CC=CC=C1 JBCLFWXMTIKCCB-UHFFFAOYSA-N 0.000 description 2
- RVKIPWVMZANZLI-UHFFFAOYSA-N H-Lys-Trp-OH Natural products C1=CC=C2C(CC(NC(=O)C(N)CCCCN)C(O)=O)=CNC2=C1 RVKIPWVMZANZLI-UHFFFAOYSA-N 0.000 description 2
- LVXFNTIIGOQBMD-SRVKXCTJSA-N His-Leu-Ser Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O LVXFNTIIGOQBMD-SRVKXCTJSA-N 0.000 description 2
- YSGBJIQXTIVBHZ-AJNGGQMLSA-N Ile-Lys-Leu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O YSGBJIQXTIVBHZ-AJNGGQMLSA-N 0.000 description 2
- CNMOKANDJMLAIF-CIQUZCHMSA-N Ile-Thr-Ala Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O CNMOKANDJMLAIF-CIQUZCHMSA-N 0.000 description 2
- LZDNBBYBDGBADK-UHFFFAOYSA-N L-valyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)C(C)C)C(O)=O)=CNC2=C1 LZDNBBYBDGBADK-UHFFFAOYSA-N 0.000 description 2
- LJHGALIOHLRRQN-DCAQKATOSA-N Leu-Ala-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N LJHGALIOHLRRQN-DCAQKATOSA-N 0.000 description 2
- WNGVUZWBXZKQES-YUMQZZPRSA-N Leu-Ala-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O WNGVUZWBXZKQES-YUMQZZPRSA-N 0.000 description 2
- WSGXUIQTEZDVHJ-GARJFASQSA-N Leu-Ala-Pro Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@@H]1C(O)=O WSGXUIQTEZDVHJ-GARJFASQSA-N 0.000 description 2
- IBMVEYRWAWIOTN-RWMBFGLXSA-N Leu-Arg-Pro Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N1CCC[C@@H]1C(O)=O IBMVEYRWAWIOTN-RWMBFGLXSA-N 0.000 description 2
- MMEDVBWCMGRKKC-GARJFASQSA-N Leu-Asp-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N MMEDVBWCMGRKKC-GARJFASQSA-N 0.000 description 2
- NFHJQETXTSDZSI-DCAQKATOSA-N Leu-Cys-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O NFHJQETXTSDZSI-DCAQKATOSA-N 0.000 description 2
- WIDZHJTYKYBLSR-DCAQKATOSA-N Leu-Glu-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O WIDZHJTYKYBLSR-DCAQKATOSA-N 0.000 description 2
- QJUWBDPGGYVRHY-YUMQZZPRSA-N Leu-Gly-Cys Chemical compound CC(C)C[C@@H](C(=O)NCC(=O)N[C@@H](CS)C(=O)O)N QJUWBDPGGYVRHY-YUMQZZPRSA-N 0.000 description 2
- VBZOAGIPCULURB-QWRGUYRKSA-N Leu-Gly-His Chemical compound CC(C)C[C@@H](C(=O)NCC(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N VBZOAGIPCULURB-QWRGUYRKSA-N 0.000 description 2
- BKTXKJMNTSMJDQ-AVGNSLFASA-N Leu-His-Gln Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N BKTXKJMNTSMJDQ-AVGNSLFASA-N 0.000 description 2
- IFMPDNRWZZEZSL-SRVKXCTJSA-N Leu-Leu-Cys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CS)C(O)=O IFMPDNRWZZEZSL-SRVKXCTJSA-N 0.000 description 2
- DRWMRVFCKKXHCH-BZSNNMDCSA-N Leu-Phe-Leu Chemical compound CC(C)C[C@H]([NH3+])C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C([O-])=O)CC1=CC=CC=C1 DRWMRVFCKKXHCH-BZSNNMDCSA-N 0.000 description 2
- CHJKEDSZNSONPS-DCAQKATOSA-N Leu-Pro-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O CHJKEDSZNSONPS-DCAQKATOSA-N 0.000 description 2
- XNKDCYABMBBEKN-IUCAKERBSA-N Lys-Gly-Gln Chemical compound NCCCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(N)=O XNKDCYABMBBEKN-IUCAKERBSA-N 0.000 description 2
- QQPSCXKFDSORFT-IHRRRGAJSA-N Lys-Lys-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCCCN QQPSCXKFDSORFT-IHRRRGAJSA-N 0.000 description 2
- TYEJPFJNAHIKRT-DCAQKATOSA-N Lys-Met-Cys Chemical compound CSCC[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCCCN)N TYEJPFJNAHIKRT-DCAQKATOSA-N 0.000 description 2
- LUTDBHBIHHREDC-IHRRRGAJSA-N Lys-Pro-Lys Chemical compound NCCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(O)=O LUTDBHBIHHREDC-IHRRRGAJSA-N 0.000 description 2
- UQJOKDAYFULYIX-AVGNSLFASA-N Lys-Pro-Pro Chemical compound NCCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 UQJOKDAYFULYIX-AVGNSLFASA-N 0.000 description 2
- WQDKIVRHTQYJSN-DCAQKATOSA-N Lys-Ser-Arg Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N WQDKIVRHTQYJSN-DCAQKATOSA-N 0.000 description 2
- UGCIQUYEJIEHKX-GVXVVHGQSA-N Lys-Val-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O UGCIQUYEJIEHKX-GVXVVHGQSA-N 0.000 description 2
- QEVRUYFHWJJUHZ-DCAQKATOSA-N Met-Ala-Leu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC(C)C QEVRUYFHWJJUHZ-DCAQKATOSA-N 0.000 description 2
- GODBLDDYHFTUAH-CIUDSAMLSA-N Met-Asp-Glu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCC(O)=O GODBLDDYHFTUAH-CIUDSAMLSA-N 0.000 description 2
- OSOLWRWQADPDIQ-DCAQKATOSA-N Met-Asp-Leu Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O OSOLWRWQADPDIQ-DCAQKATOSA-N 0.000 description 2
- NCVJJAJVWILAGI-SRVKXCTJSA-N Met-Gln-Lys Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N NCVJJAJVWILAGI-SRVKXCTJSA-N 0.000 description 2
- LRALLISKBZNSKN-BQBZGAKWSA-N Met-Gly-Ser Chemical compound CSCC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O LRALLISKBZNSKN-BQBZGAKWSA-N 0.000 description 2
- AWGBEIYZPAXXSX-RWMBFGLXSA-N Met-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCSC)N AWGBEIYZPAXXSX-RWMBFGLXSA-N 0.000 description 2
- USBFEVBHEQBWDD-AVGNSLFASA-N Met-Leu-Val Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O USBFEVBHEQBWDD-AVGNSLFASA-N 0.000 description 2
- GMMLGMFBYCFCCX-KZVJFYERSA-N Met-Thr-Ala Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O GMMLGMFBYCFCCX-KZVJFYERSA-N 0.000 description 2
- 241001465754 Metazoa Species 0.000 description 2
- XZFYRXDAULDNFX-UHFFFAOYSA-N N-L-cysteinyl-L-phenylalanine Natural products SCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 XZFYRXDAULDNFX-UHFFFAOYSA-N 0.000 description 2
- AUEJLPRZGVVDNU-UHFFFAOYSA-N N-L-tyrosyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CC1=CC=C(O)C=C1 AUEJLPRZGVVDNU-UHFFFAOYSA-N 0.000 description 2
- AJHCSUXXECOXOY-UHFFFAOYSA-N N-glycyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)CN)C(O)=O)=CNC2=C1 AJHCSUXXECOXOY-UHFFFAOYSA-N 0.000 description 2
- 108010066427 N-valyltryptophan Proteins 0.000 description 2
- BQVUABVGYYSDCJ-UHFFFAOYSA-N Nalpha-L-Leucyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)CC(C)C)C(O)=O)=CNC2=C1 BQVUABVGYYSDCJ-UHFFFAOYSA-N 0.000 description 2
- 238000000636 Northern blotting Methods 0.000 description 2
- 108091034117 Oligonucleotide Proteins 0.000 description 2
- 108700026244 Open Reading Frames Proteins 0.000 description 2
- DVOCGBNHAUHKHJ-DKIMLUQUSA-N Phe-Ile-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O DVOCGBNHAUHKHJ-DKIMLUQUSA-N 0.000 description 2
- QARPMYDMYVLFMW-KKUMJFAQSA-N Phe-Pro-Glu Chemical compound C([C@H](N)C(=O)N1[C@@H](CCC1)C(=O)N[C@@H](CCC(O)=O)C(O)=O)C1=CC=CC=C1 QARPMYDMYVLFMW-KKUMJFAQSA-N 0.000 description 2
- QSWKNJAPHQDAAS-MELADBBJSA-N Phe-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CC2=CC=CC=C2)N)C(=O)O QSWKNJAPHQDAAS-MELADBBJSA-N 0.000 description 2
- VIIRRNQMMIHYHQ-XHSDSOJGSA-N Phe-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CC=CC=C2)N VIIRRNQMMIHYHQ-XHSDSOJGSA-N 0.000 description 2
- DZZCICYRSZASNF-FXQIFTODSA-N Pro-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1 DZZCICYRSZASNF-FXQIFTODSA-N 0.000 description 2
- XQLBWXHVZVBNJM-FXQIFTODSA-N Pro-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1 XQLBWXHVZVBNJM-FXQIFTODSA-N 0.000 description 2
- VPVHXWGPALPDGP-GUBZILKMSA-N Pro-Asn-Arg Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O VPVHXWGPALPDGP-GUBZILKMSA-N 0.000 description 2
- KPDRZQUWJKTMBP-DCAQKATOSA-N Pro-Asp-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@@H]1CCCN1 KPDRZQUWJKTMBP-DCAQKATOSA-N 0.000 description 2
- AIZVVCMAFRREQS-GUBZILKMSA-N Pro-Cys-Arg Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CS)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O AIZVVCMAFRREQS-GUBZILKMSA-N 0.000 description 2
- LCWXSALTPTZKNM-CIUDSAMLSA-N Pro-Cys-Glu Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(=O)O)C(=O)O LCWXSALTPTZKNM-CIUDSAMLSA-N 0.000 description 2
- JFNPBBOGGNMSRX-CIUDSAMLSA-N Pro-Gln-Ala Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O JFNPBBOGGNMSRX-CIUDSAMLSA-N 0.000 description 2
- ZPPVJIJMIKTERM-YUMQZZPRSA-N Pro-Gln-Gly Chemical compound OC(=O)CNC(=O)[C@H](CCC(=O)N)NC(=O)[C@@H]1CCCN1 ZPPVJIJMIKTERM-YUMQZZPRSA-N 0.000 description 2
- CLNJSLSHKJECME-BQBZGAKWSA-N Pro-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)[C@@H]1CCCN1 CLNJSLSHKJECME-BQBZGAKWSA-N 0.000 description 2
- JMVQDLDPDBXAAX-YUMQZZPRSA-N Pro-Gly-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H]1CCCN1 JMVQDLDPDBXAAX-YUMQZZPRSA-N 0.000 description 2
- VYWNORHENYEQDW-YUMQZZPRSA-N Pro-Gly-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H]1CCCN1 VYWNORHENYEQDW-YUMQZZPRSA-N 0.000 description 2
- VWXGFAIZUQBBBG-UWVGGRQHSA-N Pro-His-Gly Chemical compound C([C@@H](C(=O)NCC(=O)[O-])NC(=O)[C@H]1[NH2+]CCC1)C1=CN=CN1 VWXGFAIZUQBBBG-UWVGGRQHSA-N 0.000 description 2
- YXHYJEPDKSYPSQ-AVGNSLFASA-N Pro-Leu-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]1CCCN1 YXHYJEPDKSYPSQ-AVGNSLFASA-N 0.000 description 2
- SPLBRAKYXGOFSO-UNQGMJICSA-N Pro-Phe-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@@H]2CCCN2)O SPLBRAKYXGOFSO-UNQGMJICSA-N 0.000 description 2
- KDBHVPXBQADZKY-GUBZILKMSA-N Pro-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 KDBHVPXBQADZKY-GUBZILKMSA-N 0.000 description 2
- HWLKHNDRXWTFTN-GUBZILKMSA-N Pro-Pro-Cys Chemical compound C1C[C@H](NC1)C(=O)N2CCC[C@H]2C(=O)N[C@@H](CS)C(=O)O HWLKHNDRXWTFTN-GUBZILKMSA-N 0.000 description 2
- FDMKYQQYJKYCLV-GUBZILKMSA-N Pro-Pro-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 FDMKYQQYJKYCLV-GUBZILKMSA-N 0.000 description 2
- DCHQYSOGURGJST-FJXKBIBVSA-N Pro-Thr-Gly Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O DCHQYSOGURGJST-FJXKBIBVSA-N 0.000 description 2
- FIODMZKLZFLYQP-GUBZILKMSA-N Pro-Val-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O FIODMZKLZFLYQP-GUBZILKMSA-N 0.000 description 2
- 101710188053 Protein D Proteins 0.000 description 2
- 108020005091 Replication Origin Proteins 0.000 description 2
- 101710132893 Resolvase Proteins 0.000 description 2
- BTKUIVBNGBFTTP-WHFBIAKZSA-N Ser-Ala-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)NCC(O)=O BTKUIVBNGBFTTP-WHFBIAKZSA-N 0.000 description 2
- OHKLFYXEOGGGCK-ZLUOBGJFSA-N Ser-Asp-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O OHKLFYXEOGGGCK-ZLUOBGJFSA-N 0.000 description 2
- UICKAKRRRBTILH-GUBZILKMSA-N Ser-Glu-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CO)N UICKAKRRRBTILH-GUBZILKMSA-N 0.000 description 2
- YIUWWXVTYLANCJ-NAKRPEOUSA-N Ser-Ile-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O YIUWWXVTYLANCJ-NAKRPEOUSA-N 0.000 description 2
- WOJYIMBIKTWKJO-KKUMJFAQSA-N Ser-Phe-His Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)NC(=O)[C@H](CO)N WOJYIMBIKTWKJO-KKUMJFAQSA-N 0.000 description 2
- XQAPEISNMXNKGE-FXQIFTODSA-N Ser-Pro-Cys Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CO)N)C(=O)N[C@@H](CS)C(=O)O XQAPEISNMXNKGE-FXQIFTODSA-N 0.000 description 2
- WNDUPCKKKGSKIQ-CIUDSAMLSA-N Ser-Pro-Gln Chemical compound OC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O WNDUPCKKKGSKIQ-CIUDSAMLSA-N 0.000 description 2
- PURRNJBBXDDWLX-ZDLURKLDSA-N Ser-Thr-Gly Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CO)N)O PURRNJBBXDDWLX-ZDLURKLDSA-N 0.000 description 2
- ZSDXEKUKQAKZFE-XAVMHZPKSA-N Ser-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N)O ZSDXEKUKQAKZFE-XAVMHZPKSA-N 0.000 description 2
- PMTWIUBUQRGCSB-FXQIFTODSA-N Ser-Val-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O PMTWIUBUQRGCSB-FXQIFTODSA-N 0.000 description 2
- JZRYFUGREMECBH-XPUUQOCRSA-N Ser-Val-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O JZRYFUGREMECBH-XPUUQOCRSA-N 0.000 description 2
- YEDSOSIKVUMIJE-DCAQKATOSA-N Ser-Val-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O YEDSOSIKVUMIJE-DCAQKATOSA-N 0.000 description 2
- JGUWRQWULDWNCM-FXQIFTODSA-N Ser-Val-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O JGUWRQWULDWNCM-FXQIFTODSA-N 0.000 description 2
- 238000002105 Southern blotting Methods 0.000 description 2
- LHUBVKCLOVALIA-HJGDQZAQSA-N Thr-Arg-Gln Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(O)=O LHUBVKCLOVALIA-HJGDQZAQSA-N 0.000 description 2
- GZYNMZQXFRWDFH-YTWAJWBKSA-N Thr-Arg-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N1CCC[C@@H]1C(=O)O)N)O GZYNMZQXFRWDFH-YTWAJWBKSA-N 0.000 description 2
- JXKMXEBNZCKSDY-JIOCBJNQSA-N Thr-Asp-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N)O JXKMXEBNZCKSDY-JIOCBJNQSA-N 0.000 description 2
- NQVDGKYAUHTCME-QTKMDUPCSA-N Thr-His-Arg Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N)O NQVDGKYAUHTCME-QTKMDUPCSA-N 0.000 description 2
- SCSVNSNWUTYSFO-WDCWCFNPSA-N Thr-Lys-Glu Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O SCSVNSNWUTYSFO-WDCWCFNPSA-N 0.000 description 2
- CYCGARJWIQWPQM-YJRXYDGGSA-N Thr-Tyr-Ser Chemical compound C[C@@H](O)[C@H]([NH3+])C(=O)N[C@H](C(=O)N[C@@H](CO)C([O-])=O)CC1=CC=C(O)C=C1 CYCGARJWIQWPQM-YJRXYDGGSA-N 0.000 description 2
- ICNFHVUVCNWUAB-SZMVWBNQSA-N Trp-Arg-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N ICNFHVUVCNWUAB-SZMVWBNQSA-N 0.000 description 2
- BGWSLEYVITZIQP-DCPHZVHLSA-N Trp-Phe-Ala Chemical compound C[C@H](NC(=O)[C@H](Cc1ccccc1)NC(=O)[C@@H](N)Cc1c[nH]c2ccccc12)C(O)=O BGWSLEYVITZIQP-DCPHZVHLSA-N 0.000 description 2
- QUIXRGCMQOXUSV-SZMVWBNQSA-N Trp-Pro-Pro Chemical compound O=C([C@@H]1CCCN1C(=O)[C@H](CC=1C2=CC=CC=C2NC=1)N)N1CCC[C@H]1C(O)=O QUIXRGCMQOXUSV-SZMVWBNQSA-N 0.000 description 2
- NIHNMOSRSAYZIT-BPNCWPANSA-N Tyr-Ala-Arg Chemical compound NC(=N)NCCC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 NIHNMOSRSAYZIT-BPNCWPANSA-N 0.000 description 2
- HSBZWINKRYZCSQ-KKUMJFAQSA-N Tyr-Lys-Asp Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O HSBZWINKRYZCSQ-KKUMJFAQSA-N 0.000 description 2
- GXAZTLJYINLMJL-LAEOZQHASA-N Val-Asn-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N GXAZTLJYINLMJL-LAEOZQHASA-N 0.000 description 2
- KXUKIBHIVRYOIP-ZKWXMUAHSA-N Val-Asp-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CS)C(=O)O)N KXUKIBHIVRYOIP-ZKWXMUAHSA-N 0.000 description 2
- KOPBYUSPXBQIHD-NRPADANISA-N Val-Cys-Glu Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N KOPBYUSPXBQIHD-NRPADANISA-N 0.000 description 2
- LHADRQBREKTRLR-DCAQKATOSA-N Val-Cys-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](C(C)C)N LHADRQBREKTRLR-DCAQKATOSA-N 0.000 description 2
- DLYOEFGPYTZVSP-AEJSXWLSSA-N Val-Cys-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)N1CCC[C@@H]1C(=O)O)N DLYOEFGPYTZVSP-AEJSXWLSSA-N 0.000 description 2
- ZEVNVXYRZRIRCH-GVXVVHGQSA-N Val-Gln-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N ZEVNVXYRZRIRCH-GVXVVHGQSA-N 0.000 description 2
- NYTKXWLZSNRILS-IFFSRLJSSA-N Val-Gln-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](C(C)C)N)O NYTKXWLZSNRILS-IFFSRLJSSA-N 0.000 description 2
- ZXYPHBKIZLAQTL-QXEWZRGKSA-N Val-Pro-Asp Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(=O)O)C(=O)O)N ZXYPHBKIZLAQTL-QXEWZRGKSA-N 0.000 description 2
- SSYBNWFXCFNRFN-GUBZILKMSA-N Val-Pro-Ser Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O SSYBNWFXCFNRFN-GUBZILKMSA-N 0.000 description 2
- DEGUERSKQBRZMZ-FXQIFTODSA-N Val-Ser-Ala Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O DEGUERSKQBRZMZ-FXQIFTODSA-N 0.000 description 2
- VIKZGAUAKQZDOF-NRPADANISA-N Val-Ser-Glu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O VIKZGAUAKQZDOF-NRPADANISA-N 0.000 description 2
- HVRRJRMULCPNRO-BZSNNMDCSA-N Val-Trp-Arg Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](N)C(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O)=CNC2=C1 HVRRJRMULCPNRO-BZSNNMDCSA-N 0.000 description 2
- QHSSPPHOHJSTML-HOCLYGCPSA-N Val-Trp-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)NCC(=O)O)N QHSSPPHOHJSTML-HOCLYGCPSA-N 0.000 description 2
- 239000002671 adjuvant Substances 0.000 description 2
- 108010078114 alanyl-tryptophyl-alanine Proteins 0.000 description 2
- 108010047495 alanylglycine Proteins 0.000 description 2
- 230000001093 anti-cancer Effects 0.000 description 2
- 230000000692 anti-sense effect Effects 0.000 description 2
- 108010018691 arginyl-threonyl-arginine Proteins 0.000 description 2
- 108010094001 arginyl-tryptophyl-arginine Proteins 0.000 description 2
- 108010062796 arginyllysine Proteins 0.000 description 2
- 108010093581 aspartyl-proline Proteins 0.000 description 2
- 230000004071 biological effect Effects 0.000 description 2
- 238000001574 biopsy Methods 0.000 description 2
- 239000003153 chemical reaction reagent Substances 0.000 description 2
- 239000003795 chemical substances by application Substances 0.000 description 2
- 230000014107 chromosome localization Effects 0.000 description 2
- 238000010276 construction Methods 0.000 description 2
- 108010016616 cysteinylglycine Proteins 0.000 description 2
- 238000001514 detection method Methods 0.000 description 2
- 230000018109 developmental process Effects 0.000 description 2
- 238000002405 diagnostic procedure Methods 0.000 description 2
- 238000004520 electroporation Methods 0.000 description 2
- 230000002708 enhancing effect Effects 0.000 description 2
- 238000000605 extraction Methods 0.000 description 2
- 238000006062 fragmentation reaction Methods 0.000 description 2
- 108010006664 gamma-glutamyl-glycyl-glycine Proteins 0.000 description 2
- 108010075431 glycyl-alanyl-phenylalanine Proteins 0.000 description 2
- 108010026364 glycyl-glycyl-leucine Proteins 0.000 description 2
- 108010079413 glycyl-prolyl-glutamic acid Proteins 0.000 description 2
- 108010082286 glycyl-seryl-alanine Proteins 0.000 description 2
- 108010084389 glycyltryptophan Proteins 0.000 description 2
- 108010037850 glycylvaline Proteins 0.000 description 2
- 239000005090 green fluorescent protein Substances 0.000 description 2
- 210000003494 hepatocyte Anatomy 0.000 description 2
- 210000004408 hybridoma Anatomy 0.000 description 2
- 230000001900 immune effect Effects 0.000 description 2
- 230000000968 intestinal effect Effects 0.000 description 2
- 108010044374 isoleucyl-tyrosine Proteins 0.000 description 2
- 108010051673 leucyl-glycyl-phenylalanine Proteins 0.000 description 2
- 238000004811 liquid chromatography Methods 0.000 description 2
- 210000005228 liver tissue Anatomy 0.000 description 2
- 230000004807 localization Effects 0.000 description 2
- 108010043322 lysyl-tryptophyl-alpha-lysine Proteins 0.000 description 2
- 125000001360 methionine group Chemical group N[C@@H](CCSC)C(=O)* 0.000 description 2
- 108010056582 methionylglutamic acid Proteins 0.000 description 2
- 238000002493 microarray Methods 0.000 description 2
- 238000010369 molecular cloning Methods 0.000 description 2
- 108010051242 phenylalanylserine Proteins 0.000 description 2
- 210000002826 placenta Anatomy 0.000 description 2
- -1 polyoxyethylene Polymers 0.000 description 2
- 230000035755 proliferation Effects 0.000 description 2
- 108010020755 prolyl-glycyl-glycine Proteins 0.000 description 2
- 108010093296 prolyl-prolyl-alanine Proteins 0.000 description 2
- 238000005215 recombination Methods 0.000 description 2
- 230000006798 recombination Effects 0.000 description 2
- 150000003839 salts Chemical class 0.000 description 2
- 238000012163 sequencing technique Methods 0.000 description 2
- 108010031491 threonyl-lysyl-glutamic acid Proteins 0.000 description 2
- 238000012549 training Methods 0.000 description 2
- 238000013518 transcription Methods 0.000 description 2
- 230000035897 transcription Effects 0.000 description 2
- 230000009466 transformation Effects 0.000 description 2
- 230000017105 transposition Effects 0.000 description 2
- 108010015666 tryptophyl-leucyl-glutamic acid Proteins 0.000 description 2
- 241001430294 unidentified retrovirus Species 0.000 description 2
- 108010009962 valyltyrosine Proteins 0.000 description 2
- 235000013311 vegetables Nutrition 0.000 description 2
- 238000001262 western blot Methods 0.000 description 2
- JWDFQMWEFLOOED-UHFFFAOYSA-N (2,5-dioxopyrrolidin-1-yl) 3-(pyridin-2-yldisulfanyl)propanoate Chemical compound O=C1CCC(=O)N1OC(=O)CCSSC1=CC=CC=N1 JWDFQMWEFLOOED-UHFFFAOYSA-N 0.000 description 1
- AOXKZRNFFMKAOV-JYJNAYRXSA-N (2s)-2-[[(2s)-6-amino-2-[[(2s)-2-[(2-aminoacetyl)amino]-3-(1h-imidazol-5-yl)propanoyl]amino]hexanoyl]amino]-3-(1h-imidazol-5-yl)propanoic acid Chemical compound C([C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC=1NC=NC=1)C(O)=O)NC(=O)CN)C1=CN=CN1 AOXKZRNFFMKAOV-JYJNAYRXSA-N 0.000 description 1
- VWWKKDNCCLAGRM-GVXVVHGQSA-N (2s)-2-[[2-[[(2s)-2-[[(2s)-2-amino-4-methylpentanoyl]amino]propanoyl]amino]acetyl]amino]-3-methylbutanoic acid Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O VWWKKDNCCLAGRM-GVXVVHGQSA-N 0.000 description 1
- NWXMGUDVXFXRIG-WESIUVDSSA-N (4s,4as,5as,6s,12ar)-4-(dimethylamino)-1,6,10,11,12a-pentahydroxy-6-methyl-3,12-dioxo-4,4a,5,5a-tetrahydrotetracene-2-carboxamide Chemical compound C1=CC=C2[C@](O)(C)[C@H]3C[C@H]4[C@H](N(C)C)C(=O)C(C(N)=O)=C(O)[C@@]4(O)C(=O)C3=C(O)C2=C1O NWXMGUDVXFXRIG-WESIUVDSSA-N 0.000 description 1
- UZGKAASZIMOAMU-UHFFFAOYSA-N 124177-85-1 Chemical compound NP(=O)=O UZGKAASZIMOAMU-UHFFFAOYSA-N 0.000 description 1
- YYSWCHMLFJLLBJ-ZLUOBGJFSA-N Ala-Ala-Ser Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O YYSWCHMLFJLLBJ-ZLUOBGJFSA-N 0.000 description 1
- JBVSSSZFNTXJDX-YTLHQDLWSA-N Ala-Ala-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@H](C)N JBVSSSZFNTXJDX-YTLHQDLWSA-N 0.000 description 1
- SVBXIUDNTRTKHE-CIUDSAMLSA-N Ala-Arg-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O SVBXIUDNTRTKHE-CIUDSAMLSA-N 0.000 description 1
- TTXMOJWKNRJWQJ-FXQIFTODSA-N Ala-Arg-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CCCN=C(N)N TTXMOJWKNRJWQJ-FXQIFTODSA-N 0.000 description 1
- DWINFPQUSSHSFS-UVBJJODRSA-N Ala-Arg-Trp Chemical compound N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CNC2=CC=CC=C12)C(=O)O DWINFPQUSSHSFS-UVBJJODRSA-N 0.000 description 1
- MCKSLROAGSDNFC-ACZMJKKPSA-N Ala-Asp-Gln Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O MCKSLROAGSDNFC-ACZMJKKPSA-N 0.000 description 1
- FBHOPGDGELNWRH-DRZSPHRISA-N Ala-Glu-Phe Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O FBHOPGDGELNWRH-DRZSPHRISA-N 0.000 description 1
- VWEWCZSUWOEEFM-WDSKDSINSA-N Ala-Gly-Ala-Gly Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(=O)NCC(O)=O VWEWCZSUWOEEFM-WDSKDSINSA-N 0.000 description 1
- LJFNNUBZSZCZFN-WHFBIAKZSA-N Ala-Gly-Cys Chemical compound N[C@@H](C)C(=O)NCC(=O)N[C@@H](CS)C(=O)O LJFNNUBZSZCZFN-WHFBIAKZSA-N 0.000 description 1
- PCIFXPRIFWKWLK-YUMQZZPRSA-N Ala-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@H](C)N PCIFXPRIFWKWLK-YUMQZZPRSA-N 0.000 description 1
- SMCGQGDVTPFXKB-XPUUQOCRSA-N Ala-Gly-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@H](C)N SMCGQGDVTPFXKB-XPUUQOCRSA-N 0.000 description 1
- IORKCNUBHNIMKY-CIUDSAMLSA-N Ala-Pro-Glu Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O IORKCNUBHNIMKY-CIUDSAMLSA-N 0.000 description 1
- OLVCTPPSXNRGKV-GUBZILKMSA-N Ala-Pro-Pro Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 OLVCTPPSXNRGKV-GUBZILKMSA-N 0.000 description 1
- DCVYRWFAMZFSDA-ZLUOBGJFSA-N Ala-Ser-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O DCVYRWFAMZFSDA-ZLUOBGJFSA-N 0.000 description 1
- RMAWDDRDTRSZIR-ZLUOBGJFSA-N Ala-Ser-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O RMAWDDRDTRSZIR-ZLUOBGJFSA-N 0.000 description 1
- HOVPGJUNRLMIOZ-CIUDSAMLSA-N Ala-Ser-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@H](C)N HOVPGJUNRLMIOZ-CIUDSAMLSA-N 0.000 description 1
- NCQMBSJGJMYKCK-ZLUOBGJFSA-N Ala-Ser-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O NCQMBSJGJMYKCK-ZLUOBGJFSA-N 0.000 description 1
- OEVCHROQUIVQFZ-YTLHQDLWSA-N Ala-Thr-Ala Chemical compound C[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](C)C(O)=O OEVCHROQUIVQFZ-YTLHQDLWSA-N 0.000 description 1
- WNHNMKOFKCHKKD-BFHQHQDPSA-N Ala-Thr-Gly Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O WNHNMKOFKCHKKD-BFHQHQDPSA-N 0.000 description 1
- ZVWXMTTZJKBJCI-BHDSKKPTSA-N Ala-Trp-Ala Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](N)C)C(=O)N[C@@H](C)C(O)=O)=CNC2=C1 ZVWXMTTZJKBJCI-BHDSKKPTSA-N 0.000 description 1
- LFFOJBOTZUWINF-ZANVPECISA-N Ala-Trp-Gly Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](N)C)C(=O)NCC(O)=O)=CNC2=C1 LFFOJBOTZUWINF-ZANVPECISA-N 0.000 description 1
- VHAQSYHSDKERBS-XPUUQOCRSA-N Ala-Val-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O VHAQSYHSDKERBS-XPUUQOCRSA-N 0.000 description 1
- 102000002260 Alkaline Phosphatase Human genes 0.000 description 1
- 108020004774 Alkaline Phosphatase Proteins 0.000 description 1
- 108020005544 Antisense RNA Proteins 0.000 description 1
- KWKQGHSSNHPGOW-BQBZGAKWSA-N Arg-Ala-Gly Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(=O)NCC(O)=O KWKQGHSSNHPGOW-BQBZGAKWSA-N 0.000 description 1
- OTOXOKCIIQLMFH-KZVJFYERSA-N Arg-Ala-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCN=C(N)N OTOXOKCIIQLMFH-KZVJFYERSA-N 0.000 description 1
- IASNWHAGGYTEKX-IUCAKERBSA-N Arg-Arg-Gly Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)NCC(O)=O IASNWHAGGYTEKX-IUCAKERBSA-N 0.000 description 1
- ZTKHZAXGTFXUDD-VEVYYDQMSA-N Arg-Asn-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ZTKHZAXGTFXUDD-VEVYYDQMSA-N 0.000 description 1
- JSHVMZANPXCDTL-GMOBBJLQSA-N Arg-Asp-Ile Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JSHVMZANPXCDTL-GMOBBJLQSA-N 0.000 description 1
- OTCJMMRQBVDQRK-DCAQKATOSA-N Arg-Asp-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O OTCJMMRQBVDQRK-DCAQKATOSA-N 0.000 description 1
- SQKPKIJVWHAWNF-DCAQKATOSA-N Arg-Asp-Lys Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(O)=O SQKPKIJVWHAWNF-DCAQKATOSA-N 0.000 description 1
- JUWQNWXEGDYCIE-YUMQZZPRSA-N Arg-Gln-Gly Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O JUWQNWXEGDYCIE-YUMQZZPRSA-N 0.000 description 1
- VRZDJJWOFXMFRO-ZFWWWQNUSA-N Arg-Gly-Trp Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)NCC(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O VRZDJJWOFXMFRO-ZFWWWQNUSA-N 0.000 description 1
- UPKMBGAAEZGHOC-RWMBFGLXSA-N Arg-His-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CN=CN2)NC(=O)[C@H](CCCN=C(N)N)N)C(=O)O UPKMBGAAEZGHOC-RWMBFGLXSA-N 0.000 description 1
- YBZMTKUDWXZLIX-UWVGGRQHSA-N Arg-Leu-Gly Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O YBZMTKUDWXZLIX-UWVGGRQHSA-N 0.000 description 1
- NMRHDSAOIURTNT-RWMBFGLXSA-N Arg-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N NMRHDSAOIURTNT-RWMBFGLXSA-N 0.000 description 1
- FSNVAJOPUDVQAR-AVGNSLFASA-N Arg-Lys-Arg Chemical compound NC(=N)NCCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O FSNVAJOPUDVQAR-AVGNSLFASA-N 0.000 description 1
- RIIVUOJDDQXHRV-SRVKXCTJSA-N Arg-Lys-Gln Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O RIIVUOJDDQXHRV-SRVKXCTJSA-N 0.000 description 1
- RIQBRKVTFBWEDY-RHYQMDGZSA-N Arg-Lys-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O RIQBRKVTFBWEDY-RHYQMDGZSA-N 0.000 description 1
- FOQFHANLUJDQEE-GUBZILKMSA-N Arg-Pro-Cys Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CCCN=C(N)N)N)C(=O)N[C@@H](CS)C(=O)O FOQFHANLUJDQEE-GUBZILKMSA-N 0.000 description 1
- DNLQVHBBMPZUGJ-BQBZGAKWSA-N Arg-Ser-Gly Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O DNLQVHBBMPZUGJ-BQBZGAKWSA-N 0.000 description 1
- LFAUVOXPCGJKTB-DCAQKATOSA-N Arg-Ser-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCCN=C(N)N)N LFAUVOXPCGJKTB-DCAQKATOSA-N 0.000 description 1
- FRBAHXABMQXSJQ-FXQIFTODSA-N Arg-Ser-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O FRBAHXABMQXSJQ-FXQIFTODSA-N 0.000 description 1
- FBXMCPLCVYUWBO-BPUTZDHNSA-N Arg-Ser-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCCN=C(N)N)N FBXMCPLCVYUWBO-BPUTZDHNSA-N 0.000 description 1
- XRNXPIGJPQHCPC-RCWTZXSCSA-N Arg-Thr-Val Chemical compound CC(C)[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)CCCNC(N)=N)[C@@H](C)O)C(O)=O XRNXPIGJPQHCPC-RCWTZXSCSA-N 0.000 description 1
- XMGVWQWEWWULNS-BPUTZDHNSA-N Arg-Trp-Ser Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N XMGVWQWEWWULNS-BPUTZDHNSA-N 0.000 description 1
- ZDOQDYFZNGASEY-BIIVOSGPSA-N Asn-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(=O)N)N)C(=O)O ZDOQDYFZNGASEY-BIIVOSGPSA-N 0.000 description 1
- QYXNFROWLZPWPC-FXQIFTODSA-N Asn-Glu-Gln Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O QYXNFROWLZPWPC-FXQIFTODSA-N 0.000 description 1
- ZKDGORKGHPCZOV-DCAQKATOSA-N Asn-His-Arg Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N ZKDGORKGHPCZOV-DCAQKATOSA-N 0.000 description 1
- JTXVXGXTRXMOFJ-FXQIFTODSA-N Asn-Pro-Asn Chemical compound NC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O JTXVXGXTRXMOFJ-FXQIFTODSA-N 0.000 description 1
- JWQWPRCDYWNVNM-ACZMJKKPSA-N Asn-Ser-Gln Chemical compound C(CC(=O)N)[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CC(=O)N)N JWQWPRCDYWNVNM-ACZMJKKPSA-N 0.000 description 1
- QUMKPKWYDVMGNT-NUMRIWBASA-N Asn-Thr-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N)O QUMKPKWYDVMGNT-NUMRIWBASA-N 0.000 description 1
- VHQOCWWKXIOAQI-WDSKDSINSA-N Asp-Gln-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O VHQOCWWKXIOAQI-WDSKDSINSA-N 0.000 description 1
- POTCZYQVVNXUIG-BQBZGAKWSA-N Asp-Gly-Pro Chemical compound OC(=O)C[C@H](N)C(=O)NCC(=O)N1CCC[C@H]1C(O)=O POTCZYQVVNXUIG-BQBZGAKWSA-N 0.000 description 1
- SPWXXPFDTMYTRI-IUKAMOBKSA-N Asp-Ile-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SPWXXPFDTMYTRI-IUKAMOBKSA-N 0.000 description 1
- BRRPVTUFESPTCP-ACZMJKKPSA-N Asp-Ser-Glu Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O BRRPVTUFESPTCP-ACZMJKKPSA-N 0.000 description 1
- AEJSNWMRPXAKCW-WHFBIAKZSA-N Cys-Ala-Gly Chemical compound SC[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O AEJSNWMRPXAKCW-WHFBIAKZSA-N 0.000 description 1
- PKNIZMPLMSKROD-BIIVOSGPSA-N Cys-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CS)N PKNIZMPLMSKROD-BIIVOSGPSA-N 0.000 description 1
- DZIGZIIJIGGANI-FXQIFTODSA-N Cys-Glu-Gln Chemical compound SC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O DZIGZIIJIGGANI-FXQIFTODSA-N 0.000 description 1
- UXUSHQYYQCZWET-WDSKDSINSA-N Cys-Glu-Gly Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O UXUSHQYYQCZWET-WDSKDSINSA-N 0.000 description 1
- HQZGVYJBRSISDT-BQBZGAKWSA-N Cys-Gly-Arg Chemical compound [H]N[C@@H](CS)C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(O)=O HQZGVYJBRSISDT-BQBZGAKWSA-N 0.000 description 1
- UCSXXFRXHGUXCQ-SRVKXCTJSA-N Cys-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CS)N UCSXXFRXHGUXCQ-SRVKXCTJSA-N 0.000 description 1
- HBHMVBGGHDMPBF-GARJFASQSA-N Cys-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CS)N HBHMVBGGHDMPBF-GARJFASQSA-N 0.000 description 1
- BCFXQBXXDSEHRS-FXQIFTODSA-N Cys-Ser-Arg Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O BCFXQBXXDSEHRS-FXQIFTODSA-N 0.000 description 1
- GGRDJANMZPGMNS-CIUDSAMLSA-N Cys-Ser-Leu Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O GGRDJANMZPGMNS-CIUDSAMLSA-N 0.000 description 1
- 102000053602 DNA Human genes 0.000 description 1
- 230000004544 DNA amplification Effects 0.000 description 1
- 238000007399 DNA isolation Methods 0.000 description 1
- 238000000018 DNA microarray Methods 0.000 description 1
- 241000702421 Dependoparvovirus Species 0.000 description 1
- BWGNESOTFCXPMA-UHFFFAOYSA-N Dihydrogen disulfide Chemical compound SS BWGNESOTFCXPMA-UHFFFAOYSA-N 0.000 description 1
- 108010053187 Diphtheria Toxin Proteins 0.000 description 1
- 102000016607 Diphtheria Toxin Human genes 0.000 description 1
- 241000196324 Embryophyta Species 0.000 description 1
- 102000004533 Endonucleases Human genes 0.000 description 1
- 108010042407 Endonucleases Proteins 0.000 description 1
- 241000206602 Eukaryota Species 0.000 description 1
- 229920001917 Ficoll Polymers 0.000 description 1
- 108700028146 Genetic Enhancer Elements Proteins 0.000 description 1
- XXLBHPPXDUWYAG-XQXXSGGOSA-N Gln-Ala-Thr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XXLBHPPXDUWYAG-XQXXSGGOSA-N 0.000 description 1
- FGYPOQPQTUNESW-IUCAKERBSA-N Gln-Gly-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CCC(=O)N)N FGYPOQPQTUNESW-IUCAKERBSA-N 0.000 description 1
- XSBGUANSZDGULP-IUCAKERBSA-N Gln-Gly-Lys Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)NCC(=O)N[C@@H](CCCCN)C(O)=O XSBGUANSZDGULP-IUCAKERBSA-N 0.000 description 1
- NSORZJXKUQFEKL-JGVFFNPUSA-N Gln-Gly-Pro Chemical compound C1C[C@@H](N(C1)C(=O)CNC(=O)[C@H](CCC(=O)N)N)C(=O)O NSORZJXKUQFEKL-JGVFFNPUSA-N 0.000 description 1
- SMLDOQHTOAAFJQ-WDSKDSINSA-N Gln-Gly-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)NCC(=O)N[C@@H](CO)C(O)=O SMLDOQHTOAAFJQ-WDSKDSINSA-N 0.000 description 1
- ICDIMQAMJGDHSE-GUBZILKMSA-N Gln-His-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(O)=O ICDIMQAMJGDHSE-GUBZILKMSA-N 0.000 description 1
- HDUDGCZEOZEFOA-KBIXCLLPSA-N Gln-Ile-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)O)NC(=O)[C@H](CCC(=O)N)N HDUDGCZEOZEFOA-KBIXCLLPSA-N 0.000 description 1
- IULKWYSYZSURJK-AVGNSLFASA-N Gln-Leu-Lys Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(O)=O IULKWYSYZSURJK-AVGNSLFASA-N 0.000 description 1
- FALJZCPMTGJOHX-SRVKXCTJSA-N Gln-Met-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O FALJZCPMTGJOHX-SRVKXCTJSA-N 0.000 description 1
- XQDGOJPVMSWZSO-SRVKXCTJSA-N Gln-Pro-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CCC(=O)N)N XQDGOJPVMSWZSO-SRVKXCTJSA-N 0.000 description 1
- NHMRJKKAVMENKJ-WDCWCFNPSA-N Gln-Thr-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O NHMRJKKAVMENKJ-WDCWCFNPSA-N 0.000 description 1
- OUBUHIODTNUUTC-WDCWCFNPSA-N Gln-Thr-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)N)N)O OUBUHIODTNUUTC-WDCWCFNPSA-N 0.000 description 1
- FITIQFSXXBKFFM-NRPADANISA-N Gln-Val-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O FITIQFSXXBKFFM-NRPADANISA-N 0.000 description 1
- ITYRYNUZHPNCIK-GUBZILKMSA-N Glu-Ala-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O ITYRYNUZHPNCIK-GUBZILKMSA-N 0.000 description 1
- PXHABOCPJVTGEK-BQBZGAKWSA-N Glu-Gln-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O PXHABOCPJVTGEK-BQBZGAKWSA-N 0.000 description 1
- WLIPTFCZLHCNFD-LPEHRKFASA-N Glu-Gln-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCC(=O)O)N)C(=O)O WLIPTFCZLHCNFD-LPEHRKFASA-N 0.000 description 1
- CGOHAEBMDSEKFB-FXQIFTODSA-N Glu-Glu-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O CGOHAEBMDSEKFB-FXQIFTODSA-N 0.000 description 1
- OAGVHWYIBZMWLA-YFKPBYRVSA-N Glu-Gly-Gly Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)NCC(O)=O OAGVHWYIBZMWLA-YFKPBYRVSA-N 0.000 description 1
- OPAINBJQDQTGJY-JGVFFNPUSA-N Glu-Gly-Pro Chemical compound C1C[C@@H](N(C1)C(=O)CNC(=O)[C@H](CCC(=O)O)N)C(=O)O OPAINBJQDQTGJY-JGVFFNPUSA-N 0.000 description 1
- WVYJNPCWJYBHJG-YVNDNENWSA-N Glu-Ile-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(N)=O)C(O)=O WVYJNPCWJYBHJG-YVNDNENWSA-N 0.000 description 1
- CHDWDBPJOZVZSE-KKUMJFAQSA-N Glu-Phe-Met Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCSC)C(O)=O CHDWDBPJOZVZSE-KKUMJFAQSA-N 0.000 description 1
- ITVBKCZZLJUUHI-HTUGSXCWSA-N Glu-Phe-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ITVBKCZZLJUUHI-HTUGSXCWSA-N 0.000 description 1
- AAJHGGDRKHYSDH-GUBZILKMSA-N Glu-Pro-Gln Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CCC(=O)O)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O AAJHGGDRKHYSDH-GUBZILKMSA-N 0.000 description 1
- ALMBZBOCGSVSAI-ACZMJKKPSA-N Glu-Ser-Asn Chemical compound C(CC(=O)O)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(=O)N)C(=O)O)N ALMBZBOCGSVSAI-ACZMJKKPSA-N 0.000 description 1
- QCMVGXDELYMZET-GLLZPBPUSA-N Glu-Thr-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O QCMVGXDELYMZET-GLLZPBPUSA-N 0.000 description 1
- VJVAQZYGLMJPTK-QEJZJMRPSA-N Glu-Trp-Asp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CCC(=O)O)N VJVAQZYGLMJPTK-QEJZJMRPSA-N 0.000 description 1
- QXUPRMQJDWJDFR-NRPADANISA-N Glu-Val-Ser Chemical compound CC(C)[C@H](NC(=O)[C@@H](N)CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O QXUPRMQJDWJDFR-NRPADANISA-N 0.000 description 1
- WGYHAAXZWPEBDQ-IFFSRLJSSA-N Glu-Val-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O WGYHAAXZWPEBDQ-IFFSRLJSSA-N 0.000 description 1
- WQZGKKKJIJFFOK-GASJEMHNSA-N Glucose Natural products OC[C@H]1OC(O)[C@H](O)[C@@H](O)[C@@H]1O WQZGKKKJIJFFOK-GASJEMHNSA-N 0.000 description 1
- UGVQELHRNUDMAA-BYPYZUCNSA-N Gly-Ala-Gly Chemical compound [NH3+]CC(=O)N[C@@H](C)C(=O)NCC([O-])=O UGVQELHRNUDMAA-BYPYZUCNSA-N 0.000 description 1
- YMUFWNJHVPQNQD-ZKWXMUAHSA-N Gly-Ala-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)CN YMUFWNJHVPQNQD-ZKWXMUAHSA-N 0.000 description 1
- LJPIRKICOISLKN-WHFBIAKZSA-N Gly-Ala-Ser Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O LJPIRKICOISLKN-WHFBIAKZSA-N 0.000 description 1
- JXYMPBCYRKWJEE-BQBZGAKWSA-N Gly-Arg-Ala Chemical compound [H]NCC(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(O)=O JXYMPBCYRKWJEE-BQBZGAKWSA-N 0.000 description 1
- UPOJUWHGMDJUQZ-IUCAKERBSA-N Gly-Arg-Arg Chemical compound NC(=N)NCCC[C@H](NC(=O)CN)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O UPOJUWHGMDJUQZ-IUCAKERBSA-N 0.000 description 1
- OVSKVOOUFAKODB-UWVGGRQHSA-N Gly-Arg-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCN=C(N)N OVSKVOOUFAKODB-UWVGGRQHSA-N 0.000 description 1
- VXKCPBPQEKKERH-IUCAKERBSA-N Gly-Arg-Pro Chemical compound NC(N)=NCCC[C@H](NC(=O)CN)C(=O)N1CCC[C@H]1C(O)=O VXKCPBPQEKKERH-IUCAKERBSA-N 0.000 description 1
- AIJAPFVDBFYNKN-WHFBIAKZSA-N Gly-Asn-Asp Chemical compound C([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)CN)C(=O)N AIJAPFVDBFYNKN-WHFBIAKZSA-N 0.000 description 1
- GGEJHJIXRBTJPD-BYPYZUCNSA-N Gly-Asn-Gly Chemical compound NCC(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O GGEJHJIXRBTJPD-BYPYZUCNSA-N 0.000 description 1
- IXKRSKPKSLXIHN-YUMQZZPRSA-N Gly-Cys-Leu Chemical compound [H]NCC(=O)N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(O)=O IXKRSKPKSLXIHN-YUMQZZPRSA-N 0.000 description 1
- PABFFPWEJMEVEC-JGVFFNPUSA-N Gly-Gln-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)N)NC(=O)CN)C(=O)O PABFFPWEJMEVEC-JGVFFNPUSA-N 0.000 description 1
- HQRHFUYMGCHHJS-LURJTMIESA-N Gly-Gly-Arg Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N HQRHFUYMGCHHJS-LURJTMIESA-N 0.000 description 1
- KMSGYZQRXPUKGI-BYPYZUCNSA-N Gly-Gly-Asn Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CC(N)=O KMSGYZQRXPUKGI-BYPYZUCNSA-N 0.000 description 1
- BUEFQXUHTUZXHR-LURJTMIESA-N Gly-Gly-Pro zwitterion Chemical compound NCC(=O)NCC(=O)N1CCC[C@H]1C(O)=O BUEFQXUHTUZXHR-LURJTMIESA-N 0.000 description 1
- VAXIVIPMCTYSHI-YUMQZZPRSA-N Gly-His-Asp Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)CN VAXIVIPMCTYSHI-YUMQZZPRSA-N 0.000 description 1
- MTBIKIMYHUWBRX-QWRGUYRKSA-N Gly-Phe-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)CN MTBIKIMYHUWBRX-QWRGUYRKSA-N 0.000 description 1
- JJGBXTYGTKWGAT-YUMQZZPRSA-N Gly-Pro-Glu Chemical compound NCC(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O JJGBXTYGTKWGAT-YUMQZZPRSA-N 0.000 description 1
- IXHQLZIWBCQBLQ-STQMWFEESA-N Gly-Pro-Phe Chemical compound NCC(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 IXHQLZIWBCQBLQ-STQMWFEESA-N 0.000 description 1
- YABRDIBSPZONIY-BQBZGAKWSA-N Gly-Ser-Met Chemical compound [H]NCC(=O)N[C@@H](CO)C(=O)N[C@@H](CCSC)C(O)=O YABRDIBSPZONIY-BQBZGAKWSA-N 0.000 description 1
- WCORRBXVISTKQL-WHFBIAKZSA-N Gly-Ser-Ser Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O WCORRBXVISTKQL-WHFBIAKZSA-N 0.000 description 1
- RHRLHXQWHCNJKR-PMVVWTBXSA-N Gly-Thr-His Chemical compound NCC(=O)N[C@@H]([C@H](O)C)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 RHRLHXQWHCNJKR-PMVVWTBXSA-N 0.000 description 1
- XHVONGZZVUUORG-WEDXCCLWSA-N Gly-Thr-Lys Chemical compound NCC(=O)N[C@@H]([C@H](O)C)C(=O)N[C@H](C(O)=O)CCCCN XHVONGZZVUUORG-WEDXCCLWSA-N 0.000 description 1
- FFALDIDGPLUDKV-ZDLURKLDSA-N Gly-Thr-Ser Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O FFALDIDGPLUDKV-ZDLURKLDSA-N 0.000 description 1
- PYFHPYDQHCEVIT-KBPBESRZSA-N Gly-Trp-Gln Chemical compound [H]NCC(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCC(N)=O)C(O)=O PYFHPYDQHCEVIT-KBPBESRZSA-N 0.000 description 1
- RYAOJUMWLWUGNW-QMMMGPOBSA-N Gly-Val-Gly Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O RYAOJUMWLWUGNW-QMMMGPOBSA-N 0.000 description 1
- BAYQNCWLXIDLHX-ONGXEEELSA-N Gly-Val-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)CN BAYQNCWLXIDLHX-ONGXEEELSA-N 0.000 description 1
- YGHSQRJSHKYUJY-SCZZXKLOSA-N Gly-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN YGHSQRJSHKYUJY-SCZZXKLOSA-N 0.000 description 1
- SBVMXEZQJVUARN-XPUUQOCRSA-N Gly-Val-Ser Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O SBVMXEZQJVUARN-XPUUQOCRSA-N 0.000 description 1
- NYHBQMYGNKIUIF-UUOKFMHZSA-N Guanosine Chemical compound C1=NC=2C(=O)NC(N)=NC=2N1[C@@H]1O[C@H](CO)[C@@H](O)[C@H]1O NYHBQMYGNKIUIF-UUOKFMHZSA-N 0.000 description 1
- HTZKFIYQMHJWSQ-INTQDDNPSA-N His-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CN=CN2)N HTZKFIYQMHJWSQ-INTQDDNPSA-N 0.000 description 1
- STOOMQFEJUVAKR-KKUMJFAQSA-N His-His-His Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1N=CNC=1)C(=O)N[C@@H](CC=1N=CNC=1)C(O)=O)C1=CNC=N1 STOOMQFEJUVAKR-KKUMJFAQSA-N 0.000 description 1
- BSVLMPMIXPQNKC-KBPBESRZSA-N His-Phe-Gly Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)NCC(O)=O BSVLMPMIXPQNKC-KBPBESRZSA-N 0.000 description 1
- SOYCWSKCUVDLMC-AVGNSLFASA-N His-Pro-Arg Chemical compound N[C@@H](Cc1cnc[nH]1)C(=O)N2CCC[C@H]2C(=O)N[C@@H](CCCNC(=N)N)C(=O)O SOYCWSKCUVDLMC-AVGNSLFASA-N 0.000 description 1
- WKEABZIITNXXQZ-CIUDSAMLSA-N His-Ser-Cys Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)O)N WKEABZIITNXXQZ-CIUDSAMLSA-N 0.000 description 1
- PZAJPILZRFPYJJ-SRVKXCTJSA-N His-Ser-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O PZAJPILZRFPYJJ-SRVKXCTJSA-N 0.000 description 1
- VXZZUXWAOMWWJH-QTKMDUPCSA-N His-Thr-Val Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O VXZZUXWAOMWWJH-QTKMDUPCSA-N 0.000 description 1
- LNVILFYCPVOHPV-IHPCNDPISA-N His-Trp-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC(C)C)C(O)=O LNVILFYCPVOHPV-IHPCNDPISA-N 0.000 description 1
- 108010033040 Histones Proteins 0.000 description 1
- 241000282412 Homo Species 0.000 description 1
- MKWSZEHGHSLNPF-NAKRPEOUSA-N Ile-Ala-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)O)N MKWSZEHGHSLNPF-NAKRPEOUSA-N 0.000 description 1
- CNPNWGHRMBQHBZ-ZKWXMUAHSA-N Ile-Gln Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@H](C(O)=O)CCC(N)=O CNPNWGHRMBQHBZ-ZKWXMUAHSA-N 0.000 description 1
- PFPUFNLHBXKPHY-HTFCKZLJSA-N Ile-Ile-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(=O)O)N PFPUFNLHBXKPHY-HTFCKZLJSA-N 0.000 description 1
- PXKACEXYLPBMAD-JBDRJPRFSA-N Ile-Ser-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)O)N PXKACEXYLPBMAD-JBDRJPRFSA-N 0.000 description 1
- 102000001706 Immunoglobulin Fab Fragments Human genes 0.000 description 1
- 108010054477 Immunoglobulin Fab Fragments Proteins 0.000 description 1
- MJOZZTKJZQFKDK-GUBZILKMSA-N Leu-Ala-Gln Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(N)=O MJOZZTKJZQFKDK-GUBZILKMSA-N 0.000 description 1
- PNUCWVAGVNLUMW-CIUDSAMLSA-N Leu-Cys-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CS)C(=O)N[C@@H](CO)C(O)=O PNUCWVAGVNLUMW-CIUDSAMLSA-N 0.000 description 1
- DZQMXBALGUHGJT-GUBZILKMSA-N Leu-Glu-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O DZQMXBALGUHGJT-GUBZILKMSA-N 0.000 description 1
- YVKSMSDXKMSIRX-GUBZILKMSA-N Leu-Glu-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O YVKSMSDXKMSIRX-GUBZILKMSA-N 0.000 description 1
- NEEOBPIXKWSBRF-IUCAKERBSA-N Leu-Glu-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O NEEOBPIXKWSBRF-IUCAKERBSA-N 0.000 description 1
- LAGPXKYZCCTSGQ-JYJNAYRXSA-N Leu-Glu-Phe Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O LAGPXKYZCCTSGQ-JYJNAYRXSA-N 0.000 description 1
- HRTRLSRYZZKPCO-BJDJZHNGSA-N Leu-Ile-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O HRTRLSRYZZKPCO-BJDJZHNGSA-N 0.000 description 1
- IEWBEPKLKUXQBU-VOAKCMCISA-N Leu-Leu-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O IEWBEPKLKUXQBU-VOAKCMCISA-N 0.000 description 1
- VCHVSKNMTXWIIP-SRVKXCTJSA-N Leu-Lys-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O VCHVSKNMTXWIIP-SRVKXCTJSA-N 0.000 description 1
- QMKFDEUJGYNFMC-AVGNSLFASA-N Leu-Pro-Arg Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCN=C(N)N)C(O)=O QMKFDEUJGYNFMC-AVGNSLFASA-N 0.000 description 1
- DPURXCQCHSQPAN-AVGNSLFASA-N Leu-Pro-Pro Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 DPURXCQCHSQPAN-AVGNSLFASA-N 0.000 description 1
- SBANPBVRHYIMRR-GARJFASQSA-N Leu-Ser-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CO)C(=O)N1CCC[C@@H]1C(=O)O)N SBANPBVRHYIMRR-GARJFASQSA-N 0.000 description 1
- BRTVHXHCUSXYRI-CIUDSAMLSA-N Leu-Ser-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O BRTVHXHCUSXYRI-CIUDSAMLSA-N 0.000 description 1
- SQUFDMCWMFOEBA-KKUMJFAQSA-N Leu-Ser-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 SQUFDMCWMFOEBA-KKUMJFAQSA-N 0.000 description 1
- ZJZNLRVCZWUONM-JXUBOQSCSA-N Leu-Thr-Ala Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O ZJZNLRVCZWUONM-JXUBOQSCSA-N 0.000 description 1
- KLSUAWUZBMAZCL-RHYQMDGZSA-N Leu-Thr-Pro Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@H]1C(O)=O KLSUAWUZBMAZCL-RHYQMDGZSA-N 0.000 description 1
- GZRABTMNWJXFMH-UVOCVTCTSA-N Leu-Thr-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GZRABTMNWJXFMH-UVOCVTCTSA-N 0.000 description 1
- LFXSPAIBSZSTEM-PMVMPFDFSA-N Leu-Trp-Phe Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CC3=CC=CC=C3)C(=O)O)N LFXSPAIBSZSTEM-PMVMPFDFSA-N 0.000 description 1
- YQFZRHYZLARWDY-IHRRRGAJSA-N Leu-Val-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCCN YQFZRHYZLARWDY-IHRRRGAJSA-N 0.000 description 1
- MSFITIBEMPWCBD-ULQDDVLXSA-N Leu-Val-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 MSFITIBEMPWCBD-ULQDDVLXSA-N 0.000 description 1
- NTEVEUCLFMWSND-SRVKXCTJSA-N Lys-Arg-Gln Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(O)=O NTEVEUCLFMWSND-SRVKXCTJSA-N 0.000 description 1
- DGAAQRAUOFHBFJ-CIUDSAMLSA-N Lys-Asn-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(O)=O DGAAQRAUOFHBFJ-CIUDSAMLSA-N 0.000 description 1
- RZHLIPMZXOEJTL-AVGNSLFASA-N Lys-Gln-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCCCN)N RZHLIPMZXOEJTL-AVGNSLFASA-N 0.000 description 1
- CKSBRMUOQDNPKZ-SRVKXCTJSA-N Lys-Gln-Met Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCSC)C(O)=O CKSBRMUOQDNPKZ-SRVKXCTJSA-N 0.000 description 1
- PAMDBWYMLWOELY-SDDRHHMPSA-N Lys-Glu-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCCCN)N)C(=O)O PAMDBWYMLWOELY-SDDRHHMPSA-N 0.000 description 1
- CAVGLNOOIFHJOF-SRVKXCTJSA-N Lys-His-Cys Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCCCN)N CAVGLNOOIFHJOF-SRVKXCTJSA-N 0.000 description 1
- AIRZWUMAHCDDHR-KKUMJFAQSA-N Lys-Leu-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O AIRZWUMAHCDDHR-KKUMJFAQSA-N 0.000 description 1
- PIXVFCBYEGPZPA-JYJNAYRXSA-N Lys-Phe-Gln Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CCCCN)N PIXVFCBYEGPZPA-JYJNAYRXSA-N 0.000 description 1
- WZVSHTFTCYOFPL-GARJFASQSA-N Lys-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CCCCN)N)C(=O)O WZVSHTFTCYOFPL-GARJFASQSA-N 0.000 description 1
- KTINOHQFVVCEGQ-XIRDDKMYSA-N Lys-Trp-Asp Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](Cc1c[nH]c2ccccc12)C(=O)N[C@@H](CC(O)=O)C(O)=O KTINOHQFVVCEGQ-XIRDDKMYSA-N 0.000 description 1
- QRHWTCJBCLGYRB-FXQIFTODSA-N Met-Ala-Cys Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CS)C(O)=O QRHWTCJBCLGYRB-FXQIFTODSA-N 0.000 description 1
- HHCOOFPGNXKFGR-HJGDQZAQSA-N Met-Gln-Thr Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O HHCOOFPGNXKFGR-HJGDQZAQSA-N 0.000 description 1
- ODFBIJXEWPWSAN-CYDGBPFRSA-N Met-Ile-Val Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O ODFBIJXEWPWSAN-CYDGBPFRSA-N 0.000 description 1
- HGAJNEWOUHDUMZ-SRVKXCTJSA-N Met-Leu-Glu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCC(O)=O HGAJNEWOUHDUMZ-SRVKXCTJSA-N 0.000 description 1
- BJPQKNHZHUCQNQ-SRVKXCTJSA-N Met-Pro-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CCSC)N BJPQKNHZHUCQNQ-SRVKXCTJSA-N 0.000 description 1
- IIHMNTBFPMRJCN-RCWTZXSCSA-N Met-Val-Thr Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O IIHMNTBFPMRJCN-RCWTZXSCSA-N 0.000 description 1
- CZCIKBSVHDNIDH-NSHDSACASA-N N(alpha)-methyl-L-tryptophan Chemical compound C1=CC=C2C(C[C@H]([NH2+]C)C([O-])=O)=CNC2=C1 CZCIKBSVHDNIDH-NSHDSACASA-N 0.000 description 1
- WUGMRIBZSVSJNP-UHFFFAOYSA-N N-L-alanyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)C)C(O)=O)=CNC2=C1 WUGMRIBZSVSJNP-UHFFFAOYSA-N 0.000 description 1
- 108010002311 N-glycylglutamic acid Proteins 0.000 description 1
- 108010087066 N2-tryptophyllysine Proteins 0.000 description 1
- CZCIKBSVHDNIDH-UHFFFAOYSA-N Nalpha-methyl-DL-tryptophan Natural products C1=CC=C2C(CC(NC)C(O)=O)=CNC2=C1 CZCIKBSVHDNIDH-UHFFFAOYSA-N 0.000 description 1
- 229930193140 Neomycin Natural products 0.000 description 1
- 108700020796 Oncogene Proteins 0.000 description 1
- 241000283973 Oryctolagus cuniculus Species 0.000 description 1
- 108010067902 Peptide Library Proteins 0.000 description 1
- CDNPIRSCAFMMBE-SRVKXCTJSA-N Phe-Asn-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O CDNPIRSCAFMMBE-SRVKXCTJSA-N 0.000 description 1
- IDUCUXTUHHIQIP-SOUVJXGZSA-N Phe-Gln-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC2=CC=CC=C2)N)C(=O)O IDUCUXTUHHIQIP-SOUVJXGZSA-N 0.000 description 1
- ZJPGOXWRFNKIQL-JYJNAYRXSA-N Phe-Pro-Pro Chemical compound C([C@H](N)C(=O)N1[C@@H](CCC1)C(=O)N1[C@@H](CCC1)C(O)=O)C1=CC=CC=C1 ZJPGOXWRFNKIQL-JYJNAYRXSA-N 0.000 description 1
- AFNJAQVMTIQTCB-DLOVCJGASA-N Phe-Ser-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CC=CC=C1 AFNJAQVMTIQTCB-DLOVCJGASA-N 0.000 description 1
- RGMLUHANLDVMPB-ULQDDVLXSA-N Phe-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N RGMLUHANLDVMPB-ULQDDVLXSA-N 0.000 description 1
- 241000276498 Pollachius virens Species 0.000 description 1
- 229920003171 Poly (ethylene oxide) Polymers 0.000 description 1
- VXCHGLYSIOOZIS-GUBZILKMSA-N Pro-Ala-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1 VXCHGLYSIOOZIS-GUBZILKMSA-N 0.000 description 1
- KIZQGKLMXKGDIV-BQBZGAKWSA-N Pro-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1 KIZQGKLMXKGDIV-BQBZGAKWSA-N 0.000 description 1
- FYQSMXKJYTZYRP-DCAQKATOSA-N Pro-Ala-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1 FYQSMXKJYTZYRP-DCAQKATOSA-N 0.000 description 1
- ONPFOYPPPOHMNH-UVBJJODRSA-N Pro-Ala-Trp Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@@H]3CCCN3 ONPFOYPPPOHMNH-UVBJJODRSA-N 0.000 description 1
- SSSFPISOZOLQNP-GUBZILKMSA-N Pro-Arg-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O SSSFPISOZOLQNP-GUBZILKMSA-N 0.000 description 1
- ICTZKEXYDDZZFP-SRVKXCTJSA-N Pro-Arg-Pro Chemical compound N([C@@H](CCCN=C(N)N)C(=O)N1[C@@H](CCC1)C(O)=O)C(=O)[C@@H]1CCCN1 ICTZKEXYDDZZFP-SRVKXCTJSA-N 0.000 description 1
- SBYVDRLQAGENMY-DCAQKATOSA-N Pro-Asn-His Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O SBYVDRLQAGENMY-DCAQKATOSA-N 0.000 description 1
- SWXSLPHTJVAWDF-VEVYYDQMSA-N Pro-Asn-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SWXSLPHTJVAWDF-VEVYYDQMSA-N 0.000 description 1
- FKKHDBFNOLCYQM-FXQIFTODSA-N Pro-Cys-Ala Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CS)C(=O)N[C@@H](C)C(O)=O FKKHDBFNOLCYQM-FXQIFTODSA-N 0.000 description 1
- MGDFPGCFVJFITQ-CIUDSAMLSA-N Pro-Glu-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O MGDFPGCFVJFITQ-CIUDSAMLSA-N 0.000 description 1
- QCARZLHECSFOGG-CIUDSAMLSA-N Pro-Glu-Cys Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CS)C(=O)O QCARZLHECSFOGG-CIUDSAMLSA-N 0.000 description 1
- VDGTVWFMRXVQCT-GUBZILKMSA-N Pro-Glu-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H]1CCCN1 VDGTVWFMRXVQCT-GUBZILKMSA-N 0.000 description 1
- FRKBNXCFJBPJOL-GUBZILKMSA-N Pro-Glu-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O FRKBNXCFJBPJOL-GUBZILKMSA-N 0.000 description 1
- HAAQQNHQZBOWFO-LURJTMIESA-N Pro-Gly-Gly Chemical compound OC(=O)CNC(=O)CNC(=O)[C@@H]1CCCN1 HAAQQNHQZBOWFO-LURJTMIESA-N 0.000 description 1
- WSRWHZRUOCACLJ-UWVGGRQHSA-N Pro-Gly-His Chemical compound C([C@@H](C(=O)O)NC(=O)CNC(=O)[C@H]1NCCC1)C1=CN=CN1 WSRWHZRUOCACLJ-UWVGGRQHSA-N 0.000 description 1
- SSWJYJHXQOYTSP-SRVKXCTJSA-N Pro-His-Gln Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(N)=O)C(O)=O SSWJYJHXQOYTSP-SRVKXCTJSA-N 0.000 description 1
- XYSXOCIWCPFOCG-IHRRRGAJSA-N Pro-Leu-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O XYSXOCIWCPFOCG-IHRRRGAJSA-N 0.000 description 1
- MLKVIVZCFYRTIR-KKUMJFAQSA-N Pro-Phe-Gln Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O MLKVIVZCFYRTIR-KKUMJFAQSA-N 0.000 description 1
- NAIPAPCKKRCMBL-JYJNAYRXSA-N Pro-Pro-Phe Chemical compound C([C@@H](C(=O)O)NC(=O)[C@H]1N(CCC1)C(=O)[C@H]1NCCC1)C1=CC=CC=C1 NAIPAPCKKRCMBL-JYJNAYRXSA-N 0.000 description 1
- QKDIHFHGHBYTKB-IHRRRGAJSA-N Pro-Ser-Phe Chemical compound N([C@@H](CO)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C(=O)[C@@H]1CCCN1 QKDIHFHGHBYTKB-IHRRRGAJSA-N 0.000 description 1
- OQSGBXGNAFQGGS-CYDGBPFRSA-N Pro-Val-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O OQSGBXGNAFQGGS-CYDGBPFRSA-N 0.000 description 1
- IIRBTQHFVNGPMQ-AVGNSLFASA-N Pro-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@@H]1CCCN1 IIRBTQHFVNGPMQ-AVGNSLFASA-N 0.000 description 1
- 241000125945 Protoparvovirus Species 0.000 description 1
- 102000007056 Recombinant Fusion Proteins Human genes 0.000 description 1
- 108010008281 Recombinant Fusion Proteins Proteins 0.000 description 1
- 108010039491 Ricin Proteins 0.000 description 1
- 241000293869 Salmonella enterica subsp. enterica serovar Typhimurium Species 0.000 description 1
- 238000012300 Sequence Analysis Methods 0.000 description 1
- MMGJPDWSIOAGTH-ACZMJKKPSA-N Ser-Ala-Gln Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(O)=O MMGJPDWSIOAGTH-ACZMJKKPSA-N 0.000 description 1
- HRNQLKCLPVKZNE-CIUDSAMLSA-N Ser-Ala-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O HRNQLKCLPVKZNE-CIUDSAMLSA-N 0.000 description 1
- IDQFQFVEWMWRQQ-DLOVCJGASA-N Ser-Ala-Phe Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O IDQFQFVEWMWRQQ-DLOVCJGASA-N 0.000 description 1
- BRKHVZNDAOMAHX-BIIVOSGPSA-N Ser-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N BRKHVZNDAOMAHX-BIIVOSGPSA-N 0.000 description 1
- YQHZVYJAGWMHES-ZLUOBGJFSA-N Ser-Ala-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O YQHZVYJAGWMHES-ZLUOBGJFSA-N 0.000 description 1
- HBZBPFLJNDXRAY-FXQIFTODSA-N Ser-Ala-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O HBZBPFLJNDXRAY-FXQIFTODSA-N 0.000 description 1
- QFBNNYNWKYKVJO-DCAQKATOSA-N Ser-Arg-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CO)CCCN=C(N)N QFBNNYNWKYKVJO-DCAQKATOSA-N 0.000 description 1
- OYEDZGNMSBZCIM-XGEHTFHBSA-N Ser-Arg-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OYEDZGNMSBZCIM-XGEHTFHBSA-N 0.000 description 1
- MMAPOBOTRUVNKJ-ZLUOBGJFSA-N Ser-Asp-Ser Chemical compound C([C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CO)N)C(=O)O MMAPOBOTRUVNKJ-ZLUOBGJFSA-N 0.000 description 1
- RFBKULCUBJAQFT-BIIVOSGPSA-N Ser-Cys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CS)NC(=O)[C@H](CO)N)C(=O)O RFBKULCUBJAQFT-BIIVOSGPSA-N 0.000 description 1
- DSGYZICNAMEJOC-AVGNSLFASA-N Ser-Glu-Phe Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O DSGYZICNAMEJOC-AVGNSLFASA-N 0.000 description 1
- KDGARKCAKHBEDB-NKWVEPMBSA-N Ser-Gly-Pro Chemical compound C1C[C@@H](N(C1)C(=O)CNC(=O)[C@H](CO)N)C(=O)O KDGARKCAKHBEDB-NKWVEPMBSA-N 0.000 description 1
- IOVBCLGAJJXOHK-SRVKXCTJSA-N Ser-His-His Chemical compound C([C@H](NC(=O)[C@H](CO)N)C(=O)N[C@@H](CC=1NC=NC=1)C(O)=O)C1=CN=CN1 IOVBCLGAJJXOHK-SRVKXCTJSA-N 0.000 description 1
- MOQDPPUMFSMYOM-KKUMJFAQSA-N Ser-His-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CC2=CN=CN2)NC(=O)[C@H](CO)N MOQDPPUMFSMYOM-KKUMJFAQSA-N 0.000 description 1
- CAOYHZOWXFFAIR-CIUDSAMLSA-N Ser-His-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(O)=O CAOYHZOWXFFAIR-CIUDSAMLSA-N 0.000 description 1
- FUMGHWDRRFCKEP-CIUDSAMLSA-N Ser-Leu-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O FUMGHWDRRFCKEP-CIUDSAMLSA-N 0.000 description 1
- XNCUYZKGQOCOQH-YUMQZZPRSA-N Ser-Leu-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O XNCUYZKGQOCOQH-YUMQZZPRSA-N 0.000 description 1
- KCGIREHVWRXNDH-GARJFASQSA-N Ser-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N KCGIREHVWRXNDH-GARJFASQSA-N 0.000 description 1
- XNXRTQZTFVMJIJ-DCAQKATOSA-N Ser-Met-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O XNXRTQZTFVMJIJ-DCAQKATOSA-N 0.000 description 1
- RWDVVSKYZBNDCO-MELADBBJSA-N Ser-Phe-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)NC(=O)[C@H](CO)N)C(=O)O RWDVVSKYZBNDCO-MELADBBJSA-N 0.000 description 1
- JLKWJWPDXPKKHI-FXQIFTODSA-N Ser-Pro-Asn Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CO)N)C(=O)N[C@@H](CC(=O)N)C(=O)O JLKWJWPDXPKKHI-FXQIFTODSA-N 0.000 description 1
- BSXKBOUZDAZXHE-CIUDSAMLSA-N Ser-Pro-Glu Chemical compound [H]N[C@@H](CO)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O BSXKBOUZDAZXHE-CIUDSAMLSA-N 0.000 description 1
- AZWNCEBQZXELEZ-FXQIFTODSA-N Ser-Pro-Ser Chemical compound OC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O AZWNCEBQZXELEZ-FXQIFTODSA-N 0.000 description 1
- HHJFMHQYEAAOBM-ZLUOBGJFSA-N Ser-Ser-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O HHJFMHQYEAAOBM-ZLUOBGJFSA-N 0.000 description 1
- VFWQQZMRKFOGLE-ZLUOBGJFSA-N Ser-Ser-Cys Chemical compound C([C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)O)N)O VFWQQZMRKFOGLE-ZLUOBGJFSA-N 0.000 description 1
- CUXJENOFJXOSOZ-BIIVOSGPSA-N Ser-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CO)N)C(=O)O CUXJENOFJXOSOZ-BIIVOSGPSA-N 0.000 description 1
- SOACHCFYJMCMHC-BWBBJGPYSA-N Ser-Thr-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CO)N)O SOACHCFYJMCMHC-BWBBJGPYSA-N 0.000 description 1
- KKKVOZNCLALMPV-XKBZYTNZSA-N Ser-Thr-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O KKKVOZNCLALMPV-XKBZYTNZSA-N 0.000 description 1
- FZNNGIHSIPKFRE-QEJZJMRPSA-N Ser-Trp-Gln Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCC(N)=O)C(O)=O FZNNGIHSIPKFRE-QEJZJMRPSA-N 0.000 description 1
- BCAVNDNYOGTQMQ-AAEUAGOBSA-N Ser-Trp-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)NCC(O)=O BCAVNDNYOGTQMQ-AAEUAGOBSA-N 0.000 description 1
- GSCVDSBEYVGMJQ-SRVKXCTJSA-N Ser-Tyr-Asp Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CO)N)O GSCVDSBEYVGMJQ-SRVKXCTJSA-N 0.000 description 1
- 241000187747 Streptomyces Species 0.000 description 1
- 108700007696 Tetrahydrofolate Dehydrogenase Proteins 0.000 description 1
- TYVAWPFQYFPSBR-BFHQHQDPSA-N Thr-Ala-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)NCC(O)=O TYVAWPFQYFPSBR-BFHQHQDPSA-N 0.000 description 1
- XYEXCEPTALHNEV-RCWTZXSCSA-N Thr-Arg-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O XYEXCEPTALHNEV-RCWTZXSCSA-N 0.000 description 1
- GLQFKOVWXPPFTP-VEVYYDQMSA-N Thr-Arg-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O GLQFKOVWXPPFTP-VEVYYDQMSA-N 0.000 description 1
- LYGKYFKSZTUXGZ-ZDLURKLDSA-N Thr-Cys-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CS)C(=O)NCC(O)=O LYGKYFKSZTUXGZ-ZDLURKLDSA-N 0.000 description 1
- RKDFEMGVMMYYNG-WDCWCFNPSA-N Thr-Gln-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O RKDFEMGVMMYYNG-WDCWCFNPSA-N 0.000 description 1
- VGYBYGQXZJDZJU-XQXXSGGOSA-N Thr-Glu-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O VGYBYGQXZJDZJU-XQXXSGGOSA-N 0.000 description 1
- BNGDYRRHRGOPHX-IFFSRLJSSA-N Thr-Glu-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)[C@@H](C)O)C(O)=O BNGDYRRHRGOPHX-IFFSRLJSSA-N 0.000 description 1
- KCRQEJSKXAIULJ-FJXKBIBVSA-N Thr-Gly-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(O)=O KCRQEJSKXAIULJ-FJXKBIBVSA-N 0.000 description 1
- QQWNRERCGGZOKG-WEDXCCLWSA-N Thr-Gly-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O QQWNRERCGGZOKG-WEDXCCLWSA-N 0.000 description 1
- YUOCMLNTUZAGNF-KLHWPWHYSA-N Thr-His-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N2CCC[C@@H]2C(=O)O)N)O YUOCMLNTUZAGNF-KLHWPWHYSA-N 0.000 description 1
- BVOVIGCHYNFJBZ-JXUBOQSCSA-N Thr-Leu-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O BVOVIGCHYNFJBZ-JXUBOQSCSA-N 0.000 description 1
- RFKVQLIXNVEOMB-WEDXCCLWSA-N Thr-Leu-Gly Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)NCC(=O)O)N)O RFKVQLIXNVEOMB-WEDXCCLWSA-N 0.000 description 1
- VRUFCJZQDACGLH-UVOCVTCTSA-N Thr-Leu-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O VRUFCJZQDACGLH-UVOCVTCTSA-N 0.000 description 1
- BDGBHYCAZJPLHX-HJGDQZAQSA-N Thr-Lys-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O BDGBHYCAZJPLHX-HJGDQZAQSA-N 0.000 description 1
- DXPURPNJDFCKKO-RHYQMDGZSA-N Thr-Lys-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)[C@@H](C)O)C(O)=O DXPURPNJDFCKKO-RHYQMDGZSA-N 0.000 description 1
- JMBRNXUOLJFURW-BEAPCOKYSA-N Thr-Phe-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N2CCC[C@@H]2C(=O)O)N)O JMBRNXUOLJFURW-BEAPCOKYSA-N 0.000 description 1
- WTMPKZWHRCMMMT-KZVJFYERSA-N Thr-Pro-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O WTMPKZWHRCMMMT-KZVJFYERSA-N 0.000 description 1
- SGAOHNPSEPVAFP-ZDLURKLDSA-N Thr-Ser-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)NCC(O)=O SGAOHNPSEPVAFP-ZDLURKLDSA-N 0.000 description 1
- AHERARIZBPOMNU-KATARQTJSA-N Thr-Ser-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O AHERARIZBPOMNU-KATARQTJSA-N 0.000 description 1
- ZESGVALRVJIVLZ-VFCFLDTKSA-N Thr-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@@H]1C(=O)O)N)O ZESGVALRVJIVLZ-VFCFLDTKSA-N 0.000 description 1
- 241000218636 Thuja Species 0.000 description 1
- HJWVPKJHHLZCNH-DVXDUOKCSA-N Trp-Ala-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)CC=3C4=CC=CC=C4NC=3)C)C(O)=O)=CNC2=C1 HJWVPKJHHLZCNH-DVXDUOKCSA-N 0.000 description 1
- QNMIVTOQXUSGLN-SZMVWBNQSA-N Trp-Arg-Arg Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O)=CNC2=C1 QNMIVTOQXUSGLN-SZMVWBNQSA-N 0.000 description 1
- VKMOGXREKGVZAF-QEJZJMRPSA-N Trp-Asp-Gln Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N VKMOGXREKGVZAF-QEJZJMRPSA-N 0.000 description 1
- GTNCSPKYWCJZAC-XIRDDKMYSA-N Trp-Asp-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N GTNCSPKYWCJZAC-XIRDDKMYSA-N 0.000 description 1
- MZDJYWGXAIEYEP-BPUTZDHNSA-N Trp-Cys-Arg Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N MZDJYWGXAIEYEP-BPUTZDHNSA-N 0.000 description 1
- CZSMNLQMRWPGQF-XEGUGMAKSA-N Trp-Gln-Ala Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O CZSMNLQMRWPGQF-XEGUGMAKSA-N 0.000 description 1
- SVGAWGVHFIYAEE-JSGCOSHPSA-N Trp-Gly-Gln Chemical compound C1=CC=C2C(C[C@H](N)C(=O)NCC(=O)N[C@@H](CCC(N)=O)C(O)=O)=CNC2=C1 SVGAWGVHFIYAEE-JSGCOSHPSA-N 0.000 description 1
- RPVDDQYNBOVWLR-HOCLYGCPSA-N Trp-Gly-Leu Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O RPVDDQYNBOVWLR-HOCLYGCPSA-N 0.000 description 1
- LDMUNXDDIDAPJH-VMBFOHBNSA-N Trp-Ile-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N LDMUNXDDIDAPJH-VMBFOHBNSA-N 0.000 description 1
- VPRHDRKAPYZMHL-SZMVWBNQSA-N Trp-Leu-Glu Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O)=CNC2=C1 VPRHDRKAPYZMHL-SZMVWBNQSA-N 0.000 description 1
- GWBWCGITOYODER-YTQUADARSA-N Trp-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)N GWBWCGITOYODER-YTQUADARSA-N 0.000 description 1
- GIAMKIPJSRZVJB-IHPCNDPISA-N Trp-Phe-Ser Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)N GIAMKIPJSRZVJB-IHPCNDPISA-N 0.000 description 1
- GSCPHMSPGQSZJT-JYBASQMISA-N Trp-Ser-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N)O GSCPHMSPGQSZJT-JYBASQMISA-N 0.000 description 1
- QHWMVGCEQAPQDK-UMPQAUOISA-N Trp-Thr-Arg Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N)O QHWMVGCEQAPQDK-UMPQAUOISA-N 0.000 description 1
- 108700025716 Tumor Suppressor Genes Proteins 0.000 description 1
- 102000044209 Tumor Suppressor Genes Human genes 0.000 description 1
- 108010064997 VPY tripeptide Proteins 0.000 description 1
- YFOCMOVJBQDBCE-NRPADANISA-N Val-Ala-Glu Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N YFOCMOVJBQDBCE-NRPADANISA-N 0.000 description 1
- JFAWZADYPRMRCO-UBHSHLNASA-N Val-Ala-Phe Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 JFAWZADYPRMRCO-UBHSHLNASA-N 0.000 description 1
- VMRFIKXKOFNMHW-GUBZILKMSA-N Val-Arg-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CO)C(=O)O)N VMRFIKXKOFNMHW-GUBZILKMSA-N 0.000 description 1
- QPZMOUMNTGTEFR-ZKWXMUAHSA-N Val-Asn-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](C(C)C)N QPZMOUMNTGTEFR-ZKWXMUAHSA-N 0.000 description 1
- HIZMLPKDJAXDRG-FXQIFTODSA-N Val-Cys-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CO)C(=O)O)N HIZMLPKDJAXDRG-FXQIFTODSA-N 0.000 description 1
- WFENBJPLZMPVAX-XVKPBYJWSA-N Val-Gly-Glu Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(O)=O WFENBJPLZMPVAX-XVKPBYJWSA-N 0.000 description 1
- PIFJAFRUVWZRKR-QMMMGPOBSA-N Val-Gly-Gly Chemical compound CC(C)[C@H]([NH3+])C(=O)NCC(=O)NCC([O-])=O PIFJAFRUVWZRKR-QMMMGPOBSA-N 0.000 description 1
- LAYSXAOGWHKNED-XPUUQOCRSA-N Val-Gly-Ser Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O LAYSXAOGWHKNED-XPUUQOCRSA-N 0.000 description 1
- XXROXFHCMVXETG-UWVGGRQHSA-N Val-Gly-Val Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O XXROXFHCMVXETG-UWVGGRQHSA-N 0.000 description 1
- JAKHAONCJJZVHT-DCAQKATOSA-N Val-Lys-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)O)N JAKHAONCJJZVHT-DCAQKATOSA-N 0.000 description 1
- QWCZXKIFPWPQHR-JYJNAYRXSA-N Val-Pro-Tyr Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 QWCZXKIFPWPQHR-JYJNAYRXSA-N 0.000 description 1
- AJNUKMZFHXUBMK-GUBZILKMSA-N Val-Ser-Arg Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N AJNUKMZFHXUBMK-GUBZILKMSA-N 0.000 description 1
- VHIZXDZMTDVFGX-DCAQKATOSA-N Val-Ser-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](C(C)C)N VHIZXDZMTDVFGX-DCAQKATOSA-N 0.000 description 1
- NZYNRRGJJVSSTJ-GUBZILKMSA-N Val-Ser-Val Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O NZYNRRGJJVSSTJ-GUBZILKMSA-N 0.000 description 1
- YQYFYUSYEDNLSD-YEPSODPASA-N Val-Thr-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O YQYFYUSYEDNLSD-YEPSODPASA-N 0.000 description 1
- PFMSJVIPEZMKSC-DZKIICNBSA-N Val-Tyr-Glu Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N PFMSJVIPEZMKSC-DZKIICNBSA-N 0.000 description 1
- JXWGBRRVTRAZQA-ULQDDVLXSA-N Val-Tyr-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](C(C)C)N JXWGBRRVTRAZQA-ULQDDVLXSA-N 0.000 description 1
- 238000005377 adsorption chromatography Methods 0.000 description 1
- 108010076324 alanyl-glycyl-glycine Proteins 0.000 description 1
- 108010050025 alpha-glutamyltryptophan Proteins 0.000 description 1
- 230000008485 antagonism Effects 0.000 description 1
- 239000000427 antigen Substances 0.000 description 1
- 108091007433 antigens Proteins 0.000 description 1
- 102000036639 antigens Human genes 0.000 description 1
- 238000013459 approach Methods 0.000 description 1
- 108010036533 arginylvaline Proteins 0.000 description 1
- 108010077245 asparaginyl-proline Proteins 0.000 description 1
- 108010047857 aspartylglycine Proteins 0.000 description 1
- 108010092854 aspartyllysine Proteins 0.000 description 1
- 238000003149 assay kit Methods 0.000 description 1
- WQZGKKKJIJFFOK-VFUOTHLCSA-N beta-D-glucose Chemical compound OC[C@H]1O[C@@H](O)[C@H](O)[C@@H](O)[C@@H]1O WQZGKKKJIJFFOK-VFUOTHLCSA-N 0.000 description 1
- 238000011953 bioanalysis Methods 0.000 description 1
- 239000001506 calcium phosphate Substances 0.000 description 1
- 229910000389 calcium phosphate Inorganic materials 0.000 description 1
- 235000011010 calcium phosphates Nutrition 0.000 description 1
- 244000309466 calf Species 0.000 description 1
- 230000005907 cancer growth Effects 0.000 description 1
- 150000001720 carbohydrates Chemical class 0.000 description 1
- 230000011712 cell development Effects 0.000 description 1
- 230000004663 cell proliferation Effects 0.000 description 1
- 238000004587 chromatography analysis Methods 0.000 description 1
- 238000000975 co-precipitation Methods 0.000 description 1
- 239000003184 complementary RNA Substances 0.000 description 1
- 108010004073 cysteinylcysteine Proteins 0.000 description 1
- 238000013016 damping Methods 0.000 description 1
- 238000012217 deletion Methods 0.000 description 1
- 230000037430 deletion Effects 0.000 description 1
- 238000013461 design Methods 0.000 description 1
- 238000011161 development Methods 0.000 description 1
- 102000004419 dihydrofolate reductase Human genes 0.000 description 1
- 208000035475 disorder Diseases 0.000 description 1
- 238000009826 distribution Methods 0.000 description 1
- 229940079593 drug Drugs 0.000 description 1
- 238000001035 drying Methods 0.000 description 1
- 235000013399 edible fruits Nutrition 0.000 description 1
- 238000006911 enzymatic reaction Methods 0.000 description 1
- 238000002474 experimental method Methods 0.000 description 1
- 238000010195 expression analysis Methods 0.000 description 1
- 230000001605 fetal effect Effects 0.000 description 1
- 210000002950 fibroblast Anatomy 0.000 description 1
- 239000012530 fluid Substances 0.000 description 1
- GNBHRKFJIUUOQI-UHFFFAOYSA-N fluorescein Chemical compound O1C(=O)C2=CC=CC=C2C21C1=CC=C(O)C=C1OC1=CC(O)=CC=C21 GNBHRKFJIUUOQI-UHFFFAOYSA-N 0.000 description 1
- 238000013467 fragmentation Methods 0.000 description 1
- 230000002538 fungal effect Effects 0.000 description 1
- 238000001502 gel electrophoresis Methods 0.000 description 1
- 238000002523 gelfiltration Methods 0.000 description 1
- 238000012215 gene cloning Methods 0.000 description 1
- 239000008103 glucose Substances 0.000 description 1
- 235000011187 glycerol Nutrition 0.000 description 1
- XBGGUPMXALFZOT-UHFFFAOYSA-N glycyl-L-tyrosine hemihydrate Natural products NCC(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 XBGGUPMXALFZOT-UHFFFAOYSA-N 0.000 description 1
- 108010019832 glycyl-asparaginyl-glycine Proteins 0.000 description 1
- 108010078326 glycyl-glycyl-valine Proteins 0.000 description 1
- 239000010931 gold Substances 0.000 description 1
- 229910052737 gold Inorganic materials 0.000 description 1
- 230000036449 good health Effects 0.000 description 1
- 230000036541 health Effects 0.000 description 1
- 238000004128 high performance liquid chromatography Methods 0.000 description 1
- 108010028295 histidylhistidine Proteins 0.000 description 1
- WGCNASOHLSPBMP-UHFFFAOYSA-N hydroxyacetaldehyde Natural products OCC=O WGCNASOHLSPBMP-UHFFFAOYSA-N 0.000 description 1
- 230000036039 immunity Effects 0.000 description 1
- 230000005847 immunogenicity Effects 0.000 description 1
- 238000003364 immunohistochemistry Methods 0.000 description 1
- 230000002637 immunotoxin Effects 0.000 description 1
- 239000002596 immunotoxin Substances 0.000 description 1
- 231100000608 immunotoxin Toxicity 0.000 description 1
- 229940051026 immunotoxin Drugs 0.000 description 1
- 230000008676 import Effects 0.000 description 1
- 238000000338 in vitro Methods 0.000 description 1
- 238000001727 in vivo Methods 0.000 description 1
- 230000006698 induction Effects 0.000 description 1
- 230000008595 infiltration Effects 0.000 description 1
- 238000001764 infiltration Methods 0.000 description 1
- 239000003112 inhibitor Substances 0.000 description 1
- 230000002401 inhibitory effect Effects 0.000 description 1
- 230000005764 inhibitory process Effects 0.000 description 1
- 238000002347 injection Methods 0.000 description 1
- 239000007924 injection Substances 0.000 description 1
- 238000003780 insertion Methods 0.000 description 1
- 230000037431 insertion Effects 0.000 description 1
- 230000003993 interaction Effects 0.000 description 1
- 230000003834 intracellular effect Effects 0.000 description 1
- 238000007918 intramuscular administration Methods 0.000 description 1
- 238000007912 intraperitoneal administration Methods 0.000 description 1
- 238000004255 ion exchange chromatography Methods 0.000 description 1
- 230000002147 killing effect Effects 0.000 description 1
- 238000012177 large-scale sequencing Methods 0.000 description 1
- 108010009932 leucyl-alanyl-glycyl-valine Proteins 0.000 description 1
- 150000002632 lipids Chemical class 0.000 description 1
- 108010017391 lysylvaline Proteins 0.000 description 1
- 210000001161 mammalian embryo Anatomy 0.000 description 1
- 238000010297 mechanical methods and process Methods 0.000 description 1
- 230000010534 mechanism of action Effects 0.000 description 1
- 201000001441 melanoma Diseases 0.000 description 1
- 230000002503 metabolic effect Effects 0.000 description 1
- 230000031864 metaphase Effects 0.000 description 1
- VNWKTOKETHGBQD-UHFFFAOYSA-N methane Natural products C VNWKTOKETHGBQD-UHFFFAOYSA-N 0.000 description 1
- 238000000520 microinjection Methods 0.000 description 1
- 238000000386 microscopy Methods 0.000 description 1
- 238000002156 mixing Methods 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 230000002969 morbid Effects 0.000 description 1
- 239000005445 natural material Substances 0.000 description 1
- 229960004927 neomycin Drugs 0.000 description 1
- 239000002777 nucleoside Substances 0.000 description 1
- 150000003833 nucleoside derivatives Chemical class 0.000 description 1
- 238000012856 packing Methods 0.000 description 1
- 239000002245 particle Substances 0.000 description 1
- 239000002831 pharmacologic agent Substances 0.000 description 1
- 239000003123 plant toxin Substances 0.000 description 1
- 229920002401 polyacrylamide Polymers 0.000 description 1
- 238000003752 polymerase chain reaction Methods 0.000 description 1
- 238000001556 precipitation Methods 0.000 description 1
- 230000002265 prevention Effects 0.000 description 1
- 125000002924 primary amino group Chemical group [H]N([H])* 0.000 description 1
- 230000008569 process Effects 0.000 description 1
- 210000001236 prokaryotic cell Anatomy 0.000 description 1
- 108010079317 prolyl-tyrosine Proteins 0.000 description 1
- 108010070643 prolylglutamic acid Proteins 0.000 description 1
- 238000011321 prophylaxis Methods 0.000 description 1
- 238000001742 protein purification Methods 0.000 description 1
- 238000003127 radioimmunoassay Methods 0.000 description 1
- 238000003156 radioimmunoprecipitation Methods 0.000 description 1
- 230000002829 reductive effect Effects 0.000 description 1
- 238000004153 renaturation Methods 0.000 description 1
- 230000010076 replication Effects 0.000 description 1
- 238000003757 reverse transcription PCR Methods 0.000 description 1
- 239000002342 ribonucleoside Substances 0.000 description 1
- 230000028327 secretion Effects 0.000 description 1
- 238000012772 sequence design Methods 0.000 description 1
- 210000002966 serum Anatomy 0.000 description 1
- 230000007781 signaling event Effects 0.000 description 1
- 239000007787 solid Substances 0.000 description 1
- 239000007790 solid phase Substances 0.000 description 1
- 210000001082 somatic cell Anatomy 0.000 description 1
- 241000894007 species Species 0.000 description 1
- 238000010561 standard procedure Methods 0.000 description 1
- 238000007920 subcutaneous administration Methods 0.000 description 1
- 239000000758 substrate Substances 0.000 description 1
- 238000003786 synthesis reaction Methods 0.000 description 1
- 108010061238 threonyl-glycine Proteins 0.000 description 1
- 239000003053 toxin Substances 0.000 description 1
- 231100000765 toxin Toxicity 0.000 description 1
- 238000010361 transduction Methods 0.000 description 1
- 230000026683 transduction Effects 0.000 description 1
- 238000003151 transfection method Methods 0.000 description 1
- 238000012546 transfer Methods 0.000 description 1
- 230000001131 transforming effect Effects 0.000 description 1
- 230000007704 transition Effects 0.000 description 1
- 238000013519 translation Methods 0.000 description 1
- 230000014621 translational initiation Effects 0.000 description 1
- QORWJWZARLRLPR-UHFFFAOYSA-H tricalcium bis(phosphate) Chemical compound [Ca+2].[Ca+2].[Ca+2].[O-]P([O-])([O-])=O.[O-]P([O-])([O-])=O QORWJWZARLRLPR-UHFFFAOYSA-H 0.000 description 1
- 210000004881 tumor cell Anatomy 0.000 description 1
- 241000701447 unidentified baculovirus Species 0.000 description 1
- 239000003981 vehicle Substances 0.000 description 1
- 238000012795 verification Methods 0.000 description 1
- 239000013603 viral vector Substances 0.000 description 1
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Substances O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 1
- 210000005253 yeast cell Anatomy 0.000 description 1
Landscapes
- Peptides Or Proteins (AREA)
- Micro-Organisms Or Cultivation Processes Thereof (AREA)
Abstract
The present invention relates to a class of human proteins with a function of promoting 3T3 cell transformation, polynucleotide for coding the polypeptide, and a method for generating the polypeptide by a recombinant technology. The present invention also discloses an antagonist resisting the polypeptide and the therapeutic function of the antagonist. The present invention also discloses the applications of the polynucleotide for coding the class of human proteins with the function of promoting 3T3 cell transformation.
Description
Technical field
The invention belongs to biological technical field, specifically, the present invention relates to new coding and have the proteic polynucleotide of people that promote 3T3 cell transformation function, and the polypeptide of this polynucleotide encoding.The invention still further relates to the purposes and the preparation of these polynucleotide and polypeptide.
Background technology
The research of people's gene group is international focus at present, removes human chromosome DNA large scale sequencing, outside the method for expressed sequence order-checking (EST), also lacks the screening that begins from function and has the high-throughout method of functional gene.
Cancer is one of principal disease of harm humans health.In order to treat effectively and prophylaxis of tumours, people more and more pay close attention to genetic treatment of tumor at present.Therefore, this area presses for development research people albumen and the agonist/inhibitor thereof relevant with growth of cancer cells.
Summary of the invention
The purpose of this invention is to provide people's protein polypeptide that new the having of a class promote 3T3 cell transformation function with and fragment, analogue and derivative.
Another object of the present invention provides the polynucleotide of these polypeptide of coding.
Another object of the present invention provides the method for these polypeptide of production and the purposes of this polypeptide and encoding sequence.
In a first aspect of the present invention, novel isolated protein polypeptide with promotion 3T3 cell transformation function is provided, it comprises the polypeptide of the aminoacid sequence with the group of being selected from down: SEQ ID NO:3,6,9,12,15,18,21,24,27,30; Or its conservative property variation polypeptide or its active fragments or its reactive derivative.
Preferably, this polypeptide is the polypeptide with aminoacid sequence of the group of being selected from down: SEQ ID NO:3,6,9,12,15,18,21,24,27,30.
In a second aspect of the present invention, a kind of isolating polynucleotide are provided, it comprises a nucleotide sequence, and this nucleotide sequence is shown at least 85% homogeny with a kind of nucleotides sequence that is selected from down group: the polynucleotide with the protein polypeptide that promotes 3T3 cell transformation function that (a) coding is above-mentioned; (b) with polynucleotide (a) complementary polynucleotide.Preferably, the polypeptide of this polynucleotide encoding has the aminoacid sequence of the group of being selected from down: SEQ ID NO:3,6,9,12,15,18,21,24,27,30.More preferably, the sequence of these polynucleotide is selected from down group: SEQ ID NO:2,5,8,11,14,17,20,23,26,29 coding region sequence or full length sequence.
In a third aspect of the present invention, the carrier that contains above-mentioned polynucleotide is provided, and has been transformed or host cell of transduceing or the host cell that is directly transformed or transduce by above-mentioned polynucleotide by this carrier.
In a fourth aspect of the present invention, provide preparation to have the preparation method of the polypeptide of the protein-active that promotes 3T3 cell transformation function, this method comprises: (a) being fit to express under the proteic condition with promotion 3T3 cell transformation function, cultivate the above-mentioned host cell that is transformed or transduce; (b) from culture, isolate polypeptide with the protein-active that promotes 3T3 cell transformation function.
In a fifth aspect of the present invention, provide and the above-mentioned protein polypeptide specificity bonded antibody that promotes 3T3 cell transformation function that has.The nucleic acid molecule that can be used for detecting also is provided, and it contains, and continuous 10 Nucleotide are to full length nucleotide in the above-mentioned polynucleotide, and preferably it contains the about 15-1000 of a successive Nucleotide.
In a sixth aspect of the present invention, a kind of pharmaceutical composition is provided, it contains the protein polypeptide and the pharmaceutically acceptable carrier with promotion 3T3 cell transformation function of the present invention of safe and effective amount.These pharmaceutical compositions can be used for promoting the growth of cell.The present invention also provides a kind of pharmaceutical composition, it contain safe and effective amount at antagonist (as antibody) and the pharmaceutically acceptable carrier with the protein polypeptide that promotes 3T3 cell transformation function of the present invention.This pharmaceutical composition can be treated illnesss such as cancer and cellular abnormality propagation.
Others of the present invention are because disclosing of the technology of this paper is conspicuous to those skilled in the art.
Embodiment
The 3T3 cell is a kind of l cell (J.Cell.Biol., 17:299,1963) (being also referred to as the NIH/3T3 cell).In the cancer research field, often foreign gene (especially people's gene) is introduced the 3T3 cell, observe its situation that influences to the growth of 3T3 cell.It has been generally acknowledged that, to 3T3 cell growth (or vicious transformation or transfection) influential gene is cancer related gene, wherein to 3T3 cell growth or transform that inhibiting gene is arranged is cancer suppressor gene mostly, and to the growth of 3T3 cell or transform (former) oncogene that has the gene of promoter action to be mostly.
The present invention adopts large-scale cDNA clone transfection mouse embryo fibroblasts 3T3, has on the basis that promotes the growth effect in acquisition, proves new gene through order-checking, further obtains full length cDNA clone.DNA transfection evidence, the albumen with promotion 3T3 cell transformation function of the present invention has the effect that promotes that the clone forms, its promotion rate 〉=50% to the 3T3 cell.
As used herein, " isolating " is meant that material separates (if natural substance, primal environment promptly is a natural surroundings) from its primal environment.Do not have separation and purification as polynucleotide under the native state in the active somatic cell and polypeptide, but same polynucleotide or polypeptide as from native state with in other materials that exist separately, then for separation and purification.
As used herein, " isolating albumen or polypeptide with promotion 3T3 cell transformation function " is meant to have and promotes the protein polypeptide of 3T3 cell transformation function to be substantially free of natural relative other albumen, lipid, carbohydrate or other material.Those skilled in the art can have the albumen that promotes 3T3 cell transformation function with the purified technology of protein purifying of standard.Basically pure polypeptide can produce single master tape on non-reduced polyacrylamide gel.
Polypeptide of the present invention can be recombinant polypeptide, natural polypeptides, synthetic polypeptide, preferred recombinant polypeptide.Polypeptide of the present invention can be the product of natural purifying, or the product of chemosynthesis, or uses recombinant technology to produce from protokaryon or eucaryon host (for example, bacterium, yeast, higher plant, insect and mammalian cell).The host used according to the recombinant production scheme, polypeptide of the present invention can be glycosylated, maybe can be nonglycosylated.Polypeptide of the present invention also can comprise or not comprise initial methionine residues.
The present invention also comprises having the proteic fragment of people, derivative and the analogue that promotes 3T3 cell transformation function.As used herein, term " fragment ", " derivative " are meant with " analogue " and keep natural identical biological function or the active polypeptide of people's albumen that promotes 3T3 cell transformation function that have of the present invention basically.Polypeptide fragment of the present invention, derivative or analogue can be that (i) has one or more conservative or substituted polypeptide of non-conservation amino-acid residue (preferred conservative amino acid residue), and the amino-acid residue of such replacement can be also can not encoded by genetic code, or (ii) in one or more amino-acid residues, has a polypeptide of substituted radical, or (iii) mature polypeptide and another compound (such as the compound that prolongs the polypeptide transformation period, polyoxyethylene glycol for example) merge formed polypeptide, or (iv) additional aminoacid sequence is fused to this peptide sequence and the polypeptide that forms (as leader sequence or secretion sequence or be used for the sequence or the proteinogen sequence of this polypeptide of purifying).According to the instruction of this paper, these fragments, derivative and analogue belong to the known scope of those skilled in the art.
Polynucleotide of the present invention can be dna form or rna form.Dna form comprises the DNA of cDNA, genomic dna or synthetic.DNA can be strand or double-stranded.DNA can be coding strand or noncoding strand.Be example with FP18315 albumen (in this application, its clone numbering is adopted in proteinic name), the coding region sequence of encoding mature polypeptide can be identical with the coding region sequence shown in the SEQ ID NO:2 or the varient of degeneracy.As used herein, " varient of degeneracy " is meant that for FP18315 coding has the protein of SEQ ID NO:3, but with the differentiated nucleotide sequence of coding region sequence shown in the SEQ ID NO:2.Be example with LP2209 albumen again, the coding region sequence of encoding mature polypeptide can be identical with the coding region sequence shown in the SEQ ID NO:5 or the varient of degeneracy.As used herein, " varient of degeneracy " is meant that for LP2209 coding has the protein of SEQ ID NO:6, but with the differentiated nucleotide sequence of coding region sequence shown in the SEQ ID NO:5.Other have the albumen that promotes 3T3 cell transformation function for the present invention, and the rest may be inferred.
The polynucleotide of encoding mature polypeptide comprise: the encoding sequence of an encoding mature polypeptide; The encoding sequence of mature polypeptide and various additional code sequence: encoding sequence of mature polypeptide (with optional additional code sequence) and non-coding sequence.
Term " polynucleotide of coded polypeptide " can be the polynucleotide that comprise this polypeptide of encoding, and also can be the polynucleotide that also comprise additional code and/or non-coding sequence.
The invention still further relates to the varient of above-mentioned polynucleotide, its coding has the polypeptide of identical aminoacid sequence or fragment, analogue and the derivative of polypeptide with the present invention.The varient of these polynucleotide can be the allelic variant of natural generation or the varient that non-natural takes place.These nucleotide diversity bodies comprise and replace varient, deletion mutation body and insert varient.As known in the art, allelic variant is the replacement form of polynucleotide, and it may be replacement, disappearance or the insertion of one or more Nucleotide, but can be from not changing the function of its encoded polypeptides in fact.
The invention still further relates to and above-mentioned sequence hybridization and two sequences between have at least 50%, preferably at least 70%, the polynucleotide of at least 80% homogeny more preferably.The present invention be more particularly directed under stringent condition and the interfertile polynucleotide of polynucleotide of the present invention.In the present invention, " stringent condition " is meant: (1) than hybridization under low ionic strength and the comparatively high temps and wash-out, as 0.2 * SSC, and 0.1%SDS, 60 ℃; Or (2) hybridization the time is added with denaturing agent, as 50% (v/v) methane amide, 0.1% calf serum/0.1%Ficoll, 42 ℃ etc.; Or (3) only at the homogeny between the two sequences at least more than 95%, be more preferably 97% and just hybridize when above.And the polypeptide of interfertile polynucleotide encoding has identical biological function and activity with the mature polypeptide shown in the SEQ IDNO:3 (is example with FP18315 albumen).
The invention still further relates to nucleic acid fragment with above-mentioned sequence hybridization.As used herein, the length of " nucleic acid fragment " contains 15 Nucleotide at least, better is at least 30 Nucleotide, is more preferably at least 50 Nucleotide, preferably more than at least 100 Nucleotide.The amplification technique (as PCR) that nucleic acid fragment can be used for nucleic acid has the proteic polynucleotide that promotes 3T3 cell transformation function to determine and/or to separate coding.
Polypeptide among the present invention and polynucleotide preferably provide with isolating form, more preferably are purified to homogeneous.
Dna sequence dna of the present invention can obtain with several method.For example, with hybridization technique DNA isolation well known in the art.These technology including, but not limited to: 1) with probe and genome or the hybridization of cDNA library to detect homology nucleotide sequence and 2) antibody screening of expression library to be to detect the dna fragmentation of the clone with common structure feature.
Coding has the proteic specific DNA fragment sequence that promotes 3T3 cell transformation function and produces also and can obtain with following method: 1) separate double chain DNA sequence from genomic dna; 2) the chemical synthesising DNA sequence is to obtain the double-stranded DNA of required polypeptide.
When the whole aminoacid sequence of the polypeptide product of needs was known, the direct chemical of dna sequence dna is synthetic to be the method for often selecting for use.When if required amino acid whose whole sequence is not known, the direct chemical of dna sequence dna is synthetic to be impossible, and the method for selecting for use is the separation of cDNA sequence.The standard method that separates interested cDNA is from the donorcells separating mRNA of this gene of high expression level and carries out reverse transcription, forms plasmid or phage cDNA library.Extract the existing multiple proven technique of method of mRNA, test kit also can obtain (Qiagene) from commercial channels.And the construction cDNA library also is usual method (Sambrook, et al., Molecular Cloning, A Laboratory Manual, Cold Spring HarborLaboratory.New York, 1989).Also can obtain the cDNA library of commercial offers, as the different cDNA library of Clontech company.When being used in combination the polymeric enzyme reaction technology, even few expression product also can be cloned.
Available ordinary method is screened gene of the present invention from these cDNA libraries.These methods include, but is not limited to: (1) DNA-DNA or DNA-RNA hybridization; (2) function of marker gene occurs or forfeiture; (3) measure level with the proteic transcript that promotes 3T3 cell transformation function; (4), detect the protein product of genetic expression by immunological technique or mensuration biologic activity.Aforesaid method can singly be used, but also several different methods combined utilization.
In (1) kind method, hybridizing used probe is and any a part of homology of polynucleotide of the present invention that at least 15 Nucleotide of its length better are at least 30 Nucleotide, are more preferably at least 50 Nucleotide, preferably at least 100 Nucleotide.In addition, the length of probe within 2kb, preferably is within the 1kb usually.Probe used herein is the dna sequence dna of chemosynthesis on the basis of gene DNA sequence information of the present invention normally.Gene of the present invention itself or fragment are certainly as probe.The mark of dna probe can be used radio isotope, fluorescein or enzyme (as alkaline phosphatase) etc.
In (4) kind method, detect protein product and can use immunological technique such as Western blotting, radioimmunoprecipitation, enzyme-linked immunosorbent assay (ELISA) etc. with the protein gene expression that promotes 3T3 cell transformation function.
Use method (Saiki, the et al.Science 1985 of round pcr DNA amplification/RNA; 230:1350-1354) be optimized for acquisition gene of the present invention.When particularly being difficult to obtain the cDNA of total length from the library, can preferably use RACE method (the terminal rapid amplifying method of RACE-cDNA), the primer that is used for PCR can suitably be selected according to sequence information of the present invention disclosed herein, and available ordinary method is synthetic.Available ordinary method is as the DNA/RNA fragment by gel electrophoresis separation and purifying amplification.
The gene of the present invention that obtains as mentioned above, perhaps the available ordinary method of mensuration of the nucleotide sequence of various dna fragmentations etc. such as dideoxy chain termination (Sanger et al.PNAS, 1977,74:5463-5467).This class nucleotide sequencing is available commercial sequencing kit etc. also.In order to obtain the cDNA sequence of total length, order-checking need be carried out repeatedly.Sometimes need to measure a plurality of clones' cDNA sequence, just can be spliced into the cDNA sequence of total length.
The present invention also relates to comprise the carrier of polynucleotide of the present invention, and with carrier of the present invention or have the host cell that the albumen coded sequence that promotes 3T3 cell transformation function produces through genetically engineered, and the method that produces polypeptide of the present invention through recombinant technology.
Recombinant DNA technology (Science, 1984 by routine; 224:1431), can utilize polymerized nucleoside acid sequence of the present invention to can be used to express or produce the protein polypeptide that promotes 3T3 cell transformation function that has of reorganization.In general following steps are arranged:
(1). have the proteic polynucleotide of people (or varient) that promote 3T3 cell transformation function with coding of the present invention, or transform or the transduction proper host cell with the recombinant expression vector that contains these polynucleotide;
(2). the host cell of in suitable medium, cultivating;
(3). separation, protein purification from substratum or cell.
Among the present invention, the people's albumen polynucleotide sequence with promotion 3T3 cell transformation function can be inserted in the recombinant expression vector.Term " recombinant expression vector " refers to that bacterial plasmid well known in the art, phage, yeast plasmid, vegetable cell virus, mammalian cell virus are as adenovirus, retrovirus or other carriers.The carrier of Shi Yonging includes but not limited in the present invention: and the expression vector based on T7 of in bacterium, expressing (Rosenberg, et al.Gene, 1987,56:125); The pMSXND expression vector of in mammalian cell, expressing (Lee and Nathans, J Bio Chem.263:3521,1988) and at the carrier that derives from baculovirus of expressed in insect cells.In a word, as long as can duplicate in host and stablize, any plasmid and carrier can be used.A key character of expression vector is to contain replication orgin, promotor, marker gene and translation controlling elements usually.
Method well-known to those having ordinary skill in the art can be used to make up contain and has people's encoding histone dna sequence dna of promoting 3T3 cell transformation function and suitable transcribing/translate the expression vector of control signal.These methods comprise (Sambroook, et al) such as extracorporeal recombinant DNA technology, DNA synthetic technology, the interior recombinant technologys of body.Described dna sequence dna can effectively be connected on the suitable promotor in the expression vector, and is synthetic to instruct mRNA.The representative example of these promotors has: colibacillary lac or trp promotor; Lambda particles phage PL promotor; Eukaryotic promoter comprises CMV immediate early promoter, early stage and late period SV40 promotor and some other known may command gene expression promoter in protokaryon or eukaryotic cell or its virus.Expression vector also comprises ribosome bind site and the transcription terminator that translation initiation is used.
In addition, expression vector preferably comprises one or more selected markers, to be provided for selecting the phenotypic character of transformed host cells, cultivate Tetrahydrofolate dehydrogenase, neomycin resistance and the green fluorescent protein (GFP) of usefulness as eukaryotic cell, or be used for colibacillary tsiklomitsin or amicillin resistance.
Comprise the carrier of above-mentioned suitable dna sequence dna and suitable promotor or control sequence, can be used to transform appropriate host cell, so that it can marking protein.
Host cell can be a prokaryotic cell prokaryocyte, as bacterial cell; Or eukaryotic cell such as low, as yeast cell; Or higher eucaryotic cells, as mammalian cell.Representative example has: intestinal bacteria, streptomyces; The bacterial cell of Salmonella typhimurium; Fungal cell such as yeast; Vegetable cell; The insect cell of fruit bat S2 or Sf9; The zooblast of CHO, COS or Bowes melanoma cells etc.
When polynucleotide of the present invention are expressed in higher eucaryotic cells, be enhanced if will make to transcribe when in carrier, inserting enhancer sequence.Enhanser is the cis acting factor of DNA, and nearly 10 to 300 base pairs act on promotor transcribing with enhancing gene usually.Can for example be included in the SV40 enhanser of 100 to 270 base pairs of replication origin side in late period one, at the polyoma enhanser of replication origin side in late period one and adenovirus enhanser etc.
Persons skilled in the art all know how to select appropriate carriers, promotor, enhanser and host cell.
Can carry out with routine techniques well known to those skilled in the art with the recombinant DNA transformed host cell.When the host was prokaryotic organism such as intestinal bacteria, the competent cell that can absorb DNA can be used CaCl in exponential growth after date results
2Method is handled, and used step is well-known in this area.Alternative is to use MgCl
2If desired, transforming also the method for available electroporation carries out.When the host is an eukaryote, can select following DNA transfection method for use: coprecipitation of calcium phosphate method, conventional mechanical method such as microinjection, electroporation, liposome packing etc.
The transformant that obtains can be cultivated with ordinary method, expresses the polypeptide of coded by said gene of the present invention.According to used host cell, used substratum can be selected from various conventional substratum in the cultivation.Under the condition that is suitable for the host cell growth, cultivate.After host cell grows into suitable cell density, induce the promotor of selection with suitable method (as temperature transition or chemical induction), cell is cultivated for some time again.
Recombinant polypeptide in the above methods can wrap by in the daughter cell, extracellular or on cytolemma, express or be secreted into the extracellular.If desired, can utilize its physics, the separating by various separation methods with other characteristic and the albumen of purification of Recombinant of chemistry.These methods are well-known to those skilled in the art.The example of these methods includes, but are not limited to: conventional renaturation handles, with protein precipitant handle (salt analysis method), centrifugal, the broken bacterium of infiltration, superly handle, the combination of super centrifugal, sieve chromatography (gel-filtration), adsorption chromatography, ion exchange chromatography, high performance liquid chromatography (HPLC) and other various liquid chromatography (LC) technology and these methods.
Having of reorganization promotes the people's albumen or the polypeptide of 3T3 cell transformation function to be of use in many ways.These purposes include, but is not limited to: directly have the disease due to the low or forfeiture of the protein function that promotes 3T3 cell transformation function as pharmacological agent and be used to screen and promote or antagonism has antibody, polypeptide or other part of the protein function that promotes 3T3 cell transformation function.For example, this antibody can be used for treating cancer or cellular abnormality propagation.The peptide molecule that can suppress or stimulate people's protein function that can be used for seeking therapeutic value with recombinant expressed protein screening peptide library of the present invention with promotion 3T3 cell transformation function.
The present invention also provides screening of medicaments to improve (agonist) or check the method that (antagonist) has the proteic medicament of people that promotes 3T3 cell transformation function to identify.Agonist improves and to have the people's albumen that promotes 3T3 cell transformation function biological function such as stimulate cellular proliferation, and antagonist prevention disorder such as the various cancer relevant with cell hyperproliferation with treatment.
Have the proteic antagonist of people that promotes 3T3 cell transformation function and comprise antibody, compound, acceptor disappearance thing and the analogue etc. that filter out.Have the proteic antagonist of people that promotes 3T3 cell transformation function and can and eliminate its function with people's protein binding with promotion 3T3 cell transformation function, or suppress to have the proteic generation of people that promotes 3T3 cell transformation function, or combine with the avtive spot of polypeptide and to make polypeptide can not bring into play biological function.Have and promote the proteic antagonist of people of 3T3 cell transformation function to can be used for therepic use.
In screening during as the compound of antagonist, can add in the bioanalysis mensuration having the albumen that promotes 3T3 cell transformation function, the albumen and the interaction between its acceptor that have promotion 3T3 cell transformation function by the mensuration compounds affect determine whether compound is antagonist.With the same quadrat method of above-mentioned SCREENED COMPOUND, can filter out the acceptor disappearance thing and the analogue of antagonist action.
The proteic antagonist of the present invention can be directly used in disease treatment, for example, and various malignant tumours and cellular abnormality propagation etc.
Polypeptide of the present invention, and fragment, derivative, analogue or their cell can be used as antigen to produce antibody.These antibody can be polyclone or monoclonal antibody.Polyclonal antibody can obtain by the method with this polypeptide direct injection animal.The technology of preparation monoclonal antibody comprises hybridoma technology, three knurl technology, people B-quadroma technology, EBV-hybridoma technology etc.
Can be with polypeptide of the present invention and antagonist and suitable pharmaceutical carrier combination back use.These carriers can be water, glucose, ethanol, salt, damping fluid, glycerine and their combination.Composition comprises the polypeptide or the antagonist of safe and effective amount and carrier and the vehicle that does not influence effect of drugs.These compositions can be used as medicine and are used for disease treatment.
The present invention also provides medicine box or the test kit that contains one or more containers, and one or more medicinal compositions compositions of the present invention are housed in the container.With these containers, can have by the given indicative prompting of government authorities of making, using or selling medicine or biological products, the government authorities that this prompting reflects production, uses or sells permits it to use on human body.In addition, polypeptide of the present invention can be used in combination with other treatment compound.
Pharmaceutical composition can be with mode administration easily, as by in part, intravenously, intraperitoneal, intramuscular, subcutaneous, the nose or the route of administration of intracutaneous.Have the albumen or its specific antibody that promote 3T3 cell transformation function, can come administration by the amount that treats and/or prevents concrete indication effectively.Be applied to having of patient and promote the proteic amount and the dosage range of 3T3 cell transformation function will depend on many factors, as administering mode, person's to be treated healthiness condition and diagnostician's judgement.
Have and promote the proteic polynucleotide of people of 3T3 cell transformation function also to can be used for multiple therapeutic purpose.Gene therapy technology can be used for treating since have that the proteic nothing that promotes 3T3 cell transformation function is expressed or unusual/non-activity have cell development or a metabolic disturbance due to the proteic expression that promotes 3T3 cell transformation function.The gene therapy vector (as virus vector) of reorganization can be designed to express the albumen that promotes 3T3 cell transformation function that has of variation, to suppress the endogenic protein-active that promotes 3T3 cell transformation function that has.For example, a kind of albumen that promotes 3T3 cell transformation function that has of variation can be the albumen with promotion 3T3 cell transformation function that shortens, lacked signal conduction function territory, though can combine with the substrate in downstream, lacks signaling activity.Therefore the gene therapy vector of reorganization can be used for treating and has the protein expression that promotes 3T3 cell transformation function or the disease of active caused by abnormal.Deriving from the expression vector of virus such as retrovirus, adenovirus, adeno-associated virus (AAV), hsv, parvovirus etc. can be used for having and promotes the protein gene of 3T3 cell transformation function to be transferred in the cell.The method that structure carries the recombinant viral vector with the protein gene that promotes 3T3 cell transformation function is found in existing document (Sambrook, et al.).Reorganization has the people's protein gene that promotes 3T3 cell transformation function and can be packaged in the liposome and be transferred in the cell in addition.
Inhibition has the oligonucleotide (comprising sense-rna and DNA) of the people's protein mRNA that promotes 3T3 cell transformation function and ribozyme also within the scope of the invention.Ribozyme is the enzyme sample RNA molecule that a kind of energy specificity is decomposed specific RNA, and its mechanism of action is to carry out the endonuclease effect after ribozyme molecule and the hybridization of complementary target RNA-specific.The RNA of antisense and DNA and ribozyme can obtain with existing any RNA or DNA synthetic technology, as the technology widespread use of solid phase phosphoamide chemical synthesis synthetic oligonucleotide.Antisense rna molecule can be transcribed acquisition by the dna sequence dna of this RNA that encodes in external or body.This dna sequence dna has been incorporated into the downstream of rna polymerase promoter of carrier.In order to increase the stability of nucleic acid molecule, available several different methods is modified it, and as increasing the sequence length of both sides, the connection between the ribonucleoside is used phosphoric acid thioester bond or peptide bond but not phosphodiester bond.
Polynucleotide imports tissue or intracellular method comprises: directly be injected into polynucleotide in the in-vivo tissue; Or external by carrier (as virus, phage or plasmid etc.) earlier with the polynucleotide transfered cell in, again cell is transplanted in the body etc.Because albumen of the present invention has the function that promotes the 3T3 cell transformation, so the antisense sequences of albumen coded sequence of the present invention, can be introduced into cell to suppress the abnormality proliferation (as canceration) of cell.
The present invention also provides at the antibody with the people's proteantigen determinant that promotes 3T3 cell transformation function.These antibody include, but is not limited to: the fragment that polyclonal antibody, monoclonal antibody, chimeric antibody, single-chain antibody, Fab fragment and Fab expression library produce.
The anti-proteic antibody of people with promotion 3T3 cell transformation function can be used in the immunohistochemistry technology, detects the people's albumen that promotes 3T3 cell transformation function that has in the biopsy specimen.
The also available labelled with radioisotope of the protein bound monoclonal antibody of people with having promotion 3T3 cell transformation function injects in the body and can follow the tracks of its position and distribution.This radiolabeled antibody can be used as a kind of atraumatic diagnostic method and is used for the location of tumour cell and has judged whether transfer.
Antibody among the present invention can be used for treating or preventing and have the relevant disease of people's albumen of promotion 3T3 cell transformation function.The antibody that gives suitable dosage can be blocked proteic generation of people or the activity with promotion 3T3 cell transformation function, thus the abnormality proliferation of the growth of anticancer and/or cell.
Antibody also can be used for designing the immunotoxin at a certain privileged sites in the body.As have the people's albumen high-affinity that promotes 3T3 cell transformation function monoclonal antibody can with bacterium or plant poison (as diphtheria toxin, ricin, abrine etc.) covalent attachment.A kind of usual method is with sulfydryl linking agent such as SPDP, attacks the amino of antibody, by the exchange of disulfide linkage, toxin is incorporated on the antibody, and this hybrid antibody can be used for killing relevant positive cell (as cancer cells).
Available people's albumen or the polypeptide immune animal of the production of polyclonal antibody with promotion 3T3 cell transformation function, as rabbit, mouse, rat etc.Multiple adjuvant can be used for the enhancing immunity reaction, includes but not limited to freund's adjuvant etc.
Have promote 3T3 cell transformation function people's protein monoclonal antibody can with hybridoma technology production (Kohler andMilstein.Nature, 1975,256:495-497).With the variable region bonded chimeric antibody in human constant region and inhuman source can with existing technology production (Morrison et al, PNAS, 1985,81:6851).And the technology of existing manufacture order chain antibody (U.S.Pat No.4946778) also can be used for producing the anti-proteic single-chain antibody of people that promotes 3T3 cell transformation function that has.
Can with have the protein bound peptide molecule of people that promotes 3T3 cell transformation function and can be incorporated into the rondom polypeptide storehouse that solid formation forms by the various amino acid that may make up by screening and obtain.During screening, must promote people's protein molecular of 3T3 cell transformation function to carry out mark to having.
The invention still further relates to quantitatively and detection and localization has the diagnostic testing process of people's protein level of promotion 3T3 cell transformation function.These tests are known in the art, and comprise that FISH measures and radioimmunoassay.That is detected in the test has a protein level that promotes 3T3 cell transformation function, can have the importance of albumen in various diseases that promotes 3T3 cell transformation function with laying down a definition and be used to diagnose to have the disease that the albumen that promotes 3T3 cell transformation function works.
Proteic polynucleotide with promotion 3T3 cell transformation function can be used for having the diagnosis and the treatment of the protein related diseases that promotes 3T3 cell transformation function.Aspect diagnosis, have the proteic polynucleotide that promotes 3T3 cell transformation function can be used for detecting have promote 3T3 cell transformation function proteic expression whether or under morbid state, have an abnormal exprssion that promotes 3T3 cell transformation function.As the protein D NA sequence with promotion 3T3 cell transformation function can be used for that the hybridization of biopsy specimen is had the proteic abnormal expression that promotes 3T3 cell transformation function with judgement.Hybridization technique comprises the Southern blotting, Northern blotting, in situ hybridization etc.These technological methods all are disclosed mature technologies, and relevant test kit all can obtain from commercial channels.Part or all of polynucleotide of the present invention can be used as probe stationary on microarray (Microarray) or DNA chip (being gene chip), is used for analyzing the differential expression analysis and the gene diagnosis of tissue gene.Carry out RNA-polymerase chain reaction (RT-PCR) amplification in vitro with the special primer of albumen and also can detect proteic transcription product with promotion 3T3 cell transformation function with promotion 3T3 cell transformation function.
The sudden change that detection has the protein gene that promotes 3T3 cell transformation function also can be used for diagnosing the relevant disease of albumen with promotion 3T3 cell transformation function.Form with the protein mutation that promotes 3T3 cell transformation function comprises that to have point mutation that the protein D NA sequence that promotes 3T3 cell transformation function compares, transposition, disappearance, reorganization and other any unusual etc. with normal wild type.Available existing technology such as Southern blotting, dna sequence analysis, PCR and in situ hybridization detect sudden change.In addition, sudden change might influence proteic expression, therefore can judge indirectly that with Northern blotting, Western blotting gene has or not sudden change.
Sequence of the present invention identifies it also is valuable to karyomit(e).These sequences can be specifically at certain bar human chromosome particular location and and can with its hybridization.At present, need to identify the concrete site of each gene on the karyomit(e).Yet have only chromosomal marker thing seldom to can be used for the marker chromosomes position now based on actual sequence data (repetition polymorphism).For these sequences are associated with disease related gene.The first step is positioned dna sequence dna of the present invention on the karyomit(e) exactly.
In brief, prepare PCR primer (preferred 15-35bp), sequence can be positioned on the karyomit(e) according to cDNA.Then, these primers are used for the somatocyte hybrid cell that the PCR screening contains each bar human chromosome.Have only those hybrid cells that contain corresponding to the people's gene of primer can produce the fragment of amplification.
The PCR localization method of somatocyte hybrid cell is that DNA is navigated to concrete chromosomal quick method.Use Oligonucleolide primers of the present invention,, can utilize one group to realize inferior location from specific chromosomal fragment or a large amount of genomic clone by similar approach.Other the similar strategy that can be used for chromosomal localization comprises in situ hybridization, uses the karyomit(e) prescreen and the hybridization preliminary election of the airflow classification of mark, thereby makes up the special cDNA storehouse of karyomit(e).
The cDNA clone is carried out fluorescence in situ hybridization (FISH) with Metaphase Chromosome, can in a step, accurately carry out chromosomal localization.The summary of this technology is referring to Verma etc., Human Chromosomes:a Manual of BasicTechniques, Pergamon Press, New York (1988).
In case sequence is positioned to chromosome position accurately, the physical location of this sequence on karyomit(e) just can be associated with the gene map data.These data for example are found in, V.Mckusick, Mendelian Inheritance in Man (can by with the online acquisition of Johns Hopkins University Welch Medical Library).Can pass through linkage analysis then, determine gene and navigated to relation between the disease on the chromosomal region already.
Then, need to measure ill and not cDNA between diseased individuals or genome sequence difference.If observe certain sudden change in some or all of diseased individuals, and this sudden change is not observed in any normal individual, then this sudden change may be the cause of disease of disease.More ill and diseased individuals not is usually directed at first seek the variation of structure in the karyomit(e), as from the horizontal visible of karyomit(e) or use based on detectable disappearance of the PCR of cDNA sequence or transposition.
Pyrenoids thuja acid full length sequence or its fragment with promotion 3T3 cell transformation function of the present invention can obtain with the method for pcr amplification method, recombination method or synthetic usually.For the pcr amplification method, can be disclosed according to the present invention about nucleotide sequence, especially open reading frame sequence designs primer, and with commercially available cDNA storehouse or by the prepared cDNA storehouse of ordinary method well known by persons skilled in the art as template, amplification and must relevant sequence.When sequence is longer, usually needs to carry out twice or pcr amplification repeatedly, and then the fragment that each time amplifies is stitched together by proper order.
In case obtained relevant sequence, just can obtain relevant sequence in large quantity with recombination method.This normally is cloned into carrier with it, changes cell again over to, separates obtaining relevant sequence then from the host cell after the propagation by ordinary method.
In addition, also the method for available synthetic is synthesized relevant sequence, especially fragment length more in short-term.Usually, by first synthetic a plurality of small segments, and then connect and to obtain the very long fragment of sequence.
At present, can be fully come the dna sequence dna of code book invention albumen (or its fragment, or derivatives thereof) by chemosynthesis.This dna sequence dna can be introduced then in the various dna moleculars (as carrier) and cell in this area.In addition, also can will suddenly change and introduce in the protein sequence of the present invention by chemosynthesis.
Because the albumen with promotion 3T3 cell transformation function of the present invention has the natural acid sequence that is derived from the people, therefore, compare with the albumen of the same clan that derives from other species, estimate to have higher active and/or lower side effect (for example in the intravital immunogenicity of people lower or do not have) being applied to man-hour.
Below in conjunction with specific embodiment, further set forth the present invention.Should be understood that these embodiment only to be used to the present invention is described and be not used in and limit the scope of the invention.The experimental technique of unreceipted actual conditions in the following example, usually according to people such as normal condition such as Sambrook, molecular cloning: laboratory manual (New York:Cold Spring Harbor LaboratoryPress, 1989) condition described in, or the condition of advising according to manufacturer.Notice that in Nucleotide and amino acid composite sequence, what (1) provided is the position that initial sum stops first Nucleotide of coding, (2) molecular weight unit is dalton.
The acquisition of embodiment 1:cDNA gene and the promoter action that mouse NIH/3T3 cell clone is formed
FP18315 comes from the human fetal cDNA library that makes up with ordinary method: LP2209, LP2570, LP3317, LP3428, LP4947, LP5553, LP6347, LP8067 and LP8151 come from the normal hepatocytes cDNA library that makes up with ordinary method.Get fetal tissue (FP clone) or normal liver tissue (LP clone), (GIBCO BRL company) extracts total RNA by manufacturer's specification sheets with Trizol reagent, extracts mRNA with the mRNA test kit (Pharmacia company) of purifying.Make up the cDNA library of above-mentioned mRNA with pCMV-script TMXRcDNA library construction test kit (Stratagene company).Wherein ThermoScript II is used MMLV-RT-Superscript II (GIBCO BRL) instead, and reverse transcription reaction carries out at 42 ℃.Transform XL 10-Gold recipient cell, obtained the cDNA library of 1 * 106cfu/ μ g titre.The first round is picking cDNA clone at random, is probe with high abundance cDNA clone with the cDNA clone who has proved anticancer growth function thereafter, screening by hybridization cDNA library, weak positive and negative clone of picking.With Qiagen 96 orifice plate plasmid extraction test kits, carry out the extraction of plasmid DNA by shop instruction.Plasmid DNA and empty carrier transfection simultaneously mouse NIH/3T3 cell.After the 100ng DNA alcohol precipitation drying, add 6 μ l H2O dissolving, treat transfection.Add 0.74 μ l liposome and 9.3 μ l serum-free mediums in every part of DNA sample, behind the mixing, room temperature was placed 10 minutes.Add 150 μ l serum-free mediums in every pipe, divide equally and add 3 holes and grow in the mouse NIH/3T3 cell of 96 orifice plates, placed 2 hours for 37 ℃, every hole adds 50 μ l serum-free mediums again, 37 ℃ 24 hours.Every hole is changed 100 μ l and is trained liquid entirely, 37 ℃ 24 hours, change the full training liquid 100 μ l that contain G418,37 ℃ 24-48 hour, the limit is observed, the training liquid that G418 concentration does not wait is changed on the limit.After about 2-3 time, there is the clone to form up to the microscopy cell, counting.Find that above-mentioned clone has the cell clone of promotion formation effect, the result is as shown in the table.
CDNA clone's transfectional cell (3T3) clone formation situation
| CDNA clones title | CDNA clones number (three repetitions) | Empty carrier clone number (three repetitions) | ||||
| FP18315 | 84 | 88 | 78 | 39 | 41 | 38 |
| LP2209 | 62 | 68 | 63 | 4 | 9 | 13 |
| LP2570 | 68 | 76 | 83 | 22 | 25 | 29 |
| LP3317 | 61 | 56 | 52 | 35 | 32 | 38 |
| LP3428 | 68 | 60 | 63 | 37 | 35 | 30 |
| LP4947 | 50 | 51 | 40 | 29 | 36 | 31 |
| LP5553 | 45 | 48 | 50 | 20 | 15 | 17 |
| LP6347 | 48 | 53 | 50 | 27 | 30 | 34 |
| LP8067 | 66 | 62 | 65 | 41 | 36 | 30 |
| LP8151 | 61 | 63 | 60 | 30 | 45 | 40 |
The cDNA clone is adopted two deoxidation cessation method, on the ABI377 automatic dna sequencer, measure the nucleotide sequence of the nearly 500bp of one end.After the analysis, be defined as novel gene cloning, carry out the other end order-checking, do not obtain full length cDNA sequence yet, the design primer checks order once more, up to obtaining full length sequence (SEQ ID NO:1,4,7,10,13,16,19,22,25,28).
Embodiment 2: PCR obtains full-length gene from placenta or normal hepatocytes cDNA:
Get fetal tissue (FP clone) or normal liver tissue (LP clone), (GIBCO BRL company) extracts total RNA by manufacturer's specification sheets with Trizol reagent, extracts mRNA with the mRNA test kit (Pharmacia company) of purifying.With MMLV-RT-Superscript II (GIBCO BRL), ThermoScript II is carried out reverse transcription reaction at 42 ℃, obtains placenta or fetus cDNA.Utilize the special primer (as shown in the table) of each gene, by 97 ℃ of 3 ' 1 circulation.94 ℃ 30 " 60 ℃ 30 " 72 ℃ of 1 ' 35 circulation, pcr amplification is carried out in 72 ℃ of 10 ' 1 circulation, and acquisition contains the amplified production of each protein gene of complete open reading frame sequence.Amplified production is through sequence verification, and the sequence that records with embodiment 1 conforms to, and changes amplified production over to host cell with routine techniques subsequently, obtains recombinant protein (SEQ ID NO:3,6,9,12,15,18,21,24,27,30).
Gene specific primer
| Clone's title | Special primer 1 (5 ' → 3 ') | SEQ ID NO | Special primer 2 (3 ' → 5 ') | SEQ ID NO |
| FP18315 | (73)GGCCAGCTCATGGTATCTCC | 31 | ATGAACCCTCTGACTCCGTCC(1884) | 32 |
| LP2209 | (101)CAGAAACCTGGGGTAAGCAA | 33 | TACAAATCACGGGAAAAGTACG(812) | 34 |
| LP2570 | (33)GATGATGCTGCCAAGTTATTCA | 35 | ATCCCCATAGACATCCTTACCG(1729) | 36 |
| LP3317 | (57)CTGGAGGGAGATCACAAAACA | 37 | ATGAAGCACTGGTGGGTGATAC(1102) | 38 |
| LP3428 | (33)CCCGTCTGGGATGTGAGG | 39 | GCTCTTTGTGGGTTCTTACTAG(1222) | 40 |
| LP4947 | (22)TTCATACTTGGTGCGCTGTG | 41 | GAGGGAACAAAGCTAACGG(2058) | 42 |
| LP5553 | (94)GATTCTGCCCATGTGCTCC | 43 | TCCTTTATTTCGTCTTCCTTGC(1901) | 44 |
| LP6347 | (138)TAGCTGGTTGCTAGAGTTACGG | 45 | TTTCCTTTATTCGGCACGTT(1899) | 46 |
| LP8067 | (19)CGGCAGACTGGGGTTGGT | 47 | TTTATTTCGCGGTCGCGT(15548) | 48 |
| LP8151 | (27)GGGTGTTGCTGTCATCTCCA | 49 | TTCTCCAAACCTCCCTCACC(1629) | 50 |
Annotate: in the bracket is the correspondence position of primer in each gene DNA sequence.
Embodiment 3:cDNA cloned sequence is analyzed
1.FP18315
A: nucleotide sequence (SEQ ID NO:1) length: 2032 bases
B: aminoacid sequence (SEQ ID NO:3) length: 117 amino acid
C. Nucleotide and amino acid composite sequence (SEQ ID NO:2) clone number and protein name: FP18315 start code: 898ATG stops coding: 1249TAG protein molecular weight: 12585.84
2.LP2209
A: nucleotide sequence (SEQ ID NO:4) length: 926 bases
B: aminoacid sequence (SEQ ID NO:6) length: 70 amino acid
C. Nucleotide and amino acid composite sequence (SEQ ID NO:5) clone number and protein name: LP2209 start code: 430ATG stops coding: 640TGA protein molecular weight: 7837.38
3.LP2570
A: nucleotide sequence (SEQ ID NO:7) length: 1814 bases
B: aminoacid sequence (SEQ ID NO:9) length: 117 amino acid
C. Nucleotide and amino acid composite sequence (SEQ ID NO:8) clone number and protein name: LP2570 start code: 1001 ATG stop coding: 1352 TAA protein molecular weights: 12882.88
4.LP2317
A: nucleotide sequence (SEQ ID NO:10) length: 1199 bases
B: aminoacid sequence (SEQ ID NO:12) length: 103 amino acid
C. Nucleotide and amino acid composite sequence (SEQ ID NO:11) clone number and protein name: LP3317 start code: 217 ATG stop coding: 526 TAA protein molecular weights: 11308.46
5.LP3428
A: nucleotide sequence (SEQ ID NO:13) length: 1269 bases
B: aminoacid sequence (SEQ ID NO:15) length: 80 amino acid
C. Nucleotide and amino acid composite sequence (SEQ ID NO:14) clone number and protein name: LP3428 start code: 978 ATG stop coding: 1218 TGA protein molecular weights: 8987.08
6.LP4947
A: nucleotide sequence (SEQ ID NO:16) length: 2471 bases
B: aminoacid sequence (SEQ ID NO:18) length: 146 amino acid
C. Nucleotide and amino acid composite sequence (SEQ ID NO:17) clone number and protein name: LP4947 start code: 108 ATG stop coding: 546 TGA protein molecular weights: 16017.98
7.LP5553
A: nucleotide sequence (SEQ ID NO:19) length: 1934 bases
B: aminoacid sequence (SEQ ID NO:21) length: 96 amino acid
C. Nucleotide and amino acid composite sequence (SEQ ID NO:20) clone number and protein name: LP5553 start code: 837 ATG stop coding: 1125 TAG protein molecular weights: 10235.89
8.LP6347
A: nucleotide sequence (SEQ ID NO:22) length: 1915 bases
B: aminoacid sequence (SEQ ID NO:24) length: 91 amino acid
C. Nucleotide and amino acid composite sequence (SEQ ID NO:23) clone number and protein name: LP6347 start code: 343 ATG stop coding: 616 TGA protein molecular weights: 9540.07
9.LP8067
A: nucleotide sequence (SEQ ID NO:25) length: 1575 bases
B: aminoacid sequence (SEQ ID NO:27) length: 200 amino acid
C. Nucleotide and amino acid composite sequence (SEQ ID NO:26) clone number and protein name: LP8067 start code: 754 ATG stop coding: 1354 TGA protein molecular weights: 20839.38
10.LP8151
A: nucleotide sequence (SEQ ID NO:28) length: 1800 bases
B: aminoacid sequence (SEQ ID NO:30) length: 168 amino acid
C. Nucleotide and amino acid composite sequence (SEQ ID NO:29) clone number and protein name: LP8151 start code: 409 ATG stop coding: 913 TGA protein molecular weights: 18356.29
All quote in this application as a reference at all documents that the present invention mentions, just quoted as a reference separately as each piece document.Should be understood that in addition those skilled in the art can make various changes or modifications the present invention after having read above-mentioned teachings of the present invention, these equivalent form of values fall within the application's appended claims institute restricted portion equally.
Sequence table
<110〉Shanghai Xinshijie Gene Techn Development Co., Ltd.
<120〉have new people's albumen and the encoding sequence thereof that promotes mouse NIII/3T3 cell transformation function
<130>025304
<160>50
<170>PatentIn version 3.1
<210>1
<211>2032
<212>DNA
<213〉homo sapiens (Homo sapiens)
<400>1
gctgagttaa tgttgagcca ctccatgtga cacagaagcc tcttgtcact gtgttgttca 60
ttggggtgcc gaggccagct catggtatct cctgagagcc agtcgttcac tttccagtat 120
ttttttttgc gggggcgtgg ggggacggat gttcactctt gttgcccagg ctggaacaca 180
atggcacgat ctcggttcac cacgacctcc acctcctgag gcaggagagt cgcttgaacc 240
caggaggcgg aggtcgtggt aagccaagat cacgccattg cactccagcc tggcaacaag 300
gagcaaaact ccgtctgaaa aaaaaaaaaa aaattagcca ggcctggtgg tgcacgcttg 360
taatcccagc tacttgagag gctgaggcag gagaattgct tgaacccagg aggtggaggt 420
tgcagtgagc cgagatcgtg ccactgccct ccagtctggg tgacagagtg agattacatc 480
tcaaaaaaaa aaaaagggtg ggaggaataa gttttacacc ctcctcccca ccccacatgg 540
aaatcatcct taaccactag agcggtgttg tcctgaagcg ctctctgtgg tgacggatgg 600
ttttccacct gcactgagtt ggttgtcagc agccacacgt gccttccgcg cacttgaaac 660
atggctaatg tgactggcta cctgagtctt tatttttctt taacgtacac gtggacagct 720
cagcgtgggt agtgaggcgc ggcctgagcc ccctgtggct ctgcacgcca gcctctgccg 780
tgttatgatc agctgcaaag ttggtaacca tgccatagta gtggaggctt tggggagggg 840
agagtaaggt ggtttgtagg actagatgga ccagataatg catacccggt aaaataaatg 900
cttgtgtggt ttgcagaatc gtcagtagtg gggcgacaga gaccagggtg gcgctttcca 960
gtggggtcac tggtcctgag cccccatggc atagaggtac cgcggccggc tgtgtccgca 1020
tctgctggcc tctattctcc ctgtgttatt tacaaagatg tttgtctcct agcaggtgcg 1080
ccacgtgagg cccagagccg cactcgctca ccctgtgcgc tctcccccaa ccgccccgca 1140
ccgtccccgt ccctggcacg gcgcttcgtg ccttccacag gcttcataac tgctttgttt 1200
ctaaagtccc gttctgacaa cgccatcgag agccacctgt ctctgattta gatcctaact 1260
caggaaggac ccatttcccc agcagctgca gcctctgttg atgtaatgag cacggtatgg 1320
tatggagaca gtaactcctt actgcagccc tgcgagggct cctgtcccca cttgaccctc 1380
gtagcaaccc cctgaggtcg caacagtctt tgtcttatcc actgctcttg ggagggtctg 1440
tggctcttgg ttgagcgtct tcaccgcacc ctcatcctgg tgatgccgcg atgctccaac 1500
cccagagcgg acacttggcg tcggaattgc agggagtccc tccgctgtag aaagggctcc 1560
tcctcttcct tcccccatgc catcccagga gaaaggcctg ccgtggagtg ccgcttcctt 1620
tgtctccaga cagtccagcc tgggcgacag agcaggactc tttctttgtc tcaaaaaaaa 1680
aaaaaagtac atgtttagcc aggcatggtg gctcatgcct gtaatcccag cactttggga 1740
ggccgaggca ggtggatcaa aaggtcaaga gatggagacc atctggccaa catggtgaaa 1800
ccccatctct actaaaaata aaaaaatcag ctgggcgtgg tggtgcacgc ctgtagtccc 1860
atctacttgg gagactgagg caggagaatc tcttgaaccg gggaggcaga ggttgcagtg 1920
agccgagatt gcgccactgc actccagcct ggcaacagag agagactcca tctctaaaaa 1980
aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aa 2032
<210>2
<211>2032
<212>DNA
<213〉homo sapiens (Homo sapiens)
<220>
<221>CDS
<222>(898)..(1248)
<223>
<400>2
gctgagttaa tgttgagcca ctccatgtga cacagaagcc tcttgtcact gtgttgttca 60
ttggggtgcc gaggccagct catggtatct cctgagagcc agtcgttcac tttccagtat 120
ttttttttgc gggggcgtgg ggggacggat gttcactctt gttgcccagg ctggaacaca 180
atggcacgat ctcggttcac cacgacctcc acctcctgag gcaggagagt cgcttgaacc 240
caggaggcgg aggtcgtggt aagccaagat cacgccattg cactccagcc tggcaacaag 300
gagcaaaact ccgtctgaaa aaaaaaaaaa aaattagcca ggcctggtgg tgcacgcttg 360
taatcccagc tacttgagag gctgaggcag gagaattgct tgaacccagg aggtggaggt 420
tgcagtgagc cgagatcgtg ccactgccct ccagtctggg tgacagagtg agattacatc 480
tcaaaaaaaa aaaaagggtg ggaggaataa gttttacacc ctcctcccca ccccacatgg 540
aaatcatcct taaccactag agcggtgttg tcctgaagcg ctctctgtgg tgacggatgg 600
ttttccacct gcactgagtt ggttgtcagc agccacacgt gccttccgcg cacttgaaac 660
atggctaatg tgactggcta cctgagtctt tatttttctt taacgtacac gtggacagct 720
cagcgtgggt agtgaggcgc ggcctgagcc ccctgtggct ctgcacgcca gcctctgccg 780
tgttatgatc agctgcaaag ttggtaacca tgccatagta gtggaggctt tggggagggg 840
agagtaaggt ggtttgtagg actagatgga ccagataatg catacccggt aaaataa 897
atg ctt gtg tgg ttt gca gaa tcg tca gta gtg ggg cga cag aga cca 945
Met Leu Val Trp Phe Ala Glu Ser Ser Val Val Gly Arg Gln Arg Pro
1 5 10 15
ggg tgg cgc ttt cca gtg ggg tca ctg gtc ctg agc ccc cat ggc ata 993
Gly Trp Arg Phe Pro Val Gly Ser Leu Val Leu Ser Pro His Gly Ile
20 25 30
gag gta ccg cgg ccg gct gtg tcc gca tct gct ggc ctc tat tct ccc 1041
Glu Val Pro Arg Pro Ala Val Ser Ala Ser Ala Gly Leu Tyr Ser Pro
35 40 45
tgt gtt att tac aaa gat gtt tgt ctc cta gca ggt gcg cca cgt gag 1089
Cys Val Ile Tyr Lys Asp Val Cys Leu Leu Ala Gly Ala Pro Arg Glu
50 55 60
gcc cag agc cgc act cgc tca ccc tgt gcg ctc tcc ccc aac cgc ccc 1137
Ala Gln Ser Arg Thr Arg Ser Pro Cys Ala Leu Ser Pro Asn Arg Pro
65 70 75 80
gca ccg tcc ccg tcc ctg gca cgg cgc ttc gtg cct tcc aca ggc ttc 1185
Ala Pro Ser Pro Ser Leu Ala Arg Arg Phe Val Pro Ser Thr Gly Phe
85 90 95
ata act gct ttg ttt cta aag tcc cgt tct gac aac gcc atc gag agc 1233
Ile Thr Ala Leu Phe Leu Lys Ser Arg Ser Asp Asn Ala Ile Glu Ser
100 105 110
cac ctg tct ctg att tagatcctaa ctcaggaagg acccatttcc ccagcagctg 1288
His Leu Ser Leu Ile
115
cagcctctgt tgatgtaatg agcacggtat ggtatggaga cagtaactcc ttactgcagc 1348
cctgcgaggg ctcctgtccc cacttgaccc tcgtagcaac cccctgaggt cgcaacagtc 1408
tttgtcttat ccactgctct tgggagggtc tgtggctctt ggttgagcgt cttcaccgca 1468
ccctcatcct ggtgatgccg cgatgctcca accccagagc ggacacttgg cgtcggaatt 1528
gcagggagtc cctccgctgt agaaagggct cctcctcttc cttcccccat gccatcccag 1588
gagaaaggcc tgccgtggag tgccgcttcc tttgtctcca gacagtccag cctgggcgac 1648
agagcaggac tctttctttg tctcaaaaaa aaaaaaaagt acatgtttag ccaggcatgg 1708
tggctcatgc ctgtaatccc agcactttgg gaggccgagg caggtggatc aaaaggtcaa 1768
gagatggaga ccatctggcc aacatggtga aaccccatct ctactaaaaa taaaaaaatc 1828
agctgggcgt ggtggtgcac gcctgtagtc ccatctactt gggagactga ggcaggagaa 1888
tctcttgaac cggggaggca gaggttgcag tgagccgaga ttgcgccact gcactccagc 1948
ctggcaacag agagagactc catctctaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 2008
aaaaaaaaaa aaaaaaaaaa aaaa 2032
<210>3
<211>117
<212>PRT
<213〉homo sapiens (Homo sapiens)
<400>3
Met Leu Val Trp Phe Ala Glu Ser Ser Val Val Gly Arg Gln Arg Pro
1 5 10 15
Gly Trp Arg Phe Pro Val Gly Ser Leu Val Leu Ser Pro His Gly Ile
20 25 30
Glu Val Pro Arg Pro Ala Val Ser Ala Ser Ala Gly Leu Tyr Ser Pro
35 40 45
Cys Val Ile Tyr Lys Asp Val Cys Leu Leu Ala Gly Ala Pro Arg Glu
50 55 60
Ala Gln Ser Arg Thr Arg Ser Pro Cys Ala Leu Ser Pro Asn Arg Pro
65 70 75 80
Ala Pro Ser Pro Ser Leu Ala Arg Arg Phe Val Pro Ser Thr Gly Phe
85 90 95
Ile Thr Ala Leu Phe Leu Lys Ser Arg Ser Asp Asn Ala Ile Glu Ser
100 105 110
His Leu Ser Leu Ile
115
<210>4
<211>926
<212>DNA
<213〉homo sapiens (Homo sapiens)
<400>4
gcccttctac ccatatctat tgagtgcttg ttatgaagaa agagagcagc acaccggccc 60
tctgtctgct gggggagact gacgagtgaa cagattattc cagaaacctg gggtaagcaa 120
gatagtggat aggggacaga gggccaaagc ttcctagact gatgattagc tgagctgagt 180
gataaaagag aagtcagata attacacaaa aatgagagcg gcagcccagc ccatgcaaaa 240
gcatggccct gaccggaagc taccaggaag gattctgatg tgcctggcct gggtggtgcc 300
tgctcccaga gcctgcaagc tgggttctgg ggaagcatcc tctaggtgat agagcaggct 360
gtagagcaag ccctgttctg agtgaggagg acaggccctg tgtgtgatgg ccccacctga 420
gccactggca tgggcagtca gttcccaaga cggagtccag gacagacaaa aaatgaacag 480
ctgaaaagtg caagtagtgt aggttcaagg aagtggtgta ggaagaggga gacggaatgg 540
gatctgagga gggctgcaca gcacacagta ggcggcacaa aagtctgctc agttaggtca 600
catgctccag aggcactcac tgcaaaagag cctgaagatt gaactgaaat atgccatcgg 660
ctttgctgag tatgaatgcc aagaggagca gagagaagtc aagccctcta ggtgataggc 720
aggaacgagc tgaaagaagg acataaatct tggtttgctc agacgggcct ggattatact 780
tacgttaatt atgtttagtg cccttttcat gctaagaagt gtcctacttt ggatgataaa 840
ttgtacagtc actctaggtt taagtgatac tcaggcagtc tggcttggaa agtcaagtca 900
ggagaagaaa aaaaaaaaaa aaaaaa 926
<210>5
<211>926
<212>DNA
<213〉homo sapiens (Homo sapiens)
<220>
<221>CDS
<222>(430)..(639)
<223>
<400>5
gcccttctac ccatatctat tgagtgcttg ttatgaagaa agagagcagc acaccggccc 60
tctgtctgct gggggagact gacgagtgaa cagattattc cagaaacctg gggtaagcaa 120
gatagtggat aggggacaga gggccaaagc ttcctagact gatgattagc tgagctgagt 180
gataaaagag aagtcagata attacacaaa aatgagagcg gcagcccagc ccatgcaaaa 240
gcatggccct gaccggaagc taccaggaag gattctgatg tgcctggcct gggtggtgcc 300
tgctcccaga gcctgcaagc tgggttctgg ggaagcatcc tctaggtgat agagcaggct 360
gtagagcaag ccctgttctg agtgaggagg acaggccctg tgtgtgatgg ccccacctga 420
gccactggc atg ggc agt cag ttc cca aga cgg agt cca gga cag aca aaa 471
Met Gly Ser Gln Phe Pro Arg Arg Ser Pro Gly Gln Thr Lys
1 5 10
aat gaa cag ctg aaa agt gca agt agt gta ggt tca agg aag tgg tgt 519
Asn Glu Gln Leu Lys Ser Ala Ser Ser Val Gly Ser Arg Lys Trp Cys
15 20 25 30
agg aag agg gag acg gaa tgg gat ctg agg agg gct gca cag cac aca 567
Arg Lys Arg Glu Thr Glu Trp Asp Leu Arg Arg Ala Ala Gln His Thr
35 40 45
gta ggc ggc aca aaa gtc tgc tca gtt agg tca cat gct cca gag gca 615
Val Gly Gly Thr Lys Val Cys Ser Val Arg Ser His Ala Pro Glu Ala
50 55 60
ctc act gca aaa gag cct gaa gat tgaactgaaa tatgccatcg gctttgctga 669
Leu Thr Ala Lys Glu Pro Glu Asp
65 70
gtatgaatgc caagaggagc agagagaagt caagccctct aggtgatagg caggaacgag 729
ctgaaagaag gacataaatc ttggtttgct cagacgggcc tggattatac ttacgttaat 789
tatgtttagt gcccttttca tgctaagaag tgtcctactt tggatgataa attgtacagt 849
cactctaggt ttaagtgata ctcaggcagt ctggcttgga aagtcaagtc aggagaagaa 909
aaaaaaaaaa aaaaaaa 926
<210>6
<211>70
<212>PRT
<213〉homo sapiens (Homo sapiens)
<400>6
Met Gly Ser Gln Phe Pro Arg Arg Ser Pro Gly Gln Thr Lys Asn Glu
1 5 10 15
Gln Leu Lys Ser Ala Ser Ser Val Gly Ser Arg Lys Trp Cys Arg Lys
20 25 30
Arg Glu Thr Glu Trp Asp Leu Arg Arg Ala Ala Gln His Thr Val Gly
35 40 45
Gly Thr Lys Val Cys Ser Val Arg Ser His Ala Pro Glu Ala Leu Thr
50 55 60
Ala Lys Glu Pro Glu Asp
65 70
<210>7
<211>1814
<212>DNA
<213〉homo sapiens (Homo sapiens)
<400>7
ggttttgtga actgtacaat gtacaaatat gagatgatgc tgccaagtta ttcaaagggt 60
cattcatcag tgaatcaacg tatcaacaag cctattagct ggacagttga ggatacacca 120
ataagccgtc tagctgggga tgagaagagg gagggctgca cgtgaagggt taattcacaa 180
tgtagagtga tagtggtgag cagactttta ggctgaaggc tgaagggctc atccttttta 240
aattttgaaa caaggtcctg cattgtcacc cagtcaccca ggctgagtac agtggcacaa 300
tcgtggctca ttgtagtctc cacctcccag gctcaagcaa tcctcccatc tcagcctccc 360
agctactcgg aaggctgagg caggagaatc gcttgaactc gggaggtgga ggttgtagtg 420
agctgaggtc atgccactgc actccagcct ggtgacagaa cgagactctg tctcataaat 480
aaataaataa ataaataaat aaataaatag tcaggcatgg tggcatgcac ctgtagtggg 540
aggctgacat caggggatca cttgagctta gaaggtggag gctgcagtga gctctgactg 600
caccactgca ctccagcctg gggaacagag tgagaccctg tctcaaaaac aaaaaaacaa 660
aaaacaaaaa caaggttaag actgtcttgg aaactgaagg ggcccagaaa acctacaaga 720
cctgcctgag attttattta gggataagaa gaaagcaagc aagctgaaaa ggacagaaaa 780
agaaaataat gttgacttca gtttgcaggt taaatcccat tgaccatata agggttacat 840
ccatgtaaag tgcctggctc atagtagtgc acccatcatt tccccttcac cataggagaa 900
gctgcatccc aggcgatcca agcccaacga attcctccca ctcccaggaa gggcttgctt 960
gaggccagag gctggcaggg aaaaaaaact cctctgcaaa atgttgcctc agcctctgga 1020
atttacactt accccactgc cgtctgctca acattcctgc ctgaaattcc agccgaacca 1080
taggaaacaa ctgctctgtg aaggtccagg aactggcagg ggctggcagg cctctcccaa 1140
ctcccaaatt gctgtcacag gctatgacct cacttcctta gcccgctcca gggtgagcct 1200
cctctccagt cctgaatgtt cggtgggcag gagggcaaca gcgtgggcct ggaccagatc 1260
atggcaggcc actgagcagg gcttggccag gttactgccg tggatcattt cccattttgg 1320
agagtcaaac cccaataccc ttgctcaaag ttaaatgcac agaggaaatt ccctgttctg 1380
ggaggcttgg gggaagtgga tgctccctcg gctcagcttt cttgatgagt cgaagggagg 1440
aaaggagttg tgtttcttga gtgcttactc tataccaggc cctgtgctag gcttacactc 1500
ttcagaatga ctgtgtgagg ccagatgcag tgactcacgc ctgtaatccc aaaatgctgg 1560
gattacagga gtgagccgct gtgcttggcc aagagggata tttttaaccc gagaacggtc 1620
atgaagcact tcctggagaa ggtgaaatag cactggtcct tgaaagacat ttagaattct 1680
gaccagtcaa taagatgaaa gatactgtag gggtatctgt aggaatggcc tacgtagaag 1740
tgaggagaga gaaaacctaa cagtggtggg tggtttgtgt tgggaagcaa aaaacgaaaa 1800
aaaaaaaaaa aaaa 1814
<210>8
<211>1814
<212>DNA
<213〉homo sapiens (Homo sapiens)
<220>
<221>CDS
<222>(1001)..(1351)
<223>
<400>8
ggttttgtga actgtacaat gtacaaatat gagatgatgc tgccaagtta ttcaaagggt 60
cattcatcag tgaatcaacg tatcaacaag cctattagct ggacagttga ggatacacca 120
ataagccgtc tagctgggga tgagaagagg gagggctgca cgtgaagggt taattcacaa 180
tgtagagtga tagtggtgag cagactttta ggctgaaggc tgaagggctc atccttttta 240
aattttgaaa caaggtcctg cattgtcacc cagtcaccca ggctgagtac agtggcacaa 300
tcgtggctca ttgtagtctc cacctcccag gctcaagcaa tcctcccatc tcagcctccc 360
agctactcgg aaggctgagg caggagaatc gcttgaactc gggaggtgga ggttgtagtg 420
agctgaggtc atgccactgc actccagcct ggtgacagaa cgagactctg tctcataaat 480
aaataaataa ataaataaat aaataaatag tcaggcatgg tggcatgcac ctgtagtggg 540
aggctgacat caggggatca cttgagctta gaaggtggag gctgcagtga gctctgactg 600
caccactgca ctccagcctg gggaacagag tgagaccctg tctcaaaaac aaaaaaacaa 660
aaaacaaaaa caaggttaag actgtcttgg aaactgaagg ggcccagaaa acctacaaga 720
cctgcctgag attttattta gggataagaa gaaagcaagc aagctgaaaa ggacagaaaa 780
agaaaataat gttgacttca gtttgcaggt taaatcccat tgaccatata agggttacat 840
ccatgtaaag tgcctggctc atagtagtgc acccatcatt tccccttcac cataggagaa 900
gctgcatccc aggcgatcca agcccaacga attcctccca ctcccaggaa gggcttgctt 960
gaggccagag gctggcaggg aaaaaaaact cctctgcaaa atg ttg cct cag cct 1015
Met Leu Pro Gln Pro
1 5
ctg gaa ttt aca ctt acc cca ctg ccg tct gct caa cat tcc tgc ctg 1063
Leu Glu Phe Thr Leu Thr Pro Leu Pro Ser Ala Gln His Ser Cys Leu
10 15 20
aaa ttc cag ccg aac cat agg aaa caa ctg ctc tgt gaa ggt cca gga 1111
Lys Phe Gln Pro Asn His Arg Lys Gln Leu Leu Cys Glu Gly Pro Gly
25 30 35
act ggc agg ggc tgg cag gcc tct ccc aac tcc caa att gct gtc aca 1159
Thr Gly Arg Gly Trp Gln Ala Ser Pro Asn Ser Gln Ile Ala Val Thr
40 45 50
ggc tat gac ctc act tcc tta gcc cgc tcc agg gtg agc ctc ctc tcc 1207
Gly Tyr Asp Leu Thr Ser Leu Ala Arg Ser Arg Val Ser Leu Leu Ser
55 60 65
agt cct gaa tgt tcg gtg ggc agg agg gca aca gcg tgg gcc tgg acc 1255
Ser Pro Glu Cys Ser Val Gly Arg Arg Ala Thr Ala Trp Ala Trp Thr
70 75 80 85
aga tca tgg cag gcc act gag cag ggc ttg gcc agg tta ctg ccg tgg 1303
Arg Ser Trp Gln Ala Thr Glu Gln Gly Leu Ala Arg Leu Leu Pro Trp
90 95 100
atc att tcc cat ttt gga gag tca aac ccc aat acc ctt gct caa agt 1351
Ile Ile Ser His Phe Gly Glu Ser Asn Pro Asn Thr Leu Ala Gln Ser
105 110 115
taaatgcaca gaggaaattc cctgttctgg gaggcttggg ggaagtggat gctccctcgg 1411
ctcagctttc ttgatgagtc gaagggagga aaggagttgt gtttcttgag tgcttactct 1471
ataccaggcc ctgtgctagg cttacactct tcagaatgac tgtgtgaggc cagatgcagt 1531
gactcacgcc tgtaatccca aaatgctggg attacaggag tgagccgctg tgcttggcca 1591
agagggatat ttttaacccg agaacggtca tgaagcactt cctggagaag gtgaaatagc 1651
actggtcctt gaaagacatt tagaattctg accagtcaat aagatgaaag atactgtagg 1711
ggtatctgta ggaatggcct acgtagaagt gaggagagag aaaacctaac agtggtgggt 1771
ggtttgtgtt gggaagcaaa aaacgaaaaa aaaaaaaaaa aaa 1814
<210>9
<211>117
<212>PRT
<213〉homo sapiens (Homo sapiens)
<400>9
Met Leu Pro Gln Pro Leu Glu Phe Thr Leu Thr Pro Leu Pro Ser Ala
1 5 10 15
Gln His Ser Cys Leu Lys Phe Gln Pro Asn His Arg Lys Gln Leu Leu
20 25 30
Cys Glu Gly Pro Gly Thr Gly Arg Gly Trp Gln Ala Ser Pro Asn Ser
35 40 45
Gln Ile Ala Val Thr Gly Tyr Asp Leu Thr Ser Leu Ala Arg Ser Arg
50 55 60
Val Ser Leu Leu Ser Ser Pro Glu Cys Ser Val Gly Arg Arg Ala Thr
65 70 75 80
Ala Trp Ala Trp Thr Arg Ser Trp Gln Ala Thr Glu Gln Gly Leu Ala
85 90 95
Arg Leu Leu Pro Trp Ile Ile Ser His Phe Gly Glu Ser Asn Pro Asn
100 105 110
Thr Leu Ala Gln Ser
115
<210>10
<211>1199
<212>DNA
<213〉homo sapiens (Homo sapiens)
<400>10
gaagactcag tcttctctgt tgttggttga ccaataaata aatatttagc ctttttctgg 60
agggagatca caaaacagga cagagctctt cctgaatcct tcaggcggat tcgccctctc 120
ttcctgttct gcggaggaca gtcctggggc ctttgctgtg tcaaccacgc cccgaggcac 180
cccagaagct gcatctgtcg agtctgtgct gtaatcatgg ccttgaaggt ggaaggcagg 240
tctgagatgg gaggtgacag gagcaggtct gggccggagc aaccgcaagc gcccgtctcc 300
ttctctccct ccctccctcc ctccttccct cctccctccc tccatcaaac caggcagagc 360
tggggtctgg ggcaccccag ggagacagac ctgccacctc agcaatggct tccaggtggg 420
aatgggggtc ttttagtcca gacttttcca gagatgctgg aaacacagct tgtcctgctg 480
tttaaatgtt cgcaattatt aatttttgca atgctttgta aaaactaaag gtacaatagt 540
tgggttccac cctgggaccg ccagtttgca atccttctct tagggactga gtcaggctgg 600
cggggggcgg gctgcagggc tcccacttga accagaagtc cctgtcctag atgtctagct 660
ccaaatgtgt tccctgccct ttcctgcttc cctgtagttc ctgcccgcac ccaggatccc 720
cagccatctg cccaagcctc cccaccccca ccacgccgcc cccactcccc acatgatggg 780
gaagctggcc aagacattca ttcctccaca ccccctgccc ccatcagcag ctccaccctc 840
agcaccccca cctcagctcc tcctgcaagg cctccgtgga tggggggtgg cggtggagaa 900
agcctttctt ttataattgt tttaaaccta cttttatttt tatttttgta gagacaagat 960
ctctctatgt tgcccaggct ggtcttgaac tcctggactc aaaggatcct ccagcctcgg 1020
ccccccagag tgctgggatt acaggcgtga gccatggtgc ccagcctggt aattaacaga 1080
tacttcgtga ccacccacta tgcaccaggc cctgttctat aattcagatg aaaatccctg 1140
agaggagctg acgtgctagc aggggaggca gatagcacca caaaaaaaaa aaaaaaaaa 1199
<210>11
<211>1199
<212>DNA
<213〉homo sapiens (Homo sapiens)
<220>
<221>CDS
<222>(217)..(525)
<223>
<400>11
gaagactcag tcttctctgt tgttggttga ccaataaata aatatttagc ctttttctgg 60
agggagatca caaaacagga cagagctctt cctgaatcct tcaggcggat tcgccctctc 120
ttcctgttct gcggaggaca gtcctggggc ctttgctgtg tcaaccacgc cccgaggcac 180
cccagaagct gcatctgtcg agtctgtgct gtaatc atg gcc ttg aag gtg gaa 234
Met Ala Leu Lys Val Glu
1 5
ggc agg tct gag atg gga ggt gac agg agc agg tct ggg ccg gag caa 282
Gly Arg Ser Glu Met Gly Gly Asp Arg Ser Arg Ser Gly Pro Glu Gln
10 15 20
ccg caa gcg ccc gtc tcc ttc tct ccc tcc ctc cct ccc tcc ttc cct 330
Pro Gln Ala Pro Val Ser Phe Ser Pro Ser Leu Pro Pro Ser Phe Pro
25 30 35
cct ccc tcc ctc cat caa acc agg cag agc tgg ggt ctg ggg cac ccc 378
Pro Pro Ser Leu His Gln Thr Arg Gln Ser Trp Gly Leu Gly His Pro
40 45 50
agg gag aca gac ctg cca cct cag caa tgg ctt cca ggt ggg aat ggg 426
Arg Glu Thr Asp Leu Pro Pro Gln Gln Trp Leu Pro Gly Gly Asn Gly
55 60 65 70
ggt ctt tta gtc cag act ttt cca gag atg ctg gaa aca cag ctt gtc 474
Gly Leu Leu Val Gln Thr Phe Pro Glu Met Leu Glu Thr Gln Leu Val
75 80 85
ctg ctg ttt aaa tgt tcg caa tta tta att ttt gca atg ctt tgt aaa 522
Leu Leu Phe Lys Cys Ser Gln Leu Leu Ile Phe Ala Met Leu Cys Lys
90 95 100
aac taaaggtaca atagttgggt tccaccctgg gaccgccagt ttgcaatcct 575
Asn
tctcttaggg actgagtcag gctggcgggg ggcgggctgc agggctccca cttgaaccag 635
aagtccctgt cctagatgtc tagctccaaa tgtgttccct gccctttcct gcttccctgt 695
agttcctgcc cgcacccagg atccccagcc atctgcccaa gcctccccac ccccaccacg 755
ccgcccccac tccccacatg atggggaagc tggccaagac attcattcct ccacaccccc 815
tgcccccatc agcagctcca ccctcagcac ccccacctca gctcctcctg caaggcctcc 875
gtggatgggg ggtggcggtg gagaaagcct ttcttttata attgttttaa acctactttt 935
atttttattt ttgtagagac aagatctctc tatgttgccc aggctggtct tgaactcctg 995
gactcaaagg atcctccagc ctcggccccc cagagtgctg ggattacagg cgtgagccat 1055
ggtgcccagc ctggtaatta acagatactt cgtgaccacc cactatgcac caggccctgt 1115
tctataattc agatgaaaat ccctgagagg agctgacgtg ctagcagggg aggcagatag 1175
caccacaaaa aaaaaaaaaa aaaa 1199
<210>12
<211>103
<212>PRT
<213〉homo sapiens (Homo sapiens)
<400>12
Met Ala Leu Lys Val Glu Gly Arg Ser Glu Met Gly Gly Asp Arg Ser
1 5 10 15
Arg Ser Gly Pro Glu Gln Pro Gln Ala Pro Val Ser Phe Ser Pro Ser
20 25 30
Leu Pro Pro Ser Phe Pro Pro Pro Ser Leu His Gln Thr Arg Gln Ser
35 40 45
Trp Gly Leu Gly His Pro Arg Glu Thr Asp Leu Pro Pro Gln Gln Trp
50 55 60
Leu Pro Gly Gly Asn Gly Gly Leu Leu Val Gln Thr Phe Pro Glu Met
65 70 75 80
Leu Glu Thr Gln Leu Val Leu Leu Phe Lys Cys Ser Gln Leu Leu Ile
85 90 95
Phe Ala Met Leu Cys Lys Asn
100
<210>13
<211>1269
<212>DNA
<213〉homo sapiens (Homo sapiens)
<400>13
gctgagatgt ggggagcgcc tctccccgcc gccccgtctg ggatgtgagg agtgcctcta 60
cccggccgcc ccgtctgaga agtgaggaga ccctccgcct ggcaaccgcc ccgtctgaga 120
agtgaggagc ccctccaccc ggcagccacc ccgtctggga agtgaggagc gtctccgcca 180
gcagccaccc cgtccgggag ggaggtgggg ggggtcagcc cgcccggcca gccgccccgt 240
ccaggaggga ggtggggggg tcagccccgc ccggccagcc gccccgtccg ggagggaggt 300
gggggggtca gccccctgcc cggccagccg ccccgtcggg gaggtgaggg gcgcctttgc 360
ctggccgccc ctactgggaa gtgaggagcc cctctgcccg gccagccacc ccgtccggga 420
gggaggtggg ggggtcagcc ccccgccagg ccagccaccc cgtccgggag ggaggtgggg 480
gtgtcagccc cccgcccggc cagccgccct gtccgggagg tgaggggcgc ctctgcccgg 540
ccgcccctac tgggaagtga ggagcccctc tgcccggcca gccgccccgt ccgggaggga 600
ggtggggggg tcagccccca gcccggccag tcgccccgtc ggggaggtga ggggcgcctc 660
tgcccggctg cccctactgg gaagtgagga gcccctctgc ccagccgcca ccccgtctgg 720
gaggtgtgcc cagcagctca ttgagaacgg gccatgatga caatggcggt tttgtggaat 780
agaaaggggg gaaagttggg ggaaaagatt gagaggttgg atggttgccg tgtctgtgta 840
gaaagacgta gacatgggag acttctcatt ttgttctgta ctaagaaaaa ttattctgcc 900
ttgggatcct gttgatctgt gaccttaccc ccaaccctgt gctctctgaa acatgtgctg 960
tgtccaccca gggttaaatg gatgaagggc ggtgcaagat gtgctttgtt aaacagatgc 1020
ttgaaggcag catgctcgtt aaaagtcatc accactccct aatctcaagt accccgggac 1080
acaaacactg cggaaggccg cagggtcctc tgcctaggaa aaccagagac ctttgttcac 1140
ttgtttatct gctgaccttc cctccactat tgtcctatga ccctgccaaa tccccccctg 1200
cgagaaacac ccaagaatga tcaataaaaa taaaaaataa aaaaaataaa aaaaaaaaaa 1260
aaaaaaaaa 1269
<210>14
<211>1269
<212>DNA
<213〉homo sapiens (Homo sapiens)
<220>
<221>CDS
<222>(978)..(1217)
<223>
<400>14
gctgagatgt ggggagcgcc tctccccgcc gccccgtctg ggatgtgagg agtgcctcta 60
cccggccgcc ccgtctgaga agtgaggaga ccctccgcct ggcaaccgcc ccgtctgaga 120
agtgaggagc ccctccaccc ggcagccacc ccgtctggga agtgaggagc gtctccgcca 180
gcagccaccc cgtccgggag ggaggtgggg ggggtcagcc cgcccggcca gccgccccgt 240
ccaggaggga ggtggggggg tcagccccgc ccggccagcc gccccgtccg ggagggaggt 300
gggggggtca gccccctgcc cggccagccg ccccgtcggg gaggtgaggg gcgcctttgc 360
ctggccgccc ctactgggaa gtgaggagcc cctctgcccg gccagccacc ccgtccggga 420
gggaggtggg ggggtcagcc ccccgccagg ccagccaccc cgtccgggag ggaggtgggg 480
gtgtcagccc cccgcccggc cagccgccct gtccgggagg tgaggggcgc ctctgcccgg 540
ccgcccctac tgggaagtga ggagcccctc tgcccggcca gccgccccgt ccgggaggga 600
ggtggggggg tcagccccca gcccggccag tcgccccgtc ggggaggtga ggggcgcctc 660
tgcccggctg cccctactgg gaagtgagga gcccctctgc ccagccgcca ccccgtctgg 720
gaggtgtgcc cagcagctca ttgagaacgg gccatgatga caatggcggt tttgtggaat 780
agaaaggggg gaaagttggg ggaaaagatt gagaggttgg atggttgccg tgtctgtgta 840
gaaagacgta gacatgggag acttctcatt ttgttctgta ctaagaaaaa ttattctgcc 900
ttgggatcct gttgatctgt gaccttaccc ccaaccctgt gctctctgaa acatgtgctg 960
tgtccaccca gggttaa atg gat gaa ggg cgg tgc aag atg tgc ttt gtt 1010
Met Asp Glu Gly Arg Cys Lys Met Cys Phe Val
1 5 10
aaa cag atg ctt gaa ggc agc atg ctc gtt aaa agt cat cac cac tcc 1058
Lys Gln Met Leu Glu Gly Ser Met Leu Val Lys Ser His His His Ser
15 20 25
cta atc tca agt acc ccg gga cac aaa cac tgc gga agg ccg cag ggt 1106
Leu Ile Ser Ser Thr Pro Gly His Lys His Cys Gly Arg Pro Gln Gly
30 35 40
cct ctg cct agg aaa acc aga gac ctt tgt tca ctt gtt tat ctg ctg 1154
Pro Leu Pro Arg Lys Thr Arg Asp Leu Cys Ser Leu Val Tyr Leu Leu
45 50 55
acc ttc cct cca cta ttg tcc tat gac cct gcc aaa tcc ccc cct gcg 1202
Thr Phe Pro Pro Leu Leu Ser Tyr Asp Pro Ala Lys Ser Pro Pro Ala
60 65 70 75
aga aac acc caa gaa tgatcaataa aaataaaaaa taaaaaaaat aaaaaaaaaa 1257
Arg Asn Thr Gln Glu
80
aaaaaaaaaa aa 1269
<210>15
<211>80
<212>PRT
<213〉homo sapiens (Homo sapiens)
<400>15
Met Asp Glu Gly Arg Cys Lys Met Cys Phe Val Lys Gln Met Leu Glu
1 5 10 15
Gly Ser Met Leu Val Lys Ser His His His Ser Leu Ile Ser Ser Thr
20 25 30
Pro Gly His Lys His Cys Gly Arg Pro Gln Gly Pro Leu Pro Arg Lys
35 40 45
Thr Arg Asp Leu Cys Ser Leu Val Tyr Leu Leu Thr Phe Pro Pro Leu
50 55 60
Leu Ser Tyr Asp Pro Ala Lys Ser Pro Pro Ala Arg Asn Thr Gln Glu
65 70 75 80
<210>16
<211>2471
<212>DNA
<213〉homo sapiens (Homo sapiens)
<400>16
gttcgtttag aacagaaact gttcatactt ggtgcgctgt ggactcttgt gataattaac 60
caagagtagc tctatttgtc caacctcaca cctaaagaag aaagaaaatg gcttgtgctg 120
agttttcttt tcatgtacca agtcttgaag agcttgctgg agttatgcag aaggggttaa 180
aagataactt tgctgatgtc caggtctctg tagttgattg ccctgatttg actaaggaac 240
cctttacctt tcctgtaaaa ggcatctgtg ggaaaactag aattgcagaa gttggaggtg 300
tgccttactt attgcctctt gtaaaccaaa aaaaagttta tgatctgaat aaaattgcaa 360
aagaaatcaa gctgcctgga gcctttattc ttggagcagg agcaggtcca tttcagactc 420
tcgggttcaa ttctgagttt atgccagtta ttcagacaga aagtgaacac aagcctcctg 480
gtaaatggaa gttactttgc ccatgtgaac cctgcagatg gagggtgcct actggagaaa 540
tacagtgaga aatgtcatga ttttcagtgt gcattactgg ctaatctttt tgccagtgaa 600
ggccaacctg gcaaggtaat tgaggtgaaa gccaaaagaa gaactggacc acttaacttt 660
gtgacttgta tgagagagac cctggaaaaa cattatggaa ataagcctat aggaatggga 720
ggtactttca taattcagaa gggaaaagtg aagtctcaca ttatgcctgc agaattttct 780
tcctgcccct tgaactctga tgaagaagtg aataaatggt tgcattttta tgaaatgaaa 840
gctcctttgg tttgtctacc agtttttgtc tccagagacc cagggtttga tttgcgactg 900
gagcacactc atttttttag tcgtcatgga gaaggtggac actaccatta tgacactact 960
ccagatatag tggaatatct tggatacttc ttacctgcag agtttctcta tcgcattgat 1020
caaccaaaag agacgcattc aattgggcga gattaaatca gctgatactt atttagaaaa 1080
agaaataatt aaggttaatt aattgattga cttattaatt aatactgata taaaaccaat 1140
agaaatgatc ccacaggcca ggcacaatgg ctcatgccta tatcccagca ctttgggagg 1200
ctgaggcagg aagcacactg gagcccagga gtttgagacc agcttgggca acatagcaag 1260
accctgtcta tttttttaaa aaagtaaaaa atagaaatta tctcactact taaatcccat 1320
ttttttcact tcatatgaaa gaacatattg atagtatatt ctatattatt tcatagatct 1380
gtctgaaaga gattgggaac aaaaatatct aattgagata ttctttaatt ttttacatag 1440
cagctttatt ttttttattc tgtagtatca gcgaaatcag tcatgtttat accttgaata 1500
taaatatcag gaatcatgca attatttcta ctatgtattt agtagtatct tatatttgta 1560
taacattatt acattttgca aattagtatc acaactgcta agtagatgtt tctgagtatt 1620
agaaaaatca gtgttattac ctgcaggata ttaaaaaaca tttgaaaaag agaaaaagaa 1680
aaatcagtgt ttagaaatgt tgatagttat tgaatctttg aattgaattt taaaaatcca 1740
ttctagtaat cagagtatac tttttttata gaacaaggtg gcaggtgggg agccctttac 1800
ccttctggtg aagttaaacc atagaagttt acaatttgcc tttcacaaac attagcagtc 1860
cgggcatggt ggctgaagcc tgtgatccca gcatgttggg aggccgagtg ggaggattgc 1920
ctgagcctag gagtttaaac cagcctgggc accatggtga gaccccatct ctattattta 1980
aaatgtttct taaataaaaa aaacacaaac attagcaact agttggctag ccataattcc 2040
tcccttgttt cgattgccag ttagaagcaa tttttatttc actggttatt tgcaagagaa 2100
ggggtaaaaa aacagcatca gagtgcaaca ctgaagtgag cttactttaa tttgtaagtg 2160
aagctgttgc tgaacatgac cactatcgtt gctgaaatat aaaattatac tcactgattg 2220
taatggttga aattttcctg aataatcatt agccaaagat caactctcta atggtgctaa 2280
tggcaatcta gctaatgtgc aaatttagga agtcttcagt actaagtaca taattttcaa 2340
ataatatttt ttaattgtct actttggatg ttagctatgt cttgtgagta taaattccaa 2400
ttttaagcac tattttgatg caaaaaaatg aataaaaaat ttaatttatg aaaaaaaaaa 2460
aaaaaaaaaa a 2471
<210>17
<211>2471
<212>DNA
<213〉homo sapiens (Homo sapiens)
<220>
<221>CDS
<222>(108)..(545)
<223>
<400>17
gttcgtttag aacagaaact gttcatactt ggtgcgctgt ggactcttgt gataattaac 60
caagagtagc tctatttgtc caacctcaca cctaaagaag aaagaaa atg gct tgt 116
Met Ala Cys
1
gct gag ttt tct ttt cat gta cca agt ctt gaa gag ctt gct gga gtt 164
Ala Glu Phe Ser Phe His Val Pro Ser Leu Glu Glu Leu Ala Gly Val
5 10 15
atg cag aag ggg tta aaa gat aac ttt gct gat gtc cag gtc tct gta 212
Met Gln Lys Gly Leu Lys Asp Asn Phe Ala Asp Val Gln Val Ser Val
20 25 30 35
gtt gat tgc cct gat ttg act aag gaa ccc ttt acc ttt cct gta aaa 260
Val Asp Cys Pro Asp Leu Thr Lys Glu Pro Phe Thr Phe Pro Val Lys
40 45 50
ggc atc tgt ggg aaa act aga att gca gaa gtt gga ggt gtg cct tac 308
Gly Ile Cys Gly Lys Thr Arg Ile Ala Glu Val Gly Gly Val Pro Tyr
55 60 65
tta ttg cct ctt gta aac caa aaa aaa gtt tat gat ctg aat aaa att 356
Leu Leu Pro Leu Val Asn Gln Lys Lys Val Tyr Asp Leu Asn Lys Ile
70 75 80
gca aaa gaa atc aag ctg cct gga gcc ttt att ctt gga gca gga gca 404
Ala Lys Glu Ile Lys Leu Pro Gly Ala Phe Ile Leu Gly Ala Gly Ala
85 90 95
ggt cca ttt cag act ctc ggg ttc aat tct gag ttt atg cca gtt att 452
Gly Pro Phe Gln Thr Leu Gly Phe Asn Ser Glu Phe Met Pro Val Ile
100 105 110 115
cag aca gaa agt gaa cac aag cct cct ggt aaa tgg aag tta ctt tgc 500
Gln Thr Glu Ser Glu His Lys Pro Pro Gly Lys Trp Lys Leu Leu Cys
120 125 130
cca tgt gaa ccc tgc aga tgg agg gtg cct act gga gaa ata cag 545
Pro Cys Glu Pro Cys Arg Trp Arg Val Pro Thr Gly Glu Ile Gln
135 140 145
tgagaaatgt catgattttc agtgtgcatt actggctaat ctttttgcca gtgaaggcca 605
acctggcaag gtaattgagg tgaaagccaa aagaagaact ggaccactta actttgtgac 665
ttgtatgaga gagaccctgg aaaaacatta tggaaataag cctataggaa tgggaggtac 725
tttcataatt cagaagggaa aagtgaagtc tcacattatg cctgcagaat tttcttcctg 785
ccccttgaac tctgatgaag aagtgaataa atggttgcat ttttatgaaa tgaaagctcc 845
tttggtttgt ctaccagttt ttgtctccag agacccaggg tttgatttgc gactggagca 905
cactcatttt tttagtcgtc atggagaagg tggacactac cattatgaca ctactccaga 965
tatagtggaa tatcttggat acttcttacc tgcagagttt ctctatcgca ttgatcaacc 1025
aaaagagacg cattcaattg ggcgagatta aatcagctga tacttattta gaaaaagaaa 1085
taattaaggt taattaattg attgacttat taattaatac tgatataaaa ccaatagaaa 1145
tgatcccaca ggccaggcac aatggctcat gcctatatcc cagcactttg ggaggctgag 1205
gcaggaagca cactggagcc caggagtttg agaccagctt gggcaacata gcaagaccct 1265
gtctattttt ttaaaaaagt aaaaaataga aattatctca ctacttaaat cccatttttt 1325
tcacttcata tgaaagaaca tattgatagt atattctata ttatttcata gatctgtctg 1385
aaagagattg ggaacaaaaa tatctaattg agatattctt taatttttta catagcagct 1445
ttattttttt tattctgtag tatcagcgaa atcagtcatg tttatacctt gaatataaat 1505
atcaggaatc atgcaattat ttctactatg tatttagtag tatcttatat ttgtataaca 1565
ttattacatt ttgcaaatta gtatcacaac tgctaagtag atgtttctga gtattagaaa 1625
aatcagtgtt attacctgca ggatattaaa aaacatttga aaaagagaaa aagaaaaatc 1685
agtgtttaga aatgttgata gttattgaat ctttgaattg aattttaaaa atccattcta 1745
gtaatcagag tatacttttt ttatagaaca aggtggcagg tggggagccc tttacccttc 1805
tggtgaagtt aaaccataga agtttacaat ttgcctttca caaacattag cagtccgggc 1865
atggtggctg aagcctgtga tcccagcatg ttgggaggcc gagtgggagg attgcctgag 1925
cctaggagtt taaaccagcc tgggcaccat ggtgagaccc catctctatt atttaaaatg 1985
tttcttaaat aaaaaaaaca caaacattag caactagttg gctagccata attcctccct 2045
tgtttcgatt gccagttaga agcaattttt atttcactgg ttatttgcaa gagaaggggt 2105
aaaaaaacag catcagagtg caacactgaa gtgagcttac tttaatttgt aagtgaagct 2165
gttgctgaac atgaccacta tcgttgctga aatataaaat tatactcact gattgtaatg 2225
gttgaaattt tcctgaataa tcattagcca aagatcaact ctctaatggt gctaatggca 2285
atctagctaa tgtgcaaatt taggaagtct tcagtactaa gtacataatt ttcaaataat 2345
attttttaat tgtctacttt ggatgttagc tatgtcttgt gagtataaat tccaatttta 2405
agcactattt tgatgcaaaa aaatgaataa aaaatttaat ttatgaaaaa aaaaaaaaaa 2465
aaaaaa 2471
<210>18
<211>146
<212>PRT
<213〉homo sapiens (Homo sapiens)
<400>18
Met Ala Cys Ala Glu Phc Ser Phe His Val Pro Ser Leu Glu Glu Leu
1 5 10 15
Ala Gly Val Met Gln Lys Gly Leu Lys Asp Asn Phe Ala Asp Val Gln
20 25 30
Val Ser Val Val Asp Cys Pro Asp Leu Thr Lys Glu Pro Phe Thr Phe
35 40 45
Pro Val Lys Gly Ile Cys Gly Lys Thr Arg Ile Ala Glu Val Gly Gly
50 55 60
Val Pro Tyr Leu Leu Pro Leu Val Asn Gln Lys Lys Val Tyr Asp Leu
65 70 75 80
Asn Lys Ile Ala Lys Glu Ile Lys Leu Pro Gly Ala Phe Ile Leu Gly
85 90 95
Ala Gly Ala Gly Pro Phe Gln Thr Leu Gly Phe Asn Ser Glu Phe Met
100 105 110
Pro Val Ile Gln Thr Glu Ser Glu His Lys Pro Pro Gly Lys Trp Lys
115 120 125
Leu Leu Cys Pro Cys Glu Pro Cys Arg Trp Arg Val Pro Thr Gly Glu
130 135 140
Ile Gln
145
<210>19
<211>1934
<212>DNA
<213〉homo sapiens (Homo sapiens)
<400>19
gcccttgtgg gggcgctgca ctggggatcc tcatggtctt cggcctggca cccgcccagg 60
ggctgagaca gtggctgccc tagagggacg cccgattctg cccatgtgct ccgctgtctc 120
ttgggggtct gagtagtggg gggttccctg ggtagaagag tcaagctggg gagcacgctg 180
gaggggaggc cttctctatc ctggtgccca gccagtgggc tggcctccca aggagggttg 240
actttgtgtg tgtctgggag gctgtgccag ccgggaggct gtgcctctgt cctctagggg 300
agctgggaaa acactggggc tggaagcatc agatgcataa gatagagtta ggtttaaaat 360
gattgcctgg taactgtgaa cagcagatgg gagggtggcc aagactggag gaggcagccc 420
cccttaggag cctgtgtcag ctgtgcaggg ggtggaggtt ggcctggtct agggggtaac 480
aaagcagacg ggaaagcagc acgggggtcc tcctggcaga ctgtggggaa tggttgcgct 540
agccacaggg actatttaag taaaattaaa aattgaatgg caatatttcc cttcctttta 600
aaactacctt gttcggttga tacttagtaa ctccgtggcc agttgaagcg ctggatgttt 660
cctagaacaa gaaccaccaa agcctgttcg cacagaaggg cgaccttgca gggactcgcc 720
gccgcgacct cagtgtggct tttacaggac tccccccgag catcagcagg gaccccggcg 780
gacgtgggcg ggcgcgcgtg agctcgggct gcccggccgg gcgtgcgggc ggggacatgg 840
taacctggtc cacggagggc ggccgccacc ctcgcgtagc tttcctgtgg ttttccagga 900
ctgtccggta cagcccgggc tccgcgtgcc ccgcccgctg gagcacacct gccaccgagg 960
cgcgcgtggg gccactgccg tggcggcggc tgccctcctc acactcggct ccgcgccgcc 1020
tccggccacc gtgcgctccc gcgtggggcg cctcggacgg gcccgggagg gctgggggct 1080
cttccccgga ggaagcggcc tccgcttcgc tggcgccgcc tttttagctt ggacttcagt 1140
cctccctcgg ggactcacct ccggagtaaa cggctcttca ttagcttgga gtggccgcag 1200
gtcccgtgac cagcacccgc gagaccctgt gcgacagctg ggctgatgct ccgtttctct 1260
gggaattggt gattttttac tgtgaagatg aaattaccct aatagcatga agatcgtggg 1320
tctgtgtgtc cgtgaagtga gtcccgtctg ccaggagctg acgaaccacg aatgcttctg 1380
cctgtgctgt gcattcccgg gcccgcagct cccggtcgag gggactccga tgtgaatttg 1440
ttgtgaattt gttgtgccac aatagcagtt ctggaatgaa gctaggaaac tcgagtgtgc 1500
tttttgttta atgagcgttc accaagctga gccggagcca tcctttcggt ggtagttggg 1560
gaaggctatg cagatttgct ctgttgtgac gcgttggtga cggtgcccag cctgcggttc 1620
cccacagcat cgcaggcgct cggaaatcct ctcccgtgcc tttagaggaa ggagaggatt 1680
cctcaggaat gcagggacgc ctgtttctgt cctcacacca cagaaaaaaa cctgagggaa 1740
aaatgagacg tgaggttcct gatttaagaa gcctgggttt ttttcatggt ggggggccat 1800
ttggaatgac agcctcactt ccttttgatg gctacatctg gatacttttc ctccctgtgt 1860
ttaatgtttc cctagggaga ggaaataaag cagaaggaac gtgttaaaaa aaaaaaaaaa 1920
aaaaaaaaaa aaaa 1934
<210>20
<211>1934
<212>DNA
<213〉homo sapiens (Homo sapicns)
<220>
<221>CDS
<222>(837)..(1124)
<223>
<400>20
gcccttgtgg gggcgctgca ctggggatcc tcatggtctt cggcctggca cccgcccagg 60
ggctgagaca gtggctgccc tagagggacg cccgattctg cccatgtgct ccgctgtctc 120
ttgggggtct gagtagtggg gggttccctg ggtagaagag tcaagctggg gagcacgctg 180
gaggggaggc cttctctatc ctggtgccca gccagtgggc tggcctccca aggagggttg 240
actttgtgtg tgtctgggag gctgtgccag ccgggaggct gtgcctctgt cctctagggg 300
agctgggaaa acactggggc tggaagcatc agatgcataa gatagagtta ggtttaaaat 360
gattgcctgg taactgtgaa cagcagatgg gagggtggcc aagactggag gaggcagccc 420
cccttaggag cctgtgtcag ctgtgcaggg ggtggaggtt ggcctggtct agggggtaac 480
aaagcagacg ggaaagcagc acgggggtcc tcctggcaga ctgtggggaa tggttgcgct 540
agccacaggg actatttaag taaaattaaa aattgaatgg caatatttcc cttcctttta 600
aaactacctt gttcggttga tacttagtaa ctccgtggcc agttgaagcg ctggatgttt 660
cctagaacaa gaaccaccaa agcctgttcg cacagaaggg cgaccttgca gggactcgcc 720
gccgcgacct cagtgtggct tttacaggac tccccccgag catcagcagg gaccccggcg 780
gacgtgggcg ggcgcgcgtg agctcgggct gcccggccgg gcgtgcgggc ggggac atg 839
Met
1
gta acc tgg tcc acg gag ggc ggc cgc cac cct cgc gta gct ttc ctg 887
Val Thr Trp Ser Thr Glu Gly Gly Arg His Pro Arg Val Ala Phe Leu
5 10 15
tgg ttt tcc agg act gtc cgg tac agc ccg ggc tcc gcg tgc ccc gcc 935
Trp Phe Ser Arg Thr Val Arg Tyr Ser Pro Gly Ser Ala Cys Pro Ala
20 25 30
cgc tgg agc aca cct gcc acc gag gcg cgc gtg ggg cca ctg ccg tgg 983
Arg Trp Ser Thr Pro Ala Thr Glu Ala Arg Val Gly Pro Leu Pro Trp
35 40 45
cgg cgg ctg ccc tcc tca cac tcg gct ccg cgc cgc ctc cgg cca ccg 1031
Arg Arg Leu Pro Ser Ser His Ser Ala Pro Arg Arg Leu Arg Pro Pro
50 55 60 65
tgc gct ccc gcg tgg ggc gcc tcg gac ggg ccc ggg agg gct ggg ggc 1079
Cys Ala Pro Ala Trp Gly Ala Ser Asp Gly Pro Gly Arg Ala Gly Gly
70 75 80
tct tcc ccg gag gaa gcg gcc tcc gct tcg ctg gcg ccg cct ttt 1124
Ser Ser Pro Glu Glu Ala Ala Ser Ala Ser Leu Ala Pro Pro Phe
85 90 95
tagcttggac ttcagtcctc cctcggggac tcacctccgg agtaaacggc tcttcattag 1184
cttggagtgg ccgcaggtcc cgtgaccagc acccgcgaga ccctgtgcga cagctgggct 1244
gatgctccgt ttctctggga attggtgatt ttttactgtg aagatgaaat taccctaata 1304
gcatgaagat cgtgggtctg tgtgtccgtg aagtgagtcc cgtctgccag gagctgacga 1364
accacgaatg cttctgcctg tgctgtgcat tcccgggccc gcagctcccg gtcgagggga 1424
ctccgatgtg aatttgttgt gaatttgttg tgccacaata gcagttctgg aatgaagcta 1484
ggaaactcga gtgtgctttt tgtttaatga gcgttcacca agctgagccg gagccatcct 1544
ttcggtggta gttggggaag gctatgcaga tttgctctgt tgtgacgcgt tggtgacggt 1604
gcccagcctg cggttcccca cagcatcgca ggcgctcgga aatcctctcc cgtgccttta 1664
gaggaaggag aggattcctc aggaatgcag ggacgcctgt ttctgtcctc acaccacaga 1724
aaaaaacctg agggaaaaat gagacgtgag gttcctgatt taagaagcct gggttttttt 1784
catggtgggg ggccatttgg aatgacagcc tcacttcctt ttgatggcta catctggata 1844
cttttcctcc ctgtgtttaa tgtttcccta gggagaggaa ataaagcaga aggaacgtgt 1904
taaaaaaaaa aaaaaaaaaa aaaaaaaaaa 1934
<210>2l
<211>96
<212>PRT
<213〉homo sapiens (Homo sapicns)
<400>21
Met Val Thr Trp Ser Thr Glu Gly Gly Arg His Pro Arg Val Ala Phe
1 5 10 15
Leu Trp Phe Ser Arg Thr Val Arg Tyr Ser Pro Gly Ser Ala Cys Pro
20 25 30
Ala Arg Trp Ser Thr Pro Ala Thr Glu Ala Arg Val Gly Pro Leu Pro
35 40 45
Trp Arg Arg Leu Pro Ser Ser His Ser Ala Pro Arg Arg Leu Arg Pro
50 55 60
Pro Cys Ala Pro Ala Trp Gly Ala Ser Asp Gly Pro Gly Arg Ala Gly
65 70 75 80
Gly Ser Ser Pro Glu Glu Ala Ala Ser Ala Ser Leu Ala Pro Pro Phe
85 90 95
<210>22
<211>1915
<212>DNA
<213〉homo sapiens (Homo sapicns)
<400>22
gcacagggtg gcagcaggga aaaaaattag aaaagggtga aagattggga cttaacactt 60
cagggaagtc agctgccggg gagaaacttg ctcctaaatg aacacataag tttagatcgc 120
aatgaggagt agcagggtag ctggttgcta gagttacggt ggggatcaga aactcttcca 180
aacattttag cactgaggct ggggtagctt ttggcttttc ccaggtctca ggaggtggcc 240
tgagtcagca cacatcttcc cactcggtag acaggctggc ctctccctca ctttgagact 300
ttggcaactc ctgggccaca cggcctgcct ctttgattac taatgattgt cagtgactca 360
gagcttcctg ggacttcggg tacccacccg ctgttctcca tgcaaacaaa gcgccaggga 420
aatgacccac agggatcgca gctgcaggga gggccaggga ggttgggggt gggagtgaat 480
gctaaaagca gatcgtccag tgcccttttc agtgctaccg gcctctcacc aagcagtcct 540
ccatgtgagc aaccccgaga caaaaatgct aagtgggatc aagagagcag cactcggaga 600
gggtgtttgc cagtctgagt gtcccgcggt gcccgccaac ccgcttcctg actgacctga 660
gcaaggtctt actaagcagt cccatctctg tgggaggcat gcaacgcgtg cagggagttc 720
aggtgccggt cggcgtagcc aggcctggag gccccccagg caggaggccg cccaaaggcg 780
gggccggcgt ctcgcagact aggggctggg ggcggccaca gacggcctcg aaaccacagc 840
ccttacccca atcccacgag ccccgccaac gaaccacagg tgctgggctt tagagaacat 900
gggaaggcgg ccccagacct ggcgggaacg cctttccctc agagccaggc cccggccccg 960
tctgggaagc tcatcttgcg aagctgaggg agctcagggc aaaggccagg ctagcgcgga 1020
ccggaagggg ccgaggctgc acgggcctct gccagaacgc tcaggacatc ccggcctggg 1080
tttacaacgc tgttaggaaa attaaccaat gaataaagca acgttcagtg cgcagggagt 1140
gaaattcaat gcccaccgct aggctcctcg ctgcctctca ctcaagaggc ccaaactcag 1200
acggcgtcag ggacccggac ccagcagccg tttcacgcca atagataggg cgcatgcgca 1260
gaaatcctcc tcggctctct agcgtgagct ttcccaaggg gccacgccca gcttgccttc 1320
tgattggtcc agctggtggg gttgtcttcc gccatctttg atcagggcac taaggatgct 1380
ccccacggcc ttcacagtga cggcggagac cctgccccgc cagctgctca gtacgtgccg 1440
cgtagcccgt gcgagccaag tgtgagtccg ggcgagcgcc tgcggagcta gcactgggcc 1500
cagaatgaga gggaggcgga ggagcagcga tcacgtggtt ttagggactg tctaataatt 1560
ccacgccagc attgccggtg tttcaggggg tgggaaccgc tgcgttcccc atcaactttt 1620
ctcccaccca ccaccctccc caacctacaa gcccagctca gcttgaggta actgctgacc 1680
ggactgtcct atacagccct acaagacaga cggcgcctag ggctgaaagc gggggcctcc 1740
gtagggagcc agcgggggcc tcaatagtta ctcattttct ctacctttga tgaaaataag 1800
agctaattct taataaggct accgggtatc acgcaaaaac cctgtgctta ctattacact 1860
ttgggttgtt gcaaagatta aaggaaataa gccgtgcaaa aaaaaaaaaa aaaaa 1915
<210>23
<211>1915
<212>DNA
<213〉homo sapiens (Homo sapiens)
<220>
<221>CDS
<222>(343)..(615)
<223>
<400>23
gcacagggtg gcagcaggga aaaaaattag aaaagggtga aagattggga cttaacactt 60
cagggaagtc agctgccggg gagaaacttg ctcctaaatg aacacataag tttagatcgc 120
aatgaggagt agcagggtag ctggttgcta gagttacggt ggggatcaga aactcttcca 180
aacattttag cactgaggct ggggtagctt ttggcttttc ccaggtctca ggaggtggcc 240
tgagtcagca cacatcttcc cactcggtag acaggctggc ctctccctca ctttgagact 300
ttggcaactc ctgggccaca cggcctgcct ctttgattac ta atg att gtc agt 354
Met Ile Val Ser
1
gac tca gag ctt cct ggg act tcg ggt acc cac ccg ctg ttc tcc atg 402
Asp Ser Glu Leu Pro Gly Thr Ser Gly Thr His Pro Leu Phe Ser Met
5 10 15 20
caa aca aag cgc cag gga aat gac cca cag gga tcg cag ctg cag gga 450
Gln Thr Lys Arg Gln Gly Asn Asp Pro Gln Gly Ser Gln Leu Gln Gly
25 30 35
ggg cca ggg agg ttg ggg gtg gga gtg aat gct aaa agc aga tcg tcc 498
Gly Pro Gly Arg Leu Gly Val Gly Val Asn Ala Lys Ser Arg Ser Ser
40 45 50
agt gcc ctt ttc agt gct acc ggc ctc tca cca agc agt cct cca tgt 546
Ser Ala Leu Phe Ser Ala Thr Gly Leu Ser Pro Ser Ser Pro Pro Cys
55 60 65
gag caa ccc cga gac aaa aat gct aag tgg gat caa gag agc agc act 594
Glu Gln Pro Arg Asp Lys Asn Ala Lys Trp Asp Gln Glu Ser Ser Thr
70 75 80
cgg aga ggg tgt ttg cca gtc tgagtgtccc gcggtgcccg ccaacccgct 645
Arg Arg Gly Cys Leu Pro Val
85 90
tcctgactga cctgagcaag gtcttactaa gcagtcccat ctctgtggga ggcatgcaac 705
gcgtgcaggg agttcaggtg ccggtcggcg tagccaggcc tggaggcccc ccaggcagga 765
ggccgcccaa aggcggggcc ggcgtctcgc agactagggg ctgggggcgg ccacagacgg 825
cctcgaaacc acagccctta ccccaatccc acgagccccg ccaacgaacc acaggtgctg 885
ggctttagag aacatgggaa ggcggcccca gacctggcgg gaacgccttt ccctcagagc 945
caggccccgg ccccgtctgg gaagctcatc ttgcgaagct gagggagctc agggcaaagg 1005
ccaggctagc gcggaccgga aggggccgag gctgcacggg cctctgccag aacgctcagg 1065
acatcccggc ctgggtttac aacgctgtta ggaaaattaa ccaatgaata aagcaacgtt 1125
cagtgcgcag ggagtgaaat tcaatgccca ccgctaggct cctcgctgcc tctcactcaa 1185
gaggcccaaa ctcagacggc gtcagggacc cggacccagc agccgtttca cgccaataga 1245
tagggcgcat gcgcagaaat cctcctcggc tctctagcgt gagctttccc aaggggccac 1305
gcccagcttg ccttctgatt ggtccagctg gtggggttgt cttccgccat ctttgatcag 1365
ggcactaagg atgctcccca cggccttcac agtgacggcg gagaccctgc cccgccagct 1425
gctcagtacg tgccgcgtag cccgtgcgag ccaagtgtga gtccgggcga gcgcctgcgg 1485
agctagcact gggcccagaa tgagagggag gcggaggagc agcgatcacg tggttttagg 1545
gactgtctaa taattccacg ccagcattgc cggtgtttca gggggtggga accgctgcgt 1605
tccccatcaa cttttctccc acccaccacc ctccccaacc tacaagccca gctcagcttg 1665
aggtaactgc tgaccggact gtcctataca gccctacaag acagacggcg cctagggctg 1725
aaagcggggg cctccgtagg gagccagcgg gggcctcaat agttactcat tttctctacc 1785
tttgatgaaa ataagagcta attcttaata aggctaccgg gtatcacgca aaaaccctgt 1845
gcttactatt acactttggg ttgttgcaaa gattaaagga aataagccgt gcaaaaaaaa 1905
aaaaaaaaaa 1915
<210>24
<211>91
<212>PRT
<213〉homo sapiens (Homo sapiens)
<400>24
Met Ile Val Ser Asp Ser Glu Leu Pro Gly Thr Ser Gly Thr His Pro
1 5 10 15
Leu Phe Ser Met Gln Thr Lys Arg Gln Gly Asn Asp Pro Gln Gly Ser
20 25 30
Gln Leu Gln Gly Gly Pro Gly Arg Leu Gly Val Gly Val Asn Ala Lys
35 40 45
Ser Arg Ser Ser Ser Ala Leu Phe Ser Ala Thr Gly Leu Ser Pro Ser
50 55 60
Ser Pro Pro Cys Glu Gln Pro Arg Asp Lys Asn Ala Lys Trp Asp Gln
65 70 75 80
Glu Ser Ser Thr Arg Arg Gly Cys Leu Pro Val
85 90
<210>25
<211>1575
<212>DNA
<213〉homo sapiens (Homo sapiens)
<400>25
gcacacaggc agtgggtacg gcagactggg gttggtggca tggcccagga ccccacgctg 60
actagtgagc ccctccgcag gcatagcgtg gcgtcagccc agctgcagga gaagctggca 120
ggagcccagc gccagctggg gcagctccgg gctcaggaag ccggcctgca gcaggagcag 180
aggaaggcag acacccacaa gaagatgact gagttctaga gaccccacaa gcactatgga 240
cgaagcgtgg gaccccagca cgggctgccc tcaggaagac cagtgttgcc cgaggagggg 300
ccggcctgct ggcctggggc gtgcggacac tgctgagtgg agacagagct gcggggtccc 360
atctggacac ttacttgccc acctgccagt gtcttgggca tttccttggc aaggacatta 420
aagtgatttc atcacagtgt cattcagtgg agatcagctg gctgcagggt ctctggggag 480
agaaggggcc tcggcttcac aggatggggc tgccagtgtc ctgggcccct cctgccctct 540
gggttctagg gtgctgcgcc ctgctcctct cgctgtgggc gctgtgcaca gcctgccgca 600
ggcccgagga cgctgtagcc cccaggaaga gggcgcggag gcagcgggcg aggctgcagg 660
gcagtgcgac ggcggcggaa gcgcaagtcg gacaccagac tgcacgagct gcaccggggc 720
ccgcgcagca gcagggccct gcggcctgcc agcatggatc tcctgcgccc acactggctg 780
gaggtgtcca gggacatcac cggaccgcag gcagccccct ctgccttccc acaccaggag 840
ctgccccggg ctctgccggc agctgcagcc accgcagggt gcgctggcct cgaggccacc 900
tattccaacg tggggctggc ggcccttccc ggggtcagcc tggcggccag ccctgtggtg 960
gccgagtatg cccgcgtcca gaagcgcaaa gggacccatc gcagtcccca agagccacag 1020
caggggaaga ctgaggtgac cccggccgct caggtggacg tcctgtactc cagggtctgc 1080
aagcctaaaa ggagggaccc aggacccacc acagacccgc tggaccccaa gggccaggga 1140
gcgattctgg ccctggcggg tgacctggcc taccagaccc tcccgctcag ggccctggat 1200
gtggacagcg gccccctgga aaacgtgtat gagagcatcc gggagctggg ggaccctgct 1260
ggcaggagca gcacgtgcgg ggctgggacg ccccctgctt ccagctgccc cagcctaggg 1320
aggggctgga gacccctccc tgcctccctg ccctgaacac tcaaggacct gtgctccttc 1380
ctccagagtg aggcccgtcc cccgccccgc cccgcctcac agctgacagc gccagtccca 1440
ggtccccggg ccgccagccc gtgaggtccg tgaggtcctg gccgctctga cagccgcggc 1500
ctccccgggc tccagagaag gcccgcgtct aaataaagcg ccagcgcagg atgaaaaaaa 1560
aaaaaaaaaa aaaaa 1575
<210>26
<211>1575
<212>DNA
<213〉homo sapiens (Homo sapiens)
<220>
<221>CDS
<222>(754)..(1353)
<223>
<400>26
gcacacaggc agtgggtacg gcagactggg gttggtggca tggcccagga ccccacgctg 60
actagtgagc ccctccgcag gcatagcgtg gcgtcagccc agctgcagga gaagctggca 120
ggagcccagc gccagctggg gcagctccgg gctcaggaag ccggcctgca gcaggagcag 180
aggaaggcag acacccacaa gaagatgact gagttctaga gaccccacaa gcactatgga 240
cgaagcgtgg gaccccagca cgggctgccc tcaggaagac cagtgttgcc cgaggagggg 300
ccggcctgct ggcctggggc gtgcggacac tgctgagtgg agacagagct gcggggtccc 360
atctggacac ttacttgccc acctgccagt gtcttgggca tttccttggc aaggacatta 420
aagtgatttc atcacagtgt cattcagtgg agatcagctg gctgcagggt ctctggggag 480
agaaggggcc tcggcttcac aggatggggc tgccagtgtc ctgggcccct cctgccctct 540
gggttctagg gtgctgcgcc ctgctcctct cgctgtgggc gctgtgcaca gcctgccgca 600
ggcccgagga cgctgtagcc cccaggaaga gggcgcggag gcagcgggcg aggctgcagg 660
gcagtgcgac ggcggcggaa gcgcaagtcg gacaccagac tgcacgagct gcaccggggc 720
ccgcgcagca gcagggccct gcggcctgcc agc atg gat ctc ctg cgc cca cac 774
Met Asp Leu Leu Arg Pro His
1 5
tgg ctg gag gtg tcc agg gac atc acc gga ccg cag gca gcc ccc tct 822
Trp Leu Glu Val Ser Arg Asp Ile Thr Gly Pro Gln Ala Ala Pro Ser
10 15 20
gcc ttc cca cac cag gag ctg ccc cgg gct ctg ccg gca gct gca gcc 870
Ala Phe Pro His Gln Glu Leu Pro Arg Ala Leu Pro Ala Ala Ala Ala
25 30 35
acc gca ggg tgc gct ggc ctc gag gcc acc tat tcc aac gtg ggg ctg 918
Thr Ala Gly Cys Ala Gly Leu Glu Ala Thr Tyr Ser Asn Val Gly Leu
40 45 50 55
gcg gcc ctt ccc ggg gtc agc ctg gcg gcc agc cct gtg gtg gcc gag 966
Ala Ala Leu Pro Gly Val Ser Leu Ala Ala Ser Pro Val Val Ala Glu
60 65 70
tat gcc cgc gtc cag aag cgc aaa ggg acc cat cgc agt ccc caa gag 1014
Tyr Ala Arg Val Gln Lys Arg Lys Gly Thr His Arg Ser Pro Gln Glu
75 80 85
cca cag cag ggg aag act gag gtg acc ccg gcc gct cag gtg gac gtc 1062
Pro Gln Gln Gly Lys Thr Glu Val Thr Pro Ala Ala Gln Val Asp Val
90 95 100
ctg tac tcc agg gtc tgc aag cct aaa agg agg gac cca gga ccc acc 1110
Leu Tyr Ser Arg Val Cys Lys Pro Lys Arg Arg Asp Pro Gly Pro Thr
105 110 115
aca gac ccg ctg gac ccc aag ggc cag gga gcg att ctg gcc ctg gcg 1158
Thr Asp Pro Leu Asp Pro Lys Gly Gln Gly Ala Ile Leu Ala Leu Ala
120 125 130 135
ggt gac ctg gcc tac cag acc ctc ccg ctc agg gcc ctg gat gtg gac 1206
Gly Asp Leu Ala Tyr Gln Thr Leu Pro Leu Arg Ala Leu Asp Val Asp
140 145 150
agc ggc ccc ctg gaa aac gtg tat gag agc atc cgg gag ctg ggg gac 1254
Ser Gly Pro Leu Glu Asn Val Tyr Glu Ser Ile Arg Glu Leu Gly Asp
155 160 165
cct gct ggc agg agc agc acg tgc ggg gct ggg acg ccc cct gct tcc 1302
Pro Ala Gly Arg Ser Ser Thr Cys Gly Ala Gly Thr Pro Pro Ala Ser
170 175 180
agc tgc ccc agc cta ggg agg ggc tgg aga ccc ctc cct gcc tcc ctg 1350
Ser Cys Pro Ser Leu Gly Arg Gly Trp Arg Pro Leu Pro Ala Ser Leu
185 190 195
ccc tgaacactca aggacctgtg ctccttcctc cagagtgagg cccgtccccc 1403
Pro
200
gccccgcccc gcctcacagc tgacagcgcc agtcccaggt ccccgggccg ccagcccgtg 1463
aggtccgtga ggtcctggcc gctctgacag ccgcggcctc cccgggctcc agagaaggcc 1523
cgcgtctaaa taaagcgcca gcgcaggatg aaaaaaaaaa aaaaaaaaaa aa 1575
<210>27
<211>200
<212>PRT
<213〉homo sapiens (Homo sapiens)
<400>27
Met Asp Leu Leu Arg Pro His Trp Leu Glu Val Ser Arg Asp Ile Thr
1 5 10 15
Gly Pro Gln Ala Ala Pro Ser Ala Phe Pro His Gln Glu Leu Pro Arg
20 25 30
Ala Leu Pro Ala Ala Ala Ala Thr Ala Gly Cys Ala Gly Leu Glu Ala
35 40 45
Thr Tyr Ser Asn Val Gly Leu Ala Ala Leu Pro Gly Val Ser Leu Ala
50 55 60
Ala Ser Pro Val Val Ala Glu Tyr Ala Arg Val Gln Lys Arg Lys Gly
65 70 75 80
Thr His Arg Ser Pro Gln Glu Pro Gln Gln Gly Lys Thr Glu Val Thr
85 90 95
Pro Ala Ala Gln Val Asp Val Leu Tyr Ser Arg Val Cys Lys Pro Lys
100 105 110
Arg Arg Asp Pro Gly Pro Thr Thr Asp Pro Leu Asp Pro Lys Gly Gln
115 120 125
Gly Ala Ile Leu Ala Leu Ala Gly Asp Leu Ala Tyr Gln Thr Leu Pro
130 135 140
Leu Arg Ala Leu Asp Val Asp Ser Gly Pro Leu Glu Asn Val Tyr Glu
145 150 155 160
Ser Ile Arg Glu Leu Gly Asp Pro Ala Gly Arg Ser Ser Thr Cys Gly
165 170 175
Ala Gly Thr Pro Pro Ala Ser Ser Cys Pro Ser Leu Gly Arg Gly Trp
180 185 190
Arg Pro Leu Pro Ala Ser Leu Pro
195 200
<210>28
<211>1800
<212>DNA
<213〉homo sapiens (Homo sapiens)
<400>28
ggaatcctcg ggccgccttg tgaggggggt gttgctgtca tctccagggc taggtgacag 60
cttgttgcag gccccacagc cagggcctca gggatgggga tctggccatg tctgttttaa 120
gccgaacagg ctgttcttcc atgtggagtc tccctccgtg atcccctggg gtcaagtcct 180
gacggctggg ttcgctccct cagtgactcc gacagctctt gtctcagagg tgctggggtg 240
aaggctgtgt ggggccaggc ccctggggtc agacttcttg ccccttctgc ccgggggacg 300
ctctgggctc agggtggggc cagacagcta catcacaggg acacttgtta tgggtcgggg 360
tccagggagc acccagccag tggagtgttg gaggccactg agagggacat gactgcggtg 420
tccgagcagc gtcgtggcca ccctgtgtgc ccactccctg gtggacagac ctacgtcacc 480
tggggattgc agagcaaaag gagtgctgtt ggggtgtgtg aggtttgggg ttctgtttct 540
tgctgtgaat tgggtcatga cctgtgtcgg agcgttctct ggcctccact tgcccctgca 600
gaccagggtg tctggagggt tgtcgggggc ttgatggagg aattgctgac cacaccatgc 660
aggaagaggt gggtcccagg gacacaggaa ctttggggtc agccggttcc agattcagtt 720
gcagaggagc atcccccttg cttagccagg gaggggaagc cctcttctca cacccggcct 780
gggtatgttg tgggagagtc atttcccagg aggaaatggg gactgttacc gggggagggg 840
gtgctgtccc caggcagcag caagcgtccc ttacgcttgg gctgtccgag accctgctgc 900
tctaggcggt tgtgatgggg gagccccaca gggcacagag agccagctga aggcggcagc 960
tctgggtggg atcacatgga aaggggaggc gagccgagcc tccctcttgc caatcttggt 1020
ctccccacag ggtcatagct cctgggcccc tcacttgtca gaccctccat tcaccaaatt 1080
aaggataagt ttcacgattt cgcactccag cctggtgacg aagtgagact ccatctcaaa 1140
aaataaataa ataaataaaa ataaacctca ctgtttgttt taacaataac aagcgtggta 1200
agggtggtgt cagtccataa gttagatagg cagttgctag gcagatgtcg ttgcagatgt 1260
gtgtgtgtgt gtgtgttgtg tttaaggttg tggtggcctt tgtgcaggtt gtaaattttt 1320
gtagtctttt gatagttctt gttatcaacc atatgtacat gagaaccctt cccccttcat 1380
ggcgtttcct ggctccgttt gccagggttt taatacgagt gactccattt tgaatctgac 1440
aactttcaca tttccccttt tgatcaaggt ctttctcaaa agcatcactg atcaatcatc 1500
ctgtatttag gttttgatat ctcttggtgc cagtctgccc tcatgttgaa gggagtcact 1560
ggtaactagg aattagcatc aaaacccttt agccacattt gaacaacaaa agaggtttgg 1620
agggagtgga tctcaggcta agtctacctg gagtccactg ttaagttcaa ttctgtctgt 1680
tccataggca ctgtctatca tctcaaagtg ctgggagagc attattctct taggagttgc 1740
atttctatgc aaatttgaca agaaacaggt ataaagttct taaaaaaaaa aaaaaaaaaa 1800
<210>29
<211>1800
<212>DNA
<213〉homo sapiens (Homo sapiens)
<220>
<221>CDS
<222>(409)..(912)
<223>
<400>29
ggaatcctcg ggccgccttg tgaggggggt gttgctgtca tctccagggc taggtgacag 60
cttgttgcag gccccacagc cagggcctca gggatgggga tctggccatg tctgttttaa 120
gccgaacagg ctgttcttcc atgtggagtc tccctccgtg atcccctggg gtcaagtcct 180
gacggctggg ttcgctccct cagtgactcc gacagctctt gtctcagagg tgctggggtg 240
aaggctgtgt ggggccaggc ccctggggtc agacttcttg ccccttctgc ccgggggacg 300
ctctgggctc agggtggggc cagacagcta catcacaggg acacttgtta tgggtcgggg 360
tccagggagc acccagccag tggagtgttg gaggccactg agagggac atg act gcg 417
Met Thr Ala
1
gtg tcc gag cag cgt cgt ggc cac cct gtg tgc cca ctc cct ggt gga 465
Val Ser Glu Gln Arg Arg Gly His Pro Val Cys Pro Leu Pro Gly Gly
5 10 15
cag acc tac gtc acc tgg gga ttg cag agc aaa agg agt gct gtt ggg 513
Gln Thr Tyr Val Thr Trp Gly Leu Gln Ser Lys Arg Ser Ala Val Gly
20 25 30 35
gtg tgt gag gtt tgg ggt tct gtt tct tgc tgt gaa ttg ggt cat gac 561
Val Cys Glu Val Trp Gly Ser Val Ser Cys Cys Glu Leu Gly His Asp
40 45 50
ctg tgt cgg agc gtt ctc tgg cct cca ctt gcc cct gca gac cag ggt 609
Leu Cys Arg Ser Val Leu Trp Pro Pro Leu Ala Pro Ala Asp Gln Gly
55 60 65
gtc tgg agg gtt gtc ggg ggc ttg atg gag gaa ttg ctg acc aca cca 657
Val Trp Arg Val Val Gly Gly Leu Met Glu Glu Leu Leu Thr Thr Pro
70 75 80
tgc agg aag agg tgg gtc cca ggg aca cag gaa ctt tgg ggt cag ccg 705
Cys Arg Lys Arg Trp Val Pro Gly Thr Gln Glu Leu Trp Gly Gln Pro
85 90 95
gtt cca gat tca gtt gca gag gag cat ccc cct tgc tta gcc agg gag 753
Val Pro Asp Ser Val Ala Glu Glu His Pro Pro Cys Leu Ala Arg Glu
100 105 110 115
ggg aag ccc tct tct cac acc cgg cct ggg tat gtt gtg gga gag tca 801
Gly Lys Pro Ser Ser His Thr Arg Pro Gly Tyr Val Val Gly Glu Ser
120 125 130
ttt ccc agg agg aaa tgg gga ctg tta ccg ggg gag ggg gtg ctg tcc 849
Phe Pro Arg Arg Lys Trp Gly Leu Leu Pro Gly Glu Gly Val Leu Ser
135 140 145
cca ggc agc agc aag cgt ccc tta cgc ttg ggc tgt ccg aga ccc tgc 897
Pro Gly Ser Ser Lys Arg Pro Leu Arg Leu Gly Cys Pro Arg Pro Cys
150 155 160
tgc tct agg cgg ttg tgatggggga gccccacagg gcacagagag ccagctgaag 952
Cys Ser Arg Arg Leu
165
gcggcagctc tgggtgggat cacatggaaa ggggaggcga gccgagcctc cctcttgcca 1012
atcttggtct ccccacaggg tcatagctcc tgggcccctc acttgtcaga ccctccattc 1072
accaaattaa ggataagttt cacgatttcg cactccagcc tggtgacgaa gtgagactcc 1132
atctcaaaaa ataaataaat aaataaaaat aaacctcact gtttgtttta acaataacaa 1192
gcgtggtaag ggtggtgtca gtccataagt tagataggca gttgctaggc agatgtcgtt 1252
gcagatgtgt gtgtgtgtgt gtgttgtgtt taaggttgtg gtggcctttg tgcaggttgt 1312
aaatttttgt agtcttttga tagttcttgt tatcaaccat atgtacatga gaacccttcc 1372
cccttcatgg cgtttcctgg ctccgtttgc cagggtttta atacgagtga ctccattttg 1432
aatctgacaa ctttcacatt tccccttttg atcaaggtct ttctcaaaag catcactgat 1492
caatcatcct gtatttaggt tttgatatct cttggtgcca gtctgccctc atgttgaagg 1552
gagtcactgg taactaggaa ttagcatcaa aaccctttag ccacatttga acaacaaaag 1612
aggtttggag ggagtggatc tcaggctaag tctacctgga gtccactgtt aagttcaatt 1672
ctgtctgttc cataggcact gtctatcatc tcaaagtgct gggagagcat tattctctta 1732
ggagttgcat ttctatgcaa atttgacaag aaacaggtat aaagttctta aaaaaaaaaa 1792
aaaaaaaa 1800
<210>30
<211>168
<212>PRT
<213〉homo sapiens (Homo sapiens)
<400>30
Met Thr Ala Val Ser Glu Gln Arg Arg Gly His Pro Val Cys Pro Leu
1 5 10 15
Pro Gly Gly Gln Thr Tyr Val Thr Trp Gly Leu Gln Ser Lys Arg Ser
20 25 30
Ala Val Gly Val Cys Glu Val Trp Gly Ser Val Ser Cys Cys Glu Leu
35 40 45
Gly His Asp Leu Cys Arg Ser Val Leu Trp Pro Pro Leu Ala Pro Ala
50 55 60
Asp Gln Gly Val Trp Arg Val Val Gly Gly Leu Met Glu Glu Leu Leu
65 70 75 80
Thr Thr Pro Cys Arg Lys Arg Trp Val Pro Gly Thr Gln Glu Leu Trp
85 90 95
Gly Gln Pro Val Pro Asp Ser Val Ala Glu Glu His Pro Pro Cys Leu
100 105 110
Ala Arg Glu Gly Lys Pro Ser Ser His Thr Arg Pro Gly Tyr Val Val
115 120 125
Gly Glu Ser Phe Pro Arg Arg Lys Trp Gly Leu Leu Pro Gly Glu Gly
130 135 140
Val Leu Ser Pro Gly Ser Ser Lys Arg Pro Leu Arg Leu Gly Cys Pro
145 150 155 160
Arg Pro Cys Cys Ser Arg Arg Leu
165
<210>31
<211>20
<212>DNA
<213〉primer
<400>31
ggccagc tcatggtatc tcc 20
<210>32
<211>21
<212>DNA
<213〉primer
<400>32
atgaaccctc tgactccgtc c 21
<210>33
<211>20
<212>DNA
<213〉primer
<400>33
cagaaacctg gggtaagcaa 20
<210>34
<211>22
<212>DNA
<213〉primer
<400>34
tacaaatcac gggaaaagta cg 22
<210>35
<211>22
<212>DNA
<213〉primer
<400>35
gatgatgctg ccaagttatt ca 22
<210>36
<211>22
<212>DNA
<213〉primer
<400>36
atccccatag acatccttac cg 22
<210>37
<211>21
<212>DNA
<213〉primer
<400>37
ctggagggag atcacaaaac a 21
<210>38
<211>22
<212>DNA
<213〉primer
<400>38
atgaagcact ggtgggtgat ac 22
<210>39
<211>18
<212>DNA
<213〉primer
<400>39
cccgtctggg atgtgagg 18
<210>40
<211>22
<212>DNA
<213〉primer
<400>40
gctctttgtg ggttcttact ag 22
<210>41
<211>20
<212>DNA
<213〉primer
<400>41
ttcatacttg gtgcgctgtg 20
<210>42
<211>19
<212>DNA
<213〉primer
<400>42
gagggaacaa agctaacgg 19
<210>43
<211>19
<212>DNA
<213〉primer
<400>43
gattctgccc atgtgctcc 19
<210>44
<211>22
<212>DNA
<213〉primer
<400>44
tcctttattt cgtcttcctt gc 22
<210>45
<211>22
<212>DNA
<213〉primer
<400>45
tagctggttg ctagagttac gg 22
<210>46
<211>20
<212>DNA
<213〉primer
<400>46
tttcctttat tcggcacgtt 20
<210>47
<211>18
<212>DNA
<213〉primer
<400>47
cggcagactg gggttggt 18
<210>48
<211>18
<212>DNA
<213〉primer
<400>48
tttatttcgc ggtcgcgt 18
<210>49
<211>20
<212>DNA
<213〉primer
<400>49
gggtgttgct gtcatctcca 20
<210>50
<211>20
<212>DNA
<213〉primer
<400>50
ttctccaaac ctccctcacc 20
Claims (9)
1. isolating people's albumen with promotion 3T3 cell transformation function is characterized in that it has the aminoacid sequence of the group of being selected from down: SEQ ID NO:3,6,9,12,15,18,21,24,27,30.
2. albumen as claimed in claim 1 is characterized in that, this polypeptide has the aminoacid sequence of the group of being selected from down: SEQ ID NO:3,6,9,12,15,27,30.
3. isolating polynucleotide is characterized in that, it is selected from down group:
(a) the proteic according to claim 1 polynucleotide of coding;
(b) with polynucleotide (a) complementary polynucleotide.
4. polynucleotide as claimed in claim 3 is characterized in that, the albumen of this polynucleotide encoding has the aminoacid sequence of the group of being selected from down: SEQ ID NO:3,6,9,12,15,27,30.
5. polynucleotide as claimed in claim 3 is characterized in that, the sequence of these polynucleotide is selected from down group:
SEQ ID NO:2,5,8,11,14,17,20,23,26,29 coding region sequence or full length sequence.
6. a carrier is characterized in that, it contains the described polynucleotide of claim 3.
7. a genetically engineered host cell is characterized in that, it is a kind of host cell that is selected from down group:
(a) host cell that transforms or transduce with the described carrier of claim 6;
(b) host cell that transforms or transduce with the described polynucleotide of claim 3.
8. a claim 1 is described has a proteic preparation method of people who promotes 3T3 cell transformation function, it is characterized in that this method comprises:
(a) being fit to express under the proteic condition of people with promotion 3T3 cell transformation function, cultivate the described host cell of claim 7;
(b) from culture, isolate people's albumen with promotion 3T3 cell transformation function.
9. an energy and claim 1 are described has the people's protein-specific bonded antibody that promotes 3T3 cell transformation function.
Priority Applications (3)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| CNB021369984A CN1231497C (en) | 2002-09-16 | 2002-09-16 | Novel human protein with function for promoting mouse NIH/313 cell transformation and coding sequence thereof |
| PCT/CN2003/000636 WO2004033493A1 (en) | 2002-08-07 | 2003-08-07 | Human protein for promoting transformation of 3t3 cell and its coding sequence |
| AU2003255095A AU2003255095A1 (en) | 2002-08-07 | 2003-08-07 | Human protein for promoting transformation of 3t3 cell and its coding sequence |
Applications Claiming Priority (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| CNB021369984A CN1231497C (en) | 2002-09-16 | 2002-09-16 | Novel human protein with function for promoting mouse NIH/313 cell transformation and coding sequence thereof |
Publications (2)
| Publication Number | Publication Date |
|---|---|
| CN1483738A CN1483738A (en) | 2004-03-24 |
| CN1231497C true CN1231497C (en) | 2005-12-14 |
Family
ID=34146795
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| CNB021369984A Expired - Fee Related CN1231497C (en) | 2002-08-07 | 2002-09-16 | Novel human protein with function for promoting mouse NIH/313 cell transformation and coding sequence thereof |
Country Status (1)
| Country | Link |
|---|---|
| CN (1) | CN1231497C (en) |
-
2002
- 2002-09-16 CN CNB021369984A patent/CN1231497C/en not_active Expired - Fee Related
Also Published As
| Publication number | Publication date |
|---|---|
| CN1483738A (en) | 2004-03-24 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| CN1170850C (en) | Human angiopoietin-like protein and coding sequence and use thereof | |
| CN1169954C (en) | Polynucleotide encoding a human protein with the function of inhibiting the growth of cancer cells | |
| CN1231497C (en) | Novel human protein with function for promoting mouse NIH/313 cell transformation and coding sequence thereof | |
| CN1177864C (en) | New Human Proteins and Their Coding Sequences Differentially Expressed in Liver Cancer Tissues | |
| CN1160370C (en) | Human cell cycle control related protein and its coding sequence | |
| CN1199997C (en) | New human protein having mouse NIH/3T3 cell conversion promoting function and its code sequence | |
| CN1177050C (en) | Human protein with function of suppressing cancer cell growth and its coding sequence | |
| CN1231496C (en) | Human protein with cancer cell growth suppressing function and its coding sequence | |
| CN1155615C (en) | Human protein with cancer cell growth suppressing function and its coding sequence | |
| CN1209373C (en) | Human protein with suppression to cancer cell growth and its coding sequence | |
| CN1177049C (en) | Human protein with function of suppressing cancer cell growth and its coding sequence | |
| CN1177048C (en) | Human protein with function of suppressing cancer cell growth and its coding sequence | |
| CN1222616C (en) | Novel human protein with cancer-inhibiting function and coding sequence thereof | |
| CN1155616C (en) | Human protein with cancer cell growth promoting function and its coding sequence | |
| CN1190446C (en) | New human protein with function of improving mouse NIH/3T3 cell transformation and its encoding sequence | |
| CN1199999C (en) | Human protein for promoting transform of 3T3 cell and its coding sequence | |
| CN1169955C (en) | Polynucleotide encoding a human protein with the function of inhibiting the growth of cancer cells | |
| CN1199998C (en) | Human protein with suppression to cancer cell growth and its coding sequence | |
| CN1177047C (en) | Human protein with cancer suppressing function and its coding sequence | |
| CN1169831C (en) | Novel human protein with the function of inhibiting the growth of cancer cells and its coding sequence | |
| CN1209374C (en) | Human Protein for promoting transform of 3T3 cell and its coding sequence | |
| CN1230445C (en) | Novel human protein with function for promoting mouse NIH/313 cell transformation and coding sequence thereof | |
| CN1209370C (en) | New human protein with cancer inhibiting function and its encoding sequence | |
| CN1199994C (en) | New human protein with cancer cell growth inhibiting function and its coding sequence | |
| CN1194989C (en) | Novel human protein able to suppress cancer cell growth and its coding sequence |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| C06 | Publication | ||
| PB01 | Publication | ||
| C10 | Entry into substantive examination | ||
| SE01 | Entry into force of request for substantive examination | ||
| C14 | Grant of patent or utility model | ||
| GR01 | Patent grant | ||
| C19 | Lapse of patent right due to non-payment of the annual fee | ||
| CF01 | Termination of patent right due to non-payment of annual fee |