US20030186249A1 - Human TARPP genes and polypeptides - Google Patents
Human TARPP genes and polypeptides Download PDFInfo
- Publication number
- US20030186249A1 US20030186249A1 US10/112,372 US11237202A US2003186249A1 US 20030186249 A1 US20030186249 A1 US 20030186249A1 US 11237202 A US11237202 A US 11237202A US 2003186249 A1 US2003186249 A1 US 2003186249A1
- Authority
- US
- United States
- Prior art keywords
- ser
- gln
- pro
- gly
- leu
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Abandoned
Links
- 108090000765 processed proteins & peptides Proteins 0.000 title claims abstract description 100
- 102000004196 processed proteins & peptides Human genes 0.000 title claims abstract description 92
- 229920001184 polypeptide Polymers 0.000 title claims abstract description 89
- 241000282414 Homo sapiens Species 0.000 title claims description 159
- 101000885144 Homo sapiens cAMP-regulated phosphoprotein 21 Proteins 0.000 title claims description 133
- 102000040430 polynucleotide Human genes 0.000 claims abstract description 195
- 108091033319 polynucleotide Proteins 0.000 claims abstract description 195
- 239000002157 polynucleotide Substances 0.000 claims abstract description 195
- 108090000623 proteins and genes Proteins 0.000 claims abstract description 169
- 208000037265 diseases, disorders, signs and symptoms Diseases 0.000 claims abstract description 56
- 201000010099 disease Diseases 0.000 claims abstract description 27
- 238000000034 method Methods 0.000 claims description 139
- 102100039125 cAMP-regulated phosphoprotein 21 Human genes 0.000 claims description 127
- 239000000523 sample Substances 0.000 claims description 123
- 210000004027 cell Anatomy 0.000 claims description 105
- 230000014509 gene expression Effects 0.000 claims description 103
- 239000003795 chemical substances by application Substances 0.000 claims description 47
- 125000003729 nucleotide group Chemical group 0.000 claims description 44
- 239000002773 nucleotide Substances 0.000 claims description 42
- 150000007523 nucleic acids Chemical class 0.000 claims description 39
- 150000001413 amino acids Chemical class 0.000 claims description 38
- 102000039446 nucleic acids Human genes 0.000 claims description 35
- 108020004707 nucleic acids Proteins 0.000 claims description 35
- 238000009396 hybridization Methods 0.000 claims description 30
- 108020004414 DNA Proteins 0.000 claims description 29
- 238000003752 polymerase chain reaction Methods 0.000 claims description 27
- 238000012360 testing method Methods 0.000 claims description 26
- 230000000692 anti-sense effect Effects 0.000 claims description 20
- 108091026890 Coding region Proteins 0.000 claims description 19
- 239000012634 fragment Substances 0.000 claims description 19
- 230000004071 biological effect Effects 0.000 claims description 16
- 108020004999 messenger RNA Proteins 0.000 claims description 15
- 230000027455 binding Effects 0.000 claims description 12
- 230000001105 regulatory effect Effects 0.000 claims description 12
- 230000001225 therapeutic effect Effects 0.000 claims description 12
- 239000002299 complementary DNA Substances 0.000 claims description 11
- 210000001744 T-lymphocyte Anatomy 0.000 claims description 10
- 238000012217 deletion Methods 0.000 claims description 9
- 230000037430 deletion Effects 0.000 claims description 9
- 238000013519 translation Methods 0.000 claims description 9
- 238000004458 analytical method Methods 0.000 claims description 8
- 102000054765 polymorphisms of proteins Human genes 0.000 claims description 7
- 238000000636 Northern blotting Methods 0.000 claims description 6
- 210000000987 immune system Anatomy 0.000 claims description 6
- 210000000653 nervous system Anatomy 0.000 claims description 6
- 238000007901 in situ hybridization Methods 0.000 claims description 5
- 230000000295 complement effect Effects 0.000 claims description 4
- 238000006467 substitution reaction Methods 0.000 claims description 4
- 238000012340 reverse transcriptase PCR Methods 0.000 claims description 3
- 210000002865 immune cell Anatomy 0.000 claims description 2
- 210000002569 neuron Anatomy 0.000 claims description 2
- 125000003275 alpha amino acid group Chemical group 0.000 claims 6
- 210000005260 human cell Anatomy 0.000 claims 2
- 230000017105 transposition Effects 0.000 claims 1
- 210000001519 tissue Anatomy 0.000 abstract description 42
- 230000009870 specific binding Effects 0.000 abstract description 18
- 239000003814 drug Substances 0.000 abstract description 16
- 238000012544 monitoring process Methods 0.000 abstract description 6
- 238000011160 research Methods 0.000 abstract description 6
- 238000002560 therapeutic procedure Methods 0.000 abstract description 4
- 210000004556 brain Anatomy 0.000 abstract description 3
- 238000007876 drug discovery Methods 0.000 abstract description 3
- 210000002861 immature t-cell Anatomy 0.000 abstract description 3
- 210000003205 muscle Anatomy 0.000 abstract description 3
- 230000001817 pituitary effect Effects 0.000 abstract description 3
- 210000001541 thymus gland Anatomy 0.000 abstract description 3
- 238000003745 diagnosis Methods 0.000 abstract description 2
- 239000003596 drug target Substances 0.000 abstract description 2
- 241001465754 Metazoa Species 0.000 description 29
- 208000035475 disorder Diseases 0.000 description 29
- 108091032973 (ribonucleotides)n+m Proteins 0.000 description 28
- 108010077112 prolyl-proline Proteins 0.000 description 28
- 108010061238 threonyl-glycine Proteins 0.000 description 28
- KOSRFJWDECSPRO-UHFFFAOYSA-N alpha-L-glutamyl-L-glutamic acid Natural products OC(=O)CCC(N)C(=O)NC(CCC(O)=O)C(O)=O KOSRFJWDECSPRO-UHFFFAOYSA-N 0.000 description 27
- IVCOYUURLWQDJQ-LPEHRKFASA-N Gln-Gln-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCC(=O)N)N)C(=O)O IVCOYUURLWQDJQ-LPEHRKFASA-N 0.000 description 24
- 108010078144 glutaminyl-glycine Proteins 0.000 description 24
- PYTKULIABVRXSC-BWBBJGPYSA-N Ser-Ser-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PYTKULIABVRXSC-BWBBJGPYSA-N 0.000 description 23
- 241000282326 Felis catus Species 0.000 description 22
- 108010024078 alanyl-glycyl-serine Proteins 0.000 description 22
- 230000000694 effects Effects 0.000 description 22
- 238000001514 detection method Methods 0.000 description 20
- 108010055341 glutamyl-glutamic acid Proteins 0.000 description 19
- 108010010147 glycylglutamine Proteins 0.000 description 19
- XVZCXCTYGHPNEM-UHFFFAOYSA-N Leu-Leu-Pro Natural products CC(C)CC(N)C(=O)NC(CC(C)C)C(=O)N1CCCC1C(O)=O XVZCXCTYGHPNEM-UHFFFAOYSA-N 0.000 description 18
- WLRYGVYQFXRJDA-DCAQKATOSA-N Gln-Pro-Pro Chemical compound NC(=O)CC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 WLRYGVYQFXRJDA-DCAQKATOSA-N 0.000 description 16
- 241000880493 Leptailurus serval Species 0.000 description 16
- KZNQNBZMBZJQJO-UHFFFAOYSA-N N-glycyl-L-proline Natural products NCC(=O)N1CCCC1C(O)=O KZNQNBZMBZJQJO-UHFFFAOYSA-N 0.000 description 16
- FYKUEXMZYFIZKA-DCAQKATOSA-N Pro-Pro-Gln Chemical compound [H]N1CCC[C@H]1C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O FYKUEXMZYFIZKA-DCAQKATOSA-N 0.000 description 16
- MKGIILKDUGDRRO-FXQIFTODSA-N Pro-Ser-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H]1CCCN1 MKGIILKDUGDRRO-FXQIFTODSA-N 0.000 description 15
- VEVYMLNYMULSMS-AVGNSLFASA-N Ser-Tyr-Gln Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O VEVYMLNYMULSMS-AVGNSLFASA-N 0.000 description 15
- 108010070643 prolylglutamic acid Proteins 0.000 description 15
- 108010053725 prolylvaline Proteins 0.000 description 15
- 108010020532 tyrosyl-proline Proteins 0.000 description 15
- 238000003556 assay Methods 0.000 description 14
- -1 e.g. Chemical compound 0.000 description 14
- 108010049041 glutamylalanine Proteins 0.000 description 14
- 239000013598 vector Substances 0.000 description 14
- JBCLFWXMTIKCCB-UHFFFAOYSA-N H-Gly-Phe-OH Natural products NCC(=O)NC(C(O)=O)CC1=CC=CC=C1 JBCLFWXMTIKCCB-UHFFFAOYSA-N 0.000 description 13
- RMLLCGYYVZKKRT-CIUDSAMLSA-N Met-Ser-Glu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O RMLLCGYYVZKKRT-CIUDSAMLSA-N 0.000 description 13
- IXUGADGDCQDLSA-FXQIFTODSA-N Ser-Gln-Gln Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CO)N IXUGADGDCQDLSA-FXQIFTODSA-N 0.000 description 13
- 108010044940 alanylglutamine Proteins 0.000 description 13
- 102000004169 proteins and genes Human genes 0.000 description 13
- 230000009261 transgenic effect Effects 0.000 description 13
- OTQSTOXRUBVWAP-NRPADANISA-N Gln-Ser-Val Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O OTQSTOXRUBVWAP-NRPADANISA-N 0.000 description 12
- CEXFELBFVHLYDZ-XGEHTFHBSA-N Thr-Arg-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O CEXFELBFVHLYDZ-XGEHTFHBSA-N 0.000 description 12
- 108010047495 alanylglycine Proteins 0.000 description 12
- 108010077515 glycylproline Proteins 0.000 description 12
- 108010034529 leucyl-lysine Proteins 0.000 description 12
- 239000000203 mixture Substances 0.000 description 12
- 230000035772 mutation Effects 0.000 description 12
- NBTGEURICRTMGL-WHFBIAKZSA-N Ala-Gly-Ser Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O NBTGEURICRTMGL-WHFBIAKZSA-N 0.000 description 11
- IQACOVZVOMVILH-FXQIFTODSA-N Glu-Glu-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O IQACOVZVOMVILH-FXQIFTODSA-N 0.000 description 11
- QJXHMYMRGDOHRU-NHCYSSNCSA-N Leu-Ile-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O QJXHMYMRGDOHRU-NHCYSSNCSA-N 0.000 description 11
- 241000699666 Mus <mouse, genus> Species 0.000 description 11
- FISHYTLIMUYTQY-GUBZILKMSA-N Pro-Gln-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H]1CCCN1 FISHYTLIMUYTQY-GUBZILKMSA-N 0.000 description 11
- 229940088598 enzyme Drugs 0.000 description 11
- OTZMRMHZCMZOJZ-SRVKXCTJSA-N Arg-Leu-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O OTZMRMHZCMZOJZ-SRVKXCTJSA-N 0.000 description 10
- 102000004190 Enzymes Human genes 0.000 description 10
- 108090000790 Enzymes Proteins 0.000 description 10
- PKVWNYGXMNWJSI-CIUDSAMLSA-N Gln-Gln-Gln Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O PKVWNYGXMNWJSI-CIUDSAMLSA-N 0.000 description 10
- NSNUZSPSADIMJQ-WDSKDSINSA-N Gln-Gly-Asp Chemical compound NC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O NSNUZSPSADIMJQ-WDSKDSINSA-N 0.000 description 10
- JTWZNMUVQWWGOX-SOUVJXGZSA-N Gln-Tyr-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)[C@H](CCC(=O)N)N)C(=O)O JTWZNMUVQWWGOX-SOUVJXGZSA-N 0.000 description 10
- WGNOPSQMIQERPK-UHFFFAOYSA-N Leu-Asn-Pro Natural products CC(C)CC(N)C(=O)NC(CC(=O)N)C(=O)N1CCCC1C(=O)O WGNOPSQMIQERPK-UHFFFAOYSA-N 0.000 description 10
- AMSSKPUHBUQBOQ-SRVKXCTJSA-N Leu-Ser-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)O)N AMSSKPUHBUQBOQ-SRVKXCTJSA-N 0.000 description 10
- YBAFDPFAUTYYRW-UHFFFAOYSA-N N-L-alpha-glutamyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CCC(O)=O YBAFDPFAUTYYRW-UHFFFAOYSA-N 0.000 description 10
- DRIJZWBRGMJCDD-DCAQKATOSA-N Pro-Gln-Met Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCSC)C(O)=O DRIJZWBRGMJCDD-DCAQKATOSA-N 0.000 description 10
- SKICPQLTOXGWGO-GARJFASQSA-N Pro-Gln-Pro Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CCC(=O)N)C(=O)N2CCC[C@@H]2C(=O)O SKICPQLTOXGWGO-GARJFASQSA-N 0.000 description 10
- QWZIOCFPXMAXET-CIUDSAMLSA-N Ser-Arg-Gln Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(O)=O QWZIOCFPXMAXET-CIUDSAMLSA-N 0.000 description 10
- 108010089975 arginyl-glycyl-aspartyl-serine Proteins 0.000 description 10
- 108010006664 gamma-glutamyl-glycyl-glycine Proteins 0.000 description 10
- 108010031424 isoleucyl-prolyl-proline Proteins 0.000 description 10
- 108010070409 phenylalanyl-glycyl-glycine Proteins 0.000 description 10
- 108010031719 prolyl-serine Proteins 0.000 description 10
- 239000007787 solid Substances 0.000 description 10
- 238000013518 transcription Methods 0.000 description 10
- 230000035897 transcription Effects 0.000 description 10
- PBSQFBAJKPLRJY-BYULHYEWSA-N Asn-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CC(=O)N)N PBSQFBAJKPLRJY-BYULHYEWSA-N 0.000 description 9
- YIDFBWRHIYOYAA-LKXGYXEUSA-N Asp-Ser-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O YIDFBWRHIYOYAA-LKXGYXEUSA-N 0.000 description 9
- FGYPOQPQTUNESW-IUCAKERBSA-N Gln-Gly-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CCC(=O)N)N FGYPOQPQTUNESW-IUCAKERBSA-N 0.000 description 9
- SMLDOQHTOAAFJQ-WDSKDSINSA-N Gln-Gly-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)NCC(=O)N[C@@H](CO)C(O)=O SMLDOQHTOAAFJQ-WDSKDSINSA-N 0.000 description 9
- XHUCVVHRLNPZSZ-CIUDSAMLSA-N Glu-Gln-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O XHUCVVHRLNPZSZ-CIUDSAMLSA-N 0.000 description 9
- BUZMZDDKFCSKOT-CIUDSAMLSA-N Glu-Glu-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O BUZMZDDKFCSKOT-CIUDSAMLSA-N 0.000 description 9
- IDEODOAVGCMUQV-GUBZILKMSA-N Glu-Ser-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O IDEODOAVGCMUQV-GUBZILKMSA-N 0.000 description 9
- MHXKHKWHPNETGG-QWRGUYRKSA-N Gly-Lys-Leu Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O MHXKHKWHPNETGG-QWRGUYRKSA-N 0.000 description 9
- ZDJQVSIPFLMNOX-RHYQMDGZSA-N Leu-Thr-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N ZDJQVSIPFLMNOX-RHYQMDGZSA-N 0.000 description 9
- 108010002311 N-glycylglutamic acid Proteins 0.000 description 9
- 108010065395 Neuropep-1 Proteins 0.000 description 9
- 108091028043 Nucleic acid sequence Proteins 0.000 description 9
- SNGZLPOXVRTNMB-LPEHRKFASA-N Pro-Ser-Pro Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CO)C(=O)N2CCC[C@@H]2C(=O)O SNGZLPOXVRTNMB-LPEHRKFASA-N 0.000 description 9
- PGSWNLRYYONGPE-JYJNAYRXSA-N Pro-Val-Tyr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O PGSWNLRYYONGPE-JYJNAYRXSA-N 0.000 description 9
- 108010003201 RGH 0205 Proteins 0.000 description 9
- VQBCMLMPEWPUTB-ACZMJKKPSA-N Ser-Glu-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O VQBCMLMPEWPUTB-ACZMJKKPSA-N 0.000 description 9
- UIGMAMGZOJVTDN-WHFBIAKZSA-N Ser-Gly-Ser Chemical compound OC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O UIGMAMGZOJVTDN-WHFBIAKZSA-N 0.000 description 9
- XQJCEKXQUJQNNK-ZLUOBGJFSA-N Ser-Ser-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O XQJCEKXQUJQNNK-ZLUOBGJFSA-N 0.000 description 9
- MPUMPERGHHJGRP-WEDXCCLWSA-N Thr-Gly-Lys Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)N[C@@H](CCCCN)C(=O)O)N)O MPUMPERGHHJGRP-WEDXCCLWSA-N 0.000 description 9
- ZLFHAAGHGQBQQN-GUBZILKMSA-N Val-Ala-Pro Natural products CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@H]1C(O)=O ZLFHAAGHGQBQQN-GUBZILKMSA-N 0.000 description 9
- GBIUHAYJGWVNLN-UHFFFAOYSA-N Val-Ser-Pro Natural products CC(C)C(N)C(=O)NC(CO)C(=O)N1CCCC1C(O)=O GBIUHAYJGWVNLN-UHFFFAOYSA-N 0.000 description 9
- 108010069490 alanyl-glycyl-seryl-glutamic acid Proteins 0.000 description 9
- 239000000427 antigen Substances 0.000 description 9
- 108010015792 glycyllysine Proteins 0.000 description 9
- 108010025306 histidylleucine Proteins 0.000 description 9
- 108010090333 leucyl-lysyl-proline Proteins 0.000 description 9
- 108010030617 leucyl-phenylalanyl-valine Proteins 0.000 description 9
- 108010057821 leucylproline Proteins 0.000 description 9
- 108010003700 lysyl aspartic acid Proteins 0.000 description 9
- 108010025826 prolyl-leucyl-arginine Proteins 0.000 description 9
- 108010090894 prolylleucine Proteins 0.000 description 9
- UGLPMYSCWHTZQU-AUTRQRHGSA-N Ala-Ala-Tyr Chemical compound C[C@H]([NH3+])C(=O)N[C@@H](C)C(=O)N[C@H](C([O-])=O)CC1=CC=C(O)C=C1 UGLPMYSCWHTZQU-AUTRQRHGSA-N 0.000 description 8
- RXTBLQVXNIECFP-FXQIFTODSA-N Ala-Gln-Gln Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O RXTBLQVXNIECFP-FXQIFTODSA-N 0.000 description 8
- KIJLEFNHWSXHRU-NUMRIWBASA-N Asp-Gln-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KIJLEFNHWSXHRU-NUMRIWBASA-N 0.000 description 8
- ZHNUHDYFZUAESO-UHFFFAOYSA-N Formamide Chemical compound NC=O ZHNUHDYFZUAESO-UHFFFAOYSA-N 0.000 description 8
- NVEASDQHBRZPSU-BQBZGAKWSA-N Gln-Gln-Gly Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O NVEASDQHBRZPSU-BQBZGAKWSA-N 0.000 description 8
- PBYFVIQRFLNQCO-GUBZILKMSA-N Gln-Pro-Gln Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O PBYFVIQRFLNQCO-GUBZILKMSA-N 0.000 description 8
- SOEGEPHNZOISMT-BYPYZUCNSA-N Gly-Ser-Gly Chemical compound NCC(=O)N[C@@H](CO)C(=O)NCC(O)=O SOEGEPHNZOISMT-BYPYZUCNSA-N 0.000 description 8
- HPCFRQWLTRDGHT-AJNGGQMLSA-N Ile-Leu-Leu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O HPCFRQWLTRDGHT-AJNGGQMLSA-N 0.000 description 8
- PMGDADKJMCOXHX-UHFFFAOYSA-N L-Arginyl-L-glutamin-acetat Natural products NC(=N)NCCCC(N)C(=O)NC(CCC(N)=O)C(O)=O PMGDADKJMCOXHX-UHFFFAOYSA-N 0.000 description 8
- CQGSYZCULZMEDE-UHFFFAOYSA-N Leu-Gln-Pro Natural products CC(C)CC(N)C(=O)NC(CCC(N)=O)C(=O)N1CCCC1C(O)=O CQGSYZCULZMEDE-UHFFFAOYSA-N 0.000 description 8
- 108010034522 NNQQ peptide Proteins 0.000 description 8
- KIPIKSXPPLABPN-CIUDSAMLSA-N Pro-Glu-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H]1CCCN1 KIPIKSXPPLABPN-CIUDSAMLSA-N 0.000 description 8
- DTQIXTOJHKVEOH-DCAQKATOSA-N Pro-His-Cys Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC2=CN=CN2)C(=O)N[C@@H](CS)C(=O)O DTQIXTOJHKVEOH-DCAQKATOSA-N 0.000 description 8
- JXVXYRZQIUPYSA-NHCYSSNCSA-N Pro-Val-Gln Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O JXVXYRZQIUPYSA-NHCYSSNCSA-N 0.000 description 8
- ZOHGLPQGEHSLPD-FXQIFTODSA-N Ser-Gln-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O ZOHGLPQGEHSLPD-FXQIFTODSA-N 0.000 description 8
- KDGARKCAKHBEDB-NKWVEPMBSA-N Ser-Gly-Pro Chemical compound C1C[C@@H](N(C1)C(=O)CNC(=O)[C@H](CO)N)C(=O)O KDGARKCAKHBEDB-NKWVEPMBSA-N 0.000 description 8
- LLSLRQOEAFCZLW-NRPADANISA-N Ser-Val-Gln Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O LLSLRQOEAFCZLW-NRPADANISA-N 0.000 description 8
- XKWABWFMQXMUMT-HJGDQZAQSA-N Thr-Pro-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O XKWABWFMQXMUMT-HJGDQZAQSA-N 0.000 description 8
- 108010064997 VPY tripeptide Proteins 0.000 description 8
- 239000002253 acid Substances 0.000 description 8
- 230000003321 amplification Effects 0.000 description 8
- 108091007433 antigens Proteins 0.000 description 8
- 102000036639 antigens Human genes 0.000 description 8
- 108010008355 arginyl-glutamine Proteins 0.000 description 8
- 108010060199 cysteinylproline Proteins 0.000 description 8
- 238000001727 in vivo Methods 0.000 description 8
- 238000003199 nucleic acid amplification method Methods 0.000 description 8
- 108010069117 seryl-lysyl-aspartic acid Proteins 0.000 description 8
- AXFMEGAFCUULFV-BLFANLJRSA-N (2s)-2-[[(2s)-1-[(2s,3r)-2-amino-3-methylpentanoyl]pyrrolidine-2-carbonyl]amino]pentanedioic acid Chemical compound CC[C@@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O AXFMEGAFCUULFV-BLFANLJRSA-N 0.000 description 7
- PNALXAODQKTNLV-JBDRJPRFSA-N Ala-Ile-Ala Chemical compound C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O PNALXAODQKTNLV-JBDRJPRFSA-N 0.000 description 7
- BOKLLPVAQDSLHC-FXQIFTODSA-N Ala-Val-Cys Chemical compound C[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CS)C(=O)O)N BOKLLPVAQDSLHC-FXQIFTODSA-N 0.000 description 7
- NTAZNGWBXRVEDJ-FXQIFTODSA-N Arg-Asp-Asp Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O NTAZNGWBXRVEDJ-FXQIFTODSA-N 0.000 description 7
- ZNZPKVQURDQFFS-FXQIFTODSA-N Gln-Glu-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O ZNZPKVQURDQFFS-FXQIFTODSA-N 0.000 description 7
- DAHLWSFUXOHMIA-FXQIFTODSA-N Glu-Ser-Gln Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O DAHLWSFUXOHMIA-FXQIFTODSA-N 0.000 description 7
- VNCNWQPIQYAMAK-ACZMJKKPSA-N Glu-Ser-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O VNCNWQPIQYAMAK-ACZMJKKPSA-N 0.000 description 7
- 108010021625 Immunoglobulin Fragments Proteins 0.000 description 7
- 102000008394 Immunoglobulin Fragments Human genes 0.000 description 7
- RFUBXQQFJFGJFV-GUBZILKMSA-N Leu-Asn-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O RFUBXQQFJFGJFV-GUBZILKMSA-N 0.000 description 7
- ILJREDZFPHTUIE-GUBZILKMSA-N Leu-Asp-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O ILJREDZFPHTUIE-GUBZILKMSA-N 0.000 description 7
- LMVOVCYVZBBWQB-SRVKXCTJSA-N Lys-Asp-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN LMVOVCYVZBBWQB-SRVKXCTJSA-N 0.000 description 7
- LLSUNJYOSCOOEB-GUBZILKMSA-N Lys-Glu-Asp Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O LLSUNJYOSCOOEB-GUBZILKMSA-N 0.000 description 7
- WPTDJKDGICUFCP-XUXIUFHCSA-N Met-Ile-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)O)NC(=O)[C@H](CCSC)N WPTDJKDGICUFCP-XUXIUFHCSA-N 0.000 description 7
- USBFEVBHEQBWDD-AVGNSLFASA-N Met-Leu-Val Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O USBFEVBHEQBWDD-AVGNSLFASA-N 0.000 description 7
- QEDGNYFHLXXIDC-DCAQKATOSA-N Met-Pro-Gln Chemical compound CSCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O QEDGNYFHLXXIDC-DCAQKATOSA-N 0.000 description 7
- APJPXSFJBMMOLW-KBPBESRZSA-N Phe-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC1=CC=CC=C1 APJPXSFJBMMOLW-KBPBESRZSA-N 0.000 description 7
- QNZLIVROMORQFH-BQBZGAKWSA-N Pro-Gly-Cys Chemical compound C1C[C@H](NC1)C(=O)NCC(=O)N[C@@H](CS)C(=O)O QNZLIVROMORQFH-BQBZGAKWSA-N 0.000 description 7
- SUENWIFTSTWUKD-AVGNSLFASA-N Pro-Leu-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O SUENWIFTSTWUKD-AVGNSLFASA-N 0.000 description 7
- BGWKULMLUIUPKY-BQBZGAKWSA-N Pro-Ser-Gly Chemical compound OC(=O)CNC(=O)[C@H](CO)NC(=O)[C@@H]1CCCN1 BGWKULMLUIUPKY-BQBZGAKWSA-N 0.000 description 7
- FIXILCYTSAUERA-FXQIFTODSA-N Ser-Ala-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O FIXILCYTSAUERA-FXQIFTODSA-N 0.000 description 7
- WNDUPCKKKGSKIQ-CIUDSAMLSA-N Ser-Pro-Gln Chemical compound OC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O WNDUPCKKKGSKIQ-CIUDSAMLSA-N 0.000 description 7
- SRSPTFBENMJHMR-WHFBIAKZSA-N Ser-Ser-Gly Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O SRSPTFBENMJHMR-WHFBIAKZSA-N 0.000 description 7
- BMKNXTJLHFIAAH-CIUDSAMLSA-N Ser-Ser-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O BMKNXTJLHFIAAH-CIUDSAMLSA-N 0.000 description 7
- RXUOAOOZIWABBW-XGEHTFHBSA-N Ser-Thr-Arg Chemical compound OC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N RXUOAOOZIWABBW-XGEHTFHBSA-N 0.000 description 7
- VYEHBMMAJFVTOI-JHEQGTHGSA-N Thr-Gly-Gln Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CCC(N)=O)C(O)=O VYEHBMMAJFVTOI-JHEQGTHGSA-N 0.000 description 7
- IMULJHHGAUZZFE-MBLNEYKQSA-N Thr-Gly-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H]([C@@H](C)CC)C(O)=O IMULJHHGAUZZFE-MBLNEYKQSA-N 0.000 description 7
- CRWOSTCODDFEKZ-HRCADAONSA-N Tyr-Arg-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC2=CC=C(C=C2)O)N)C(=O)O CRWOSTCODDFEKZ-HRCADAONSA-N 0.000 description 7
- ARPONUQDNWLXOZ-KKUMJFAQSA-N Tyr-Gln-Arg Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O ARPONUQDNWLXOZ-KKUMJFAQSA-N 0.000 description 7
- YYLHVUCSTXXKBS-IHRRRGAJSA-N Tyr-Pro-Ser Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O YYLHVUCSTXXKBS-IHRRRGAJSA-N 0.000 description 7
- IDKGBVZGNTYYCC-QXEWZRGKSA-N Val-Asn-Pro Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N1CCC[C@H]1C(O)=O IDKGBVZGNTYYCC-QXEWZRGKSA-N 0.000 description 7
- AAOPYWQQBXHINJ-DZKIICNBSA-N Val-Gln-Tyr Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N AAOPYWQQBXHINJ-DZKIICNBSA-N 0.000 description 7
- 108010005233 alanylglutamic acid Proteins 0.000 description 7
- 108010060035 arginylproline Proteins 0.000 description 7
- 150000001875 compounds Chemical class 0.000 description 7
- 239000003550 marker Substances 0.000 description 7
- 230000004048 modification Effects 0.000 description 7
- 238000012986 modification Methods 0.000 description 7
- 108010015796 prolylisoleucine Proteins 0.000 description 7
- 239000000758 substrate Substances 0.000 description 7
- 108010073969 valyllysine Proteins 0.000 description 7
- BUANFPRKJKJSRR-ACZMJKKPSA-N Ala-Ala-Gln Chemical compound C[C@H]([NH3+])C(=O)N[C@@H](C)C(=O)N[C@H](C([O-])=O)CCC(N)=O BUANFPRKJKJSRR-ACZMJKKPSA-N 0.000 description 6
- UCIYCBSJBQGDGM-LPEHRKFASA-N Ala-Arg-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N1CCC[C@@H]1C(=O)O)N UCIYCBSJBQGDGM-LPEHRKFASA-N 0.000 description 6
- NCQMBSJGJMYKCK-ZLUOBGJFSA-N Ala-Ser-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O NCQMBSJGJMYKCK-ZLUOBGJFSA-N 0.000 description 6
- WNHNMKOFKCHKKD-BFHQHQDPSA-N Ala-Thr-Gly Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O WNHNMKOFKCHKKD-BFHQHQDPSA-N 0.000 description 6
- XVLLUZMFSAYKJV-GUBZILKMSA-N Arg-Asp-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O XVLLUZMFSAYKJV-GUBZILKMSA-N 0.000 description 6
- ACRYGQFHAQHDSF-ZLUOBGJFSA-N Asn-Asn-Asn Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O ACRYGQFHAQHDSF-ZLUOBGJFSA-N 0.000 description 6
- CUQDCPXNZPDYFQ-ZLUOBGJFSA-N Asp-Ser-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O CUQDCPXNZPDYFQ-ZLUOBGJFSA-N 0.000 description 6
- 102000053602 DNA Human genes 0.000 description 6
- HHWQMFIGMMOVFK-WDSKDSINSA-N Gln-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(N)=O HHWQMFIGMMOVFK-WDSKDSINSA-N 0.000 description 6
- TWHDOEYLXXQYOZ-FXQIFTODSA-N Gln-Asn-Gln Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N TWHDOEYLXXQYOZ-FXQIFTODSA-N 0.000 description 6
- RKAQZCDMSUQTSS-FXQIFTODSA-N Gln-Asp-Gln Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N RKAQZCDMSUQTSS-FXQIFTODSA-N 0.000 description 6
- VGTDBGYFVWOQTI-RYUDHWBXSA-N Gln-Gly-Phe Chemical compound NC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 VGTDBGYFVWOQTI-RYUDHWBXSA-N 0.000 description 6
- ZBKUIQNCRIYVGH-SDDRHHMPSA-N Gln-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)N)N ZBKUIQNCRIYVGH-SDDRHHMPSA-N 0.000 description 6
- LVRKAFPPFJRIOF-GARJFASQSA-N Gln-Met-Pro Chemical compound CSCC[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)N)N LVRKAFPPFJRIOF-GARJFASQSA-N 0.000 description 6
- OREPWMPAUWIIAM-ZPFDUUQYSA-N Gln-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CCC(=O)N)N OREPWMPAUWIIAM-ZPFDUUQYSA-N 0.000 description 6
- XQDGOJPVMSWZSO-SRVKXCTJSA-N Gln-Pro-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CCC(=O)N)N XQDGOJPVMSWZSO-SRVKXCTJSA-N 0.000 description 6
- MQJDLNRXBOELJW-KKUMJFAQSA-N Gln-Pro-Phe Chemical compound N[C@@H](CCC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](Cc1ccccc1)C(O)=O MQJDLNRXBOELJW-KKUMJFAQSA-N 0.000 description 6
- YPFFHGRJCUBXPX-NHCYSSNCSA-N Gln-Pro-Val Chemical compound CC(C)[C@H](NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CCC(N)=O)C(O)=O YPFFHGRJCUBXPX-NHCYSSNCSA-N 0.000 description 6
- LGWNISYVKDNJRP-FXQIFTODSA-N Gln-Ser-Gln Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O LGWNISYVKDNJRP-FXQIFTODSA-N 0.000 description 6
- VEYGCDYMOXHJLS-GVXVVHGQSA-N Gln-Val-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O VEYGCDYMOXHJLS-GVXVVHGQSA-N 0.000 description 6
- WOMUDRVDJMHTCV-DCAQKATOSA-N Glu-Arg-Arg Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O WOMUDRVDJMHTCV-DCAQKATOSA-N 0.000 description 6
- CKRUHITYRFNUKW-WDSKDSINSA-N Glu-Asn-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O CKRUHITYRFNUKW-WDSKDSINSA-N 0.000 description 6
- SJPMNHCEWPTRBR-BQBZGAKWSA-N Glu-Glu-Gly Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O SJPMNHCEWPTRBR-BQBZGAKWSA-N 0.000 description 6
- LGYZYFFDELZWRS-DCAQKATOSA-N Glu-Glu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O LGYZYFFDELZWRS-DCAQKATOSA-N 0.000 description 6
- HZISRJBYZAODRV-XQXXSGGOSA-N Glu-Thr-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O HZISRJBYZAODRV-XQXXSGGOSA-N 0.000 description 6
- UMZHHILWZBFPGL-LOKLDPHHSA-N Glu-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)O)N)O UMZHHILWZBFPGL-LOKLDPHHSA-N 0.000 description 6
- QSVCIFZPGLOZGH-WDSKDSINSA-N Gly-Glu-Ser Chemical compound NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O QSVCIFZPGLOZGH-WDSKDSINSA-N 0.000 description 6
- PDUHNKAFQXQNLH-ZETCQYMHSA-N Gly-Lys-Gly Chemical compound NCCCC[C@H](NC(=O)CN)C(=O)NCC(O)=O PDUHNKAFQXQNLH-ZETCQYMHSA-N 0.000 description 6
- MDBYBTWRMOAJAY-NHCYSSNCSA-N His-Asn-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC1=CN=CN1)N MDBYBTWRMOAJAY-NHCYSSNCSA-N 0.000 description 6
- YAALVYQFVJNXIV-KKUMJFAQSA-N His-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CN=CN1 YAALVYQFVJNXIV-KKUMJFAQSA-N 0.000 description 6
- VSZALHITQINTGC-GHCJXIJMSA-N Ile-Ala-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CC(=O)O)C(=O)O)N VSZALHITQINTGC-GHCJXIJMSA-N 0.000 description 6
- 108091092195 Intron Proteins 0.000 description 6
- VPKIQULSKFVCSM-SRVKXCTJSA-N Leu-Gln-Arg Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O VPKIQULSKFVCSM-SRVKXCTJSA-N 0.000 description 6
- ZYLJULGXQDNXDK-GUBZILKMSA-N Leu-Gln-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O ZYLJULGXQDNXDK-GUBZILKMSA-N 0.000 description 6
- IAJFFZORSWOZPQ-SRVKXCTJSA-N Leu-Leu-Asn Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O IAJFFZORSWOZPQ-SRVKXCTJSA-N 0.000 description 6
- IDGZVZJLYFTXSL-DCAQKATOSA-N Leu-Ser-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCN=C(N)N IDGZVZJLYFTXSL-DCAQKATOSA-N 0.000 description 6
- SKRGVGLIRUGANF-AVGNSLFASA-N Lys-Leu-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O SKRGVGLIRUGANF-AVGNSLFASA-N 0.000 description 6
- ZUGVARDEGWMMLK-SRVKXCTJSA-N Lys-Ser-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCCN ZUGVARDEGWMMLK-SRVKXCTJSA-N 0.000 description 6
- BJPQKNHZHUCQNQ-SRVKXCTJSA-N Met-Pro-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CCSC)N BJPQKNHZHUCQNQ-SRVKXCTJSA-N 0.000 description 6
- CIDICGYKRUTYLE-FXQIFTODSA-N Met-Ser-Ala Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O CIDICGYKRUTYLE-FXQIFTODSA-N 0.000 description 6
- MIXPUVSPPOWTCR-FXQIFTODSA-N Met-Ser-Ser Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O MIXPUVSPPOWTCR-FXQIFTODSA-N 0.000 description 6
- 241001529936 Murinae Species 0.000 description 6
- 241000699660 Mus musculus Species 0.000 description 6
- 206010028980 Neoplasm Diseases 0.000 description 6
- FZHBZMDRDASUHN-NAKRPEOUSA-N Pro-Ala-Ile Chemical compound CC[C@H](C)[C@H](NC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1)C(O)=O FZHBZMDRDASUHN-NAKRPEOUSA-N 0.000 description 6
- KDBHVPXBQADZKY-GUBZILKMSA-N Pro-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 KDBHVPXBQADZKY-GUBZILKMSA-N 0.000 description 6
- CGSOWZUPLOKYOR-AVGNSLFASA-N Pro-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 CGSOWZUPLOKYOR-AVGNSLFASA-N 0.000 description 6
- FDMKYQQYJKYCLV-GUBZILKMSA-N Pro-Pro-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 FDMKYQQYJKYCLV-GUBZILKMSA-N 0.000 description 6
- UQFYNFTYDHUIMI-WHFBIAKZSA-N Ser-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)[C@@H](N)CO UQFYNFTYDHUIMI-WHFBIAKZSA-N 0.000 description 6
- SNVIOQXAHVORQM-WDSKDSINSA-N Ser-Gly-Gln Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CCC(N)=O)C(O)=O SNVIOQXAHVORQM-WDSKDSINSA-N 0.000 description 6
- BKZYBLLIBOBOOW-GHCJXIJMSA-N Ser-Ile-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O BKZYBLLIBOBOOW-GHCJXIJMSA-N 0.000 description 6
- FUMGHWDRRFCKEP-CIUDSAMLSA-N Ser-Leu-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O FUMGHWDRRFCKEP-CIUDSAMLSA-N 0.000 description 6
- NLOAIFSWUUFQFR-CIUDSAMLSA-N Ser-Leu-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O NLOAIFSWUUFQFR-CIUDSAMLSA-N 0.000 description 6
- WLJPJRGQRNCIQS-ZLUOBGJFSA-N Ser-Ser-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O WLJPJRGQRNCIQS-ZLUOBGJFSA-N 0.000 description 6
- GKWNLDNXMMLRMC-GLLZPBPUSA-N Thr-Glu-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N)O GKWNLDNXMMLRMC-GLLZPBPUSA-N 0.000 description 6
- UBDDORVPVLEECX-FJXKBIBVSA-N Thr-Gly-Met Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CCSC)C(O)=O UBDDORVPVLEECX-FJXKBIBVSA-N 0.000 description 6
- JWQNAFHCXKVZKZ-UVOCVTCTSA-N Thr-Lys-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O JWQNAFHCXKVZKZ-UVOCVTCTSA-N 0.000 description 6
- RVMNUBQWPVOUKH-HEIBUPTGSA-N Thr-Ser-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O RVMNUBQWPVOUKH-HEIBUPTGSA-N 0.000 description 6
- JLTQXEOXIJMCLZ-ZVZYQTTQSA-N Trp-Gln-Val Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O)=CNC2=C1 JLTQXEOXIJMCLZ-ZVZYQTTQSA-N 0.000 description 6
- XGZBEGGGAUQBMB-KJEVXHAQSA-N Tyr-Pro-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CC2=CC=C(C=C2)O)N)O XGZBEGGGAUQBMB-KJEVXHAQSA-N 0.000 description 6
- SLLKXDSRVAOREO-KZVJFYERSA-N Val-Ala-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](C)NC(=O)[C@H](C(C)C)N)O SLLKXDSRVAOREO-KZVJFYERSA-N 0.000 description 6
- KOPBYUSPXBQIHD-NRPADANISA-N Val-Cys-Glu Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N KOPBYUSPXBQIHD-NRPADANISA-N 0.000 description 6
- JAKHAONCJJZVHT-DCAQKATOSA-N Val-Lys-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)O)N JAKHAONCJJZVHT-DCAQKATOSA-N 0.000 description 6
- WANVRBAZGSICCP-SRVKXCTJSA-N Val-Pro-Met Chemical compound CSCC[C@H](NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)C(C)C)C(O)=O WANVRBAZGSICCP-SRVKXCTJSA-N 0.000 description 6
- 108010077245 asparaginyl-proline Proteins 0.000 description 6
- 108010038633 aspartylglutamate Proteins 0.000 description 6
- 108010092854 aspartyllysine Proteins 0.000 description 6
- 210000001124 body fluid Anatomy 0.000 description 6
- 238000005516 engineering process Methods 0.000 description 6
- 108010063718 gamma-glutamylaspartic acid Proteins 0.000 description 6
- 108010037850 glycylvaline Proteins 0.000 description 6
- 238000000338 in vitro Methods 0.000 description 6
- 108010044374 isoleucyl-tyrosine Proteins 0.000 description 6
- 230000037361 pathway Effects 0.000 description 6
- 239000000126 substance Substances 0.000 description 6
- 229940124597 therapeutic agent Drugs 0.000 description 6
- 108010051110 tyrosyl-lysine Proteins 0.000 description 6
- VLAFRQCSFRYCLC-FXQIFTODSA-N (2s)-2-[[(2s)-2-[[2-[[(2s)-2-aminopropanoyl]amino]acetyl]amino]-3-hydroxypropanoyl]amino]pentanedioic acid Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O VLAFRQCSFRYCLC-FXQIFTODSA-N 0.000 description 5
- NNRFRJQMBSBXGO-CIUDSAMLSA-N (3s)-3-[[2-[[(2s)-2-amino-5-(diaminomethylideneamino)pentanoyl]amino]acetyl]amino]-4-[[(1s)-1-carboxy-2-hydroxyethyl]amino]-4-oxobutanoic acid Chemical compound NC(N)=NCCC[C@H](N)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O NNRFRJQMBSBXGO-CIUDSAMLSA-N 0.000 description 5
- RLMISHABBKUNFO-WHFBIAKZSA-N Ala-Ala-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O RLMISHABBKUNFO-WHFBIAKZSA-N 0.000 description 5
- NIZKGBJVCMRDKO-KWQFWETISA-N Ala-Gly-Tyr Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 NIZKGBJVCMRDKO-KWQFWETISA-N 0.000 description 5
- OLDOLPWZEMHNIA-PJODQICGSA-N Arg-Ala-Trp Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O OLDOLPWZEMHNIA-PJODQICGSA-N 0.000 description 5
- BEXGZLUHRXTZCC-CIUDSAMLSA-N Arg-Gln-Ser Chemical compound C(C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CO)C(=O)O)N)CN=C(N)N BEXGZLUHRXTZCC-CIUDSAMLSA-N 0.000 description 5
- KXOPYFNQLVUOAQ-FXQIFTODSA-N Arg-Ser-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O KXOPYFNQLVUOAQ-FXQIFTODSA-N 0.000 description 5
- KSHJMDSNSKDJPU-QTKMDUPCSA-N Arg-Thr-His Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 KSHJMDSNSKDJPU-QTKMDUPCSA-N 0.000 description 5
- HZPSDHRYYIORKR-WHFBIAKZSA-N Asn-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H](N)CC(N)=O HZPSDHRYYIORKR-WHFBIAKZSA-N 0.000 description 5
- FAEFJTCTNZTPHX-ACZMJKKPSA-N Asn-Gln-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O FAEFJTCTNZTPHX-ACZMJKKPSA-N 0.000 description 5
- NLRJGXZWTKXRHP-DCAQKATOSA-N Asn-Leu-Arg Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O NLRJGXZWTKXRHP-DCAQKATOSA-N 0.000 description 5
- VCJCPARXDBEGNE-GUBZILKMSA-N Asn-Pro-Pro Chemical compound NC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 VCJCPARXDBEGNE-GUBZILKMSA-N 0.000 description 5
- KNMRXHIAVXHCLW-ZLUOBGJFSA-N Asp-Asn-Ser Chemical compound C([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CO)C(=O)O)N)C(=O)O KNMRXHIAVXHCLW-ZLUOBGJFSA-N 0.000 description 5
- SVABRQFIHCSNCI-FOHZUACHSA-N Asp-Gly-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O SVABRQFIHCSNCI-FOHZUACHSA-N 0.000 description 5
- JOCQXVJCTCEFAZ-CIUDSAMLSA-N Asp-His-Asn Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(N)=O)C(O)=O JOCQXVJCTCEFAZ-CIUDSAMLSA-N 0.000 description 5
- 108020004705 Codon Proteins 0.000 description 5
- 108010047041 Complementarity Determining Regions Proteins 0.000 description 5
- BYALSSDCQYHKMY-XGEHTFHBSA-N Cys-Arg-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CS)N)O BYALSSDCQYHKMY-XGEHTFHBSA-N 0.000 description 5
- NDUSUIGBMZCOIL-ZKWXMUAHSA-N Cys-Asn-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CS)N NDUSUIGBMZCOIL-ZKWXMUAHSA-N 0.000 description 5
- ZGERHCJBLPQPGV-ACZMJKKPSA-N Cys-Ser-Gln Chemical compound C(CC(=O)N)[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CS)N ZGERHCJBLPQPGV-ACZMJKKPSA-N 0.000 description 5
- 238000002965 ELISA Methods 0.000 description 5
- LMPBBFWHCRURJD-LAEOZQHASA-N Gln-Asn-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)N)N LMPBBFWHCRURJD-LAEOZQHASA-N 0.000 description 5
- UFNSPPFJOHNXRE-AUTRQRHGSA-N Gln-Gln-Val Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O UFNSPPFJOHNXRE-AUTRQRHGSA-N 0.000 description 5
- SNLOOPZHAQDMJG-CIUDSAMLSA-N Gln-Glu-Glu Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O SNLOOPZHAQDMJG-CIUDSAMLSA-N 0.000 description 5
- JXFLPKSDLDEOQK-JHEQGTHGSA-N Gln-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CCC(N)=O JXFLPKSDLDEOQK-JHEQGTHGSA-N 0.000 description 5
- ROHVCXBMIAAASL-HJGDQZAQSA-N Gln-Met-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](CCC(=O)N)N)O ROHVCXBMIAAASL-HJGDQZAQSA-N 0.000 description 5
- VNTGPISAOMAXRK-CIUDSAMLSA-N Gln-Pro-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O VNTGPISAOMAXRK-CIUDSAMLSA-N 0.000 description 5
- JILRMFFFCHUUTJ-ACZMJKKPSA-N Gln-Ser-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O JILRMFFFCHUUTJ-ACZMJKKPSA-N 0.000 description 5
- BYKZWDGMJLNFJY-XKBZYTNZSA-N Gln-Ser-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCC(=O)N)N)O BYKZWDGMJLNFJY-XKBZYTNZSA-N 0.000 description 5
- ZZLDMBMFKZFQMU-NRPADANISA-N Gln-Val-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O ZZLDMBMFKZFQMU-NRPADANISA-N 0.000 description 5
- OCJRHJZKGGSPRW-IUCAKERBSA-N Glu-Lys-Gly Chemical compound NCCCC[C@@H](C(=O)NCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O OCJRHJZKGGSPRW-IUCAKERBSA-N 0.000 description 5
- XLFHCWHXKSFVIB-BQBZGAKWSA-N Gly-Gln-Gln Chemical compound NCC(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O XLFHCWHXKSFVIB-BQBZGAKWSA-N 0.000 description 5
- XMPXVJIDADUOQB-RCOVLWMOSA-N Gly-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C([O-])=O)NC(=O)CNC(=O)C[NH3+] XMPXVJIDADUOQB-RCOVLWMOSA-N 0.000 description 5
- UQJNXZSSGQIPIQ-FBCQKBJTSA-N Gly-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)CN UQJNXZSSGQIPIQ-FBCQKBJTSA-N 0.000 description 5
- UTYGDAHJBBDPBA-BYULHYEWSA-N Gly-Ile-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)CN UTYGDAHJBBDPBA-BYULHYEWSA-N 0.000 description 5
- TWTPDFFBLQEBOE-IUCAKERBSA-N Gly-Leu-Gln Chemical compound [H]NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O TWTPDFFBLQEBOE-IUCAKERBSA-N 0.000 description 5
- VDCRBJACQKOSMS-JSGCOSHPSA-N Gly-Phe-Val Chemical compound [H]NCC(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(O)=O VDCRBJACQKOSMS-JSGCOSHPSA-N 0.000 description 5
- OOCFXNOVSLSHAB-IUCAKERBSA-N Gly-Pro-Pro Chemical compound NCC(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 OOCFXNOVSLSHAB-IUCAKERBSA-N 0.000 description 5
- VNNRLUNBJSWZPF-ZKWXMUAHSA-N Gly-Ser-Ile Chemical compound [H]NCC(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O VNNRLUNBJSWZPF-ZKWXMUAHSA-N 0.000 description 5
- WNGHUXFWEWTKAO-YUMQZZPRSA-N Gly-Ser-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)CN WNGHUXFWEWTKAO-YUMQZZPRSA-N 0.000 description 5
- MYXNLWDWWOTERK-BHNWBGBOSA-N Gly-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN)O MYXNLWDWWOTERK-BHNWBGBOSA-N 0.000 description 5
- MDOBWSFNSNPENN-PMVVWTBXSA-N His-Thr-Gly Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O MDOBWSFNSNPENN-PMVVWTBXSA-N 0.000 description 5
- 101100273831 Homo sapiens CDS1 gene Proteins 0.000 description 5
- UNDGQKWQNSTPPW-CYDGBPFRSA-N Ile-Arg-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCSC)C(=O)O)N UNDGQKWQNSTPPW-CYDGBPFRSA-N 0.000 description 5
- VOBYAKCXGQQFLR-LSJOCFKGSA-N Ile-Gly-Val Chemical compound CC[C@H](C)[C@H](N)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O VOBYAKCXGQQFLR-LSJOCFKGSA-N 0.000 description 5
- FQYQMFCIJNWDQZ-CYDGBPFRSA-N Ile-Pro-Pro Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 FQYQMFCIJNWDQZ-CYDGBPFRSA-N 0.000 description 5
- XMYURPUVJSKTMC-KBIXCLLPSA-N Ile-Ser-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N XMYURPUVJSKTMC-KBIXCLLPSA-N 0.000 description 5
- 108060003951 Immunoglobulin Proteins 0.000 description 5
- FIICHHJDINDXKG-IHPCNDPISA-N Leu-Lys-Trp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O FIICHHJDINDXKG-IHPCNDPISA-N 0.000 description 5
- WMIOEVKKYIMVKI-DCAQKATOSA-N Leu-Pro-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O WMIOEVKKYIMVKI-DCAQKATOSA-N 0.000 description 5
- QESXLSQLQHHTIX-RHYQMDGZSA-N Leu-Val-Thr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O QESXLSQLQHHTIX-RHYQMDGZSA-N 0.000 description 5
- LCMWVZLBCUVDAZ-IUCAKERBSA-N Lys-Gly-Glu Chemical compound [NH3+]CCCC[C@H]([NH3+])C(=O)NCC(=O)N[C@H](C([O-])=O)CCC([O-])=O LCMWVZLBCUVDAZ-IUCAKERBSA-N 0.000 description 5
- RMOKGALPSPOYKE-KATARQTJSA-N Lys-Thr-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O RMOKGALPSPOYKE-KATARQTJSA-N 0.000 description 5
- ULNXMMYXQKGNPG-LPEHRKFASA-N Met-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCSC)N ULNXMMYXQKGNPG-LPEHRKFASA-N 0.000 description 5
- CQRGINSEMFBACV-WPRPVWTQSA-N Met-Val-Gly Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O CQRGINSEMFBACV-WPRPVWTQSA-N 0.000 description 5
- AJHCSUXXECOXOY-UHFFFAOYSA-N N-glycyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)CN)C(O)=O)=CNC2=C1 AJHCSUXXECOXOY-UHFFFAOYSA-N 0.000 description 5
- 101710163270 Nuclease Proteins 0.000 description 5
- 108091034117 Oligonucleotide Proteins 0.000 description 5
- VZFPYFRVHMSSNA-JURCDPSOSA-N Phe-Ile-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CC1=CC=CC=C1 VZFPYFRVHMSSNA-JURCDPSOSA-N 0.000 description 5
- BPCLGWHVPVTTFM-QWRGUYRKSA-N Phe-Ser-Gly Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)NCC(O)=O BPCLGWHVPVTTFM-QWRGUYRKSA-N 0.000 description 5
- YUPRIZTWANWWHK-DZKIICNBSA-N Phe-Val-Glu Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N YUPRIZTWANWWHK-DZKIICNBSA-N 0.000 description 5
- AJLVKXCNXIJHDV-CIUDSAMLSA-N Pro-Ala-Gln Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(O)=O AJLVKXCNXIJHDV-CIUDSAMLSA-N 0.000 description 5
- DRVIASBABBMZTF-GUBZILKMSA-N Pro-Ala-Met Chemical compound C[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@@H]1CCCN1 DRVIASBABBMZTF-GUBZILKMSA-N 0.000 description 5
- HAAQQNHQZBOWFO-LURJTMIESA-N Pro-Gly-Gly Chemical compound OC(=O)CNC(=O)CNC(=O)[C@@H]1CCCN1 HAAQQNHQZBOWFO-LURJTMIESA-N 0.000 description 5
- HFNPOYOKIPGAEI-SRVKXCTJSA-N Pro-Leu-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]1CCCN1 HFNPOYOKIPGAEI-SRVKXCTJSA-N 0.000 description 5
- SBVPYBFMIGDIDX-SRVKXCTJSA-N Pro-Pro-Pro Chemical compound OC(=O)[C@@H]1CCCN1C(=O)[C@H]1N(C(=O)[C@H]2NCCC2)CCC1 SBVPYBFMIGDIDX-SRVKXCTJSA-N 0.000 description 5
- RCYUBVHMVUHEBM-RCWTZXSCSA-N Pro-Pro-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O RCYUBVHMVUHEBM-RCWTZXSCSA-N 0.000 description 5
- UGDMQJSXSSZUKL-IHRRRGAJSA-N Pro-Ser-Tyr Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O UGDMQJSXSSZUKL-IHRRRGAJSA-N 0.000 description 5
- HRIXMVRZRGFKNQ-HJGDQZAQSA-N Pro-Thr-Gln Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(O)=O HRIXMVRZRGFKNQ-HJGDQZAQSA-N 0.000 description 5
- RMJZWERKFFNNNS-XGEHTFHBSA-N Pro-Thr-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O RMJZWERKFFNNNS-XGEHTFHBSA-N 0.000 description 5
- 240000004808 Saccharomyces cerevisiae Species 0.000 description 5
- XWCYBVBLJRWOFR-WDSKDSINSA-N Ser-Gln-Gly Chemical compound OC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O XWCYBVBLJRWOFR-WDSKDSINSA-N 0.000 description 5
- XXNYYSXNXCJYKX-DCAQKATOSA-N Ser-Leu-Met Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(O)=O XXNYYSXNXCJYKX-DCAQKATOSA-N 0.000 description 5
- JCLAFVNDBJMLBC-JBDRJPRFSA-N Ser-Ser-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JCLAFVNDBJMLBC-JBDRJPRFSA-N 0.000 description 5
- PURRNJBBXDDWLX-ZDLURKLDSA-N Ser-Thr-Gly Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CO)N)O PURRNJBBXDDWLX-ZDLURKLDSA-N 0.000 description 5
- SNXUIBACCONSOH-BWBBJGPYSA-N Ser-Thr-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CO)C(O)=O SNXUIBACCONSOH-BWBBJGPYSA-N 0.000 description 5
- ZQUKYJOKQBRBCS-GLLZPBPUSA-N Thr-Gln-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N)O ZQUKYJOKQBRBCS-GLLZPBPUSA-N 0.000 description 5
- WRUWXBBEFUTJOU-XGEHTFHBSA-N Thr-Met-Ser Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CO)C(=O)O)N)O WRUWXBBEFUTJOU-XGEHTFHBSA-N 0.000 description 5
- DOBIBIXIHJKVJF-XKBZYTNZSA-N Thr-Ser-Gln Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(N)=O DOBIBIXIHJKVJF-XKBZYTNZSA-N 0.000 description 5
- WPSKTVVMQCXPRO-BWBBJGPYSA-N Thr-Ser-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O WPSKTVVMQCXPRO-BWBBJGPYSA-N 0.000 description 5
- HUPLKEHTTQBXSC-YJRXYDGGSA-N Thr-Ser-Tyr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 HUPLKEHTTQBXSC-YJRXYDGGSA-N 0.000 description 5
- ZOBLBMGJKVJVEV-BZSNNMDCSA-N Tyr-Lys-Lys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)O)N)O ZOBLBMGJKVJVEV-BZSNNMDCSA-N 0.000 description 5
- HIZMLPKDJAXDRG-FXQIFTODSA-N Val-Cys-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CO)C(=O)O)N HIZMLPKDJAXDRG-FXQIFTODSA-N 0.000 description 5
- HURRXSNHCCSJHA-AUTRQRHGSA-N Val-Gln-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N HURRXSNHCCSJHA-AUTRQRHGSA-N 0.000 description 5
- QSPOLEBZTMESFY-SRVKXCTJSA-N Val-Pro-Val Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O QSPOLEBZTMESFY-SRVKXCTJSA-N 0.000 description 5
- HWNYVQMOLCYHEA-IHRRRGAJSA-N Val-Ser-Tyr Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N HWNYVQMOLCYHEA-IHRRRGAJSA-N 0.000 description 5
- OWFGFHQMSBTKLX-UFYCRDLUSA-N Val-Tyr-Tyr Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O)N OWFGFHQMSBTKLX-UFYCRDLUSA-N 0.000 description 5
- 150000007513 acids Chemical class 0.000 description 5
- 108010087924 alanylproline Proteins 0.000 description 5
- 108010001271 arginyl-glutamyl-arginine Proteins 0.000 description 5
- 108010052670 arginyl-glutamyl-glutamic acid Proteins 0.000 description 5
- 238000003491 array Methods 0.000 description 5
- 108010047857 aspartylglycine Proteins 0.000 description 5
- 239000010839 body fluid Substances 0.000 description 5
- 238000006243 chemical reaction Methods 0.000 description 5
- 238000010367 cloning Methods 0.000 description 5
- 229940127089 cytotoxic agent Drugs 0.000 description 5
- 210000001671 embryonic stem cell Anatomy 0.000 description 5
- 239000013604 expression vector Substances 0.000 description 5
- 108010085325 histidylproline Proteins 0.000 description 5
- 230000006801 homologous recombination Effects 0.000 description 5
- 238000002744 homologous recombination Methods 0.000 description 5
- 238000003018 immunoassay Methods 0.000 description 5
- 102000018358 immunoglobulin Human genes 0.000 description 5
- 238000002372 labelling Methods 0.000 description 5
- 210000004698 lymphocyte Anatomy 0.000 description 5
- 238000004519 manufacturing process Methods 0.000 description 5
- 239000002609 medium Substances 0.000 description 5
- 238000002703 mutagenesis Methods 0.000 description 5
- 231100000350 mutagenesis Toxicity 0.000 description 5
- 238000012216 screening Methods 0.000 description 5
- 238000012163 sequencing technique Methods 0.000 description 5
- 102000040650 (ribonucleotides)n+m Human genes 0.000 description 4
- JBVSSSZFNTXJDX-YTLHQDLWSA-N Ala-Ala-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@H](C)N JBVSSSZFNTXJDX-YTLHQDLWSA-N 0.000 description 4
- CXQODNIBUNQWAS-CIUDSAMLSA-N Ala-Gln-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N CXQODNIBUNQWAS-CIUDSAMLSA-N 0.000 description 4
- CSAHOYQKNHGDHX-ACZMJKKPSA-N Ala-Gln-Asn Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O CSAHOYQKNHGDHX-ACZMJKKPSA-N 0.000 description 4
- WKOBSJOZRJJVRZ-FXQIFTODSA-N Ala-Glu-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O WKOBSJOZRJJVRZ-FXQIFTODSA-N 0.000 description 4
- BLIMFWGRQKRCGT-YUMQZZPRSA-N Ala-Gly-Lys Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCCN BLIMFWGRQKRCGT-YUMQZZPRSA-N 0.000 description 4
- MQIGTEQXYCRLGK-BQBZGAKWSA-N Ala-Gly-Pro Chemical compound C[C@H](N)C(=O)NCC(=O)N1CCC[C@H]1C(O)=O MQIGTEQXYCRLGK-BQBZGAKWSA-N 0.000 description 4
- PEEYDECOOVQKRZ-DLOVCJGASA-N Ala-Ser-Phe Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O PEEYDECOOVQKRZ-DLOVCJGASA-N 0.000 description 4
- 108700028369 Alleles Proteins 0.000 description 4
- BVBKBQRPOJFCQM-DCAQKATOSA-N Arg-Asn-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O BVBKBQRPOJFCQM-DCAQKATOSA-N 0.000 description 4
- OGUPCHKBOKJFMA-SRVKXCTJSA-N Arg-Glu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCCN=C(N)N OGUPCHKBOKJFMA-SRVKXCTJSA-N 0.000 description 4
- GNYUVVJYGJFKHN-RVMXOQNASA-N Arg-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N GNYUVVJYGJFKHN-RVMXOQNASA-N 0.000 description 4
- NPAVRDPEFVKELR-DCAQKATOSA-N Arg-Lys-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O NPAVRDPEFVKELR-DCAQKATOSA-N 0.000 description 4
- JBIRFLWXWDSDTR-CYDGBPFRSA-N Arg-Met-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](CCCN=C(N)N)N JBIRFLWXWDSDTR-CYDGBPFRSA-N 0.000 description 4
- OISWSORSLQOGFV-AVGNSLFASA-N Arg-Met-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCSC)NC(=O)[C@@H](N)CCCN=C(N)N OISWSORSLQOGFV-AVGNSLFASA-N 0.000 description 4
- PDQBXRSOSCTGKY-ACZMJKKPSA-N Asn-Ala-Gln Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N PDQBXRSOSCTGKY-ACZMJKKPSA-N 0.000 description 4
- HOIFSHOLNKQCSA-FXQIFTODSA-N Asn-Arg-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O HOIFSHOLNKQCSA-FXQIFTODSA-N 0.000 description 4
- KXFCBAHYSLJCCY-ZLUOBGJFSA-N Asn-Asn-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O KXFCBAHYSLJCCY-ZLUOBGJFSA-N 0.000 description 4
- WQLJRNRLHWJIRW-KKUMJFAQSA-N Asn-His-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CC2=CN=CN2)NC(=O)[C@H](CC(=O)N)N)O WQLJRNRLHWJIRW-KKUMJFAQSA-N 0.000 description 4
- YVXRYLVELQYAEQ-SRVKXCTJSA-N Asn-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)N)N YVXRYLVELQYAEQ-SRVKXCTJSA-N 0.000 description 4
- UQBGYPFHWFZMCD-ZLUOBGJFSA-N Asp-Asn-Asn Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O UQBGYPFHWFZMCD-ZLUOBGJFSA-N 0.000 description 4
- ATYWBXGNXZYZGI-ACZMJKKPSA-N Asp-Asn-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O ATYWBXGNXZYZGI-ACZMJKKPSA-N 0.000 description 4
- QXHVOUSPVAWEMX-ZLUOBGJFSA-N Asp-Asp-Ser Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O QXHVOUSPVAWEMX-ZLUOBGJFSA-N 0.000 description 4
- QHHVSXGWLYEAGX-GUBZILKMSA-N Asp-His-Gln Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)O)N QHHVSXGWLYEAGX-GUBZILKMSA-N 0.000 description 4
- OEDJQRXNDRUGEU-SRVKXCTJSA-N Asp-Leu-His Chemical compound N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)O OEDJQRXNDRUGEU-SRVKXCTJSA-N 0.000 description 4
- VSMYBNPOHYAXSD-GUBZILKMSA-N Asp-Lys-Glu Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O VSMYBNPOHYAXSD-GUBZILKMSA-N 0.000 description 4
- DONWIPDSZZJHHK-HJGDQZAQSA-N Asp-Lys-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC(=O)O)N)O DONWIPDSZZJHHK-HJGDQZAQSA-N 0.000 description 4
- 241000894006 Bacteria Species 0.000 description 4
- KLLFLHBKSJAUMZ-ACZMJKKPSA-N Cys-Asn-Glu Chemical compound C(CC(=O)O)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CS)N KLLFLHBKSJAUMZ-ACZMJKKPSA-N 0.000 description 4
- UUOYKFNULIOCGJ-GUBZILKMSA-N Cys-Glu-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CS)N UUOYKFNULIOCGJ-GUBZILKMSA-N 0.000 description 4
- QQAYIVHVRFJICE-AEJSXWLSSA-N Cys-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CS)N QQAYIVHVRFJICE-AEJSXWLSSA-N 0.000 description 4
- LKUWAWGNJYJODH-KBIXCLLPSA-N Gln-Ala-Ile Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O LKUWAWGNJYJODH-KBIXCLLPSA-N 0.000 description 4
- ZFADFBPRMSBPOT-KKUMJFAQSA-N Gln-Arg-Phe Chemical compound N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](Cc1ccccc1)C(O)=O ZFADFBPRMSBPOT-KKUMJFAQSA-N 0.000 description 4
- SOBBAYVQSNXYPQ-ACZMJKKPSA-N Gln-Asn-Asn Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O SOBBAYVQSNXYPQ-ACZMJKKPSA-N 0.000 description 4
- BLOXULLYFRGYKZ-GUBZILKMSA-N Gln-Glu-Arg Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O BLOXULLYFRGYKZ-GUBZILKMSA-N 0.000 description 4
- YPMDZWPZFOZYFG-GUBZILKMSA-N Gln-Leu-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O YPMDZWPZFOZYFG-GUBZILKMSA-N 0.000 description 4
- GURIQZQSTBBHRV-SRVKXCTJSA-N Gln-Lys-Arg Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O GURIQZQSTBBHRV-SRVKXCTJSA-N 0.000 description 4
- QBEWLBKBGXVVPD-RYUDHWBXSA-N Gln-Phe-Gly Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CCC(=O)N)N QBEWLBKBGXVVPD-RYUDHWBXSA-N 0.000 description 4
- GTBXHETZPUURJE-KKUMJFAQSA-N Gln-Tyr-Arg Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O GTBXHETZPUURJE-KKUMJFAQSA-N 0.000 description 4
- ZFBBMCKQSNJZSN-AUTRQRHGSA-N Gln-Val-Gln Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O ZFBBMCKQSNJZSN-AUTRQRHGSA-N 0.000 description 4
- YYOBUPFZLKQUAX-FXQIFTODSA-N Glu-Asn-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O YYOBUPFZLKQUAX-FXQIFTODSA-N 0.000 description 4
- HJIFPJUEOGZWRI-GUBZILKMSA-N Glu-Asp-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCC(=O)O)N HJIFPJUEOGZWRI-GUBZILKMSA-N 0.000 description 4
- GFLQTABMFBXRIY-GUBZILKMSA-N Glu-Gln-Arg Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O GFLQTABMFBXRIY-GUBZILKMSA-N 0.000 description 4
- ZCOJVESMNGBGLF-GRLWGSQLSA-N Glu-Ile-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O ZCOJVESMNGBGLF-GRLWGSQLSA-N 0.000 description 4
- DNPCBMNFQVTHMA-DCAQKATOSA-N Glu-Leu-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O DNPCBMNFQVTHMA-DCAQKATOSA-N 0.000 description 4
- CUPSDFQZTVVTSK-GUBZILKMSA-N Glu-Lys-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCC(O)=O CUPSDFQZTVVTSK-GUBZILKMSA-N 0.000 description 4
- BCYGDJXHAGZNPQ-DCAQKATOSA-N Glu-Lys-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O BCYGDJXHAGZNPQ-DCAQKATOSA-N 0.000 description 4
- QNJNPKSWAHPYGI-JYJNAYRXSA-N Glu-Phe-Leu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C(O)=O)CC1=CC=CC=C1 QNJNPKSWAHPYGI-JYJNAYRXSA-N 0.000 description 4
- DCBSZJJHOTXMHY-DCAQKATOSA-N Glu-Pro-Pro Chemical compound OC(=O)CC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 DCBSZJJHOTXMHY-DCAQKATOSA-N 0.000 description 4
- DTPOVRRYXPJJAZ-FJXKBIBVSA-N Gly-Arg-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCN=C(N)N DTPOVRRYXPJJAZ-FJXKBIBVSA-N 0.000 description 4
- KAJAOGBVWCYGHZ-JTQLQIEISA-N Gly-Gly-Phe Chemical compound [NH3+]CC(=O)NCC(=O)N[C@H](C([O-])=O)CC1=CC=CC=C1 KAJAOGBVWCYGHZ-JTQLQIEISA-N 0.000 description 4
- YTSVAIMKVLZUDU-YUMQZZPRSA-N Gly-Leu-Asp Chemical compound [H]NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O YTSVAIMKVLZUDU-YUMQZZPRSA-N 0.000 description 4
- NTBOEZICHOSJEE-YUMQZZPRSA-N Gly-Lys-Ser Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O NTBOEZICHOSJEE-YUMQZZPRSA-N 0.000 description 4
- HFPVRZWORNJRRC-UWVGGRQHSA-N Gly-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)CN HFPVRZWORNJRRC-UWVGGRQHSA-N 0.000 description 4
- GWCJMBNBFYBQCV-XPUUQOCRSA-N Gly-Val-Ala Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O GWCJMBNBFYBQCV-XPUUQOCRSA-N 0.000 description 4
- TTZAWSKKNCEINZ-AVGNSLFASA-N His-Arg-Val Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(O)=O TTZAWSKKNCEINZ-AVGNSLFASA-N 0.000 description 4
- RNMNYMDTESKEAJ-KKUMJFAQSA-N His-Leu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CN=CN1 RNMNYMDTESKEAJ-KKUMJFAQSA-N 0.000 description 4
- RNVUQLOKVIPNEM-BZSNNMDCSA-N His-Tyr-Lys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC2=CN=CN2)N)O RNVUQLOKVIPNEM-BZSNNMDCSA-N 0.000 description 4
- HDOYNXLPTRQLAD-JBDRJPRFSA-N Ile-Ala-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(=O)O)N HDOYNXLPTRQLAD-JBDRJPRFSA-N 0.000 description 4
- FJWYJQRCVNGEAQ-ZPFDUUQYSA-N Ile-Asn-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N FJWYJQRCVNGEAQ-ZPFDUUQYSA-N 0.000 description 4
- NCSIQAFSIPHVAN-IUKAMOBKSA-N Ile-Asn-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N NCSIQAFSIPHVAN-IUKAMOBKSA-N 0.000 description 4
- HGNUKGZQASSBKQ-PCBIJLKTSA-N Ile-Asp-Phe Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N HGNUKGZQASSBKQ-PCBIJLKTSA-N 0.000 description 4
- CDGLBYSAZFIIJO-RCOVLWMOSA-N Ile-Gly-Gly Chemical compound CC[C@H](C)[C@H]([NH3+])C(=O)NCC(=O)NCC([O-])=O CDGLBYSAZFIIJO-RCOVLWMOSA-N 0.000 description 4
- CMNMPCTVCWWYHY-MXAVVETBSA-N Ile-His-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CC(C)C)C(=O)O)N CMNMPCTVCWWYHY-MXAVVETBSA-N 0.000 description 4
- TVYWVSJGSHQWMT-AJNGGQMLSA-N Ile-Leu-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N TVYWVSJGSHQWMT-AJNGGQMLSA-N 0.000 description 4
- QHUREMVLLMNUAX-OSUNSFLBSA-N Ile-Thr-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)O)N QHUREMVLLMNUAX-OSUNSFLBSA-N 0.000 description 4
- NJGXXYLPDMMFJB-XUXIUFHCSA-N Ile-Val-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N NJGXXYLPDMMFJB-XUXIUFHCSA-N 0.000 description 4
- 108010065920 Insulin Lispro Proteins 0.000 description 4
- ZTLGVASZOIKNIX-DCAQKATOSA-N Leu-Gln-Glu Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N ZTLGVASZOIKNIX-DCAQKATOSA-N 0.000 description 4
- HFBCHNRFRYLZNV-GUBZILKMSA-N Leu-Glu-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O HFBCHNRFRYLZNV-GUBZILKMSA-N 0.000 description 4
- KXODZBLFVFSLAI-AVGNSLFASA-N Leu-His-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CC(C)C)CC1=CN=CN1 KXODZBLFVFSLAI-AVGNSLFASA-N 0.000 description 4
- LXKNSJLSGPNHSK-KKUMJFAQSA-N Leu-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N LXKNSJLSGPNHSK-KKUMJFAQSA-N 0.000 description 4
- WXUOJXIGOPMDJM-SRVKXCTJSA-N Leu-Lys-Asn Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O WXUOJXIGOPMDJM-SRVKXCTJSA-N 0.000 description 4
- VVQJGYPTIYOFBR-IHRRRGAJSA-N Leu-Lys-Met Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCSC)C(=O)O)N VVQJGYPTIYOFBR-IHRRRGAJSA-N 0.000 description 4
- RGUXWMDNCPMQFB-YUMQZZPRSA-N Leu-Ser-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O RGUXWMDNCPMQFB-YUMQZZPRSA-N 0.000 description 4
- GQUDMNDPQTXZRV-DCAQKATOSA-N Lys-Arg-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O GQUDMNDPQTXZRV-DCAQKATOSA-N 0.000 description 4
- WALVCOOOKULCQM-ULQDDVLXSA-N Lys-Arg-Phe Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O WALVCOOOKULCQM-ULQDDVLXSA-N 0.000 description 4
- ZQCVMVCVPFYXHZ-SRVKXCTJSA-N Lys-Asn-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CCCCN ZQCVMVCVPFYXHZ-SRVKXCTJSA-N 0.000 description 4
- SQXUUGUCGJSWCK-CIUDSAMLSA-N Lys-Asp-Cys Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CS)C(=O)O)N SQXUUGUCGJSWCK-CIUDSAMLSA-N 0.000 description 4
- OVIVOCSURJYCTM-GUBZILKMSA-N Lys-Asp-Glu Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCC(O)=O OVIVOCSURJYCTM-GUBZILKMSA-N 0.000 description 4
- MXMDJEJWERYPMO-XUXIUFHCSA-N Lys-Ile-Arg Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O MXMDJEJWERYPMO-XUXIUFHCSA-N 0.000 description 4
- ATNKHRAIZCMCCN-BZSNNMDCSA-N Lys-Lys-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CCCCN)N ATNKHRAIZCMCCN-BZSNNMDCSA-N 0.000 description 4
- AEIIJFBQVGYVEV-YESZJQIVSA-N Lys-Phe-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)NC(=O)[C@H](CCCCN)N)C(=O)O AEIIJFBQVGYVEV-YESZJQIVSA-N 0.000 description 4
- LUTDBHBIHHREDC-IHRRRGAJSA-N Lys-Pro-Lys Chemical compound NCCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(O)=O LUTDBHBIHHREDC-IHRRRGAJSA-N 0.000 description 4
- SBQDRNOLGSYHQA-YUMQZZPRSA-N Lys-Ser-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)NCC(O)=O SBQDRNOLGSYHQA-YUMQZZPRSA-N 0.000 description 4
- DJDFBVNNDAUPRW-GUBZILKMSA-N Met-Glu-Gln Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O DJDFBVNNDAUPRW-GUBZILKMSA-N 0.000 description 4
- AWGBEIYZPAXXSX-RWMBFGLXSA-N Met-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCSC)N AWGBEIYZPAXXSX-RWMBFGLXSA-N 0.000 description 4
- DBXMFHGGHMXYHY-DCAQKATOSA-N Met-Leu-Ser Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O DBXMFHGGHMXYHY-DCAQKATOSA-N 0.000 description 4
- XMBSYZWANAQXEV-UHFFFAOYSA-N N-alpha-L-glutamyl-L-phenylalanine Natural products OC(=O)CCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 XMBSYZWANAQXEV-UHFFFAOYSA-N 0.000 description 4
- 108010079364 N-glycylalanine Proteins 0.000 description 4
- CGOMLCQJEMWMCE-STQMWFEESA-N Phe-Arg-Gly Chemical compound NC(N)=NCCC[C@@H](C(=O)NCC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 CGOMLCQJEMWMCE-STQMWFEESA-N 0.000 description 4
- KKYHKZCMETTXEO-AVGNSLFASA-N Phe-Cys-Glu Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N KKYHKZCMETTXEO-AVGNSLFASA-N 0.000 description 4
- DVOCGBNHAUHKHJ-DKIMLUQUSA-N Phe-Ile-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O DVOCGBNHAUHKHJ-DKIMLUQUSA-N 0.000 description 4
- FKFCKDROTNIVSO-JYJNAYRXSA-N Phe-Pro-Met Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCSC)C(O)=O FKFCKDROTNIVSO-JYJNAYRXSA-N 0.000 description 4
- MCIXMYKSPQUMJG-SRVKXCTJSA-N Phe-Ser-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O MCIXMYKSPQUMJG-SRVKXCTJSA-N 0.000 description 4
- ODPIUQVTULPQEP-CIUDSAMLSA-N Pro-Gln-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@@H]1CCCN1 ODPIUQVTULPQEP-CIUDSAMLSA-N 0.000 description 4
- DIFXZGPHVCIVSQ-CIUDSAMLSA-N Pro-Gln-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O DIFXZGPHVCIVSQ-CIUDSAMLSA-N 0.000 description 4
- FIODMZKLZFLYQP-GUBZILKMSA-N Pro-Val-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O FIODMZKLZFLYQP-GUBZILKMSA-N 0.000 description 4
- 241000700159 Rattus Species 0.000 description 4
- QEDMOZUJTGEIBF-FXQIFTODSA-N Ser-Arg-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O QEDMOZUJTGEIBF-FXQIFTODSA-N 0.000 description 4
- QFBNNYNWKYKVJO-DCAQKATOSA-N Ser-Arg-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CO)CCCN=C(N)N QFBNNYNWKYKVJO-DCAQKATOSA-N 0.000 description 4
- PVDTYLHUWAEYGY-CIUDSAMLSA-N Ser-Glu-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O PVDTYLHUWAEYGY-CIUDSAMLSA-N 0.000 description 4
- SQBLRDDJTUJDMV-ACZMJKKPSA-N Ser-Glu-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O SQBLRDDJTUJDMV-ACZMJKKPSA-N 0.000 description 4
- QKQDTEYDEIJPNK-GUBZILKMSA-N Ser-Glu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CO QKQDTEYDEIJPNK-GUBZILKMSA-N 0.000 description 4
- YUJLIIRMIAGMCQ-CIUDSAMLSA-N Ser-Leu-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O YUJLIIRMIAGMCQ-CIUDSAMLSA-N 0.000 description 4
- VIIJCAQMJBHSJH-FXQIFTODSA-N Ser-Met-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CO)C(O)=O VIIJCAQMJBHSJH-FXQIFTODSA-N 0.000 description 4
- XKFJENWJGHMDLI-QWRGUYRKSA-N Ser-Phe-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)NCC(O)=O XKFJENWJGHMDLI-QWRGUYRKSA-N 0.000 description 4
- RHAPJNVNWDBFQI-BQBZGAKWSA-N Ser-Pro-Gly Chemical compound OC[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O RHAPJNVNWDBFQI-BQBZGAKWSA-N 0.000 description 4
- HHJFMHQYEAAOBM-ZLUOBGJFSA-N Ser-Ser-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O HHJFMHQYEAAOBM-ZLUOBGJFSA-N 0.000 description 4
- ILZAUMFXKSIUEF-SRVKXCTJSA-N Ser-Ser-Phe Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 ILZAUMFXKSIUEF-SRVKXCTJSA-N 0.000 description 4
- JURQXQBJKUHGJS-UHFFFAOYSA-N Ser-Ser-Ser-Ser Chemical compound OCC(N)C(=O)NC(CO)C(=O)NC(CO)C(=O)NC(CO)C(O)=O JURQXQBJKUHGJS-UHFFFAOYSA-N 0.000 description 4
- MFQMZDPAZRZAPV-NAKRPEOUSA-N Ser-Val-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CO)N MFQMZDPAZRZAPV-NAKRPEOUSA-N 0.000 description 4
- RCOUFINCYASMDN-GUBZILKMSA-N Ser-Val-Met Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCSC)C(O)=O RCOUFINCYASMDN-GUBZILKMSA-N 0.000 description 4
- CAJFZCICSVBOJK-SHGPDSBTSA-N Thr-Ala-Thr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CAJFZCICSVBOJK-SHGPDSBTSA-N 0.000 description 4
- VASYSJHSMSBTDU-LKXGYXEUSA-N Thr-Asn-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CS)C(=O)O)N)O VASYSJHSMSBTDU-LKXGYXEUSA-N 0.000 description 4
- OHAJHDJOCKKJLV-LKXGYXEUSA-N Thr-Asp-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O OHAJHDJOCKKJLV-LKXGYXEUSA-N 0.000 description 4
- UHBPFYOQQPFKQR-JHEQGTHGSA-N Thr-Gln-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O UHBPFYOQQPFKQR-JHEQGTHGSA-N 0.000 description 4
- XPNSAQMEAVSQRD-FBCQKBJTSA-N Thr-Gly-Gly Chemical compound C[C@@H](O)[C@H](N)C(=O)NCC(=O)NCC(O)=O XPNSAQMEAVSQRD-FBCQKBJTSA-N 0.000 description 4
- SGAOHNPSEPVAFP-ZDLURKLDSA-N Thr-Ser-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)NCC(O)=O SGAOHNPSEPVAFP-ZDLURKLDSA-N 0.000 description 4
- QNXZCKMXHPULME-ZNSHCXBVSA-N Thr-Val-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N)O QNXZCKMXHPULME-ZNSHCXBVSA-N 0.000 description 4
- 108700019146 Transgenes Proteins 0.000 description 4
- UUBKSZNKJUJQEJ-JRQIVUDYSA-N Tyr-Thr-Asp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N)O UUBKSZNKJUJQEJ-JRQIVUDYSA-N 0.000 description 4
- VUTHNLMCXKLLFI-LAEOZQHASA-N Val-Asp-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N VUTHNLMCXKLLFI-LAEOZQHASA-N 0.000 description 4
- OPGWZDIYEYJVRX-AVGNSLFASA-N Val-His-Arg Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N OPGWZDIYEYJVRX-AVGNSLFASA-N 0.000 description 4
- VHRLUTIMTDOVCG-PEDHHIEDSA-N Val-Ile-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)NC(=O)[C@H](C(C)C)N VHRLUTIMTDOVCG-PEDHHIEDSA-N 0.000 description 4
- SYSWVVCYSXBVJG-RHYQMDGZSA-N Val-Leu-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](C(C)C)N)O SYSWVVCYSXBVJG-RHYQMDGZSA-N 0.000 description 4
- MJFSRZZJQWZHFQ-SRVKXCTJSA-N Val-Met-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C(C)C)C(=O)O)N MJFSRZZJQWZHFQ-SRVKXCTJSA-N 0.000 description 4
- QWCZXKIFPWPQHR-JYJNAYRXSA-N Val-Pro-Tyr Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 QWCZXKIFPWPQHR-JYJNAYRXSA-N 0.000 description 4
- QZKVWWIUSQGWMY-IHRRRGAJSA-N Val-Ser-Phe Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 QZKVWWIUSQGWMY-IHRRRGAJSA-N 0.000 description 4
- DVLWZWNAQUBZBC-ZNSHCXBVSA-N Val-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](C(C)C)N)O DVLWZWNAQUBZBC-ZNSHCXBVSA-N 0.000 description 4
- 108010070944 alanylhistidine Proteins 0.000 description 4
- 238000013459 approach Methods 0.000 description 4
- 108010013835 arginine glutamate Proteins 0.000 description 4
- 108010036533 arginylvaline Proteins 0.000 description 4
- 210000004369 blood Anatomy 0.000 description 4
- 239000008280 blood Substances 0.000 description 4
- 239000000872 buffer Substances 0.000 description 4
- 201000011510 cancer Diseases 0.000 description 4
- 238000003776 cleavage reaction Methods 0.000 description 4
- 108010083633 cyclic AMP-regulated phosphoprotein ARPP-21 Proteins 0.000 description 4
- 239000002254 cytotoxic agent Substances 0.000 description 4
- 231100000599 cytotoxic agent Toxicity 0.000 description 4
- 230000002950 deficient Effects 0.000 description 4
- VYFYYTLLBUKUHU-UHFFFAOYSA-N dopamine Chemical compound NCCC1=CC=C(O)C(O)=C1 VYFYYTLLBUKUHU-UHFFFAOYSA-N 0.000 description 4
- 239000003623 enhancer Substances 0.000 description 4
- 230000002255 enzymatic effect Effects 0.000 description 4
- VPZXBVLAVMBEQI-UHFFFAOYSA-N glycyl-DL-alpha-alanine Natural products OC(=O)C(C)NC(=O)CN VPZXBVLAVMBEQI-UHFFFAOYSA-N 0.000 description 4
- 108010081551 glycylphenylalanine Proteins 0.000 description 4
- 239000002502 liposome Substances 0.000 description 4
- 108010009298 lysylglutamic acid Proteins 0.000 description 4
- 108010056582 methionylglutamic acid Proteins 0.000 description 4
- 108010051242 phenylalanylserine Proteins 0.000 description 4
- 230000002285 radioactive effect Effects 0.000 description 4
- 230000007017 scission Effects 0.000 description 4
- 239000000243 solution Substances 0.000 description 4
- 230000008685 targeting Effects 0.000 description 4
- 230000002103 transcriptional effect Effects 0.000 description 4
- 238000012546 transfer Methods 0.000 description 4
- 238000011830 transgenic mouse model Methods 0.000 description 4
- 238000011282 treatment Methods 0.000 description 4
- 210000002700 urine Anatomy 0.000 description 4
- CRWFEKLFPVRPBV-CIUDSAMLSA-N Ala-Gln-Met Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCSC)C(O)=O CRWFEKLFPVRPBV-CIUDSAMLSA-N 0.000 description 3
- WMYJZJRILUVVRG-WDSKDSINSA-N Ala-Gly-Gln Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(N)=O WMYJZJRILUVVRG-WDSKDSINSA-N 0.000 description 3
- SIGTYDNEPYEXGK-ZANVPECISA-N Ala-Gly-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)CNC(=O)[C@@H](N)C)C(O)=O)=CNC2=C1 SIGTYDNEPYEXGK-ZANVPECISA-N 0.000 description 3
- AUFACLFHBAGZEN-ZLUOBGJFSA-N Ala-Ser-Cys Chemical compound N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)O AUFACLFHBAGZEN-ZLUOBGJFSA-N 0.000 description 3
- OMCKWYSDUQBYCN-FXQIFTODSA-N Ala-Ser-Met Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCSC)C(O)=O OMCKWYSDUQBYCN-FXQIFTODSA-N 0.000 description 3
- MRXZVZVKYZELRU-UHFFFAOYSA-N Ala-Trp-Ser-Ser Chemical compound C1=CC=C2C(CC(NC(=O)C(N)C)C(=O)NC(CO)C(=O)NC(CO)C(O)=O)=CNC2=C1 MRXZVZVKYZELRU-UHFFFAOYSA-N 0.000 description 3
- JNJHNBXBGNJESC-KKXDTOCCSA-N Ala-Tyr-Phe Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O JNJHNBXBGNJESC-KKXDTOCCSA-N 0.000 description 3
- REWSWYIDQIELBE-FXQIFTODSA-N Ala-Val-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O REWSWYIDQIELBE-FXQIFTODSA-N 0.000 description 3
- HJVGMOYJDDXLMI-AVGNSLFASA-N Arg-Arg-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCNC(N)=N)NC(=O)[C@@H](N)CCCNC(N)=N HJVGMOYJDDXLMI-AVGNSLFASA-N 0.000 description 3
- UISQLSIBJKEJSS-GUBZILKMSA-N Arg-Arg-Ser Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CO)C(O)=O UISQLSIBJKEJSS-GUBZILKMSA-N 0.000 description 3
- PQWTZSNVWSOFFK-FXQIFTODSA-N Arg-Asp-Asn Chemical compound C(C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N)CN=C(N)N PQWTZSNVWSOFFK-FXQIFTODSA-N 0.000 description 3
- KMSHNDWHPWXPEC-BQBZGAKWSA-N Arg-Asp-Gly Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O KMSHNDWHPWXPEC-BQBZGAKWSA-N 0.000 description 3
- VNFWDYWTSHFRRG-SRVKXCTJSA-N Arg-Gln-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O VNFWDYWTSHFRRG-SRVKXCTJSA-N 0.000 description 3
- BSGSDLYGGHGMND-IHRRRGAJSA-N Arg-Phe-Cys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N BSGSDLYGGHGMND-IHRRRGAJSA-N 0.000 description 3
- NIELFHOLFTUZME-HJWJTTGWSA-N Arg-Phe-Ile Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O NIELFHOLFTUZME-HJWJTTGWSA-N 0.000 description 3
- VRTWYUYCJGNFES-CIUDSAMLSA-N Arg-Ser-Gln Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O VRTWYUYCJGNFES-CIUDSAMLSA-N 0.000 description 3
- LRPZJPMQGKGHSG-XGEHTFHBSA-N Arg-Ser-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCCN=C(N)N)N)O LRPZJPMQGKGHSG-XGEHTFHBSA-N 0.000 description 3
- INOIAEUXVVNJKA-XGEHTFHBSA-N Arg-Thr-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O INOIAEUXVVNJKA-XGEHTFHBSA-N 0.000 description 3
- KEZVOBAKAXHMOF-GUBZILKMSA-N Arg-Val-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCN=C(N)N KEZVOBAKAXHMOF-GUBZILKMSA-N 0.000 description 3
- XHFXZQHTLJVZBN-FXQIFTODSA-N Asn-Arg-Asn Chemical compound C(C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N)CN=C(N)N XHFXZQHTLJVZBN-FXQIFTODSA-N 0.000 description 3
- XXAOXVBAWLMTDR-ZLUOBGJFSA-N Asn-Cys-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CC(=O)N)N XXAOXVBAWLMTDR-ZLUOBGJFSA-N 0.000 description 3
- NNMUHYLAYUSTTN-FXQIFTODSA-N Asn-Gln-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O NNMUHYLAYUSTTN-FXQIFTODSA-N 0.000 description 3
- JREOBWLIZLXRIS-GUBZILKMSA-N Asn-Glu-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O JREOBWLIZLXRIS-GUBZILKMSA-N 0.000 description 3
- AYOAHKWVQLNPDM-HJGDQZAQSA-N Asn-Lys-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O AYOAHKWVQLNPDM-HJGDQZAQSA-N 0.000 description 3
- GFGUPLIETCNQGF-DCAQKATOSA-N Asn-Pro-His Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CC(=O)N)N)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O GFGUPLIETCNQGF-DCAQKATOSA-N 0.000 description 3
- MGSVBZIBCCKGCY-ZLUOBGJFSA-N Asp-Ser-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O MGSVBZIBCCKGCY-ZLUOBGJFSA-N 0.000 description 3
- 101100035744 Chlorobium chlorochromatii (strain CaD3) rplX gene Proteins 0.000 description 3
- PRXCTTWKGJAPMT-ZLUOBGJFSA-N Cys-Ala-Ser Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O PRXCTTWKGJAPMT-ZLUOBGJFSA-N 0.000 description 3
- KCXVZYZYPLLWCC-UHFFFAOYSA-N EDTA Chemical compound OC(=O)CN(CC(O)=O)CCN(CC(O)=O)CC(O)=O KCXVZYZYPLLWCC-UHFFFAOYSA-N 0.000 description 3
- 241000196324 Embryophyta Species 0.000 description 3
- 241000588724 Escherichia coli Species 0.000 description 3
- YJIUYQKQBBQYHZ-ACZMJKKPSA-N Gln-Ala-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O YJIUYQKQBBQYHZ-ACZMJKKPSA-N 0.000 description 3
- RGXXLQWXBFNXTG-CIUDSAMLSA-N Gln-Arg-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(O)=O RGXXLQWXBFNXTG-CIUDSAMLSA-N 0.000 description 3
- KWUSGAIFNHQCBY-DCAQKATOSA-N Gln-Arg-Arg Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O KWUSGAIFNHQCBY-DCAQKATOSA-N 0.000 description 3
- GHYJGDCPHMSFEJ-GUBZILKMSA-N Gln-Gln-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCC(=O)N)N GHYJGDCPHMSFEJ-GUBZILKMSA-N 0.000 description 3
- KDXKFBSNIJYNNR-YVNDNENWSA-N Gln-Glu-Ile Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KDXKFBSNIJYNNR-YVNDNENWSA-N 0.000 description 3
- LKVCNGLNTAPMSZ-JYJNAYRXSA-N Gln-His-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CC2=CN=CN2)NC(=O)[C@H](CCC(=O)N)N LKVCNGLNTAPMSZ-JYJNAYRXSA-N 0.000 description 3
- LHMWTCWZARHLPV-CIUDSAMLSA-N Gln-Met-Ser Chemical compound CSCC[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CCC(=O)N)N LHMWTCWZARHLPV-CIUDSAMLSA-N 0.000 description 3
- GHAXJVNBAKGWEJ-AVGNSLFASA-N Gln-Ser-Tyr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O GHAXJVNBAKGWEJ-AVGNSLFASA-N 0.000 description 3
- NHMRJKKAVMENKJ-WDCWCFNPSA-N Gln-Thr-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O NHMRJKKAVMENKJ-WDCWCFNPSA-N 0.000 description 3
- WPJDPEOQUIXXOY-AVGNSLFASA-N Gln-Tyr-Asn Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)N)N)O WPJDPEOQUIXXOY-AVGNSLFASA-N 0.000 description 3
- VYOILACOFPPNQH-UMNHJUIQSA-N Gln-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)N)N VYOILACOFPPNQH-UMNHJUIQSA-N 0.000 description 3
- RUFHOVYUYSNDNY-ACZMJKKPSA-N Glu-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(O)=O RUFHOVYUYSNDNY-ACZMJKKPSA-N 0.000 description 3
- ILGFBUGLBSAQQB-GUBZILKMSA-N Glu-Glu-Arg Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O ILGFBUGLBSAQQB-GUBZILKMSA-N 0.000 description 3
- DVLZZEPUNFEUBW-AVGNSLFASA-N Glu-His-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CCC(=O)O)N DVLZZEPUNFEUBW-AVGNSLFASA-N 0.000 description 3
- BKMOHWJHXQLFEX-IRIUXVKKSA-N Glu-Tyr-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CCC(=O)O)N)O BKMOHWJHXQLFEX-IRIUXVKKSA-N 0.000 description 3
- UGVQELHRNUDMAA-BYPYZUCNSA-N Gly-Ala-Gly Chemical compound [NH3+]CC(=O)N[C@@H](C)C(=O)NCC([O-])=O UGVQELHRNUDMAA-BYPYZUCNSA-N 0.000 description 3
- DWUKOTKSTDWGAE-BQBZGAKWSA-N Gly-Asn-Arg Chemical compound NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N DWUKOTKSTDWGAE-BQBZGAKWSA-N 0.000 description 3
- BYYNJRSNDARRBX-YFKPBYRVSA-N Gly-Gln-Gly Chemical compound NCC(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O BYYNJRSNDARRBX-YFKPBYRVSA-N 0.000 description 3
- PABFFPWEJMEVEC-JGVFFNPUSA-N Gly-Gln-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)N)NC(=O)CN)C(=O)O PABFFPWEJMEVEC-JGVFFNPUSA-N 0.000 description 3
- GNPVTZJUUBPZKW-WDSKDSINSA-N Gly-Gln-Ser Chemical compound [H]NCC(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O GNPVTZJUUBPZKW-WDSKDSINSA-N 0.000 description 3
- JLJLBWDKDRYOPA-RYUDHWBXSA-N Gly-Gln-Tyr Chemical compound NCC(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 JLJLBWDKDRYOPA-RYUDHWBXSA-N 0.000 description 3
- QPDUVFSVVAOUHE-XVKPBYJWSA-N Gly-Gln-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CCC(N)=O)NC(=O)CN)C(O)=O QPDUVFSVVAOUHE-XVKPBYJWSA-N 0.000 description 3
- HMHRTKOWRUPPNU-RCOVLWMOSA-N Gly-Ile-Gly Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O HMHRTKOWRUPPNU-RCOVLWMOSA-N 0.000 description 3
- BHPQOIPBLYJNAW-NGZCFLSTSA-N Gly-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN BHPQOIPBLYJNAW-NGZCFLSTSA-N 0.000 description 3
- UUYBFNKHOCJCHT-VHSXEESVSA-N Gly-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN UUYBFNKHOCJCHT-VHSXEESVSA-N 0.000 description 3
- YYXJFBMCOUSYSF-RYUDHWBXSA-N Gly-Phe-Gln Chemical compound [H]NCC(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O YYXJFBMCOUSYSF-RYUDHWBXSA-N 0.000 description 3
- YOBGUCWZPXJHTN-BQBZGAKWSA-N Gly-Ser-Arg Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCN=C(N)N YOBGUCWZPXJHTN-BQBZGAKWSA-N 0.000 description 3
- CSMYMGFCEJWALV-WDSKDSINSA-N Gly-Ser-Gln Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(N)=O CSMYMGFCEJWALV-WDSKDSINSA-N 0.000 description 3
- WCORRBXVISTKQL-WHFBIAKZSA-N Gly-Ser-Ser Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O WCORRBXVISTKQL-WHFBIAKZSA-N 0.000 description 3
- NWOSHVVPKDQKKT-RYUDHWBXSA-N Gly-Tyr-Gln Chemical compound [H]NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O NWOSHVVPKDQKKT-RYUDHWBXSA-N 0.000 description 3
- NGRPGJGKJMUGDM-XVKPBYJWSA-N Gly-Val-Gln Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O NGRPGJGKJMUGDM-XVKPBYJWSA-N 0.000 description 3
- ZZLWLWSUIBSMNP-CIUDSAMLSA-N His-Asp-Ser Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O ZZLWLWSUIBSMNP-CIUDSAMLSA-N 0.000 description 3
- LIEIYPBMQJLASB-SRVKXCTJSA-N His-Gln-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CC1=CN=CN1 LIEIYPBMQJLASB-SRVKXCTJSA-N 0.000 description 3
- ULRFSEJGSHYLQI-YESZJQIVSA-N His-Phe-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)NC(=O)[C@H](CC3=CN=CN3)N)C(=O)O ULRFSEJGSHYLQI-YESZJQIVSA-N 0.000 description 3
- QYZYJFXHXYUZMZ-UGYAYLCHSA-N Ile-Asn-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)N)C(=O)O)N QYZYJFXHXYUZMZ-UGYAYLCHSA-N 0.000 description 3
- SVBAHOMTJRFSIC-SXTJYALSSA-N Ile-Ile-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(=O)N)C(=O)O)N SVBAHOMTJRFSIC-SXTJYALSSA-N 0.000 description 3
- WSSGUVAKYCQSCT-XUXIUFHCSA-N Ile-Met-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(=O)O)N WSSGUVAKYCQSCT-XUXIUFHCSA-N 0.000 description 3
- HQEPKOFULQTSFV-JURCDPSOSA-N Ile-Phe-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C)C(=O)O)N HQEPKOFULQTSFV-JURCDPSOSA-N 0.000 description 3
- FXJLRZFMKGHYJP-CFMVVWHZSA-N Ile-Tyr-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N FXJLRZFMKGHYJP-CFMVVWHZSA-N 0.000 description 3
- UCOCBWDBHCUPQP-DCAQKATOSA-N Leu-Arg-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O UCOCBWDBHCUPQP-DCAQKATOSA-N 0.000 description 3
- LOLUPZNNADDTAA-AVGNSLFASA-N Leu-Gln-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O LOLUPZNNADDTAA-AVGNSLFASA-N 0.000 description 3
- CQGSYZCULZMEDE-SRVKXCTJSA-N Leu-Gln-Pro Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N1CCC[C@H]1C(O)=O CQGSYZCULZMEDE-SRVKXCTJSA-N 0.000 description 3
- GPICTNQYKHHHTH-GUBZILKMSA-N Leu-Gln-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O GPICTNQYKHHHTH-GUBZILKMSA-N 0.000 description 3
- QVFGXCVIXXBFHO-AVGNSLFASA-N Leu-Glu-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O QVFGXCVIXXBFHO-AVGNSLFASA-N 0.000 description 3
- USLNHQZCDQJBOV-ZPFDUUQYSA-N Leu-Ile-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O USLNHQZCDQJBOV-ZPFDUUQYSA-N 0.000 description 3
- JLWZLIQRYCTYBD-IHRRRGAJSA-N Leu-Lys-Arg Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O JLWZLIQRYCTYBD-IHRRRGAJSA-N 0.000 description 3
- RZXLZBIUTDQHJQ-SRVKXCTJSA-N Leu-Lys-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O RZXLZBIUTDQHJQ-SRVKXCTJSA-N 0.000 description 3
- RTIRBWJPYJYTLO-MELADBBJSA-N Leu-Lys-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N1CCC[C@@H]1C(=O)O)N RTIRBWJPYJYTLO-MELADBBJSA-N 0.000 description 3
- FYPWFNKQVVEELI-ULQDDVLXSA-N Leu-Phe-Val Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C(C)C)C(O)=O)CC1=CC=CC=C1 FYPWFNKQVVEELI-ULQDDVLXSA-N 0.000 description 3
- RRVCZCNFXIFGRA-DCAQKATOSA-N Leu-Pro-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O RRVCZCNFXIFGRA-DCAQKATOSA-N 0.000 description 3
- XWEVVRRSIOBJOO-SRVKXCTJSA-N Leu-Pro-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O XWEVVRRSIOBJOO-SRVKXCTJSA-N 0.000 description 3
- JDBQSGMJBMPNFT-AVGNSLFASA-N Leu-Pro-Val Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O JDBQSGMJBMPNFT-AVGNSLFASA-N 0.000 description 3
- XFIHDSBIPWEYJJ-YUMQZZPRSA-N Lys-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN XFIHDSBIPWEYJJ-YUMQZZPRSA-N 0.000 description 3
- QUYCUALODHJQLK-CIUDSAMLSA-N Lys-Asp-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O QUYCUALODHJQLK-CIUDSAMLSA-N 0.000 description 3
- WRODMZBHNNPRLN-SRVKXCTJSA-N Lys-Leu-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O WRODMZBHNNPRLN-SRVKXCTJSA-N 0.000 description 3
- ZCWWVXAXWUAEPZ-SRVKXCTJSA-N Lys-Met-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(O)=O)C(O)=O ZCWWVXAXWUAEPZ-SRVKXCTJSA-N 0.000 description 3
- BOJYMMBYBNOOGG-DCAQKATOSA-N Lys-Pro-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O BOJYMMBYBNOOGG-DCAQKATOSA-N 0.000 description 3
- WQDKIVRHTQYJSN-DCAQKATOSA-N Lys-Ser-Arg Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N WQDKIVRHTQYJSN-DCAQKATOSA-N 0.000 description 3
- JMNRXRPBHFGXQX-GUBZILKMSA-N Lys-Ser-Glu Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O JMNRXRPBHFGXQX-GUBZILKMSA-N 0.000 description 3
- JOSAKOKSPXROGQ-BJDJZHNGSA-N Lys-Ser-Ile Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JOSAKOKSPXROGQ-BJDJZHNGSA-N 0.000 description 3
- MEQLGHAMAUPOSJ-DCAQKATOSA-N Lys-Ser-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O MEQLGHAMAUPOSJ-DCAQKATOSA-N 0.000 description 3
- GAELMDJMQDUDLJ-BQBZGAKWSA-N Met-Ala-Gly Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O GAELMDJMQDUDLJ-BQBZGAKWSA-N 0.000 description 3
- CWFYZYQMUDWGTI-GUBZILKMSA-N Met-Arg-Asp Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O CWFYZYQMUDWGTI-GUBZILKMSA-N 0.000 description 3
- SPSSJSICDYYTQN-HJGDQZAQSA-N Met-Thr-Gln Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCC(N)=O SPSSJSICDYYTQN-HJGDQZAQSA-N 0.000 description 3
- 241000699670 Mus sp. Species 0.000 description 3
- WUGMRIBZSVSJNP-UHFFFAOYSA-N N-L-alanyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)C)C(O)=O)=CNC2=C1 WUGMRIBZSVSJNP-UHFFFAOYSA-N 0.000 description 3
- 108091061960 Naked DNA Proteins 0.000 description 3
- 239000000020 Nitrocellulose Substances 0.000 description 3
- 102000057297 Pepsin A Human genes 0.000 description 3
- 108090000284 Pepsin A Proteins 0.000 description 3
- LLGTYVHITPVGKR-RYUDHWBXSA-N Phe-Gln-Gly Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O LLGTYVHITPVGKR-RYUDHWBXSA-N 0.000 description 3
- YYKZDTVQHTUKDW-RYUDHWBXSA-N Phe-Gly-Gln Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)NCC(=O)N[C@@H](CCC(=O)N)C(=O)O)N YYKZDTVQHTUKDW-RYUDHWBXSA-N 0.000 description 3
- CZQZSMJXFGGBHM-KKUMJFAQSA-N Phe-Pro-Gln Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O CZQZSMJXFGGBHM-KKUMJFAQSA-N 0.000 description 3
- ZJPGOXWRFNKIQL-JYJNAYRXSA-N Phe-Pro-Pro Chemical compound C([C@H](N)C(=O)N1[C@@H](CCC1)C(=O)N1[C@@H](CCC1)C(O)=O)C1=CC=CC=C1 ZJPGOXWRFNKIQL-JYJNAYRXSA-N 0.000 description 3
- HFZNNDWPHBRNPV-KZVJFYERSA-N Pro-Ala-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O HFZNNDWPHBRNPV-KZVJFYERSA-N 0.000 description 3
- OOLOTUZJUBOMAX-GUBZILKMSA-N Pro-Ala-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O OOLOTUZJUBOMAX-GUBZILKMSA-N 0.000 description 3
- SGCZFWSQERRKBD-BQBZGAKWSA-N Pro-Asp-Gly Chemical compound OC(=O)CNC(=O)[C@H](CC(O)=O)NC(=O)[C@@H]1CCCN1 SGCZFWSQERRKBD-BQBZGAKWSA-N 0.000 description 3
- ZPPVJIJMIKTERM-YUMQZZPRSA-N Pro-Gln-Gly Chemical compound OC(=O)CNC(=O)[C@H](CCC(=O)N)NC(=O)[C@@H]1CCCN1 ZPPVJIJMIKTERM-YUMQZZPRSA-N 0.000 description 3
- LGSANCBHSMDFDY-GARJFASQSA-N Pro-Glu-Pro Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CCC(=O)O)C(=O)N2CCC[C@@H]2C(=O)O LGSANCBHSMDFDY-GARJFASQSA-N 0.000 description 3
- DXTOOBDIIAJZBJ-BQBZGAKWSA-N Pro-Gly-Ser Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](CO)C(O)=O DXTOOBDIIAJZBJ-BQBZGAKWSA-N 0.000 description 3
- BODDREDDDRZUCF-QTKMDUPCSA-N Pro-His-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@@H]2CCCN2)O BODDREDDDRZUCF-QTKMDUPCSA-N 0.000 description 3
- UREQLMJCKFLLHM-NAKRPEOUSA-N Pro-Ile-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O UREQLMJCKFLLHM-NAKRPEOUSA-N 0.000 description 3
- HBBBLSVBQGZKOZ-GUBZILKMSA-N Pro-Met-Ala Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C)C(O)=O HBBBLSVBQGZKOZ-GUBZILKMSA-N 0.000 description 3
- LGMBKOAPPTYKLC-JYJNAYRXSA-N Pro-Phe-Arg Chemical compound C([C@@H](C(=O)N[C@@H](CCCNC(=N)N)C(O)=O)NC(=O)[C@H]1NCCC1)C1=CC=CC=C1 LGMBKOAPPTYKLC-JYJNAYRXSA-N 0.000 description 3
- LEIKGVHQTKHOLM-IUCAKERBSA-N Pro-Pro-Gly Chemical compound OC(=O)CNC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 LEIKGVHQTKHOLM-IUCAKERBSA-N 0.000 description 3
- DCHQYSOGURGJST-FJXKBIBVSA-N Pro-Thr-Gly Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O DCHQYSOGURGJST-FJXKBIBVSA-N 0.000 description 3
- GXWRTSIVLSQACD-RCWTZXSCSA-N Pro-Thr-Met Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@@H]1CCCN1)O GXWRTSIVLSQACD-RCWTZXSCSA-N 0.000 description 3
- AIOWVDNPESPXRB-YTWAJWBKSA-N Pro-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@@H]2CCCN2)O AIOWVDNPESPXRB-YTWAJWBKSA-N 0.000 description 3
- IYCBDVBJWDXQRR-FXQIFTODSA-N Ser-Ala-Met Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CCSC)C(O)=O IYCBDVBJWDXQRR-FXQIFTODSA-N 0.000 description 3
- OYEDZGNMSBZCIM-XGEHTFHBSA-N Ser-Arg-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OYEDZGNMSBZCIM-XGEHTFHBSA-N 0.000 description 3
- XVAUJOAYHWWNQF-ZLUOBGJFSA-N Ser-Asn-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(O)=O XVAUJOAYHWWNQF-ZLUOBGJFSA-N 0.000 description 3
- YMEXHZTVKDAKIY-GHCJXIJMSA-N Ser-Asn-Ile Chemical compound CC[C@H](C)[C@H](NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CO)C(O)=O YMEXHZTVKDAKIY-GHCJXIJMSA-N 0.000 description 3
- SFZKGGOGCNQPJY-CIUDSAMLSA-N Ser-Asp-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CO)N SFZKGGOGCNQPJY-CIUDSAMLSA-N 0.000 description 3
- MMAPOBOTRUVNKJ-ZLUOBGJFSA-N Ser-Asp-Ser Chemical compound C([C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CO)N)C(=O)O MMAPOBOTRUVNKJ-ZLUOBGJFSA-N 0.000 description 3
- BLPYXIXXCFVIIF-FXQIFTODSA-N Ser-Cys-Arg Chemical compound C(C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CO)N)CN=C(N)N BLPYXIXXCFVIIF-FXQIFTODSA-N 0.000 description 3
- FMDHKPRACUXATF-ACZMJKKPSA-N Ser-Gln-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O FMDHKPRACUXATF-ACZMJKKPSA-N 0.000 description 3
- XXXAXOWMBOKTRN-XPUUQOCRSA-N Ser-Gly-Val Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O XXXAXOWMBOKTRN-XPUUQOCRSA-N 0.000 description 3
- BEAFYHFQTOTVFS-VGDYDELISA-N Ser-Ile-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CO)N BEAFYHFQTOTVFS-VGDYDELISA-N 0.000 description 3
- RIAKPZVSNBBNRE-BJDJZHNGSA-N Ser-Ile-Leu Chemical compound OC[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O RIAKPZVSNBBNRE-BJDJZHNGSA-N 0.000 description 3
- KCGIREHVWRXNDH-GARJFASQSA-N Ser-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N KCGIREHVWRXNDH-GARJFASQSA-N 0.000 description 3
- FPCGZYMRFFIYIH-CIUDSAMLSA-N Ser-Lys-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O FPCGZYMRFFIYIH-CIUDSAMLSA-N 0.000 description 3
- OLKICIBQRVSQMA-SRVKXCTJSA-N Ser-Ser-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O OLKICIBQRVSQMA-SRVKXCTJSA-N 0.000 description 3
- ZSDXEKUKQAKZFE-XAVMHZPKSA-N Ser-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N)O ZSDXEKUKQAKZFE-XAVMHZPKSA-N 0.000 description 3
- UBTNVMGPMYDYIU-HJPIBITLSA-N Ser-Tyr-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O UBTNVMGPMYDYIU-HJPIBITLSA-N 0.000 description 3
- KIEIJCFVGZCUAS-MELADBBJSA-N Ser-Tyr-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)[C@H](CO)N)C(=O)O KIEIJCFVGZCUAS-MELADBBJSA-N 0.000 description 3
- HAYADTTXNZFUDM-IHRRRGAJSA-N Ser-Tyr-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O HAYADTTXNZFUDM-IHRRRGAJSA-N 0.000 description 3
- SGZVZUCRAVSPKQ-FXQIFTODSA-N Ser-Val-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CO)N SGZVZUCRAVSPKQ-FXQIFTODSA-N 0.000 description 3
- DWYAUVCQDTZIJI-VZFHVOOUSA-N Thr-Ala-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O DWYAUVCQDTZIJI-VZFHVOOUSA-N 0.000 description 3
- TWLMXDWFVNEFFK-FJXKBIBVSA-N Thr-Arg-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O TWLMXDWFVNEFFK-FJXKBIBVSA-N 0.000 description 3
- XXNLGZRRSKPSGF-HTUGSXCWSA-N Thr-Gln-Phe Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N)O XXNLGZRRSKPSGF-HTUGSXCWSA-N 0.000 description 3
- LIXBDERDAGNVAV-XKBZYTNZSA-N Thr-Gln-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O LIXBDERDAGNVAV-XKBZYTNZSA-N 0.000 description 3
- YUOCMLNTUZAGNF-KLHWPWHYSA-N Thr-His-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N2CCC[C@@H]2C(=O)O)N)O YUOCMLNTUZAGNF-KLHWPWHYSA-N 0.000 description 3
- MECLEFZMPPOEAC-VOAKCMCISA-N Thr-Leu-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N)O MECLEFZMPPOEAC-VOAKCMCISA-N 0.000 description 3
- YOOAQCZYZHGUAZ-KATARQTJSA-N Thr-Leu-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O YOOAQCZYZHGUAZ-KATARQTJSA-N 0.000 description 3
- LKJCABTUFGTPPY-HJGDQZAQSA-N Thr-Pro-Gln Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O LKJCABTUFGTPPY-HJGDQZAQSA-N 0.000 description 3
- MROIJTGJGIDEEJ-RCWTZXSCSA-N Thr-Pro-Pro Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 MROIJTGJGIDEEJ-RCWTZXSCSA-N 0.000 description 3
- XHWCDRUPDNSDAZ-XKBZYTNZSA-N Thr-Ser-Glu Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N)O XHWCDRUPDNSDAZ-XKBZYTNZSA-N 0.000 description 3
- MFMGPEKYBXFIRF-SUSMZKCASA-N Thr-Thr-Gln Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(O)=O MFMGPEKYBXFIRF-SUSMZKCASA-N 0.000 description 3
- CJEHCEOXPLASCK-MEYUZBJRSA-N Thr-Tyr-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)[C@H](O)C)CC1=CC=C(O)C=C1 CJEHCEOXPLASCK-MEYUZBJRSA-N 0.000 description 3
- BKVICMPZWRNWOC-RHYQMDGZSA-N Thr-Val-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)[C@@H](C)O BKVICMPZWRNWOC-RHYQMDGZSA-N 0.000 description 3
- RNDWCRUOGGQDKN-UBHSHLNASA-N Trp-Ser-Asp Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O RNDWCRUOGGQDKN-UBHSHLNASA-N 0.000 description 3
- VTFWAGGJDRSQFG-MELADBBJSA-N Tyr-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC2=CC=C(C=C2)O)N)C(=O)O VTFWAGGJDRSQFG-MELADBBJSA-N 0.000 description 3
- XBWKCYFGRXKWGO-SRVKXCTJSA-N Tyr-Cys-Asn Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(N)=O)C(O)=O XBWKCYFGRXKWGO-SRVKXCTJSA-N 0.000 description 3
- CKHQKYHIZCRTAP-SOUVJXGZSA-N Tyr-Gln-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC2=CC=C(C=C2)O)N)C(=O)O CKHQKYHIZCRTAP-SOUVJXGZSA-N 0.000 description 3
- KEHKBBUYZWAMHL-DZKIICNBSA-N Tyr-Gln-Val Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O KEHKBBUYZWAMHL-DZKIICNBSA-N 0.000 description 3
- CDBXVDXSLPLFMD-BPNCWPANSA-N Tyr-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CC1=CC=C(O)C=C1 CDBXVDXSLPLFMD-BPNCWPANSA-N 0.000 description 3
- QKXAEWMHAAVVGS-KKUMJFAQSA-N Tyr-Pro-Glu Chemical compound N[C@@H](Cc1ccc(O)cc1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O QKXAEWMHAAVVGS-KKUMJFAQSA-N 0.000 description 3
- KHPLUFDSWGDRHD-SLFFLAALSA-N Tyr-Tyr-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)[C@H](CC3=CC=C(C=C3)O)N)C(=O)O KHPLUFDSWGDRHD-SLFFLAALSA-N 0.000 description 3
- RMRFSFXLFWWAJZ-HJOGWXRNSA-N Tyr-Tyr-Tyr Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=C(O)C=C1 RMRFSFXLFWWAJZ-HJOGWXRNSA-N 0.000 description 3
- VKYDVKAKGDNZED-STECZYCISA-N Tyr-Val-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CC1=CC=C(C=C1)O)N VKYDVKAKGDNZED-STECZYCISA-N 0.000 description 3
- CPGJELLYDQEDRK-NAKRPEOUSA-N Val-Ile-Ala Chemical compound CC[C@H](C)[C@H](NC(=O)[C@@H](N)C(C)C)C(=O)N[C@@H](C)C(O)=O CPGJELLYDQEDRK-NAKRPEOUSA-N 0.000 description 3
- BZMIYHIJVVJPCK-QSFUFRPTSA-N Val-Ile-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N BZMIYHIJVVJPCK-QSFUFRPTSA-N 0.000 description 3
- BTWMICVCQLKKNR-DCAQKATOSA-N Val-Leu-Ser Chemical compound CC(C)[C@H]([NH3+])C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C([O-])=O BTWMICVCQLKKNR-DCAQKATOSA-N 0.000 description 3
- HPANGHISDXDUQY-ULQDDVLXSA-N Val-Lys-Phe Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N HPANGHISDXDUQY-ULQDDVLXSA-N 0.000 description 3
- YDVDTCJGBBJGRT-GUBZILKMSA-N Val-Met-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CO)C(=O)O)N YDVDTCJGBBJGRT-GUBZILKMSA-N 0.000 description 3
- UGFMVXRXULGLNO-XPUUQOCRSA-N Val-Ser-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O UGFMVXRXULGLNO-XPUUQOCRSA-N 0.000 description 3
- GBIUHAYJGWVNLN-AEJSXWLSSA-N Val-Ser-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N1CCC[C@@H]1C(=O)O)N GBIUHAYJGWVNLN-AEJSXWLSSA-N 0.000 description 3
- UQMPYVLTQCGRSK-IFFSRLJSSA-N Val-Thr-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N)O UQMPYVLTQCGRSK-IFFSRLJSSA-N 0.000 description 3
- CFIBZQOLUDURST-IHRRRGAJSA-N Val-Tyr-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CS)C(=O)O)N CFIBZQOLUDURST-IHRRRGAJSA-N 0.000 description 3
- PMKQKNBISAOSRI-XHSDSOJGSA-N Val-Tyr-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N2CCC[C@@H]2C(=O)O)N PMKQKNBISAOSRI-XHSDSOJGSA-N 0.000 description 3
- 241000700605 Viruses Species 0.000 description 3
- JLCPHMBAVCMARE-UHFFFAOYSA-N [3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-hydroxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methyl [5-(6-aminopurin-9-yl)-2-(hydroxymethyl)oxolan-3-yl] hydrogen phosphate Polymers Cc1cn(C2CC(OP(O)(=O)OCC3OC(CC3OP(O)(=O)OCC3OC(CC3O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c3nc(N)[nH]c4=O)C(COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3CO)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cc(C)c(=O)[nH]c3=O)n3cc(C)c(=O)[nH]c3=O)n3ccc(N)nc3=O)n3cc(C)c(=O)[nH]c3=O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)O2)c(=O)[nH]c1=O JLCPHMBAVCMARE-UHFFFAOYSA-N 0.000 description 3
- 230000004913 activation Effects 0.000 description 3
- 210000004504 adult stem cell Anatomy 0.000 description 3
- 238000010171 animal model Methods 0.000 description 3
- 239000002246 antineoplastic agent Substances 0.000 description 3
- 239000012736 aqueous medium Substances 0.000 description 3
- 108010068380 arginylarginine Proteins 0.000 description 3
- 108010040443 aspartyl-aspartic acid Proteins 0.000 description 3
- 108010069205 aspartyl-phenylalanine Proteins 0.000 description 3
- 230000001580 bacterial effect Effects 0.000 description 3
- 239000000969 carrier Substances 0.000 description 3
- 210000001175 cerebrospinal fluid Anatomy 0.000 description 3
- 230000008859 change Effects 0.000 description 3
- 238000002512 chemotherapy Methods 0.000 description 3
- 238000004587 chromatography analysis Methods 0.000 description 3
- 229940079593 drug Drugs 0.000 description 3
- 238000006911 enzymatic reaction Methods 0.000 description 3
- 238000010363 gene targeting Methods 0.000 description 3
- 210000004602 germ cell Anatomy 0.000 description 3
- XBGGUPMXALFZOT-UHFFFAOYSA-N glycyl-L-tyrosine hemihydrate Natural products NCC(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 XBGGUPMXALFZOT-UHFFFAOYSA-N 0.000 description 3
- RAXXELZNTBOGNW-UHFFFAOYSA-N imidazole Natural products C1=CNC=N1 RAXXELZNTBOGNW-UHFFFAOYSA-N 0.000 description 3
- 230000001939 inductive effect Effects 0.000 description 3
- 238000002347 injection Methods 0.000 description 3
- 239000007924 injection Substances 0.000 description 3
- 238000003780 insertion Methods 0.000 description 3
- 230000037431 insertion Effects 0.000 description 3
- 108010027338 isoleucylcysteine Proteins 0.000 description 3
- 239000006166 lysate Substances 0.000 description 3
- 108010038320 lysylphenylalanine Proteins 0.000 description 3
- 210000004962 mammalian cell Anatomy 0.000 description 3
- 210000001161 mammalian embryo Anatomy 0.000 description 3
- 239000000463 material Substances 0.000 description 3
- 239000011159 matrix material Substances 0.000 description 3
- 230000001404 mediated effect Effects 0.000 description 3
- 229910052751 metal Inorganic materials 0.000 description 3
- 239000002184 metal Substances 0.000 description 3
- 238000010369 molecular cloning Methods 0.000 description 3
- 229920001220 nitrocellulos Polymers 0.000 description 3
- 230000036961 partial effect Effects 0.000 description 3
- 229940111202 pepsin Drugs 0.000 description 3
- 238000002360 preparation method Methods 0.000 description 3
- 230000008569 process Effects 0.000 description 3
- 238000012545 processing Methods 0.000 description 3
- 230000005855 radiation Effects 0.000 description 3
- 230000008707 rearrangement Effects 0.000 description 3
- 238000003757 reverse transcription PCR Methods 0.000 description 3
- 150000003839 salts Chemical class 0.000 description 3
- 108010071207 serylmethionine Proteins 0.000 description 3
- 229910001415 sodium ion Inorganic materials 0.000 description 3
- 239000007790 solid phase Substances 0.000 description 3
- RYYWUUFWQRZTIU-UHFFFAOYSA-K thiophosphate Chemical compound [O-]P([O-])([O-])=S RYYWUUFWQRZTIU-UHFFFAOYSA-K 0.000 description 3
- 108010003137 tyrosyltyrosine Proteins 0.000 description 3
- 108700026220 vif Genes Proteins 0.000 description 3
- YBJHBAHKTGYVGT-ZKWXMUAHSA-N (+)-Biotin Chemical compound N1C(=O)N[C@@H]2[C@H](CCCCC(=O)O)SC[C@@H]21 YBJHBAHKTGYVGT-ZKWXMUAHSA-N 0.000 description 2
- XVZCXCTYGHPNEM-IHRRRGAJSA-N (2s)-1-[(2s)-2-[[(2s)-2-amino-4-methylpentanoyl]amino]-4-methylpentanoyl]pyrrolidine-2-carboxylic acid Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(O)=O XVZCXCTYGHPNEM-IHRRRGAJSA-N 0.000 description 2
- IAKHMKGGTNLKSZ-INIZCTEOSA-N (S)-colchicine Chemical compound C1([C@@H](NC(C)=O)CC2)=CC(=O)C(OC)=CC=C1C1=C2C=C(OC)C(OC)=C1OC IAKHMKGGTNLKSZ-INIZCTEOSA-N 0.000 description 2
- DQVAZKGVGKHQDS-UHFFFAOYSA-N 2-[[1-[2-[(2-amino-4-methylpentanoyl)amino]-4-methylpentanoyl]pyrrolidine-2-carbonyl]amino]-4-methylpentanoic acid Chemical compound CC(C)CC(N)C(=O)NC(CC(C)C)C(=O)N1CCCC1C(=O)NC(CC(C)C)C(O)=O DQVAZKGVGKHQDS-UHFFFAOYSA-N 0.000 description 2
- QQACQIHVWCVBBR-GVARAGBVSA-N Ala-Ile-Tyr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O QQACQIHVWCVBBR-GVARAGBVSA-N 0.000 description 2
- GFEDXKNBZMPEDM-KZVJFYERSA-N Ala-Met-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GFEDXKNBZMPEDM-KZVJFYERSA-N 0.000 description 2
- DRARURMRLANNLS-GUBZILKMSA-N Ala-Met-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C(C)C)C(O)=O DRARURMRLANNLS-GUBZILKMSA-N 0.000 description 2
- XWFWAXPOLRTDFZ-FXQIFTODSA-N Ala-Pro-Ser Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O XWFWAXPOLRTDFZ-FXQIFTODSA-N 0.000 description 2
- YHBDGLZYNIARKJ-GUBZILKMSA-N Ala-Pro-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](C)N YHBDGLZYNIARKJ-GUBZILKMSA-N 0.000 description 2
- WQKAQKZRDIZYNV-VZFHVOOUSA-N Ala-Ser-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O WQKAQKZRDIZYNV-VZFHVOOUSA-N 0.000 description 2
- 102000007698 Alcohol dehydrogenase Human genes 0.000 description 2
- 108010021809 Alcohol dehydrogenase Proteins 0.000 description 2
- 102000002260 Alkaline Phosphatase Human genes 0.000 description 2
- 108020004774 Alkaline Phosphatase Proteins 0.000 description 2
- 108091023037 Aptamer Proteins 0.000 description 2
- QAODJPUKWNNNRP-DCAQKATOSA-N Arg-Glu-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O QAODJPUKWNNNRP-DCAQKATOSA-N 0.000 description 2
- MJINRRBEMOLJAK-DCAQKATOSA-N Arg-Lys-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCCN=C(N)N MJINRRBEMOLJAK-DCAQKATOSA-N 0.000 description 2
- GITAWLWBTMJPKH-AVGNSLFASA-N Arg-Met-His Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N GITAWLWBTMJPKH-AVGNSLFASA-N 0.000 description 2
- JOTRDIXZHNQYGP-DCAQKATOSA-N Arg-Ser-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCCN=C(N)N)N JOTRDIXZHNQYGP-DCAQKATOSA-N 0.000 description 2
- SYFHFLGAROUHNT-VEVYYDQMSA-N Arg-Thr-Asn Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(O)=O SYFHFLGAROUHNT-VEVYYDQMSA-N 0.000 description 2
- ISVACHFCVRKIDG-SRVKXCTJSA-N Arg-Val-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O ISVACHFCVRKIDG-SRVKXCTJSA-N 0.000 description 2
- GXMSVVBIAMWMKO-BQBZGAKWSA-N Asn-Arg-Gly Chemical compound NC(=O)C[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CCCN=C(N)N GXMSVVBIAMWMKO-BQBZGAKWSA-N 0.000 description 2
- MFFOYNGMOYFPBD-DCAQKATOSA-N Asn-Arg-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O MFFOYNGMOYFPBD-DCAQKATOSA-N 0.000 description 2
- DAPLJWATMAXPPZ-CIUDSAMLSA-N Asn-Asn-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CC(N)=O DAPLJWATMAXPPZ-CIUDSAMLSA-N 0.000 description 2
- KUYKVGODHGHFDI-ACZMJKKPSA-N Asn-Gln-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O KUYKVGODHGHFDI-ACZMJKKPSA-N 0.000 description 2
- OOXUBGLNDRGOKT-FXQIFTODSA-N Asn-Ser-Arg Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O OOXUBGLNDRGOKT-FXQIFTODSA-N 0.000 description 2
- JBDLMLZNDRLDIX-HJGDQZAQSA-N Asn-Thr-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O JBDLMLZNDRLDIX-HJGDQZAQSA-N 0.000 description 2
- JNCRAQVYJZGIOW-QSFUFRPTSA-N Asn-Val-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JNCRAQVYJZGIOW-QSFUFRPTSA-N 0.000 description 2
- XOQYDFCQPWAMSA-KKHAAJSZSA-N Asn-Val-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XOQYDFCQPWAMSA-KKHAAJSZSA-N 0.000 description 2
- JGDBHIVECJGXJA-FXQIFTODSA-N Asp-Asp-Arg Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O JGDBHIVECJGXJA-FXQIFTODSA-N 0.000 description 2
- PXLNPFOJZQMXAT-BYULHYEWSA-N Asp-Asp-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC(O)=O PXLNPFOJZQMXAT-BYULHYEWSA-N 0.000 description 2
- NURJSGZGBVJFAD-ZLUOBGJFSA-N Asp-Cys-Ser Chemical compound C([C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CO)C(=O)O)N)C(=O)O NURJSGZGBVJFAD-ZLUOBGJFSA-N 0.000 description 2
- SNDBKTFJWVEVPO-WHFBIAKZSA-N Asp-Gly-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](CO)C(O)=O SNDBKTFJWVEVPO-WHFBIAKZSA-N 0.000 description 2
- DPNWSMBUYCLEDG-CIUDSAMLSA-N Asp-Lys-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O DPNWSMBUYCLEDG-CIUDSAMLSA-N 0.000 description 2
- XXAMCEGRCZQGEM-ZLUOBGJFSA-N Asp-Ser-Asn Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O XXAMCEGRCZQGEM-ZLUOBGJFSA-N 0.000 description 2
- XWKBWZXGNXTDKY-ZKWXMUAHSA-N Asp-Val-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC(O)=O XWKBWZXGNXTDKY-ZKWXMUAHSA-N 0.000 description 2
- 241000972773 Aulopiformes Species 0.000 description 2
- 208000023275 Autoimmune disease Diseases 0.000 description 2
- 241000283690 Bos taurus Species 0.000 description 2
- 108091003079 Bovine Serum Albumin Proteins 0.000 description 2
- 108091035707 Consensus sequence Proteins 0.000 description 2
- 230000004568 DNA-binding Effects 0.000 description 2
- 229920002307 Dextran Polymers 0.000 description 2
- 102000015554 Dopamine receptor Human genes 0.000 description 2
- 108050004812 Dopamine receptor Proteins 0.000 description 2
- AOJJSUZBOXZQNB-TZSSRYMLSA-N Doxorubicin Chemical compound O([C@H]1C[C@@](O)(CC=2C(O)=C3C(=O)C=4C=CC=C(C=4C(=O)C3=C(O)C=21)OC)C(=O)CO)[C@H]1C[C@H](N)[C@H](O)[C@H](C)O1 AOJJSUZBOXZQNB-TZSSRYMLSA-N 0.000 description 2
- 241000255581 Drosophila <fruit fly, genus> Species 0.000 description 2
- 108700024394 Exon Proteins 0.000 description 2
- 101710082714 Exotoxin A Proteins 0.000 description 2
- 241000287828 Gallus gallus Species 0.000 description 2
- 206010064571 Gene mutation Diseases 0.000 description 2
- SHERTACNJPYHAR-ACZMJKKPSA-N Gln-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(N)=O SHERTACNJPYHAR-ACZMJKKPSA-N 0.000 description 2
- JESJDAAGXULQOP-CIUDSAMLSA-N Gln-Arg-Ser Chemical compound C(C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CCC(=O)N)N)CN=C(N)N JESJDAAGXULQOP-CIUDSAMLSA-N 0.000 description 2
- LJEPDHWNQXPXMM-NHCYSSNCSA-N Gln-Arg-Val Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(O)=O LJEPDHWNQXPXMM-NHCYSSNCSA-N 0.000 description 2
- OETQLUYCMBARHJ-CIUDSAMLSA-N Gln-Asn-Arg Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O OETQLUYCMBARHJ-CIUDSAMLSA-N 0.000 description 2
- LOJYQMFIIJVETK-WDSKDSINSA-N Gln-Gln Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(O)=O LOJYQMFIIJVETK-WDSKDSINSA-N 0.000 description 2
- LPYPANUXJGFMGV-FXQIFTODSA-N Gln-Gln-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCC(=O)N)N LPYPANUXJGFMGV-FXQIFTODSA-N 0.000 description 2
- KVXVVDFOZNYYKZ-DCAQKATOSA-N Gln-Gln-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O KVXVVDFOZNYYKZ-DCAQKATOSA-N 0.000 description 2
- RBWKVOSARCFSQQ-FXQIFTODSA-N Gln-Gln-Ser Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O RBWKVOSARCFSQQ-FXQIFTODSA-N 0.000 description 2
- JHPFPROFOAJRFN-IHRRRGAJSA-N Gln-Glu-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCC(=O)N)N)O JHPFPROFOAJRFN-IHRRRGAJSA-N 0.000 description 2
- IWUFOVSLWADEJC-AVGNSLFASA-N Gln-His-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(O)=O IWUFOVSLWADEJC-AVGNSLFASA-N 0.000 description 2
- SHAUZYVSXAMYAZ-JYJNAYRXSA-N Gln-Leu-Phe Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CCC(=O)N)N SHAUZYVSXAMYAZ-JYJNAYRXSA-N 0.000 description 2
- KLKYKPXITJBSNI-CIUDSAMLSA-N Gln-Met-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C)C(O)=O KLKYKPXITJBSNI-CIUDSAMLSA-N 0.000 description 2
- MKRDNSWGJWTBKZ-GVXVVHGQSA-N Gln-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)N)N MKRDNSWGJWTBKZ-GVXVVHGQSA-N 0.000 description 2
- VTTSANCGJWLPNC-ZPFDUUQYSA-N Glu-Arg-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O VTTSANCGJWLPNC-ZPFDUUQYSA-N 0.000 description 2
- WRNAXCVRSBBKGS-BQBZGAKWSA-N Glu-Gly-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H](CCC(N)=O)C(O)=O WRNAXCVRSBBKGS-BQBZGAKWSA-N 0.000 description 2
- IVGJYOOGJLFKQE-AVGNSLFASA-N Glu-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)O)N IVGJYOOGJLFKQE-AVGNSLFASA-N 0.000 description 2
- BPLNJYHNAJVLRT-ACZMJKKPSA-N Glu-Ser-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O BPLNJYHNAJVLRT-ACZMJKKPSA-N 0.000 description 2
- ZAPFAWQHBOHWLL-GUBZILKMSA-N Glu-Ser-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCC(=O)O)N ZAPFAWQHBOHWLL-GUBZILKMSA-N 0.000 description 2
- SYAYROHMAIHWFB-KBIXCLLPSA-N Glu-Ser-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O SYAYROHMAIHWFB-KBIXCLLPSA-N 0.000 description 2
- VHPVBPCCWVDGJL-IRIUXVKKSA-N Glu-Thr-Tyr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O VHPVBPCCWVDGJL-IRIUXVKKSA-N 0.000 description 2
- QGAJQIGFFIQJJK-IHRRRGAJSA-N Glu-Tyr-Gln Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)O)N)O QGAJQIGFFIQJJK-IHRRRGAJSA-N 0.000 description 2
- YZPVGIVFMZLQMM-YUMQZZPRSA-N Gly-Gln-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)CN YZPVGIVFMZLQMM-YUMQZZPRSA-N 0.000 description 2
- MBOAPAXLTUSMQI-JHEQGTHGSA-N Gly-Glu-Thr Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MBOAPAXLTUSMQI-JHEQGTHGSA-N 0.000 description 2
- QPTNELDXWKRIFX-YFKPBYRVSA-N Gly-Gly-Gln Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CCC(N)=O QPTNELDXWKRIFX-YFKPBYRVSA-N 0.000 description 2
- XPJBQTCXPJNIFE-ZETCQYMHSA-N Gly-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)CN XPJBQTCXPJNIFE-ZETCQYMHSA-N 0.000 description 2
- UESJMAMHDLEHGM-NHCYSSNCSA-N Gly-Ile-Leu Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O UESJMAMHDLEHGM-NHCYSSNCSA-N 0.000 description 2
- SCWYHUQOOFRVHP-MBLNEYKQSA-N Gly-Ile-Thr Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SCWYHUQOOFRVHP-MBLNEYKQSA-N 0.000 description 2
- LRQXRHGQEVWGPV-NHCYSSNCSA-N Gly-Leu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)CN LRQXRHGQEVWGPV-NHCYSSNCSA-N 0.000 description 2
- LXTRSHQLGYINON-DTWKUNHWSA-N Gly-Met-Pro Chemical compound CSCC[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN LXTRSHQLGYINON-DTWKUNHWSA-N 0.000 description 2
- GGAPHLIUUTVYMX-QWRGUYRKSA-N Gly-Phe-Ser Chemical compound OC[C@@H](C([O-])=O)NC(=O)[C@@H](NC(=O)C[NH3+])CC1=CC=CC=C1 GGAPHLIUUTVYMX-QWRGUYRKSA-N 0.000 description 2
- BMWFDYIYBAFROD-WPRPVWTQSA-N Gly-Pro-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)CN BMWFDYIYBAFROD-WPRPVWTQSA-N 0.000 description 2
- PYFHPYDQHCEVIT-KBPBESRZSA-N Gly-Trp-Gln Chemical compound [H]NCC(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCC(N)=O)C(O)=O PYFHPYDQHCEVIT-KBPBESRZSA-N 0.000 description 2
- RVKIPWVMZANZLI-UHFFFAOYSA-N H-Lys-Trp-OH Natural products C1=CC=C2C(CC(NC(=O)C(N)CCCCN)C(O)=O)=CNC2=C1 RVKIPWVMZANZLI-UHFFFAOYSA-N 0.000 description 2
- 241000238631 Hexapoda Species 0.000 description 2
- WYWBYSPRCFADBM-GARJFASQSA-N His-Cys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CS)NC(=O)[C@H](CC2=CN=CN2)N)C(=O)O WYWBYSPRCFADBM-GARJFASQSA-N 0.000 description 2
- UROVZOUMHNXPLZ-AVGNSLFASA-N His-Leu-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CN=CN1 UROVZOUMHNXPLZ-AVGNSLFASA-N 0.000 description 2
- LNDVNHOSZQPJGI-AVGNSLFASA-N His-Pro-Pro Chemical compound C([C@H](N)C(=O)N1[C@@H](CCC1)C(=O)N1[C@@H](CCC1)C(O)=O)C1=CN=CN1 LNDVNHOSZQPJGI-AVGNSLFASA-N 0.000 description 2
- 206010020751 Hypersensitivity Diseases 0.000 description 2
- GQKSJYINYYWPMR-NGZCFLSTSA-N Ile-Gly-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N1CCC[C@@H]1C(=O)O)N GQKSJYINYYWPMR-NGZCFLSTSA-N 0.000 description 2
- PWDSHAAAFXISLE-SXTJYALSSA-N Ile-Ile-Asp Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O PWDSHAAAFXISLE-SXTJYALSSA-N 0.000 description 2
- HUORUFRRJHELPD-MNXVOIDGSA-N Ile-Leu-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N HUORUFRRJHELPD-MNXVOIDGSA-N 0.000 description 2
- 108010067060 Immunoglobulin Variable Region Proteins 0.000 description 2
- 102000017727 Immunoglobulin Variable Region Human genes 0.000 description 2
- UQSXHKLRYXJYBZ-UHFFFAOYSA-N Iron oxide Chemical compound [Fe]=O UQSXHKLRYXJYBZ-UHFFFAOYSA-N 0.000 description 2
- 102000004195 Isomerases Human genes 0.000 description 2
- 108090000769 Isomerases Proteins 0.000 description 2
- WGNOPSQMIQERPK-GARJFASQSA-N Leu-Asn-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N1CCC[C@@H]1C(=O)O)N WGNOPSQMIQERPK-GARJFASQSA-N 0.000 description 2
- VQPPIMUZCZCOIL-GUBZILKMSA-N Leu-Gln-Ala Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O VQPPIMUZCZCOIL-GUBZILKMSA-N 0.000 description 2
- DZQMXBALGUHGJT-GUBZILKMSA-N Leu-Glu-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O DZQMXBALGUHGJT-GUBZILKMSA-N 0.000 description 2
- QNBVTHNJGCOVFA-AVGNSLFASA-N Leu-Leu-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCC(O)=O QNBVTHNJGCOVFA-AVGNSLFASA-N 0.000 description 2
- GNRPTBRHRRZCMA-RWMBFGLXSA-N Leu-Met-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCSC)C(=O)N1CCC[C@@H]1C(=O)O)N GNRPTBRHRRZCMA-RWMBFGLXSA-N 0.000 description 2
- VKVDRTGWLVZJOM-DCAQKATOSA-N Leu-Val-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O VKVDRTGWLVZJOM-DCAQKATOSA-N 0.000 description 2
- 108060001084 Luciferase Proteins 0.000 description 2
- 239000005089 Luciferase Substances 0.000 description 2
- NTEVEUCLFMWSND-SRVKXCTJSA-N Lys-Arg-Gln Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(O)=O NTEVEUCLFMWSND-SRVKXCTJSA-N 0.000 description 2
- NKKFVJRLCCUJNA-QWRGUYRKSA-N Lys-Gly-Lys Chemical compound NCCCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCCN NKKFVJRLCCUJNA-QWRGUYRKSA-N 0.000 description 2
- ZJWIXBZTAAJERF-IHRRRGAJSA-N Lys-Lys-Arg Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CCCN=C(N)N ZJWIXBZTAAJERF-IHRRRGAJSA-N 0.000 description 2
- PLOUVAYOMTYJRG-JXUBOQSCSA-N Lys-Thr-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O PLOUVAYOMTYJRG-JXUBOQSCSA-N 0.000 description 2
- GVKINWYYLOLEFQ-XIRDDKMYSA-N Lys-Trp-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CO)C(O)=O GVKINWYYLOLEFQ-XIRDDKMYSA-N 0.000 description 2
- ULLIQRYQNMAAHC-RWMBFGLXSA-N Met-His-Pro Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N2CCC[C@@H]2C(=O)O)N ULLIQRYQNMAAHC-RWMBFGLXSA-N 0.000 description 2
- PCTFVQATEGYHJU-FXQIFTODSA-N Met-Ser-Asn Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O PCTFVQATEGYHJU-FXQIFTODSA-N 0.000 description 2
- QQPMHUCGDRJFQK-RHYQMDGZSA-N Met-Thr-Leu Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(C)C QQPMHUCGDRJFQK-RHYQMDGZSA-N 0.000 description 2
- IHRFZLQEQVHXFA-RHYQMDGZSA-N Met-Thr-Lys Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCCCN IHRFZLQEQVHXFA-RHYQMDGZSA-N 0.000 description 2
- LBSWWNKMVPAXOI-GUBZILKMSA-N Met-Val-Ser Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O LBSWWNKMVPAXOI-GUBZILKMSA-N 0.000 description 2
- QPCDCPDFJACHGM-UHFFFAOYSA-N N,N-bis{2-[bis(carboxymethyl)amino]ethyl}glycine Chemical compound OC(=O)CN(CC(O)=O)CCN(CC(=O)O)CCN(CC(O)=O)CC(O)=O QPCDCPDFJACHGM-UHFFFAOYSA-N 0.000 description 2
- 229930193140 Neomycin Natural products 0.000 description 2
- 239000004677 Nylon Substances 0.000 description 2
- 108700026244 Open Reading Frames Proteins 0.000 description 2
- 241000283973 Oryctolagus cuniculus Species 0.000 description 2
- NAXPHWZXEXNDIW-JTQLQIEISA-N Phe-Gly-Gly Chemical compound OC(=O)CNC(=O)CNC(=O)[C@@H](N)CC1=CC=CC=C1 NAXPHWZXEXNDIW-JTQLQIEISA-N 0.000 description 2
- KZRQONDKKJCAOL-DKIMLUQUSA-N Phe-Leu-Ile Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KZRQONDKKJCAOL-DKIMLUQUSA-N 0.000 description 2
- GOUWCZRDTWTODO-YDHLFZDLSA-N Phe-Val-Asn Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O GOUWCZRDTWTODO-YDHLFZDLSA-N 0.000 description 2
- KUSYCSMTTHSZOA-DZKIICNBSA-N Phe-Val-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N KUSYCSMTTHSZOA-DZKIICNBSA-N 0.000 description 2
- 108010089430 Phosphoproteins Proteins 0.000 description 2
- 102000007982 Phosphoproteins Human genes 0.000 description 2
- 239000004793 Polystyrene Substances 0.000 description 2
- INXAPZFIOVGHSV-CIUDSAMLSA-N Pro-Asn-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H]1CCCN1 INXAPZFIOVGHSV-CIUDSAMLSA-N 0.000 description 2
- JFNPBBOGGNMSRX-CIUDSAMLSA-N Pro-Gln-Ala Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O JFNPBBOGGNMSRX-CIUDSAMLSA-N 0.000 description 2
- CMOIIANLNNYUTP-SRVKXCTJSA-N Pro-Gln-His Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O CMOIIANLNNYUTP-SRVKXCTJSA-N 0.000 description 2
- CFVRJNZJQHDQPP-CYDGBPFRSA-N Pro-Ile-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H]1CCCN1 CFVRJNZJQHDQPP-CYDGBPFRSA-N 0.000 description 2
- BARPGRUZBKFJMA-SRVKXCTJSA-N Pro-Met-Arg Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@@H]1CCCN1 BARPGRUZBKFJMA-SRVKXCTJSA-N 0.000 description 2
- QGLFRQCECIWXFA-RCWTZXSCSA-N Pro-Met-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@@H]1CCCN1)O QGLFRQCECIWXFA-RCWTZXSCSA-N 0.000 description 2
- XYAFCOJKICBRDU-JYJNAYRXSA-N Pro-Phe-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(O)=O XYAFCOJKICBRDU-JYJNAYRXSA-N 0.000 description 2
- DWPXHLIBFQLKLK-CYDGBPFRSA-N Pro-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 DWPXHLIBFQLKLK-CYDGBPFRSA-N 0.000 description 2
- IALSFJSONJZBKB-HRCADAONSA-N Pro-Tyr-Pro Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)N3CCC[C@@H]3C(=O)O IALSFJSONJZBKB-HRCADAONSA-N 0.000 description 2
- KHRLUIPIMIQFGT-AVGNSLFASA-N Pro-Val-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O KHRLUIPIMIQFGT-AVGNSLFASA-N 0.000 description 2
- XRGIDCGRSSWCKE-SRVKXCTJSA-N Pro-Val-Met Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCSC)C(O)=O XRGIDCGRSSWCKE-SRVKXCTJSA-N 0.000 description 2
- 101000762949 Pseudomonas aeruginosa (strain ATCC 15692 / DSM 22644 / CIP 104116 / JCM 14847 / LMG 12228 / 1C / PRS 101 / PAO1) Exotoxin A Proteins 0.000 description 2
- 102000018120 Recombinases Human genes 0.000 description 2
- 108010091086 Recombinases Proteins 0.000 description 2
- 108700008625 Reporter Genes Proteins 0.000 description 2
- 108010039491 Ricin Proteins 0.000 description 2
- BTKUIVBNGBFTTP-WHFBIAKZSA-N Ser-Ala-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)NCC(O)=O BTKUIVBNGBFTTP-WHFBIAKZSA-N 0.000 description 2
- YQHZVYJAGWMHES-ZLUOBGJFSA-N Ser-Ala-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O YQHZVYJAGWMHES-ZLUOBGJFSA-N 0.000 description 2
- OBXVZEAMXFSGPU-FXQIFTODSA-N Ser-Asn-Arg Chemical compound C(C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CO)N)CN=C(N)N OBXVZEAMXFSGPU-FXQIFTODSA-N 0.000 description 2
- CDVFZMOFNJPUDD-ACZMJKKPSA-N Ser-Gln-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O CDVFZMOFNJPUDD-ACZMJKKPSA-N 0.000 description 2
- AEGUWTFAQQWVLC-BQBZGAKWSA-N Ser-Gly-Arg Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(O)=O AEGUWTFAQQWVLC-BQBZGAKWSA-N 0.000 description 2
- MIJWOJAXARLEHA-WDSKDSINSA-N Ser-Gly-Glu Chemical compound OC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(O)=O MIJWOJAXARLEHA-WDSKDSINSA-N 0.000 description 2
- YMTLKLXDFCSCNX-BYPYZUCNSA-N Ser-Gly-Gly Chemical compound OC[C@H](N)C(=O)NCC(=O)NCC(O)=O YMTLKLXDFCSCNX-BYPYZUCNSA-N 0.000 description 2
- VZQRNAYURWAEFE-KKUMJFAQSA-N Ser-Leu-Phe Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 VZQRNAYURWAEFE-KKUMJFAQSA-N 0.000 description 2
- GZSZPKSBVAOGIE-CIUDSAMLSA-N Ser-Lys-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O GZSZPKSBVAOGIE-CIUDSAMLSA-N 0.000 description 2
- RWDVVSKYZBNDCO-MELADBBJSA-N Ser-Phe-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)NC(=O)[C@H](CO)N)C(=O)O RWDVVSKYZBNDCO-MELADBBJSA-N 0.000 description 2
- FLONGDPORFIVQW-XGEHTFHBSA-N Ser-Pro-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CO FLONGDPORFIVQW-XGEHTFHBSA-N 0.000 description 2
- FZXOPYUEQGDGMS-ACZMJKKPSA-N Ser-Ser-Gln Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O FZXOPYUEQGDGMS-ACZMJKKPSA-N 0.000 description 2
- GYDFRTRSSXOZCR-ACZMJKKPSA-N Ser-Ser-Glu Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O GYDFRTRSSXOZCR-ACZMJKKPSA-N 0.000 description 2
- FAPWRFPIFSIZLT-UHFFFAOYSA-M Sodium chloride Chemical compound [Na+].[Cl-] FAPWRFPIFSIZLT-UHFFFAOYSA-M 0.000 description 2
- GUZGCDIZVGODML-NKIYYHGXSA-N Thr-Gln-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N)O GUZGCDIZVGODML-NKIYYHGXSA-N 0.000 description 2
- KBLYJPQSNGTDIU-LOKLDPHHSA-N Thr-Glu-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N)O KBLYJPQSNGTDIU-LOKLDPHHSA-N 0.000 description 2
- WTMPKZWHRCMMMT-KZVJFYERSA-N Thr-Pro-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O WTMPKZWHRCMMMT-KZVJFYERSA-N 0.000 description 2
- DEGCBBCMYWNJNA-RHYQMDGZSA-N Thr-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)[C@@H](C)O DEGCBBCMYWNJNA-RHYQMDGZSA-N 0.000 description 2
- GVMXJJAJLIEASL-ZJDVBMNYSA-N Thr-Pro-Thr Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O GVMXJJAJLIEASL-ZJDVBMNYSA-N 0.000 description 2
- YGCDFAJJCRVQKU-RCWTZXSCSA-N Thr-Pro-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)[C@@H](C)O YGCDFAJJCRVQKU-RCWTZXSCSA-N 0.000 description 2
- DKKHULUSOSWGHS-UWJYBYFXSA-N Tyr-Asn-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC1=CC=C(C=C1)O)N DKKHULUSOSWGHS-UWJYBYFXSA-N 0.000 description 2
- QUILOGWWLXMSAT-IHRRRGAJSA-N Tyr-Gln-Gln Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O QUILOGWWLXMSAT-IHRRRGAJSA-N 0.000 description 2
- HVPPEXXUDXAPOM-MGHWNKPDSA-N Tyr-Ile-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 HVPPEXXUDXAPOM-MGHWNKPDSA-N 0.000 description 2
- ZLFHAAGHGQBQQN-AEJSXWLSSA-N Val-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](C(C)C)N ZLFHAAGHGQBQQN-AEJSXWLSSA-N 0.000 description 2
- XEYUMGGWQCIWAR-XVKPBYJWSA-N Val-Gln-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)NCC(=O)O)N XEYUMGGWQCIWAR-XVKPBYJWSA-N 0.000 description 2
- AGKDVLSDNSTLFA-UMNHJUIQSA-N Val-Gln-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N1CCC[C@@H]1C(=O)O)N AGKDVLSDNSTLFA-UMNHJUIQSA-N 0.000 description 2
- CVIXTAITYJQMPE-LAEOZQHASA-N Val-Glu-Asn Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O CVIXTAITYJQMPE-LAEOZQHASA-N 0.000 description 2
- BEGDZYNDCNEGJZ-XVKPBYJWSA-N Val-Gly-Gln Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(N)=O BEGDZYNDCNEGJZ-XVKPBYJWSA-N 0.000 description 2
- HGJRMXOWUWVUOA-GVXVVHGQSA-N Val-Leu-Gln Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N HGJRMXOWUWVUOA-GVXVVHGQSA-N 0.000 description 2
- 230000002159 abnormal effect Effects 0.000 description 2
- RJURFGZVJUQBHK-UHFFFAOYSA-N actinomycin D Natural products CC1OC(=O)C(C(C)C)N(C)C(=O)CN(C)C(=O)C2CCCN2C(=O)C(C(C)C)NC(=O)C1NC(=O)C1=C(N)C(=O)C(C)=C2OC(C(C)=CC=C3C(=O)NC4C(=O)NC(C(N5CCCC5C(=O)N(C)CC(=O)N(C)C(C(C)C)C(=O)OC4C)=O)C(C)C)=C3N=C21 RJURFGZVJUQBHK-UHFFFAOYSA-N 0.000 description 2
- 239000000654 additive Substances 0.000 description 2
- 210000004100 adrenal gland Anatomy 0.000 description 2
- 208000026935 allergic disease Diseases 0.000 description 2
- 230000007815 allergy Effects 0.000 description 2
- 230000000890 antigenic effect Effects 0.000 description 2
- 239000011324 bead Substances 0.000 description 2
- 229940098773 bovine serum albumin Drugs 0.000 description 2
- 210000004899 c-terminal region Anatomy 0.000 description 2
- 230000003197 catalytic effect Effects 0.000 description 2
- 238000004113 cell culture Methods 0.000 description 2
- 230000022131 cell cycle Effects 0.000 description 2
- 230000001413 cellular effect Effects 0.000 description 2
- 229920002678 cellulose Polymers 0.000 description 2
- 235000010980 cellulose Nutrition 0.000 description 2
- 239000003153 chemical reaction reagent Substances 0.000 description 2
- 239000003638 chemical reducing agent Substances 0.000 description 2
- 239000003593 chromogenic compound Substances 0.000 description 2
- 210000000349 chromosome Anatomy 0.000 description 2
- 210000001072 colon Anatomy 0.000 description 2
- 238000007796 conventional method Methods 0.000 description 2
- 210000004748 cultured cell Anatomy 0.000 description 2
- YPHMISFOHDHNIV-FSZOTQKASA-N cycloheximide Chemical compound C1[C@@H](C)C[C@H](C)C(=O)[C@@H]1[C@H](O)CC1CC(=O)NC(=O)C1 YPHMISFOHDHNIV-FSZOTQKASA-N 0.000 description 2
- 239000003599 detergent Substances 0.000 description 2
- 229960003638 dopamine Drugs 0.000 description 2
- 230000003291 dopaminomimetic effect Effects 0.000 description 2
- 230000003828 downregulation Effects 0.000 description 2
- 235000013601 eggs Nutrition 0.000 description 2
- 238000000605 extraction Methods 0.000 description 2
- 238000000684 flow cytometry Methods 0.000 description 2
- 239000007850 fluorescent dye Substances 0.000 description 2
- 230000005714 functional activity Effects 0.000 description 2
- 238000001502 gel electrophoresis Methods 0.000 description 2
- 238000012215 gene cloning Methods 0.000 description 2
- 238000001415 gene therapy Methods 0.000 description 2
- 230000002068 genetic effect Effects 0.000 description 2
- 238000010353 genetic engineering Methods 0.000 description 2
- 239000011521 glass Substances 0.000 description 2
- 108010089804 glycyl-threonine Proteins 0.000 description 2
- 239000001963 growth medium Substances 0.000 description 2
- 210000002216 heart Anatomy 0.000 description 2
- 238000004128 high performance liquid chromatography Methods 0.000 description 2
- 108010092114 histidylphenylalanine Proteins 0.000 description 2
- 230000003053 immunization Effects 0.000 description 2
- 230000001024 immunotherapeutic effect Effects 0.000 description 2
- 238000009169 immunotherapy Methods 0.000 description 2
- 230000010354 integration Effects 0.000 description 2
- 230000003993 interaction Effects 0.000 description 2
- 150000002500 ions Chemical class 0.000 description 2
- 108010045069 keyhole-limpet hemocyanin Proteins 0.000 description 2
- 210000003734 kidney Anatomy 0.000 description 2
- 230000000670 limiting effect Effects 0.000 description 2
- 239000007788 liquid Substances 0.000 description 2
- 210000004185 liver Anatomy 0.000 description 2
- 238000004020 luminiscence type Methods 0.000 description 2
- 210000004072 lung Anatomy 0.000 description 2
- 230000005291 magnetic effect Effects 0.000 description 2
- 238000013507 mapping Methods 0.000 description 2
- 230000035800 maturation Effects 0.000 description 2
- 150000002739 metals Chemical class 0.000 description 2
- YACKEPLHDIMKIO-UHFFFAOYSA-N methylphosphonic acid Chemical compound CP(O)(O)=O YACKEPLHDIMKIO-UHFFFAOYSA-N 0.000 description 2
- 239000004005 microsphere Substances 0.000 description 2
- 230000037230 mobility Effects 0.000 description 2
- 201000006417 multiple sclerosis Diseases 0.000 description 2
- 229960004927 neomycin Drugs 0.000 description 2
- 229920001778 nylon Polymers 0.000 description 2
- 210000001672 ovary Anatomy 0.000 description 2
- 210000000496 pancreas Anatomy 0.000 description 2
- 230000005298 paramagnetic effect Effects 0.000 description 2
- 108010018625 phenylalanylarginine Proteins 0.000 description 2
- 230000026731 phosphorylation Effects 0.000 description 2
- 238000006366 phosphorylation reaction Methods 0.000 description 2
- 229920002401 polyacrylamide Polymers 0.000 description 2
- 229920002223 polystyrene Polymers 0.000 description 2
- 210000002307 prostate Anatomy 0.000 description 2
- 238000000746 purification Methods 0.000 description 2
- 238000000163 radioactive labelling Methods 0.000 description 2
- 238000003127 radioimmunoassay Methods 0.000 description 2
- 230000002829 reductive effect Effects 0.000 description 2
- 238000007894 restriction fragment length polymorphism technique Methods 0.000 description 2
- 206010039073 rheumatoid arthritis Diseases 0.000 description 2
- 235000019515 salmon Nutrition 0.000 description 2
- 230000003248 secreting effect Effects 0.000 description 2
- 210000000813 small intestine Anatomy 0.000 description 2
- 210000002784 stomach Anatomy 0.000 description 2
- 208000011117 substance-related disease Diseases 0.000 description 2
- 210000001550 testis Anatomy 0.000 description 2
- 210000001685 thyroid gland Anatomy 0.000 description 2
- WYWHKKSPHMUBEB-UHFFFAOYSA-N tioguanine Chemical compound N1C(N)=NC(=S)C2=C1N=CN2 WYWHKKSPHMUBEB-UHFFFAOYSA-N 0.000 description 2
- 239000003053 toxin Substances 0.000 description 2
- 231100000765 toxin Toxicity 0.000 description 2
- 108700012359 toxins Proteins 0.000 description 2
- 238000001890 transfection Methods 0.000 description 2
- 210000004291 uterus Anatomy 0.000 description 2
- IBIDRSSEHFLGSD-UHFFFAOYSA-N valinyl-arginine Natural products CC(C)C(N)C(=O)NC(C(O)=O)CCCN=C(N)N IBIDRSSEHFLGSD-UHFFFAOYSA-N 0.000 description 2
- OGWKCGZFUXNPDA-XQKSVPLYSA-N vincristine Chemical compound C([N@]1C[C@@H](C[C@]2(C(=O)OC)C=3C(=CC4=C([C@]56[C@H]([C@@]([C@H](OC(C)=O)[C@]7(CC)C=CCN([C@H]67)CC5)(O)C(=O)OC)N4C=O)C=3)OC)C[C@@](C1)(O)CC)CC1=C2NC2=CC=CC=C12 OGWKCGZFUXNPDA-XQKSVPLYSA-N 0.000 description 2
- 230000003612 virological effect Effects 0.000 description 2
- 230000000007 visual effect Effects 0.000 description 2
- 108010027345 wheylin-1 peptide Proteins 0.000 description 2
- STGXGJRRAJKJRG-JDJSBBGDSA-N (3r,4r,5r)-5-(hydroxymethyl)-3-methoxyoxolane-2,4-diol Chemical compound CO[C@H]1C(O)O[C@H](CO)[C@H]1O STGXGJRRAJKJRG-JDJSBBGDSA-N 0.000 description 1
- UMCMPZBLKLEWAF-BCTGSCMUSA-N 3-[(3-cholamidopropyl)dimethylammonio]propane-1-sulfonate Chemical compound C([C@H]1C[C@H]2O)[C@H](O)CC[C@]1(C)[C@@H]1[C@@H]2[C@@H]2CC[C@H]([C@@H](CCC(=O)NCCC[N+](C)(C)CCCS([O-])(=O)=O)C)[C@@]2(C)[C@@H](O)C1 UMCMPZBLKLEWAF-BCTGSCMUSA-N 0.000 description 1
- HUDPLKWXRLNSPC-UHFFFAOYSA-N 4-aminophthalhydrazide Chemical compound O=C1NNC(=O)C=2C1=CC(N)=CC=2 HUDPLKWXRLNSPC-UHFFFAOYSA-N 0.000 description 1
- JYCQQPHGFMYQCF-UHFFFAOYSA-N 4-tert-Octylphenol monoethoxylate Chemical compound CC(C)(C)CC(C)(C)C1=CC=C(OCCO)C=C1 JYCQQPHGFMYQCF-UHFFFAOYSA-N 0.000 description 1
- 102100031126 6-phosphogluconolactonase Human genes 0.000 description 1
- 108010029731 6-phosphogluconolactonase Proteins 0.000 description 1
- STQGQHZAVUOBTE-UHFFFAOYSA-N 7-Cyan-hept-2t-en-4,6-diinsaeure Natural products C1=2C(O)=C3C(=O)C=4C(OC)=CC=CC=4C(=O)C3=C(O)C=2CC(O)(C(C)=O)CC1OC1CC(N)C(O)C(C)O1 STQGQHZAVUOBTE-UHFFFAOYSA-N 0.000 description 1
- UBKVUFQGVWHZIR-UHFFFAOYSA-N 8-oxoguanine Chemical compound O=C1NC(N)=NC2=NC(=O)N=C21 UBKVUFQGVWHZIR-UHFFFAOYSA-N 0.000 description 1
- 108010066676 Abrin Proteins 0.000 description 1
- 102100033639 Acetylcholinesterase Human genes 0.000 description 1
- 108010022752 Acetylcholinesterase Proteins 0.000 description 1
- 108010085238 Actins Proteins 0.000 description 1
- 102000007469 Actins Human genes 0.000 description 1
- 108010000239 Aequorin Proteins 0.000 description 1
- 229920000936 Agarose Polymers 0.000 description 1
- FDAZDMAFZYTHGS-XVYDVKMFSA-N Ala-His-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(O)=O)C(O)=O FDAZDMAFZYTHGS-XVYDVKMFSA-N 0.000 description 1
- MEFILNJXAVSUTO-JXUBOQSCSA-N Ala-Leu-Thr Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MEFILNJXAVSUTO-JXUBOQSCSA-N 0.000 description 1
- OLVCTPPSXNRGKV-GUBZILKMSA-N Ala-Pro-Pro Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 OLVCTPPSXNRGKV-GUBZILKMSA-N 0.000 description 1
- HCBKAOZYACJUEF-XQXXSGGOSA-N Ala-Thr-Gln Chemical compound N[C@@H](C)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CCC(N)=O)C(=O)O HCBKAOZYACJUEF-XQXXSGGOSA-N 0.000 description 1
- QOIGKCBMXUCDQU-KDXUFGMBSA-N Ala-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](C)N)O QOIGKCBMXUCDQU-KDXUFGMBSA-N 0.000 description 1
- 108010025188 Alcohol oxidase Proteins 0.000 description 1
- 102000013142 Amylases Human genes 0.000 description 1
- 108010065511 Amylases Proteins 0.000 description 1
- 102100039375 Ankyrin repeat domain-containing protein 2 Human genes 0.000 description 1
- 108010032595 Antibody Binding Sites Proteins 0.000 description 1
- 108020005544 Antisense RNA Proteins 0.000 description 1
- 208000019901 Anxiety disease Diseases 0.000 description 1
- 241000219194 Arabidopsis Species 0.000 description 1
- YFWTXMRJJDNTLM-LSJOCFKGSA-N Arg-Ala-His Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N YFWTXMRJJDNTLM-LSJOCFKGSA-N 0.000 description 1
- HKRXJBBCQBAGIM-FXQIFTODSA-N Arg-Asp-Ser Chemical compound C(C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CO)C(=O)O)N)CN=C(N)N HKRXJBBCQBAGIM-FXQIFTODSA-N 0.000 description 1
- PNQWAUXQDBIJDY-GUBZILKMSA-N Arg-Glu-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O PNQWAUXQDBIJDY-GUBZILKMSA-N 0.000 description 1
- PNIGSVZJNVUVJA-BQBZGAKWSA-N Arg-Gly-Asn Chemical compound NC(N)=NCCC[C@H](N)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O PNIGSVZJNVUVJA-BQBZGAKWSA-N 0.000 description 1
- KRQSPVKUISQQFS-FJXKBIBVSA-N Arg-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CCCN=C(N)N KRQSPVKUISQQFS-FJXKBIBVSA-N 0.000 description 1
- GMFAGHNRXPSSJS-SRVKXCTJSA-N Arg-Leu-Gln Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O GMFAGHNRXPSSJS-SRVKXCTJSA-N 0.000 description 1
- BDMIFVIWCNLDCT-CIUDSAMLSA-N Asn-Arg-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O BDMIFVIWCNLDCT-CIUDSAMLSA-N 0.000 description 1
- PTNFNTOBUDWHNZ-GUBZILKMSA-N Asn-Arg-Met Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCSC)C(O)=O PTNFNTOBUDWHNZ-GUBZILKMSA-N 0.000 description 1
- RRVBEKYEFMCDIF-WHFBIAKZSA-N Asn-Cys-Gly Chemical compound C([C@@H](C(=O)N[C@@H](CS)C(=O)NCC(=O)O)N)C(=O)N RRVBEKYEFMCDIF-WHFBIAKZSA-N 0.000 description 1
- AYKKKGFJXIDYLX-ACZMJKKPSA-N Asn-Gln-Asn Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O AYKKKGFJXIDYLX-ACZMJKKPSA-N 0.000 description 1
- PQAIOUVVZCOLJK-FXQIFTODSA-N Asn-Gln-Gln Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N PQAIOUVVZCOLJK-FXQIFTODSA-N 0.000 description 1
- GFFRWIJAFFMQGM-NUMRIWBASA-N Asn-Glu-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GFFRWIJAFFMQGM-NUMRIWBASA-N 0.000 description 1
- FVKHEKVYFTZWDX-GHCJXIJMSA-N Asn-Ile-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(=O)N)N FVKHEKVYFTZWDX-GHCJXIJMSA-N 0.000 description 1
- FTSAJSADJCMDHH-CIUDSAMLSA-N Asn-Lys-Asp Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CC(=O)N)N FTSAJSADJCMDHH-CIUDSAMLSA-N 0.000 description 1
- YRTOMUMWSTUQAX-FXQIFTODSA-N Asn-Pro-Asp Chemical compound NC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O YRTOMUMWSTUQAX-FXQIFTODSA-N 0.000 description 1
- PQKSVQSMTHPRIB-ZKWXMUAHSA-N Asn-Val-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O PQKSVQSMTHPRIB-ZKWXMUAHSA-N 0.000 description 1
- CLUMZOKVGUWUFD-CIUDSAMLSA-N Asp-Leu-Asn Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O CLUMZOKVGUWUFD-CIUDSAMLSA-N 0.000 description 1
- XWSIYTYNLKCLJB-CIUDSAMLSA-N Asp-Lys-Asn Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O XWSIYTYNLKCLJB-CIUDSAMLSA-N 0.000 description 1
- BRRPVTUFESPTCP-ACZMJKKPSA-N Asp-Ser-Glu Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O BRRPVTUFESPTCP-ACZMJKKPSA-N 0.000 description 1
- JSHWXQIZOCVWIA-ZKWXMUAHSA-N Asp-Ser-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O JSHWXQIZOCVWIA-ZKWXMUAHSA-N 0.000 description 1
- 108010024976 Asparaginase Proteins 0.000 description 1
- 102000015790 Asparaginase Human genes 0.000 description 1
- 238000012935 Averaging Methods 0.000 description 1
- 241000271566 Aves Species 0.000 description 1
- 108090001008 Avidin Proteins 0.000 description 1
- 241000193830 Bacillus <bacterium> Species 0.000 description 1
- 102100021277 Beta-secretase 2 Human genes 0.000 description 1
- 101710150190 Beta-secretase 2 Proteins 0.000 description 1
- 206010006187 Breast cancer Diseases 0.000 description 1
- 208000026310 Breast neoplasm Diseases 0.000 description 1
- FERIUCNNQQJTOY-UHFFFAOYSA-M Butyrate Chemical compound CCCC([O-])=O FERIUCNNQQJTOY-UHFFFAOYSA-M 0.000 description 1
- FERIUCNNQQJTOY-UHFFFAOYSA-N Butyric acid Natural products CCCC(O)=O FERIUCNNQQJTOY-UHFFFAOYSA-N 0.000 description 1
- 241000283707 Capra Species 0.000 description 1
- KXDHJXZQYSOELW-UHFFFAOYSA-M Carbamate Chemical compound NC([O-])=O KXDHJXZQYSOELW-UHFFFAOYSA-M 0.000 description 1
- 102000014914 Carrier Proteins Human genes 0.000 description 1
- 108010078791 Carrier Proteins Proteins 0.000 description 1
- 102100035882 Catalase Human genes 0.000 description 1
- 108010053835 Catalase Proteins 0.000 description 1
- 102000053642 Catalytic RNA Human genes 0.000 description 1
- 108090000994 Catalytic RNA Proteins 0.000 description 1
- 241001247197 Cephalocarida Species 0.000 description 1
- 241000282693 Cercopithecidae Species 0.000 description 1
- 102000012286 Chitinases Human genes 0.000 description 1
- 108010022172 Chitinases Proteins 0.000 description 1
- 101100510093 Chlorobium chlorochromatii (strain CaD3) gmk gene Proteins 0.000 description 1
- 101100198594 Chlorobium chlorochromatii (strain CaD3) rnhA gene Proteins 0.000 description 1
- 108020004635 Complementary DNA Proteins 0.000 description 1
- 241000699800 Cricetinae Species 0.000 description 1
- 102000008130 Cyclic AMP-Dependent Protein Kinases Human genes 0.000 description 1
- 108010049894 Cyclic AMP-Dependent Protein Kinases Proteins 0.000 description 1
- IVOMOUWHDPKRLL-KQYNXXCUSA-N Cyclic adenosine monophosphate Chemical compound C([C@H]1O2)OP(O)(=O)O[C@H]1[C@@H](O)[C@@H]2N1C(N=CN=C2N)=C2N=C1 IVOMOUWHDPKRLL-KQYNXXCUSA-N 0.000 description 1
- SWJYSDXMTPMBHO-FXQIFTODSA-N Cys-Pro-Ser Chemical compound [H]N[C@@H](CS)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O SWJYSDXMTPMBHO-FXQIFTODSA-N 0.000 description 1
- 101710112752 Cytotoxin Proteins 0.000 description 1
- IGXWBGJHJZYPQS-SSDOTTSWSA-N D-Luciferin Chemical compound OC(=O)[C@H]1CSC(C=2SC3=CC=C(O)C=C3N=2)=N1 IGXWBGJHJZYPQS-SSDOTTSWSA-N 0.000 description 1
- 101150074155 DHFR gene Proteins 0.000 description 1
- 230000007067 DNA methylation Effects 0.000 description 1
- 238000012270 DNA recombination Methods 0.000 description 1
- 230000033616 DNA repair Effects 0.000 description 1
- 108010014303 DNA-directed DNA polymerase Proteins 0.000 description 1
- 102000016928 DNA-directed DNA polymerase Human genes 0.000 description 1
- 108090000626 DNA-directed RNA polymerases Proteins 0.000 description 1
- 102000004163 DNA-directed RNA polymerases Human genes 0.000 description 1
- 108010092160 Dactinomycin Proteins 0.000 description 1
- 241000252212 Danio rerio Species 0.000 description 1
- 206010011878 Deafness Diseases 0.000 description 1
- CYCGRDQQIOGCKX-UHFFFAOYSA-N Dehydro-luciferin Natural products OC(=O)C1=CSC(C=2SC3=CC(O)=CC=C3N=2)=N1 CYCGRDQQIOGCKX-UHFFFAOYSA-N 0.000 description 1
- BWGNESOTFCXPMA-UHFFFAOYSA-N Dihydrogen disulfide Chemical compound SS BWGNESOTFCXPMA-UHFFFAOYSA-N 0.000 description 1
- 208000030453 Drug-Related Side Effects and Adverse reaction Diseases 0.000 description 1
- 102100031334 Elongation factor 2 Human genes 0.000 description 1
- BJGNCJDXODQBOB-UHFFFAOYSA-N Fivefly Luciferin Natural products OC(=O)C1CSC(C=2SC3=CC(O)=CC=C3N=2)=N1 BJGNCJDXODQBOB-UHFFFAOYSA-N 0.000 description 1
- JSYULGSPLTZDHM-NRPADANISA-N Gln-Ala-Val Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O JSYULGSPLTZDHM-NRPADANISA-N 0.000 description 1
- CITDWMLWXNUQKD-FXQIFTODSA-N Gln-Gln-Asn Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC(=O)N)C(=O)O)N CITDWMLWXNUQKD-FXQIFTODSA-N 0.000 description 1
- AJDMYLOISOCHHC-YVNDNENWSA-N Gln-Gln-Ile Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O AJDMYLOISOCHHC-YVNDNENWSA-N 0.000 description 1
- QQAPDATZKKTBIY-YUMQZZPRSA-N Gln-Gly-Met Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)NCC(=O)N[C@@H](CCSC)C(O)=O QQAPDATZKKTBIY-YUMQZZPRSA-N 0.000 description 1
- FNAJNWPDTIXYJN-CIUDSAMLSA-N Gln-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CCC(N)=O FNAJNWPDTIXYJN-CIUDSAMLSA-N 0.000 description 1
- KUBFPYIMAGXGBT-ACZMJKKPSA-N Gln-Ser-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O KUBFPYIMAGXGBT-ACZMJKKPSA-N 0.000 description 1
- PAOHIZNRJNIXQY-XQXXSGGOSA-N Gln-Thr-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O PAOHIZNRJNIXQY-XQXXSGGOSA-N 0.000 description 1
- OGMQXTXGLDNBSS-FXQIFTODSA-N Glu-Ala-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(O)=O OGMQXTXGLDNBSS-FXQIFTODSA-N 0.000 description 1
- CGYDXNKRIMJMLV-GUBZILKMSA-N Glu-Arg-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O CGYDXNKRIMJMLV-GUBZILKMSA-N 0.000 description 1
- VPKBCVUDBNINAH-GARJFASQSA-N Glu-Arg-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CCC(=O)O)N)C(=O)O VPKBCVUDBNINAH-GARJFASQSA-N 0.000 description 1
- FLLRAEJOLZPSMN-CIUDSAMLSA-N Glu-Asn-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N FLLRAEJOLZPSMN-CIUDSAMLSA-N 0.000 description 1
- RDDSZZJOKDVPAE-ACZMJKKPSA-N Glu-Asn-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O RDDSZZJOKDVPAE-ACZMJKKPSA-N 0.000 description 1
- GZWOBWMOMPFPCD-CIUDSAMLSA-N Glu-Asp-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCC(=O)O)N GZWOBWMOMPFPCD-CIUDSAMLSA-N 0.000 description 1
- MUSGDMDGNGXULI-DCAQKATOSA-N Glu-Glu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O MUSGDMDGNGXULI-DCAQKATOSA-N 0.000 description 1
- ILWHFUZZCFYSKT-AVGNSLFASA-N Glu-Lys-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O ILWHFUZZCFYSKT-AVGNSLFASA-N 0.000 description 1
- 108010073178 Glucan 1,4-alpha-Glucosidase Proteins 0.000 description 1
- 102100022624 Glucoamylase Human genes 0.000 description 1
- 108010015776 Glucose oxidase Proteins 0.000 description 1
- 239000004366 Glucose oxidase Substances 0.000 description 1
- 108010018962 Glucosephosphate Dehydrogenase Proteins 0.000 description 1
- SXRSQZLOMIGNAQ-UHFFFAOYSA-N Glutaraldehyde Chemical compound O=CCCCC=O SXRSQZLOMIGNAQ-UHFFFAOYSA-N 0.000 description 1
- QXPRJQPCFXMCIY-NKWVEPMBSA-N Gly-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN QXPRJQPCFXMCIY-NKWVEPMBSA-N 0.000 description 1
- COVXELOAORHTND-LSJOCFKGSA-N Gly-Ile-Val Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O COVXELOAORHTND-LSJOCFKGSA-N 0.000 description 1
- FGPLUIQCSKGLTI-WDSKDSINSA-N Gly-Ser-Glu Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O FGPLUIQCSKGLTI-WDSKDSINSA-N 0.000 description 1
- CUVBTVWFVIIDOC-YEPSODPASA-N Gly-Thr-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)O)NC(=O)CN CUVBTVWFVIIDOC-YEPSODPASA-N 0.000 description 1
- MUGLKCQHTUFLGF-WPRPVWTQSA-N Gly-Val-Met Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)CN MUGLKCQHTUFLGF-WPRPVWTQSA-N 0.000 description 1
- HVLSXIKZNLPZJJ-TXZCQADKSA-N HA peptide Chemical compound C([C@@H](C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N1[C@@H](CCC1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)N[C@@H](C)C(O)=O)NC(=O)[C@H]1N(CCC1)C(=O)[C@@H](N)CC=1C=CC(O)=CC=1)C1=CC=C(O)C=C1 HVLSXIKZNLPZJJ-TXZCQADKSA-N 0.000 description 1
- BRZQWIIFIKTJDH-VGDYDELISA-N His-Ile-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N BRZQWIIFIKTJDH-VGDYDELISA-N 0.000 description 1
- PLCAEMGSYOYIPP-GUBZILKMSA-N His-Ser-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CN=CN1 PLCAEMGSYOYIPP-GUBZILKMSA-N 0.000 description 1
- 101000961307 Homo sapiens Ankyrin repeat domain-containing protein 2 Proteins 0.000 description 1
- 101000743768 Homo sapiens R3H domain-containing protein 1 Proteins 0.000 description 1
- 101000743771 Homo sapiens R3H domain-containing protein 2 Proteins 0.000 description 1
- 108010001336 Horseradish Peroxidase Proteins 0.000 description 1
- 101000829171 Hypocrea virens (strain Gv29-8 / FGSC 10586) Effector TSP1 Proteins 0.000 description 1
- FADXGVVLSPPEQY-GHCJXIJMSA-N Ile-Cys-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(=O)N)C(=O)O)N FADXGVVLSPPEQY-GHCJXIJMSA-N 0.000 description 1
- PHIXPNQDGGILMP-YVNDNENWSA-N Ile-Glu-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N PHIXPNQDGGILMP-YVNDNENWSA-N 0.000 description 1
- WCNWGAUZWWSYDG-SVSWQMSJSA-N Ile-Thr-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)O)N WCNWGAUZWWSYDG-SVSWQMSJSA-N 0.000 description 1
- 102000018071 Immunoglobulin Fc Fragments Human genes 0.000 description 1
- 108010091135 Immunoglobulin Fc Fragments Proteins 0.000 description 1
- 229930010555 Inosine Natural products 0.000 description 1
- UGQMRVRMYYASKQ-KQYNXXCUSA-N Inosine Chemical compound O[C@@H]1[C@H](O)[C@@H](CO)O[C@H]1N1C2=NC=NC(O)=C2N=C1 UGQMRVRMYYASKQ-KQYNXXCUSA-N 0.000 description 1
- ONIBWKKTOPOVIA-BYPYZUCNSA-N L-Proline Chemical compound OC(=O)[C@@H]1CCCN1 ONIBWKKTOPOVIA-BYPYZUCNSA-N 0.000 description 1
- FBOZXECLQNJBKD-ZDUSSCGKSA-N L-methotrexate Chemical compound C=1N=C2N=C(N)N=C(N)C2=NC=1CN(C)C1=CC=C(C(=O)N[C@@H](CCC(O)=O)C(O)=O)C=C1 FBOZXECLQNJBKD-ZDUSSCGKSA-N 0.000 description 1
- 108090001090 Lectins Proteins 0.000 description 1
- 102000004856 Lectins Human genes 0.000 description 1
- WQWSMEOYXJTFRU-GUBZILKMSA-N Leu-Glu-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O WQWSMEOYXJTFRU-GUBZILKMSA-N 0.000 description 1
- CFZZDVMBRYFFNU-QWRGUYRKSA-N Leu-His-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)NCC(O)=O CFZZDVMBRYFFNU-QWRGUYRKSA-N 0.000 description 1
- FLNPJLDPGMLWAU-UWVGGRQHSA-N Leu-Met-Gly Chemical compound OC(=O)CNC(=O)[C@H](CCSC)NC(=O)[C@@H](N)CC(C)C FLNPJLDPGMLWAU-UWVGGRQHSA-N 0.000 description 1
- GCXGCIYIHXSKAY-ULQDDVLXSA-N Leu-Phe-Arg Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O GCXGCIYIHXSKAY-ULQDDVLXSA-N 0.000 description 1
- DRWMRVFCKKXHCH-BZSNNMDCSA-N Leu-Phe-Leu Chemical compound CC(C)C[C@H]([NH3+])C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C([O-])=O)CC1=CC=CC=C1 DRWMRVFCKKXHCH-BZSNNMDCSA-N 0.000 description 1
- KWLWZYMNUZJKMZ-IHRRRGAJSA-N Leu-Pro-Leu Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O KWLWZYMNUZJKMZ-IHRRRGAJSA-N 0.000 description 1
- CHJKEDSZNSONPS-DCAQKATOSA-N Leu-Pro-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O CHJKEDSZNSONPS-DCAQKATOSA-N 0.000 description 1
- PWPBLZXWFXJFHE-RHYQMDGZSA-N Leu-Pro-Thr Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O PWPBLZXWFXJFHE-RHYQMDGZSA-N 0.000 description 1
- GOFJOGXGMPHOGL-DCAQKATOSA-N Leu-Ser-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC(C)C GOFJOGXGMPHOGL-DCAQKATOSA-N 0.000 description 1
- BRTVHXHCUSXYRI-CIUDSAMLSA-N Leu-Ser-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O BRTVHXHCUSXYRI-CIUDSAMLSA-N 0.000 description 1
- PPGBXYKMUMHFBF-KATARQTJSA-N Leu-Ser-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PPGBXYKMUMHFBF-KATARQTJSA-N 0.000 description 1
- 102000003960 Ligases Human genes 0.000 description 1
- 108090000364 Ligases Proteins 0.000 description 1
- DDWFXDSYGUXRAY-UHFFFAOYSA-N Luciferin Natural products CCc1c(C)c(CC2NC(=O)C(=C2C=C)C)[nH]c1Cc3[nH]c4C(=C5/NC(CC(=O)O)C(C)C5CC(=O)O)CC(=O)c4c3C DDWFXDSYGUXRAY-UHFFFAOYSA-N 0.000 description 1
- 108010047357 Luminescent Proteins Proteins 0.000 description 1
- 102000006830 Luminescent Proteins Human genes 0.000 description 1
- 206010058467 Lung neoplasm malignant Diseases 0.000 description 1
- ABHIXYDMILIUKV-CIUDSAMLSA-N Lys-Asn-Asn Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O ABHIXYDMILIUKV-CIUDSAMLSA-N 0.000 description 1
- IMAKMJCBYCSMHM-AVGNSLFASA-N Lys-Glu-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN IMAKMJCBYCSMHM-AVGNSLFASA-N 0.000 description 1
- 102000013460 Malate Dehydrogenase Human genes 0.000 description 1
- 108010026217 Malate Dehydrogenase Proteins 0.000 description 1
- 101710175625 Maltose/maltodextrin-binding periplasmic protein Proteins 0.000 description 1
- 241000124008 Mammalia Species 0.000 description 1
- BVXXDMUMHMXFER-BPNCWPANSA-N Met-Ala-Tyr Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O BVXXDMUMHMXFER-BPNCWPANSA-N 0.000 description 1
- UZWMJZSOXGOVIN-LURJTMIESA-N Met-Gly-Gly Chemical compound CSCC[C@H](N)C(=O)NCC(=O)NCC(O)=O UZWMJZSOXGOVIN-LURJTMIESA-N 0.000 description 1
- JHDNAOVJJQSMMM-GMOBBJLQSA-N Met-Ile-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CCSC)N JHDNAOVJJQSMMM-GMOBBJLQSA-N 0.000 description 1
- GGXZOTSDJJTDGB-GUBZILKMSA-N Met-Ser-Val Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O GGXZOTSDJJTDGB-GUBZILKMSA-N 0.000 description 1
- 108010059724 Micrococcal Nuclease Proteins 0.000 description 1
- 229930192392 Mitomycin Natural products 0.000 description 1
- 108090000143 Mouse Proteins Proteins 0.000 description 1
- 241000713333 Mouse mammary tumor virus Species 0.000 description 1
- 101100520973 Mus musculus Ppp1r1c gene Proteins 0.000 description 1
- 101000800539 Mus musculus Translationally-controlled tumor protein Proteins 0.000 description 1
- NWIBSHFKIJFRCO-WUDYKRTCSA-N Mytomycin Chemical compound C1N2C(C(C(C)=C(N)C3=O)=O)=C3[C@@H](COC(N)=O)[C@@]2(OC)[C@@H]2[C@H]1N2 NWIBSHFKIJFRCO-WUDYKRTCSA-N 0.000 description 1
- 238000005481 NMR spectroscopy Methods 0.000 description 1
- 108091007491 NSP3 Papain-like protease domains Proteins 0.000 description 1
- 241000244206 Nematoda Species 0.000 description 1
- 108010077850 Nuclear Localization Signals Proteins 0.000 description 1
- 108091007494 Nucleic acid- binding domains Proteins 0.000 description 1
- 229910019142 PO4 Inorganic materials 0.000 description 1
- 229930012538 Paclitaxel Natural products 0.000 description 1
- 108090000526 Papain Proteins 0.000 description 1
- 208000018737 Parkinson disease Diseases 0.000 description 1
- 241001494479 Pecora Species 0.000 description 1
- 108010077519 Peptide Elongation Factor 2 Proteins 0.000 description 1
- YRKFKTQRVBJYLT-CQDKDKBSSA-N Phe-Ala-His Chemical compound C([C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CC=1NC=NC=1)C(O)=O)C1=CC=CC=C1 YRKFKTQRVBJYLT-CQDKDKBSSA-N 0.000 description 1
- HBXAOEBRGLCLIW-AVGNSLFASA-N Phe-Ser-Gln Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N HBXAOEBRGLCLIW-AVGNSLFASA-N 0.000 description 1
- 102000004160 Phosphoric Monoester Hydrolases Human genes 0.000 description 1
- 108090000608 Phosphoric Monoester Hydrolases Proteins 0.000 description 1
- 108010053210 Phycocyanin Proteins 0.000 description 1
- 108010004729 Phycoerythrin Proteins 0.000 description 1
- 241000276498 Pollachius virens Species 0.000 description 1
- 239000004698 Polyethylene Substances 0.000 description 1
- 108010021757 Polynucleotide 5'-Hydroxyl-Kinase Proteins 0.000 description 1
- 102000008422 Polynucleotide 5'-hydroxyl-kinase Human genes 0.000 description 1
- 239000004743 Polypropylene Substances 0.000 description 1
- 241000288906 Primates Species 0.000 description 1
- XZONQWUEBAFQPO-HJGDQZAQSA-N Pro-Gln-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XZONQWUEBAFQPO-HJGDQZAQSA-N 0.000 description 1
- FMLRRBDLBJLJIK-DCAQKATOSA-N Pro-Leu-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]1CCCN1 FMLRRBDLBJLJIK-DCAQKATOSA-N 0.000 description 1
- YXHYJEPDKSYPSQ-AVGNSLFASA-N Pro-Leu-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]1CCCN1 YXHYJEPDKSYPSQ-AVGNSLFASA-N 0.000 description 1
- GURGCNUWVSDYTP-SRVKXCTJSA-N Pro-Leu-Gln Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O GURGCNUWVSDYTP-SRVKXCTJSA-N 0.000 description 1
- GFHOSBYCLACKEK-GUBZILKMSA-N Pro-Pro-Asn Chemical compound [H]N1CCC[C@H]1C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O GFHOSBYCLACKEK-GUBZILKMSA-N 0.000 description 1
- LNICFEXCAHIJOR-DCAQKATOSA-N Pro-Ser-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O LNICFEXCAHIJOR-DCAQKATOSA-N 0.000 description 1
- ONIBWKKTOPOVIA-UHFFFAOYSA-N Proline Natural products OC(=O)C1CCCN1 ONIBWKKTOPOVIA-UHFFFAOYSA-N 0.000 description 1
- 239000004365 Protease Substances 0.000 description 1
- 108010029485 Protein Isoforms Proteins 0.000 description 1
- 102000001708 Protein Isoforms Human genes 0.000 description 1
- 102000001253 Protein Kinase Human genes 0.000 description 1
- 108010066717 Q beta Replicase Proteins 0.000 description 1
- 102100038382 R3H domain-containing protein 1 Human genes 0.000 description 1
- 102100038384 R3H domain-containing protein 2 Human genes 0.000 description 1
- 102000002185 R3H domains Human genes 0.000 description 1
- 108050009559 R3H domains Proteins 0.000 description 1
- 238000012228 RNA interference-mediated gene silencing Methods 0.000 description 1
- 238000010240 RT-PCR analysis Methods 0.000 description 1
- 102100030086 Receptor tyrosine-protein kinase erbB-2 Human genes 0.000 description 1
- 101710100968 Receptor tyrosine-protein kinase erbB-2 Proteins 0.000 description 1
- 108010083644 Ribonucleases Proteins 0.000 description 1
- 102000006382 Ribonucleases Human genes 0.000 description 1
- 108091081021 Sense strand Proteins 0.000 description 1
- WDXYVIIVDIDOSX-DCAQKATOSA-N Ser-Arg-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CO)CCCN=C(N)N WDXYVIIVDIDOSX-DCAQKATOSA-N 0.000 description 1
- OOKCGAYXSNJBGQ-ZLUOBGJFSA-N Ser-Asn-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O OOKCGAYXSNJBGQ-ZLUOBGJFSA-N 0.000 description 1
- BQWCDDAISCPDQV-XHNCKOQMSA-N Ser-Gln-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CO)N)C(=O)O BQWCDDAISCPDQV-XHNCKOQMSA-N 0.000 description 1
- MUARUIBTKQJKFY-WHFBIAKZSA-N Ser-Gly-Asp Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O MUARUIBTKQJKFY-WHFBIAKZSA-N 0.000 description 1
- DJACUBDEDBZKLQ-KBIXCLLPSA-N Ser-Ile-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(O)=O DJACUBDEDBZKLQ-KBIXCLLPSA-N 0.000 description 1
- LRZLZIUXQBIWTB-KATARQTJSA-N Ser-Lys-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LRZLZIUXQBIWTB-KATARQTJSA-N 0.000 description 1
- VLMIUSLQONKLDV-HEIBUPTGSA-N Ser-Thr-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O VLMIUSLQONKLDV-HEIBUPTGSA-N 0.000 description 1
- BDMWLJLPPUCLNV-XGEHTFHBSA-N Ser-Thr-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O BDMWLJLPPUCLNV-XGEHTFHBSA-N 0.000 description 1
- PMTWIUBUQRGCSB-FXQIFTODSA-N Ser-Val-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O PMTWIUBUQRGCSB-FXQIFTODSA-N 0.000 description 1
- 108020004682 Single-Stranded DNA Proteins 0.000 description 1
- 238000002105 Southern blotting Methods 0.000 description 1
- 108091081024 Start codon Proteins 0.000 description 1
- 241000194017 Streptococcus Species 0.000 description 1
- 241000282887 Suidae Species 0.000 description 1
- 239000004098 Tetracycline Substances 0.000 description 1
- LHEZGZQRLDBSRR-WDCWCFNPSA-N Thr-Glu-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O LHEZGZQRLDBSRR-WDCWCFNPSA-N 0.000 description 1
- KPNSNVTUVKSBFL-ZJDVBMNYSA-N Thr-Met-Thr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N)O KPNSNVTUVKSBFL-ZJDVBMNYSA-N 0.000 description 1
- 108010034949 Thyroglobulin Proteins 0.000 description 1
- 102000009843 Thyroglobulin Human genes 0.000 description 1
- 206010070863 Toxicity to various agents Diseases 0.000 description 1
- 239000007983 Tris buffer Substances 0.000 description 1
- 239000013504 Triton X-100 Substances 0.000 description 1
- 229920004890 Triton X-100 Polymers 0.000 description 1
- GQNCRIFNDVFRNF-BPUTZDHNSA-N Trp-Pro-Asp Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O GQNCRIFNDVFRNF-BPUTZDHNSA-N 0.000 description 1
- VYQQQIRHIFALGE-UWJYBYFXSA-N Tyr-Ser-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 VYQQQIRHIFALGE-UWJYBYFXSA-N 0.000 description 1
- 108010007780 U7 Small Nuclear Ribonucleoprotein Proteins 0.000 description 1
- IVOMOUWHDPKRLL-UHFFFAOYSA-N UNPD107823 Natural products O1C2COP(O)(=O)OC2C(O)C1N1C(N=CN=C2N)=C2N=C1 IVOMOUWHDPKRLL-UHFFFAOYSA-N 0.000 description 1
- 108010046334 Urease Proteins 0.000 description 1
- KKHRWGYHBZORMQ-NHCYSSNCSA-N Val-Arg-Glu Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N KKHRWGYHBZORMQ-NHCYSSNCSA-N 0.000 description 1
- CFSSLXZJEMERJY-NRPADANISA-N Val-Gln-Ala Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O CFSSLXZJEMERJY-NRPADANISA-N 0.000 description 1
- PWRITNSESKQTPW-NRPADANISA-N Val-Gln-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CO)C(=O)O)N PWRITNSESKQTPW-NRPADANISA-N 0.000 description 1
- PIFJAFRUVWZRKR-QMMMGPOBSA-N Val-Gly-Gly Chemical compound CC(C)[C@H]([NH3+])C(=O)NCC(=O)NCC([O-])=O PIFJAFRUVWZRKR-QMMMGPOBSA-N 0.000 description 1
- PTFPUAXGIKTVNN-ONGXEEELSA-N Val-His-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)NCC(=O)O)N PTFPUAXGIKTVNN-ONGXEEELSA-N 0.000 description 1
- 241000251539 Vertebrata <Metazoa> Species 0.000 description 1
- ZVNYJIZDIRKMBF-UHFFFAOYSA-N Vesnarinone Chemical compound C1=C(OC)C(OC)=CC=C1C(=O)N1CCN(C=2C=C3CCC(=O)NC3=CC=2)CC1 ZVNYJIZDIRKMBF-UHFFFAOYSA-N 0.000 description 1
- JXLYSJRDGCGARV-WWYNWVTFSA-N Vinblastine Natural products O=C(O[C@H]1[C@](O)(C(=O)OC)[C@@H]2N(C)c3c(cc(c(OC)c3)[C@]3(C(=O)OC)c4[nH]c5c(c4CCN4C[C@](O)(CC)C[C@H](C3)C4)cccc5)[C@@]32[C@H]2[C@@]1(CC)C=CCN2CC3)C JXLYSJRDGCGARV-WWYNWVTFSA-N 0.000 description 1
- IXKSXJFAGXLQOQ-XISFHERQSA-N WHWLQLKPGQPMY Chemical compound C([C@@H](C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)NC(=O)[C@@H](N)CC=1C2=CC=CC=C2NC=1)C1=CNC=N1 IXKSXJFAGXLQOQ-XISFHERQSA-N 0.000 description 1
- 241000269370 Xenopus <genus> Species 0.000 description 1
- 230000001594 aberrant effect Effects 0.000 description 1
- 229940022698 acetylcholinesterase Drugs 0.000 description 1
- DZBUGLKDJFMEHC-UHFFFAOYSA-N acridine Chemical class C1=CC=CC2=CC3=CC=CC=C3N=C21 DZBUGLKDJFMEHC-UHFFFAOYSA-N 0.000 description 1
- RJURFGZVJUQBHK-IIXSONLDSA-N actinomycin D Chemical compound C[C@H]1OC(=O)[C@H](C(C)C)N(C)C(=O)CN(C)C(=O)[C@@H]2CCCN2C(=O)[C@@H](C(C)C)NC(=O)[C@H]1NC(=O)C1=C(N)C(=O)C(C)=C2OC(C(C)=CC=C3C(=O)N[C@@H]4C(=O)N[C@@H](C(N5CCC[C@H]5C(=O)N(C)CC(=O)N(C)[C@@H](C(C)C)C(=O)O[C@@H]4C)=O)C(C)C)=C3N=C21 RJURFGZVJUQBHK-IIXSONLDSA-N 0.000 description 1
- 238000007792 addition Methods 0.000 description 1
- 210000001789 adipocyte Anatomy 0.000 description 1
- 238000001042 affinity chromatography Methods 0.000 description 1
- 239000002168 alkylating agent Substances 0.000 description 1
- 229940100198 alkylating agent Drugs 0.000 description 1
- 108010004469 allophycocyanin Proteins 0.000 description 1
- BFNBIHQBYMNNAN-UHFFFAOYSA-N ammonium sulfate Chemical compound N.N.OS(O)(=O)=O BFNBIHQBYMNNAN-UHFFFAOYSA-N 0.000 description 1
- 229910052921 ammonium sulfate Inorganic materials 0.000 description 1
- 238000012870 ammonium sulfate precipitation Methods 0.000 description 1
- 235000011130 ammonium sulphate Nutrition 0.000 description 1
- 238000002669 amniocentesis Methods 0.000 description 1
- 235000019418 amylase Nutrition 0.000 description 1
- 229940025131 amylases Drugs 0.000 description 1
- 210000004102 animal cell Anatomy 0.000 description 1
- 238000005571 anion exchange chromatography Methods 0.000 description 1
- 150000001450 anions Chemical class 0.000 description 1
- MWPLVEDNUUSJAV-UHFFFAOYSA-N anthracene Chemical compound C1=CC=CC2=CC3=CC=CC=C3C=C21 MWPLVEDNUUSJAV-UHFFFAOYSA-N 0.000 description 1
- 229940045799 anthracyclines and related substance Drugs 0.000 description 1
- 239000003242 anti bacterial agent Substances 0.000 description 1
- 230000003302 anti-idiotype Effects 0.000 description 1
- 230000000340 anti-metabolite Effects 0.000 description 1
- 229940088710 antibiotic agent Drugs 0.000 description 1
- 210000000628 antibody-producing cell Anatomy 0.000 description 1
- 102000025171 antigen binding proteins Human genes 0.000 description 1
- 108091000831 antigen binding proteins Proteins 0.000 description 1
- 229940100197 antimetabolite Drugs 0.000 description 1
- 239000002256 antimetabolite Substances 0.000 description 1
- 239000003080 antimitotic agent Substances 0.000 description 1
- 229940041181 antineoplastic drug Drugs 0.000 description 1
- 239000002787 antisense oligonuctleotide Substances 0.000 description 1
- 230000036506 anxiety Effects 0.000 description 1
- 239000012062 aqueous buffer Substances 0.000 description 1
- 108010072041 arginyl-glycyl-aspartic acid Proteins 0.000 description 1
- 210000001106 artificial yeast chromosome Anatomy 0.000 description 1
- 229960003272 asparaginase Drugs 0.000 description 1
- DCXYFEDJOCDNAF-UHFFFAOYSA-M asparaginate Chemical compound [O-]C(=O)C(N)CC(N)=O DCXYFEDJOCDNAF-UHFFFAOYSA-M 0.000 description 1
- 238000002820 assay format Methods 0.000 description 1
- 238000000376 autoradiography Methods 0.000 description 1
- 210000004227 basal ganglia Anatomy 0.000 description 1
- 230000003115 biocidal effect Effects 0.000 description 1
- 230000033228 biological regulation Effects 0.000 description 1
- 238000005415 bioluminescence Methods 0.000 description 1
- 230000029918 bioluminescence Effects 0.000 description 1
- 238000001574 biopsy Methods 0.000 description 1
- 230000015572 biosynthetic process Effects 0.000 description 1
- 229960002685 biotin Drugs 0.000 description 1
- 235000020958 biotin Nutrition 0.000 description 1
- 239000011616 biotin Substances 0.000 description 1
- 230000000903 blocking effect Effects 0.000 description 1
- RSIHSRDYCUFFLA-DYKIIFRCSA-N boldenone Chemical compound O=C1C=C[C@]2(C)[C@H]3CC[C@](C)([C@H](CC4)O)[C@@H]4[C@@H]3CCC2=C1 RSIHSRDYCUFFLA-DYKIIFRCSA-N 0.000 description 1
- 210000001185 bone marrow Anatomy 0.000 description 1
- 210000004958 brain cell Anatomy 0.000 description 1
- 210000000481 breast Anatomy 0.000 description 1
- 238000009395 breeding Methods 0.000 description 1
- 230000001488 breeding effect Effects 0.000 description 1
- 244000309464 bull Species 0.000 description 1
- 239000001506 calcium phosphate Substances 0.000 description 1
- 229910000389 calcium phosphate Inorganic materials 0.000 description 1
- 235000011010 calcium phosphates Nutrition 0.000 description 1
- 238000004364 calculation method Methods 0.000 description 1
- 150000001720 carbohydrates Chemical class 0.000 description 1
- 235000014633 carbohydrates Nutrition 0.000 description 1
- UBAZGMLMVVQSCD-UHFFFAOYSA-N carbon dioxide;molecular oxygen Chemical compound O=O.O=C=O UBAZGMLMVVQSCD-UHFFFAOYSA-N 0.000 description 1
- 230000015556 catabolic process Effects 0.000 description 1
- 238000005277 cation exchange chromatography Methods 0.000 description 1
- 125000002091 cationic group Chemical group 0.000 description 1
- 238000000423 cell based assay Methods 0.000 description 1
- 230000010261 cell growth Effects 0.000 description 1
- 230000004700 cellular uptake Effects 0.000 description 1
- 239000001913 cellulose Substances 0.000 description 1
- 210000003169 central nervous system Anatomy 0.000 description 1
- 239000013611 chromosomal DNA Substances 0.000 description 1
- 230000002759 chromosomal effect Effects 0.000 description 1
- 239000005515 coenzyme Substances 0.000 description 1
- 229960001338 colchicine Drugs 0.000 description 1
- 239000000084 colloidal system Substances 0.000 description 1
- 230000002860 competitive effect Effects 0.000 description 1
- 239000003184 complementary RNA Substances 0.000 description 1
- 230000001268 conjugating effect Effects 0.000 description 1
- 239000000470 constituent Substances 0.000 description 1
- 238000012937 correction Methods 0.000 description 1
- 230000008878 coupling Effects 0.000 description 1
- 238000010168 coupling process Methods 0.000 description 1
- 238000005859 coupling reaction Methods 0.000 description 1
- 239000013078 crystal Substances 0.000 description 1
- 229940095074 cyclic amp Drugs 0.000 description 1
- 230000009089 cytolysis Effects 0.000 description 1
- 239000000824 cytostatic agent Substances 0.000 description 1
- 231100000433 cytotoxic Toxicity 0.000 description 1
- 230000001472 cytotoxic effect Effects 0.000 description 1
- 239000002619 cytotoxin Substances 0.000 description 1
- 229960000640 dactinomycin Drugs 0.000 description 1
- STQGQHZAVUOBTE-VGBVRHCVSA-N daunorubicin Chemical compound O([C@H]1C[C@@](O)(CC=2C(O)=C3C(=O)C=4C=CC=C(C=4C(=O)C3=C(O)C=21)OC)C(C)=O)[C@H]1C[C@H](N)[C@H](O)[C@H](C)O1 STQGQHZAVUOBTE-VGBVRHCVSA-N 0.000 description 1
- 229960000975 daunorubicin Drugs 0.000 description 1
- 231100000895 deafness Toxicity 0.000 description 1
- 230000007547 defect Effects 0.000 description 1
- 230000007812 deficiency Effects 0.000 description 1
- 238000006731 degradation reaction Methods 0.000 description 1
- RSIHSRDYCUFFLA-UHFFFAOYSA-N dehydrotestosterone Natural products O=C1C=CC2(C)C3CCC(C)(C(CC4)O)C4C3CCC2=C1 RSIHSRDYCUFFLA-UHFFFAOYSA-N 0.000 description 1
- 230000002939 deleterious effect Effects 0.000 description 1
- 230000001419 dependent effect Effects 0.000 description 1
- 230000030609 dephosphorylation Effects 0.000 description 1
- 238000006209 dephosphorylation reaction Methods 0.000 description 1
- 238000013461 design Methods 0.000 description 1
- 238000011161 development Methods 0.000 description 1
- 238000002050 diffraction method Methods 0.000 description 1
- 230000029087 digestion Effects 0.000 description 1
- NAGJZTKCGNOGPW-UHFFFAOYSA-K dioxido-sulfanylidene-sulfido-$l^{5}-phosphane Chemical compound [O-]P([O-])([S-])=S NAGJZTKCGNOGPW-UHFFFAOYSA-K 0.000 description 1
- 206010013023 diphtheria Diseases 0.000 description 1
- 208000037765 diseases and disorders Diseases 0.000 description 1
- 238000006073 displacement reaction Methods 0.000 description 1
- 238000009826 distribution Methods 0.000 description 1
- 230000002222 downregulating effect Effects 0.000 description 1
- 229960004679 doxorubicin Drugs 0.000 description 1
- 230000000857 drug effect Effects 0.000 description 1
- 238000007878 drug screening assay Methods 0.000 description 1
- 239000000975 dye Substances 0.000 description 1
- 238000001493 electron microscopy Methods 0.000 description 1
- 238000001962 electrophoresis Methods 0.000 description 1
- 238000004520 electroporation Methods 0.000 description 1
- 238000010828 elution Methods 0.000 description 1
- 210000002257 embryonic structure Anatomy 0.000 description 1
- 230000002124 endocrine Effects 0.000 description 1
- 230000003511 endothelial effect Effects 0.000 description 1
- 230000007613 environmental effect Effects 0.000 description 1
- 230000001973 epigenetic effect Effects 0.000 description 1
- 150000002148 esters Chemical class 0.000 description 1
- 238000012869 ethanol precipitation Methods 0.000 description 1
- 229960005542 ethidium bromide Drugs 0.000 description 1
- ZMMJGEGLRURXTF-UHFFFAOYSA-N ethidium bromide Chemical compound [Br-].C12=CC(N)=CC=C2C2=CC=C(N)C=C2[N+](CC)=C1C1=CC=CC=C1 ZMMJGEGLRURXTF-UHFFFAOYSA-N 0.000 description 1
- VJJPUSNTGOMMGY-MRVIYFEKSA-N etoposide Chemical compound COC1=C(O)C(OC)=CC([C@@H]2C3=CC=4OCOC=4C=C3[C@@H](O[C@H]3[C@@H]([C@@H](O)[C@@H]4O[C@H](C)OC[C@H]4O3)O)[C@@H]3[C@@H]2C(OC3)=O)=C1 VJJPUSNTGOMMGY-MRVIYFEKSA-N 0.000 description 1
- 229960005420 etoposide Drugs 0.000 description 1
- 238000011156 evaluation Methods 0.000 description 1
- 238000002474 experimental method Methods 0.000 description 1
- 230000005294 ferromagnetic effect Effects 0.000 description 1
- 210000003754 fetus Anatomy 0.000 description 1
- 239000000835 fiber Substances 0.000 description 1
- ZFKJVJIDPQDDFY-UHFFFAOYSA-N fluorescamine Chemical compound C12=CC=CC=C2C(=O)OC1(C1=O)OC=C1C1=CC=CC=C1 ZFKJVJIDPQDDFY-UHFFFAOYSA-N 0.000 description 1
- MHMNJMPURVTYEJ-UHFFFAOYSA-N fluorescein-5-isothiocyanate Chemical compound O1C(=O)C2=CC(N=C=S)=CC=C2C21C1=CC=C(O)C=C1OC1=CC(O)=CC=C21 MHMNJMPURVTYEJ-UHFFFAOYSA-N 0.000 description 1
- 238000001943 fluorescence-activated cell sorting Methods 0.000 description 1
- 238000001215 fluorescent labelling Methods 0.000 description 1
- 238000002825 functional assay Methods 0.000 description 1
- 230000002538 fungal effect Effects 0.000 description 1
- 230000004927 fusion Effects 0.000 description 1
- 230000009368 gene silencing by RNA Effects 0.000 description 1
- 239000003862 glucocorticoid Substances 0.000 description 1
- 229940116332 glucose oxidase Drugs 0.000 description 1
- 235000019420 glucose oxidase Nutrition 0.000 description 1
- 108010085059 glutamyl-arginyl-proline Proteins 0.000 description 1
- 108010079547 glutamylmethionine Proteins 0.000 description 1
- 150000004676 glycans Chemical class 0.000 description 1
- 108020004445 glyceraldehyde-3-phosphate dehydrogenase Proteins 0.000 description 1
- 102000006602 glyceraldehyde-3-phosphate dehydrogenase Human genes 0.000 description 1
- AWUCVROLDVIAJX-UHFFFAOYSA-N glycerol 1-phosphate Chemical compound OCC(O)COP(O)(O)=O AWUCVROLDVIAJX-UHFFFAOYSA-N 0.000 description 1
- 150000002337 glycosamines Chemical group 0.000 description 1
- 238000003306 harvesting Methods 0.000 description 1
- 208000016354 hearing loss disease Diseases 0.000 description 1
- 210000004408 hybridoma Anatomy 0.000 description 1
- 230000007062 hydrolysis Effects 0.000 description 1
- 238000006460 hydrolysis reaction Methods 0.000 description 1
- 230000002209 hydrophobic effect Effects 0.000 description 1
- 238000004191 hydrophobic interaction chromatography Methods 0.000 description 1
- 229910052588 hydroxylapatite Inorganic materials 0.000 description 1
- 230000003100 immobilizing effect Effects 0.000 description 1
- 230000001900 immune effect Effects 0.000 description 1
- 230000028993 immune response Effects 0.000 description 1
- 208000026278 immune system disease Diseases 0.000 description 1
- 230000000984 immunochemical effect Effects 0.000 description 1
- 238000003365 immunocytochemistry Methods 0.000 description 1
- 230000005847 immunogenicity Effects 0.000 description 1
- 239000002955 immunomodulating agent Substances 0.000 description 1
- 229940121354 immunomodulator Drugs 0.000 description 1
- 238000001114 immunoprecipitation Methods 0.000 description 1
- 230000002637 immunotoxin Effects 0.000 description 1
- 239000002596 immunotoxin Substances 0.000 description 1
- 231100000608 immunotoxin Toxicity 0.000 description 1
- 229940051026 immunotoxin Drugs 0.000 description 1
- 238000012606 in vitro cell culture Methods 0.000 description 1
- 238000011065 in-situ storage Methods 0.000 description 1
- 238000010348 incorporation Methods 0.000 description 1
- 238000011534 incubation Methods 0.000 description 1
- 230000006698 induction Effects 0.000 description 1
- 238000001802 infusion Methods 0.000 description 1
- 239000004615 ingredient Substances 0.000 description 1
- 239000003112 inhibitor Substances 0.000 description 1
- 229960003786 inosine Drugs 0.000 description 1
- 238000001361 intraarterial administration Methods 0.000 description 1
- 238000007918 intramuscular administration Methods 0.000 description 1
- 238000007912 intraperitoneal administration Methods 0.000 description 1
- 238000007913 intrathecal administration Methods 0.000 description 1
- 229940096010 iron polysaccharide Drugs 0.000 description 1
- SZVJSHCCFOBDDC-UHFFFAOYSA-N iron(II,III) oxide Inorganic materials O=[Fe]O[Fe]O[Fe]=O SZVJSHCCFOBDDC-UHFFFAOYSA-N 0.000 description 1
- 238000005304 joining Methods 0.000 description 1
- 229910052747 lanthanoid Inorganic materials 0.000 description 1
- 150000002602 lanthanoids Chemical class 0.000 description 1
- 239000004816 latex Substances 0.000 description 1
- 229920000126 latex Polymers 0.000 description 1
- 239000002523 lectin Substances 0.000 description 1
- 239000003446 ligand Substances 0.000 description 1
- 108020001756 ligand binding domains Proteins 0.000 description 1
- 238000007834 ligase chain reaction Methods 0.000 description 1
- 230000031700 light absorption Effects 0.000 description 1
- 150000002632 lipids Chemical class 0.000 description 1
- 238000004811 liquid chromatography Methods 0.000 description 1
- HWYHZTIRURJOHG-UHFFFAOYSA-N luminol Chemical compound O=C1NNC(=O)C2=C1C(N)=CC=C2 HWYHZTIRURJOHG-UHFFFAOYSA-N 0.000 description 1
- 201000005202 lung cancer Diseases 0.000 description 1
- 208000020816 lung neoplasm Diseases 0.000 description 1
- 210000001165 lymph node Anatomy 0.000 description 1
- 108010044348 lysyl-glutamyl-aspartic acid Proteins 0.000 description 1
- 238000002826 magnetic-activated cell sorting Methods 0.000 description 1
- 210000005075 mammary gland Anatomy 0.000 description 1
- 238000007726 management method Methods 0.000 description 1
- 238000005259 measurement Methods 0.000 description 1
- 230000007246 mechanism Effects 0.000 description 1
- 238000002844 melting Methods 0.000 description 1
- 230000008018 melting Effects 0.000 description 1
- MYWUZJCMWCOHBA-VIFPVBQESA-N methamphetamine Chemical compound CN[C@@H](C)CC1=CC=CC=C1 MYWUZJCMWCOHBA-VIFPVBQESA-N 0.000 description 1
- 108010085203 methionylmethionine Proteins 0.000 description 1
- 229960000485 methotrexate Drugs 0.000 description 1
- 238000002493 microarray Methods 0.000 description 1
- 230000002906 microbiologic effect Effects 0.000 description 1
- 238000000386 microscopy Methods 0.000 description 1
- 230000003278 mimic effect Effects 0.000 description 1
- 229960004857 mitomycin Drugs 0.000 description 1
- 210000000107 myocyte Anatomy 0.000 description 1
- 210000001178 neural stem cell Anatomy 0.000 description 1
- 210000004498 neuroglial cell Anatomy 0.000 description 1
- 230000009871 nonspecific binding Effects 0.000 description 1
- 239000002777 nucleoside Substances 0.000 description 1
- 150000003833 nucleoside derivatives Chemical class 0.000 description 1
- 210000004940 nucleus Anatomy 0.000 description 1
- HEGSGKPQLMEBJL-RKQHYHRCSA-N octyl beta-D-glucopyranoside Chemical compound CCCCCCCCO[C@@H]1O[C@H](CO)[C@@H](O)[C@H](O)[C@H]1O HEGSGKPQLMEBJL-RKQHYHRCSA-N 0.000 description 1
- 229940124276 oligodeoxyribonucleotide Drugs 0.000 description 1
- 230000003287 optical effect Effects 0.000 description 1
- 238000005457 optimization Methods 0.000 description 1
- 210000000056 organ Anatomy 0.000 description 1
- 239000007800 oxidant agent Substances 0.000 description 1
- 229960001592 paclitaxel Drugs 0.000 description 1
- 229940055729 papain Drugs 0.000 description 1
- 235000019834 papain Nutrition 0.000 description 1
- 239000002245 particle Substances 0.000 description 1
- XYJRXVWERLGGKC-UHFFFAOYSA-D pentacalcium;hydroxide;triphosphate Chemical compound [OH-].[Ca+2].[Ca+2].[Ca+2].[Ca+2].[Ca+2].[O-]P([O-])([O-])=O.[O-]P([O-])([O-])=O.[O-]P([O-])([O-])=O XYJRXVWERLGGKC-UHFFFAOYSA-D 0.000 description 1
- 210000004976 peripheral blood cell Anatomy 0.000 description 1
- 230000002093 peripheral effect Effects 0.000 description 1
- 238000002823 phage display Methods 0.000 description 1
- 230000002974 pharmacogenomic effect Effects 0.000 description 1
- RXNXLAHQOVLMIE-UHFFFAOYSA-N phenyl 10-methylacridin-10-ium-9-carboxylate Chemical compound C12=CC=CC=C2[N+](C)=C2C=CC=CC2=C1C(=O)OC1=CC=CC=C1 RXNXLAHQOVLMIE-UHFFFAOYSA-N 0.000 description 1
- NBIIXXVUZAFLBC-UHFFFAOYSA-K phosphate Chemical compound [O-]P([O-])([O-])=O NBIIXXVUZAFLBC-UHFFFAOYSA-K 0.000 description 1
- 239000010452 phosphate Substances 0.000 description 1
- 229940080469 phosphocellulose Drugs 0.000 description 1
- PTMHPRAIXMAOOB-UHFFFAOYSA-L phosphoramidate Chemical compound NP([O-])([O-])=O PTMHPRAIXMAOOB-UHFFFAOYSA-L 0.000 description 1
- 238000005222 photoaffinity labeling Methods 0.000 description 1
- ZWLUXSQADUDCSB-UHFFFAOYSA-N phthalaldehyde Chemical compound O=CC1=CC=CC=C1C=O ZWLUXSQADUDCSB-UHFFFAOYSA-N 0.000 description 1
- 230000000704 physical effect Effects 0.000 description 1
- 230000001766 physiological effect Effects 0.000 description 1
- 210000002826 placenta Anatomy 0.000 description 1
- 239000013612 plasmid Substances 0.000 description 1
- 239000013600 plasmid vector Substances 0.000 description 1
- 230000008488 polyadenylation Effects 0.000 description 1
- 229920000573 polyethylene Polymers 0.000 description 1
- 230000000379 polymerizing effect Effects 0.000 description 1
- 229920001155 polypropylene Polymers 0.000 description 1
- 229920001282 polysaccharide Polymers 0.000 description 1
- 239000005017 polysaccharide Substances 0.000 description 1
- 235000020004 porter Nutrition 0.000 description 1
- 230000001124 posttranscriptional effect Effects 0.000 description 1
- 230000001323 posttranslational effect Effects 0.000 description 1
- 238000001556 precipitation Methods 0.000 description 1
- 238000009609 prenatal screening Methods 0.000 description 1
- 230000002265 prevention Effects 0.000 description 1
- 230000037452 priming Effects 0.000 description 1
- 108010020755 prolyl-glycyl-glycine Proteins 0.000 description 1
- 108010093296 prolyl-prolyl-alanine Proteins 0.000 description 1
- 108010020432 prolyl-prolylisoleucine Proteins 0.000 description 1
- 108010029020 prolylglycine Proteins 0.000 description 1
- 108060006633 protein kinase Proteins 0.000 description 1
- 230000030788 protein refolding Effects 0.000 description 1
- 230000007026 protein scission Effects 0.000 description 1
- 230000002797 proteolythic effect Effects 0.000 description 1
- 238000011002 quantification Methods 0.000 description 1
- 238000004445 quantitative analysis Methods 0.000 description 1
- 239000000700 radioactive tracer Substances 0.000 description 1
- 239000002287 radioligand Substances 0.000 description 1
- 238000002708 random mutagenesis Methods 0.000 description 1
- 108020003175 receptors Proteins 0.000 description 1
- 102000005962 receptors Human genes 0.000 description 1
- 230000006798 recombination Effects 0.000 description 1
- 238000005215 recombination Methods 0.000 description 1
- 239000012925 reference material Substances 0.000 description 1
- 230000008439 repair process Effects 0.000 description 1
- 238000009256 replacement therapy Methods 0.000 description 1
- 230000010076 replication Effects 0.000 description 1
- 230000004044 response Effects 0.000 description 1
- 208000019745 retinal vasculopathy with cerebral leukodystrophy Diseases 0.000 description 1
- 230000002441 reversible effect Effects 0.000 description 1
- PYWVYCXTNDRMGF-UHFFFAOYSA-N rhodamine B Chemical compound [Cl-].C=12C=CC(=[N+](CC)CC)C=C2OC2=CC(N(CC)CC)=CC=C2C=1C1=CC=CC=C1C(O)=O PYWVYCXTNDRMGF-UHFFFAOYSA-N 0.000 description 1
- 108091092562 ribozyme Proteins 0.000 description 1
- 102200082402 rs751610198 Human genes 0.000 description 1
- 210000003296 saliva Anatomy 0.000 description 1
- 210000003079 salivary gland Anatomy 0.000 description 1
- 238000005070 sampling Methods 0.000 description 1
- 201000000980 schizophrenia Diseases 0.000 description 1
- 238000010845 search algorithm Methods 0.000 description 1
- 238000010187 selection method Methods 0.000 description 1
- 238000000926 separation method Methods 0.000 description 1
- 230000019491 signal transduction Effects 0.000 description 1
- 238000002741 site-directed mutagenesis Methods 0.000 description 1
- 210000003491 skin Anatomy 0.000 description 1
- 239000011780 sodium chloride Substances 0.000 description 1
- 210000001082 somatic cell Anatomy 0.000 description 1
- 230000000392 somatic effect Effects 0.000 description 1
- 241000894007 species Species 0.000 description 1
- 210000000952 spleen Anatomy 0.000 description 1
- 230000000638 stimulation Effects 0.000 description 1
- 238000003860 storage Methods 0.000 description 1
- 210000002536 stromal cell Anatomy 0.000 description 1
- 238000007920 subcutaneous administration Methods 0.000 description 1
- 201000009032 substance abuse Diseases 0.000 description 1
- 231100000736 substance abuse Toxicity 0.000 description 1
- 125000001424 substituent group Chemical group 0.000 description 1
- IIACRCGMVDHOTQ-UHFFFAOYSA-M sulfamate Chemical compound NS([O-])(=O)=O IIACRCGMVDHOTQ-UHFFFAOYSA-M 0.000 description 1
- NVBFHJWHLNUMCV-UHFFFAOYSA-N sulfamide Chemical compound NS(N)(=O)=O NVBFHJWHLNUMCV-UHFFFAOYSA-N 0.000 description 1
- 238000001356 surgical procedure Methods 0.000 description 1
- 238000003786 synthesis reaction Methods 0.000 description 1
- 230000002194 synthesizing effect Effects 0.000 description 1
- RCINICONZNJXQF-MZXODVADSA-N taxol Chemical compound O([C@@H]1[C@@]2(C[C@@H](C(C)=C(C2(C)C)[C@H](C([C@]2(C)[C@@H](O)C[C@H]3OC[C@]3([C@H]21)OC(C)=O)=O)OC(=O)C)OC(=O)[C@H](O)[C@@H](NC(=O)C=1C=CC=CC=1)C=1C=CC=CC=1)O)C(=O)C1=CC=CC=C1 RCINICONZNJXQF-MZXODVADSA-N 0.000 description 1
- 230000002123 temporal effect Effects 0.000 description 1
- NRUKOCRGYNPUPR-QBPJDGROSA-N teniposide Chemical compound COC1=C(O)C(OC)=CC([C@@H]2C3=CC=4OCOC=4C=C3[C@@H](O[C@H]3[C@@H]([C@@H](O)[C@@H]4O[C@@H](OC[C@H]4O3)C=3SC=CC=3)O)[C@@H]3[C@@H]2C(OC3)=O)=C1 NRUKOCRGYNPUPR-QBPJDGROSA-N 0.000 description 1
- 229960001278 teniposide Drugs 0.000 description 1
- 229960000814 tetanus toxoid Drugs 0.000 description 1
- 229960002180 tetracycline Drugs 0.000 description 1
- 229930101283 tetracycline Natural products 0.000 description 1
- 235000019364 tetracycline Nutrition 0.000 description 1
- 150000003522 tetracyclines Chemical class 0.000 description 1
- 125000003396 thiol group Chemical group [H]S* 0.000 description 1
- 150000003573 thiols Chemical class 0.000 description 1
- 229960002175 thyroglobulin Drugs 0.000 description 1
- 230000000699 topical effect Effects 0.000 description 1
- 238000012549 training Methods 0.000 description 1
- 230000001052 transient effect Effects 0.000 description 1
- QORWJWZARLRLPR-UHFFFAOYSA-H tricalcium bis(phosphate) Chemical compound [Ca+2].[Ca+2].[Ca+2].[O-]P([O-])([O-])=O.[O-]P([O-])([O-])=O QORWJWZARLRLPR-UHFFFAOYSA-H 0.000 description 1
- 238000011144 upstream manufacturing Methods 0.000 description 1
- 238000002255 vaccination Methods 0.000 description 1
- 108010015385 valyl-prolyl-proline Proteins 0.000 description 1
- 108010009962 valyltyrosine Proteins 0.000 description 1
- OPUPHQHVRPYOTC-UHFFFAOYSA-N vgf3hm1rrf Chemical compound C1=NC(C(=O)C=2C3=CC=CN=2)=C2C3=NC=CC2=C1 OPUPHQHVRPYOTC-UHFFFAOYSA-N 0.000 description 1
- 229960003048 vinblastine Drugs 0.000 description 1
- JXLYSJRDGCGARV-XQKSVPLYSA-N vincaleukoblastine Chemical compound C([C@@H](C[C@]1(C(=O)OC)C=2C(=CC3=C([C@]45[C@H]([C@@]([C@H](OC(C)=O)[C@]6(CC)C=CCN([C@H]56)CC4)(O)C(=O)OC)N3C)C=2)OC)C[C@@](C2)(O)CC)N2CCC2=C1NC1=CC=CC=C21 JXLYSJRDGCGARV-XQKSVPLYSA-N 0.000 description 1
- 229960004528 vincristine Drugs 0.000 description 1
- OGWKCGZFUXNPDA-UHFFFAOYSA-N vincristine Natural products C1C(CC)(O)CC(CC2(C(=O)OC)C=3C(=CC4=C(C56C(C(C(OC(C)=O)C7(CC)C=CCN(C67)CC5)(O)C(=O)OC)N4C=O)C=3)OC)CN1CCC1=C2NC2=CC=CC=C12 OGWKCGZFUXNPDA-UHFFFAOYSA-N 0.000 description 1
- 238000005406 washing Methods 0.000 description 1
- 238000001262 western blot Methods 0.000 description 1
Images
Classifications
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K14/00—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
- C07K14/435—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans
- C07K14/46—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans from vertebrates
- C07K14/47—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans from vertebrates from mammals
Definitions
- FIG. 1 is the amino acid aligmnents of the different splice variants of human TARPP, Br137A (SEQ ID NO 4), B (SEQ ID NO 6), C (SEQ ID NO 8), D (SEQ ID NO 10; SEQ ID NO 13, NM — 016300), and E (SEQ ID NO 2), and partial clone AL133109 (SEQ ID NO 13).
- FIG. 2 is a schematic drawing showing the differences between the various forms of human TARPP.
- FIG. 3 shows amino acid alignments of the different splice variants of human TARPP (Br137A, B, C, D, and E) with mouse TARPP (NM — 033264; SEQ ID NO 11).
- the present invention relates to all facets of human TARPP (also known as human Br137), polypeptides encoded by it, antibodies and specific binding partners thereto, and their applications to research, diagnosis, drug discovery, therapy, clinical medicine, forensic science and medicine, etc.
- Human TARPP polynucleotides, polypeptides, antibodies, etc. are useful in variety of ways, including, but not limited to, as a molecular markers, as drug targets, and for detecting, diagnosing, staging, monitoring, prognosticating, preventing or treating, determining predisposition to, etc., diseases and conditions relating to T-cells and dopaminergic pathways.
- the identification of specific genes, and groups of genes, expressed in pathways physiologically relevant to these conditions permits the definition of functional and disease pathways, and the delineation of targets in these pathways which are useful in diagnostic, therapeutic, and clinical applications.
- the present invention also relates to methods of using the polynucleotides and related products (proteins, antibodies, etc.) in business and computer-related methods, e.g., advertising, displaying, offering, selling, etc., such products for sale, commercial use, licensing, etc.
- Br137E is an 847 amino acid polypeptide. Its nucleotide and amino acid sequences are shown in SEQ ID NOS 1 and 2.
- Br137B (SEQ ID NO 5 and 6) has a deletion of amino acids 267-300
- Br137A (SEQ ID NO 3 and 4) has a deletion of amino acids 312-331
- Br137C (SEQ ID NO 7 and 8) has a deletion of both these domains.
- Br137D contains only the first 87 amino acids followed by a two-amino acid N-terminus which differs from the other forms.
- a partial clone, AL133109 as shown in FIG. 1, is missing the first 161 amino acids of Br137E, as well as having an amino acid difference at position 312 (SEQ ID NO 2).
- Br137E contains a nuclear localization signal at about amino acids 107-124, an R3H domain (single-stranded nucleic acid binding domain) at about amino acids 147-224, and a proline rich region at about amino acids 476-682. These domains are also present in the A-C splice forms, but at different amino acid positions.
- Human TARPP has nucleic acid binding activity conferred by the corresponding binding domain indicating that it can bind nucleic acids, preferably single-stranded DNA or RNA. This binding activity can be assayed routinely, e.g., using gel electrophoresis band shift assays, e.g., as carried out in, e.g., U.S. Pat. Nos. 6,333,407 and 5,789,538, ELISA-based assays (e.g., MercuryTM TransFactor Kit from Clontech), and other assays which detect DNA-protein interactions.
- ELISA-based assays e.g
- the Br137 family represent the human homologs of murine TARPP (thymocyte ARPP) (M — 033264; SEQ ID NO 11; “Mouse” in FIG. 3).
- Br137E has about 83% amino acid identity and 87% homology with it (calculated using the BLAST algorithm). See, FIG. 3 (NM — 033264 is murine TARPP).
- human TARPP has an insertion at about amino acid positions 549-572 of SEQ ID NO 2 which is not present in the mouse protein. See, FIG. 3.
- a 21 kDa polypeptide was isolated from rat basal ganglia based on its phosphorylation by cAMP-dependent protein kinase (PKA). Williams et al., J. Neurosci ., 9:3631-3637, 1989. It was named ARPP-21 (cAMP-regulated phosphoprotein). Activation of dopamine receptors resulted in an increase in the phosphorylation of ARPP-21.
- PKA cAMP-dependent protein kinase
- a high molecular weight polypeptide of ARPP-21 was subsequently identified in T-cells and named TARPP. Kisielow et al., Eur. J. Immunol ., 31:1141-1149, 2001. This polypeptide contains ARPP-21 sequence at its 5′ end, but a novel 3′ end coding for more than 700 additional amino acids (for a total of 807 amino acids).
- Murine TARPP appears to be involved in the regulation of thymocyte maturation and TCR rearrangement. Expression of TARPP is down-regulated after the TCR signals delivered. It is highly expressed in immature thymocytes and is associated with the commitment to the T-cell lineage, making it highly selective marker for T-cell commitment. See, Kisielow, ibid. After commitment to the T-cell lineage during positive selection, its expression is turned off.
- KIAA0029 is a hypothetical protein that shares about 45% amino acid sequence identity and 59% homology with Br137E.
- KIAA1002 a second hypothetical protein, has about 42% amino acid identity and 54% homology to it.
- Human TARPP is highly expressed in brain, pituitary, muscle, and thymus. It is expressed at lower levels in adrenal gland, bone marrow, heart, small intestine, kidney, liver, ovary, prostate, stomach, testis, and thyroid. There was virtually no detectable expression in colon, lung, lymph node, peripheral lymphocytes, mammary gland, pancreas, and uterus.
- human TARPP is involved the maturation of T-cells, especially in the rearrangement of the TCR. For this reason, it can be used to modulate T-cells, e.g., in allergy, auto immune disease (e.g., rheumatoid arthritis and multiple sclerosis), and graft-host disease. It can also be used a marker to determine the index of mature versus immature T-cells, where human TARPP is marker of immature T-cells. Additionally, human TARPP is phosphorylated upon dopamine receptor activation, indicating an involvement in dopamine pathways. Consequently, it is target for diseases that involve dopamine, including, e.g., schizophrenia, substance abuse and addiction, anxiety, Parkinson's disease, and other dopaminergic diseases and conditions.
- diseases that involve dopamine including, e.g., schizophrenia, substance abuse and addiction, anxiety, Parkinson's disease, and other dopaminergic diseases and conditions.
- Human TARPP is localized to chromosomal band 3p21.33.
- disorders genetically mapped to this region, including, e.g., retinal vasculopathy with cerebral leukodystrophy (OMIM 192315), deafness, neurosensory, autosomal recessive 6 (OMIM 600971), and lung cancer.
- Nucleic acids of the present invention can be used as linkage markers, diagnostic targets, therapeutic targets, for any of the mentioned disorders, as well as any disorders or genes mapping in proximity to it.
- a mammalian polynucleotide, or fragment thereof, of the present invention is a polynucleotide having a nucleotide sequence obtainable from a natural source. It therefore includes naturally-occurring normal, naturally-occurring mutant, and naturally-occurring polymorphic alleles (e.g., SNPs), differentially-spliced transcripts, splice-variants, etc.
- naturally-occurring it is meant that the polynucleotide is obtainable from a natural source, e.g., animal tissue and cells, body fluids, tissue culture cells, forensic samples.
- Natural sources include, e.g., living cells obtained from tissues and whole organisms, tumors, cultured cell lines, including primary and immortalized cell lines.
- Naturally-occurring mutations can include deletions (e.g., a truncated amino- or carboxy-terminus), substitutions, inversions, or additions of nucleotide sequence. These genes can be detected and isolated by polynucleotide hybridization according to methods which one skilled in the art would know, e.g., as discussed below.
- a polynucleotide according to the present invention can be obtained from a variety of different sources. It can be obtained from DNA or RNA, such as polyadenylated mRNA or total RNA, e.g., isolated from tissues, cells, or whole organism.
- the polynucleotide can be obtained directly from DNA or RNA, from a cDNA library, from a genomic library, etc.
- the polynucleotide can be obtained from a cell or tissue (e.g., from an embryonic or adult tissues) at a particular stage of development, having a desired genotype, phenotype, disease status, etc.
- a polynucleotide which “codes without interruption” refers to a polynucleotide having a continuous open reading frame (“ORF”) as compared to an ORF which is interrupted by introns or other noncoding sequences.
- Polynucleotides and polypeptides can be excluded as compositions from the present invention if, e.g., listed in a publicly available databases on the day this application was filed and/or disclosed in a patent application having an earlier filing or priority date than this application and/or conceived and/or reduced to practice earlier than a polynucleotide in this application.
- an isolated polynucleotide which is SEQ ID NO refers to an isolated nucleic acid molecule from which the recited sequence was derived (e.g., a cDNA derived from mRNA; cDNA derived from genomic DNA). Because of sequencing errors, typographical errors, etc., the actual naturally-occurring sequence may differ from a SEQ ID listed herein.
- the phrase indicates the specific molecule from which the sequence was derived, rather than a molecule having that exact recited nucleotide sequence, analogously to how a culture depository number refers to a specific cloned fragment in a cryotube.
- a polynucleotide sequence of the invention can contain the complete sequence as shown in SEQ ID NO 1, 3, 5, 7, 9, and others, degenerate sequences thereof, anti-sense, muteins thereof, genes comprising said sequences, full-length cDNAs comprising said sequences, complete genomic sequences, fragments thereof, homologs, primers, nucleic acid molecules which hybridize thereto, derivatives thereof, etc.
- the present invention also relates genomic DNA from which the polynucleotides of the present invention can be derived.
- genomic DNA coding for a human, mouse, or other mammalian polynucleotide can be obtained routinely, for example, by screening a genomic library (e.g., a YAC library) with a polynucleotide of the present invention, or by searching nucleotide databases, such as GenBank and EMBL, for matches.
- Promoter and other regulatory regions can be identified upstream or downstream of coding and expressed RNAs, and assayed routinely for activity, e.g., by joining to a reporter gene (e.g., CAT, GFP, alkaline phosphatase, luciferase, galatosidase).
- a reporter gene e.g., CAT, GFP, alkaline phosphatase, luciferase, galatosidase.
- a promoter obtained from a gene can be used, e.g., in gene therapy to obtain tissue-specific expression of a heterologous gene (e.g., coding for a therapeutic product or cytotoxin).
- 3′-untranslated sequences (as well as introns) can be used, e.g., to stabilize transcripts, to target transcripts, etc.
- a polynucleotide of the present invention can comprise additional polynucleotide sequences, e.g., sequences to enhance expression, detection, uptake, cataloging, tagging, etc.
- a polynucleotide can include only coding sequence; a coding sequence and additional non-naturally occurring or heterologous coding sequence (e.g., sequences coding for leader, signal, secretory, targeting, enzymatic, fluorescent, antibiotic resistance, and other functional or diagnostic peptides); coding sequences and non-coding sequences, e.g., untranslated sequences at either a 5′ or 3′ end, or dispersed in the coding sequence, e.g., introns.
- a polynucleotide according to the present invention also can comprise an expression control sequence operably linked to a polynucleotide as described above.
- expression control sequence means a polynucleotide sequence that regulates expression of a polypeptide coded for by a polynucleotide to which it is functionally (“operably”) linked. Expression can be regulated at the level of the mRNA or polypeptide.
- the expression control sequence includes mRNA-related elements and protein-related elements. Such elements include promoters, enhancers (viral or cellular), ribosome binding sequences, transcriptional terminators, etc.
- An expression control sequence is operably linked to a nucleotide coding sequence when the expression control sequence is positioned in such a manner to effect or achieve expression of the coding sequence.
- expression control sequences can include an initiation codon and additional nucleotides to place a partial nucleotide sequence of the present invention in-frame in order to produce a polypeptide (e.g., pET vectors from Promega have been designed to permit a molecule to be inserted into all three reading frames to identify the one that results in polypeptide expression).
- Expression control sequences can be heterologous or endogenous to the normal gene.
- a polynucleotide of the present invention can also comprise nucleic acid vector sequences, e.g., for cloning, expression, amplification, selection, etc. Any effective vector can be used.
- a vector is, e.g., a polynucleotide molecule which can replicate autonomously in a host cell, e.g., containing an origin of replication. Vectors can be useful to perform manipulations, to propagate, and/or obtain large quantities of the recombinant molecule in a desired host.
- a skilled worker can select a vector depending on the purpose desired, e.g., to propagate the recombinant molecule in bacteria, yeast, insect, or mammalian cells. The following vectors are provided by way of example.
- Bacterial pQE70, pQE60, pQE-9 (Qiagen), pBS, pD10, Phagescript, phiX174, pBK Phagemid, pNH8A, pNH16a, pNH18Z, pNH46A (Stratagene); Bluescript KS+II (Stratagene); ptrc99a, pKK223-3, pKK233-3, pDR540, pRIT5 (Pharmacia).
- Eukaryotic PWLNEO, pSV2CAT, pOG44, pXT1, pSG (Stratagene), pSVK3, PBPV, PMSG, pSVL (Pharmacia), pCR2.1/TOPO, pCRII/TOPO, pCR4/TOPO, pTrcHisB, pCMV6-XL4, etc.
- any other vector e.g., plasmids, viruses, or parts thereof, may be used as long as they are replicable and viable in the desired host.
- the vector can also comprise sequences which enable it to replicate in the host whose genome is to be modified.
- Polynucleotide hybridization is useful in a variety of applications, including, in gene detection methods, for identifying mutations, for making mutations, to identify homologs in the same and different species, to identify related members of the same gene family, in diagnostic and prognostic assays, in therapeutic applications (e.g., where an antisense polynucleotide is used to inhibit expression), etc.
- the ability of two single-stranded polynucleotide preparations to hybridize together is a measure of their nucleotide sequence complementarity, e.g., base-pairing between nucleotides, such as A-T, G-C, etc.
- the invention thus also relates to polynucleotides, and their complements, which hybridize to a polynucleotide comprising a nucleotide sequence as set forth in SEQ ID NO 1, 3, 5, 7, 9, and others and genomic sequences thereof.
- a nucleotide sequence hybridizing to the latter sequence will have a complementary polynucleotide strand, or act as a template for one in the presence of a polymerase (i.e., an appropriate polynucleotide synthesizing enzyme).
- the present invention includes both strands of polynucleotide, e.g., a sense strand and an anti-sense strand.
- Hybridization conditions can be chosen to select polynucleotides which have a desired amount of nucleotide complementarity with the nucleotide sequences set forth in SEQ ID NO 1, 3, 5, 7, 9, and others and genomic sequences thereof.
- a polynucleotide capable of hybridizing to such sequence preferably, possesses, e.g., about 70%, 75%, 80%, 85%, 87%, 90%, 92%, 95%, 97%, 99%, or 100% complementarity, between the sequences.
- the present invention particularly relates to polynucleotide sequences which hybridize to the nucleotide sequences set forth in SEQ ID NO 1, 3, 5, 7, 9, and others or genomic sequences thereof, under low or high stringency conditions. These conditions can be used, e.g., to select corresponding homologs in non-human species.
- Polynucleotides which hybridize to polynucleotides of the present invention can be selected in various ways.
- Filter-type blots i.e., matrices containing polynucleotide, such as nitrocellulose), glass chips, and other matrices and substrates comprising polynucleotides (short or long) of interest, can be incubated in a prehybridization solution (e.g., 6 ⁇ SSC, 0.5% SDS, 100 ⁇ g/ml denatured salmon sperm DNA, 5 ⁇ Denhardt's solution, and 50% formamide), at 22-68° C., overnight, and then hybridized with a detectable polynucleotide probe under conditions appropriate to achieve the desired stringency.
- a prehybridization solution e.g., 6 ⁇ SSC, 0.5% SDS, 100 ⁇ g/ml denatured salmon sperm DNA, 5 ⁇ Denhardt's solution, and 50% formamide
- a high temperature can be used (e.g., 65° C.). As the homology drops, lower washing temperatures are used. For salt concentrations, the lower the salt concentration, the higher the stringency. The length of the probe is another consideration. Very short probes (e.g., less than 100 base pairs) are washed at lower temperatures, even if the homology is high. With short probes, formamide can be omitted. See, e.g., Current Protocols in Molecular Biology , Chapter 6, Screening of Recombinant Libraries; Sambrook et al., Molecular Cloning , 1989, Chapter 9.
- high stringency conditions can be achieved by incubating the blot overnight (e.g., at least 12 hours) with a long polynucleotide probe in a hybridization solution containing, e.g., about 5 ⁇ SSC, 0.5% SDS, 100 ⁇ g/ml denatured salmon sperm DNA and 50% formamide, at 42° C. Blots can be washed at high stringency conditions that allow, e.g., for less than 5% bp mismatch (e.g., wash twice in 0.1% SSC and 0.1% SDS for 30 min at 65° C.), i.e., selecting sequences having 95% or greater sequence identity.
- a hybridization solution containing, e.g., about 5 ⁇ SSC, 0.5% SDS, 100 ⁇ g/ml denatured salmon sperm DNA and 50% formamide, at 42° C. Blots can be washed at high stringency conditions that allow, e.g., for less than 5% bp mismatch (e.g
- high stringency conditions includes a final wash at 65° C. in aqueous buffer containing 30 mM NaCl and 0.5% SDS.
- Another example of high stringent conditions is hybridization in 7% SDS, 0.5 M NaPO 4 , pH 7, 1 mM EDTA at 50° C., e.g., overnight, followed by one or more washes with a 1% SDS solution at 42° C. Whereas high stringency washes can allow for less than 5% mismatch, reduced or low stringency conditions can permit up to 20% nucleotide mismatch.
- Hybridization at low stringency can be accomplished as above, but using lower formamide conditions, lower temperatures and/or lower salt concentrations, as well as longer periods of incubation time.
- Hybridization can also be based on a calculation of melting temperature (Tm) of the hybrid formed between the probe and its target, as described in Sambrook et al..
- Tm melting temperature
- Tm 81.5+16.6 log 10 [Na + ]+0.41(% GC) ⁇ 600/N where [Na + ] is the molar concentration of sodium ions, % GC is the percentage of GC base pairs in the probe, and N is the length.
- Hybridization can be carried out at several degrees below this temperature to ensure that the probe and target can hybridize. Mismatches can be allowed for by lowering the temperature even further.
- Stringent conditions can be selected to isolate sequences, and their complements, which have, e.g., at least about 90%, 95%, or 97%, nucleotide complementarity between the probe (e.g., a short polynucleotide of SEQ ID NO 1, 3, 5, 7, 9, and others or genomic sequences thereof) and a target polynucleotide.
- homologs of polynucleotides of the present invention can be obtained from mammalian and non-mammalian sources according to various methods. For example, hybridization with a polynucleotide can be employed to select homologs, e.g., as described in Sambrook et al., Molecular Cloning , Chapter 11, 1989. Such homologs can have varying amounts of nucleotide and amino acid sequence identity and similarity to such polynucleotides of the present invention.
- Mammalian organisms include, e.g., mice, rats, monkeys, pigs, cows, etc.
- Non-mammalian organisms include, e.g., vertebrates, invertebrates, zebra fish, chicken, Drosophila, C. elegans , Xenopus, yeast such as S. pombe, S. cerevisiae , roundworms, prokaryotes, plants, Arabidopsis, artemia, viruses, etc.
- the degree of nucleotide sequence identity between human and mouse can be about, e.g. 70% or more, 85% or more for open reading frames, etc.
- Alignments can be accomplished by using any effective algorithm.
- Wilbur-Lipman e.g., Wilbur and Lipman, Proc. Natl. Acad. Sci ., 80:726-730, 1983
- Martinez/Needleman-Wunsch e.g., Martinez, Nucleic Acid Res ., 11:4629-4634, 1983
- the minimum match can be set at 9, gap penalty at 1.10, and gap length penalty at 0.33.
- Similarity index for related genes at the nucleotide level in accordance with the present invention can be greater than 70%, 80%, 85%, 90%, 95%, 99%, or more. Pairs of protein sequences can be aligned by the Lipman-Pearson method (e.g., Lipman and Pearson, Science , 227:1435-1441, 1985) with k-tuple set at 2, gap penalty set at 4, and gap length penalty set at 12.
- Results can be expressed as percent similarity index, where related genes at the amino acid level in accordance with the present invention can be greater than 65%, 70%, 75%, 80%, 85%, 90%, 95%, 99%, or more.
- Various commercial and free sources of alignment programs are available, e.g., MegAlign by DNA Star, BLAST (National Center for Biotechnology Information), BCM (Baylor College of Medicine) Launcher, etc.
- BLAST can be used to calculate amino acid sequence identity, amino acid sequence homology, and nucleotide sequence identity.
- Percent sequence identity can also be determined by other conventional methods, e.g., as described in Altschul et al., Bull. Math. Bio . 48: 603-616, 1986 and Henikoff and Henikoff, Proc. Natl. Acad. Sci . USA 89:10915-10919, 1992.
- a polynucleotide of the present invention can comprise any continuous nucleotide sequence of SEQ ID NO 1, 3, 5, 7, 9, and others, sequences which share sequence identity thereto, or complements thereof.
- probe refers to any substance that can be used to detect, identify, isolate, etc., another substance.
- a polynucleotide probe is comprised of nucleic acid can be used to detect, identify, etc., other nucleic acids, such as DNA and RNA.
- polynucleotides can be of any desired size that is effective to achieve the specificity desired.
- a probe can be from about 7 or 8 nucleotides to several thousand nucleotides, depending upon its use and purpose.
- a probe used as a primer PCR can be shorter than a probe used in an ordered array of polynucleotide probes.
- Probe sizes vary, and the invention is not limited in any way by their size, e.g., probes can be from about 7-2000 nucleotides, 7-1000, 8-700, 8-600, 8-500, 8-400, 8-300, 8-150, 8-100, 8-75, 7-50, 10-25, 14-16, at least about 8, at least about 10, at least about 15, at least about 25, etc.
- the polynucleotides can have non-naturally-occurring nucleotides, e.g., inosine, AZT, 3TC, etc.
- the polynucleotides can have 100% sequence identity or complementarity to a sequence of SEQ ID NO 1, 3, 5, 7, 9, and others, or it can have mismatches or nucleotide substitutions, e.g., 1, 2, 3, 4, or 5 substitutions.
- the probes can be single-stranded or double-stranded.
- kits can be present in a kit, where the kit includes, e.g., one or more polynucleotides, a desired buffer (e.g., phosphate, tris, etc.), detection compositions, RNA or cDNA from different tissues to be used as controls, libraries, etc.
- the polynucleotide can be labeled or unlabeled, with radioactive or non-radioactive labels as known in the art.
- Kits can comprise one or more pairs of polynucleotides for amplifying nucleic acids specific for human TARPP, e.g., comprising a forward and reverse primer effective in PCR. These include both sense and anti-sense orientations. For instance, in PCR-based methods (such as RT-PCR), a pair of primers are typically used, one having a sense sequence and the other having an antisense sequence.
- Another aspect of the present invention is a nucleotide sequence that is specific to, or for, a selective polynucleotide.
- the phrases “specific for” or “specific to” a polynucleotide have a functional meaning that the polynucleotide can be used to identify the presence of one or more target genes in a sample. It is specific in the sense that it can be used to detect polynucleotides above background noise (“non-specific binding”).
- a specific sequence is a defined order of nucleotides which occurs in the polynucleotide, e.g., in the nucleotide sequences of SEQ ID NO 1, 3, 5, 7, 9, and others.
- a probe or mixture of probes can comprise a sequence or sequences that are specific to a plurality of target sequences, e.g., where the sequence is a consensus sequence, a functional domain, etc., e.g., capable of recognizing a family of related genes. Such sequences can be used as probes in any of the methods described herein or incorporated by reference. Both sense and antisense nucleotide sequences are included.
- a specific polynucleotide according to the present invention can be determined routinely.
- a polynucleotide comprising a specific sequence can be used as a hybridization probe to identify the presence of, e.g., human or mouse polynucleotide, in a sample comprising a mixture of polynucleotides, e.g., on a Northern blot.
- Hybridization can be performed under high stringent conditions (see, above) to select polynucleotides (and their complements which can contain the coding sequence) having at least 90%, 95%, 99%, etc., identity (i.e., complementarity) to the probe, but less stringent conditions can also be used.
- a specific polynucleotide sequence can also be fused in-frame, at either its 5′ or 3′ end, to various nucleotide sequences as mentioned throughout the patent, including coding sequences for enzymes, detectable markers, GFP, etc, expression control sequences, etc.
- a polynucleotide probe can be used in gene detection and hybridization methods as already described.
- a specific polynucleotide probe can be used to detect whether a particular tissue or cell-type is present in a target sample.
- a selective polynucleotide can be chosen which is characteristic of the desired target tissue.
- Such polynucleotide is preferably chosen so that it is expressed or displayed in the target tissue, but not in other tissues which are present in the sample.
- a specific polynucleotide probe can be designed which hybridizes (if hybridization is the basis of the assay) under the hybridization conditions to the selective polynucleotide, whereby the presence of the selective polynucleotide can be determined.
- Probes which are specific for polynucleotides of the present invention can also be prepared using involve transcription-based systems, e.g., incorporating an RNA polymerase promoter into a selective polynucleotide of the present invention, and then transcribing anti-sense RNA using the polynucleotide as a template. See, e.g., U.S. Pat. No. 5,545,522.
- a polynucleotide according to the present invention can comprise, e.g., DNA, RNA, synthetic polynucleotide, peptide polynucleotide, modified nucleotides, dsDNA, ssDNA, ssRNA, dsRNA, and mixtures thereof.
- a polynucleotide can be single- or double-stranded, triplex, DNA:RNA, duplexes, comprise hairpins, and other secondary structures, etc.
- Nucleotides comprising a polynucleotide can be joined via various known linkages, e.g., ester, sulfamate, sulfamide, phosphorothioate, phosphoramidate, methylphosphonate, carbamate, etc., depending on the desired purpose, e.g., resistance to nucleases, such as RNAse H, improved in vivo stability, etc. See, e.g., U.S. Pat. No. 5,378,825. Any desired nucleotide or nucleotide analog can be incorporated, e.g., 6-mercaptoguanine, 8-oxo-guanine, etc.
- polynucleotides can also be attached to solid supports, e.g., nitrocellulose, magnetic or paramagnetic microspheres (e.g., as described in U.S. Pat. Nos.
- Polynucleotide according to the present invention can be labeled according to any desired method.
- the polynucleotide can be labeled using radioactive tracers such as 32 P, 35 S, 3 H, or 14 C, to mention some commonly used tracers.
- the radioactive labeling can be carried out according to any method, such as, for example, terminal labeling at the 3′ or 5′ end using a radiolabeled nucleotide, polynucleotide kinase (with or without dephosphorylation with a phosphatase) or a ligase (depending on the end to be labeled).
- a non-radioactive labeling can also be used, combining a polynucleotide of the present invention with residues having immunological properties (antigens, haptens), a specific affinity for certain reagents (ligands), properties enabling detectable enzyme reactions to be completed (enzymes or coenzymes, enzyme substrates, or other substances involved in an enzymatic reaction), or characteristic physical properties, such as fluorescence or the emission or absorption of light at a desired wavelength, etc.
- Another aspect of the present invention relates to methods and processes for detecting human TARPP. Detection methods have a variety of applications, including for diagnostic, prognostic, forensic, and research applications.
- a polynucleotide in accordance with the present invention can be used as a “probe.”
- the term “probe” or “polynucleotide probe” has its customary meaning in the art, e.g., a polynucleotide which is effective to identify (e.g., by hybridization), when used in an appropriate process, the presence of a target polynucleotide to which it is designed.
- Identification can involve simply determining presence or absence, or it can be quantitative, e.g., in assessing amounts of a gene or gene transcript present in a sample.
- Probes can be useful in a variety of ways, such as for diagnostic purposes, to identify homologs, and to detect, quantitate, or isolate a polynucleotide of the present invention in a test sample.
- Assays can be utilized which permit quantification and/or presence/absence detection of a target nucleic acid in a sample. Assays can be performed at the single-cell level, or in a sample comprising many cells, where the assay is “averaging” expression over the entire collection of cells and tissue present in the sample. Any suitable assay format can be used, including, but not limited to, e.g., Southern blot analysis, Northern blot analysis, polymerase chain reaction (“PCR”) (e.g., Saiki et al., Science , 241:53, 1988; U.S. Pat. Nos.
- PCR polymerase chain reaction
- PCR Protocols A Guide to Methods and Applications , Innis et al., eds., Academic Press, New York, 1990
- RT-PCR reverse transcriptase polymerase chain reaction
- RACE rapid amplification of cDNA ends
- LCR ligase chain reaction
- RNA fingerprinting techniques nucleic acid sequence based amplification (“NASBA”) and other transcription based amplification systems (e.g., U.S. Pat. Nos. 5,409,818 and 5,554,527; WO 88/10315), polynucleotide arrays (e.g., U.S. Pat. Nos.
- NASBA nucleic acid sequence based amplification
- transcription based amplification systems e.g., U.S. Pat. Nos. 5,409,818 and 5,554,527; WO 88/10315
- polynucleotide arrays e.g., U.S. Pat. Nos.
- any method suitable for single cell analysis of gene or protein expression can be used, including in situ hybridization, immunocytochemistry, MACS, FACS, flow cytometry, etc.
- expression products can be measured using antibodies, PCR, or other types of nucleic acid amplification (e.g., Brady et al., Methods Mol . & Cell. Biol . 2, 17-25, 1990; Eberwine et al., 1992 , Proc. Natl. Acad. Sci ., 89, 3010-3014, 1992; U.S. Pat. No. 5,723,290).
- nucleic acid amplification e.g., Brady et al., Methods Mol . & Cell. Biol . 2, 17-25, 1990; Eberwine et al., 1992 , Proc. Natl. Acad. Sci ., 89, 3010-3014, 1992; U.S. Pat. No. 5,723,290.
- polynucleotide is labeled, or comprises a particular nucleotide type useful for detection.
- the present invention includes such modified polynucleotides that are necessary to carry out such methods.
- polynucleotides can be DNA, RNA, DNA: RNA hybrids, PNA, etc., and can comprise any modification or substituent which is effective to achieve detection.
- Detection can be desirable for a variety of different purposes, including research, diagnostic, prognostic, and forensic.
- diagnostic purposes it may be desirable to identify the presence or quantity of a polynucleotide sequence in a sample, where the sample is obtained from tissue, cells, body fluids, etc.
- the present invention relates to a method of detecting a polynucleotide comprising, contacting a target polynucleotide in a test sample with a polynucleotide probe under conditions effective to achieve hybridization between the target and probe; and detecting hybridization.
- test sample in which it is desired to identify a polynucleotide or polypeptide thereof can be used, including, e.g., blood, urine, saliva, stool (for extracting nucleic acid, see, e.g., U.S. Pat. No. 6,177,251), swabs comprising tissue, biopsied tissue, tissue sections, cultured cells, etc.
- Detection can be accomplished in combination with polynucleotide probes for other genes, e.g., genes which are expressed in other disease states, tissues, cells, such as brain, heart, kidney, spleen, thymus, liver, stomach, small intestine, colon, muscle, lung, testis, placenta, pituitary, thyroid, skin, adrenal gland, pancreas, salivary gland, uterus, ovary, prostate gland, peripheral blood cells (T-cells, lymphocytes, etc.), embryo, normal breast fat, adult and embryonic stem cells, specific cell-types, such as endothelial, epithelial, myocytes, adipose, luminal epithelial, basoepithelial, myoepithelial, stromal cells, etc.
- genes which are expressed in other disease states, tissues, cells, such as brain, heart, kidney, spleen, thymus, liver, stomach, small intestine, colon, muscle, lung, testis, place
- Polynucleotides can be used in wide range of methods and compositions, including for detecting, diagnosing, staging, grading, assessing, prognosticating, etc. diseases and disorders associated with human TARPP, for monitoring or assessing therapeutic and/or preventative measures, in ordered arrays, etc. Any method of detecting genes and polynucleotides of SEQ ID NO 1, 3, 5, 7, 9, and others can be used; certainly, the present invention is not to be limited how such methods are implemented.
- the present invention relates to methods of detecting human TARPP in a sample comprising nucleic acid.
- Such methods can comprise one or more the following steps in any effective order, e.g., contacting said sample with a polynucleotide probe under conditions effective for said probe to hybridize specifically to nucleic acid in said sample, and detecting the presence or absence of probe hybridized to nucleic acid in said sample, wherein said probe is a polynucleotide which is SEQ ID NO 1, 3, 5, 7, 9, and others, a polynucleotide having, e.g., about 70%, 80%, 85%, 90%, 95%, 99%, or more sequence identity thereto, effective or specific fragments thereof, or complements thereto.
- the detection method can be applied to any sample, e.g., cultured primary, secondary, or established cell lines, tissue biopsy, blood, urine, stool, cerebral spinal fluid, and other bodily fluids, for any purpose.
- Contacting the sample with probe can be carried out by any effective means in any effective environment. It can be accomplished in a solid, liquid, frozen, gaseous, amorphous, solidified, coagulated, colloid, etc., mixtures thereof, matrix.
- a probe in an aqueous medium can be contacted with a sample which is also in an aqueous medium, or which is affixed to a solid matrix, or vice-versa.
- the term “effective conditions” means, e.g., the particular milieu in which the desired effect is achieved.
- a milieu includes, e.g., appropriate buffers, oxidizing agents, reducing agents, pH, co-factors, temperature, ion concentrations, suitable age and/or stage of cell (such as, in particular part of the cell cycle, or at a particular stage where particular genes are being expressed) where cells are being used, culture conditions (including substrate, oxygen, carbon dioxide, etc.).
- the probe and sample can be combined such that the resulting conditions are functional for said probe to hybridize specifically to nucleic acid in said sample.
- hybridize specifically indicates that the hybridization between single-stranded polynucleotides is based on nucleotide sequence complementarity.
- the effective conditions are selected such that the probe hybridizes to a preselected and/or definite target nucleic acid in the sample. For instance, if detection of a polynucleotide set forth in SEQ ID NO 1, 3, 5, 7, 9, and others is desired, a probe can be selected which can hybridize to such target gene under high stringent conditions, without significant hybridization to other genes in the sample.
- the effective hybridization conditions can be less stringent, and/or the probe can comprise codon degeneracy, such that a homolog is detected in the sample.
- the methods can be carried out by any effective process, e.g., by Northern blot analysis, polymerase chain reaction (PCR), reverse transcriptase PCR, RACE PCR, in situ hybridization, etc., as indicated above.
- PCR polymerase chain reaction
- RACE PCR reverse transcriptase PCR
- in situ hybridization etc.
- two or more probes are generally used.
- One probe can be specific for a defined sequence which is characteristic of a selective polynucleotide, but the other probe can be specific for the selective polynucleotide, or specific for a more general sequence, e.g., a sequence such as polyA which is characteristic of mRNA, a sequence which is specific for a promoter, ribosome binding site, or other transcriptional features, a consensus sequence (e.g., representing a functional domain).
- 5′ and 3′ probes e.g., polyA, Kozak, etc.
- the probes can also be referred to as “primers” in that they can prime a DNA polymerase reaction.
- the present invention also relates to determining the amounts at which polynucleotides of the present invention are expressed in sample and determining the differential expression of such polynucleotides in samples.
- Such methods can involve substantially the same steps as described above for presence/absence detection, e.g., contacting with probe, hybridizing, and detecting hybridized probe, but using more quantitative methods and/or comparisons to standards.
- the amount of hybridization between the probe and target can be determined by any suitable methods, e.g., PCR, RT-PCR, RACE PCR, Northern blot, polynucleotide microarrays, Rapid-Scan, etc., and includes both quantitative and qualitative measurements. For further details, see the hybridization methods described above and below. Determining by such hybridization whether the target is differentially expressed (e.g., up-regulated or down-regulated) in the sample can also be accomplished by any effective means. For instance, the target's expression pattern in the sample can be compared to its pattern in a known standard, such as in a normal tissue, or it can be compared to another gene in the same sample.
- a known standard such as in a normal tissue
- a second sample when utilized for the comparison, it can be a sample of normal tissue that is known not to contain diseased cells.
- the comparison can be performed on samples which contain the same amount of RNA (such as polyadenylated RNA or total RNA), or, on RNA extracted from the same amounts of starting tissue.
- RNA such as polyadenylated RNA or total RNA
- Hybridization can also be compared to a second target in the same tissue sample. Experiments can be performed that determine a ratio between the target nucleic acid and a second nucleic acid (a standard or control) , e.g., in a normal tissue. When the ratio between the target and control are substantially the same in a normal and sample, the sample is determined or diagnosed not to contain cells.
- the sample is determined to contain cancer cells.
- the approaches can be combined, and one or more second samples, or second targets can be used. Any second target nucleic acid can be used as a comparison, including “housekeeping” genes, such as beta-actin, alcohol dehydrogenase, or any other gene whose expression does not vary depending upon the disease status of the cell.
- Polynucleotides of the present invention can also be utilized to identify mutant alleles, SNPs, gene rearrangements and modifications, and other polymorphisms of the wild-type gene. Mutant alleles, polymorphisms, SNPs, etc., can be identified and isolated from cancers that are known, or suspected to have, a genetic component. Identification of such genes can be carried out routinely (see, above for more guidance), e.g., using PCR, hybridization techniques, direct sequencing, mismatch reactions (see, e.g., above), RFLP analysis, SSCP (e.g., Orita et al., Proc. Natl. Acad.
- a polynucleotide having a sequence selected from SEQ ID NO 1, 3, 5, 7, 9, and others is used as a probe.
- the selected mutant alleles, SNPs, polymorphisms, etc. can be used diagnostically to determine whether a subject has, or is susceptible to a disorder associated with human TARPP, as well as to design therapies and predict the outcome of the disorder.
- Methods involve, e.g., diagnosing a disorder associated with human TARPP or determining susceptibility to a disorder, comprising, detecting the presence of a mutation in a gene represented by a polynucleotide selected from SEQ ID NO 1, 3, 5, 7, 9, and others.
- the detecting can be carried out by any effective method, e.g., obtaining cells from a subject, determining the gene sequence or structure of a target gene (using, e.g., mRNA, cDNA, genomic DNA, etc), comparing the sequence or structure of the target gene to the structure of the normal gene, whereby a difference in sequence or structure indicates a mutation in the gene in the subject.
- Polynucleotides can also be used to test for mutations, SNPs, polymorphisms, etc., e.g., using mismatch DNA repair technology as described in U.S. Pat. Nos. 5,683,877; 5,656,430; Wu et al., Proc. Natl. Acad. Sci ., 89:8779-8783, 1992.
- the present invention also relates to methods of detecting polymorphisms in human TARPP, comprising, e.g., comparing the structure of: genomic DNA comprising all or part of human TARPP, mRNA comprising all or part of human TARPP, cDNA comprising all or part of human TARPP, or a polypeptide comprising all or part of human TARPP, with the structure of human TARPP set forth in SEQ ID NOS. 1-8.
- the methods can be carried out on a sample from any source, e.g., cells, tissues, body fluids, blood, urine, stool, hair, egg, sperm, cerebral spinal fluid, etc.
- comparing the structure steps include, but are not limited to, comparing restriction maps, nucleotide sequences, amino acid sequences, RFLPs, Dnase sites, DNA methylation fingerprints (e.g., U.S. Pat. No. 6,214,556), protein cleavage sites, molecular weights, electrophoretic mobilities, charges, ion mobility, etc., between a standard human TARPP and a test human TARPP.
- structure can refer to any physical characteristics or configurations which can be used to distinguish between nucleic acids and polypeptides. The methods and instruments used to accomplish the comparing step depends upon the physical characteristics which are to be compared.
- sequencing machines both amino acid and polynucleotide
- electrophoresis mass spectrometer
- mass spectrometer U.S. Pat. Nos. 6,093,541, 6,002,127
- liquid chromatography HPLC, etc.
- “all or part” of the gene or polypeptide can be compared. For example, if nucleotide sequencing is utilized, the entire gene can be sequenced, including promoter, introns, and exons, or only parts of it can be sequenced and compared, e.g., exon 1, exon 2, etc.
- Mutated polynucleotide sequences of the present invention are useful for various purposes, e.g., to create mutations of the polypeptides they encode, to identify functional regions of genomic DNA, to produce probes for screening libraries, etc. Mutagenesis can be carried out routinely according to any effective method, e.g., oligonucleotide-directed (Smith, M., Ann. Rev. Genet .
- Desired sequences can also be produced by the assembly of target sequences using mutually priming oligonucleotides (Uhlmann, Gene , 71:29-40, 1988).
- analysis of the three-dimensional structure of the human TARPP polypeptide can be used to guide and facilitate making mutants which effect polypeptide activity.
- Sites of substrate-enzyme interaction or other biological activities can also be determined by analysis of crystal structure as determined by such techniques as nuclear magnetic resonance, crystallography or photoaffinity labeling. See, for example, de Vos et al., Science 255:306-312, 1992; Smith et al., J. Mol. Biol. 224:899-904, 1992; Wlodaver et al., FEBS Lett. 309:59-64, 1992.
- libraries of human TARPP and fragments thereof can be used for screening and selection of human TARPP variants.
- a library of coding sequences can be generated by treating a double-stranded DNA with a nuclease under conditions where the nicking occurs, e.g., only once per molecule, denaturing the double-stranded DNA, renaturing it to for double-stranded DNA that can include sense/antisense pairs from different nicked products, removing single-stranded portions from reformed duplexes by treatment with S1 nuclease, and ligating the resulting DNAs into an expression vector.
- expression libraries can be made comprising “mutagenized” human TARPP. The entire coding sequence or parts thereof can be used.
- a polynucleotide according to the present invention can be expressed in a variety of different systems, in vitro and in vivo, according to the desired purpose.
- a polynucleotide can be inserted into an expression vector, introduced into a desired host, and cultured under conditions effective to achieve expression of a polypeptide coded for by the polynucleotide, to search for specific binding partners.
- Effective conditions include any culture conditions which are suitable for achieving production of the polypeptide by the host cell, including effective temperatures, pH, medium, additives to the media in which the host cell is cultured (e.g., additives which amplify or induce expression such as butyrate, or methotrexate if the coding polynucleotide is adjacent to a dhfr gene), cycloheximide, cell densities, culture dishes, etc.
- a polynucleotide can be introduced into the cell by any effective method including, e.g., naked DNA, calcium phosphate precipitation, electroporation, injection, DEAE-Dextran mediated transfection, fusion with liposomes, association with agents which enhance its uptake into cells, viral transfection.
- a cell into which a polynucleotide of the present invention has been introduced is a transformed host cell.
- the polynucleotide can be extrachromosomal or integrated into a chromosome(s) of the host cell. It can be stable or transient.
- An expression vector is selected for its compatibility with the host cell.
- Host cells include, mammalian cells, e.g., COS, CV1, BHK, CHO, HeLa, LTK, NIH 3T3, CNS neural stem cells (e.g., U.S. Pat. No.
- frugipeda frugipeda
- Drosophila bacteria, such as E. coli , Streptococcus, bacillus, yeast, such as Sacharomyces, S. cerevisiae , fungal cells, plant cells, embryonic or adult stem cells (e.g., mammalian, such as mouse or human).
- Expression control sequences are similarly selected for host compatibility and a desired purpose, e.g., high copy number, high amounts, induction, amplification, controlled expression.
- Other sequences which can be employed include enhancers such as from SV40, CMV, RSV, inducible promoters, cell-type specific elements, or sequences which allow selective or specific cell expression.
- Promoters that can be used to drive its expression include, e.g., the endogenous promoter, MMTV, SV40, trp, lac, tac, or T7 promoters for bacterial hosts; or alpha factor, alcohol oxidase, or PGH promoters for yeast.
- RNA promoters can be used to produced RNA transcripts, such as T7 or SP6.
- heterologous means that the gene has been introduced into the cell line by the “hand-of-man.” Introduction of a gene into a cell line is discussed above.
- the transfected (or transformed) cell expressing the gene can be lysed or the cell line can be used intact.
- a polynucleotide can contain codons found in a naturally-occurring gene, transcript, or cDNA, for example, e.g., as set forth in SEQ ID NO 1, 3, 5, 7, 9, and others, or it can contain degenerate codons coding for the same amino acid sequences. For instance, it may be desirable to change the codons in the sequence to optimize the sequence for expression in a desired host. See, e.g., U.S. Pat. Nos. 5,567,600 and 5,567,862.
- a polypeptide according to the present invention can be recovered from natural sources, transformed host cells (culture medium or cells) according to the usual methods, including, detergent extraction (e.g., non-ionic detergent, Triton X-100, CHAPS, octylglucoside, Igepal CA-630), ammonium sulfate or ethanol precipitation, acid extraction, anion or cation exchange chromatography, phosphocellulose chromatography, hydrophobic interaction chromatography, hydroxyapatite chromatography, lectin chromatography, gel electrophoresis. Protein refolding steps can be used, as necessary, in completing the configuration of the mature protein. Finally, high performance liquid chromatography (HPLC) can be employed for purification steps.
- detergent extraction e.g., non-ionic detergent, Triton X-100, CHAPS, octylglucoside, Igepal CA-630
- ammonium sulfate or ethanol precipitation acid extraction
- Another approach is express the polypeptide recombinantly with an affinity tag (Flag epitope, HA epitope, myc epitope, 6 ⁇ His, maltose binding protein, chitinase, etc) and then purify by anti-tag antibody-conjugated affinity chromatography.
- an affinity tag Frac epitope, HA epitope, myc epitope, 6 ⁇ His, maltose binding protein, chitinase, etc
- the present invention also relates to antibodies, and other specific-binding partners, which are specific for polypeptides encoded by polynucleotides of the present invention, e.g., human TARPP.
- Antibodies e.g., polyclonal, monoclonal, recombinant, chimeric, humanized, single-chain, Fab, and fragments thereof, can be prepared according to any desired method. See, also, screening recombinant immunoglobulin libraries (e.g., Orlandi et al., Proc. Natl. Acad.
- the antibodies can be IgM, IgG, subtypes, IgG2a, IgG1, etc.
- Antibodies, and immune responses can also be generated by administering naked DNA See, e.g., U.S. Pat. Nos. 5,703,055; 5,589,466; 5,580,859.
- Antibodies can be used from any source, including, goat, rabbit, mouse, chicken (e.g., IgY; see, Duan, W0/029444 for methods of making antibodies in avian hosts, and harvesting the antibodies from the eggs).
- An antibody specific for a polypeptide means that the antibody recognizes a defined sequence of amino acids within or including the polypeptide.
- Other specific binding partners include, e.g., aptamers and PNA.
- antibodies can be prepared against specific epitopes or domains of human TARPP, e.g., 1-161, 88-161, 267-300, 312-331, comprising amino acid 312, and comprising any of the amino acid differences between mouse and human as shown in FIG. 3.
- polyclonal antibodies are well-known to those skilled in the art. See, for example, Green et al., Production of Polyclonal Antisera, in IMMUNOCHEMICAL PROTOCOLS (Manson, ed.), pages 1-5 (Humana Press 1992); Coligan et al., Production of Polyclonal Antisera in Rabbits, Rats, Mice and Hamsters, in CURRENT PROTOCOLS IN IMMUNOLOGY, section 2.4.1 (1992). The preparation of monoclonal antibodies likewise is conventional.
- Antibodies can also be humanized, e.g., where they are to be used therapeutically.
- Humanized monoclonal antibodies are produced by transferring mouse complementarity determining regions from heavy and light variable chains of the mouse immunoglobulin into a human variable domain, and then substituting human residues in the framework regions of the murine counterparts.
- the use of antibody components derived from humanized monoclonal antibodies obviates potential problems associated with the immunogenicity of murine constant regions.
- General techniques for cloning murine immunoglobulin variable domains are described, for example, by Orlandi et al., Proc. Nat'l Acad. Sci. USA 86:3833 (1989), which is hereby incorporated in its entirety by reference.
- Antibodies of the invention also may be derived from human antibody fragments isolated from a combinatorial immunoglobulin library. See, for example, Barbas et al., METHODS: A COMPANION TO METHODS IN ENZYMOLOGY, VOL. 2, page 119 (1991); Winter et al., Ann. Rev. Immunol. 12: 433 (1994).
- Cloning and expression vectors that are useful for producing a human immunoglobulin phage library can be obtained commercially, for example, from STRATAGENE Cloning Systems (La Jolla, Calif.).
- antibodies of the present invention may be derived from a human monoclonal antibody.
- Such antibodies are obtained from transgenic mice that have been “engineered” to produce specific human antibodies in response to antigenic challenge.
- elements of the human heavy and light chain loci are introduced into strains of mice derived from embryonic stem cell lines that contain targeted disruptions of the endogenous heavy and light chain loci.
- the transgenic mice can synthesize human antibodies specific for human antigens and can be used to produce human antibody-secreting hybridomas.
- Methods for obtaining human antibodies from transgenic mice are described, e.g., in Green et al., Nature Genet. 7:13 (1994); Lonberg et al., Nature 368:856 (1994); and Taylor et al., Int. Immunol. 6:579 (1994).
- Antibody fragments of the present invention can be prepared by proteolytic hydrolysis of the antibody or by expression in E. coli of nucleic acid encoding the fragment.
- Antibody fragments can be obtained by pepsin or papain digestion of whole antibodies by conventional methods.
- antibody fragments can be produced by enzymatic cleavage of antibodies with pepsin to provide a 5S fragment denoted F(ab′).sub.2.
- This fragment can be further cleaved using a thiol reducing agent, and optionally a blocking group for the sulfhydryl groups resulting from cleavage of disulfide linkages, to produce 3.5S Fab′ monovalent fragments.
- an enzymatic cleavage using pepsin produces two monovalent Fab′ fragments and an Fc fragment directly.
- These methods are described, for example, by Goldenberg, U.S. Pat. Nos. 4,036,945 and 4,331,647, and references contained therein. These patents are hereby incorporated in their entireties by reference. See also Nisoiihoff et al., Arch. Biochem. Biophys. 89:230 (1960); Porter, Biochem. J. 73:119 (1959); Edelman etal, METHODS IN ENZYMOLOGY, VOL. 1, page 422 (Academic Press 1967); and Coligan et al. at sections 2.8.1-2.8.10 and 2.10.1-2.10.4.
- Fv fragments comprise an association of V.sub.H and V.sub.L chains. This association may be noncovalent, as described in Inbar et al., Proc. Nat'l Acad. Sci. USA 69:2659 (1972).
- the variable chains can be linked by an intermolecular disulfide bond or cross-linked by chemicals such as glutaraldehyde. See, e.g., Sandhu, supra.
- the Fv fragments comprise V.sub.H and V.sub.L chains connected by a peptide linker.
- These single-chain antigen binding proteins are prepared by constructing a structural gene comprising nucleic acid sequences encoding the V.sub.H and V.sub.L domains connected by an oligonucleotide. The structural gene is inserted into an expression vector, which is subsequently introduced into a host cell such as E. coli . The recombinant host cells synthesize a single polypeptide chain with a linker peptide bridging the two V domains.
- CDR peptides (“minimal recognition units”) can be obtained by constructing genes encoding the CDR of an antibody of interest. Such genes are prepared, for example, by using the polymerase chain reaction to synthesize the variable region from RNA of antibody-producing cells. See, for example, Lariick et al., METHODS: A COMPANION TO METHODS IN ENZYMOLOGY, VOL. 2, page 106 (1991).
- antibody as used herein includes intact molecules as well as fragments thereof, such as Fab, F(ab′)2, and Fv which are capable of binding to an epitopic determinant present in BinI polypeptide. Such antibody fragments retain some ability to selectively bind with its antigen or receptor.
- epitopic determinants refers to an antigenic determinant on an antigen to which the paratope of an antibody binds. Epitopic determinants usually consist of chemically active surface groupings of molecules such as amino acids or sugar side chains and usually have specific three dimensional structural characteristics, as well as specific charge characteristics. Antibodies can be prepared against specific epitopes or polypeptide domains.
- Antibodies which bind to human TARPP polypeptides of the present invention can be prepared using an intact polypeptide or fragments containing small peptides of interest as the immunizing antigen. For example, it may be desirable to produce antibodies that specifically bind to the N- or C-terminal domains of human TARPP.
- carrier protein if desired.
- Such commonly used carriers which are chemically coupled to the immunizing peptide include keyhole limpet hemocyanin (KLH), thyroglobulin, bovine serum albumin (BSA), and tetanus toxoid.
- Polyclonal or monoclonal antibodies can be further purified, for example, by binding to and elution from a matrix to which the polypeptide or a peptide to which the antibodies were raised is bound.
- a matrix to which the polypeptide or a peptide to which the antibodies were raised is bound.
- Those of skill in the art will know of various techniques common in the immunology arts for purification and/or concentration of polyclonal antibodies, as well as monoclonal antibodies (See for example, Coligan, et al., Unit 9, Current Protocols in Immunology, Wiley Interscience, 1994, incorporated by reference).
- Anti-idiotype technology can also be used to produce invention monoclonal antibodies which mimic an epitope.
- an anti-idiotypic monoclonal antibody made to a first monoclonal antibody will have a binding domain in the hypervariable region which is the “image” of the epitope bound by the first monoclonal antibody.
- Polypeptides coded for by human TARPP of the present invention can be detected, visualized, determined, quantitated, etc. according to any effective method.
- useful methods include, e.g., but are not limited to, immunoassays, RIA (radioimmunassay), ELISA, (enzyme-linked-immunosorbent assay), immunoflourescence, flow cytometry, histology, electron microscopy, light microscopy, in situ assays, immunoprecipitation, Western blot.
- Immunoassays may be carried in liquid or on biological support.
- a sample e.g., blood, stool, urine, cells, tissue, cerebral spinal fluid, body fluids, etc.
- a solid phase support or carrier such as nitrocellulose, or other solid support that is capable of immobilizing cells, cell particles or soluble proteins.
- the support may then be washed with suitable buffers followed by treatment with the detectably labeled human TARPP specific antibody.
- the solid phase support can then be washed with a buffer a second time to remove unbound antibody.
- the amount of bound label on solid support may then be detected by conventional means.
- a “solid phase support or carrier” includes any support capable of binding an antigen, antibody, or other specific binding partner.
- Supports or carriers include glass, polystyrene, polypropylene, polyethylene, dextran, nylon, amylases, natural and modified celluloses, polyacrylamides, and magnetite.
- a support material can have any structural or physical configuration.
- the support configuration may be spherical, as in a bead, or cylindrical, as in the inside surface of a test tube, or the external surface of a rod.
- the surface may be flat such as a sheet, test strip, etc.
- Preferred supports include polystyrene beads
- EIA enzyme immunoassay
- the enzyme which is bound to the antibody will react with an appropriate substrate, preferably a chromogenic substrate, in such a manner as to produce a chemical moiety that can be detected, for example, by spectrophotometric, fluorimetric or by visual means.
- Enzymes that can be used to detectably label the antibody include, but are not limited to, malate dehydrogenase, staphylococcal nuclease, delta-5-steroid isomerase, yeast alcohol dehydrogenase, .alpha.-glycerophosphate, dehydrogenase, triose phosphate isomerase, horseradish peroxidase, alkaline phosphatase, asparaginase, glucose oxidase, beta.-galactosidase, ribonuclease, urease, catalase, glucose-6-phosphate dehydrogenase, glucoamylase and acetylcholinesterase.
- the detection can be accomplished by calorimetric methods
- Detection may also be accomplished using any of a variety of other immunoassays.
- a radioimmunoassay RIA
- the radioactive isotope can be detected by such means as the use of a gamma counter or a scintillation counter or by autoradiography.
- the antibody can also be labeled with a fluorescent compound.
- fluorescent labeling compounds are fluorescein isothiocyanate, rhodamine, phycoerythrin, phycocyanin, allophycocyanin, o-phthaldehyde and fluorescamine.
- the antibody can also be detectably labeled using fluorescence emitting metals such as those in the lanthanide series. These metals can be attached to the antibody using such metal chelating groups as diethylenetriaminepentacetic acid (DTPA) or ethylenediaminetetraacetic acid (EDTA).
- DTPA diethylenetriaminepentacetic acid
- EDTA ethylenediaminetetraacetic acid
- the antibody also can be detectably labeled by coupling it to a chemiluminescent compound.
- the presence of the chemiluminescent-tagged antibody is then determined by detecting the presence of luminescence that arises during the course of a chemical reaction.
- useful chemiluminescent labeling compounds are luminol, isoluminol, theromatic acridinium ester, imidazole, acridinium salt and oxalate ester.
- Bioluminescence is a type of chemiluminescence found in biological systems in which a catalytic protein increases the efficiency of the chemiluminescent reaction. The presence of a bioluminescent protein is determined by detecting the presence of luminescence. Important bioluminescent compounds for purposes of labeling are luciferin, luciferase and aequorin.
- the present invention also relates to methods and compositions for diagnosing a disorder of nervous or immune (e.g., lymphocyte) tissues, or determining susceptibility to a disorder, using polynucleotides, polypeptides, and specific-binding partners of the present invention to detect, assess, determine, etc., human TARPP.
- a disorder of nervous or immune e.g., lymphocyte
- polypeptides, and specific-binding partners of the present invention to detect, assess, determine, etc., human TARPP.
- the gene can serve as a marker for the disorder, e.g., where the gene, when mutant, is a direct cause of the disorder; where the gene is affected by another gene(s) which is directly responsible for the disorder, e.g., when the gene is part of the same signaling pathway as the directly responsible gene; and, where the gene is chromosomally linked to the gene(s) directly responsible for the disorder, and segregates with it.
- a probe specific for the gene can be employed as described above and below. Any method of detecting and/or assessing the gene can be used, including detecting expression of the gene using polynucleotides, antibodies, or other specific-binding partners.
- the present invention relates to methods of diagnosing a disorder associated with human TARPP, or determining a subject's susceptibility to such disorder, comprising, e.g., assessing the expression of said gene(s) in a tissue sample comprising tissue or cells suspected of having the disorder.
- diagnosis indicates that it is determined whether the sample has the disorder.
- a “disorder” means, e.g., any abnormal condition as in a disease or malady.
- Determining a subject's susceptibility to a disease or disorder indicates that the subject is assessed for whether she is predisposed to get such a disease or disorder, where the predisposition is indicated by abnormal expression of the gene (e.g., gene mutation, gene expression pattern is not normal, etc.). Predisposition or susceptibility to a disease may result when a such disease is influenced by epigenetic, environmental, etc., factors. This includes prenatal screening where samples from the fetus or embryo (e.g., via amniocentesis or CV sampling) are analyzed for the expression of the gene.
- assessing expression of human TARPP it is meant that the functional status of the gene is evaluated. This includes, but is not limited to, measuring expression levels of said gene, determining the genomic structure of said gene, determining the mRNA structure of transcripts from said gene, or measuring the expression levels of polypeptide coded for by said gene.
- assessing expression includes evaluating the all aspects of the transcriptional and translational machinery of the gene.
- a sample can be evaluated (i.e., “assessed”) by looking (e.g., sequencing or restriction mapping) at the promoter sequence in the gene, by detecting transcription products (e.g., RNA), by detecting translation product (e.g., polypeptide).
- transcription products e.g., RNA
- translation product e.g., polypeptide
- a normal gene e.g., a gene which is not associated with the disorder.
- the nature of the comparison can be determined routinely, depending upon how the assessing is accomplished. If, for example, the mRNA levels of a sample is detected, then the mRNA levels of a normal can serve as a comparison, or a gene which is known not to be affected by the disorder. Methods of detecting mRNA are well known, and discussed above, e.g., but not limited to, Northern blot analysis, polymerase chain reaction (PCR), reverse transcriptase PCR, RACE PCR, etc.
- polypeptide production is used to evaluate the gene
- polypeptide in a normal tissue sample can be used as a comparison, or, polypeptide from a different gene whose expression is known not to be affected by the disorder.
- Changes in the profile can indicate, e.g., drug toxicity, return to a normal level, etc.
- the present invention also relates to methods of monitoring or assessing a therapeutic or preventative measure (e.g., chemotherapy, radiation, anti-neoplastic drugs, antibodies, etc.) in a subject, comprising, e.g., detecting the expression levels of human TARPP.
- a subject can be a cell-based assay system, non-human animal model, human patient, etc. Detecting can be accomplished as described for the methods above and below.
- therapeutic or preventative intervention it is meant, e.g., a drug administered to a patient, surgery, radiation, chemotherapy, and other measures taken to prevent, treat, or diagnose a disorder.
- Expression can be assessed in any sample comprising any tissue or cell type, body fluid, etc., as discussed for other methods of the present invention, including cells from the immune or nervous system, such as lymphocytes, neurons, or glia
- the present invention also relates to methods of identifying agents, and the agents themselves, which modulate human TARPP. These agents can be used to modulate the biological activity of the polypeptide encoded for the gene, or the gene, itself. Agents which regulate the gene or its product are useful in variety of different environments, including as medicinal agents to treat or prevent disorders associated with human TARPP and as research reagents to modify the function of tissues and cell.
- Methods of identifying agents generally comprise steps in which an agent is placed in contact with the gene, transcription product, translation product, or other target, and then a determination is performed to assess whether the agent “modulates” the target.
- the specific method utilized will depend upon a number of factors, including, e.g., the target (i.e., is it the gene or polypeptide encoded by it), the environment (e.g., in vitro or in vivo), the composition of the agent, etc.
- a method can comprise, in any effective order, one or more of the following steps, e.g., contacting a human TARPP gene (e.g., in a cell population) with a test agent under conditions effective for said test agent to modulate the expression of human TARPP, and determining whether said test agent modulates said human TARPP.
- An agent can modulate expression of human TARPP at any level, including transcription, translation, and/or perdurance of the nucleic acid (e.g., degradation, stability, etc.) in the cell.
- a method can comprise, in any effective order, one or more of the following steps, e.g., contacting a human TARPP polypeptide (e.g., in a cell, lysate, or isolated) with a test agent under conditions effective for said test agent to modulate the biological activity of said polypeptide, and determining whether said test agent modulates said biological activity.
- Contacting human TARPP with the test agent can be accomplished by any suitable method and/or means that places the agent in a position to functionally control expression or biological activity of human TARPP present in the sample.
- Functional control indicates that the agent can exert its physiological effect on human TARPP through whatever mechanism it works.
- the choice of the method and/or means can depend upon the nature of the agent and the condition and type of environment in which the human TARPP is presented, e.g., lysate, isolated, or in a cell population (such as, in vivo, in vitro, organ explants, etc.). For instance, if the cell population is an in vitro cell culture, the agent can be contacted with the cells by adding it directly into the culture medium.
- agent cannot dissolve readily in an aqueous medium, it can be incorporated into liposomes, or another lipophilic carrier, and then administered to the cell culture. Contact can also be facilitated by incorporation of agent with carriers and delivery molecules and complexes, by injection, by infusion, etc.
- Modulation can be of any type, quality, or quantity, e.g., increase, facilitate, enhance, up-regulate, stimulate, activate, amplify, augment, induce, decrease, down-regulate, diminish, lessen, reduce, etc.
- the modulatory quantity can also encompass any value, e.g., 1%, 5%, 10%, 50%, 75%, 1-fold, 2-fold, 5-fold, 10-fold, 100-fold
- modulate human TARPP expression means, e.g., that the test agent has an effect on its expression, e.g., to effect the amount of transcription, to effect RNA splicing, to effect translation of the RNA into polypeptide, to effect RNA or polypeptide stability, to effect polyadenylation or other processing of the RNA, to effect post-transcriptional or post-translational processing, etc.
- To modulate biological activity means, e.g., that a functional activity of the polypeptide is changed in comparison to its normal activity in the absence of the agent. This effect includes, increase, decrease, block, inhibit, enhance, etc.
- a test agent can be of any molecular composition, e.g., chemical compounds, biomolecules, such as polypeptides, lipids, nucleic acids (e.g., antisense to a polynucleotide sequence selected from SEQ ID NO 1, 3, 5, 7, 9, and others), carbohydrates, antibodies, ribozymes, double-stranded RNA, aptamers, etc.
- a polypeptide to be modulated is a cell-surface molecule
- a test agent can be an antibody that specifically recognizes it and, e.g., causes the polypeptide to be internalized, leading to its down regulation on the surface of the cell.
- Antibodies can also be used to modulate the biological activity a polypeptide in a lysate or other cell-free form.
- Antisense human TARPP can also be used as test agents to modulate gene expression.
- Selective polynucleotides, polypeptides, and specific-binding partners thereto can be utilized in therapeutic applications, especially to treat diseases and conditions of the immune and nervous system.
- Useful methods include, but are not limited to, immunotherapy (e.g., using specific-binding partners to polypeptides), vaccination (e.g., using a selective polypeptide or a naked DNA encoding such polypeptide), protein or polypeptide replacement therapy, gene therapy (e.g., germ-line correction, antisense), etc.
- antibody that specifically recognizes a tissue-specific antigen can be used to stimulate the body to destroy or attack the cancer, to cause down-regulation, to produce complement-mediated lysis, to inhibit cell growth, etc., of target cells which display the antigen, e.g., analogously to how c-erbB-2 antibodies are used to treat breast cancer.
- antibody can be labeled or conjugated to enhance its deleterious effect, e.g., with radionuclides and other energy emitting entitities, toxins, such as ricin, exotoxin A (ETA), and diphtheria, cytotoxic or cytostatic agents, immunomodulators, chemotherapeutic agents, etc. See, e.g., U.S. Pat. No. 6,107,090.
- An antibody or other specific-binding partner can be conjugated to a second molecule, such as a cytotoxic agent, and used for targeting the second molecule to a tissue-antigen positive cell (Vitetta, E. S. et al., 1993, Immunotoxin therapy, in DeVita, Jr., V. T. et al., eds, Cancer: Principles and Practice of Oncology, 4th ed., J. B. Lippincott Co., Philadelphia, 2624-2636).
- cytotoxic agents include, but are not limited to, antimetabolites, alkylating agents, anthracyclines, antibiotics, anti-mitotic agents, radioisotopes and chemotherapeutic agents.
- cytotoxic agents include, but are not limited to ricin, doxorubicin, daunorubicin, taxol, ethidium bromide, mitomycin, etoposide, tenoposide, vincristine, vinblastine, colchicine, dihydroxy anthracin dione, actinomycin D, 1-dehydrotestosterone, diptheria toxin, Pseudomonas exotoxin (PE) A, PE40, abrin, elongation factor-2 and glucocorticoid. Techniques for conjugating therapeutic agents to antibodies are well.
- polynucleotides and polypeptides can be used as targets for non-immunotherapeutic applications, e.g., using compounds which interfere with function, expression (e.g., antisense as a therapeutic agent), assembly, etc.
- RNA interference can be used in vivtro and in vivo to silence Human TARPP when its expression contributes to a disease (but also for other purposes, e.g., to identify the gene's function to change a developmental pathway of a cell, etc.). See, e.g., Sharp and Zamore, Science , 287:2431-2433, 2001; Grishok et al., Science , 287:2494, 2001.
- Therapeutic agents of the present invention can be administered in any form by any effective route, including, e.g., oral, parenteral, enteral, intraperitoneal, topical, transdermal (e.g., using any standard patch), ophthalmic, nasally, local, non-oral, such as aerosal, inhalation, subcutaneous, intramuscular, buccal, sublingual, rectal, vaginal, intra-arterial, and intrathecal, etc. They can be administered alone, or in combination with any ingredient(s), active or inactive.
- the present invention also relates to methods of treating a disease of the immune or nervous system showing altered expression of human TARPP, comprising, e.g., administering to a subject in need thereof a therapeutic agent which is effective for regulating expression of said human TARPP and/or which is effective in treating said disease.
- treating is used conventionally, e.g., the management or care of a subject for the purpose of combating, alleviating, reducing, relieving, improving the condition of, etc., of a disease or disorder.
- Diseases or disorders which can be treated in accordance with the present invention include, but are not limited to autoimmune disease, such as multiple sclerosis and rheumatoid arthritis, and allergy.
- altered expression it is meant that the disease is associated with a mutation in the gene, or any modification to the gene (or corresponding product) which affects its normal function.
- expression of human TARPP refers to, e.g., transcription, translation, splicing, stability of the mRNA or protein product, activity of the gene product, differential expression, etc.
- Any agent which “treats” the disease can be used.
- Such an agent can be one which regulates the expression of the human TARPP.
- Expression refers to the same acts already mentioned, e.g. transcription, translation, splicing, stability of the mRNA or protein product, activity of the gene product, differential expression, etc. For instance, if the condition was a result of a complete deficiency of the gene product, administration of gene product to a patient would be said to treat the disease and regulate the gene's expression. Many other possible situations are possible, e.g., where the gene is aberrantly expressed, and the therapeutic agent regulates the aberrant expression by restoring its normal expression pattern.
- Antisense polynucleotide e.g., RNA
- RNA can also be prepared from a polynucleotide according to the present invention, preferably an anti-sense to a sequence of SEQ ID NO 1, 3, 5, 7, 9, and others.
- Antisense polynucleotide can be used in various ways, such as to regulate or modulate expression of the polypeptides they encode, e.g., inhibit their expression, for in situ hybridization, for therapeutic purposes, for making targeted mutations (in vivo, triplex, etc.) etc.
- anti-sense see, e.g., U.S. Pat. Nos.
- An antisense polynucleotides can be operably linked to an expression control sequence.
- a total length of about 35 bp can be used in cell culture with cationic liposomes to facilitate cellular uptake, but for in vivo use, preferably shorter oligonucleotides are administered, e.g. 25 nucleotides.
- Antisense polynucleotides can comprise modified, nonnaturally-occurring nucleotides and linkages between the nucleotides (e.g., modification of the phosphate-sugar backbone; methyl phosphonate, phosphorothioate, or phosphorodithioate linkages; and 2′-O-methyl ribose sugar units), e.g., to enhance in vivo or in vitro stability, to confer nuclease resistance, to modulate uptake, to modulate cellular distribution and compartmentalization, etc. Any effective nucleotide or modification can be used, including those already mentioned, as known in the art, etc., e.g., disclosed in U.S. Pat. Nos.
- the present invention also relates to an ordered array of polynucleotide probes and specific-binding partners (e.g., antibodies) for detecting the expression of human TARPP in a sample, comprising, one or more polynucleotide probes or specific binding partners associated with a solid support, wherein each probe is specific for human TARPP, and the probes comprise a nucleotide sequence of SEQ ID NO 1, 3, 5, 7, 9, and others which is specific for said gene, a nucleotide sequence having sequence identity to SEQ ID NO 1, 3, 5, 7, 9, and others which is specific for said gene or polynucleotide, or complements thereto, or a specific-binding partner which is specific for human TARPP.
- specific-binding partners e.g., antibodies
- the phrase “ordered array” indicates that the probes are arranged in an identifiable or position-addressable pattern, e.g., such as the arrays disclosed in U.S. Pat. Nos. 6,156,501, 6,077,673, 6,054,270, 5,723,320, 5,700,637, WO09919711, WO00023803.
- the probes are associated with the solid support in any effective way.
- the probes can be bound to the solid support, either by polymerizing the probes on the substrate, or by attaching a probe to the substrate. Association can be, covalent, electrostatic, noncovalent, hydrophobic, hydrophilic, noncovalent, coordination, adsorbed, absorbed, polar, etc.
- the probes can fill the hollow orifice, be absorbed into the solid filament, be attached to the surface of the orifice, etc. Probes can be of any effective size, sequence identity, composition, etc., as already discussed.
- Ordered arrays can further comprise polynucleotide probes or specific-binding partners which are specific for other genes, including genes specific for immune or nervous tissues, or genes associated with diseases thereof.
- the present invention also relates to transgenic animals comprising human TARPP genes.
- genes include, but are not limited to, functionally-disrupted genes, mutated genes, ectopically or selectively-expressed genes, inducible or regulatable genes, etc.
- These transgenic animals can be produced according to any suitable technique or method, including homologous recombination, mutagenesis (e.g., ENU, Rathkolb et al., Exp. Physiol ., 85(6):635-644, 2000), and the tetracycline-regulated gene expression system (e.g., U.S. Pat. No. 6,242,667).
- gene as used herein includes any part of a gene, i.e., regulatory sequences, promoters, enhancers, exons, introns, coding sequences, etc.
- a human TARPP nucleic acid present in the construct or transgene can be naturally-occurring wild-type, polymorphic, or mutated.
- polynucleotides of the present invention can be used to create transgenic animals, e.g. a non-human animal, comprising at least one cell whose genome comprises a functional disruption of human TARPP.
- functional disruption or “functionally disrupted,” it is meant that the gene does not express a biologically-active product. It can be substantially deficient in at least one functional activity coded for by the gene. Expression of a polypeptide can be substantially absent, i.e., essentially undetectable amounts are made. However, polypeptide can also be made, but which is deficient in activity, e.g., where only an amino-terminal portion of the gene product is produced.
- the gene can be disrupted in a specific region, e.g., in the sequence coding for amino acids 1-161 of a human TARPP.
- Cells and/or animals can also have targeted deletions, e.g., deletion of a coding sequence for amino acids 267-300 and/or 312-331 of a human TARPP of SEQ ID NO 1 or 2.
- the transgenic animal can comprise one or more cells. When substantially all its cells contain the engineered gene, it can be referred to as a transgenic animal “whose genome comprises” the engineered gene. This indicates that the endogenous gene loci of the animal has been modified and substantially all cells contain such modification.
- Functional disruption of the gene can be accomplished in any effective way, including, e.g., introduction of a stop codon into any part of the coding sequence such that the resulting polypeptide is biologically inactive (e.g., because it lacks a catalytic domain, a ligand binding domain, etc.), introduction of a mutation into a promoter or other regulatory sequence that is effective to turn it off, or reduce transcription of the gene, insertion of an exogenous sequence into the gene which inactivates it (e.g., which disrupts the production of a biologically-active polypeptide or which disrupts the promoter or other transcriptional machinery), deletion of sequences from the Human TARPP gene, etc.
- transgenic animals having functionally disrupted genes are well known, e.g., as described in U.S. Pat. Nos. 6,239,326, 6,225,525, 6,207,878, 6,194,633, 6,187,992, 6,180,849, 6,177,610, 6,100,445, 6,087,555, 6,080,910, 6,069,297, 6,060,642, 6,028,244, 6,013,858, 5,981,830, 5,866,760, 5,859,314, 5,850,004, 5,817,912, 5,789,654, 5,777,195, and 5,569,824.
- a transgenic animal which comprises the functional disruption can also be referred to as a “knock-out” animal, since the biological activity of its human TARPP genes has been “knocked-out.”
- One or more the different splice forms, Br137A-E can also be knocked-out or disrupted, e.g., in cells or whole mammals. Knock-out cells and animals can be homozygous or heterozygous.
- homologous recombination technology is of special interest since it allows specific regions of the genome to be targeted.
- genes can be specifically-inactivated, specific mutations can be introduced, and exogenous sequences can be introduced at specific sites. These methods are well known in the art, e.g., as described in the patents above. See, also, Robertson, Biol. Reproduc ., 44(2):238-245, 1991.
- the genetic engineering is performed in an embryonic stem (ES) cell, or other pluripotent cell line (e.g., adult stem cells, EG cells), and that genetically-modified cell (or nucleus) is used to create a whole organism.
- ES embryonic stem
- EG cells e.g., adult stem cells, EG cells
- nuclear transfer can be used in combination with homologous recombination technologies.
- the human TARPP locus can be disrupted in ES cells using a positive-negative selection method (e.g., Mansour et al., Nature, 336:348-352, 1988).
- a targeting vector can be constructed which comprises a part of the gene to be targeted.
- a selectable marker such as neomycin resistance genes, can be inserted into a human TARPP exon present in the targeting vector, disrupting it.
- the vector recombines with the ES cell genome, it disrupts the function of the gene.
- the presence in the cell of the vector can be determined by expression of neomycin resistance. See, e.g., U.S. Pat. No. 6,239,326.
- Cells having at least one functionally disrupted gene can be used to make chimeric and germline animals, e.g., animals having somatic and/or germ cells comprising the engineered gene.
- Homozygous knock-out animals can be obtained from breeding heterozygous knock-out animals. See, e.g., U.S. Pat. No. 6,225,525.
- a transgenic animal, or animal cell, lacking one or more functional human TARPP genes (and lacking one or more functional copies of the splice variant) can be useful in a variety of applications, including, as an animal model for diseases of the immune or nervous system, for drug screening assays (e.g., for DNA-binding activities other than those contributed by human TARPP; by making a cell deficient in one or more splice forms of human TARPP, the contribution of other DNA binding activity can be specifically examined), as a source of tissues deficient in human TARPP activity, and any of the utilities mentioned in any issued U.S. Patent on transgenic animals, including, U.S. Pat. Nos.
- a recombinant human TARPP nucleic acid refers to a gene which has been introduced into a target host cell and optionally modified, such as cells derived from animals, plants, bacteria, yeast, etc.
- a recombinant human TARPP includes completely synthetic nucleic acid sequences, semi-synthetic nucleic acid sequences, sequences derived from natural sources, and chimeras thereof. “Operable linkage” has the meaning used through the specification, i.e., placed in a functional relationship with another nucleic acid.
- a gene When a gene is operably linked to an expression control sequence, as explained above, it indicates that the gene (e.g., coding sequence) is joined to the expression control sequence (e.g., promoter) in such a way that facilitates transcription and translation of the coding sequence.
- the phrase “genome” indicates that the genome of the cell has been modified. In this case, the recombinant human TARPP has been stably integrated into the genome of the animal.
- the human TARPP nucleic acid in operable linkage with the expression control sequence can also be referred to as a construct or transgene.
- Any expression control sequence can be used depending on the purpose. For instance, if selective expression is desired, then expression control sequences which limit its expression can be selected. These include, e.g., tissue or cell-specific promoters, introns, enhancers, etc. For various methods of cell and tissue-specific expression, see, e.g., U.S. Pat. Nos. 6,215,040, 6,210,736, and 6,153,427. These also include the endogenous promoter, i.e., the coding sequence can be operably linked to its own promoter. Inducible and regulatable promoters can also be utilized.
- the present invention also relates to a transgenic animal which contains a functionally disrupted and a transgene stably integrated into the animals genome.
- a transgenic animal which contains a functionally disrupted and a transgene stably integrated into the animals genome.
- Such an animal can be constructed using combinations any of the above- and below-mentioned methods.
- Such animals have any of the aforementioned uses, including permitting the knock-out of the normal gene and its replacement with a mutated gene.
- Such a transgene can be integrated at the endogenous gene locus so that the functional disruption and “knock-in” are carried out in the same step.
- transgenic animals can be prepared according to known methods, including, e.g., by pronuclear injection of recombinant genes into pronuclei of 1-cell embryos, incorporating an artificial yeast chromosome into embryonic stem cells, gene targeting methods, embryonic stem cell methodology, cloning methods, nuclear transfer methods. See, also, e.g., U.S. Pat. Nos. 4,736,866; 4,873,191; 4,873,316; 5,082,779; 5,304,489; 5,174,986; 5,175,384; 5,175,385; 5,221,778; Gordon et al., Proc. Natl. Acad.
- Palmiter et al. Cell, 41:343-345, 1985; Palmiter et al., Ann. Rev. Genet., 20:465-499, 1986; Askew et al., Mol. Cell. Bio., 13:4115-4124, 1993; Games et al. Nature, 373:523-527, 1995; Valancius and Smithies, Mol. Cell. Bio., 11: 1402-1408, 1991; Stacey et al., Mol. Cell. Bio., 14:1009-1016, 1994; Hasty et al., Nature, 350:243-246, 1995; Rubinstein et al., Nucl.
- a polynucleotide according to the present invention can be introduced into any non-human animal, including a non-human mammal, mouse (Hogan et al., Manipulating the Mouse Embryo: A Laboratory Manual, Cold Spring Harbor Laboratory , Cold Spring Harbor, New York, 1986), pig (Hammer et al., Nature, 315:343-345, 1985), sheep (Hammer et al., Nature, 315:343-345, 1985), cattle, rat, or primate. See also, e.g., Church, 1987, Trends in Biotech. 5:13-19; Clark et al., Trends in Biotech.
- Transgenic animals can be produced by the methods described in U.S. Pat. No. 5,994,618, and utilized for any of the utilities described therein.
- the present invention also relates to electronic forms of polynucleotides, polypeptides, etc., of the present invention, including computer-readable medium (e.g., magnetic, optical, etc., stored in any suitable format, such as flat files or hierarchical files) which comprise such sequences, or fragments thereof, e-commerce-related means, etc.
- computer-readable medium e.g., magnetic, optical, etc., stored in any suitable format, such as flat files or hierarchical files
- the present invention relates to methods of retrieving gene sequences from a computer-readable medium, comprising, one or more of the following steps in any effective order, e.g., selecting a cell or gene expression profile, e.g., a profile that specifies that said gene is expressed in brain and/or immune cells, and, and retrieving said expressed gene sequences, where the gene sequences consist of the genes represented by SEQ ID Nos 1-10
- a “gene expression profile” means the list of tissues, cells, etc., in which a defined gene is expressed (i.e, transcribed and/or translated).
- a “cell expression profile” means the genes which are expressed in the particular cell type. The profile can be a list of the tissues in which the gene is expressed, but can include additional information as well, including level of expression (e.g., a quantity as compared or normalized to a control gene), and information on temporal (e.g., at what point in the cell-cycle or developmental program) and spatial expression.
- selecting a gene or cell expression profile it is meant that a user decides what type of gene or cell expression pattern he is interested in retrieving, e.g., he may require that the gene is differentially expressed in a tissue. Any pattern of expression preferences may be selected.
- the selecting can be performed by any effective method.
- “selecting” refers to the process in which a user forms a query that is used to search a database of gene expression profiles. The step of retrieving involves searching for results in a database that correspond to the query set forth in the selecting step. Any suitable algorithm can be utilized to perform the search query, including algorithms that look for matches, or that perform optimization between query and data.
- the database is information that has been stored in an appropriate storage medium, having a suitable computer-readable format. Once results are retrieved, they can be displayed in any suitable format, such as HTML.
- a query is formed by the user to retrieve the set of genes from the database having the desired gene or cell expression profile. Once the query is inputted into the system, a search algorithm is used to interrogate the database, and retrieve results.
- the present invention also relates to methods of advertising, licensing, selling, purchasing, brokering, etc., genes, polynucleotides, specific-binding partners, antibodies, etc., of the present invention.
- Methods can comprises, e.g., displaying a human TARPP gene, human TARPP polypeptide, or antibody specific for human TARPP in a printed or computer-readable medium (e.g., on the Web or Internet), accepting an offer to purchase said gene, polypeptide, or antibody.
- a polynucleotide, probe, polypeptide, antibody, specific-binding partner, etc., according to the present invention can be isolated.
- isolated means that the material is in a form in which it is not found in its original environment or in nature, e.g., more concentrated, more purified, separated from component, etc.
- An isolated polynucleotide includes, e.g., a polynucleotide having the sequenced separated from the chromosomal DNA found in a living animal, e.g., as the complete gene, a transcript, or a cDNA.
- This polynucleotide can be part of a vector or inserted into a chromosome (by specific gene-targeting or by random integration at a position other than its normal position) and still be isolated in that it is not in a form that is found in its natural environment.
- a polynucleotide, polypeptide, etc., of the present invention can also be substantially purified. By substantially purified, it is meant that polynucleotide or polypeptide is separated and is essentially free from other polynucleotides or polypeptides, i.e., the polynucleotide or polypeptide is the primary and active constituent.
- a polynucleotide can also be a recombinant molecule.
- recombinant it is meant that the polynucleotide is an arrangement or form which does not occur in nature.
- a recombinant molecule comprising a promoter sequence would not encompass the naturally-occurring gene, but would include the promoter operably linked to a coding sequence not associated with it in nature, e.g., a reporter gene, or a truncation of the normal coding sequence.
- a marker is used herein to indicate a means for detecting or labeling a target.
- a marker can be a polynucleotide (usually referred to as a “probe”), polypeptide (e.g., an antibody conjugated to a detectable label), PNA, or any effective material.
Landscapes
- Chemical & Material Sciences (AREA)
- Health & Medical Sciences (AREA)
- Life Sciences & Earth Sciences (AREA)
- Organic Chemistry (AREA)
- General Health & Medical Sciences (AREA)
- Gastroenterology & Hepatology (AREA)
- Biochemistry (AREA)
- Biophysics (AREA)
- Zoology (AREA)
- Genetics & Genomics (AREA)
- Medicinal Chemistry (AREA)
- Molecular Biology (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- Toxicology (AREA)
- Peptides Or Proteins (AREA)
- Measuring Or Testing Involving Enzymes Or Micro-Organisms (AREA)
- Micro-Organisms Or Cultivation Processes Thereof (AREA)
Abstract
The present invention relates to all facets of novel polynucleotides, the polypeptides they encode, antibodies and specific binding partners thereto, and their applications to research, diagnosis, drug discovery, therapy, clinical medicine, forensic science and medicine, etc. The polynucleotides are highly in brain, pituitary, muscle, and thymus, and are therefore useful in variety of ways, including, but not limited to, as molecular markers, as drug targets, and for detecting, diagnosing, staging, monitoring, prognosticating, preventing or treating, determining predisposition to, etc., diseases and conditions, relating to such tissues. The genes and polypeptides can also be used as markers for immature T-cells.
Description
- FIG. 1 is the amino acid aligmnents of the different splice variants of human TARPP, Br137A (SEQ ID NO 4), B (SEQ ID NO 6), C (SEQ ID NO 8), D (SEQ ID NO 10; SEQ ID NO 13, NM —016300), and E (SEQ ID NO 2), and partial clone AL133109 (SEQ ID NO 13).
- FIG. 2 is a schematic drawing showing the differences between the various forms of human TARPP.
- FIG. 3 shows amino acid alignments of the different splice variants of human TARPP (Br137A, B, C, D, and E) with mouse TARPP (NM —033264; SEQ ID NO 11).
- The present invention relates to all facets of human TARPP (also known as human Br137), polypeptides encoded by it, antibodies and specific binding partners thereto, and their applications to research, diagnosis, drug discovery, therapy, clinical medicine, forensic science and medicine, etc. Human TARPP polynucleotides, polypeptides, antibodies, etc., are useful in variety of ways, including, but not limited to, as a molecular markers, as drug targets, and for detecting, diagnosing, staging, monitoring, prognosticating, preventing or treating, determining predisposition to, etc., diseases and conditions relating to T-cells and dopaminergic pathways. The identification of specific genes, and groups of genes, expressed in pathways physiologically relevant to these conditions permits the definition of functional and disease pathways, and the delineation of targets in these pathways which are useful in diagnostic, therapeutic, and clinical applications. The present invention also relates to methods of using the polynucleotides and related products (proteins, antibodies, etc.) in business and computer-related methods, e.g., advertising, displaying, offering, selling, etc., such products for sale, commercial use, licensing, etc.
- Human TARPP (thymocyte cyclic AMP regulated phosphoprotein, or, Br 137A, B, C, D, and E) is represented by a family of alternative splice variants. FIGS. 1 and 2 summarize the differences between the multiple forms. Br137E is an 847 amino acid polypeptide. Its nucleotide and amino acid sequences are shown in
SEQ ID NOS 1 and 2. Br137B (SEQ ID NO 5 and 6) has a deletion of amino acids 267-300, Br137A (SEQ ID NO 3 and 4) has a deletion of amino acids 312-331, and Br137C (SEQ ID NO 7 and 8) has a deletion of both these domains. Br137D contains only the first 87 amino acids followed by a two-amino acid N-terminus which differs from the other forms. A partial clone, AL133109 as shown in FIG. 1, is missing the first 161 amino acids of Br137E, as well as having an amino acid difference at position 312 (SEQ ID NO 2). - Br137E contains a nuclear localization signal at about amino acids 107-124, an R3H domain (single-stranded nucleic acid binding domain) at about amino acids 147-224, and a proline rich region at about amino acids 476-682. These domains are also present in the A-C splice forms, but at different amino acid positions. Human TARPP has nucleic acid binding activity conferred by the corresponding binding domain indicating that it can bind nucleic acids, preferably single-stranded DNA or RNA. This binding activity can be assayed routinely, e.g., using gel electrophoresis band shift assays, e.g., as carried out in, e.g., U.S. Pat. Nos. 6,333,407 and 5,789,538, ELISA-based assays (e.g., Mercury™ TransFactor Kit from Clontech), and other assays which detect DNA-protein interactions.
- The Br137 family represent the human homologs of murine TARPP (thymocyte ARPP) (M —033264; SEQ ID NO 11; “Mouse” in FIG. 3). Br137E has about 83% amino acid identity and 87% homology with it (calculated using the BLAST algorithm). See, FIG. 3 (NM—033264 is murine TARPP). In addition to amino acid sequence differences between the two proteins, human TARPP has an insertion at about amino acid positions 549-572 of SEQ ID NO 2 which is not present in the mouse protein. See, FIG. 3.
- Originally, a 21 kDa polypeptide was isolated from rat basal ganglia based on its phosphorylation by cAMP-dependent protein kinase (PKA). Williams et al., J. Neurosci., 9:3631-3637, 1989. It was named ARPP-21 (cAMP-regulated phosphoprotein). Activation of dopamine receptors resulted in an increase in the phosphorylation of ARPP-21. Caporaso et al., Neuropharm., 39:1637-1644, 2000. Human ARPP-21 (Br137D) contains 89 amino acids (NM—016300; SEQ ID NO 13).
- A high molecular weight polypeptide of ARPP-21 was subsequently identified in T-cells and named TARPP. Kisielow et al., Eur. J. Immunol., 31:1141-1149, 2001. This polypeptide contains ARPP-21 sequence at its 5′ end, but a novel 3′ end coding for more than 700 additional amino acids (for a total of 807 amino acids). Murine TARPP appears to be involved in the regulation of thymocyte maturation and TCR rearrangement. Expression of TARPP is down-regulated after the TCR signals delivered. It is highly expressed in immature thymocytes and is associated with the commitment to the T-cell lineage, making it highly selective marker for T-cell commitment. See, Kisielow, ibid. After commitment to the T-cell lineage during positive selection, its expression is turned off.
- There appear to be several members of the human TARPP family. KIAA0029 is a hypothetical protein that shares about 45% amino acid sequence identity and 59% homology with Br137E. KIAA1002, a second hypothetical protein, has about 42% amino acid identity and 54% homology to it.
- Human TARPP is highly expressed in brain, pituitary, muscle, and thymus. It is expressed at lower levels in adrenal gland, bone marrow, heart, small intestine, kidney, liver, ovary, prostate, stomach, testis, and thyroid. There was virtually no detectable expression in colon, lung, lymph node, peripheral lymphocytes, mammary gland, pancreas, and uterus.
- As indicated by its expression pattern, human TARPP is involved the maturation of T-cells, especially in the rearrangement of the TCR. For this reason, it can be used to modulate T-cells, e.g., in allergy, auto immune disease (e.g., rheumatoid arthritis and multiple sclerosis), and graft-host disease. It can also be used a marker to determine the index of mature versus immature T-cells, where human TARPP is marker of immature T-cells. Additionally, human TARPP is phosphorylated upon dopamine receptor activation, indicating an involvement in dopamine pathways. Consequently, it is target for diseases that involve dopamine, including, e.g., schizophrenia, substance abuse and addiction, anxiety, Parkinson's disease, and other dopaminergic diseases and conditions.
- Human TARPP is localized to chromosomal band 3p21.33. There are several disorders genetically mapped to this region, including, e.g., retinal vasculopathy with cerebral leukodystrophy (OMIM 192315), deafness, neurosensory, autosomal recessive 6 (OMIM 600971), and lung cancer. Nucleic acids of the present invention can be used as linkage markers, diagnostic targets, therapeutic targets, for any of the mentioned disorders, as well as any disorders or genes mapping in proximity to it.
- Nucleic Acids
- A mammalian polynucleotide, or fragment thereof, of the present invention is a polynucleotide having a nucleotide sequence obtainable from a natural source. It therefore includes naturally-occurring normal, naturally-occurring mutant, and naturally-occurring polymorphic alleles (e.g., SNPs), differentially-spliced transcripts, splice-variants, etc. By the term “naturally-occurring,” it is meant that the polynucleotide is obtainable from a natural source, e.g., animal tissue and cells, body fluids, tissue culture cells, forensic samples. Natural sources include, e.g., living cells obtained from tissues and whole organisms, tumors, cultured cell lines, including primary and immortalized cell lines. Naturally-occurring mutations can include deletions (e.g., a truncated amino- or carboxy-terminus), substitutions, inversions, or additions of nucleotide sequence. These genes can be detected and isolated by polynucleotide hybridization according to methods which one skilled in the art would know, e.g., as discussed below.
- A polynucleotide according to the present invention can be obtained from a variety of different sources. It can be obtained from DNA or RNA, such as polyadenylated mRNA or total RNA, e.g., isolated from tissues, cells, or whole organism. The polynucleotide can be obtained directly from DNA or RNA, from a cDNA library, from a genomic library, etc. The polynucleotide can be obtained from a cell or tissue (e.g., from an embryonic or adult tissues) at a particular stage of development, having a desired genotype, phenotype, disease status, etc. A polynucleotide which “codes without interruption” refers to a polynucleotide having a continuous open reading frame (“ORF”) as compared to an ORF which is interrupted by introns or other noncoding sequences.
- Polynucleotides and polypeptides (including any part of human TARPP) can be excluded as compositions from the present invention if, e.g., listed in a publicly available databases on the day this application was filed and/or disclosed in a patent application having an earlier filing or priority date than this application and/or conceived and/or reduced to practice earlier than a polynucleotide in this application.
- As described herein, the phrase “an isolated polynucleotide which is SEQ ID NO,” or “an isolated polynucleotide which is selected from SEQ ID NO,” refers to an isolated nucleic acid molecule from which the recited sequence was derived (e.g., a cDNA derived from mRNA; cDNA derived from genomic DNA). Because of sequencing errors, typographical errors, etc., the actual naturally-occurring sequence may differ from a SEQ ID listed herein. Thus, the phrase indicates the specific molecule from which the sequence was derived, rather than a molecule having that exact recited nucleotide sequence, analogously to how a culture depository number refers to a specific cloned fragment in a cryotube.
- As explained in more detail below, a polynucleotide sequence of the invention can contain the complete sequence as shown in
SEQ ID NO 1, 3, 5, 7, 9, and others, degenerate sequences thereof, anti-sense, muteins thereof, genes comprising said sequences, full-length cDNAs comprising said sequences, complete genomic sequences, fragments thereof, homologs, primers, nucleic acid molecules which hybridize thereto, derivatives thereof, etc. - Genomic
- The present invention also relates genomic DNA from which the polynucleotides of the present invention can be derived. A genomic DNA coding for a human, mouse, or other mammalian polynucleotide, can be obtained routinely, for example, by screening a genomic library (e.g., a YAC library) with a polynucleotide of the present invention, or by searching nucleotide databases, such as GenBank and EMBL, for matches. Promoter and other regulatory regions (including both 5′ and 3′ regions) can be identified upstream or downstream of coding and expressed RNAs, and assayed routinely for activity, e.g., by joining to a reporter gene (e.g., CAT, GFP, alkaline phosphatase, luciferase, galatosidase). A promoter obtained from a gene can be used, e.g., in gene therapy to obtain tissue-specific expression of a heterologous gene (e.g., coding for a therapeutic product or cytotoxin). 3′-untranslated sequences (as well as introns) can be used, e.g., to stabilize transcripts, to target transcripts, etc.
- Constructs
- A polynucleotide of the present invention can comprise additional polynucleotide sequences, e.g., sequences to enhance expression, detection, uptake, cataloging, tagging, etc. A polynucleotide can include only coding sequence; a coding sequence and additional non-naturally occurring or heterologous coding sequence (e.g., sequences coding for leader, signal, secretory, targeting, enzymatic, fluorescent, antibiotic resistance, and other functional or diagnostic peptides); coding sequences and non-coding sequences, e.g., untranslated sequences at either a 5′ or 3′ end, or dispersed in the coding sequence, e.g., introns.
- A polynucleotide according to the present invention also can comprise an expression control sequence operably linked to a polynucleotide as described above. The phrase “expression control sequence” means a polynucleotide sequence that regulates expression of a polypeptide coded for by a polynucleotide to which it is functionally (“operably”) linked. Expression can be regulated at the level of the mRNA or polypeptide. Thus, the expression control sequence includes mRNA-related elements and protein-related elements. Such elements include promoters, enhancers (viral or cellular), ribosome binding sequences, transcriptional terminators, etc. An expression control sequence is operably linked to a nucleotide coding sequence when the expression control sequence is positioned in such a manner to effect or achieve expression of the coding sequence. For example, when a promoter is operably linked 5′ to a coding sequence, expression of the coding sequence is driven by the promoter. Expression control sequences can include an initiation codon and additional nucleotides to place a partial nucleotide sequence of the present invention in-frame in order to produce a polypeptide (e.g., pET vectors from Promega have been designed to permit a molecule to be inserted into all three reading frames to identify the one that results in polypeptide expression). Expression control sequences can be heterologous or endogenous to the normal gene.
- A polynucleotide of the present invention can also comprise nucleic acid vector sequences, e.g., for cloning, expression, amplification, selection, etc. Any effective vector can be used. A vector is, e.g., a polynucleotide molecule which can replicate autonomously in a host cell, e.g., containing an origin of replication. Vectors can be useful to perform manipulations, to propagate, and/or obtain large quantities of the recombinant molecule in a desired host. A skilled worker can select a vector depending on the purpose desired, e.g., to propagate the recombinant molecule in bacteria, yeast, insect, or mammalian cells. The following vectors are provided by way of example. Bacterial: pQE70, pQE60, pQE-9 (Qiagen), pBS, pD10, Phagescript, phiX174, pBK Phagemid, pNH8A, pNH16a, pNH18Z, pNH46A (Stratagene); Bluescript KS+II (Stratagene); ptrc99a, pKK223-3, pKK233-3, pDR540, pRIT5 (Pharmacia). Eukaryotic: PWLNEO, pSV2CAT, pOG44, pXT1, pSG (Stratagene), pSVK3, PBPV, PMSG, pSVL (Pharmacia), pCR2.1/TOPO, pCRII/TOPO, pCR4/TOPO, pTrcHisB, pCMV6-XL4, etc. However, any other vector, e.g., plasmids, viruses, or parts thereof, may be used as long as they are replicable and viable in the desired host. The vector can also comprise sequences which enable it to replicate in the host whose genome is to be modified.
- Hybridization
- Polynucleotide hybridization, as discussed in more detail below, is useful in a variety of applications, including, in gene detection methods, for identifying mutations, for making mutations, to identify homologs in the same and different species, to identify related members of the same gene family, in diagnostic and prognostic assays, in therapeutic applications (e.g., where an antisense polynucleotide is used to inhibit expression), etc.
- The ability of two single-stranded polynucleotide preparations to hybridize together is a measure of their nucleotide sequence complementarity, e.g., base-pairing between nucleotides, such as A-T, G-C, etc. The invention thus also relates to polynucleotides, and their complements, which hybridize to a polynucleotide comprising a nucleotide sequence as set forth in
SEQ ID NO 1, 3, 5, 7, 9, and others and genomic sequences thereof. A nucleotide sequence hybridizing to the latter sequence will have a complementary polynucleotide strand, or act as a template for one in the presence of a polymerase (i.e., an appropriate polynucleotide synthesizing enzyme). The present invention includes both strands of polynucleotide, e.g., a sense strand and an anti-sense strand. - Hybridization conditions can be chosen to select polynucleotides which have a desired amount of nucleotide complementarity with the nucleotide sequences set forth in
SEQ ID NO 1, 3, 5, 7, 9, and others and genomic sequences thereof. A polynucleotide capable of hybridizing to such sequence, preferably, possesses, e.g., about 70%, 75%, 80%, 85%, 87%, 90%, 92%, 95%, 97%, 99%, or 100% complementarity, between the sequences. The present invention particularly relates to polynucleotide sequences which hybridize to the nucleotide sequences set forth inSEQ ID NO 1, 3, 5, 7, 9, and others or genomic sequences thereof, under low or high stringency conditions. These conditions can be used, e.g., to select corresponding homologs in non-human species. - Polynucleotides which hybridize to polynucleotides of the present invention can be selected in various ways. Filter-type blots (i.e., matrices containing polynucleotide, such as nitrocellulose), glass chips, and other matrices and substrates comprising polynucleotides (short or long) of interest, can be incubated in a prehybridization solution (e.g., 6×SSC, 0.5% SDS, 100 μg/ml denatured salmon sperm DNA, 5× Denhardt's solution, and 50% formamide), at 22-68° C., overnight, and then hybridized with a detectable polynucleotide probe under conditions appropriate to achieve the desired stringency. In general, when high homology or sequence identity is desired, a high temperature can be used (e.g., 65° C.). As the homology drops, lower washing temperatures are used. For salt concentrations, the lower the salt concentration, the higher the stringency. The length of the probe is another consideration. Very short probes (e.g., less than 100 base pairs) are washed at lower temperatures, even if the homology is high. With short probes, formamide can be omitted. See, e.g., Current Protocols in Molecular Biology, Chapter 6, Screening of Recombinant Libraries; Sambrook et al., Molecular Cloning, 1989, Chapter 9.
- For instance, high stringency conditions can be achieved by incubating the blot overnight (e.g., at least 12 hours) with a long polynucleotide probe in a hybridization solution containing, e.g., about 5×SSC, 0.5% SDS, 100 μg/ml denatured salmon sperm DNA and 50% formamide, at 42° C. Blots can be washed at high stringency conditions that allow, e.g., for less than 5% bp mismatch (e.g., wash twice in 0.1% SSC and 0.1% SDS for 30 min at 65° C.), i.e., selecting sequences having 95% or greater sequence identity.
- Other non-limiting examples of high stringency conditions includes a final wash at 65° C. in aqueous buffer containing 30 mM NaCl and 0.5% SDS. Another example of high stringent conditions is hybridization in 7% SDS, 0.5 M NaPO 4,
pH 7, 1 mM EDTA at 50° C., e.g., overnight, followed by one or more washes with a 1% SDS solution at 42° C. Whereas high stringency washes can allow for less than 5% mismatch, reduced or low stringency conditions can permit up to 20% nucleotide mismatch. Hybridization at low stringency can be accomplished as above, but using lower formamide conditions, lower temperatures and/or lower salt concentrations, as well as longer periods of incubation time. - Hybridization can also be based on a calculation of melting temperature (Tm) of the hybrid formed between the probe and its target, as described in Sambrook et al.. Generally, the temperature Tm at which a short oligonucleotide (containing 18 nucleotides or fewer) will melt from its target sequence is given by the following equation: Tm=(number of A's and T's)×2° C.+(number of C's and G's)×4° C. For longer molecules, Tm=81.5+16.6 log 10[Na+]+0.41(% GC)−600/N where [Na+] is the molar concentration of sodium ions, % GC is the percentage of GC base pairs in the probe, and N is the length. Hybridization can be carried out at several degrees below this temperature to ensure that the probe and target can hybridize. Mismatches can be allowed for by lowering the temperature even further.
- Stringent conditions can be selected to isolate sequences, and their complements, which have, e.g., at least about 90%, 95%, or 97%, nucleotide complementarity between the probe (e.g., a short polynucleotide of
SEQ ID NO 1, 3, 5, 7, 9, and others or genomic sequences thereof) and a target polynucleotide. - Other homologs of polynucleotides of the present invention can be obtained from mammalian and non-mammalian sources according to various methods. For example, hybridization with a polynucleotide can be employed to select homologs, e.g., as described in Sambrook et al., Molecular Cloning, Chapter 11, 1989. Such homologs can have varying amounts of nucleotide and amino acid sequence identity and similarity to such polynucleotides of the present invention. Mammalian organisms include, e.g., mice, rats, monkeys, pigs, cows, etc. Non-mammalian organisms include, e.g., vertebrates, invertebrates, zebra fish, chicken, Drosophila, C. elegans, Xenopus, yeast such as S. pombe, S. cerevisiae, roundworms, prokaryotes, plants, Arabidopsis, artemia, viruses, etc. The degree of nucleotide sequence identity between human and mouse can be about, e.g. 70% or more, 85% or more for open reading frames, etc.
- Alignment
- Alignments can be accomplished by using any effective algorithm. For pairwise alignments of DNA sequences, the methods described by Wilbur-Lipman (e.g., Wilbur and Lipman, Proc. Natl. Acad. Sci., 80:726-730, 1983) or Martinez/Needleman-Wunsch (e.g., Martinez, Nucleic Acid Res., 11:4629-4634, 1983) can be used. For instance, if the Martinez/Needleman-Wunsch DNA alignment is applied, the minimum match can be set at 9, gap penalty at 1.10, and gap length penalty at 0.33. The results can be calculated as a similarity index, equal to the sum of the matching residues divided by the sum of all residues and gap characters, and then multiplied by 100 to express as a percent. Similarity index for related genes at the nucleotide level in accordance with the present invention can be greater than 70%, 80%, 85%, 90%, 95%, 99%, or more. Pairs of protein sequences can be aligned by the Lipman-Pearson method (e.g., Lipman and Pearson, Science, 227:1435-1441, 1985) with k-tuple set at 2, gap penalty set at 4, and gap length penalty set at 12. Results can be expressed as percent similarity index, where related genes at the amino acid level in accordance with the present invention can be greater than 65%, 70%, 75%, 80%, 85%, 90%, 95%, 99%, or more. Various commercial and free sources of alignment programs are available, e.g., MegAlign by DNA Star, BLAST (National Center for Biotechnology Information), BCM (Baylor College of Medicine) Launcher, etc. BLAST can be used to calculate amino acid sequence identity, amino acid sequence homology, and nucleotide sequence identity.
- Percent sequence identity can also be determined by other conventional methods, e.g., as described in Altschul et al., Bull. Math. Bio. 48: 603-616, 1986 and Henikoff and Henikoff, Proc. Natl. Acad. Sci. USA 89:10915-10919, 1992.
- Specific Polynucleotide Probes
- A polynucleotide of the present invention can comprise any continuous nucleotide sequence of
SEQ ID NO 1, 3, 5, 7, 9, and others, sequences which share sequence identity thereto, or complements thereof. The term “probe” refers to any substance that can be used to detect, identify, isolate, etc., another substance. A polynucleotide probe is comprised of nucleic acid can be used to detect, identify, etc., other nucleic acids, such as DNA and RNA. - These polynucleotides can be of any desired size that is effective to achieve the specificity desired. For example, a probe can be from about 7 or 8 nucleotides to several thousand nucleotides, depending upon its use and purpose. For instance, a probe used as a primer PCR can be shorter than a probe used in an ordered array of polynucleotide probes. Probe sizes vary, and the invention is not limited in any way by their size, e.g., probes can be from about 7-2000 nucleotides, 7-1000, 8-700, 8-600, 8-500, 8-400, 8-300, 8-150, 8-100, 8-75, 7-50, 10-25, 14-16, at least about 8, at least about 10, at least about 15, at least about 25, etc. The polynucleotides can have non-naturally-occurring nucleotides, e.g., inosine, AZT, 3TC, etc. The polynucleotides can have 100% sequence identity or complementarity to a sequence of
SEQ ID NO 1, 3, 5, 7, 9, and others, or it can have mismatches or nucleotide substitutions, e.g., 1, 2, 3, 4, or 5 substitutions. The probes can be single-stranded or double-stranded. - In accordance with the present invention, a polynucleotide can be present in a kit, where the kit includes, e.g., one or more polynucleotides, a desired buffer (e.g., phosphate, tris, etc.), detection compositions, RNA or cDNA from different tissues to be used as controls, libraries, etc. The polynucleotide can be labeled or unlabeled, with radioactive or non-radioactive labels as known in the art. Kits can comprise one or more pairs of polynucleotides for amplifying nucleic acids specific for human TARPP, e.g., comprising a forward and reverse primer effective in PCR. These include both sense and anti-sense orientations. For instance, in PCR-based methods (such as RT-PCR), a pair of primers are typically used, one having a sense sequence and the other having an antisense sequence.
- Another aspect of the present invention is a nucleotide sequence that is specific to, or for, a selective polynucleotide. The phrases “specific for” or “specific to” a polynucleotide have a functional meaning that the polynucleotide can be used to identify the presence of one or more target genes in a sample. It is specific in the sense that it can be used to detect polynucleotides above background noise (“non-specific binding”). A specific sequence is a defined order of nucleotides which occurs in the polynucleotide, e.g., in the nucleotide sequences of
SEQ ID NO 1, 3, 5, 7, 9, and others. A probe or mixture of probes can comprise a sequence or sequences that are specific to a plurality of target sequences, e.g., where the sequence is a consensus sequence, a functional domain, etc., e.g., capable of recognizing a family of related genes. Such sequences can be used as probes in any of the methods described herein or incorporated by reference. Both sense and antisense nucleotide sequences are included. A specific polynucleotide according to the present invention can be determined routinely. - A polynucleotide comprising a specific sequence can be used as a hybridization probe to identify the presence of, e.g., human or mouse polynucleotide, in a sample comprising a mixture of polynucleotides, e.g., on a Northern blot. Hybridization can be performed under high stringent conditions (see, above) to select polynucleotides (and their complements which can contain the coding sequence) having at least 90%, 95%, 99%, etc., identity (i.e., complementarity) to the probe, but less stringent conditions can also be used. A specific polynucleotide sequence can also be fused in-frame, at either its 5′ or 3′ end, to various nucleotide sequences as mentioned throughout the patent, including coding sequences for enzymes, detectable markers, GFP, etc, expression control sequences, etc.
- A polynucleotide probe, especially one that is specific to a polynucleotide of the present invention, can be used in gene detection and hybridization methods as already described. In one embodiment, a specific polynucleotide probe can be used to detect whether a particular tissue or cell-type is present in a target sample. To carry out such a method, a selective polynucleotide can be chosen which is characteristic of the desired target tissue. Such polynucleotide is preferably chosen so that it is expressed or displayed in the target tissue, but not in other tissues which are present in the sample. Starting from the selective polynucleotide, a specific polynucleotide probe can be designed which hybridizes (if hybridization is the basis of the assay) under the hybridization conditions to the selective polynucleotide, whereby the presence of the selective polynucleotide can be determined.
- Probes which are specific for polynucleotides of the present invention can also be prepared using involve transcription-based systems, e.g., incorporating an RNA polymerase promoter into a selective polynucleotide of the present invention, and then transcribing anti-sense RNA using the polynucleotide as a template. See, e.g., U.S. Pat. No. 5,545,522.
- Polynucleotide Composition
- A polynucleotide according to the present invention can comprise, e.g., DNA, RNA, synthetic polynucleotide, peptide polynucleotide, modified nucleotides, dsDNA, ssDNA, ssRNA, dsRNA, and mixtures thereof. A polynucleotide can be single- or double-stranded, triplex, DNA:RNA, duplexes, comprise hairpins, and other secondary structures, etc. Nucleotides comprising a polynucleotide can be joined via various known linkages, e.g., ester, sulfamate, sulfamide, phosphorothioate, phosphoramidate, methylphosphonate, carbamate, etc., depending on the desired purpose, e.g., resistance to nucleases, such as RNAse H, improved in vivo stability, etc. See, e.g., U.S. Pat. No. 5,378,825. Any desired nucleotide or nucleotide analog can be incorporated, e.g., 6-mercaptoguanine, 8-oxo-guanine, etc.
- Various modifications can be made to the polynucleotides, such as attaching detectable markers (avidin, biotin, radioactive elements, fluorescent tags and dyes, energy transfer labels, energy-emitting labels, binding partners, etc.) or moieties which improve hybridization, detection, and/or stability. The polynucleotides can also be attached to solid supports, e.g., nitrocellulose, magnetic or paramagnetic microspheres (e.g., as described in U.S. Pat. Nos. 5,411,863; 5,543,289; for instance, comprising ferromagnetic, supermagnetic, paramagnetic, superparamagnetic, iron oxide and polysaccharide), nylon, agarose, diazotized cellulose, latex solid microspheres, polyacrylamides, etc., according to a desired method. See, e.g., U.S. Pat. Nos. 5,470,967, 5,476,925, and 5,478,893.
- Polynucleotide according to the present invention can be labeled according to any desired method. The polynucleotide can be labeled using radioactive tracers such as 32P, 35S, 3H, or 14C, to mention some commonly used tracers. The radioactive labeling can be carried out according to any method, such as, for example, terminal labeling at the 3′ or 5′ end using a radiolabeled nucleotide, polynucleotide kinase (with or without dephosphorylation with a phosphatase) or a ligase (depending on the end to be labeled). A non-radioactive labeling can also be used, combining a polynucleotide of the present invention with residues having immunological properties (antigens, haptens), a specific affinity for certain reagents (ligands), properties enabling detectable enzyme reactions to be completed (enzymes or coenzymes, enzyme substrates, or other substances involved in an enzymatic reaction), or characteristic physical properties, such as fluorescence or the emission or absorption of light at a desired wavelength, etc.
- Nucleic Acid Detection Methods
- Another aspect of the present invention relates to methods and processes for detecting human TARPP. Detection methods have a variety of applications, including for diagnostic, prognostic, forensic, and research applications. To accomplish gene detection, a polynucleotide in accordance with the present invention can be used as a “probe.” The term “probe” or “polynucleotide probe” has its customary meaning in the art, e.g., a polynucleotide which is effective to identify (e.g., by hybridization), when used in an appropriate process, the presence of a target polynucleotide to which it is designed. Identification can involve simply determining presence or absence, or it can be quantitative, e.g., in assessing amounts of a gene or gene transcript present in a sample. Probes can be useful in a variety of ways, such as for diagnostic purposes, to identify homologs, and to detect, quantitate, or isolate a polynucleotide of the present invention in a test sample.
- Assays can be utilized which permit quantification and/or presence/absence detection of a target nucleic acid in a sample. Assays can be performed at the single-cell level, or in a sample comprising many cells, where the assay is “averaging” expression over the entire collection of cells and tissue present in the sample. Any suitable assay format can be used, including, but not limited to, e.g., Southern blot analysis, Northern blot analysis, polymerase chain reaction (“PCR”) (e.g., Saiki et al., Science, 241:53, 1988; U.S. Pat. Nos. 4,683,195, 4,683,202, and 6,040,166; PCR Protocols: A Guide to Methods and Applications, Innis et al., eds., Academic Press, New York, 1990), reverse transcriptase polymerase chain reaction (“RT-PCR”), anchored PCR, rapid amplification of cDNA ends (“RACE”) (e.g., Schaefer in Gene Cloning and Analysis: Current Innovations, Pages 99-115, 1997), ligase chain reaction (“LCR”) (
EP 320 308), one-sided PCR (Ohara et al., Proc. Natl. Acad. Sci., 86:5673-5677, 1989), indexing methods (e.g., U.S. Pat. No. 5,508,169), in situ hybridization, differential display (e.g., Liang et al., Nucl. Acid. Res., 21:3269-3275, 1993; U.S. Pat. Nos. 5,262,311, 5,599,672 and 5,965,409; WO97/18454; Prashar and Weissman, Proc. Natl. Acad. Sci., 93:659-663, and U.S. Pat. Nos. 6,010,850 and 5,712,126; Welsh et al., Nucleic Acid Res., 20:4965-4970, 1992, and U.S. Pat. No. 5,487,985) and other RNA fingerprinting techniques, nucleic acid sequence based amplification (“NASBA”) and other transcription based amplification systems (e.g., U.S. Pat. Nos. 5,409,818 and 5,554,527; WO 88/10315), polynucleotide arrays (e.g., U.S. Pat. Nos. 5,143,854, 5,424,186; 5,700,637, 5,874,219, and 6,054,270; PCT WO 92/10092; PCT WO 90/15070), Qbeta Replicase (PCT/US87/00880), Strand Displacement Amplification (“SDA”), Repair Chain Reaction (“RCR”), nuclease protection assays, subtraction-based methods, Rapid-Scan™, etc. Additional useful methods include, but are not limited to, e.g., template-based amplification methods, competitive PCR (e.g., U.S. Pat. No. 5,747,251), redox-based assays (e.g., U.S. Pat. No. 5,871,918), Taqman-based assays (e.g., Holland et al., Proc. Natl. Acad, Sci., 88:7276-7280, 1991; U.S. Pat. Nos. 5,210,015 and 5,994,063), real-time fluorescence-based monitoring (e.g., U.S. Pat. No. 5,928,907), molecular energy transfer labels (e.g., U.S. Pat. Nos. 5,348,853, 5,532,129, 5,565,322, 6,030,787, and 6,117,635; Tyagi and Kramer, Nature Biotech., 14:303-309, 1996). Any method suitable for single cell analysis of gene or protein expression can be used, including in situ hybridization, immunocytochemistry, MACS, FACS, flow cytometry, etc. For single cell assays, expression products can be measured using antibodies, PCR, or other types of nucleic acid amplification (e.g., Brady et al., Methods Mol. & Cell. Biol. 2, 17-25, 1990; Eberwine et al., 1992, Proc. Natl. Acad. Sci., 89, 3010-3014, 1992; U.S. Pat. No. 5,723,290). These and other methods can be carried out conventionally, e.g., as described in the mentioned publications. - Many of such methods may require that the polynucleotide is labeled, or comprises a particular nucleotide type useful for detection. The present invention includes such modified polynucleotides that are necessary to carry out such methods. Thus, polynucleotides can be DNA, RNA, DNA: RNA hybrids, PNA, etc., and can comprise any modification or substituent which is effective to achieve detection.
- Detection can be desirable for a variety of different purposes, including research, diagnostic, prognostic, and forensic. For diagnostic purposes, it may be desirable to identify the presence or quantity of a polynucleotide sequence in a sample, where the sample is obtained from tissue, cells, body fluids, etc. In a preferred method as described in more detail below, the present invention relates to a method of detecting a polynucleotide comprising, contacting a target polynucleotide in a test sample with a polynucleotide probe under conditions effective to achieve hybridization between the target and probe; and detecting hybridization.
- Any test sample in which it is desired to identify a polynucleotide or polypeptide thereof can be used, including, e.g., blood, urine, saliva, stool (for extracting nucleic acid, see, e.g., U.S. Pat. No. 6,177,251), swabs comprising tissue, biopsied tissue, tissue sections, cultured cells, etc.
- Detection can be accomplished in combination with polynucleotide probes for other genes, e.g., genes which are expressed in other disease states, tissues, cells, such as brain, heart, kidney, spleen, thymus, liver, stomach, small intestine, colon, muscle, lung, testis, placenta, pituitary, thyroid, skin, adrenal gland, pancreas, salivary gland, uterus, ovary, prostate gland, peripheral blood cells (T-cells, lymphocytes, etc.), embryo, normal breast fat, adult and embryonic stem cells, specific cell-types, such as endothelial, epithelial, myocytes, adipose, luminal epithelial, basoepithelial, myoepithelial, stromal cells, etc.
- Polynucleotides can be used in wide range of methods and compositions, including for detecting, diagnosing, staging, grading, assessing, prognosticating, etc. diseases and disorders associated with human TARPP, for monitoring or assessing therapeutic and/or preventative measures, in ordered arrays, etc. Any method of detecting genes and polynucleotides of
SEQ ID NO 1, 3, 5, 7, 9, and others can be used; certainly, the present invention is not to be limited how such methods are implemented. - Along these lines, the present invention relates to methods of detecting human TARPP in a sample comprising nucleic acid. Such methods can comprise one or more the following steps in any effective order, e.g., contacting said sample with a polynucleotide probe under conditions effective for said probe to hybridize specifically to nucleic acid in said sample, and detecting the presence or absence of probe hybridized to nucleic acid in said sample, wherein said probe is a polynucleotide which is
SEQ ID NO 1, 3, 5, 7, 9, and others, a polynucleotide having, e.g., about 70%, 80%, 85%, 90%, 95%, 99%, or more sequence identity thereto, effective or specific fragments thereof, or complements thereto. The detection method can be applied to any sample, e.g., cultured primary, secondary, or established cell lines, tissue biopsy, blood, urine, stool, cerebral spinal fluid, and other bodily fluids, for any purpose. - Contacting the sample with probe can be carried out by any effective means in any effective environment. It can be accomplished in a solid, liquid, frozen, gaseous, amorphous, solidified, coagulated, colloid, etc., mixtures thereof, matrix. For instance, a probe in an aqueous medium can be contacted with a sample which is also in an aqueous medium, or which is affixed to a solid matrix, or vice-versa.
- Generally, as used throughout the specification, the term “effective conditions” means, e.g., the particular milieu in which the desired effect is achieved. Such a milieu, includes, e.g., appropriate buffers, oxidizing agents, reducing agents, pH, co-factors, temperature, ion concentrations, suitable age and/or stage of cell (such as, in particular part of the cell cycle, or at a particular stage where particular genes are being expressed) where cells are being used, culture conditions (including substrate, oxygen, carbon dioxide, etc.). When hybridization is the chosen means of achieving detection, the probe and sample can be combined such that the resulting conditions are functional for said probe to hybridize specifically to nucleic acid in said sample.
- The phrase “hybridize specifically”indicates that the hybridization between single-stranded polynucleotides is based on nucleotide sequence complementarity. The effective conditions are selected such that the probe hybridizes to a preselected and/or definite target nucleic acid in the sample. For instance, if detection of a polynucleotide set forth in
SEQ ID NO 1, 3, 5, 7, 9, and others is desired, a probe can be selected which can hybridize to such target gene under high stringent conditions, without significant hybridization to other genes in the sample. To detect homologs of a polynucleotide set forth inSEQ ID NO 1, 3, 5, 7, 9, and others, the effective hybridization conditions can be less stringent, and/or the probe can comprise codon degeneracy, such that a homolog is detected in the sample. - As already mentioned, the methods can be carried out by any effective process, e.g., by Northern blot analysis, polymerase chain reaction (PCR), reverse transcriptase PCR, RACE PCR, in situ hybridization, etc., as indicated above. When PCR based techniques are used, two or more probes are generally used. One probe can be specific for a defined sequence which is characteristic of a selective polynucleotide, but the other probe can be specific for the selective polynucleotide, or specific for a more general sequence, e.g., a sequence such as polyA which is characteristic of mRNA, a sequence which is specific for a promoter, ribosome binding site, or other transcriptional features, a consensus sequence (e.g., representing a functional domain). For the former aspects, 5′ and 3′ probes (e.g., polyA, Kozak, etc.) are preferred which are capable of specifically hybridizing to the ends of transcripts. When PCR is utilized, the probes can also be referred to as “primers” in that they can prime a DNA polymerase reaction.
- In addition to testing for the presence or absence of polynucleotides, the present invention also relates to determining the amounts at which polynucleotides of the present invention are expressed in sample and determining the differential expression of such polynucleotides in samples.. Such methods can involve substantially the same steps as described above for presence/absence detection, e.g., contacting with probe, hybridizing, and detecting hybridized probe, but using more quantitative methods and/or comparisons to standards.
- The amount of hybridization between the probe and target can be determined by any suitable methods, e.g., PCR, RT-PCR, RACE PCR, Northern blot, polynucleotide microarrays, Rapid-Scan, etc., and includes both quantitative and qualitative measurements. For further details, see the hybridization methods described above and below. Determining by such hybridization whether the target is differentially expressed (e.g., up-regulated or down-regulated) in the sample can also be accomplished by any effective means. For instance, the target's expression pattern in the sample can be compared to its pattern in a known standard, such as in a normal tissue, or it can be compared to another gene in the same sample. When a second sample is utilized for the comparison, it can be a sample of normal tissue that is known not to contain diseased cells. The comparison can be performed on samples which contain the same amount of RNA (such as polyadenylated RNA or total RNA), or, on RNA extracted from the same amounts of starting tissue. Such a second sample can also be referred to as a control or standard. Hybridization can also be compared to a second target in the same tissue sample. Experiments can be performed that determine a ratio between the target nucleic acid and a second nucleic acid (a standard or control) , e.g., in a normal tissue. When the ratio between the target and control are substantially the same in a normal and sample, the sample is determined or diagnosed not to contain cells. However, if the ratio is different between the normal and sample tissues, the sample is determined to contain cancer cells. The approaches can be combined, and one or more second samples, or second targets can be used. Any second target nucleic acid can be used as a comparison, including “housekeeping” genes, such as beta-actin, alcohol dehydrogenase, or any other gene whose expression does not vary depending upon the disease status of the cell.
- Methods of Identifying Polymorphisms, Mutations, etc., of Human TARPP
- Polynucleotides of the present invention can also be utilized to identify mutant alleles, SNPs, gene rearrangements and modifications, and other polymorphisms of the wild-type gene. Mutant alleles, polymorphisms, SNPs, etc., can be identified and isolated from cancers that are known, or suspected to have, a genetic component. Identification of such genes can be carried out routinely (see, above for more guidance), e.g., using PCR, hybridization techniques, direct sequencing, mismatch reactions (see, e.g., above), RFLP analysis, SSCP (e.g., Orita et al., Proc. Natl. Acad. Sci., 86:2766, 1992), etc., where a polynucleotide having a sequence selected from
SEQ ID NO 1, 3, 5, 7, 9, and others is used as a probe. The selected mutant alleles, SNPs, polymorphisms, etc., can be used diagnostically to determine whether a subject has, or is susceptible to a disorder associated with human TARPP, as well as to design therapies and predict the outcome of the disorder. Methods involve, e.g., diagnosing a disorder associated with human TARPP or determining susceptibility to a disorder, comprising, detecting the presence of a mutation in a gene represented by a polynucleotide selected fromSEQ ID NO 1, 3, 5, 7, 9, and others. The detecting can be carried out by any effective method, e.g., obtaining cells from a subject, determining the gene sequence or structure of a target gene (using, e.g., mRNA, cDNA, genomic DNA, etc), comparing the sequence or structure of the target gene to the structure of the normal gene, whereby a difference in sequence or structure indicates a mutation in the gene in the subject. Polynucleotides can also be used to test for mutations, SNPs, polymorphisms, etc., e.g., using mismatch DNA repair technology as described in U.S. Pat. Nos. 5,683,877; 5,656,430; Wu et al., Proc. Natl. Acad. Sci., 89:8779-8783, 1992. - The present invention also relates to methods of detecting polymorphisms in human TARPP, comprising, e.g., comparing the structure of: genomic DNA comprising all or part of human TARPP, mRNA comprising all or part of human TARPP, cDNA comprising all or part of human TARPP, or a polypeptide comprising all or part of human TARPP, with the structure of human TARPP set forth in SEQ ID NOS. 1-8. The methods can be carried out on a sample from any source, e.g., cells, tissues, body fluids, blood, urine, stool, hair, egg, sperm, cerebral spinal fluid, etc.
- These methods can be implemented in many different ways. For example, “comparing the structure” steps include, but are not limited to, comparing restriction maps, nucleotide sequences, amino acid sequences, RFLPs, Dnase sites, DNA methylation fingerprints (e.g., U.S. Pat. No. 6,214,556), protein cleavage sites, molecular weights, electrophoretic mobilities, charges, ion mobility, etc., between a standard human TARPP and a test human TARPP. The term “structure” can refer to any physical characteristics or configurations which can be used to distinguish between nucleic acids and polypeptides. The methods and instruments used to accomplish the comparing step depends upon the physical characteristics which are to be compared. Thus, various techniques are contemplated, including, e.g., sequencing machines (both amino acid and polynucleotide), electrophoresis, mass spectrometer (U.S. Pat. Nos. 6,093,541, 6,002,127), liquid chromatography, HPLC, etc.
- To carry out such methods, “all or part” of the gene or polypeptide can be compared. For example, if nucleotide sequencing is utilized, the entire gene can be sequenced, including promoter, introns, and exons, or only parts of it can be sequenced and compared, e.g.,
exon 1, exon 2, etc. - Mutagenesis
- Mutated polynucleotide sequences of the present invention are useful for various purposes, e.g., to create mutations of the polypeptides they encode, to identify functional regions of genomic DNA, to produce probes for screening libraries, etc. Mutagenesis can be carried out routinely according to any effective method, e.g., oligonucleotide-directed (Smith, M., Ann. Rev. Genet. 19:423-463, 1985), degenerate oligonucleotide-directed (Hill et al., Method Enzymology, 155:558-568, 1987), region-specific (Myers et al., Science, 229:242-246, 1985; Derbyshire et al., Gene, 46:145, 1986; Ner et al., DNA, 7:127, 1988), linker-scanning (McKnight and Kingsbury, Science, 217:316-324, 1982), directed using PCR, recursive ensemble mutagenesis (Arkin and Yourvan, Proc. Natl. Acad. Sci., 89:7811-7815, 1992), random mutagenesis (e.g., U.S. Pat. Nos. 5,096,815; 5,198,346; and 5,223,409), site-directed mutagenesis (e.g., Walder et al., Gene, 42:133, 1986; Bauer et al., Gene, 37:73, 1985; Craik, Bio Techniques, Jan. 12-19, 1985; Smith et al., Genetic Engineering: Principles and Methods, Plenum Press, 1981), phage display (e.g., Lowman et al., Biochem. 30:10832-10837, 1991; Ladner et al., U.S. Pat. No. 5,223,409; Huse, WIPO Publication WO 92/06204), etc. Desired sequences can also be produced by the assembly of target sequences using mutually priming oligonucleotides (Uhlmann, Gene, 71:29-40, 1988). For directed mutagenesis methods, analysis of the three-dimensional structure of the human TARPP polypeptide can be used to guide and facilitate making mutants which effect polypeptide activity. Sites of substrate-enzyme interaction or other biological activities can also be determined by analysis of crystal structure as determined by such techniques as nuclear magnetic resonance, crystallography or photoaffinity labeling. See, for example, de Vos et al., Science 255:306-312, 1992; Smith et al., J. Mol. Biol. 224:899-904, 1992; Wlodaver et al., FEBS Lett. 309:59-64, 1992.
- In addition, libraries of human TARPP and fragments thereof can be used for screening and selection of human TARPP variants. For instance, a library of coding sequences can be generated by treating a double-stranded DNA with a nuclease under conditions where the nicking occurs, e.g., only once per molecule, denaturing the double-stranded DNA, renaturing it to for double-stranded DNA that can include sense/antisense pairs from different nicked products, removing single-stranded portions from reformed duplexes by treatment with S1 nuclease, and ligating the resulting DNAs into an expression vector. By this method, expression libraries can be made comprising “mutagenized” human TARPP. The entire coding sequence or parts thereof can be used.
- Polynucleotide Expression, Polypeptides Produced Thereby, and Specific-binding Partners thereto.
- A polynucleotide according to the present invention can be expressed in a variety of different systems, in vitro and in vivo, according to the desired purpose. For example, a polynucleotide can be inserted into an expression vector, introduced into a desired host, and cultured under conditions effective to achieve expression of a polypeptide coded for by the polynucleotide, to search for specific binding partners. Effective conditions include any culture conditions which are suitable for achieving production of the polypeptide by the host cell, including effective temperatures, pH, medium, additives to the media in which the host cell is cultured (e.g., additives which amplify or induce expression such as butyrate, or methotrexate if the coding polynucleotide is adjacent to a dhfr gene), cycloheximide, cell densities, culture dishes, etc. A polynucleotide can be introduced into the cell by any effective method including, e.g., naked DNA, calcium phosphate precipitation, electroporation, injection, DEAE-Dextran mediated transfection, fusion with liposomes, association with agents which enhance its uptake into cells, viral transfection. A cell into which a polynucleotide of the present invention has been introduced is a transformed host cell. The polynucleotide can be extrachromosomal or integrated into a chromosome(s) of the host cell. It can be stable or transient. An expression vector is selected for its compatibility with the host cell. Host cells include, mammalian cells, e.g., COS, CV1, BHK, CHO, HeLa, LTK, NIH 3T3, CNS neural stem cells (e.g., U.S. Pat. No. 6,103,530), IMR-32, A172 (ATCC CRL-1620), T98G (ATCC CRL-1690), CCF-STTG1 (ATCC CRL-1718), DBTRG-05MG (ATCC CRL-2020), PFSK-1 (ATCC CRL-2060), SK-N-AS and other SK cell lines (ATCC CRL-2137), CHP-212 (ATCC CRL-2273), RG2 (ATCC CRL-2433), HCN-2 (ATCC CRL-10742), U-87 MG and other U MG cell lines (ATCC HTB-14), D283 Med (ATCC HTB-185), PC 12, Neuro-2a (ATCC CCL-131), HH (ATCC CRL 2105), MOLT-4 (ATCC CRL 1582), MJ (ATCC CRL-8294), SK7 (ATCC HB-8584), SK8 (ATCC HB-8585), HM1 (HB-8586), H9 (ATCC HTB-176), HuT 78 (ATCC TIB-161), HuT 102 (ATCC TIB-162), Jurkat, insect cells, such as Sf9 ( S. frugipeda) and Drosophila, bacteria, such as E. coli, Streptococcus, bacillus, yeast, such as Sacharomyces, S. cerevisiae, fungal cells, plant cells, embryonic or adult stem cells (e.g., mammalian, such as mouse or human).
- Expression control sequences are similarly selected for host compatibility and a desired purpose, e.g., high copy number, high amounts, induction, amplification, controlled expression. Other sequences which can be employed include enhancers such as from SV40, CMV, RSV, inducible promoters, cell-type specific elements, or sequences which allow selective or specific cell expression. Promoters that can be used to drive its expression, include, e.g., the endogenous promoter, MMTV, SV40, trp, lac, tac, or T7 promoters for bacterial hosts; or alpha factor, alcohol oxidase, or PGH promoters for yeast. RNA promoters can be used to produced RNA transcripts, such as T7 or SP6. See, e.g., Melton et al., Polynucleotide Res., 12(18):7035-7056, 1984; Dunn and Studier. J. Mol. Bio., 166:477-435, 1984; U.S. Pat. No. 5,891,636; Studier et al., Gene Expression Technology, Methods in Enzymology, 85:60-89, 1987. In addition, as discussed above, translational signals (including in-frame insertions) can be included.
- When a polynucleotide is expressed as a heterologous gene in a transfected cell line, the gene is introduced into a cell as described above, under effective conditions in which the gene is expressed. The term “heterologous” means that the gene has been introduced into the cell line by the “hand-of-man.” Introduction of a gene into a cell line is discussed above. The transfected (or transformed) cell expressing the gene can be lysed or the cell line can be used intact.
- For expression and other purposes, a polynucleotide can contain codons found in a naturally-occurring gene, transcript, or cDNA, for example, e.g., as set forth in
SEQ ID NO 1, 3, 5, 7, 9, and others, or it can contain degenerate codons coding for the same amino acid sequences. For instance, it may be desirable to change the codons in the sequence to optimize the sequence for expression in a desired host. See, e.g., U.S. Pat. Nos. 5,567,600 and 5,567,862. - A polypeptide according to the present invention can be recovered from natural sources, transformed host cells (culture medium or cells) according to the usual methods, including, detergent extraction (e.g., non-ionic detergent, Triton X-100, CHAPS, octylglucoside, Igepal CA-630), ammonium sulfate or ethanol precipitation, acid extraction, anion or cation exchange chromatography, phosphocellulose chromatography, hydrophobic interaction chromatography, hydroxyapatite chromatography, lectin chromatography, gel electrophoresis. Protein refolding steps can be used, as necessary, in completing the configuration of the mature protein. Finally, high performance liquid chromatography (HPLC) can be employed for purification steps. Another approach is express the polypeptide recombinantly with an affinity tag (Flag epitope, HA epitope, myc epitope, 6×His, maltose binding protein, chitinase, etc) and then purify by anti-tag antibody-conjugated affinity chromatography.
- The present invention also relates to antibodies, and other specific-binding partners, which are specific for polypeptides encoded by polynucleotides of the present invention, e.g., human TARPP. Antibodies, e.g., polyclonal, monoclonal, recombinant, chimeric, humanized, single-chain, Fab, and fragments thereof, can be prepared according to any desired method. See, also, screening recombinant immunoglobulin libraries (e.g., Orlandi et al., Proc. Natl. Acad. Sci., 86:3833-3837, 1989; Huse et al., Science, 256:1275-1281, 1989); in vitro stimulation of lymphocyte populations; Winter and Milstein, Nature, 349: 293-299, 1991. The antibodies can be IgM, IgG, subtypes, IgG2a, IgG1, etc. Antibodies, and immune responses, can also be generated by administering naked DNA See, e.g., U.S. Pat. Nos. 5,703,055; 5,589,466; 5,580,859. Antibodies can be used from any source, including, goat, rabbit, mouse, chicken (e.g., IgY; see, Duan, W0/029444 for methods of making antibodies in avian hosts, and harvesting the antibodies from the eggs). An antibody specific for a polypeptide means that the antibody recognizes a defined sequence of amino acids within or including the polypeptide. Other specific binding partners include, e.g., aptamers and PNA. antibodies can be prepared against specific epitopes or domains of human TARPP, e.g., 1-161, 88-161, 267-300, 312-331, comprising
amino acid 312, and comprising any of the amino acid differences between mouse and human as shown in FIG. 3. - The preparation of polyclonal antibodies is well-known to those skilled in the art. See, for example, Green et al., Production of Polyclonal Antisera, in IMMUNOCHEMICAL PROTOCOLS (Manson, ed.), pages 1-5 (Humana Press 1992); Coligan et al., Production of Polyclonal Antisera in Rabbits, Rats, Mice and Hamsters, in CURRENT PROTOCOLS IN IMMUNOLOGY, section 2.4.1 (1992). The preparation of monoclonal antibodies likewise is conventional. See, for example, Kohler & Milstein, Nature 256:495 (1975); Coligan et al., sections 2.5.1-2.6.7; and Harlow et al., ANTIBODIES: A LABORATORY MANUAL, page 726 (Cold Spring Harbor Pub. 1988).
- Antibodies can also be humanized, e.g., where they are to be used therapeutically. Humanized monoclonal antibodies are produced by transferring mouse complementarity determining regions from heavy and light variable chains of the mouse immunoglobulin into a human variable domain, and then substituting human residues in the framework regions of the murine counterparts. The use of antibody components derived from humanized monoclonal antibodies obviates potential problems associated with the immunogenicity of murine constant regions. General techniques for cloning murine immunoglobulin variable domains are described, for example, by Orlandi et al., Proc. Nat'l Acad. Sci. USA 86:3833 (1989), which is hereby incorporated in its entirety by reference. Techniques for producing humanized monoclonal antibodies are described, for example, in U.S. Pat. No. 6,054,297, Jones et al., Nature 321: 522 (1986); Riechmann et al., Nature 332: 323 (1988); Verhoeyen et al., Science 239: 1534 (1988); Carter et al., Proc. Nat'l Acad. Sci. USA 89: 4285 (1992); Sandhu, Crit. Rev. Biotech. 12: 437 (1992); and Singer et al., J. Immunol. 150: 2844 (1993).
- Antibodies of the invention also may be derived from human antibody fragments isolated from a combinatorial immunoglobulin library. See, for example, Barbas et al., METHODS: A COMPANION TO METHODS IN ENZYMOLOGY, VOL. 2, page 119 (1991); Winter et al., Ann. Rev. Immunol. 12: 433 (1994). Cloning and expression vectors that are useful for producing a human immunoglobulin phage library can be obtained commercially, for example, from STRATAGENE Cloning Systems (La Jolla, Calif.).
- In addition, antibodies of the present invention may be derived from a human monoclonal antibody. Such antibodies are obtained from transgenic mice that have been “engineered” to produce specific human antibodies in response to antigenic challenge. In this technique, elements of the human heavy and light chain loci are introduced into strains of mice derived from embryonic stem cell lines that contain targeted disruptions of the endogenous heavy and light chain loci. The transgenic mice can synthesize human antibodies specific for human antigens and can be used to produce human antibody-secreting hybridomas. Methods for obtaining human antibodies from transgenic mice are described, e.g., in Green et al., Nature Genet. 7:13 (1994); Lonberg et al., Nature 368:856 (1994); and Taylor et al., Int. Immunol. 6:579 (1994).
- Antibody fragments of the present invention can be prepared by proteolytic hydrolysis of the antibody or by expression in E. coli of nucleic acid encoding the fragment. Antibody fragments can be obtained by pepsin or papain digestion of whole antibodies by conventional methods. For example, antibody fragments can be produced by enzymatic cleavage of antibodies with pepsin to provide a 5S fragment denoted F(ab′).sub.2. This fragment can be further cleaved using a thiol reducing agent, and optionally a blocking group for the sulfhydryl groups resulting from cleavage of disulfide linkages, to produce 3.5S Fab′ monovalent fragments. Alternatively, an enzymatic cleavage using pepsin produces two monovalent Fab′ fragments and an Fc fragment directly. These methods are described, for example, by Goldenberg, U.S. Pat. Nos. 4,036,945 and 4,331,647, and references contained therein. These patents are hereby incorporated in their entireties by reference. See also Nisoiihoff et al., Arch. Biochem. Biophys. 89:230 (1960); Porter, Biochem. J. 73:119 (1959); Edelman etal, METHODS IN ENZYMOLOGY, VOL. 1, page 422 (Academic Press 1967); and Coligan et al. at sections 2.8.1-2.8.10 and 2.10.1-2.10.4.
- Other methods of cleaving antibodies, such as separation of heavy chains to form monovalent light-heavy chain fragments, further cleavage of fragments, or other enzymatic, chemical, or genetic techniques can also be used. For example, Fv fragments comprise an association of V.sub.H and V.sub.L chains. This association may be noncovalent, as described in Inbar et al., Proc. Nat'l Acad. Sci. USA 69:2659 (1972). Alternatively, the variable chains can be linked by an intermolecular disulfide bond or cross-linked by chemicals such as glutaraldehyde. See, e.g., Sandhu, supra. Preferably, the Fv fragments comprise V.sub.H and V.sub.L chains connected by a peptide linker. These single-chain antigen binding proteins (sFv) are prepared by constructing a structural gene comprising nucleic acid sequences encoding the V.sub.H and V.sub.L domains connected by an oligonucleotide. The structural gene is inserted into an expression vector, which is subsequently introduced into a host cell such as E. coli. The recombinant host cells synthesize a single polypeptide chain with a linker peptide bridging the two V domains. Methods for producing sFvs are described, for example, by Whitlow et al., METHODS: A COMPANION TO METHODS IN ENZYMOLOGY, VOL. 2, page 97 (1991); Bird etal.,Science 242:423-426 (1988); Ladneret al., U.S. Pat. No. 4,946,778; Pack et al., Bio/Technology 11: 1271-77 (1993); and Sandhu, supra.
- Another form of an antibody fragment is a peptide coding for a single complementarity-determining region (CDR). CDR peptides (“minimal recognition units”) can be obtained by constructing genes encoding the CDR of an antibody of interest. Such genes are prepared, for example, by using the polymerase chain reaction to synthesize the variable region from RNA of antibody-producing cells. See, for example, Lariick et al., METHODS: A COMPANION TO METHODS IN ENZYMOLOGY, VOL. 2, page 106 (1991).
- The term “antibody” as used herein includes intact molecules as well as fragments thereof, such as Fab, F(ab′)2, and Fv which are capable of binding to an epitopic determinant present in BinI polypeptide. Such antibody fragments retain some ability to selectively bind with its antigen or receptor. The term “epitope” refers to an antigenic determinant on an antigen to which the paratope of an antibody binds. Epitopic determinants usually consist of chemically active surface groupings of molecules such as amino acids or sugar side chains and usually have specific three dimensional structural characteristics, as well as specific charge characteristics. Antibodies can be prepared against specific epitopes or polypeptide domains.
- Antibodies which bind to human TARPP polypeptides of the present invention can be prepared using an intact polypeptide or fragments containing small peptides of interest as the immunizing antigen. For example, it may be desirable to produce antibodies that specifically bind to the N- or C-terminal domains of human TARPP. The polypeptide or peptide used to immunize an animal which is derived from translated cDNA or chemically synthesized which can be conjugated to a carrier protein, if desired. Such commonly used carriers which are chemically coupled to the immunizing peptide include keyhole limpet hemocyanin (KLH), thyroglobulin, bovine serum albumin (BSA), and tetanus toxoid.
- Polyclonal or monoclonal antibodies can be further purified, for example, by binding to and elution from a matrix to which the polypeptide or a peptide to which the antibodies were raised is bound. Those of skill in the art will know of various techniques common in the immunology arts for purification and/or concentration of polyclonal antibodies, as well as monoclonal antibodies (See for example, Coligan, et al., Unit 9, Current Protocols in Immunology, Wiley Interscience, 1994, incorporated by reference).
- Anti-idiotype technology can also be used to produce invention monoclonal antibodies which mimic an epitope. For example, an anti-idiotypic monoclonal antibody made to a first monoclonal antibody will have a binding domain in the hypervariable region which is the “image” of the epitope bound by the first monoclonal antibody.
- Methods of Detecting Polypeptides
- Polypeptides coded for by human TARPP of the present invention can be detected, visualized, determined, quantitated, etc. according to any effective method. useful methods include, e.g., but are not limited to, immunoassays, RIA (radioimmunassay), ELISA, (enzyme-linked-immunosorbent assay), immunoflourescence, flow cytometry, histology, electron microscopy, light microscopy, in situ assays, immunoprecipitation, Western blot.
- Immunoassays may be carried in liquid or on biological support. For instance, a sample (e.g., blood, stool, urine, cells, tissue, cerebral spinal fluid, body fluids, etc.) can be brought in contact with and immobilized onto a solid phase support or carrier such as nitrocellulose, or other solid support that is capable of immobilizing cells, cell particles or soluble proteins. The support may then be washed with suitable buffers followed by treatment with the detectably labeled human TARPP specific antibody. The solid phase support can then be washed with a buffer a second time to remove unbound antibody. The amount of bound label on solid support may then be detected by conventional means.
- A “solid phase support or carrier” includes any support capable of binding an antigen, antibody, or other specific binding partner. Supports or carriers include glass, polystyrene, polypropylene, polyethylene, dextran, nylon, amylases, natural and modified celluloses, polyacrylamides, and magnetite. A support material can have any structural or physical configuration. Thus, the support configuration may be spherical, as in a bead, or cylindrical, as in the inside surface of a test tube, or the external surface of a rod. Alternatively, the surface may be flat such as a sheet, test strip, etc. Preferred supports include polystyrene beads
- One of the many ways in which gene peptide-specific antibody can be detectably labeled is by linking it to an enzyme and using it in an enzyme immunoassay (EIA). See, e.g., Voller, A., “The Enzyme Linked Immunosorbent Assay (ELISA),” 1978, Diagnostic Horizons 2, 1-7, Microbiological Associates Quarterly Publication, Walkersville, Md.); Voller, A. et al., 1978, J. Clin. Pathol. 31, 507-520; Butler, J. E., 1981, Meth. Enzymol. 73, 482-523; Maggio, E. (ed.), 1980, Enzyme Immunoassay, CRC Press, Boca Raton, Fla. The enzyme which is bound to the antibody will react with an appropriate substrate, preferably a chromogenic substrate, in such a manner as to produce a chemical moiety that can be detected, for example, by spectrophotometric, fluorimetric or by visual means. Enzymes that can be used to detectably label the antibody include, but are not limited to, malate dehydrogenase, staphylococcal nuclease, delta-5-steroid isomerase, yeast alcohol dehydrogenase, .alpha.-glycerophosphate, dehydrogenase, triose phosphate isomerase, horseradish peroxidase, alkaline phosphatase, asparaginase, glucose oxidase, beta.-galactosidase, ribonuclease, urease, catalase, glucose-6-phosphate dehydrogenase, glucoamylase and acetylcholinesterase. The detection can be accomplished by calorimetric methods that employ a chromogenic substrate for the enzyme. Detection may also be accomplished by visual comparison of the extent of enzymatic reaction of a substrate in comparison with similarly prepared standards.
- Detection may also be accomplished using any of a variety of other immunoassays. For example, by radioactively labeling the antibodies or antibody fragments, it is possible to detect human TARPP peptides through the use of a radioimmunoassay (RIA). See, e.g., Weintraub, B., Principles of Radioimmunoassays, Seventh Training Course on Radioligand Assay Techniques, The Endocrine Society, March, 1986. The radioactive isotope can be detected by such means as the use of a gamma counter or a scintillation counter or by autoradiography.
- It is also possible to label the antibody with a fluorescent compound. When the fluorescently labeled antibody is exposed to light of the proper wave length, its presence can then be detected due to fluorescence. Among the most commonly used fluorescent labeling compounds are fluorescein isothiocyanate, rhodamine, phycoerythrin, phycocyanin, allophycocyanin, o-phthaldehyde and fluorescamine. The antibody can also be detectably labeled using fluorescence emitting metals such as those in the lanthanide series. These metals can be attached to the antibody using such metal chelating groups as diethylenetriaminepentacetic acid (DTPA) or ethylenediaminetetraacetic acid (EDTA).
- The antibody also can be detectably labeled by coupling it to a chemiluminescent compound. The presence of the chemiluminescent-tagged antibody is then determined by detecting the presence of luminescence that arises during the course of a chemical reaction. Examples of useful chemiluminescent labeling compounds are luminol, isoluminol, theromatic acridinium ester, imidazole, acridinium salt and oxalate ester.
- Likewise, a bioluminescent compound may be used to label the antibody of the present invention. Bioluminescence is a type of chemiluminescence found in biological systems in which a catalytic protein increases the efficiency of the chemiluminescent reaction. The presence of a bioluminescent protein is determined by detecting the presence of luminescence. Important bioluminescent compounds for purposes of labeling are luciferin, luciferase and aequorin.
- Diagnostic
- The present invention also relates to methods and compositions for diagnosing a disorder of nervous or immune (e.g., lymphocyte) tissues, or determining susceptibility to a disorder, using polynucleotides, polypeptides, and specific-binding partners of the present invention to detect, assess, determine, etc., human TARPP. In such methods, the gene can serve as a marker for the disorder, e.g., where the gene, when mutant, is a direct cause of the disorder; where the gene is affected by another gene(s) which is directly responsible for the disorder, e.g., when the gene is part of the same signaling pathway as the directly responsible gene; and, where the gene is chromosomally linked to the gene(s) directly responsible for the disorder, and segregates with it. Many other situations are possible. To detect, assess, determine, etc., a probe specific for the gene can be employed as described above and below. Any method of detecting and/or assessing the gene can be used, including detecting expression of the gene using polynucleotides, antibodies, or other specific-binding partners.
- The present invention relates to methods of diagnosing a disorder associated with human TARPP, or determining a subject's susceptibility to such disorder, comprising, e.g., assessing the expression of said gene(s) in a tissue sample comprising tissue or cells suspected of having the disorder. The phrase “diagnosing” indicates that it is determined whether the sample has the disorder. A “disorder” means, e.g., any abnormal condition as in a disease or malady. “Determining a subject's susceptibility to a disease or disorder” indicates that the subject is assessed for whether she is predisposed to get such a disease or disorder, where the predisposition is indicated by abnormal expression of the gene (e.g., gene mutation, gene expression pattern is not normal, etc.). Predisposition or susceptibility to a disease may result when a such disease is influenced by epigenetic, environmental, etc., factors. This includes prenatal screening where samples from the fetus or embryo (e.g., via amniocentesis or CV sampling) are analyzed for the expression of the gene.
- By the phrase “assessing expression of human TARPP,” it is meant that the functional status of the gene is evaluated. This includes, but is not limited to, measuring expression levels of said gene, determining the genomic structure of said gene, determining the mRNA structure of transcripts from said gene, or measuring the expression levels of polypeptide coded for by said gene. Thus, the term “assessing expression” includes evaluating the all aspects of the transcriptional and translational machinery of the gene. For instance, if a promoter defect causes, or is suspected of causing, the disorder, then a sample can be evaluated (i.e., “assessed”) by looking (e.g., sequencing or restriction mapping) at the promoter sequence in the gene, by detecting transcription products (e.g., RNA), by detecting translation product (e.g., polypeptide). Any measure of whether the gene is functional can be used, including, polypeptide, polynucleotide, and functional assays for the gene's biological activity.
- In making the assessment, it can be useful to compare the results to a normal gene, e.g., a gene which is not associated with the disorder. The nature of the comparison can be determined routinely, depending upon how the assessing is accomplished. If, for example, the mRNA levels of a sample is detected, then the mRNA levels of a normal can serve as a comparison, or a gene which is known not to be affected by the disorder. Methods of detecting mRNA are well known, and discussed above, e.g., but not limited to, Northern blot analysis, polymerase chain reaction (PCR), reverse transcriptase PCR, RACE PCR, etc. Similarly, if polypeptide production is used to evaluate the gene, then the polypeptide in a normal tissue sample can be used as a comparison, or, polypeptide from a different gene whose expression is known not to be affected by the disorder. These are only examples of how such a method could be carried out.
- Assessing the effects of therapeutic and preventative interventions (e.g., administration of a drug, chemotherapy, radiation, etc.) on nervous and immune disorders is a major effort in drug discovery, clinical medicine, and pharmacogenomics. The evaluation of therapeutic and preventative measures, whether experimental or already in clinical use, has broad applicability, e.g., in clinical trials, for monitoring the status of a patient, for analyzing and assessing animal models, and in any scenario involving cancer treatment and prevention. Analyzing the expression profiles of polynucleotides of the present invention can be utilized as a parameter by which interventions are judged and measured. Treatment of a disorder can change the expression profile in some manner which is prognostic or indicative of the drug's effect on it. Changes in the profile can indicate, e.g., drug toxicity, return to a normal level, etc. Accordingly, the present invention also relates to methods of monitoring or assessing a therapeutic or preventative measure (e.g., chemotherapy, radiation, anti-neoplastic drugs, antibodies, etc.) in a subject, comprising, e.g., detecting the expression levels of human TARPP. A subject can be a cell-based assay system, non-human animal model, human patient, etc. Detecting can be accomplished as described for the methods above and below. By “therapeutic or preventative intervention,” it is meant, e.g., a drug administered to a patient, surgery, radiation, chemotherapy, and other measures taken to prevent, treat, or diagnose a disorder.
- Expression can be assessed in any sample comprising any tissue or cell type, body fluid, etc., as discussed for other methods of the present invention, including cells from the immune or nervous system, such as lymphocytes, neurons, or glia
- Identifying Agent Methods
- The present invention also relates to methods of identifying agents, and the agents themselves, which modulate human TARPP. These agents can be used to modulate the biological activity of the polypeptide encoded for the gene, or the gene, itself. Agents which regulate the gene or its product are useful in variety of different environments, including as medicinal agents to treat or prevent disorders associated with human TARPP and as research reagents to modify the function of tissues and cell.
- Methods of identifying agents generally comprise steps in which an agent is placed in contact with the gene, transcription product, translation product, or other target, and then a determination is performed to assess whether the agent “modulates” the target. The specific method utilized will depend upon a number of factors, including, e.g., the target (i.e., is it the gene or polypeptide encoded by it), the environment (e.g., in vitro or in vivo), the composition of the agent, etc.
- For modulating the expression of human TARPP gene, a method can comprise, in any effective order, one or more of the following steps, e.g., contacting a human TARPP gene (e.g., in a cell population) with a test agent under conditions effective for said test agent to modulate the expression of human TARPP, and determining whether said test agent modulates said human TARPP. An agent can modulate expression of human TARPP at any level, including transcription, translation, and/or perdurance of the nucleic acid (e.g., degradation, stability, etc.) in the cell.
- For modulating the biological activity of human TARPP polypeptides, a method can comprise, in any effective order, one or more of the following steps, e.g., contacting a human TARPP polypeptide (e.g., in a cell, lysate, or isolated) with a test agent under conditions effective for said test agent to modulate the biological activity of said polypeptide, and determining whether said test agent modulates said biological activity.
- Contacting human TARPP with the test agent can be accomplished by any suitable method and/or means that places the agent in a position to functionally control expression or biological activity of human TARPP present in the sample. Functional control indicates that the agent can exert its physiological effect on human TARPP through whatever mechanism it works. The choice of the method and/or means can depend upon the nature of the agent and the condition and type of environment in which the human TARPP is presented, e.g., lysate, isolated, or in a cell population (such as, in vivo, in vitro, organ explants, etc.). For instance, if the cell population is an in vitro cell culture, the agent can be contacted with the cells by adding it directly into the culture medium. If the agent cannot dissolve readily in an aqueous medium, it can be incorporated into liposomes, or another lipophilic carrier, and then administered to the cell culture. Contact can also be facilitated by incorporation of agent with carriers and delivery molecules and complexes, by injection, by infusion, etc.
- After the agent has been administered in such a way that it can gain access to human TARPP, it can be determined whether the test agent modulates human TARPP expression or biological activity. Modulation can be of any type, quality, or quantity, e.g., increase, facilitate, enhance, up-regulate, stimulate, activate, amplify, augment, induce, decrease, down-regulate, diminish, lessen, reduce, etc. The modulatory quantity can also encompass any value, e.g., 1%, 5%, 10%, 50%, 75%, 1-fold, 2-fold, 5-fold, 10-fold, 100-fold, modulate human TARPP expression means, e.g., that the test agent has an effect on its expression, e.g., to effect the amount of transcription, to effect RNA splicing, to effect translation of the RNA into polypeptide, to effect RNA or polypeptide stability, to effect polyadenylation or other processing of the RNA, to effect post-transcriptional or post-translational processing, etc. To modulate biological activity means, e.g., that a functional activity of the polypeptide is changed in comparison to its normal activity in the absence of the agent. This effect includes, increase, decrease, block, inhibit, enhance, etc. Biological activities of human TARPP included, e.g., nucleic acid binding activity.
- A test agent can be of any molecular composition, e.g., chemical compounds, biomolecules, such as polypeptides, lipids, nucleic acids (e.g., antisense to a polynucleotide sequence selected from
SEQ ID NO 1, 3, 5, 7, 9, and others), carbohydrates, antibodies, ribozymes, double-stranded RNA, aptamers, etc. For example, if a polypeptide to be modulated is a cell-surface molecule, a test agent can be an antibody that specifically recognizes it and, e.g., causes the polypeptide to be internalized, leading to its down regulation on the surface of the cell. Such an effect does not have to be permanent, but can require the presence of the antibody to continue the down-regulatory effect. Antibodies can also be used to modulate the biological activity a polypeptide in a lysate or other cell-free form. Antisense human TARPP can also be used as test agents to modulate gene expression. - Therapeutics
- Selective polynucleotides, polypeptides, and specific-binding partners thereto, can be utilized in therapeutic applications, especially to treat diseases and conditions of the immune and nervous system. Useful methods include, but are not limited to, immunotherapy (e.g., using specific-binding partners to polypeptides), vaccination (e.g., using a selective polypeptide or a naked DNA encoding such polypeptide), protein or polypeptide replacement therapy, gene therapy (e.g., germ-line correction, antisense), etc.
- Various immunotherapeutic approaches can be used. For instance, unlabeled antibody that specifically recognizes a tissue-specific antigen can be used to stimulate the body to destroy or attack the cancer, to cause down-regulation, to produce complement-mediated lysis, to inhibit cell growth, etc., of target cells which display the antigen, e.g., analogously to how c-erbB-2 antibodies are used to treat breast cancer. In addition, antibody can be labeled or conjugated to enhance its deleterious effect, e.g., with radionuclides and other energy emitting entitities, toxins, such as ricin, exotoxin A (ETA), and diphtheria, cytotoxic or cytostatic agents, immunomodulators, chemotherapeutic agents, etc. See, e.g., U.S. Pat. No. 6,107,090.
- An antibody or other specific-binding partner can be conjugated to a second molecule, such as a cytotoxic agent, and used for targeting the second molecule to a tissue-antigen positive cell (Vitetta, E. S. et al., 1993, Immunotoxin therapy, in DeVita, Jr., V. T. et al., eds, Cancer: Principles and Practice of Oncology, 4th ed., J. B. Lippincott Co., Philadelphia, 2624-2636). Examples of cytotoxic agents include, but are not limited to, antimetabolites, alkylating agents, anthracyclines, antibiotics, anti-mitotic agents, radioisotopes and chemotherapeutic agents. Further examples of cytotoxic agents include, but are not limited to ricin, doxorubicin, daunorubicin, taxol, ethidium bromide, mitomycin, etoposide, tenoposide, vincristine, vinblastine, colchicine, dihydroxy anthracin dione, actinomycin D, 1-dehydrotestosterone, diptheria toxin, Pseudomonas exotoxin (PE) A, PE40, abrin, elongation factor-2 and glucocorticoid. Techniques for conjugating therapeutic agents to antibodies are well.
- In addition to immunotherapy, polynucleotides and polypeptides can be used as targets for non-immunotherapeutic applications, e.g., using compounds which interfere with function, expression (e.g., antisense as a therapeutic agent), assembly, etc. RNA interference can be used in vivtro and in vivo to silence Human TARPP when its expression contributes to a disease (but also for other purposes, e.g., to identify the gene's function to change a developmental pathway of a cell, etc.). See, e.g., Sharp and Zamore, Science, 287:2431-2433, 2001; Grishok et al., Science, 287:2494, 2001.
- Delivery of therapeutic agents can be achieved according to any effective method, including, liposomes, viruses, plasmid vectors, bacterial delivery systems, orally, systemically, etc. Therapeutic agents of the present invention can be administered in any form by any effective route, including, e.g., oral, parenteral, enteral, intraperitoneal, topical, transdermal (e.g., using any standard patch), ophthalmic, nasally, local, non-oral, such as aerosal, inhalation, subcutaneous, intramuscular, buccal, sublingual, rectal, vaginal, intra-arterial, and intrathecal, etc. They can be administered alone, or in combination with any ingredient(s), active or inactive.
- In addition to therapeutics, per se, the present invention also relates to methods of treating a disease of the immune or nervous system showing altered expression of human TARPP, comprising, e.g., administering to a subject in need thereof a therapeutic agent which is effective for regulating expression of said human TARPP and/or which is effective in treating said disease. The term “treating” is used conventionally, e.g., the management or care of a subject for the purpose of combating, alleviating, reducing, relieving, improving the condition of, etc., of a disease or disorder. Diseases or disorders which can be treated in accordance with the present invention include, but are not limited to autoimmune disease, such as multiple sclerosis and rheumatoid arthritis, and allergy. By the phrase “altered expression,” it is meant that the disease is associated with a mutation in the gene, or any modification to the gene (or corresponding product) which affects its normal function. Thus, expression of human TARPP refers to, e.g., transcription, translation, splicing, stability of the mRNA or protein product, activity of the gene product, differential expression, etc.
- Any agent which “treats” the disease can be used. Such an agent can be one which regulates the expression of the human TARPP. Expression refers to the same acts already mentioned, e.g. transcription, translation, splicing, stability of the mRNA or protein product, activity of the gene product, differential expression, etc. For instance, if the condition was a result of a complete deficiency of the gene product, administration of gene product to a patient would be said to treat the disease and regulate the gene's expression. Many other possible situations are possible, e.g., where the gene is aberrantly expressed, and the therapeutic agent regulates the aberrant expression by restoring its normal expression pattern.
- Antisense
- Antisense polynucleotide (e.g., RNA) can also be prepared from a polynucleotide according to the present invention, preferably an anti-sense to a sequence of
SEQ ID NO 1, 3, 5, 7, 9, and others. Antisense polynucleotide can be used in various ways, such as to regulate or modulate expression of the polypeptides they encode, e.g., inhibit their expression, for in situ hybridization, for therapeutic purposes, for making targeted mutations (in vivo, triplex, etc.) etc. For guidance on administering and designing anti-sense, see, e.g., U.S. Pat. Nos. 6,200,960, 6,200,807, 6,197,584, 6,190,869, 6,190,661, 6,187,587, 6,168,950, 6,153,595, 6,150,162, 6,133,246, 6,117,847, 6,096,722, 6,087,343, 6,040,296, 6,005,095, 5,998,383, 5,994,230, 5,891,725, 5,885,970, and 5,840,708. An antisense polynucleotides can be operably linked to an expression control sequence. A total length of about 35 bp can be used in cell culture with cationic liposomes to facilitate cellular uptake, but for in vivo use, preferably shorter oligonucleotides are administered, e.g. 25 nucleotides. - Antisense polynucleotides can comprise modified, nonnaturally-occurring nucleotides and linkages between the nucleotides (e.g., modification of the phosphate-sugar backbone; methyl phosphonate, phosphorothioate, or phosphorodithioate linkages; and 2′-O-methyl ribose sugar units), e.g., to enhance in vivo or in vitro stability, to confer nuclease resistance, to modulate uptake, to modulate cellular distribution and compartmentalization, etc. Any effective nucleotide or modification can be used, including those already mentioned, as known in the art, etc., e.g., disclosed in U.S. Pat. Nos. 6,133,438; 6,127,533; 6,124,445; 6,121,437; 5,218,103 (e.g., nucleoside thiophosphoramidites); 4,973,679; Sproat et al., “2′-O-Methyloligoribonucleotides: synthesis and applications,” Oligonucleotides and Analogs A Practical Approach, Eckstein (ed.), IRL Press, Oxford, 1991, 49-86; Iribarren et al., “2′-O-Alkyl Oligoribonucleotides as Antisense Probes,” Proc. Natl. Acad. Sci. USA, 1990, 87, 7747-7751; Cotton et al., “2′-O-methyl, 2′-O-ethyl oligoribonucleotides and phosphorothioate oligodeoxyribonucleotides as inhibitors of the in vitro U7 snRNP-dependent mRNA processing event,” Nucl. Acids Res., 1991, 19, 2629-2635.
- Arrays
- The present invention also relates to an ordered array of polynucleotide probes and specific-binding partners (e.g., antibodies) for detecting the expression of human TARPP in a sample, comprising, one or more polynucleotide probes or specific binding partners associated with a solid support, wherein each probe is specific for human TARPP, and the probes comprise a nucleotide sequence of
SEQ ID NO 1, 3, 5, 7, 9, and others which is specific for said gene, a nucleotide sequence having sequence identity toSEQ ID NO 1, 3, 5, 7, 9, and others which is specific for said gene or polynucleotide, or complements thereto, or a specific-binding partner which is specific for human TARPP. - The phrase “ordered array” indicates that the probes are arranged in an identifiable or position-addressable pattern, e.g., such as the arrays disclosed in U.S. Pat. Nos. 6,156,501, 6,077,673, 6,054,270, 5,723,320, 5,700,637, WO09919711, WO00023803. The probes are associated with the solid support in any effective way. For instance, the probes can be bound to the solid support, either by polymerizing the probes on the substrate, or by attaching a probe to the substrate. Association can be, covalent, electrostatic, noncovalent, hydrophobic, hydrophilic, noncovalent, coordination, adsorbed, absorbed, polar, etc. When fibers or hollow filaments are utilized for the array, the probes can fill the hollow orifice, be absorbed into the solid filament, be attached to the surface of the orifice, etc. Probes can be of any effective size, sequence identity, composition, etc., as already discussed.
- Ordered arrays can further comprise polynucleotide probes or specific-binding partners which are specific for other genes, including genes specific for immune or nervous tissues, or genes associated with diseases thereof.
- Transgenic Animals
- The present invention also relates to transgenic animals comprising human TARPP genes. Such genes, as discussed in more detail below, include, but are not limited to, functionally-disrupted genes, mutated genes, ectopically or selectively-expressed genes, inducible or regulatable genes, etc. These transgenic animals can be produced according to any suitable technique or method, including homologous recombination, mutagenesis (e.g., ENU, Rathkolb et al., Exp. Physiol., 85(6):635-644, 2000), and the tetracycline-regulated gene expression system (e.g., U.S. Pat. No. 6,242,667). The term “gene” as used herein includes any part of a gene, i.e., regulatory sequences, promoters, enhancers, exons, introns, coding sequences, etc. A human TARPP nucleic acid present in the construct or transgene can be naturally-occurring wild-type, polymorphic, or mutated.
- Along these lines, polynucleotides of the present invention can be used to create transgenic animals, e.g. a non-human animal, comprising at least one cell whose genome comprises a functional disruption of human TARPP. By the phrases “functional disruption” or “functionally disrupted,” it is meant that the gene does not express a biologically-active product. It can be substantially deficient in at least one functional activity coded for by the gene. Expression of a polypeptide can be substantially absent, i.e., essentially undetectable amounts are made. However, polypeptide can also be made, but which is deficient in activity, e.g., where only an amino-terminal portion of the gene product is produced. For example, the gene can be disrupted in a specific region, e.g., in the sequence coding for amino acids 1-161 of a human TARPP. Cells and/or animals can also have targeted deletions, e.g., deletion of a coding sequence for amino acids 267-300 and/or 312-331 of a human TARPP of
SEQ ID NO 1 or 2. - The transgenic animal can comprise one or more cells. When substantially all its cells contain the engineered gene, it can be referred to as a transgenic animal “whose genome comprises” the engineered gene. This indicates that the endogenous gene loci of the animal has been modified and substantially all cells contain such modification.
- Functional disruption of the gene can be accomplished in any effective way, including, e.g., introduction of a stop codon into any part of the coding sequence such that the resulting polypeptide is biologically inactive (e.g., because it lacks a catalytic domain, a ligand binding domain, etc.), introduction of a mutation into a promoter or other regulatory sequence that is effective to turn it off, or reduce transcription of the gene, insertion of an exogenous sequence into the gene which inactivates it (e.g., which disrupts the production of a biologically-active polypeptide or which disrupts the promoter or other transcriptional machinery), deletion of sequences from the Human TARPP gene, etc. Examples of transgenic animals having functionally disrupted genes are well known, e.g., as described in U.S. Pat. Nos. 6,239,326, 6,225,525, 6,207,878, 6,194,633, 6,187,992, 6,180,849, 6,177,610, 6,100,445, 6,087,555, 6,080,910, 6,069,297, 6,060,642, 6,028,244, 6,013,858, 5,981,830, 5,866,760, 5,859,314, 5,850,004, 5,817,912, 5,789,654, 5,777,195, and 5,569,824. A transgenic animal which comprises the functional disruption can also be referred to as a “knock-out” animal, since the biological activity of its human TARPP genes has been “knocked-out.” One or more the different splice forms, Br137A-E can also be knocked-out or disrupted, e.g., in cells or whole mammals. Knock-out cells and animals can be homozygous or heterozygous.
- For creating functional disrupted genes, and other gene mutations, homologous recombination technology is of special interest since it allows specific regions of the genome to be targeted. Using homologous recombination methods, genes can be specifically-inactivated, specific mutations can be introduced, and exogenous sequences can be introduced at specific sites. These methods are well known in the art, e.g., as described in the patents above. See, also, Robertson, Biol. Reproduc., 44(2):238-245, 1991. Generally, the genetic engineering is performed in an embryonic stem (ES) cell, or other pluripotent cell line (e.g., adult stem cells, EG cells), and that genetically-modified cell (or nucleus) is used to create a whole organism. Nuclear transfer can be used in combination with homologous recombination technologies.
- For example, the human TARPP locus can be disrupted in ES cells using a positive-negative selection method (e.g., Mansour et al., Nature, 336:348-352, 1988). In this method, a targeting vector can be constructed which comprises a part of the gene to be targeted. A selectable marker, such as neomycin resistance genes, can be inserted into a human TARPP exon present in the targeting vector, disrupting it. When the vector recombines with the ES cell genome, it disrupts the function of the gene. The presence in the cell of the vector can be determined by expression of neomycin resistance. See, e.g., U.S. Pat. No. 6,239,326. Cells having at least one functionally disrupted gene can be used to make chimeric and germline animals, e.g., animals having somatic and/or germ cells comprising the engineered gene. Homozygous knock-out animals can be obtained from breeding heterozygous knock-out animals. See, e.g., U.S. Pat. No. 6,225,525.
- A transgenic animal, or animal cell, lacking one or more functional human TARPP genes (and lacking one or more functional copies of the splice variant) can be useful in a variety of applications, including, as an animal model for diseases of the immune or nervous system, for drug screening assays (e.g., for DNA-binding activities other than those contributed by human TARPP; by making a cell deficient in one or more splice forms of human TARPP, the contribution of other DNA binding activity can be specifically examined), as a source of tissues deficient in human TARPP activity, and any of the utilities mentioned in any issued U.S. Patent on transgenic animals, including, U.S. Pat. Nos. 6,239,326, 6,225,525, 6,207,878, 6,194,633, 6,187,992, 6,180,849, 6,177,610, 6,100,445, 6,087,555, 6,080,910, 6,069,297, 6,060,642, 6,028,244, 6,013,858, 5,981,830, 5,866,760, 5,859,314, 5,850,004, 5,817,912, 5,789,654, 5,777,195, and 5,569,824. The individual contributions of the different forms of human TARPP can be assessed by disrupting specific regions of it.
- A recombinant human TARPP nucleic acid refers to a gene which has been introduced into a target host cell and optionally modified, such as cells derived from animals, plants, bacteria, yeast, etc. A recombinant human TARPP includes completely synthetic nucleic acid sequences, semi-synthetic nucleic acid sequences, sequences derived from natural sources, and chimeras thereof. “Operable linkage” has the meaning used through the specification, i.e., placed in a functional relationship with another nucleic acid. When a gene is operably linked to an expression control sequence, as explained above, it indicates that the gene (e.g., coding sequence) is joined to the expression control sequence (e.g., promoter) in such a way that facilitates transcription and translation of the coding sequence. As described above, the phrase “genome” indicates that the genome of the cell has been modified. In this case, the recombinant human TARPP has been stably integrated into the genome of the animal. The human TARPP nucleic acid in operable linkage with the expression control sequence can also be referred to as a construct or transgene.
- Any expression control sequence can be used depending on the purpose. For instance, if selective expression is desired, then expression control sequences which limit its expression can be selected. These include, e.g., tissue or cell-specific promoters, introns, enhancers, etc. For various methods of cell and tissue-specific expression, see, e.g., U.S. Pat. Nos. 6,215,040, 6,210,736, and 6,153,427. These also include the endogenous promoter, i.e., the coding sequence can be operably linked to its own promoter. Inducible and regulatable promoters can also be utilized.
- The present invention also relates to a transgenic animal which contains a functionally disrupted and a transgene stably integrated into the animals genome. Such an animal can be constructed using combinations any of the above- and below-mentioned methods. Such animals have any of the aforementioned uses, including permitting the knock-out of the normal gene and its replacement with a mutated gene. Such a transgene can be integrated at the endogenous gene locus so that the functional disruption and “knock-in” are carried out in the same step.
- In addition to the methods mentioned above, transgenic animals can be prepared according to known methods, including, e.g., by pronuclear injection of recombinant genes into pronuclei of 1-cell embryos, incorporating an artificial yeast chromosome into embryonic stem cells, gene targeting methods, embryonic stem cell methodology, cloning methods, nuclear transfer methods. See, also, e.g., U.S. Pat. Nos. 4,736,866; 4,873,191; 4,873,316; 5,082,779; 5,304,489; 5,174,986; 5,175,384; 5,175,385; 5,221,778; Gordon et al., Proc. Natl. Acad. Sci., 77:7380-7384, 1980; Palmiter et al., Cell, 41:343-345, 1985; Palmiter et al., Ann. Rev. Genet., 20:465-499, 1986; Askew et al., Mol. Cell. Bio., 13:4115-4124, 1993; Games et al. Nature, 373:523-527, 1995; Valancius and Smithies, Mol. Cell. Bio., 11: 1402-1408, 1991; Stacey et al., Mol. Cell. Bio., 14:1009-1016, 1994; Hasty et al., Nature, 350:243-246, 1995; Rubinstein et al., Nucl. Acid Res., 21:2613-2617,1993; Cibelli et al., Science, 280:1256-1258, 1998. For guidance on recombinase excision systems, see, e.g., U.S. Pat. Nos. 5,626,159, 5,527,695, and 5,434,066. See also, Orban, P. C., et al., “Tissue-and Site-Specific DNA Recombination in Transgenic Mice,” Proc. Natl. Acad. Sci. USA, 89:6861-6865 (1992); O'Gorman, S., et al., “Recombinase-Mediated Gene Activation and Site-Specific Integration in Mammalian Cells,” Science, 251:1351-1355 (1991); Sauer, B., et al., “Cre-stimulated recombination at loxP-Containing DNA sequences placed into the mammalian genome,” Polynucleotides Research, 17(1):147-161 (1989); Gagneten, S. et al. (1997) Nucl. Acids Res. 25:3326-3331; Xiao and Weaver (1997) Nucl. Acids Res. 25:2985-2991; Agah, R. et al. (1997) J. Clin. Invest. 100: 169-179; Barlow, C. et al. (1997) Nucl. Acids Res. 25:2543-2545; Araki, K. et al. (1997) Nucl. Acids Res. 25:868-872; Mortensen, R. N. et al. (1992) Mol. Cell. Biol. 12:2391-2395 (G418 escalation method); Lakhlani, P. P. et al. (1997) Proc. Natl. Acad. Sci. USA 94:9950-9955 (“hit and run”); Westphal and Leder (1997) Curr. Biol. 7:530-533 (transposon-generated “knock-out” and “knock-in”); Templeton, N. S. et al. (1997) Gene Ther. 4:700-709 (methods for efficient gene targeting, allowing for a high frequency of homologous recombination events, e.g., without selectable markers); PCT International Publication WO 93/22443 (functionally-disrupted).
- A polynucleotide according to the present invention can be introduced into any non-human animal, including a non-human mammal, mouse (Hogan et al., Manipulating the Mouse Embryo: A Laboratory Manual, Cold Spring Harbor Laboratory, Cold Spring Harbor, New York, 1986), pig (Hammer et al., Nature, 315:343-345, 1985), sheep (Hammer et al., Nature, 315:343-345, 1985), cattle, rat, or primate. See also, e.g., Church, 1987, Trends in Biotech. 5:13-19; Clark et al., Trends in Biotech. 5:20-24, 1987); and DePamphilis et al., BioTechniques, 6:662-680, 1988. Transgenic animals can be produced by the methods described in U.S. Pat. No. 5,994,618, and utilized for any of the utilities described therein.
- Database
- The present invention also relates to electronic forms of polynucleotides, polypeptides, etc., of the present invention, including computer-readable medium (e.g., magnetic, optical, etc., stored in any suitable format, such as flat files or hierarchical files) which comprise such sequences, or fragments thereof, e-commerce-related means, etc. Along these lines, the present invention relates to methods of retrieving gene sequences from a computer-readable medium, comprising, one or more of the following steps in any effective order, e.g., selecting a cell or gene expression profile, e.g., a profile that specifies that said gene is expressed in brain and/or immune cells, and, and retrieving said expressed gene sequences, where the gene sequences consist of the genes represented by SEQ ID Nos 1-10
- A “gene expression profile” means the list of tissues, cells, etc., in which a defined gene is expressed (i.e, transcribed and/or translated). A “cell expression profile” means the genes which are expressed in the particular cell type. The profile can be a list of the tissues in which the gene is expressed, but can include additional information as well, including level of expression (e.g., a quantity as compared or normalized to a control gene), and information on temporal (e.g., at what point in the cell-cycle or developmental program) and spatial expression. By the phrase “selecting a gene or cell expression profile,” it is meant that a user decides what type of gene or cell expression pattern he is interested in retrieving, e.g., he may require that the gene is differentially expressed in a tissue. Any pattern of expression preferences may be selected. The selecting can be performed by any effective method. In general, “selecting” refers to the process in which a user forms a query that is used to search a database of gene expression profiles. The step of retrieving involves searching for results in a database that correspond to the query set forth in the selecting step. Any suitable algorithm can be utilized to perform the search query, including algorithms that look for matches, or that perform optimization between query and data. The database is information that has been stored in an appropriate storage medium, having a suitable computer-readable format. Once results are retrieved, they can be displayed in any suitable format, such as HTML. A query is formed by the user to retrieve the set of genes from the database having the desired gene or cell expression profile. Once the query is inputted into the system, a search algorithm is used to interrogate the database, and retrieve results.
- Advertising, Licensing, etc., Methods
- The present invention also relates to methods of advertising, licensing, selling, purchasing, brokering, etc., genes, polynucleotides, specific-binding partners, antibodies, etc., of the present invention. Methods can comprises, e.g., displaying a human TARPP gene, human TARPP polypeptide, or antibody specific for human TARPP in a printed or computer-readable medium (e.g., on the Web or Internet), accepting an offer to purchase said gene, polypeptide, or antibody.
- Other
- A polynucleotide, probe, polypeptide, antibody, specific-binding partner, etc., according to the present invention can be isolated. The term “isolated” means that the material is in a form in which it is not found in its original environment or in nature, e.g., more concentrated, more purified, separated from component, etc. An isolated polynucleotide includes, e.g., a polynucleotide having the sequenced separated from the chromosomal DNA found in a living animal, e.g., as the complete gene, a transcript, or a cDNA. This polynucleotide can be part of a vector or inserted into a chromosome (by specific gene-targeting or by random integration at a position other than its normal position) and still be isolated in that it is not in a form that is found in its natural environment. A polynucleotide, polypeptide, etc., of the present invention can also be substantially purified. By substantially purified, it is meant that polynucleotide or polypeptide is separated and is essentially free from other polynucleotides or polypeptides, i.e., the polynucleotide or polypeptide is the primary and active constituent. A polynucleotide can also be a recombinant molecule. By “recombinant,” it is meant that the polynucleotide is an arrangement or form which does not occur in nature. For instance, a recombinant molecule comprising a promoter sequence would not encompass the naturally-occurring gene, but would include the promoter operably linked to a coding sequence not associated with it in nature, e.g., a reporter gene, or a truncation of the normal coding sequence.
- The term “marker” is used herein to indicate a means for detecting or labeling a target. A marker can be a polynucleotide (usually referred to as a “probe”), polypeptide (e.g., an antibody conjugated to a detectable label), PNA, or any effective material.
- The topic headings set forth above are meant as guidance where certain information can be found in the application, but are not intended to be the only source in the application where information on such topic can be found. Reference materials
- For other aspects of the polynucleotides, reference is made to standard textbooks of molecular biology. See, e.g., Hames et al., Polynucleotide Hybridization, IL Press, 1985; Davis et al., Basic Methods in Molecular Biology, Elsevir Sciences Publishing, Inc., New York, 1986; Sambrook et al., Molecular Cloning, CSH Press, 1989; Howe, Gene Cloning and Manipulation, Cambridge University Press, 1995; Ausubel et al., Current Protocols in Molecular Biology, John Wiley & Sons, Inc., 1994-1998.
- The preceding preferred specific embodiments are merely illustrative, and not limiting the remainder of the disclosure in any way whatsoever. The entire disclosure of all applications, patents and publications, cited above and in the figures are hereby incorporated by reference in their entirety.
-
1 15 1 3369 DNA Homo sapiens CDS (250)..(2793) 1 gctggatcaa gctgtgaacg tgatttgctg gaagctggtt gacgatgtgt cacactgtgt 60 aagggaatcg catggagatg ggcattccga actgttaatg gggacatggg actccagttg 120 tctctgatca cttgtgtgga ttttcctggc gtagaacgac agaagccgct agtaagtcgc 180 caagacctac agcaggaatt ctgcaccaaa gggcataaaa tcttgttatt ttaatttgca 240 tctgggaga atg tct gag caa gga gac ctg aat cag gca ata gca gag gaa 291 Met Ser Glu Gln Gly Asp Leu Asn Gln Ala Ile Ala Glu Glu 1 5 10 gga ggg act gag cag gag acg gcc act cca gag aac ggc att gtt aaa 339 Gly Gly Thr Glu Gln Glu Thr Ala Thr Pro Glu Asn Gly Ile Val Lys 15 20 25 30 tca gaa agt ctg gat gaa gag gag aaa ctg gaa ctg cag agg cgg ctg 387 Ser Glu Ser Leu Asp Glu Glu Glu Lys Leu Glu Leu Gln Arg Arg Leu 35 40 45 gag gct cag aat caa gaa aga aga aaa tcc aag tca gga gca gga aaa 435 Glu Ala Gln Asn Gln Glu Arg Arg Lys Ser Lys Ser Gly Ala Gly Lys 50 55 60 ggt aaa ctg act cgc agt ctt gct gtc tgt gag gaa tct tct gcc aga 483 Gly Lys Leu Thr Arg Ser Leu Ala Val Cys Glu Glu Ser Ser Ala Arg 65 70 75 cca gga ggt gaa agt ctt cag gat cag gaa tca att cat tta cag ctt 531 Pro Gly Gly Glu Ser Leu Gln Asp Gln Glu Ser Ile His Leu Gln Leu 80 85 90 tcc agt ttt tcc agc ctg caa gag gag gat aaa tct agg aaa gat gac 579 Ser Ser Phe Ser Ser Leu Gln Glu Glu Asp Lys Ser Arg Lys Asp Asp 95 100 105 110 tct gaa aga gaa aaa gaa aag gat aaa aac aaa gat aaa acc tct gaa 627 Ser Glu Arg Glu Lys Glu Lys Asp Lys Asn Lys Asp Lys Thr Ser Glu 115 120 125 aaa ccc aag atc aga atg tta tca aaa gat tgc agc caa gaa tac acg 675 Lys Pro Lys Ile Arg Met Leu Ser Lys Asp Cys Ser Gln Glu Tyr Thr 130 135 140 gat tct aca ggc ata gac tta cac gag ttt ctg att aac aca tta aag 723 Asp Ser Thr Gly Ile Asp Leu His Glu Phe Leu Ile Asn Thr Leu Lys 145 150 155 aat aat tcc agg gac agg atg ata ctt ttg aaa atg gag cag gaa att 771 Asn Asn Ser Arg Asp Arg Met Ile Leu Leu Lys Met Glu Gln Glu Ile 160 165 170 att gat ttc att gct gac aac aat aat cat tat aaa aag ttc cct cag 819 Ile Asp Phe Ile Ala Asp Asn Asn Asn His Tyr Lys Lys Phe Pro Gln 175 180 185 190 atg tca tcg tat cag agg atg ctt gtc cat cga gtg gca gct tat ttt 867 Met Ser Ser Tyr Gln Arg Met Leu Val His Arg Val Ala Ala Tyr Phe 195 200 205 gga ttg gat cac aat gtg gat caa aca gga aaa tct gtt atc atc aac 915 Gly Leu Asp His Asn Val Asp Gln Thr Gly Lys Ser Val Ile Ile Asn 210 215 220 aag acc agc agc acc aga ata cca gag caa agg ttt tgt gaa cat tta 963 Lys Thr Ser Ser Thr Arg Ile Pro Glu Gln Arg Phe Cys Glu His Leu 225 230 235 aaa gat gaa aaa ggt gaa gaa tcc cag aag cgg ttt atc ttg aag cga 1011 Lys Asp Glu Lys Gly Glu Glu Ser Gln Lys Arg Phe Ile Leu Lys Arg 240 245 250 gat aac tct agt att gat aaa gaa gac aat cag caa aac aga atg cat 1059 Asp Asn Ser Ser Ile Asp Lys Glu Asp Asn Gln Gln Asn Arg Met His 255 260 265 270 cca ttt aga gat gac aga cga agt aaa tca att gaa gag aga gaa gag 1107 Pro Phe Arg Asp Asp Arg Arg Ser Lys Ser Ile Glu Glu Arg Glu Glu 275 280 285 gaa tat cag aga gtg agg gag aga ata ttt gca cac gat tca gtt tgc 1155 Glu Tyr Gln Arg Val Arg Glu Arg Ile Phe Ala His Asp Ser Val Cys 290 295 300 tcc cag gaa agc ctt ttt gtg gaa aac agt agg ctc ttg gaa gac agt 1203 Ser Gln Glu Ser Leu Phe Val Glu Asn Ser Arg Leu Leu Glu Asp Ser 305 310 315 aac ata tgc aat gag acc tat aag aaa aga cag ctc ttt cgg ggc aac 1251 Asn Ile Cys Asn Glu Thr Tyr Lys Lys Arg Gln Leu Phe Arg Gly Asn 320 325 330 aga gat ggc tca ggg aga aca tct ggg agt cga cag agc agc tca gaa 1299 Arg Asp Gly Ser Gly Arg Thr Ser Gly Ser Arg Gln Ser Ser Ser Glu 335 340 345 350 aat gaa ctc aag tgg tct gac cac caa agg gcc tgg agc agc aca gac 1347 Asn Glu Leu Lys Trp Ser Asp His Gln Arg Ala Trp Ser Ser Thr Asp 355 360 365 tcc gac agt tcc aac cgc aat cta aag ccc gcc atg acc aag acg gcg 1395 Ser Asp Ser Ser Asn Arg Asn Leu Lys Pro Ala Met Thr Lys Thr Ala 370 375 380 agt ttt ggg ggc atc acg gtg ctg acc agg ggt gac agc act tcc agt 1443 Ser Phe Gly Gly Ile Thr Val Leu Thr Arg Gly Asp Ser Thr Ser Ser 385 390 395 act agg agt acc ggg aag ctg tcc aaa gca ggt tcc gag tct tcc agc 1491 Thr Arg Ser Thr Gly Lys Leu Ser Lys Ala Gly Ser Glu Ser Ser Ser 400 405 410 agt gca ggc tcc tca gga tcg ctg tcc cgc acc cat cca cct ctc cag 1539 Ser Ala Gly Ser Ser Gly Ser Leu Ser Arg Thr His Pro Pro Leu Gln 415 420 425 430 agc aca ccc cta gtc tca ggt gtg gca gct ggc tct cca ggc tgt gtg 1587 Ser Thr Pro Leu Val Ser Gly Val Ala Ala Gly Ser Pro Gly Cys Val 435 440 445 cct tat cca gag aat gga ata ggg ggc cag gtt gct ccc agc agc acc 1635 Pro Tyr Pro Glu Asn Gly Ile Gly Gly Gln Val Ala Pro Ser Ser Thr 450 455 460 agc tac atc ctc ctt cca ctt gaa gct gca aca ggc atc ccg cct gga 1683 Ser Tyr Ile Leu Leu Pro Leu Glu Ala Ala Thr Gly Ile Pro Pro Gly 465 470 475 agc atc ctt ctt aat cca cac aca ggc cag ccc ttt gtg aat ccc gat 1731 Ser Ile Leu Leu Asn Pro His Thr Gly Gln Pro Phe Val Asn Pro Asp 480 485 490 gga act cct gca ata tac aac cca ccc acc agt cag cag ccc ctg cga 1779 Gly Thr Pro Ala Ile Tyr Asn Pro Pro Thr Ser Gln Gln Pro Leu Arg 495 500 505 510 agc gcc atg gtg ggg cag tcc caa cag cag cca cca cag cag cag ccc 1827 Ser Ala Met Val Gly Gln Ser Gln Gln Gln Pro Pro Gln Gln Gln Pro 515 520 525 tcc ccg cag ccc caa cag cag gtc cag cca ccg cag cca cag atg gca 1875 Ser Pro Gln Pro Gln Gln Gln Val Gln Pro Pro Gln Pro Gln Met Ala 530 535 540 ggc cct ctg gtc act cag tct gtc cag ggg ctg cag gct tcc tcc cag 1923 Gly Pro Leu Val Thr Gln Ser Val Gln Gly Leu Gln Ala Ser Ser Gln 545 550 555 tca gtg caa tat cca gca gtc tct ttt cct ccc cag cac ctc cta cct 1971 Ser Val Gln Tyr Pro Ala Val Ser Phe Pro Pro Gln His Leu Leu Pro 560 565 570 gtg tct cca acg cag cac ttt ccc atg aga gat gat gtg gca aca cag 2019 Val Ser Pro Thr Gln His Phe Pro Met Arg Asp Asp Val Ala Thr Gln 575 580 585 590 ttt ggc cag atg acc ctg agc cgg cag tcc tcg ggg gag act cct gaa 2067 Phe Gly Gln Met Thr Leu Ser Arg Gln Ser Ser Gly Glu Thr Pro Glu 595 600 605 ccc cca tca ggt cct gtc tac cca tcc tcc ctt atg cca cag ccg gcc 2115 Pro Pro Ser Gly Pro Val Tyr Pro Ser Ser Leu Met Pro Gln Pro Ala 610 615 620 cag cag ccc agc tat gta atc gcc tct aca ggc cag cag ctt cct aca 2163 Gln Gln Pro Ser Tyr Val Ile Ala Ser Thr Gly Gln Gln Leu Pro Thr 625 630 635 gga gga ttc tca ggc tct ggc cct ccc atc tcc cag cag gtc ctc cag 2211 Gly Gly Phe Ser Gly Ser Gly Pro Pro Ile Ser Gln Gln Val Leu Gln 640 645 650 ccc cct ccc tca cca cag gga ttt gtg caa cag cct ccg cct gca cag 2259 Pro Pro Pro Ser Pro Gln Gly Phe Val Gln Gln Pro Pro Pro Ala Gln 655 660 665 670 atg cct gta tat tat tac cca tct ggt cag tac cct acc tca acc acg 2307 Met Pro Val Tyr Tyr Tyr Pro Ser Gly Gln Tyr Pro Thr Ser Thr Thr 675 680 685 caa cag tac cgg ccc atg gcc ccg gtt cag tac aac gct cag agg agt 2355 Gln Gln Tyr Arg Pro Met Ala Pro Val Gln Tyr Asn Ala Gln Arg Ser 690 695 700 caa cag atg cca cag gca gca cag caa gca ggt tac cag cca gtc ttg 2403 Gln Gln Met Pro Gln Ala Ala Gln Gln Ala Gly Tyr Gln Pro Val Leu 705 710 715 tct ggt caa cag gga ttc caa ggc cta ata gga gtg cag cag cca cct 2451 Ser Gly Gln Gln Gly Phe Gln Gly Leu Ile Gly Val Gln Gln Pro Pro 720 725 730 cag agt cag aac gtg ata aat aac caa caa gga act ccg gtg caa agc 2499 Gln Ser Gln Asn Val Ile Asn Asn Gln Gln Gly Thr Pro Val Gln Ser 735 740 745 750 gtg atg gtt tcc tac cca aca atg tct tct tat cag gtg cca atg acc 2547 Val Met Val Ser Tyr Pro Thr Met Ser Ser Tyr Gln Val Pro Met Thr 755 760 765 cag ggt tct caa gga ctg ccc cag cag tca tac caa cag cca atc atg 2595 Gln Gly Ser Gln Gly Leu Pro Gln Gln Ser Tyr Gln Gln Pro Ile Met 770 775 780 cta cct aac cag gca ggt caa ggg tca ctc cca gcc act gga atg cct 2643 Leu Pro Asn Gln Ala Gly Gln Gly Ser Leu Pro Ala Thr Gly Met Pro 785 790 795 gtt tac tgt aat gtc aca ccg ccc acc cct cag aac aac ctt agg ctg 2691 Val Tyr Cys Asn Val Thr Pro Pro Thr Pro Gln Asn Asn Leu Arg Leu 800 805 810 att ggc cca cac tgc ccc tcc agc act gtc cca gtg atg tca gct agc 2739 Ile Gly Pro His Cys Pro Ser Ser Thr Val Pro Val Met Ser Ala Ser 815 820 825 830 tgc aga aca aac tgt gca agt atg agc aat gct ggt tgg cag gtc aaa 2787 Cys Arg Thr Asn Cys Ala Ser Met Ser Asn Ala Gly Trp Gln Val Lys 835 840 845 ttc tga gagctctggc tgtggtacat ttcttcagat atttctcatg gcctttgatg 2843 Phe gaagaggaac aaggtgggaa aactggctga ggacttaagt attcactcaa cactcaaatg 2903 attgctgctg gtattctgta aaaaataaac aaagactaat atacacgtta gctggttaat 2963 ggtgcatatt tctgtcatgt ctgctaggta tgcctttata gcttagctag tgacatgaat 3023 tcatcaaggt aagattttct cctaccactg aataccactg tgtagattat aatatcccta 3083 atttggatta gttttgtact ttgtgttgag tttgtgatgc taaaagtatt taaaaattat 3143 atactaaatc acattgtacc aaagctgtaa tggaaaagca aagaagaatt gatgaattga 3203 aggaataatt tatatacatt atagagtttt cttttttaat ggatatatac tgtattgtag 3263 tgtttaatca aaataaaact atttgacctt atggaggaag gtcatgtttt taccaccaaa 3323 aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaa 3369 2 847 PRT Homo sapiens 2 Met Ser Glu Gln Gly Asp Leu Asn Gln Ala Ile Ala Glu Glu Gly Gly 1 5 10 15 Thr Glu Gln Glu Thr Ala Thr Pro Glu Asn Gly Ile Val Lys Ser Glu 20 25 30 Ser Leu Asp Glu Glu Glu Lys Leu Glu Leu Gln Arg Arg Leu Glu Ala 35 40 45 Gln Asn Gln Glu Arg Arg Lys Ser Lys Ser Gly Ala Gly Lys Gly Lys 50 55 60 Leu Thr Arg Ser Leu Ala Val Cys Glu Glu Ser Ser Ala Arg Pro Gly 65 70 75 80 Gly Glu Ser Leu Gln Asp Gln Glu Ser Ile His Leu Gln Leu Ser Ser 85 90 95 Phe Ser Ser Leu Gln Glu Glu Asp Lys Ser Arg Lys Asp Asp Ser Glu 100 105 110 Arg Glu Lys Glu Lys Asp Lys Asn Lys Asp Lys Thr Ser Glu Lys Pro 115 120 125 Lys Ile Arg Met Leu Ser Lys Asp Cys Ser Gln Glu Tyr Thr Asp Ser 130 135 140 Thr Gly Ile Asp Leu His Glu Phe Leu Ile Asn Thr Leu Lys Asn Asn 145 150 155 160 Ser Arg Asp Arg Met Ile Leu Leu Lys Met Glu Gln Glu Ile Ile Asp 165 170 175 Phe Ile Ala Asp Asn Asn Asn His Tyr Lys Lys Phe Pro Gln Met Ser 180 185 190 Ser Tyr Gln Arg Met Leu Val His Arg Val Ala Ala Tyr Phe Gly Leu 195 200 205 Asp His Asn Val Asp Gln Thr Gly Lys Ser Val Ile Ile Asn Lys Thr 210 215 220 Ser Ser Thr Arg Ile Pro Glu Gln Arg Phe Cys Glu His Leu Lys Asp 225 230 235 240 Glu Lys Gly Glu Glu Ser Gln Lys Arg Phe Ile Leu Lys Arg Asp Asn 245 250 255 Ser Ser Ile Asp Lys Glu Asp Asn Gln Gln Asn Arg Met His Pro Phe 260 265 270 Arg Asp Asp Arg Arg Ser Lys Ser Ile Glu Glu Arg Glu Glu Glu Tyr 275 280 285 Gln Arg Val Arg Glu Arg Ile Phe Ala His Asp Ser Val Cys Ser Gln 290 295 300 Glu Ser Leu Phe Val Glu Asn Ser Arg Leu Leu Glu Asp Ser Asn Ile 305 310 315 320 Cys Asn Glu Thr Tyr Lys Lys Arg Gln Leu Phe Arg Gly Asn Arg Asp 325 330 335 Gly Ser Gly Arg Thr Ser Gly Ser Arg Gln Ser Ser Ser Glu Asn Glu 340 345 350 Leu Lys Trp Ser Asp His Gln Arg Ala Trp Ser Ser Thr Asp Ser Asp 355 360 365 Ser Ser Asn Arg Asn Leu Lys Pro Ala Met Thr Lys Thr Ala Ser Phe 370 375 380 Gly Gly Ile Thr Val Leu Thr Arg Gly Asp Ser Thr Ser Ser Thr Arg 385 390 395 400 Ser Thr Gly Lys Leu Ser Lys Ala Gly Ser Glu Ser Ser Ser Ser Ala 405 410 415 Gly Ser Ser Gly Ser Leu Ser Arg Thr His Pro Pro Leu Gln Ser Thr 420 425 430 Pro Leu Val Ser Gly Val Ala Ala Gly Ser Pro Gly Cys Val Pro Tyr 435 440 445 Pro Glu Asn Gly Ile Gly Gly Gln Val Ala Pro Ser Ser Thr Ser Tyr 450 455 460 Ile Leu Leu Pro Leu Glu Ala Ala Thr Gly Ile Pro Pro Gly Ser Ile 465 470 475 480 Leu Leu Asn Pro His Thr Gly Gln Pro Phe Val Asn Pro Asp Gly Thr 485 490 495 Pro Ala Ile Tyr Asn Pro Pro Thr Ser Gln Gln Pro Leu Arg Ser Ala 500 505 510 Met Val Gly Gln Ser Gln Gln Gln Pro Pro Gln Gln Gln Pro Ser Pro 515 520 525 Gln Pro Gln Gln Gln Val Gln Pro Pro Gln Pro Gln Met Ala Gly Pro 530 535 540 Leu Val Thr Gln Ser Val Gln Gly Leu Gln Ala Ser Ser Gln Ser Val 545 550 555 560 Gln Tyr Pro Ala Val Ser Phe Pro Pro Gln His Leu Leu Pro Val Ser 565 570 575 Pro Thr Gln His Phe Pro Met Arg Asp Asp Val Ala Thr Gln Phe Gly 580 585 590 Gln Met Thr Leu Ser Arg Gln Ser Ser Gly Glu Thr Pro Glu Pro Pro 595 600 605 Ser Gly Pro Val Tyr Pro Ser Ser Leu Met Pro Gln Pro Ala Gln Gln 610 615 620 Pro Ser Tyr Val Ile Ala Ser Thr Gly Gln Gln Leu Pro Thr Gly Gly 625 630 635 640 Phe Ser Gly Ser Gly Pro Pro Ile Ser Gln Gln Val Leu Gln Pro Pro 645 650 655 Pro Ser Pro Gln Gly Phe Val Gln Gln Pro Pro Pro Ala Gln Met Pro 660 665 670 Val Tyr Tyr Tyr Pro Ser Gly Gln Tyr Pro Thr Ser Thr Thr Gln Gln 675 680 685 Tyr Arg Pro Met Ala Pro Val Gln Tyr Asn Ala Gln Arg Ser Gln Gln 690 695 700 Met Pro Gln Ala Ala Gln Gln Ala Gly Tyr Gln Pro Val Leu Ser Gly 705 710 715 720 Gln Gln Gly Phe Gln Gly Leu Ile Gly Val Gln Gln Pro Pro Gln Ser 725 730 735 Gln Asn Val Ile Asn Asn Gln Gln Gly Thr Pro Val Gln Ser Val Met 740 745 750 Val Ser Tyr Pro Thr Met Ser Ser Tyr Gln Val Pro Met Thr Gln Gly 755 760 765 Ser Gln Gly Leu Pro Gln Gln Ser Tyr Gln Gln Pro Ile Met Leu Pro 770 775 780 Asn Gln Ala Gly Gln Gly Ser Leu Pro Ala Thr Gly Met Pro Val Tyr 785 790 795 800 Cys Asn Val Thr Pro Pro Thr Pro Gln Asn Asn Leu Arg Leu Ile Gly 805 810 815 Pro His Cys Pro Ser Ser Thr Val Pro Val Met Ser Ala Ser Cys Arg 820 825 830 Thr Asn Cys Ala Ser Met Ser Asn Ala Gly Trp Gln Val Lys Phe 835 840 845 3 3374 DNA Homo sapiens CDS (329)..(2812) 3 gtctattttt aatgctattt aatgaaggag cgagcgcctc actcagcaat aaaagaagca 60 tgagggaaga cagagcagtg catggttatg gatactggac aaggatattt ggaaaggttg 120 acgatgtgtc acactgtgta agggaatcgc atggagatgg gcattccgaa ctgttaatgg 180 ggacatggga ctccagttgt ctctgatcac ttgtgtggat tttcctggcg tagaacgaca 240 gaagccgcta gtaagtcgcc aagacctaca gcaggaattc tgcaccaaag ggcataaaat 300 cttgttattt taatttgcat ctgggaga atg tct gag caa gga gac ctg aat 352 Met Ser Glu Gln Gly Asp Leu Asn 1 5 cag gca ata gca gag gaa gga ggg act gag cag gag acg gcc act cca 400 Gln Ala Ile Ala Glu Glu Gly Gly Thr Glu Gln Glu Thr Ala Thr Pro 10 15 20 gag aac ggc att gtt aaa tca gaa agt ctg gat gaa gag gag aaa ctg 448 Glu Asn Gly Ile Val Lys Ser Glu Ser Leu Asp Glu Glu Glu Lys Leu 25 30 35 40 gaa ctg cag agg cgg ctg gag gct cag aat caa gaa aga aga aaa tcc 496 Glu Leu Gln Arg Arg Leu Glu Ala Gln Asn Gln Glu Arg Arg Lys Ser 45 50 55 aag tca gga gca gga aaa ggt aaa ctg act cgc agt ctt gct gtc tgt 544 Lys Ser Gly Ala Gly Lys Gly Lys Leu Thr Arg Ser Leu Ala Val Cys 60 65 70 gag gaa tct tct gcc aga cca gga ggt gaa agt ctt cag gat cag gaa 592 Glu Glu Ser Ser Ala Arg Pro Gly Gly Glu Ser Leu Gln Asp Gln Glu 75 80 85 tca att cat tta cag ctt tcc agt ttt tcc agc ctg caa gag gag gat 640 Ser Ile His Leu Gln Leu Ser Ser Phe Ser Ser Leu Gln Glu Glu Asp 90 95 100 aaa tct agg aaa gat gac tct gaa aga gaa aaa gaa aag gat aaa aac 688 Lys Ser Arg Lys Asp Asp Ser Glu Arg Glu Lys Glu Lys Asp Lys Asn 105 110 115 120 aaa gat aaa acc tct gaa aaa ccc aag atc aga atg tta tca aaa gat 736 Lys Asp Lys Thr Ser Glu Lys Pro Lys Ile Arg Met Leu Ser Lys Asp 125 130 135 tgc agc caa gaa tac acg gat tct aca ggc ata gac tta cac gag ttt 784 Cys Ser Gln Glu Tyr Thr Asp Ser Thr Gly Ile Asp Leu His Glu Phe 140 145 150 ctg att aac aca tta aag aat aat tcc agg gac agg atg ata ctt ttg 832 Leu Ile Asn Thr Leu Lys Asn Asn Ser Arg Asp Arg Met Ile Leu Leu 155 160 165 aaa atg gag cag gaa att att gat ttc att gct gac aac aat aat cat 880 Lys Met Glu Gln Glu Ile Ile Asp Phe Ile Ala Asp Asn Asn Asn His 170 175 180 tat aaa aag ttc cct cag atg tca tcg tat cag agg atg ctt gtc cat 928 Tyr Lys Lys Phe Pro Gln Met Ser Ser Tyr Gln Arg Met Leu Val His 185 190 195 200 cga gtg gca gct tat ttt gga ttg gat cac aat gtg gat caa aca gga 976 Arg Val Ala Ala Tyr Phe Gly Leu Asp His Asn Val Asp Gln Thr Gly 205 210 215 aaa tct gtt atc atc aac aag acc agc agc acc aga ata cca gag caa 1024 Lys Ser Val Ile Ile Asn Lys Thr Ser Ser Thr Arg Ile Pro Glu Gln 220 225 230 agg ttt tgt gaa cat tta aaa gat gaa aaa ggt gaa gaa tcc cag aag 1072 Arg Phe Cys Glu His Leu Lys Asp Glu Lys Gly Glu Glu Ser Gln Lys 235 240 245 cgg ttt atc ttg aag cga gat aac tct agt att gat aaa gaa gac aat 1120 Arg Phe Ile Leu Lys Arg Asp Asn Ser Ser Ile Asp Lys Glu Asp Asn 250 255 260 cag caa aac aga atg cat cca ttt aga gat gac aga cga agt aaa tca 1168 Gln Gln Asn Arg Met His Pro Phe Arg Asp Asp Arg Arg Ser Lys Ser 265 270 275 280 att gaa gag aga gaa gag gaa tat cag aga gtg agg gag aga ata ttt 1216 Ile Glu Glu Arg Glu Glu Glu Tyr Gln Arg Val Arg Glu Arg Ile Phe 285 290 295 gca cac gat tca gtt tgc tcc cag gaa agc ctt ttt gtg gaa aac agg 1264 Ala His Asp Ser Val Cys Ser Gln Glu Ser Leu Phe Val Glu Asn Arg 300 305 310 ggc aac aga gat ggc tca ggg aga aca tct ggg agt cga cag agc agc 1312 Gly Asn Arg Asp Gly Ser Gly Arg Thr Ser Gly Ser Arg Gln Ser Ser 315 320 325 tca gaa aat gaa ctc aag tgg tct gac cac caa agg gcc tgg agc agc 1360 Ser Glu Asn Glu Leu Lys Trp Ser Asp His Gln Arg Ala Trp Ser Ser 330 335 340 aca gac tcc gac agt tcc aac cgc aat cta aag ccc gcc atg acc aag 1408 Thr Asp Ser Asp Ser Ser Asn Arg Asn Leu Lys Pro Ala Met Thr Lys 345 350 355 360 acg gcg agt ttt ggg ggc atc acg gtg ctg acc agg ggt gac agc act 1456 Thr Ala Ser Phe Gly Gly Ile Thr Val Leu Thr Arg Gly Asp Ser Thr 365 370 375 tcc agt act agg agt acc ggg aag ctg tcc aaa gca ggt tcc gag tct 1504 Ser Ser Thr Arg Ser Thr Gly Lys Leu Ser Lys Ala Gly Ser Glu Ser 380 385 390 tcc agc agt gca ggc tcc tca gga tcg ctg tcc cgc acc cat cca cct 1552 Ser Ser Ser Ala Gly Ser Ser Gly Ser Leu Ser Arg Thr His Pro Pro 395 400 405 ctc cag agc aca ccc cta gtc tca ggt gtg gca gct ggc tct cca ggc 1600 Leu Gln Ser Thr Pro Leu Val Ser Gly Val Ala Ala Gly Ser Pro Gly 410 415 420 tgt gtg cct tat cca gag aat gga ata ggg ggc cag gtt gct ccc agc 1648 Cys Val Pro Tyr Pro Glu Asn Gly Ile Gly Gly Gln Val Ala Pro Ser 425 430 435 440 agc acc agc tac atc ctc ctt cca ctt gaa gct gca aca ggc atc ccg 1696 Ser Thr Ser Tyr Ile Leu Leu Pro Leu Glu Ala Ala Thr Gly Ile Pro 445 450 455 cct gga agc atc ctt ctt aat cca cac aca ggc cag ccc ttt gtg aat 1744 Pro Gly Ser Ile Leu Leu Asn Pro His Thr Gly Gln Pro Phe Val Asn 460 465 470 ccc gat gga act cct gca ata tac aac cca ccc acc agt cag cag ccc 1792 Pro Asp Gly Thr Pro Ala Ile Tyr Asn Pro Pro Thr Ser Gln Gln Pro 475 480 485 ctg cga agc gcc atg gtg ggg cag tcc caa cag cag ccg cca cag cag 1840 Leu Arg Ser Ala Met Val Gly Gln Ser Gln Gln Gln Pro Pro Gln Gln 490 495 500 cag ccc tcc ccg cag ccc caa cag cag gtc cag cca ccg cag cca cag 1888 Gln Pro Ser Pro Gln Pro Gln Gln Gln Val Gln Pro Pro Gln Pro Gln 505 510 515 520 atg gca ggc cct ctg gtc act cag tct gtc cag ggg ctg cag gct tcc 1936 Met Ala Gly Pro Leu Val Thr Gln Ser Val Gln Gly Leu Gln Ala Ser 525 530 535 tcc cag tca gtg caa tat ccg gca gtc tct ttt cct ccc cag cac ctc 1984 Ser Gln Ser Val Gln Tyr Pro Ala Val Ser Phe Pro Pro Gln His Leu 540 545 550 cta cct gtg tct cca acg cag cac ttt ccc atg aga gat gat gtg gca 2032 Leu Pro Val Ser Pro Thr Gln His Phe Pro Met Arg Asp Asp Val Ala 555 560 565 aca cag ttt ggc cag atg acc ctg agc cgg cag tcc tcg ggg gag act 2080 Thr Gln Phe Gly Gln Met Thr Leu Ser Arg Gln Ser Ser Gly Glu Thr 570 575 580 cct gaa ccc cca tca ggt cct gtc tac cca tcc tcc ctt atg cca cag 2128 Pro Glu Pro Pro Ser Gly Pro Val Tyr Pro Ser Ser Leu Met Pro Gln 585 590 595 600 ccg gcc cag cag ccc agc tat gta atc gcc tct aca ggc cag cag ctt 2176 Pro Ala Gln Gln Pro Ser Tyr Val Ile Ala Ser Thr Gly Gln Gln Leu 605 610 615 cct aca gga gga ttc tca ggc tct ggc cct ccc atc tcc cag cag gtc 2224 Pro Thr Gly Gly Phe Ser Gly Ser Gly Pro Pro Ile Ser Gln Gln Val 620 625 630 ctc cag ccc cct ccc tca cca cag gga ttt gtg caa cag cct ccg cct 2272 Leu Gln Pro Pro Pro Ser Pro Gln Gly Phe Val Gln Gln Pro Pro Pro 635 640 645 gca cag atg cct gta tat tat tac cca tct ggt cag tac cct acc tca 2320 Ala Gln Met Pro Val Tyr Tyr Tyr Pro Ser Gly Gln Tyr Pro Thr Ser 650 655 660 acc acg caa cag tac cgg ccc atg gcc ccg gtt cag tac aac gct cag 2368 Thr Thr Gln Gln Tyr Arg Pro Met Ala Pro Val Gln Tyr Asn Ala Gln 665 670 675 680 agg agt caa cag atg cca cag gca gca cag caa gca ggt tac cag cca 2416 Arg Ser Gln Gln Met Pro Gln Ala Ala Gln Gln Ala Gly Tyr Gln Pro 685 690 695 gtc ttg tct ggt caa cag gga ttc caa ggc cta ata gga gtg cag cag 2464 Val Leu Ser Gly Gln Gln Gly Phe Gln Gly Leu Ile Gly Val Gln Gln 700 705 710 cca cct cag agt cag aac gtg ata aat aac caa caa gga act ccg gtg 2512 Pro Pro Gln Ser Gln Asn Val Ile Asn Asn Gln Gln Gly Thr Pro Val 715 720 725 caa agc gtg atg gtt tcc tac cca aca atg tct tct tat cag gtg cca 2560 Gln Ser Val Met Val Ser Tyr Pro Thr Met Ser Ser Tyr Gln Val Pro 730 735 740 atg acc cag ggt tct caa gga ctg ccc cag cag tca tac caa cag cca 2608 Met Thr Gln Gly Ser Gln Gly Leu Pro Gln Gln Ser Tyr Gln Gln Pro 745 750 755 760 atc atg cta cct aac cag gca ggt caa ggg tca ctc cca gcc act gga 2656 Ile Met Leu Pro Asn Gln Ala Gly Gln Gly Ser Leu Pro Ala Thr Gly 765 770 775 atg cct gtt tac tgt aat gtc aca ccg ccc acc cct cag aac aac ctt 2704 Met Pro Val Tyr Cys Asn Val Thr Pro Pro Thr Pro Gln Asn Asn Leu 780 785 790 agg ctg att ggc cca cac tgc ccc tcc agc act gtc cca gtg atg tca 2752 Arg Leu Ile Gly Pro His Cys Pro Ser Ser Thr Val Pro Val Met Ser 795 800 805 gct agc tgc aga aca aac tgt gca agt atg agc aat gct ggt tgg cag 2800 Ala Ser Cys Arg Thr Asn Cys Ala Ser Met Ser Asn Ala Gly Trp Gln 810 815 820 gtc aaa ttc tga gagctctggc tgtggtacat ttcttcagat atttctcatg 2852 Val Lys Phe 825 gcctttgatg gaagaggaac aaggtgggaa aactggctga ggacttaagt attcactcaa 2912 cactcaaatg attgctgctg gtattctgta aaaaataaac aaagactaat atacacgtta 2972 gctggttaat ggtgcatatt tctgtcatgt ctgctaggta tgcctttata gcttagctag 3032 tgacatgaat tcatcaaggt aagattttct cctaccactg aataccactg tgtagattat 3092 aatatcccta atttggatta gttttgtact ttgtgttgag tttgtgatgc taaaagtatt 3152 taaaaattat atactaaatc acattgtacc aaagctgtaa tggaaaagca aagaagaatt 3212 gatgaattga aggaataatt tatatacatt atagagtttt cttttttaat ggatatatac 3272 tgtattgtag tgtttaatca aaataaaact atttgacctt atggaggaag gtcatgtttt 3332 taaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aa 3374 4 827 PRT Homo sapiens 4 Met Ser Glu Gln Gly Asp Leu Asn Gln Ala Ile Ala Glu Glu Gly Gly 1 5 10 15 Thr Glu Gln Glu Thr Ala Thr Pro Glu Asn Gly Ile Val Lys Ser Glu 20 25 30 Ser Leu Asp Glu Glu Glu Lys Leu Glu Leu Gln Arg Arg Leu Glu Ala 35 40 45 Gln Asn Gln Glu Arg Arg Lys Ser Lys Ser Gly Ala Gly Lys Gly Lys 50 55 60 Leu Thr Arg Ser Leu Ala Val Cys Glu Glu Ser Ser Ala Arg Pro Gly 65 70 75 80 Gly Glu Ser Leu Gln Asp Gln Glu Ser Ile His Leu Gln Leu Ser Ser 85 90 95 Phe Ser Ser Leu Gln Glu Glu Asp Lys Ser Arg Lys Asp Asp Ser Glu 100 105 110 Arg Glu Lys Glu Lys Asp Lys Asn Lys Asp Lys Thr Ser Glu Lys Pro 115 120 125 Lys Ile Arg Met Leu Ser Lys Asp Cys Ser Gln Glu Tyr Thr Asp Ser 130 135 140 Thr Gly Ile Asp Leu His Glu Phe Leu Ile Asn Thr Leu Lys Asn Asn 145 150 155 160 Ser Arg Asp Arg Met Ile Leu Leu Lys Met Glu Gln Glu Ile Ile Asp 165 170 175 Phe Ile Ala Asp Asn Asn Asn His Tyr Lys Lys Phe Pro Gln Met Ser 180 185 190 Ser Tyr Gln Arg Met Leu Val His Arg Val Ala Ala Tyr Phe Gly Leu 195 200 205 Asp His Asn Val Asp Gln Thr Gly Lys Ser Val Ile Ile Asn Lys Thr 210 215 220 Ser Ser Thr Arg Ile Pro Glu Gln Arg Phe Cys Glu His Leu Lys Asp 225 230 235 240 Glu Lys Gly Glu Glu Ser Gln Lys Arg Phe Ile Leu Lys Arg Asp Asn 245 250 255 Ser Ser Ile Asp Lys Glu Asp Asn Gln Gln Asn Arg Met His Pro Phe 260 265 270 Arg Asp Asp Arg Arg Ser Lys Ser Ile Glu Glu Arg Glu Glu Glu Tyr 275 280 285 Gln Arg Val Arg Glu Arg Ile Phe Ala His Asp Ser Val Cys Ser Gln 290 295 300 Glu Ser Leu Phe Val Glu Asn Arg Gly Asn Arg Asp Gly Ser Gly Arg 305 310 315 320 Thr Ser Gly Ser Arg Gln Ser Ser Ser Glu Asn Glu Leu Lys Trp Ser 325 330 335 Asp His Gln Arg Ala Trp Ser Ser Thr Asp Ser Asp Ser Ser Asn Arg 340 345 350 Asn Leu Lys Pro Ala Met Thr Lys Thr Ala Ser Phe Gly Gly Ile Thr 355 360 365 Val Leu Thr Arg Gly Asp Ser Thr Ser Ser Thr Arg Ser Thr Gly Lys 370 375 380 Leu Ser Lys Ala Gly Ser Glu Ser Ser Ser Ser Ala Gly Ser Ser Gly 385 390 395 400 Ser Leu Ser Arg Thr His Pro Pro Leu Gln Ser Thr Pro Leu Val Ser 405 410 415 Gly Val Ala Ala Gly Ser Pro Gly Cys Val Pro Tyr Pro Glu Asn Gly 420 425 430 Ile Gly Gly Gln Val Ala Pro Ser Ser Thr Ser Tyr Ile Leu Leu Pro 435 440 445 Leu Glu Ala Ala Thr Gly Ile Pro Pro Gly Ser Ile Leu Leu Asn Pro 450 455 460 His Thr Gly Gln Pro Phe Val Asn Pro Asp Gly Thr Pro Ala Ile Tyr 465 470 475 480 Asn Pro Pro Thr Ser Gln Gln Pro Leu Arg Ser Ala Met Val Gly Gln 485 490 495 Ser Gln Gln Gln Pro Pro Gln Gln Gln Pro Ser Pro Gln Pro Gln Gln 500 505 510 Gln Val Gln Pro Pro Gln Pro Gln Met Ala Gly Pro Leu Val Thr Gln 515 520 525 Ser Val Gln Gly Leu Gln Ala Ser Ser Gln Ser Val Gln Tyr Pro Ala 530 535 540 Val Ser Phe Pro Pro Gln His Leu Leu Pro Val Ser Pro Thr Gln His 545 550 555 560 Phe Pro Met Arg Asp Asp Val Ala Thr Gln Phe Gly Gln Met Thr Leu 565 570 575 Ser Arg Gln Ser Ser Gly Glu Thr Pro Glu Pro Pro Ser Gly Pro Val 580 585 590 Tyr Pro Ser Ser Leu Met Pro Gln Pro Ala Gln Gln Pro Ser Tyr Val 595 600 605 Ile Ala Ser Thr Gly Gln Gln Leu Pro Thr Gly Gly Phe Ser Gly Ser 610 615 620 Gly Pro Pro Ile Ser Gln Gln Val Leu Gln Pro Pro Pro Ser Pro Gln 625 630 635 640 Gly Phe Val Gln Gln Pro Pro Pro Ala Gln Met Pro Val Tyr Tyr Tyr 645 650 655 Pro Ser Gly Gln Tyr Pro Thr Ser Thr Thr Gln Gln Tyr Arg Pro Met 660 665 670 Ala Pro Val Gln Tyr Asn Ala Gln Arg Ser Gln Gln Met Pro Gln Ala 675 680 685 Ala Gln Gln Ala Gly Tyr Gln Pro Val Leu Ser Gly Gln Gln Gly Phe 690 695 700 Gln Gly Leu Ile Gly Val Gln Gln Pro Pro Gln Ser Gln Asn Val Ile 705 710 715 720 Asn Asn Gln Gln Gly Thr Pro Val Gln Ser Val Met Val Ser Tyr Pro 725 730 735 Thr Met Ser Ser Tyr Gln Val Pro Met Thr Gln Gly Ser Gln Gly Leu 740 745 750 Pro Gln Gln Ser Tyr Gln Gln Pro Ile Met Leu Pro Asn Gln Ala Gly 755 760 765 Gln Gly Ser Leu Pro Ala Thr Gly Met Pro Val Tyr Cys Asn Val Thr 770 775 780 Pro Pro Thr Pro Gln Asn Asn Leu Arg Leu Ile Gly Pro His Cys Pro 785 790 795 800 Ser Ser Thr Val Pro Val Met Ser Ala Ser Cys Arg Thr Asn Cys Ala 805 810 815 Ser Met Ser Asn Ala Gly Trp Gln Val Lys Phe 820 825 5 3332 DNA Homo sapiens CDS (329)..(2770) 5 gtctattttt aatgctattt aatgaaggag cgagcgcctc actcagcaat aaaagaagca 60 tgagggaaga cagagcagtg catggttatg gatactggac aaggatattt ggaaaggttg 120 acgatgtgtc acactgtgta agggaatcgc atggagatgg gcattccgaa ctgttaatgg 180 ggacatggga ctccagttgt ctctgatcac ttgtgtggat tttcctggcg tagaacgaca 240 gaagccgcta gtaagtcgcc aagacctaca gcaggaattc tgcaccaaag ggcataaaat 300 cttgttattt taatttgcat ctgggaga atg tct gag caa gga gac ctg aat 352 Met Ser Glu Gln Gly Asp Leu Asn 1 5 cag gca ata gca gag gaa gga ggg act gag cag gag acg gcc act cca 400 Gln Ala Ile Ala Glu Glu Gly Gly Thr Glu Gln Glu Thr Ala Thr Pro 10 15 20 gag aac ggc att gtt aaa tca gaa agt ctg gat gaa gag gag aaa ctg 448 Glu Asn Gly Ile Val Lys Ser Glu Ser Leu Asp Glu Glu Glu Lys Leu 25 30 35 40 gaa ctg cag agg cgg ctg gag gct cag aat caa gaa aga aga aaa tcc 496 Glu Leu Gln Arg Arg Leu Glu Ala Gln Asn Gln Glu Arg Arg Lys Ser 45 50 55 aag tca gga gca gga aaa ggt aaa ctg act cgc agt ctt gct gtc tgt 544 Lys Ser Gly Ala Gly Lys Gly Lys Leu Thr Arg Ser Leu Ala Val Cys 60 65 70 gag gaa tct tct gcc aga cca gga ggt gaa agt ctt cag gat cag gaa 592 Glu Glu Ser Ser Ala Arg Pro Gly Gly Glu Ser Leu Gln Asp Gln Glu 75 80 85 tca att cat tta cag ctt tcc agt ttt tcc agc ctg caa gag gag gat 640 Ser Ile His Leu Gln Leu Ser Ser Phe Ser Ser Leu Gln Glu Glu Asp 90 95 100 aaa tct agg aaa gat gac tct gaa aga gaa aaa gaa aag gat aaa aac 688 Lys Ser Arg Lys Asp Asp Ser Glu Arg Glu Lys Glu Lys Asp Lys Asn 105 110 115 120 aaa gat aaa acc tct gaa aaa ccc aag atc aga atg tta tca aaa gat 736 Lys Asp Lys Thr Ser Glu Lys Pro Lys Ile Arg Met Leu Ser Lys Asp 125 130 135 tgc agc caa gaa tac acg gat tct aca ggc ata gac tta cac gag ttt 784 Cys Ser Gln Glu Tyr Thr Asp Ser Thr Gly Ile Asp Leu His Glu Phe 140 145 150 ctg att aac aca tta aag aat aat tcc agg gac agg atg ata ctt ttg 832 Leu Ile Asn Thr Leu Lys Asn Asn Ser Arg Asp Arg Met Ile Leu Leu 155 160 165 aaa atg gag cag gaa att att gat ttc att gct gac aac aat aat cat 880 Lys Met Glu Gln Glu Ile Ile Asp Phe Ile Ala Asp Asn Asn Asn His 170 175 180 tat aaa aag ttc cct cag atg tca tcg tat cag agg atg ctt gtc cat 928 Tyr Lys Lys Phe Pro Gln Met Ser Ser Tyr Gln Arg Met Leu Val His 185 190 195 200 cga gtg gca gct tat ttt gga ttg gat cac aat gtg gat caa aca gga 976 Arg Val Ala Ala Tyr Phe Gly Leu Asp His Asn Val Asp Gln Thr Gly 205 210 215 aaa tct gtt atc atc aac aag acc agc agc acc aga ata cca gag caa 1024 Lys Ser Val Ile Ile Asn Lys Thr Ser Ser Thr Arg Ile Pro Glu Gln 220 225 230 agg ttt tgt gaa cat tta aaa gat gaa aaa ggt gaa gaa tcc cag aag 1072 Arg Phe Cys Glu His Leu Lys Asp Glu Lys Gly Glu Glu Ser Gln Lys 235 240 245 cgg ttt atc ttg aag cga gat aac tct agt att gat aaa gaa gac aat 1120 Arg Phe Ile Leu Lys Arg Asp Asn Ser Ser Ile Asp Lys Glu Asp Asn 250 255 260 cag tca gtt tgc tcc cag gaa agc ctt ttt gtg gaa aac agt agg ctc 1168 Gln Ser Val Cys Ser Gln Glu Ser Leu Phe Val Glu Asn Ser Arg Leu 265 270 275 280 ttg gaa gac agt aac ata tgc aat gag acc tat aag aaa aga cag ctc 1216 Leu Glu Asp Ser Asn Ile Cys Asn Glu Thr Tyr Lys Lys Arg Gln Leu 285 290 295 ttt cgg ggc aac aga gat ggc tca ggg aga aca tct ggg agt cga cag 1264 Phe Arg Gly Asn Arg Asp Gly Ser Gly Arg Thr Ser Gly Ser Arg Gln 300 305 310 agc agc tca gaa aat gaa ctc aag tgg tct gac cac caa agg gcc tgg 1312 Ser Ser Ser Glu Asn Glu Leu Lys Trp Ser Asp His Gln Arg Ala Trp 315 320 325 agc agc aca gac tcc gac agt tcc aac cgc aat cta aag ccc gcc atg 1360 Ser Ser Thr Asp Ser Asp Ser Ser Asn Arg Asn Leu Lys Pro Ala Met 330 335 340 acc aag acg gcg agt ttt ggg ggc atc acg gtg ctg acc agg ggt gac 1408 Thr Lys Thr Ala Ser Phe Gly Gly Ile Thr Val Leu Thr Arg Gly Asp 345 350 355 360 agc act tcc agt act agg agt acc ggg aag ctg tcc aaa gca ggt tcc 1456 Ser Thr Ser Ser Thr Arg Ser Thr Gly Lys Leu Ser Lys Ala Gly Ser 365 370 375 gag tct tcc agc agt gca ggc tcc tca gga tcg ctg tcc cgc acc cat 1504 Glu Ser Ser Ser Ser Ala Gly Ser Ser Gly Ser Leu Ser Arg Thr His 380 385 390 cca cct ctc cag agc aca ccc cta gtc tca ggt gtg gca gct ggc tct 1552 Pro Pro Leu Gln Ser Thr Pro Leu Val Ser Gly Val Ala Ala Gly Ser 395 400 405 cca ggc tgt gtg cct tat cca gag aat gga ata ggg ggc cag gtt gct 1600 Pro Gly Cys Val Pro Tyr Pro Glu Asn Gly Ile Gly Gly Gln Val Ala 410 415 420 ccc agc agc acc agc tac atc ctc ctt cca ctt gaa gct gca aca ggc 1648 Pro Ser Ser Thr Ser Tyr Ile Leu Leu Pro Leu Glu Ala Ala Thr Gly 425 430 435 440 atc ccg cct gga agc atc ctt ctt aat cca cac aca ggc cag ccc ttt 1696 Ile Pro Pro Gly Ser Ile Leu Leu Asn Pro His Thr Gly Gln Pro Phe 445 450 455 gtg aat ccc gat gga act cct gca ata tac aac cca ccc acc agt cag 1744 Val Asn Pro Asp Gly Thr Pro Ala Ile Tyr Asn Pro Pro Thr Ser Gln 460 465 470 cag ccc ctg cga agc gcc atg gtg ggg cag tcc caa cag cag ccg cca 1792 Gln Pro Leu Arg Ser Ala Met Val Gly Gln Ser Gln Gln Gln Pro Pro 475 480 485 cag cag cag ccc tcc ccg cag ccc caa cag cag gtc cag cca ccg cag 1840 Gln Gln Gln Pro Ser Pro Gln Pro Gln Gln Gln Val Gln Pro Pro Gln 490 495 500 cca cag atg gca ggc cct ctg gtc act cag tct gtc cag ggg ctg cag 1888 Pro Gln Met Ala Gly Pro Leu Val Thr Gln Ser Val Gln Gly Leu Gln 505 510 515 520 gct tcc tcc cag tca gtg caa tat ccg gca gtc tct ttt cct ccc cag 1936 Ala Ser Ser Gln Ser Val Gln Tyr Pro Ala Val Ser Phe Pro Pro Gln 525 530 535 cac ctc cta cct gtg tct cca acg cag cac ttt ccc atg aga gat gat 1984 His Leu Leu Pro Val Ser Pro Thr Gln His Phe Pro Met Arg Asp Asp 540 545 550 gtg gca aca cag ttt ggc cag atg acc ctg agc cgg cag tcc tcg ggg 2032 Val Ala Thr Gln Phe Gly Gln Met Thr Leu Ser Arg Gln Ser Ser Gly 555 560 565 gag act cct gaa ccc cca tca ggt cct gtc tac cca tcc tcc ctt atg 2080 Glu Thr Pro Glu Pro Pro Ser Gly Pro Val Tyr Pro Ser Ser Leu Met 570 575 580 cca cag ccg gcc cag cag ccc agc tat gta atc gcc tct aca ggc cag 2128 Pro Gln Pro Ala Gln Gln Pro Ser Tyr Val Ile Ala Ser Thr Gly Gln 585 590 595 600 cag ctt cct aca gga gga ttc tca ggc tct ggc cct ccc atc tcc cag 2176 Gln Leu Pro Thr Gly Gly Phe Ser Gly Ser Gly Pro Pro Ile Ser Gln 605 610 615 cag gtc ctc cag ccc cct ccc tca cca cag gga tty gtg caa cag cct 2224 Gln Val Leu Gln Pro Pro Pro Ser Pro Gln Gly Phe Val Gln Gln Pro 620 625 630 ccg cct gca cag atg cct gta tat tat tac cca tct ggt cag tac cct 2272 Pro Pro Ala Gln Met Pro Val Tyr Tyr Tyr Pro Ser Gly Gln Tyr Pro 635 640 645 acc tca acc acg caa cag tac cgg ccc atg gcc ccg gtt cag tac aac 2320 Thr Ser Thr Thr Gln Gln Tyr Arg Pro Met Ala Pro Val Gln Tyr Asn 650 655 660 gct cag agg agt caa cag atg cca cag gca gca cag caa gca ggt tac 2368 Ala Gln Arg Ser Gln Gln Met Pro Gln Ala Ala Gln Gln Ala Gly Tyr 665 670 675 680 cag cca gtc ttg tct ggt caa cag gga ttc caa ggc cta ata gga gtg 2416 Gln Pro Val Leu Ser Gly Gln Gln Gly Phe Gln Gly Leu Ile Gly Val 685 690 695 cag cag cca cct cag agt cag aac gtg ata aat aac caa caa gga act 2464 Gln Gln Pro Pro Gln Ser Gln Asn Val Ile Asn Asn Gln Gln Gly Thr 700 705 710 ccg gtg caa agc gtg atg gtt tcc tac cca aca atg tct tct tat cag 2512 Pro Val Gln Ser Val Met Val Ser Tyr Pro Thr Met Ser Ser Tyr Gln 715 720 725 gtg cca atg acc cag ggt tct caa gga ctg ccc cag cag tca tac caa 2560 Val Pro Met Thr Gln Gly Ser Gln Gly Leu Pro Gln Gln Ser Tyr Gln 730 735 740 cag cca atc atg cta cct aac cag gca ggt caa ggg tca ctc cca gcc 2608 Gln Pro Ile Met Leu Pro Asn Gln Ala Gly Gln Gly Ser Leu Pro Ala 745 750 755 760 act gga atg cct gtt tac tgt aat gtc aca ccg ccc acc cct cag aac 2656 Thr Gly Met Pro Val Tyr Cys Asn Val Thr Pro Pro Thr Pro Gln Asn 765 770 775 aac ctt agg ctg att ggc cca cac tgc ccc tcc agc act gtc cca gtg 2704 Asn Leu Arg Leu Ile Gly Pro His Cys Pro Ser Ser Thr Val Pro Val 780 785 790 atg tca gct agc tgc aga aca aac tgt gca agt atg agc aat gct ggt 2752 Met Ser Ala Ser Cys Arg Thr Asn Cys Ala Ser Met Ser Asn Ala Gly 795 800 805 tgg cag gtc aaa ttc tga gagctctggc tgtggtacat ttcttcagat 2800 Trp Gln Val Lys Phe 810 atttctcatg gcctttgatg gaagaggaac aaggtgggaa aactggctga ggacttaagt 2860 attcactcaa cactcaaatg attgctgctg gtattctgta aaaartaaac aaagactaat 2920 atacacgtta gctggttaat ggtgcatatt tctgtcatgt ctgctaggta tgcctttata 2980 gcttagctag tgacatgaat tcatcaaggt aagattytct cctaccactg aataccactg 3040 tgtagattat aatatcccta atttggatta gttttgtact ttgtgttgag tttgtgatgc 3100 taaaagtatt taaaaattat atactaaatc acattgtacc aaagctgtaa tggaaaagca 3160 aagaagaayt gatgaattga aggaataatt tatatacatt atagagtttt cttttttaat 3220 ggatatatac tgtattgtag tgtttaatca aaataaaact atttgacctt atggaggaag 3280 gtcatgtttt taaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aa 3332 6 813 PRT Homo sapiens 6 Met Ser Glu Gln Gly Asp Leu Asn Gln Ala Ile Ala Glu Glu Gly Gly 1 5 10 15 Thr Glu Gln Glu Thr Ala Thr Pro Glu Asn Gly Ile Val Lys Ser Glu 20 25 30 Ser Leu Asp Glu Glu Glu Lys Leu Glu Leu Gln Arg Arg Leu Glu Ala 35 40 45 Gln Asn Gln Glu Arg Arg Lys Ser Lys Ser Gly Ala Gly Lys Gly Lys 50 55 60 Leu Thr Arg Ser Leu Ala Val Cys Glu Glu Ser Ser Ala Arg Pro Gly 65 70 75 80 Gly Glu Ser Leu Gln Asp Gln Glu Ser Ile His Leu Gln Leu Ser Ser 85 90 95 Phe Ser Ser Leu Gln Glu Glu Asp Lys Ser Arg Lys Asp Asp Ser Glu 100 105 110 Arg Glu Lys Glu Lys Asp Lys Asn Lys Asp Lys Thr Ser Glu Lys Pro 115 120 125 Lys Ile Arg Met Leu Ser Lys Asp Cys Ser Gln Glu Tyr Thr Asp Ser 130 135 140 Thr Gly Ile Asp Leu His Glu Phe Leu Ile Asn Thr Leu Lys Asn Asn 145 150 155 160 Ser Arg Asp Arg Met Ile Leu Leu Lys Met Glu Gln Glu Ile Ile Asp 165 170 175 Phe Ile Ala Asp Asn Asn Asn His Tyr Lys Lys Phe Pro Gln Met Ser 180 185 190 Ser Tyr Gln Arg Met Leu Val His Arg Val Ala Ala Tyr Phe Gly Leu 195 200 205 Asp His Asn Val Asp Gln Thr Gly Lys Ser Val Ile Ile Asn Lys Thr 210 215 220 Ser Ser Thr Arg Ile Pro Glu Gln Arg Phe Cys Glu His Leu Lys Asp 225 230 235 240 Glu Lys Gly Glu Glu Ser Gln Lys Arg Phe Ile Leu Lys Arg Asp Asn 245 250 255 Ser Ser Ile Asp Lys Glu Asp Asn Gln Ser Val Cys Ser Gln Glu Ser 260 265 270 Leu Phe Val Glu Asn Ser Arg Leu Leu Glu Asp Ser Asn Ile Cys Asn 275 280 285 Glu Thr Tyr Lys Lys Arg Gln Leu Phe Arg Gly Asn Arg Asp Gly Ser 290 295 300 Gly Arg Thr Ser Gly Ser Arg Gln Ser Ser Ser Glu Asn Glu Leu Lys 305 310 315 320 Trp Ser Asp His Gln Arg Ala Trp Ser Ser Thr Asp Ser Asp Ser Ser 325 330 335 Asn Arg Asn Leu Lys Pro Ala Met Thr Lys Thr Ala Ser Phe Gly Gly 340 345 350 Ile Thr Val Leu Thr Arg Gly Asp Ser Thr Ser Ser Thr Arg Ser Thr 355 360 365 Gly Lys Leu Ser Lys Ala Gly Ser Glu Ser Ser Ser Ser Ala Gly Ser 370 375 380 Ser Gly Ser Leu Ser Arg Thr His Pro Pro Leu Gln Ser Thr Pro Leu 385 390 395 400 Val Ser Gly Val Ala Ala Gly Ser Pro Gly Cys Val Pro Tyr Pro Glu 405 410 415 Asn Gly Ile Gly Gly Gln Val Ala Pro Ser Ser Thr Ser Tyr Ile Leu 420 425 430 Leu Pro Leu Glu Ala Ala Thr Gly Ile Pro Pro Gly Ser Ile Leu Leu 435 440 445 Asn Pro His Thr Gly Gln Pro Phe Val Asn Pro Asp Gly Thr Pro Ala 450 455 460 Ile Tyr Asn Pro Pro Thr Ser Gln Gln Pro Leu Arg Ser Ala Met Val 465 470 475 480 Gly Gln Ser Gln Gln Gln Pro Pro Gln Gln Gln Pro Ser Pro Gln Pro 485 490 495 Gln Gln Gln Val Gln Pro Pro Gln Pro Gln Met Ala Gly Pro Leu Val 500 505 510 Thr Gln Ser Val Gln Gly Leu Gln Ala Ser Ser Gln Ser Val Gln Tyr 515 520 525 Pro Ala Val Ser Phe Pro Pro Gln His Leu Leu Pro Val Ser Pro Thr 530 535 540 Gln His Phe Pro Met Arg Asp Asp Val Ala Thr Gln Phe Gly Gln Met 545 550 555 560 Thr Leu Ser Arg Gln Ser Ser Gly Glu Thr Pro Glu Pro Pro Ser Gly 565 570 575 Pro Val Tyr Pro Ser Ser Leu Met Pro Gln Pro Ala Gln Gln Pro Ser 580 585 590 Tyr Val Ile Ala Ser Thr Gly Gln Gln Leu Pro Thr Gly Gly Phe Ser 595 600 605 Gly Ser Gly Pro Pro Ile Ser Gln Gln Val Leu Gln Pro Pro Pro Ser 610 615 620 Pro Gln Gly Phe Val Gln Gln Pro Pro Pro Ala Gln Met Pro Val Tyr 625 630 635 640 Tyr Tyr Pro Ser Gly Gln Tyr Pro Thr Ser Thr Thr Gln Gln Tyr Arg 645 650 655 Pro Met Ala Pro Val Gln Tyr Asn Ala Gln Arg Ser Gln Gln Met Pro 660 665 670 Gln Ala Ala Gln Gln Ala Gly Tyr Gln Pro Val Leu Ser Gly Gln Gln 675 680 685 Gly Phe Gln Gly Leu Ile Gly Val Gln Gln Pro Pro Gln Ser Gln Asn 690 695 700 Val Ile Asn Asn Gln Gln Gly Thr Pro Val Gln Ser Val Met Val Ser 705 710 715 720 Tyr Pro Thr Met Ser Ser Tyr Gln Val Pro Met Thr Gln Gly Ser Gln 725 730 735 Gly Leu Pro Gln Gln Ser Tyr Gln Gln Pro Ile Met Leu Pro Asn Gln 740 745 750 Ala Gly Gln Gly Ser Leu Pro Ala Thr Gly Met Pro Val Tyr Cys Asn 755 760 765 Val Thr Pro Pro Thr Pro Gln Asn Asn Leu Arg Leu Ile Gly Pro His 770 775 780 Cys Pro Ser Ser Thr Val Pro Val Met Ser Ala Ser Cys Arg Thr Asn 785 790 795 800 Cys Ala Ser Met Ser Asn Ala Gly Trp Gln Val Lys Phe 805 810 7 3272 DNA Homo sapiens CDS (329)..(2710) 7 gtctattttt aatgctattt aatgaaggag cgagcgcctc actcagcaat aaaagaagca 60 tgagggaaga cagagcagtg catggttatg gatactggac aaggatattt ggaaaggttg 120 acgatgtgtc acactgtgta agggaatcgc atggagatgg gcattccgaa ctgttaatgg 180 ggacatggga ctccagttgt ctctgatcac ttgtgtggat tttcctggcg tagaacgaca 240 gaagccgcta gtaagtcgcc aagacctaca gcaggaattc tgcaccaaag ggcataaaat 300 cttgttattt taatttgcat ctgggaga atg tct gag caa gga gac ctg aat 352 Met Ser Glu Gln Gly Asp Leu Asn 1 5 cag gca ata gca gag gaa gga ggg act gag cag gag acg gcc act cca 400 Gln Ala Ile Ala Glu Glu Gly Gly Thr Glu Gln Glu Thr Ala Thr Pro 10 15 20 gag aac ggc att gtt aaa tca gaa agt ctg gat gaa gag gag aaa ctg 448 Glu Asn Gly Ile Val Lys Ser Glu Ser Leu Asp Glu Glu Glu Lys Leu 25 30 35 40 gaa ctg cag agg cgg ctg gag gct cag aat caa gaa aga aga aaa tcc 496 Glu Leu Gln Arg Arg Leu Glu Ala Gln Asn Gln Glu Arg Arg Lys Ser 45 50 55 aag tca gga gca gga aaa ggt aaa ctg act cgc agt ctt gct gtc tgt 544 Lys Ser Gly Ala Gly Lys Gly Lys Leu Thr Arg Ser Leu Ala Val Cys 60 65 70 gag gaa tct tct gcc aga cca gga ggt gaa agt ctt cag gat cag gaa 592 Glu Glu Ser Ser Ala Arg Pro Gly Gly Glu Ser Leu Gln Asp Gln Glu 75 80 85 tca att cat tta cag ctt tcc agt ttt tcc agc ctg caa gag gag gat 640 Ser Ile His Leu Gln Leu Ser Ser Phe Ser Ser Leu Gln Glu Glu Asp 90 95 100 aaa tct agg aaa gat gac tct gaa aga gaa aaa gaa aag gat aaa aac 688 Lys Ser Arg Lys Asp Asp Ser Glu Arg Glu Lys Glu Lys Asp Lys Asn 105 110 115 120 aaa gat aaa acc tct gaa aaa ccc aag atc aga atg tta tca aaa gat 736 Lys Asp Lys Thr Ser Glu Lys Pro Lys Ile Arg Met Leu Ser Lys Asp 125 130 135 tgc agc caa gaa tac acg gat tct aca ggc ata gac tta cac gag ttt 784 Cys Ser Gln Glu Tyr Thr Asp Ser Thr Gly Ile Asp Leu His Glu Phe 140 145 150 ctg att aac aca tta aag aat aat tcc agg gac agg atg ata ctt ttg 832 Leu Ile Asn Thr Leu Lys Asn Asn Ser Arg Asp Arg Met Ile Leu Leu 155 160 165 aaa atg gag cag gaa att att gat ttc att gct gac aac aat aat cat 880 Lys Met Glu Gln Glu Ile Ile Asp Phe Ile Ala Asp Asn Asn Asn His 170 175 180 tat aaa aag ttc cct cag atg tca tcg tat cag agg atg ctt gtc cat 928 Tyr Lys Lys Phe Pro Gln Met Ser Ser Tyr Gln Arg Met Leu Val His 185 190 195 200 cga gtg gca gct tat ttt gga ttg gat cac aat gtg gat caa aca gga 976 Arg Val Ala Ala Tyr Phe Gly Leu Asp His Asn Val Asp Gln Thr Gly 205 210 215 aaa tct gtt atc atc aac aag acc agc agc acc aga ata cca gag caa 1024 Lys Ser Val Ile Ile Asn Lys Thr Ser Ser Thr Arg Ile Pro Glu Gln 220 225 230 agg ttt tgt gaa cat tta aaa gat gaa aaa ggt gaa gaa tcc cag aag 1072 Arg Phe Cys Glu His Leu Lys Asp Glu Lys Gly Glu Glu Ser Gln Lys 235 240 245 cgg ttt atc ttg aag cga gat aac tct agt att gat aaa gaa gac aat 1120 Arg Phe Ile Leu Lys Arg Asp Asn Ser Ser Ile Asp Lys Glu Asp Asn 250 255 260 cag tca gtt tgc tcc cag gaa agc ctt ttt gtg gaa aac agg ggc aac 1168 Gln Ser Val Cys Ser Gln Glu Ser Leu Phe Val Glu Asn Arg Gly Asn 265 270 275 280 aga gat ggc tca ggg aga aca tct ggg agt cga cag agc agc tca gaa 1216 Arg Asp Gly Ser Gly Arg Thr Ser Gly Ser Arg Gln Ser Ser Ser Glu 285 290 295 aat gaa ctc aag tgg tct gac cac caa agg gcc tgg agc agc aca gac 1264 Asn Glu Leu Lys Trp Ser Asp His Gln Arg Ala Trp Ser Ser Thr Asp 300 305 310 tcc gac agt tcc aac cgc aat cta aag ccc gcc atg acc aag acg gcg 1312 Ser Asp Ser Ser Asn Arg Asn Leu Lys Pro Ala Met Thr Lys Thr Ala 315 320 325 agt ttt ggg ggc atc acg gtg ctg acc agg ggt gac agc act tcc agt 1360 Ser Phe Gly Gly Ile Thr Val Leu Thr Arg Gly Asp Ser Thr Ser Ser 330 335 340 act agg agt acc ggg aag ctg tcc aaa gca ggt tcc gag tct tcc agc 1408 Thr Arg Ser Thr Gly Lys Leu Ser Lys Ala Gly Ser Glu Ser Ser Ser 345 350 355 360 agt gca ggc tcc tca gga tcg ctg tcc cgc acc cat cca cct ctc cag 1456 Ser Ala Gly Ser Ser Gly Ser Leu Ser Arg Thr His Pro Pro Leu Gln 365 370 375 agc aca ccc cta gtc tca ggt gtg gca gct ggc tct cca ggc tgt gtg 1504 Ser Thr Pro Leu Val Ser Gly Val Ala Ala Gly Ser Pro Gly Cys Val 380 385 390 cct tat cca gag aat gga ata ggg ggc cag gtt gct ccc agc agc acc 1552 Pro Tyr Pro Glu Asn Gly Ile Gly Gly Gln Val Ala Pro Ser Ser Thr 395 400 405 agc tac atc ctc ctt cca ctt gaa gct gca aca ggc atc ccg cct gga 1600 Ser Tyr Ile Leu Leu Pro Leu Glu Ala Ala Thr Gly Ile Pro Pro Gly 410 415 420 agc atc ctt ctt aat cca cac aca ggc cag ccc ttt gtg aat ccc gat 1648 Ser Ile Leu Leu Asn Pro His Thr Gly Gln Pro Phe Val Asn Pro Asp 425 430 435 440 gga act cct gca ata tac aac cca ccc acc agt cag cag ccc ctg cga 1696 Gly Thr Pro Ala Ile Tyr Asn Pro Pro Thr Ser Gln Gln Pro Leu Arg 445 450 455 agc gcc atg gtg ggg cag tcc caa cag cag ccg cca cag cag cag ccc 1744 Ser Ala Met Val Gly Gln Ser Gln Gln Gln Pro Pro Gln Gln Gln Pro 460 465 470 tcc ccg cag ccc caa cag cag gtc cag cca ccg cag cca cag atg gca 1792 Ser Pro Gln Pro Gln Gln Gln Val Gln Pro Pro Gln Pro Gln Met Ala 475 480 485 ggc cct ctg gtc act cag tct gtc cag ggg ctg cag gct tcc tcc cag 1840 Gly Pro Leu Val Thr Gln Ser Val Gln Gly Leu Gln Ala Ser Ser Gln 490 495 500 tca gtg caa tat ccg gca gtc tct ttt cct ccc cag cac ctc cta cct 1888 Ser Val Gln Tyr Pro Ala Val Ser Phe Pro Pro Gln His Leu Leu Pro 505 510 515 520 gtg tct cca acg cag cac ttt ccc atg aga gat gat gtg gca aca cag 1936 Val Ser Pro Thr Gln His Phe Pro Met Arg Asp Asp Val Ala Thr Gln 525 530 535 ttt ggc cag atg acc ctg agc cgg cag tcc tcg ggg gag act cct gaa 1984 Phe Gly Gln Met Thr Leu Ser Arg Gln Ser Ser Gly Glu Thr Pro Glu 540 545 550 ccc cca tca ggt cct gtc tac cca tcc tcc ctt atg cca cag ccg gcc 2032 Pro Pro Ser Gly Pro Val Tyr Pro Ser Ser Leu Met Pro Gln Pro Ala 555 560 565 cag cag ccc agc tat gta atc gcc tct aca ggc cag cag ctt cct aca 2080 Gln Gln Pro Ser Tyr Val Ile Ala Ser Thr Gly Gln Gln Leu Pro Thr 570 575 580 gga gga ttc tca ggc tct ggc cct ccc atc tcc cag cag gtc ctc cag 2128 Gly Gly Phe Ser Gly Ser Gly Pro Pro Ile Ser Gln Gln Val Leu Gln 585 590 595 600 ccc cct ccc tca cca cag gga tty gtg caa cag cct ccg cct gca cag 2176 Pro Pro Pro Ser Pro Gln Gly Phe Val Gln Gln Pro Pro Pro Ala Gln 605 610 615 atg cct gta tat tat tac cca tct ggt cag tac cct acc tca acc acg 2224 Met Pro Val Tyr Tyr Tyr Pro Ser Gly Gln Tyr Pro Thr Ser Thr Thr 620 625 630 caa cag tac cgg ccc atg gcc ccg gtt cag tac aac gct cag agg agt 2272 Gln Gln Tyr Arg Pro Met Ala Pro Val Gln Tyr Asn Ala Gln Arg Ser 635 640 645 caa cag atg cca cag gca gca cag caa gca ggt tac cag cca gtc ttg 2320 Gln Gln Met Pro Gln Ala Ala Gln Gln Ala Gly Tyr Gln Pro Val Leu 650 655 660 tct ggt caa cag gga ttc caa ggc cta ata gga gtg cag cag cca cct 2368 Ser Gly Gln Gln Gly Phe Gln Gly Leu Ile Gly Val Gln Gln Pro Pro 665 670 675 680 cag agt cag aac gtg ata aat aac caa caa gga act ccg gtg caa agc 2416 Gln Ser Gln Asn Val Ile Asn Asn Gln Gln Gly Thr Pro Val Gln Ser 685 690 695 gtg atg gtt tcc tac cca aca atg tct tct tat cag gtg cca atg acc 2464 Val Met Val Ser Tyr Pro Thr Met Ser Ser Tyr Gln Val Pro Met Thr 700 705 710 cag ggt tct caa gga ctg ccc cag cag tca tac caa cag cca atc atg 2512 Gln Gly Ser Gln Gly Leu Pro Gln Gln Ser Tyr Gln Gln Pro Ile Met 715 720 725 cta cct aac cag gca ggt caa ggg tca ctc cca gcc act gga atg cct 2560 Leu Pro Asn Gln Ala Gly Gln Gly Ser Leu Pro Ala Thr Gly Met Pro 730 735 740 gtt tac tgt aat gtc aca ccg ccc acc cct cag aac aac ctt agg ctg 2608 Val Tyr Cys Asn Val Thr Pro Pro Thr Pro Gln Asn Asn Leu Arg Leu 745 750 755 760 att ggc cca cac tgc ccc tcc agc act gtc cca gtg atg tca gct agc 2656 Ile Gly Pro His Cys Pro Ser Ser Thr Val Pro Val Met Ser Ala Ser 765 770 775 tgc aga aca aac tgt gca agt atg agc aat gct ggt tgg cag gtc aaa 2704 Cys Arg Thr Asn Cys Ala Ser Met Ser Asn Ala Gly Trp Gln Val Lys 780 785 790 ttc tga gagctctggc tgtggtacat ttcttcagat atttctcatg gcctttgatg 2760 Phe gaagaggaac aaggtgggaa aactggctga ggacttaagt attcactcaa cactcaaatg 2820 attgctgctg gtattctgta aaaartaaac aaagactaat atacacgtta gctggttaat 2880 ggtgcatatt tctgtcatgt ctgctaggta tgcctttata gcttagctag tgacatgaat 2940 tcatcaaggt aagattytct cctaccactg aataccactg tgtagattat aatatcccta 3000 atttggatta gttttgtact ttgtgttgag tttgtgatgc taaaagtatt taaaaattat 3060 atactaaatc acattgtacc aaagctgtaa tggaaaagca aagaagaayt gatgaattga 3120 aggaataatt tatatacatt atagagtttt cttttttaat ggatatatac tgtattgtag 3180 tgtttaatca aaataaaact atttgacctt atggaggaag gtcatgtttt taaaaaaaaa 3240 aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aa 3272 8 793 PRT Homo sapiens 8 Met Ser Glu Gln Gly Asp Leu Asn Gln Ala Ile Ala Glu Glu Gly Gly 1 5 10 15 Thr Glu Gln Glu Thr Ala Thr Pro Glu Asn Gly Ile Val Lys Ser Glu 20 25 30 Ser Leu Asp Glu Glu Glu Lys Leu Glu Leu Gln Arg Arg Leu Glu Ala 35 40 45 Gln Asn Gln Glu Arg Arg Lys Ser Lys Ser Gly Ala Gly Lys Gly Lys 50 55 60 Leu Thr Arg Ser Leu Ala Val Cys Glu Glu Ser Ser Ala Arg Pro Gly 65 70 75 80 Gly Glu Ser Leu Gln Asp Gln Glu Ser Ile His Leu Gln Leu Ser Ser 85 90 95 Phe Ser Ser Leu Gln Glu Glu Asp Lys Ser Arg Lys Asp Asp Ser Glu 100 105 110 Arg Glu Lys Glu Lys Asp Lys Asn Lys Asp Lys Thr Ser Glu Lys Pro 115 120 125 Lys Ile Arg Met Leu Ser Lys Asp Cys Ser Gln Glu Tyr Thr Asp Ser 130 135 140 Thr Gly Ile Asp Leu His Glu Phe Leu Ile Asn Thr Leu Lys Asn Asn 145 150 155 160 Ser Arg Asp Arg Met Ile Leu Leu Lys Met Glu Gln Glu Ile Ile Asp 165 170 175 Phe Ile Ala Asp Asn Asn Asn His Tyr Lys Lys Phe Pro Gln Met Ser 180 185 190 Ser Tyr Gln Arg Met Leu Val His Arg Val Ala Ala Tyr Phe Gly Leu 195 200 205 Asp His Asn Val Asp Gln Thr Gly Lys Ser Val Ile Ile Asn Lys Thr 210 215 220 Ser Ser Thr Arg Ile Pro Glu Gln Arg Phe Cys Glu His Leu Lys Asp 225 230 235 240 Glu Lys Gly Glu Glu Ser Gln Lys Arg Phe Ile Leu Lys Arg Asp Asn 245 250 255 Ser Ser Ile Asp Lys Glu Asp Asn Gln Ser Val Cys Ser Gln Glu Ser 260 265 270 Leu Phe Val Glu Asn Arg Gly Asn Arg Asp Gly Ser Gly Arg Thr Ser 275 280 285 Gly Ser Arg Gln Ser Ser Ser Glu Asn Glu Leu Lys Trp Ser Asp His 290 295 300 Gln Arg Ala Trp Ser Ser Thr Asp Ser Asp Ser Ser Asn Arg Asn Leu 305 310 315 320 Lys Pro Ala Met Thr Lys Thr Ala Ser Phe Gly Gly Ile Thr Val Leu 325 330 335 Thr Arg Gly Asp Ser Thr Ser Ser Thr Arg Ser Thr Gly Lys Leu Ser 340 345 350 Lys Ala Gly Ser Glu Ser Ser Ser Ser Ala Gly Ser Ser Gly Ser Leu 355 360 365 Ser Arg Thr His Pro Pro Leu Gln Ser Thr Pro Leu Val Ser Gly Val 370 375 380 Ala Ala Gly Ser Pro Gly Cys Val Pro Tyr Pro Glu Asn Gly Ile Gly 385 390 395 400 Gly Gln Val Ala Pro Ser Ser Thr Ser Tyr Ile Leu Leu Pro Leu Glu 405 410 415 Ala Ala Thr Gly Ile Pro Pro Gly Ser Ile Leu Leu Asn Pro His Thr 420 425 430 Gly Gln Pro Phe Val Asn Pro Asp Gly Thr Pro Ala Ile Tyr Asn Pro 435 440 445 Pro Thr Ser Gln Gln Pro Leu Arg Ser Ala Met Val Gly Gln Ser Gln 450 455 460 Gln Gln Pro Pro Gln Gln Gln Pro Ser Pro Gln Pro Gln Gln Gln Val 465 470 475 480 Gln Pro Pro Gln Pro Gln Met Ala Gly Pro Leu Val Thr Gln Ser Val 485 490 495 Gln Gly Leu Gln Ala Ser Ser Gln Ser Val Gln Tyr Pro Ala Val Ser 500 505 510 Phe Pro Pro Gln His Leu Leu Pro Val Ser Pro Thr Gln His Phe Pro 515 520 525 Met Arg Asp Asp Val Ala Thr Gln Phe Gly Gln Met Thr Leu Ser Arg 530 535 540 Gln Ser Ser Gly Glu Thr Pro Glu Pro Pro Ser Gly Pro Val Tyr Pro 545 550 555 560 Ser Ser Leu Met Pro Gln Pro Ala Gln Gln Pro Ser Tyr Val Ile Ala 565 570 575 Ser Thr Gly Gln Gln Leu Pro Thr Gly Gly Phe Ser Gly Ser Gly Pro 580 585 590 Pro Ile Ser Gln Gln Val Leu Gln Pro Pro Pro Ser Pro Gln Gly Phe 595 600 605 Val Gln Gln Pro Pro Pro Ala Gln Met Pro Val Tyr Tyr Tyr Pro Ser 610 615 620 Gly Gln Tyr Pro Thr Ser Thr Thr Gln Gln Tyr Arg Pro Met Ala Pro 625 630 635 640 Val Gln Tyr Asn Ala Gln Arg Ser Gln Gln Met Pro Gln Ala Ala Gln 645 650 655 Gln Ala Gly Tyr Gln Pro Val Leu Ser Gly Gln Gln Gly Phe Gln Gly 660 665 670 Leu Ile Gly Val Gln Gln Pro Pro Gln Ser Gln Asn Val Ile Asn Asn 675 680 685 Gln Gln Gly Thr Pro Val Gln Ser Val Met Val Ser Tyr Pro Thr Met 690 695 700 Ser Ser Tyr Gln Val Pro Met Thr Gln Gly Ser Gln Gly Leu Pro Gln 705 710 715 720 Gln Ser Tyr Gln Gln Pro Ile Met Leu Pro Asn Gln Ala Gly Gln Gly 725 730 735 Ser Leu Pro Ala Thr Gly Met Pro Val Tyr Cys Asn Val Thr Pro Pro 740 745 750 Thr Pro Gln Asn Asn Leu Arg Leu Ile Gly Pro His Cys Pro Ser Ser 755 760 765 Thr Val Pro Val Met Ser Ala Ser Cys Arg Thr Asn Cys Ala Ser Met 770 775 780 Ser Asn Ala Gly Trp Gln Val Lys Phe 785 790 9 1006 DNA Homo sapiens CDS (280)..(549) 9 gggcagcttg agacaggtgg agctggatca agctgtgaac gtgatttgct ggaagctggt 60 cattagtgtt gacgatgtgt cacactgtgt aagggaatcg catggagatg ggcattccga 120 actgttaatg gggacatggg actccagttg tctctgatca cttgtgtgga ttttcctggc 180 gtagaacgac agaagccgct agtaagtcgc caagacctac agcaggaatt ctgcaccaaa 240 gggcataaaa tcttgttatt ttaatttgca tctgggaga atg tct gag caa gga 294 Met Ser Glu Gln Gly 1 5 gac ctg aat cag gca ata gca gag gaa gga ggg act gag cag gag acg 342 Asp Leu Asn Gln Ala Ile Ala Glu Glu Gly Gly Thr Glu Gln Glu Thr 10 15 20 gcc act cca gag aac ggc att gtt aaa tca gaa agt ctg gat gaa gag 390 Ala Thr Pro Glu Asn Gly Ile Val Lys Ser Glu Ser Leu Asp Glu Glu 25 30 35 gag aaa ctg gaa ctg cag agg cgg ctg gag gct cag aat caa gaa aga 438 Glu Lys Leu Glu Leu Gln Arg Arg Leu Glu Ala Gln Asn Gln Glu Arg 40 45 50 aga aaa tcc aag tca gga gca gga aaa ggt aaa ctg act cgc agt ctt 486 Arg Lys Ser Lys Ser Gly Ala Gly Lys Gly Lys Leu Thr Arg Ser Leu 55 60 65 gct gtc tgt gag gaa tct tct gcc aga cca gga ggt gaa agt ctt cag 534 Ala Val Cys Glu Glu Ser Ser Ala Arg Pro Gly Gly Glu Ser Leu Gln 70 75 80 85 gat cag act ctc tga aaactgcaaa tggaaaggaa ttcaaaagaa tttagattaa 589 Asp Gln Thr Leu aagttaaata aaaagtaggc acagtagtgc tgaattttcc tcaaaggctc tcttttgata 649 aggctgaacc aaatataatc ccaagtatcc tctctccttc cttgttggag atgtcttacc 709 tctcagctcc caaaatgcac ttgcctataa gaaacacaat tgctggttca tatgaaactt 769 wagaaatagt gaataaggtg catttaactt tggagaaata cttttatgsc tttggtggag 829 atttctcaat actgcaaaag ttgtccagaa atgaatctga gctgatggtg actttaagtt 889 aatattatta atatatcact gcatattttt acccttattt ttgctcctta cagcaagatt 949 agtaggttat aaaaatttaa atttaaacaa aattatttca tgacaaaatg ggaaact 1006 10 89 PRT Homo sapiens 10 Met Ser Glu Gln Gly Asp Leu Asn Gln Ala Ile Ala Glu Glu Gly Gly 1 5 10 15 Thr Glu Gln Glu Thr Ala Thr Pro Glu Asn Gly Ile Val Lys Ser Glu 20 25 30 Ser Leu Asp Glu Glu Glu Lys Leu Glu Leu Gln Arg Arg Leu Glu Ala 35 40 45 Gln Asn Gln Glu Arg Arg Lys Ser Lys Ser Gly Ala Gly Lys Gly Lys 50 55 60 Leu Thr Arg Ser Leu Ala Val Cys Glu Glu Ser Ser Ala Arg Pro Gly 65 70 75 80 Gly Glu Ser Leu Gln Asp Gln Thr Leu 85 11 807 PRT Mus musculus 11 Met Ser Glu Gln Gly Gly Leu Thr Pro Thr Ile Leu Glu Glu Gly Gln 1 5 10 15 Thr Glu Pro Glu Ser Ala Pro Glu Asn Gly Ile Leu Lys Ser Glu Ser 20 25 30 Leu Asp Glu Glu Glu Lys Leu Glu Leu Gln Arg Arg Leu Ala Ala Gln 35 40 45 Asn Gln Glu Arg Arg Lys Ser Lys Ser Gly Ala Gly Lys Gly Lys Leu 50 55 60 Thr Arg Ser Leu Ala Val Cys Glu Glu Ser Ser Ala Arg Ser Gly Gly 65 70 75 80 Glu Ser His Gln Asp Gln Glu Ser Ile His Leu Gln Leu Ser Ser Phe 85 90 95 Pro Ser Leu Gln Glu Glu Asp Lys Ser Arg Lys Asp Asp Ser Glu Arg 100 105 110 Glu Lys Glu Lys Asp Lys Asn Arg Glu Lys Leu Ser Glu Arg Pro Lys 115 120 125 Ile Arg Met Leu Ser Lys Asp Cys Ser Gln Glu Tyr Thr Asp Ser Thr 130 135 140 Gly Ile Asp Leu His Gly Phe Leu Ile Asn Thr Leu Lys Asn Asn Ser 145 150 155 160 Arg Asp Arg Met Ile Leu Leu Lys Met Glu Gln Glu Met Ile Asp Phe 165 170 175 Ile Ala Asp Ser Asn Asn His Tyr Lys Lys Phe Pro Gln Met Ser Ser 180 185 190 Tyr Gln Arg Met Leu Val His Arg Val Ala Ala Tyr Phe Gly Leu Asp 195 200 205 His Asn Val Asp Gln Thr Gly Lys Ser Val Ile Ile Asn Lys Thr Ser 210 215 220 Ser Thr Arg Ile Pro Glu Gln Arg Phe Cys Glu His Leu Lys Asp Glu 225 230 235 240 Lys Ser Glu Glu Ser Gln Lys Arg Phe Ile Leu Lys Arg Asp Asn Ser 245 250 255 Ser Ile Asp Lys Glu Asp Asn Gln Asn Arg Met His Pro Phe Arg Asp 260 265 270 Asp Arg Arg Ser Lys Ser Ile Glu Glu Arg Glu Glu Glu Tyr Gln Arg 275 280 285 Val Arg Glu Arg Ile Phe Ala His Asp Ser Val Cys Ser Gln Glu Ser 290 295 300 Leu Phe Leu Asp Asn Ser Arg Leu Gln Glu Asp Met His Ile Cys Asn 305 310 315 320 Glu Thr Tyr Lys Lys Arg Gln Leu Phe Arg Ala His Arg Asp Ser Ser 325 330 335 Gly Arg Thr Ser Gly Ser Arg Gln Ser Ser Ser Glu Thr Glu Leu Arg 340 345 350 Trp Pro Asp His Gln Arg Ala Trp Ser Ser Thr Asp Ser Asp Ser Ser 355 360 365 Asn Arg Asn Leu Lys Pro Thr Met Thr Lys Thr Ala Ser Phe Gly Gly 370 375 380 Ile Thr Val Leu Thr Arg Gly Asp Ser Thr Ser Ser Thr Arg Ser Ala 385 390 395 400 Gly Lys Leu Ser Lys Thr Gly Ser Glu Ser Ser Ser Ser Ala Gly Ser 405 410 415 Ser Gly Ser Leu Ser Arg Thr His Pro Gln Ser Thr Ala Leu Thr Ser 420 425 430 Ser Val Ala Ala Gly Ser Pro Gly Cys Met Ala Tyr Ser Glu Asn Gly 435 440 445 Met Gly Gly Gln Val Pro Pro Ser Ser Thr Ser Tyr Ile Leu Leu Pro 450 455 460 Leu Glu Ser Ala Thr Gly Ile Pro Pro Gly Ser Ile Leu Leu Asn Pro 465 470 475 480 His Thr Gly Gln Pro Phe Val Asn Pro Asp Gly Thr Pro Ala Ile Tyr 485 490 495 Asn Pro Pro Gly Ser Gln Gln Thr Leu Arg Gly Thr Val Gly Gly Gln 500 505 510 Pro Gln Gln Pro Pro Gln Gln Gln Pro Ser Pro Gln Pro Gln Gln Gln 515 520 525 Val Gln Ala Ser Gln Pro Gln Met Ala Gly Pro Leu Val Thr Gln Arg 530 535 540 Glu Glu Leu Ala Ala Gln Phe Ser Gln Leu Ser Met Ser Arg Gln Ser 545 550 555 560 Ser Gly Asp Thr Pro Glu Pro Pro Ser Gly Thr Val Tyr Pro Ala Ser 565 570 575 Leu Leu Pro Gln Thr Ala Gln Pro Gln Ser Tyr Val Ile Thr Ser Ala 580 585 590 Gly Gln Gln Leu Ser Thr Gly Gly Phe Ser Asp Ser Gly Pro Pro Ile 595 600 605 Ser Gln Gln Val Leu Gln Ala Pro Pro Ser Pro Gln Gly Phe Val Gln 610 615 620 Gln Pro Pro Pro Ala Gln Met Ser Val Tyr Tyr Tyr Pro Ser Gly Gln 625 630 635 640 Tyr Pro Thr Ser Thr Ser Gln Gln Tyr Arg Pro Leu Ala Ser Val Gln 645 650 655 Tyr Ser Ala Gln Arg Ser Gln Gln Ile Pro Gln Thr Thr Gln Gln Ala 660 665 670 Gly Tyr Gln Pro Val Leu Ser Gly Gln Gln Gly Phe Gln Gly Met Met 675 680 685 Gly Val Gln Gln Ser Ala His Ser Gln Gly Val Met Ser Ser Gln Gln 690 695 700 Gly Ala Pro Val His Gly Val Met Val Ser Tyr Pro Thr Met Ser Ser 705 710 715 720 Tyr Gln Val Pro Met Thr Gln Gly Ser Gln Ala Val Pro Gln Gln Thr 725 730 735 Tyr Gln Pro Pro Ile Met Leu Pro Ser Gln Ala Gly Gln Gly Ser Leu 740 745 750 Pro Ala Thr Gly Met Pro Val Tyr Cys Asn Val Thr Pro Pro Asn Pro 755 760 765 Gln Asn Asn Leu Arg Leu Met Gly Pro His Cys Pro Ser Ser Thr Val 770 775 780 Pro Val Met Ser Ala Ser Cys Arg Thr Asn Cys Gly Asn Val Ser Asn 785 790 795 800 Ala Gly Trp Gln Val Lys Phe 805 12 648 PRT Homo sapien 12 Met Ile Leu Leu Lys Met Glu Gln Glu Ile Ile Asp Phe Ile Ala Asp 1 5 10 15 Asn Asn Asn His Tyr Lys Lys Phe Pro Gln Met Ser Ser Tyr Gln Arg 20 25 30 Met Leu Val His Arg Val Ala Ala Tyr Phe Gly Leu Asp His Asn Val 35 40 45 Asp Gln Thr Gly Lys Ser Val Ile Ile Asn Lys Thr Ser Ser Thr Arg 50 55 60 Ile Pro Glu Gln Arg Phe Cys Glu His Leu Lys Asp Glu Lys Gly Glu 65 70 75 80 Glu Ser Gln Lys Arg Phe Ile Leu Lys Arg Asp Asn Ser Ser Ile Asp 85 90 95 Lys Glu Asp Asn Gln Ser Val Cys Ser Gln Glu Ser Leu Phe Val Glu 100 105 110 Asn Arg Leu Leu Glu Asp Ser Asn Ile Cys Asn Glu Thr Tyr Lys Lys 115 120 125 Arg Gln Leu Phe Arg Gly Asn Arg Asp Gly Ser Gly Arg Thr Ser Gly 130 135 140 Ser Arg Gln Ser Ser Ser Glu Asn Glu Leu Lys Trp Ser Asp His Gln 145 150 155 160 Arg Ala Trp Ser Ser Thr Asp Ser Asp Ser Ser Asn Arg Asn Leu Lys 165 170 175 Pro Ala Met Thr Lys Thr Ala Ser Phe Gly Gly Ile Thr Val Leu Thr 180 185 190 Arg Gly Asp Ser Thr Ser Ser Thr Arg Ser Thr Gly Lys Leu Ser Lys 195 200 205 Ala Gly Ser Glu Ser Ser Ser Ser Ala Gly Ser Ser Gly Ser Leu Ser 210 215 220 Arg Thr His Pro Pro Leu Gln Ser Thr Pro Leu Val Ser Gly Val Ala 225 230 235 240 Ala Gly Ser Pro Gly Cys Val Pro Tyr Pro Glu Asn Gly Ile Gly Gly 245 250 255 Gln Val Ala Pro Ser Ser Thr Ser Tyr Ile Leu Leu Pro Leu Glu Ala 260 265 270 Ala Thr Gly Ile Pro Pro Gly Ser Ile Leu Leu Asn Pro His Thr Gly 275 280 285 Gln Pro Phe Val Asn Pro Asp Gly Thr Pro Ala Ile Tyr Asn Pro Pro 290 295 300 Thr Ser Gln Gln Pro Leu Arg Ser Ala Met Val Gly Gln Ser Gln Gln 305 310 315 320 Gln Pro Pro Gln Gln Gln Pro Ser Pro Gln Pro Gln Gln Gln Val Gln 325 330 335 Pro Pro Gln Pro Gln Met Ala Gly Pro Leu Val Thr Gln Ser Val Gln 340 345 350 Gly Leu Gln Ala Ser Ser Gln Ser Val Gln Tyr Pro Ala Val Ser Phe 355 360 365 Pro Pro Gln His Leu Leu Pro Val Ser Pro Thr Gln His Phe Pro Met 370 375 380 Arg Asp Asp Val Ala Thr Gln Phe Gly Gln Met Thr Leu Ser Arg Gln 385 390 395 400 Ser Ser Gly Glu Thr Pro Glu Pro Pro Ser Gly Pro Val Tyr Pro Ser 405 410 415 Ser Leu Met Pro Gln Pro Ala Gln Gln Pro Ser Tyr Val Ile Ala Ser 420 425 430 Thr Gly Gln Gln Leu Pro Thr Gly Gly Phe Ser Gly Ser Gly Pro Pro 435 440 445 Ile Ser Gln Gln Val Leu Gln Pro Pro Pro Ser Pro Gln Gly Phe Val 450 455 460 Gln Gln Pro Pro Pro Ala Gln Met Pro Val Tyr Tyr Tyr Pro Ser Gly 465 470 475 480 Gln Tyr Pro Thr Ser Thr Thr Gln Gln Tyr Arg Pro Met Ala Pro Val 485 490 495 Gln Tyr Asn Ala Gln Arg Ser Gln Gln Met Pro Gln Ala Ala Gln Gln 500 505 510 Ala Gly Tyr Gln Pro Val Leu Ser Gly Gln Gln Gly Phe Gln Gly Leu 515 520 525 Ile Gly Val Gln Gln Pro Pro Gln Ser Gln Asn Val Ile Asn Asn Gln 530 535 540 Gln Gly Thr Pro Val Gln Ser Val Met Val Ser Tyr Pro Thr Met Ser 545 550 555 560 Ser Tyr Gln Val Pro Met Thr Gln Gly Ser Gln Gly Leu Pro Gln Gln 565 570 575 Ser Tyr Gln Gln Pro Ile Met Leu Pro Asn Gln Ala Gly Gln Gly Ser 580 585 590 Leu Pro Ala Thr Gly Met Pro Val Tyr Cys Asn Val Thr Pro Pro Thr 595 600 605 Pro Gln Asn Asn Leu Arg Leu Ile Gly Pro His Cys Pro Ser Ser Thr 610 615 620 Val Pro Val Met Ser Ala Ser Cys Arg Thr Asn Cys Ala Ser Met Ser 625 630 635 640 Asn Ala Gly Trp Gln Val Lys Phe 645 13 651 PRT Homo sapien 13 Arg Asp Arg Met Ile Leu Leu Lys Met Glu Gln Glu Ile Ile Asp Phe 1 5 10 15 Ile Ala Asp Asn Asn Asn His Tyr Lys Lys Phe Pro Gln Met Ser Ser 20 25 30 Tyr Gln Arg Met Leu Val His Arg Val Ala Ala Tyr Phe Gly Leu Asp 35 40 45 His Asn Val Asp Gln Thr Gly Lys Ser Val Ile Ile Asn Lys Thr Ser 50 55 60 Ser Thr Arg Ile Pro Glu Gln Arg Phe Cys Glu His Leu Lys Asp Glu 65 70 75 80 Lys Gly Glu Glu Ser Gln Lys Arg Phe Ile Leu Lys Arg Asp Asn Ser 85 90 95 Ser Ile Asp Lys Glu Asp Asn Gln Ser Val Cys Ser Gln Glu Ser Leu 100 105 110 Phe Val Glu Asn Arg Leu Leu Glu Asp Ser Asn Ile Cys Asn Glu Thr 115 120 125 Tyr Lys Lys Arg Gln Leu Phe Arg Gly Asn Arg Asp Gly Ser Gly Arg 130 135 140 Thr Ser Gly Ser Arg Gln Ser Ser Ser Glu Asn Glu Leu Lys Trp Ser 145 150 155 160 Asp His Gln Arg Ala Trp Ser Ser Thr Asp Ser Asp Ser Ser Asn Arg 165 170 175 Asn Leu Lys Pro Ala Met Thr Lys Thr Ala Ser Phe Gly Gly Ile Thr 180 185 190 Val Leu Thr Arg Gly Asp Ser Thr Ser Ser Thr Arg Ser Thr Gly Lys 195 200 205 Leu Ser Lys Ala Gly Ser Glu Ser Ser Ser Ser Ala Gly Ser Ser Gly 210 215 220 Ser Leu Ser Arg Thr His Pro Pro Leu Gln Ser Thr Pro Leu Val Ser 225 230 235 240 Gly Val Ala Ala Gly Ser Pro Gly Cys Val Pro Tyr Pro Glu Asn Gly 245 250 255 Ile Gly Gly Gln Val Ala Pro Ser Ser Thr Ser Tyr Ile Leu Leu Pro 260 265 270 Leu Glu Ala Ala Thr Gly Ile Pro Pro Gly Ser Ile Leu Leu Asn Pro 275 280 285 His Thr Gly Gln Pro Phe Val Asn Pro Asp Gly Thr Pro Ala Ile Tyr 290 295 300 Asn Pro Pro Thr Ser Gln Gln Pro Leu Arg Ser Ala Met Val Gly Gln 305 310 315 320 Ser Gln Gln Gln Pro Pro Gln Gln Gln Pro Ser Pro Gln Pro Gln Gln 325 330 335 Gln Val Gln Pro Pro Gln Pro Gln Met Ala Gly Pro Leu Val Thr Gln 340 345 350 Ser Val Gln Gly Leu Gln Ala Ser Ser Gln Ser Val Gln Tyr Pro Ala 355 360 365 Val Ser Phe Pro Pro Gln His Leu Leu Pro Val Ser Pro Thr Gln His 370 375 380 Phe Pro Met Arg Asp Asp Val Ala Thr Gln Phe Gly Gln Met Thr Leu 385 390 395 400 Ser Arg Gln Ser Ser Gly Glu Thr Pro Glu Pro Pro Ser Gly Pro Val 405 410 415 Tyr Pro Ser Ser Leu Met Pro Gln Pro Ala Gln Gln Pro Ser Tyr Val 420 425 430 Ile Ala Ser Thr Gly Gln Gln Leu Pro Thr Gly Gly Phe Ser Gly Ser 435 440 445 Gly Pro Pro Ile Ser Gln Gln Val Leu Gln Pro Pro Pro Ser Pro Gln 450 455 460 Gly Phe Val Gln Gln Pro Pro Pro Ala Gln Met Pro Val Tyr Tyr Tyr 465 470 475 480 Pro Ser Gly Gln Tyr Pro Thr Ser Thr Thr Gln Gln Tyr Arg Pro Met 485 490 495 Ala Pro Val Gln Tyr Asn Ala Gln Arg Ser Gln Gln Met Pro Gln Ala 500 505 510 Ala Gln Gln Ala Gly Tyr Gln Pro Val Leu Ser Gly Gln Gln Gly Phe 515 520 525 Gln Gly Leu Ile Gly Val Gln Gln Pro Pro Gln Ser Gln Asn Val Ile 530 535 540 Asn Asn Gln Gln Gly Thr Pro Val Gln Ser Val Met Val Ser Tyr Pro 545 550 555 560 Thr Met Ser Ser Tyr Gln Val Pro Met Thr Gln Gly Ser Gln Gly Leu 565 570 575 Pro Gln Gln Ser Tyr Gln Gln Pro Ile Met Leu Pro Asn Gln Ala Gly 580 585 590 Gln Gly Ser Leu Pro Ala Thr Gly Met Pro Val Tyr Cys Asn Val Thr 595 600 605 Pro Pro Thr Pro Gln Asn Asn Leu Arg Leu Ile Gly Pro His Cys Pro 610 615 620 Ser Ser Thr Val Pro Val Met Ser Ala Ser Cys Arg Thr Asn Cys Ala 625 630 635 640 Ser Met Ser Asn Ala Gly Trp Gln Val Lys Phe 645 650 14 89 PRT Homo sapien 14 Met Ser Glu Gln Gly Asp Leu Asn Gln Ala Ile Ala Glu Glu Gly Gly 1 5 10 15 Thr Glu Gln Glu Thr Ala Thr Pro Glu Asn Gly Ile Val Lys Ser Glu 20 25 30 Ser Leu Asp Glu Glu Glu Lys Leu Glu Leu Gln Arg Arg Leu Glu Ala 35 40 45 Gln Asn Gln Glu Arg Arg Lys Ser Lys Ser Gly Ala Gly Lys Gly Lys 50 55 60 Leu Thr Arg Ser Leu Ala Val Cys Glu Glu Ser Ser Ala Arg Pro Gly 65 70 75 80 Gly Glu Ser Leu Gln Asp Gln Thr Leu 85 15 88 PRT Mus musculus 15 Met Ser Glu Gln Gly Gly Leu Thr Pro Thr Ile Leu Glu Glu Gly Gln 1 5 10 15 Thr Glu Pro Glu Ser Ala Pro Glu Asn Gly Ile Leu Lys Ser Glu Ser 20 25 30 Leu Asp Glu Glu Glu Lys Leu Glu Leu Gln Arg Arg Leu Ala Ala Gln 35 40 45 Asn Gln Glu Arg Arg Lys Ser Lys Ser Gly Ala Gly Lys Gly Lys Leu 50 55 60 Thr Arg Ser Leu Ala Val Cys Glu Glu Ser Ser Ala Arg Ser Gly Gly 65 70 75 80 Glu Ser His Gln Asp Gln Thr Leu 85
Claims (24)
1. An isolated polynucleotide which codes without interruption for a human TARPP polypeptide having an amino acid sequence set forth in SEQ ID NO 2 (Br137E), SEQ ID NO 4 (Br137A), SEQ ID NO 6 (Br137B), SEQ ID NO 8 (Br137C), or a complement thereto.
2. An isolated polynucleotide of claim 1 , having the nucleotide sequence set forth in SEQ ID NO: 1, 3, 5, or 7.
3. An isolated polynucleotide comprising,
polynucleotide sequence having 99% or more sequence identity to the polynucleotide sequence set forth in SEQ ID NO 2 (Br137E), SEQ ID NO 4 (Br137A), SEQ ID NO 6 (Br137B), or SEQ ID NO 8 (Br137C), which codes without interruption for a human TARPP, or a complement thereto, and which has nucleic acid binding activity.
4. An isolated polynucleotide of a human TARPP of claim 1 consisting essentially of,
a polynucleotide sequence coding for amino acids 267-300, 312-331, 1-161, 88-161, effective specific fragments thereof, or complements thereto.
5. An isolated polynucleotide of claim 4 , wherein said fragment is effective in a polymerase chain reaction.
6. An isolated polypeptide coding for human TARPP having an amino acid sequence set forth in SEQ ID NO 2 (Br137E), SEQ ID NO 4 (Br137A), SEQ ID NO 6 (Br137B), and SEQ ID NO 8 (Br137C).
7. An isolated polypeptide consisting essentially of a polypeptide coded for by a polynucleotide sequence of claim 4 .
8. An isolated polypeptide comprising an amino acid sequence having 99% or more sequence identity to a human TARPP of claim 1 and having the amino acid sequence set forth in SEQ ID NO 2 (Br137E), SEQ ID NO 4 (Br137A), SEQ ID NO 6 (Br137B), or SEQ ID NO 8 (Br137C).
9. A method of modulating T-cells, comprising,
contacting T-cells with an agent which is effective for regulating a human TARPP gene of claim 1 expressed in said cells, or for modulating the biological activity of a polypeptide encoded thereby.
10. A method of claim 9 , wherein said agent is an antibody or an antisense polynucleotide effective to inhibit translation of said gene.
11. A method treating a disease of the immune or nervous system, comprising,
administering to a subject in need thereof an amount of an agent effective for modulating the expression of a human TARPP of claim 1 , or for modulating the biological activity of a polypeptide encoded thereby.
12. A method of detecting expression of a gene coding for human TARPP, comprising,
contacting a sample comprising nucleic acid with a polynucleotide probe specific for a human TARPP of claim 1 under conditions effective for said probe to hybridize specifically with said human TARPP, and
detecting hybridization between said probe and said human TARPP.
13. A method of claim 12 , wherein said detecting is performed by:
Northern blot analysis, polymerase chain reaction (PCR), reverse transcriptase PCR, RACE PCR, or in situ hybridization.
14. A method of assessing a therapeutic or preventative intervention in a subject having a disease of the immune or nervous system, comprising,
determining the expression levels of a human TARPP of claim 1 in a sample comprising immune or neuronal cells.
15. A method for identifying an agent that modulates a human TARPP gene in cells expressing said gene, comprising,
contacting cells expressing human TARPP of claim 1 with a test agent under conditions effective for said test agent to modulate the expression of a gene coding for said human TARPP, and
determining whether said test agent modulates said human TARPP.
16. A method of claim 15 , wherein said agent is an antisense polynucleotide to a target polynucleotide sequence selected from SEQ ID NO. 1 (Br137E), 3 (Br137A), 5 (Br137B), or 7 (Br137C), and which is effective to inhibit translation of said human TARPP.
17. A method for identifying an agent that modulates the biological activity of a human TARPP polypeptide in cells expressing said polypeptide, comprising,
contacting cells expressing a human TARPP polynucleotide of claim 1 with a test agent under conditions effective for said test agent to modulate the biological activity of a human TARPP polypeptide coded for by said polynucleotide, and
determining whether said test agent modulates said human TARPP.
18. A method of claim 17 , wherein said agent is a polynucleotide which binds to said polypeptide.
19. A method of detecting polymorphisms in human TARPP comprising,
comparing the structure of: genomic DNA comprising all or part of human TARPP, mRNA comprising all or part of human TARPP, cDNA comprising all or part of human TARPP, or a polypeptide comprising all or part of human TARPP, with the structure of human TARPP of claim 1 .
20. A method of claim 19 , wherein said polymorphism is a nucleotide deletion, substitution, inversion, or transposition.
21. A human cell whose genome comprises a functional disruption of human TARPP in the region comprising the coding sequence for amino acids 1-161 of a human TARPP of claim 1 .
22. A human cell whose genome comprises a deletion of a coding sequence for amino acids 267-300 and/or 312-331 of a human TARPP of claim 1 .
23. A method of advertising human TARPP for sale, commercial use, or licensing, comprising,
displaying in a computer-readable medium a polynucleotide or amino acid sequence for a human TARPP of claim 1 , effective specific fragments thereof, or complements thereto.
24. An antibody which is specific-for a human TARPP, said antibody which is specific for an epitope present in amino acid sequences 1-161, 88-161, 267-300, 312-331, or a polypeptide comprising amino acid 312, of a human TARPP of claim 1.
Priority Applications (6)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| US10/112,372 US20030186249A1 (en) | 2002-04-01 | 2002-04-01 | Human TARPP genes and polypeptides |
| US10/164,717 US7115393B2 (en) | 2002-04-01 | 2002-06-10 | Melanocortin-1 receptor and methods of use |
| US10/167,631 US20030232339A1 (en) | 2002-04-01 | 2002-06-13 | Human TRPCC cation channel and uses |
| US10/177,917 US20030235826A1 (en) | 2002-04-01 | 2002-06-24 | Gene and protein specific for excitable tissues |
| PCT/US2003/009921 WO2003085095A2 (en) | 2002-04-01 | 2003-04-01 | Novel expressed genes |
| AU2003218483A AU2003218483A1 (en) | 2002-04-01 | 2003-04-01 | Novel expressed genes |
Applications Claiming Priority (6)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| US10/112,372 US20030186249A1 (en) | 2002-04-01 | 2002-04-01 | Human TARPP genes and polypeptides |
| US38261402P | 2002-05-24 | 2002-05-24 | |
| US10/164,717 US7115393B2 (en) | 2002-04-01 | 2002-06-10 | Melanocortin-1 receptor and methods of use |
| US10/167,631 US20030232339A1 (en) | 2002-04-01 | 2002-06-13 | Human TRPCC cation channel and uses |
| US10/177,917 US20030235826A1 (en) | 2002-04-01 | 2002-06-24 | Gene and protein specific for excitable tissues |
| US39912502P | 2002-07-30 | 2002-07-30 |
Publications (1)
| Publication Number | Publication Date |
|---|---|
| US20030186249A1 true US20030186249A1 (en) | 2003-10-02 |
Family
ID=28795387
Family Applications (4)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| US10/112,372 Abandoned US20030186249A1 (en) | 2002-04-01 | 2002-04-01 | Human TARPP genes and polypeptides |
| US10/164,717 Expired - Lifetime US7115393B2 (en) | 2002-04-01 | 2002-06-10 | Melanocortin-1 receptor and methods of use |
| US10/167,631 Abandoned US20030232339A1 (en) | 2002-04-01 | 2002-06-13 | Human TRPCC cation channel and uses |
| US10/177,917 Abandoned US20030235826A1 (en) | 2002-04-01 | 2002-06-24 | Gene and protein specific for excitable tissues |
Family Applications After (3)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| US10/164,717 Expired - Lifetime US7115393B2 (en) | 2002-04-01 | 2002-06-10 | Melanocortin-1 receptor and methods of use |
| US10/167,631 Abandoned US20030232339A1 (en) | 2002-04-01 | 2002-06-13 | Human TRPCC cation channel and uses |
| US10/177,917 Abandoned US20030235826A1 (en) | 2002-04-01 | 2002-06-24 | Gene and protein specific for excitable tissues |
Country Status (3)
| Country | Link |
|---|---|
| US (4) | US20030186249A1 (en) |
| AU (1) | AU2003218483A1 (en) |
| WO (1) | WO2003085095A2 (en) |
Families Citing this family (44)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| EP1589103A4 (en) | 2003-01-17 | 2006-07-19 | Astellas Pharma Inc | Method of screening agent for improving memory and learning ability |
| US7344882B2 (en) * | 2003-05-12 | 2008-03-18 | Bristol-Myers Squibb Company | Polynucleotides encoding variants of the TRP channel family member, LTRPC3 |
| WO2005106484A1 (en) * | 2004-04-30 | 2005-11-10 | Bayer Healthcare Ag | Diagnostics and therapeutics for diseases associated with melanocortin 1 receptor (mc1r) |
| US9274099B2 (en) | 2005-07-22 | 2016-03-01 | The Board Of Trustees Of The Leland Stanford Junior University | Screening test drugs to identify their effects on cell membrane voltage-gated ion channel |
| US10052497B2 (en) | 2005-07-22 | 2018-08-21 | The Board Of Trustees Of The Leland Stanford Junior University | System for optical stimulation of target cells |
| EP2465925A1 (en) * | 2005-07-22 | 2012-06-20 | The Board Of Trustees Of The Leland | Light-activated cation channel and uses thereof |
| US9238150B2 (en) | 2005-07-22 | 2016-01-19 | The Board Of Trustees Of The Leland Stanford Junior University | Optical tissue interface method and apparatus for stimulating cells |
| US8926959B2 (en) | 2005-07-22 | 2015-01-06 | The Board Of Trustees Of The Leland Stanford Junior University | System for optical stimulation of target cells |
| WO2008086470A1 (en) | 2007-01-10 | 2008-07-17 | The Board Of Trustees Of The Leland Stanford Junior University | System for optical stimulation of target cells |
| WO2008101128A1 (en) | 2007-02-14 | 2008-08-21 | The Board Of Trustees Of The Leland Stanford Junior University | System, method and applications involving identification of biological circuits such as neurological characteristics |
| WO2008106694A2 (en) | 2007-03-01 | 2008-09-04 | The Board Of Trustees Of The Leland Stanford Junior University | Systems, methods and compositions for optical stimulation of target cells |
| FR2916977A1 (en) * | 2007-06-06 | 2008-12-12 | Engelhard Lyon Sa | STIMULATION OF SYNTHESIS OF MCR1, MCR2 AND μ OPIOID RECEPTORS. |
| US10035027B2 (en) | 2007-10-31 | 2018-07-31 | The Board Of Trustees Of The Leland Stanford Junior University | Device and method for ultrasonic neuromodulation via stereotactic frame based technique |
| US10434327B2 (en) | 2007-10-31 | 2019-10-08 | The Board Of Trustees Of The Leland Stanford Junior University | Implantable optical stimulators |
| JP5801188B2 (en) | 2008-04-23 | 2015-10-28 | ザ ボード オブ トラスティーズ オブ ザ レランド スタンフォード ジュニア ユニバーシティー | Systems, methods, and compositions for photostimulating target cells |
| EP2857519B1 (en) | 2008-05-29 | 2016-09-14 | The Board of Trustees of the Leland Stanford Junior University | Cell line, system and method for optical control of secondary messengers |
| EP3192562B1 (en) | 2008-06-17 | 2020-03-04 | The Board of Trustees of the Leland Stanford Junior University | Devices for optical stimulation of target cells using an optical transmission element |
| US10711242B2 (en) | 2008-06-17 | 2020-07-14 | The Board Of Trustees Of The Leland Stanford Junior University | Apparatus and methods for controlling cellular development |
| US9101759B2 (en) | 2008-07-08 | 2015-08-11 | The Board Of Trustees Of The Leland Stanford Junior University | Materials and approaches for optical stimulation of the peripheral nervous system |
| NZ602416A (en) | 2008-11-14 | 2014-08-29 | Univ Leland Stanford Junior | Optically-based stimulation of target cells and modifications thereto |
| CA2781402C (en) | 2009-11-23 | 2017-03-21 | Palatin Technologies, Inc. | Melanocortin-1 receptor-specific cyclic peptides |
| CA2781405A1 (en) | 2009-11-23 | 2011-05-26 | Palatin Technologies, Inc. | Melanocortin-1 receptor-specific linear peptides |
| SG10201505162QA (en) | 2010-03-17 | 2015-08-28 | Univ Leland Stanford Junior | Light-sensitive ion-passing molecules |
| US9634855B2 (en) | 2010-05-13 | 2017-04-25 | Alexander Poltorak | Electronic personal interactive device that determines topics of interest using a conversational agent |
| CN103384469B (en) | 2010-11-05 | 2016-06-15 | 斯坦福大学托管董事会 | Light-controlled CNS dysfunction |
| EP2635108B1 (en) | 2010-11-05 | 2019-01-23 | The Board of Trustees of the Leland Stanford Junior University | Light-activated chimeric opsins and methods of using the same |
| JP6002140B2 (en) | 2010-11-05 | 2016-10-05 | ザ ボード オブ トラスティーズ オブ ザ レランド スタンフォード ジュニア ユニバーシティー | Stabilized step function opsin protein and method of use thereof |
| CN106376525A (en) | 2010-11-05 | 2017-02-08 | 斯坦福大学托管董事会 | Control and characterization of memory function |
| AU2011323235B2 (en) | 2010-11-05 | 2015-10-29 | The Board Of Trustees Of The Leland Stanford Junior University | Optogenetic control of reward-related behaviors |
| CN103313752B (en) | 2010-11-05 | 2016-10-19 | 斯坦福大学托管董事会 | Upconversion of Light for Optogenetic Approaches |
| US8696722B2 (en) | 2010-11-22 | 2014-04-15 | The Board Of Trustees Of The Leland Stanford Junior University | Optogenetic magnetic resonance imaging |
| CN107936097A (en) | 2011-12-16 | 2018-04-20 | 斯坦福大学托管董事会 | Opsin polypeptide and its application method |
| CN104363961B (en) | 2012-02-21 | 2017-10-03 | 斯坦福大学托管董事会 | Composition and method for treating basin bottom neurogenic illness |
| US9636380B2 (en) | 2013-03-15 | 2017-05-02 | The Board Of Trustees Of The Leland Stanford Junior University | Optogenetic control of inputs to the ventral tegmental area |
| ES2742492T3 (en) | 2013-03-15 | 2020-02-14 | Univ Leland Stanford Junior | Optogenetic control of behavioral status |
| US10220092B2 (en) | 2013-04-29 | 2019-03-05 | The Board Of Trustees Of The Leland Stanford Junior University | Devices, systems and methods for optogenetic modulation of action potentials in target cells |
| US10307609B2 (en) | 2013-08-14 | 2019-06-04 | The Board Of Trustees Of The Leland Stanford Junior University | Compositions and methods for controlling pain |
| KR102726294B1 (en) | 2014-01-31 | 2024-11-06 | 팩터 바이오사이언스 인크. | Methods and products for nucleic acid production and delivery |
| EP3543339A1 (en) | 2015-02-13 | 2019-09-25 | Factor Bioscience Inc. | Nucleic acid products and methods of administration thereof |
| WO2016209654A1 (en) | 2015-06-22 | 2016-12-29 | The Board Of Trustees Of The Leland Stanford Junior University | Methods and devices for imaging and/or optogenetic control of light-responsive neurons |
| JP2019528284A (en) | 2016-08-17 | 2019-10-10 | ファクター バイオサイエンス インコーポレイテッド | Nucleic acid product and method of administration thereof |
| US11294165B2 (en) | 2017-03-30 | 2022-04-05 | The Board Of Trustees Of The Leland Stanford Junior University | Modular, electro-optical device for increasing the imaging field of view using time-sequential capture |
| CN114450265B (en) | 2019-07-03 | 2024-12-24 | 菲克特生物科学股份有限公司 | Cationic lipids and their uses |
| US10501404B1 (en) | 2019-07-30 | 2019-12-10 | Factor Bioscience Inc. | Cationic lipids and transfection methods |
Family Cites Families (2)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US5874283A (en) * | 1995-05-30 | 1999-02-23 | John Joseph Harrington | Mammalian flap-specific endonuclease |
| EP1140968B9 (en) * | 1998-12-23 | 2007-05-09 | Merck & Co., Inc. | Dna molecules encoding splice variants of the human melanocortin 1 receptor protein |
-
2002
- 2002-04-01 US US10/112,372 patent/US20030186249A1/en not_active Abandoned
- 2002-06-10 US US10/164,717 patent/US7115393B2/en not_active Expired - Lifetime
- 2002-06-13 US US10/167,631 patent/US20030232339A1/en not_active Abandoned
- 2002-06-24 US US10/177,917 patent/US20030235826A1/en not_active Abandoned
-
2003
- 2003-04-01 AU AU2003218483A patent/AU2003218483A1/en not_active Abandoned
- 2003-04-01 WO PCT/US2003/009921 patent/WO2003085095A2/en not_active Ceased
Also Published As
| Publication number | Publication date |
|---|---|
| WO2003085095A3 (en) | 2005-07-21 |
| AU2003218483A8 (en) | 2003-10-20 |
| WO2003085095A2 (en) | 2003-10-16 |
| US20030232339A1 (en) | 2003-12-18 |
| AU2003218483A1 (en) | 2003-10-20 |
| US7115393B2 (en) | 2006-10-03 |
| US20030228658A1 (en) | 2003-12-11 |
| US20030235826A1 (en) | 2003-12-25 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| US20030186249A1 (en) | Human TARPP genes and polypeptides | |
| US20040249144A1 (en) | Regulated breast cancer genes | |
| US20050069886A1 (en) | Prostate cancer genes | |
| WO2003063773A2 (en) | Differentially-regulated prostate cancer genes | |
| WO2002081638A2 (en) | Prostate cancer expression profiles | |
| US6455292B1 (en) | Full-length serine protein kinase in brain and pancreas | |
| US20050055733A1 (en) | Small intestine and colon genes | |
| US20060241015A1 (en) | Cancer genes | |
| US6635481B1 (en) | Tbx3 gene and methods of using it | |
| US20050120393A1 (en) | Full-length prostate selective polynucleotides and polypeptides | |
| US20030219748A1 (en) | Regulated prostate cancer genes | |
| US6780595B2 (en) | Human Tbx20 gene and uses | |
| US20030078199A1 (en) | Human EphA6 gene and polypeptide | |
| US20030148334A1 (en) | Differentially-expressed genes and polypeptides in angiogenesis | |
| US20050106579A1 (en) | Regulated angiogenesis genes and polypeptides | |
| US20030170639A1 (en) | Liver transmembrane protein gene | |
| US7053193B2 (en) | Breast cancer transcription factor gene and uses | |
| US20030180728A1 (en) | Human BCU399 gene, polypeptide, and uses | |
| US20030082548A1 (en) | Brain selective transmembrane receptor gene | |
| US20030190625A1 (en) | Human kidins220Pc | |
| WO2003066831A2 (en) | Angiogenesis genes | |
| US6953673B2 (en) | Histamine H2 receptor and uses | |
| US20030148407A1 (en) | Human dehydrogenase gene and polypeptide | |
| US20040248116A1 (en) | Prostate cancer expression profiles | |
| US20030215809A1 (en) | Regulated breast cancer genes |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| AS | Assignment |
Owner name: ORIGENE TECHNOLOGIES, INC., MARYLAND Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNORS:SUN, ZAIREN;FAN, WUFANG;KOVACS, KARL F., IV;AND OTHERS;REEL/FRAME:015889/0232;SIGNING DATES FROM 20020328 TO 20021014 |
|
| STCB | Information on status: application discontinuation |
Free format text: ABANDONMENT FOR FAILURE TO CORRECT DRAWINGS/OATH/NONPUB REQUEST |