MXPA99004356A - Human proteins having transmembrane domains and dnas encoding these proteins - Google Patents
Human proteins having transmembrane domains and dnas encoding these proteinsInfo
- Publication number
- MXPA99004356A MXPA99004356A MXPA/A/1999/004356A MX9904356A MXPA99004356A MX PA99004356 A MXPA99004356 A MX PA99004356A MX 9904356 A MX9904356 A MX 9904356A MX PA99004356 A MXPA99004356 A MX PA99004356A
- Authority
- MX
- Mexico
- Prior art keywords
- leu
- gly
- val
- ser
- wing
- Prior art date
Links
- 108090000623 proteins and genes Proteins 0.000 title claims abstract description 472
- 102000004169 proteins and genes Human genes 0.000 title claims abstract description 447
- 108090000144 Human Proteins Proteins 0.000 title abstract description 19
- 102000003839 Human Proteins Human genes 0.000 title abstract description 19
- 108020004414 DNA Proteins 0.000 claims abstract description 38
- 125000003275 alpha amino acid group Chemical group 0.000 claims abstract 2
- 239000002299 complementary DNA Substances 0.000 claims description 185
- 210000003527 eukaryotic cell Anatomy 0.000 claims description 13
- 241000282414 Homo sapiens Species 0.000 abstract description 118
- 108020004635 Complementary DNA Proteins 0.000 abstract description 13
- 235000018102 proteins Nutrition 0.000 description 439
- 210000004027 cell Anatomy 0.000 description 181
- RLMISHABBKUNFO-WHFBIAKZSA-N Ala-Ala-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O RLMISHABBKUNFO-WHFBIAKZSA-N 0.000 description 124
- 230000014616 translation Effects 0.000 description 104
- 238000013519 translation Methods 0.000 description 103
- 108700026244 Open Reading Frames Proteins 0.000 description 88
- 150000001413 amino acids Chemical class 0.000 description 80
- BYXHQQCXAJARLQ-ZLUOBGJFSA-N Ala-Ala-Ala Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O BYXHQQCXAJARLQ-ZLUOBGJFSA-N 0.000 description 78
- 238000000034 method Methods 0.000 description 77
- 241000894007 species Species 0.000 description 69
- 125000000539 amino acid group Chemical group 0.000 description 64
- 230000000694 effects Effects 0.000 description 59
- 108020004999 messenger RNA Proteins 0.000 description 47
- 102000039446 nucleic acids Human genes 0.000 description 46
- 108020004707 nucleic acids Proteins 0.000 description 46
- 150000007523 nucleic acids Chemical class 0.000 description 46
- 108010050848 glycylleucine Proteins 0.000 description 45
- 210000001519 tissue Anatomy 0.000 description 41
- 230000001413 cellular effect Effects 0.000 description 40
- 238000012512 characterization method Methods 0.000 description 39
- 230000015572 biosynthetic process Effects 0.000 description 38
- 102000036639 antigens Human genes 0.000 description 36
- 108091007433 antigens Proteins 0.000 description 36
- 239000000427 antigen Substances 0.000 description 35
- 238000011282 treatment Methods 0.000 description 35
- 210000001744 T-lymphocyte Anatomy 0.000 description 34
- 239000012634 fragment Substances 0.000 description 34
- 238000000338 in vitro Methods 0.000 description 34
- 241000880493 Leptailurus serval Species 0.000 description 33
- 108010052285 Membrane Proteins Proteins 0.000 description 32
- 239000000047 product Substances 0.000 description 31
- 235000001014 amino acid Nutrition 0.000 description 30
- DHMQDGOQFOQNFH-UHFFFAOYSA-N Glycine Chemical compound NCC(O)=O DHMQDGOQFOQNFH-UHFFFAOYSA-N 0.000 description 26
- 239000000243 solution Substances 0.000 description 26
- 208000005718 Stomach Neoplasms Diseases 0.000 description 25
- 206010017758 gastric cancer Diseases 0.000 description 25
- 201000011549 stomach cancer Diseases 0.000 description 25
- 102000018697 Membrane Proteins Human genes 0.000 description 24
- 238000003556 assay Methods 0.000 description 24
- 208000037265 diseases, disorders, signs and symptoms Diseases 0.000 description 24
- 230000006870 function Effects 0.000 description 24
- 239000003446 ligand Substances 0.000 description 24
- KZNQNBZMBZJQJO-UHFFFAOYSA-N N-glycyl-L-proline Natural products NCC(=O)N1CCCC1C(O)=O KZNQNBZMBZJQJO-UHFFFAOYSA-N 0.000 description 22
- 102000005962 receptors Human genes 0.000 description 21
- 108020003175 receptors Proteins 0.000 description 21
- 201000008808 Fibrosarcoma Diseases 0.000 description 20
- 239000012528 membrane Substances 0.000 description 20
- 210000004379 membrane Anatomy 0.000 description 19
- 108090000765 processed proteins & peptides Proteins 0.000 description 19
- KOSRFJWDECSPRO-UHFFFAOYSA-N alpha-L-glutamyl-L-glutamic acid Natural products OC(=O)CCC(N)C(=O)NC(CCC(O)=O)C(O)=O KOSRFJWDECSPRO-UHFFFAOYSA-N 0.000 description 18
- 201000010099 disease Diseases 0.000 description 18
- 239000013604 expression vector Substances 0.000 description 18
- 210000002435 tendon Anatomy 0.000 description 18
- 102000003990 Urokinase-type plasminogen activator Human genes 0.000 description 17
- 108090000435 Urokinase-type plasminogen activator Proteins 0.000 description 17
- 108010057821 leucylproline Proteins 0.000 description 17
- 206010028980 Neoplasm Diseases 0.000 description 16
- 108010076504 Protein Sorting Signals Proteins 0.000 description 16
- 230000012010 growth Effects 0.000 description 16
- 210000003041 ligament Anatomy 0.000 description 16
- 101100221606 Saccharomyces cerevisiae (strain ATCC 204508 / S288c) COS7 gene Proteins 0.000 description 15
- 206010041823 squamous cell carcinoma Diseases 0.000 description 15
- 229960005356 urokinase Drugs 0.000 description 15
- 102000004127 Cytokines Human genes 0.000 description 14
- 108090000695 Cytokines Proteins 0.000 description 14
- 206010025323 Lymphomas Diseases 0.000 description 14
- 230000028993 immune response Effects 0.000 description 14
- 210000004185 liver Anatomy 0.000 description 14
- 238000006243 chemical reaction Methods 0.000 description 13
- 229940079593 drug Drugs 0.000 description 13
- 239000003814 drug Substances 0.000 description 13
- 230000003993 interaction Effects 0.000 description 13
- 230000035755 proliferation Effects 0.000 description 13
- 230000000903 blocking effect Effects 0.000 description 12
- 230000003248 secreting effect Effects 0.000 description 12
- 108010026333 seryl-proline Proteins 0.000 description 12
- WHUUTDBJXJRKMK-VKHMYHEASA-N L-glutamic acid Chemical compound OC(=O)[C@@H](N)CCC(O)=O WHUUTDBJXJRKMK-VKHMYHEASA-N 0.000 description 11
- HYIFFZAQXPUEAU-QWRGUYRKSA-N Leu-Gly-Leu Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(C)C HYIFFZAQXPUEAU-QWRGUYRKSA-N 0.000 description 11
- 235000014680 Saccharomyces cerevisiae Nutrition 0.000 description 11
- 230000004069 differentiation Effects 0.000 description 11
- 108010055341 glutamyl-glutamic acid Proteins 0.000 description 11
- 108010037850 glycylvaline Proteins 0.000 description 11
- 239000001963 growth medium Substances 0.000 description 11
- 102000040430 polynucleotide Human genes 0.000 description 11
- 108091033319 polynucleotide Proteins 0.000 description 11
- 239000002157 polynucleotide Substances 0.000 description 11
- 108010061238 threonyl-glycine Proteins 0.000 description 11
- 108010073969 valyllysine Proteins 0.000 description 11
- 239000013598 vector Substances 0.000 description 11
- YBAFDPFAUTYYRW-UHFFFAOYSA-N N-L-alpha-glutamyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CCC(O)=O YBAFDPFAUTYYRW-UHFFFAOYSA-N 0.000 description 10
- SITLTJHOQZFJGG-UHFFFAOYSA-N N-L-alpha-glutamyl-L-valine Natural products CC(C)C(C(O)=O)NC(=O)C(N)CCC(O)=O SITLTJHOQZFJGG-UHFFFAOYSA-N 0.000 description 10
- ISWSIDIOOBJBQZ-UHFFFAOYSA-N Phenol Chemical compound OC1=CC=CC=C1 ISWSIDIOOBJBQZ-UHFFFAOYSA-N 0.000 description 10
- DJDSEDOKJTZBAR-ZDLURKLDSA-N Thr-Gly-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O DJDSEDOKJTZBAR-ZDLURKLDSA-N 0.000 description 10
- BTWMICVCQLKKNR-DCAQKATOSA-N Val-Leu-Ser Chemical compound CC(C)[C@H]([NH3+])C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C([O-])=O BTWMICVCQLKKNR-DCAQKATOSA-N 0.000 description 10
- 108010013835 arginine glutamate Proteins 0.000 description 10
- 238000003745 diagnosis Methods 0.000 description 10
- 108010063718 gamma-glutamylaspartic acid Proteins 0.000 description 10
- 108010077515 glycylproline Proteins 0.000 description 10
- 230000001900 immune effect Effects 0.000 description 10
- 230000002401 inhibitory effect Effects 0.000 description 10
- 239000000203 mixture Substances 0.000 description 10
- 108010077112 prolyl-proline Proteins 0.000 description 10
- 238000010361 transduction Methods 0.000 description 10
- 230000026683 transduction Effects 0.000 description 10
- 108091032973 (ribonucleotides)n+m Proteins 0.000 description 9
- JBCLFWXMTIKCCB-UHFFFAOYSA-N H-Gly-Phe-OH Natural products NCC(=O)NC(C(O)=O)CC1=CC=CC=C1 JBCLFWXMTIKCCB-UHFFFAOYSA-N 0.000 description 9
- 108091081024 Start codon Proteins 0.000 description 9
- 210000000988 bone and bone Anatomy 0.000 description 9
- 238000001514 detection method Methods 0.000 description 9
- VPZXBVLAVMBEQI-UHFFFAOYSA-N glycyl-DL-alpha-alanine Natural products OC(=O)C(C)NC(=O)CN VPZXBVLAVMBEQI-UHFFFAOYSA-N 0.000 description 9
- 210000004698 lymphocyte Anatomy 0.000 description 9
- 238000004519 manufacturing process Methods 0.000 description 9
- 201000008968 osteosarcoma Diseases 0.000 description 9
- 108010051242 phenylalanylserine Proteins 0.000 description 9
- 238000002360 preparation method Methods 0.000 description 9
- 230000002265 prevention Effects 0.000 description 9
- 108010079317 prolyl-tyrosine Proteins 0.000 description 9
- 210000004881 tumor cell Anatomy 0.000 description 9
- 102100026445 A-kinase anchor protein 17A Human genes 0.000 description 8
- 108010078791 Carrier Proteins Proteins 0.000 description 8
- 101000718019 Homo sapiens A-kinase anchor protein 17A Proteins 0.000 description 8
- 241000699666 Mus <mouse, genus> Species 0.000 description 8
- AJHCSUXXECOXOY-UHFFFAOYSA-N N-glycyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)CN)C(O)=O)=CNC2=C1 AJHCSUXXECOXOY-UHFFFAOYSA-N 0.000 description 8
- 108091034057 RNA (poly(A)) Proteins 0.000 description 8
- 240000004808 Saccharomyces cerevisiae Species 0.000 description 8
- NFDYGNFETJVMSE-BQBZGAKWSA-N Ser-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](N)CO NFDYGNFETJVMSE-BQBZGAKWSA-N 0.000 description 8
- AEFJNECXZCODJM-UWVGGRQHSA-N Val-Val-Gly Chemical compound CC(C)[C@H]([NH3+])C(=O)N[C@@H](C(C)C)C(=O)NCC([O-])=O AEFJNECXZCODJM-UWVGGRQHSA-N 0.000 description 8
- 108010093581 aspartyl-proline Proteins 0.000 description 8
- 210000003719 b-lymphocyte Anatomy 0.000 description 8
- 239000003153 chemical reaction reagent Substances 0.000 description 8
- 239000003795 chemical substances by application Substances 0.000 description 8
- 230000006378 damage Effects 0.000 description 8
- 102000037865 fusion proteins Human genes 0.000 description 8
- 108020001507 fusion proteins Proteins 0.000 description 8
- 108010089804 glycyl-threonine Proteins 0.000 description 8
- 108010081551 glycylphenylalanine Proteins 0.000 description 8
- 238000009396 hybridization Methods 0.000 description 8
- 230000006698 induction Effects 0.000 description 8
- 108010009298 lysylglutamic acid Proteins 0.000 description 8
- 230000028327 secretion Effects 0.000 description 8
- 241000588724 Escherichia coli Species 0.000 description 7
- DKEXFJVMVGETOO-LURJTMIESA-N Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CN DKEXFJVMVGETOO-LURJTMIESA-N 0.000 description 7
- 241000282412 Homo Species 0.000 description 7
- YOKVEHGYYQEQOP-QWRGUYRKSA-N Leu-Leu-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O YOKVEHGYYQEQOP-QWRGUYRKSA-N 0.000 description 7
- SBANPBVRHYIMRR-UHFFFAOYSA-N Leu-Ser-Pro Natural products CC(C)CC(N)C(=O)NC(CO)C(=O)N1CCCC1C(O)=O SBANPBVRHYIMRR-UHFFFAOYSA-N 0.000 description 7
- AAKRWBIIGKPOKQ-ONGXEEELSA-N Leu-Val-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O AAKRWBIIGKPOKQ-ONGXEEELSA-N 0.000 description 7
- BQVUABVGYYSDCJ-UHFFFAOYSA-N Nalpha-L-Leucyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)CC(C)C)C(O)=O)=CNC2=C1 BQVUABVGYYSDCJ-UHFFFAOYSA-N 0.000 description 7
- ZPPVJIJMIKTERM-YUMQZZPRSA-N Pro-Gln-Gly Chemical compound OC(=O)CNC(=O)[C@H](CCC(=O)N)NC(=O)[C@@H]1CCCN1 ZPPVJIJMIKTERM-YUMQZZPRSA-N 0.000 description 7
- SYSWVVCYSXBVJG-RHYQMDGZSA-N Val-Leu-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](C(C)C)N)O SYSWVVCYSXBVJG-RHYQMDGZSA-N 0.000 description 7
- 230000005856 abnormality Effects 0.000 description 7
- 230000001419 dependent effect Effects 0.000 description 7
- 108010054813 diprotin B Proteins 0.000 description 7
- 108010085325 histidylproline Proteins 0.000 description 7
- 230000002209 hydrophobic effect Effects 0.000 description 7
- 230000001939 inductive effect Effects 0.000 description 7
- 108010017391 lysylvaline Proteins 0.000 description 7
- 238000005259 measurement Methods 0.000 description 7
- 239000013612 plasmid Substances 0.000 description 7
- 108010031719 prolyl-serine Proteins 0.000 description 7
- 230000008439 repair process Effects 0.000 description 7
- 238000012360 testing method Methods 0.000 description 7
- 208000023275 Autoimmune disease Diseases 0.000 description 6
- XXCDTYBVGMPIOA-FXQIFTODSA-N Glu-Asp-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O XXCDTYBVGMPIOA-FXQIFTODSA-N 0.000 description 6
- LYZYGGWCBLBDMC-QWHCGFSZSA-N Gly-Tyr-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)CN)C(=O)O LYZYGGWCBLBDMC-QWHCGFSZSA-N 0.000 description 6
- 101000596234 Homo sapiens T-cell surface protein tactile Proteins 0.000 description 6
- VEXZGXHMUGYJMC-UHFFFAOYSA-N Hydrochloric acid Chemical compound Cl VEXZGXHMUGYJMC-UHFFFAOYSA-N 0.000 description 6
- 108010004250 Inhibins Proteins 0.000 description 6
- 102000002746 Inhibins Human genes 0.000 description 6
- HGCNKOLVKRAVHD-UHFFFAOYSA-N L-Met-L-Phe Natural products CSCCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 HGCNKOLVKRAVHD-UHFFFAOYSA-N 0.000 description 6
- OUYCCCASQSFEME-QMMMGPOBSA-N L-tyrosine Chemical compound OC(=O)[C@@H](N)CC1=CC=C(O)C=C1 OUYCCCASQSFEME-QMMMGPOBSA-N 0.000 description 6
- PPBKJAQJAUHZKX-SRVKXCTJSA-N Leu-Cys-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@H](C(O)=O)CC(C)C PPBKJAQJAUHZKX-SRVKXCTJSA-N 0.000 description 6
- XZFYRXDAULDNFX-UHFFFAOYSA-N N-L-cysteinyl-L-phenylalanine Natural products SCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 XZFYRXDAULDNFX-UHFFFAOYSA-N 0.000 description 6
- BIYWZVCPZIFGPY-QWRGUYRKSA-N Phe-Gly-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)NCC(=O)N[C@@H](CO)C(O)=O BIYWZVCPZIFGPY-QWRGUYRKSA-N 0.000 description 6
- INHMISZWLJZQGH-ULQDDVLXSA-N Phe-Leu-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 INHMISZWLJZQGH-ULQDDVLXSA-N 0.000 description 6
- WCUXLLCKKVVCTQ-UHFFFAOYSA-M Potassium chloride Chemical compound [Cl-].[K+] WCUXLLCKKVVCTQ-UHFFFAOYSA-M 0.000 description 6
- 102100035268 T-cell surface protein tactile Human genes 0.000 description 6
- VYVBSMCZNHOZGD-RCWTZXSCSA-N Thr-Val-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C(C)C)C(O)=O VYVBSMCZNHOZGD-RCWTZXSCSA-N 0.000 description 6
- VNGKMNPAENRGDC-JYJNAYRXSA-N Val-Phe-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C(C)C)CC1=CC=CC=C1 VNGKMNPAENRGDC-JYJNAYRXSA-N 0.000 description 6
- 230000004913 activation Effects 0.000 description 6
- 108010047857 aspartylglycine Proteins 0.000 description 6
- 230000003399 chemotactic effect Effects 0.000 description 6
- 230000007547 defect Effects 0.000 description 6
- 238000000605 extraction Methods 0.000 description 6
- XBGGUPMXALFZOT-UHFFFAOYSA-N glycyl-L-tyrosine hemihydrate Natural products NCC(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 XBGGUPMXALFZOT-UHFFFAOYSA-N 0.000 description 6
- YMAWOPBAYDPSLA-UHFFFAOYSA-N glycylglycine Chemical compound [NH3+]CC(=O)NCC([O-])=O YMAWOPBAYDPSLA-UHFFFAOYSA-N 0.000 description 6
- 210000003958 hematopoietic stem cell Anatomy 0.000 description 6
- 210000005260 human cell Anatomy 0.000 description 6
- 238000001727 in vivo Methods 0.000 description 6
- 239000000893 inhibin Substances 0.000 description 6
- 239000003112 inhibitor Substances 0.000 description 6
- 108010034529 leucyl-lysine Proteins 0.000 description 6
- 108010054155 lysyllysine Proteins 0.000 description 6
- 108010005942 methionylglycine Proteins 0.000 description 6
- 230000036961 partial effect Effects 0.000 description 6
- 108010064486 phenylalanyl-leucyl-valine Proteins 0.000 description 6
- 238000011160 research Methods 0.000 description 6
- 108091008146 restriction endonucleases Proteins 0.000 description 6
- 210000000130 stem cell Anatomy 0.000 description 6
- 230000000638 stimulation Effects 0.000 description 6
- 108010080629 tryptophan-leucine Proteins 0.000 description 6
- 108010020532 tyrosyl-proline Proteins 0.000 description 6
- DGVVWUTYPXICAM-UHFFFAOYSA-N β‐Mercaptoethanol Chemical compound OCCS DGVVWUTYPXICAM-UHFFFAOYSA-N 0.000 description 6
- 102000005606 Activins Human genes 0.000 description 5
- 108010059616 Activins Proteins 0.000 description 5
- ULBHWNVWSCJLCO-NHCYSSNCSA-N Arg-Val-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCN=C(N)N ULBHWNVWSCJLCO-NHCYSSNCSA-N 0.000 description 5
- VLDRQOHCMKCXLY-SRVKXCTJSA-N Asn-Ser-Phe Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O VLDRQOHCMKCXLY-SRVKXCTJSA-N 0.000 description 5
- 101710181478 Envelope glycoprotein GP350 Proteins 0.000 description 5
- 102000012673 Follicle Stimulating Hormone Human genes 0.000 description 5
- 108010079345 Follicle Stimulating Hormone Proteins 0.000 description 5
- WLRYGVYQFXRJDA-DCAQKATOSA-N Gln-Pro-Pro Chemical compound NC(=O)CC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 WLRYGVYQFXRJDA-DCAQKATOSA-N 0.000 description 5
- HMJULNMJWOZNFI-XHNCKOQMSA-N Glu-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CCC(=O)O)N)C(=O)O HMJULNMJWOZNFI-XHNCKOQMSA-N 0.000 description 5
- UHPAZODVFFYEEL-QWRGUYRKSA-N Gly-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)CN UHPAZODVFFYEEL-QWRGUYRKSA-N 0.000 description 5
- TVUWMSBGMVAHSJ-KBPBESRZSA-N Gly-Leu-Phe Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 TVUWMSBGMVAHSJ-KBPBESRZSA-N 0.000 description 5
- VPZXBVLAVMBEQI-VKHMYHEASA-N Glycyl-alanine Chemical compound OC(=O)[C@H](C)NC(=O)CN VPZXBVLAVMBEQI-VKHMYHEASA-N 0.000 description 5
- 208000009329 Graft vs Host Disease Diseases 0.000 description 5
- KAFOIVJDVSZUMD-UHFFFAOYSA-N Leu-Gln-Gln Natural products CC(C)CC(N)C(=O)NC(CCC(N)=O)C(=O)NC(CCC(N)=O)C(O)=O KAFOIVJDVSZUMD-UHFFFAOYSA-N 0.000 description 5
- NFNVDJGXRFEYTK-YUMQZZPRSA-N Leu-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(O)=O)CCC(O)=O NFNVDJGXRFEYTK-YUMQZZPRSA-N 0.000 description 5
- LESXFEZIFXFIQR-LURJTMIESA-N Leu-Gly Chemical compound CC(C)C[C@H](N)C(=O)NCC(O)=O LESXFEZIFXFIQR-LURJTMIESA-N 0.000 description 5
- LAPSXOAUPNOINL-YUMQZZPRSA-N Leu-Gly-Asp Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(O)=O LAPSXOAUPNOINL-YUMQZZPRSA-N 0.000 description 5
- VGPCJSXPPOQPBK-YUMQZZPRSA-N Leu-Gly-Ser Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O VGPCJSXPPOQPBK-YUMQZZPRSA-N 0.000 description 5
- XVZCXCTYGHPNEM-UHFFFAOYSA-N Leu-Leu-Pro Natural products CC(C)CC(N)C(=O)NC(CC(C)C)C(=O)N1CCCC1C(O)=O XVZCXCTYGHPNEM-UHFFFAOYSA-N 0.000 description 5
- WXUOJXIGOPMDJM-SRVKXCTJSA-N Leu-Lys-Asn Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O WXUOJXIGOPMDJM-SRVKXCTJSA-N 0.000 description 5
- VULJUQZPSOASBZ-SRVKXCTJSA-N Leu-Pro-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O VULJUQZPSOASBZ-SRVKXCTJSA-N 0.000 description 5
- 108091054437 MHC class I family Proteins 0.000 description 5
- 108010065395 Neuropep-1 Proteins 0.000 description 5
- 101100342977 Neurospora crassa (strain ATCC 24698 / 74-OR23-1A / CBS 708.71 / DSM 1257 / FGSC 987) leu-1 gene Proteins 0.000 description 5
- 238000000636 Northern blotting Methods 0.000 description 5
- DXTOOBDIIAJZBJ-BQBZGAKWSA-N Pro-Gly-Ser Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](CO)C(O)=O DXTOOBDIIAJZBJ-BQBZGAKWSA-N 0.000 description 5
- RWCOTTLHDJWHRS-YUMQZZPRSA-N Pro-Pro Chemical compound OC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 RWCOTTLHDJWHRS-YUMQZZPRSA-N 0.000 description 5
- IALSFJSONJZBKB-HRCADAONSA-N Pro-Tyr-Pro Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)N3CCC[C@@H]3C(=O)O IALSFJSONJZBKB-HRCADAONSA-N 0.000 description 5
- XNCUYZKGQOCOQH-YUMQZZPRSA-N Ser-Leu-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O XNCUYZKGQOCOQH-YUMQZZPRSA-N 0.000 description 5
- BMKNXTJLHFIAAH-CIUDSAMLSA-N Ser-Ser-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O BMKNXTJLHFIAAH-CIUDSAMLSA-N 0.000 description 5
- UMPVMAYCLYMYGA-ONGXEEELSA-N Val-Leu-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O UMPVMAYCLYMYGA-ONGXEEELSA-N 0.000 description 5
- NZYNRRGJJVSSTJ-GUBZILKMSA-N Val-Ser-Val Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O NZYNRRGJJVSSTJ-GUBZILKMSA-N 0.000 description 5
- 208000027418 Wounds and injury Diseases 0.000 description 5
- 239000000488 activin Substances 0.000 description 5
- 108010047495 alanylglycine Proteins 0.000 description 5
- 108010068380 arginylarginine Proteins 0.000 description 5
- 108010060035 arginylproline Proteins 0.000 description 5
- 210000000845 cartilage Anatomy 0.000 description 5
- 230000007812 deficiency Effects 0.000 description 5
- 208000035475 disorder Diseases 0.000 description 5
- 229940028334 follicle stimulating hormone Drugs 0.000 description 5
- 108010078144 glutaminyl-glycine Proteins 0.000 description 5
- 108010013768 glutamyl-aspartyl-proline Proteins 0.000 description 5
- XKUKSGPZAADMRA-UHFFFAOYSA-N glycyl-glycyl-glycine Chemical compound NCC(=O)NCC(=O)NCC(O)=O XKUKSGPZAADMRA-UHFFFAOYSA-N 0.000 description 5
- 108010066198 glycyl-leucyl-phenylalanine Proteins 0.000 description 5
- 108010015792 glycyllysine Proteins 0.000 description 5
- 108010087823 glycyltyrosine Proteins 0.000 description 5
- 208000024908 graft versus host disease Diseases 0.000 description 5
- 239000008187 granular material Substances 0.000 description 5
- 208000014674 injury Diseases 0.000 description 5
- 210000003734 kidney Anatomy 0.000 description 5
- 210000000265 leukocyte Anatomy 0.000 description 5
- 108010003700 lysyl aspartic acid Proteins 0.000 description 5
- 108010064235 lysylglycine Proteins 0.000 description 5
- 230000001404 mediated effect Effects 0.000 description 5
- 108010056582 methionylglutamic acid Proteins 0.000 description 5
- 108010068488 methionylphenylalanine Proteins 0.000 description 5
- 244000005700 microbiome Species 0.000 description 5
- 210000000056 organ Anatomy 0.000 description 5
- 108010073101 phenylalanylleucine Proteins 0.000 description 5
- 108010073025 phenylalanylphenylalanine Proteins 0.000 description 5
- 102000004196 processed proteins & peptides Human genes 0.000 description 5
- 230000001737 promoting effect Effects 0.000 description 5
- 108010071207 serylmethionine Proteins 0.000 description 5
- 238000002054 transplantation Methods 0.000 description 5
- 108010038745 tryptophylglycine Proteins 0.000 description 5
- 108010087967 type I signal peptidase Proteins 0.000 description 5
- QKNYBSVHEMOAJP-UHFFFAOYSA-N 2-amino-2-(hydroxymethyl)propane-1,3-diol;hydron;chloride Chemical compound Cl.OCC(N)(CO)CO QKNYBSVHEMOAJP-UHFFFAOYSA-N 0.000 description 4
- MDNAVFBZPROEHO-UHFFFAOYSA-N Ala-Lys-Val Natural products CC(C)C(C(O)=O)NC(=O)C(NC(=O)C(C)N)CCCCN MDNAVFBZPROEHO-UHFFFAOYSA-N 0.000 description 4
- ZUFPUBYQYWCMDB-NUMRIWBASA-N Asn-Thr-Glu Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O ZUFPUBYQYWCMDB-NUMRIWBASA-N 0.000 description 4
- 101100505161 Caenorhabditis elegans mel-32 gene Proteins 0.000 description 4
- OHLLDUNVMPPUMD-DCAQKATOSA-N Cys-Leu-Val Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H](CS)N OHLLDUNVMPPUMD-DCAQKATOSA-N 0.000 description 4
- NXQCSPVUPLUTJH-WHFBIAKZSA-N Cys-Ser-Gly Chemical compound SC[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O NXQCSPVUPLUTJH-WHFBIAKZSA-N 0.000 description 4
- 102000004190 Enzymes Human genes 0.000 description 4
- 108090000790 Enzymes Proteins 0.000 description 4
- FGYPOQPQTUNESW-IUCAKERBSA-N Gln-Gly-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CCC(=O)N)N FGYPOQPQTUNESW-IUCAKERBSA-N 0.000 description 4
- XFAUJGNLHIGXET-AVGNSLFASA-N Gln-Leu-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O XFAUJGNLHIGXET-AVGNSLFASA-N 0.000 description 4
- WPLGNDORMXTMQS-FXQIFTODSA-N Glu-Gln-Ser Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O WPLGNDORMXTMQS-FXQIFTODSA-N 0.000 description 4
- PAWIVEIWWYGBAM-YUMQZZPRSA-N Gly-Leu-Ala Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O PAWIVEIWWYGBAM-YUMQZZPRSA-N 0.000 description 4
- CCBIBMKQNXHNIN-ZETCQYMHSA-N Gly-Leu-Gly Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O CCBIBMKQNXHNIN-ZETCQYMHSA-N 0.000 description 4
- ZLCLYFGMKFCDCN-XPUUQOCRSA-N Gly-Ser-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CO)NC(=O)CN)C(O)=O ZLCLYFGMKFCDCN-XPUUQOCRSA-N 0.000 description 4
- GBYYQVBXFVDJPJ-WLTAIBSBSA-N Gly-Tyr-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)CN)O GBYYQVBXFVDJPJ-WLTAIBSBSA-N 0.000 description 4
- 101000912618 Homo sapiens C-type lectin domain family 2 member B Proteins 0.000 description 4
- LZDNBBYBDGBADK-UHFFFAOYSA-N L-valyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)C(C)C)C(O)=O)=CNC2=C1 LZDNBBYBDGBADK-UHFFFAOYSA-N 0.000 description 4
- HYMLKESRWLZDBR-WEDXCCLWSA-N Leu-Gly-Thr Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O HYMLKESRWLZDBR-WEDXCCLWSA-N 0.000 description 4
- POZULHZYLPGXMR-ONGXEEELSA-N Leu-Gly-Val Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O POZULHZYLPGXMR-ONGXEEELSA-N 0.000 description 4
- IFMPDNRWZZEZSL-SRVKXCTJSA-N Leu-Leu-Cys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CS)C(O)=O IFMPDNRWZZEZSL-SRVKXCTJSA-N 0.000 description 4
- KFKWRHQBZQICHA-STQMWFEESA-N Leu-Phe Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 KFKWRHQBZQICHA-STQMWFEESA-N 0.000 description 4
- GCXGCIYIHXSKAY-ULQDDVLXSA-N Leu-Phe-Arg Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O GCXGCIYIHXSKAY-ULQDDVLXSA-N 0.000 description 4
- YWKNKRAKOCLOLH-OEAJRASXSA-N Leu-Phe-Thr Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H]([C@@H](C)O)C(O)=O)CC1=CC=CC=C1 YWKNKRAKOCLOLH-OEAJRASXSA-N 0.000 description 4
- XOWMDXHFSBCAKQ-SRVKXCTJSA-N Leu-Ser-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC(C)C XOWMDXHFSBCAKQ-SRVKXCTJSA-N 0.000 description 4
- LRKCBIUDWAXNEG-CSMHCCOUSA-N Leu-Thr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LRKCBIUDWAXNEG-CSMHCCOUSA-N 0.000 description 4
- VUBIPAHVHMZHCM-KKUMJFAQSA-N Leu-Tyr-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CO)C(O)=O)CC1=CC=C(O)C=C1 VUBIPAHVHMZHCM-KKUMJFAQSA-N 0.000 description 4
- XOEDPXDZJHBQIX-ULQDDVLXSA-N Leu-Val-Phe Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 XOEDPXDZJHBQIX-ULQDDVLXSA-N 0.000 description 4
- IWWMPCPLFXFBAF-SRVKXCTJSA-N Lys-Asp-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O IWWMPCPLFXFBAF-SRVKXCTJSA-N 0.000 description 4
- VSTNAUBHKQPVJX-IHRRRGAJSA-N Lys-Met-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O VSTNAUBHKQPVJX-IHRRRGAJSA-N 0.000 description 4
- 102000043131 MHC class II family Human genes 0.000 description 4
- 108091054438 MHC class II family Proteins 0.000 description 4
- 241000124008 Mammalia Species 0.000 description 4
- UZVKFARGHHMQGX-IUCAKERBSA-N Met-Gly-Met Chemical compound CSCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCSC UZVKFARGHHMQGX-IUCAKERBSA-N 0.000 description 4
- CQRGINSEMFBACV-WPRPVWTQSA-N Met-Val-Gly Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O CQRGINSEMFBACV-WPRPVWTQSA-N 0.000 description 4
- QAVZUKIPOMBLMC-AVGNSLFASA-N Met-Val-Leu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC(C)C QAVZUKIPOMBLMC-AVGNSLFASA-N 0.000 description 4
- 241001465754 Metazoa Species 0.000 description 4
- 241001529936 Murinae Species 0.000 description 4
- 241000699670 Mus sp. Species 0.000 description 4
- AUEJLPRZGVVDNU-UHFFFAOYSA-N N-L-tyrosyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CC1=CC=C(O)C=C1 AUEJLPRZGVVDNU-UHFFFAOYSA-N 0.000 description 4
- 102000004868 N-Methyl-D-Aspartate Receptors Human genes 0.000 description 4
- 108090001041 N-Methyl-D-Aspartate Receptors Proteins 0.000 description 4
- 108010079364 N-glycylalanine Proteins 0.000 description 4
- 108010002311 N-glycylglutamic acid Proteins 0.000 description 4
- SMFGCTXUBWEPKM-KBPBESRZSA-N Phe-Leu-Gly Chemical compound OC(=O)CNC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 SMFGCTXUBWEPKM-KBPBESRZSA-N 0.000 description 4
- OCSACVPBMIYNJE-GUBZILKMSA-N Pro-Arg-Asn Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(O)=O OCSACVPBMIYNJE-GUBZILKMSA-N 0.000 description 4
- FIODMZKLZFLYQP-GUBZILKMSA-N Pro-Val-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O FIODMZKLZFLYQP-GUBZILKMSA-N 0.000 description 4
- 241000700159 Rattus Species 0.000 description 4
- LRZLZIUXQBIWTB-KATARQTJSA-N Ser-Lys-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LRZLZIUXQBIWTB-KATARQTJSA-N 0.000 description 4
- ILZAUMFXKSIUEF-SRVKXCTJSA-N Ser-Ser-Phe Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 ILZAUMFXKSIUEF-SRVKXCTJSA-N 0.000 description 4
- ILVGMCVCQBJPSH-WDSKDSINSA-N Ser-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H](N)CO ILVGMCVCQBJPSH-WDSKDSINSA-N 0.000 description 4
- FAPWRFPIFSIZLT-UHFFFAOYSA-M Sodium chloride Chemical compound [Na+].[Cl-] FAPWRFPIFSIZLT-UHFFFAOYSA-M 0.000 description 4
- BQBCIBCLXBKYHW-CSMHCCOUSA-N Thr-Leu Chemical compound CC(C)C[C@@H](C([O-])=O)NC(=O)[C@@H]([NH3+])[C@@H](C)O BQBCIBCLXBKYHW-CSMHCCOUSA-N 0.000 description 4
- JNKAYADBODLPMQ-HSHDSVGOSA-N Thr-Trp-Val Chemical compound C1=CC=C2C(C[C@@H](C(=O)N[C@@H](C(C)C)C(O)=O)NC(=O)[C@@H](N)[C@@H](C)O)=CNC2=C1 JNKAYADBODLPMQ-HSHDSVGOSA-N 0.000 description 4
- TWAVEIJGFCBWCG-JYJNAYRXSA-N Tyr-Gln-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC1=CC=C(C=C1)O)N TWAVEIJGFCBWCG-JYJNAYRXSA-N 0.000 description 4
- BYAKMYBZADCNMN-JYJNAYRXSA-N Tyr-Lys-Gln Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O BYAKMYBZADCNMN-JYJNAYRXSA-N 0.000 description 4
- ARMNWLJYHCOSHE-KKUMJFAQSA-N Tyr-Pro-Gln Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O ARMNWLJYHCOSHE-KKUMJFAQSA-N 0.000 description 4
- HZYOWMGWKKRMBZ-BYULHYEWSA-N Val-Asp-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N HZYOWMGWKKRMBZ-BYULHYEWSA-N 0.000 description 4
- CJDZKZFMAXGUOJ-IHRRRGAJSA-N Val-Cys-Tyr Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N CJDZKZFMAXGUOJ-IHRRRGAJSA-N 0.000 description 4
- LAYSXAOGWHKNED-XPUUQOCRSA-N Val-Gly-Ser Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O LAYSXAOGWHKNED-XPUUQOCRSA-N 0.000 description 4
- BVWPHWLFGRCECJ-JSGCOSHPSA-N Val-Gly-Tyr Chemical compound CC(C)[C@@H](C(=O)NCC(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N BVWPHWLFGRCECJ-JSGCOSHPSA-N 0.000 description 4
- XXROXFHCMVXETG-UWVGGRQHSA-N Val-Gly-Val Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O XXROXFHCMVXETG-UWVGGRQHSA-N 0.000 description 4
- ZRSZTKTVPNSUNA-IHRRRGAJSA-N Val-Lys-Leu Chemical compound CC(C)C[C@H](NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)C(C)C)C(O)=O ZRSZTKTVPNSUNA-IHRRRGAJSA-N 0.000 description 4
- GVRKWABULJAONN-VQVTYTSYSA-N Val-Thr Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GVRKWABULJAONN-VQVTYTSYSA-N 0.000 description 4
- NLNCNKIVJPEFBC-DLOVCJGASA-N Val-Val-Glu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCC(O)=O NLNCNKIVJPEFBC-DLOVCJGASA-N 0.000 description 4
- IOUPEELXVYPCPG-UHFFFAOYSA-N Valylglycine Chemical compound CC(C)C(N)C(=O)NCC(O)=O IOUPEELXVYPCPG-UHFFFAOYSA-N 0.000 description 4
- 241000700605 Viruses Species 0.000 description 4
- 206010052428 Wound Diseases 0.000 description 4
- 108010062796 arginyllysine Proteins 0.000 description 4
- 108010077245 asparaginyl-proline Proteins 0.000 description 4
- 108010038633 aspartylglutamate Proteins 0.000 description 4
- 230000033228 biological regulation Effects 0.000 description 4
- 230000024245 cell differentiation Effects 0.000 description 4
- 210000000170 cell membrane Anatomy 0.000 description 4
- 238000002512 chemotherapy Methods 0.000 description 4
- 238000010367 cloning Methods 0.000 description 4
- 108010016616 cysteinylglycine Proteins 0.000 description 4
- 238000011161 development Methods 0.000 description 4
- 230000018109 developmental process Effects 0.000 description 4
- 238000001962 electrophoresis Methods 0.000 description 4
- 229940088598 enzyme Drugs 0.000 description 4
- 238000012869 ethanol precipitation Methods 0.000 description 4
- 230000035558 fertility Effects 0.000 description 4
- 108010000434 glycyl-alanyl-leucine Proteins 0.000 description 4
- 108010026364 glycyl-glycyl-leucine Proteins 0.000 description 4
- 108010010147 glycylglutamine Proteins 0.000 description 4
- 108010020688 glycylhistidine Proteins 0.000 description 4
- 230000003394 haemopoietic effect Effects 0.000 description 4
- 230000035876 healing Effects 0.000 description 4
- 230000011132 hemopoiesis Effects 0.000 description 4
- 108010025306 histidylleucine Proteins 0.000 description 4
- 108010018006 histidylserine Proteins 0.000 description 4
- 230000001506 immunosuppresive effect Effects 0.000 description 4
- 230000001976 improved effect Effects 0.000 description 4
- 238000000099 in vitro assay Methods 0.000 description 4
- 208000015181 infectious disease Diseases 0.000 description 4
- 230000004054 inflammatory process Effects 0.000 description 4
- 238000003780 insertion Methods 0.000 description 4
- 230000037431 insertion Effects 0.000 description 4
- 238000011835 investigation Methods 0.000 description 4
- 108010051673 leucyl-glycyl-phenylalanine Proteins 0.000 description 4
- 108010030617 leucyl-phenylalanyl-valine Proteins 0.000 description 4
- 230000007774 longterm Effects 0.000 description 4
- 239000011259 mixed solution Substances 0.000 description 4
- 239000002773 nucleotide Substances 0.000 description 4
- 125000003729 nucleotide group Chemical group 0.000 description 4
- 230000008569 process Effects 0.000 description 4
- 108010029020 prolylglycine Proteins 0.000 description 4
- 230000008929 regeneration Effects 0.000 description 4
- 238000011069 regeneration method Methods 0.000 description 4
- 230000004044 response Effects 0.000 description 4
- 239000000523 sample Substances 0.000 description 4
- 238000012216 screening Methods 0.000 description 4
- 238000000926 separation method Methods 0.000 description 4
- 210000003491 skin Anatomy 0.000 description 4
- 238000003786 synthesis reaction Methods 0.000 description 4
- 230000009885 systemic effect Effects 0.000 description 4
- 230000008733 trauma Effects 0.000 description 4
- 108010084932 tryptophyl-proline Proteins 0.000 description 4
- 230000004614 tumor growth Effects 0.000 description 4
- 108010003137 tyrosyltyrosine Proteins 0.000 description 4
- XVZCXCTYGHPNEM-IHRRRGAJSA-N (2s)-1-[(2s)-2-[[(2s)-2-amino-4-methylpentanoyl]amino]-4-methylpentanoyl]pyrrolidine-2-carboxylic acid Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(O)=O XVZCXCTYGHPNEM-IHRRRGAJSA-N 0.000 description 3
- OXXJZDJLYSMGIQ-ZRDIBKRKSA-N 8-[2-[(e)-3-hydroxypent-1-enyl]-5-oxocyclopent-3-en-1-yl]octanoic acid Chemical compound CCC(O)\C=C\C1C=CC(=O)C1CCCCCCCC(O)=O OXXJZDJLYSMGIQ-ZRDIBKRKSA-N 0.000 description 3
- 108091006112 ATPases Proteins 0.000 description 3
- 102000057290 Adenosine Triphosphatases Human genes 0.000 description 3
- OYJCVIGKMXUVKB-GARJFASQSA-N Ala-Leu-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N OYJCVIGKMXUVKB-GARJFASQSA-N 0.000 description 3
- VHAQSYHSDKERBS-XPUUQOCRSA-N Ala-Val-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O VHAQSYHSDKERBS-XPUUQOCRSA-N 0.000 description 3
- 101100477846 Arabidopsis thaliana SNE gene Proteins 0.000 description 3
- OTCJMMRQBVDQRK-DCAQKATOSA-N Arg-Asp-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O OTCJMMRQBVDQRK-DCAQKATOSA-N 0.000 description 3
- NKBQZKVMKJJDLX-SRVKXCTJSA-N Arg-Glu-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O NKBQZKVMKJJDLX-SRVKXCTJSA-N 0.000 description 3
- GRRXPUAICOGISM-RWMBFGLXSA-N Arg-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)[C@H](CCCN=C(N)N)N)C(=O)O GRRXPUAICOGISM-RWMBFGLXSA-N 0.000 description 3
- PAPSMOYMQDWIOR-AVGNSLFASA-N Arg-Lys-Val Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O PAPSMOYMQDWIOR-AVGNSLFASA-N 0.000 description 3
- WKPXXXUSUHAXDE-SRVKXCTJSA-N Arg-Pro-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCN=C(N)N)C(O)=O WKPXXXUSUHAXDE-SRVKXCTJSA-N 0.000 description 3
- YHZQOSXDTFRZKU-WDSOQIARSA-N Arg-Trp-Leu Chemical compound C1=CC=C2C(C[C@@H](C(=O)N[C@@H](CC(C)C)C(O)=O)NC(=O)[C@@H](N)CCCN=C(N)N)=CNC2=C1 YHZQOSXDTFRZKU-WDSOQIARSA-N 0.000 description 3
- WIDVAWAQBRAKTI-YUMQZZPRSA-N Asn-Leu-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O WIDVAWAQBRAKTI-YUMQZZPRSA-N 0.000 description 3
- TZFQICWZWFNIKU-KKUMJFAQSA-N Asn-Leu-Tyr Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 TZFQICWZWFNIKU-KKUMJFAQSA-N 0.000 description 3
- YUOXLJYVSZYPBJ-CIUDSAMLSA-N Asn-Pro-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O YUOXLJYVSZYPBJ-CIUDSAMLSA-N 0.000 description 3
- FANQWNCPNFEPGZ-WHFBIAKZSA-N Asp-Asp-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O FANQWNCPNFEPGZ-WHFBIAKZSA-N 0.000 description 3
- POTCZYQVVNXUIG-BQBZGAKWSA-N Asp-Gly-Pro Chemical compound OC(=O)C[C@H](N)C(=O)NCC(=O)N1CCC[C@H]1C(O)=O POTCZYQVVNXUIG-BQBZGAKWSA-N 0.000 description 3
- MYLZFUMPZCPJCJ-NHCYSSNCSA-N Asp-Lys-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O MYLZFUMPZCPJCJ-NHCYSSNCSA-N 0.000 description 3
- HICVMZCGVFKTPM-BQBZGAKWSA-N Asp-Pro-Gly Chemical compound OC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O HICVMZCGVFKTPM-BQBZGAKWSA-N 0.000 description 3
- FAUPLTGRUBTXNU-FXQIFTODSA-N Asp-Pro-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O FAUPLTGRUBTXNU-FXQIFTODSA-N 0.000 description 3
- 108700007413 Bos taurus DMBT1 Proteins 0.000 description 3
- 108091003079 Bovine Serum Albumin Proteins 0.000 description 3
- 102000014914 Carrier Proteins Human genes 0.000 description 3
- UXIYYUMGFNSGBK-XPUUQOCRSA-N Cys-Gly-Val Chemical compound [H]N[C@@H](CS)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O UXIYYUMGFNSGBK-XPUUQOCRSA-N 0.000 description 3
- IZUNQDRIAOLWCN-YUMQZZPRSA-N Cys-Leu-Gly Chemical compound CC(C)C[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CS)N IZUNQDRIAOLWCN-YUMQZZPRSA-N 0.000 description 3
- WVLZTXGTNGHPBO-SRVKXCTJSA-N Cys-Leu-Leu Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O WVLZTXGTNGHPBO-SRVKXCTJSA-N 0.000 description 3
- SAEVTQWAYDPXMU-KATARQTJSA-N Cys-Thr-Leu Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O SAEVTQWAYDPXMU-KATARQTJSA-N 0.000 description 3
- AZDQAZRURQMSQD-XPUUQOCRSA-N Cys-Val-Gly Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O AZDQAZRURQMSQD-XPUUQOCRSA-N 0.000 description 3
- KCXVZYZYPLLWCC-UHFFFAOYSA-N EDTA Chemical compound OC(=O)CN(CC(O)=O)CCN(CC(O)=O)CC(O)=O KCXVZYZYPLLWCC-UHFFFAOYSA-N 0.000 description 3
- 102100021468 Equilibrative nucleoside transporter 2 Human genes 0.000 description 3
- LFQSCWFLJHTTHZ-UHFFFAOYSA-N Ethanol Chemical compound CCO LFQSCWFLJHTTHZ-UHFFFAOYSA-N 0.000 description 3
- NSORZJXKUQFEKL-JGVFFNPUSA-N Gln-Gly-Pro Chemical compound C1C[C@@H](N(C1)C(=O)CNC(=O)[C@H](CCC(=O)N)N)C(=O)O NSORZJXKUQFEKL-JGVFFNPUSA-N 0.000 description 3
- NROSLUJMIQGFKS-IUCAKERBSA-N Gln-His-Gly Chemical compound C1=C(NC=N1)C[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CCC(=O)N)N NROSLUJMIQGFKS-IUCAKERBSA-N 0.000 description 3
- GURIQZQSTBBHRV-SRVKXCTJSA-N Gln-Lys-Arg Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O GURIQZQSTBBHRV-SRVKXCTJSA-N 0.000 description 3
- SIGGQAHUPUBWNF-BQBZGAKWSA-N Gln-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@@H](N)CCC(N)=O SIGGQAHUPUBWNF-BQBZGAKWSA-N 0.000 description 3
- ITYRYNUZHPNCIK-GUBZILKMSA-N Glu-Ala-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O ITYRYNUZHPNCIK-GUBZILKMSA-N 0.000 description 3
- FYYSIASRLDJUNP-WHFBIAKZSA-N Glu-Asp Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(O)=O FYYSIASRLDJUNP-WHFBIAKZSA-N 0.000 description 3
- LGYZYFFDELZWRS-DCAQKATOSA-N Glu-Glu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O LGYZYFFDELZWRS-DCAQKATOSA-N 0.000 description 3
- PXXGVUVQWQGGIG-YUMQZZPRSA-N Glu-Gly-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N PXXGVUVQWQGGIG-YUMQZZPRSA-N 0.000 description 3
- FBEJIDRSQCGFJI-GUBZILKMSA-N Glu-Leu-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O FBEJIDRSQCGFJI-GUBZILKMSA-N 0.000 description 3
- VIPDPMHGICREIS-GVXVVHGQSA-N Glu-Val-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O VIPDPMHGICREIS-GVXVVHGQSA-N 0.000 description 3
- ZYRXTRTUCAVNBQ-GVXVVHGQSA-N Glu-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)O)N ZYRXTRTUCAVNBQ-GVXVVHGQSA-N 0.000 description 3
- KFMBRBPXHVMDFN-UWVGGRQHSA-N Gly-Arg-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCNC(N)=N KFMBRBPXHVMDFN-UWVGGRQHSA-N 0.000 description 3
- FUTAPPOITCCWTH-WHFBIAKZSA-N Gly-Asp-Asp Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O FUTAPPOITCCWTH-WHFBIAKZSA-N 0.000 description 3
- INLIXXRWNUKVCF-JTQLQIEISA-N Gly-Gly-Tyr Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 INLIXXRWNUKVCF-JTQLQIEISA-N 0.000 description 3
- LLZXNUUIBOALNY-QWRGUYRKSA-N Gly-Leu-Lys Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCCN LLZXNUUIBOALNY-QWRGUYRKSA-N 0.000 description 3
- BBTCXWTXOXUNFX-IUCAKERBSA-N Gly-Met-Arg Chemical compound CSCC[C@H](NC(=O)CN)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O BBTCXWTXOXUNFX-IUCAKERBSA-N 0.000 description 3
- FEUPVVCGQLNXNP-IRXDYDNUSA-N Gly-Phe-Phe Chemical compound C([C@H](NC(=O)CN)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 FEUPVVCGQLNXNP-IRXDYDNUSA-N 0.000 description 3
- WCORRBXVISTKQL-WHFBIAKZSA-N Gly-Ser-Ser Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O WCORRBXVISTKQL-WHFBIAKZSA-N 0.000 description 3
- GJHWILMUOANXTG-WPRPVWTQSA-N Gly-Val-Arg Chemical compound [H]NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O GJHWILMUOANXTG-WPRPVWTQSA-N 0.000 description 3
- BAYQNCWLXIDLHX-ONGXEEELSA-N Gly-Val-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)CN BAYQNCWLXIDLHX-ONGXEEELSA-N 0.000 description 3
- PEDCQBHIVMGVHV-UHFFFAOYSA-N Glycerine Chemical compound OCC(O)CO PEDCQBHIVMGVHV-UHFFFAOYSA-N 0.000 description 3
- FHKZHRMERJUXRJ-DCAQKATOSA-N His-Ser-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CN=CN1 FHKZHRMERJUXRJ-DCAQKATOSA-N 0.000 description 3
- 101000822017 Homo sapiens Equilibrative nucleoside transporter 2 Proteins 0.000 description 3
- 101001057699 Homo sapiens Inorganic pyrophosphatase Proteins 0.000 description 3
- 206010062016 Immunosuppression Diseases 0.000 description 3
- 102100027050 Inorganic pyrophosphatase Human genes 0.000 description 3
- 108010065920 Insulin Lispro Proteins 0.000 description 3
- FADYJNXDPBKVCA-UHFFFAOYSA-N L-Phenylalanyl-L-lysin Natural products NCCCCC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 FADYJNXDPBKVCA-UHFFFAOYSA-N 0.000 description 3
- PBCHMHROGNUXMK-DLOVCJGASA-N Leu-Ala-His Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 PBCHMHROGNUXMK-DLOVCJGASA-N 0.000 description 3
- MLTRLIITQPXHBJ-BQBZGAKWSA-N Leu-Asn Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(O)=O)CC(N)=O MLTRLIITQPXHBJ-BQBZGAKWSA-N 0.000 description 3
- JQSXWJXBASFONF-KKUMJFAQSA-N Leu-Asp-Phe Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O JQSXWJXBASFONF-KKUMJFAQSA-N 0.000 description 3
- LOLUPZNNADDTAA-AVGNSLFASA-N Leu-Gln-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O LOLUPZNNADDTAA-AVGNSLFASA-N 0.000 description 3
- NEEOBPIXKWSBRF-IUCAKERBSA-N Leu-Glu-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O NEEOBPIXKWSBRF-IUCAKERBSA-N 0.000 description 3
- DSFYPIUSAMSERP-IHRRRGAJSA-N Leu-Leu-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N DSFYPIUSAMSERP-IHRRRGAJSA-N 0.000 description 3
- JNDYEOUZBLOVOF-AVGNSLFASA-N Leu-Leu-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O JNDYEOUZBLOVOF-AVGNSLFASA-N 0.000 description 3
- UBZGNBKMIJHOHL-BZSNNMDCSA-N Leu-Leu-Phe Chemical compound CC(C)C[C@H]([NH3+])C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C([O-])=O)CC1=CC=CC=C1 UBZGNBKMIJHOHL-BZSNNMDCSA-N 0.000 description 3
- NJMXCOOEFLMZSR-AVGNSLFASA-N Leu-Met-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C(C)C)C(O)=O NJMXCOOEFLMZSR-AVGNSLFASA-N 0.000 description 3
- KQFZKDITNUEVFJ-JYJNAYRXSA-N Leu-Phe-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CC(C)C)CC1=CC=CC=C1 KQFZKDITNUEVFJ-JYJNAYRXSA-N 0.000 description 3
- DRWMRVFCKKXHCH-BZSNNMDCSA-N Leu-Phe-Leu Chemical compound CC(C)C[C@H]([NH3+])C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C([O-])=O)CC1=CC=CC=C1 DRWMRVFCKKXHCH-BZSNNMDCSA-N 0.000 description 3
- PTRKPHUGYULXPU-KKUMJFAQSA-N Leu-Phe-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O PTRKPHUGYULXPU-KKUMJFAQSA-N 0.000 description 3
- FYPWFNKQVVEELI-ULQDDVLXSA-N Leu-Phe-Val Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C(C)C)C(O)=O)CC1=CC=CC=C1 FYPWFNKQVVEELI-ULQDDVLXSA-N 0.000 description 3
- YRRCOJOXAJNSAX-IHRRRGAJSA-N Leu-Pro-Lys Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)O)N YRRCOJOXAJNSAX-IHRRRGAJSA-N 0.000 description 3
- XXXXOVFBXRERQL-ULQDDVLXSA-N Leu-Pro-Phe Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 XXXXOVFBXRERQL-ULQDDVLXSA-N 0.000 description 3
- IDGZVZJLYFTXSL-DCAQKATOSA-N Leu-Ser-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCN=C(N)N IDGZVZJLYFTXSL-DCAQKATOSA-N 0.000 description 3
- KIZIOFNVSOSKJI-CIUDSAMLSA-N Leu-Ser-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)O)N KIZIOFNVSOSKJI-CIUDSAMLSA-N 0.000 description 3
- GZRABTMNWJXFMH-UVOCVTCTSA-N Leu-Thr-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GZRABTMNWJXFMH-UVOCVTCTSA-N 0.000 description 3
- VJGQRELPQWNURN-JYJNAYRXSA-N Leu-Tyr-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O VJGQRELPQWNURN-JYJNAYRXSA-N 0.000 description 3
- VKVDRTGWLVZJOM-DCAQKATOSA-N Leu-Val-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O VKVDRTGWLVZJOM-DCAQKATOSA-N 0.000 description 3
- OAPNERBWQWUPTI-YUMQZZPRSA-N Lys-Gln Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(O)=O)CCC(N)=O OAPNERBWQWUPTI-YUMQZZPRSA-N 0.000 description 3
- NDORZBUHCOJQDO-GVXVVHGQSA-N Lys-Gln-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O NDORZBUHCOJQDO-GVXVVHGQSA-N 0.000 description 3
- LLSUNJYOSCOOEB-GUBZILKMSA-N Lys-Glu-Asp Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O LLSUNJYOSCOOEB-GUBZILKMSA-N 0.000 description 3
- HAUUXTXKJNVIFY-ONGXEEELSA-N Lys-Gly-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O HAUUXTXKJNVIFY-ONGXEEELSA-N 0.000 description 3
- NJNRBRKHOWSGMN-SRVKXCTJSA-N Lys-Leu-Asn Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O NJNRBRKHOWSGMN-SRVKXCTJSA-N 0.000 description 3
- AZOFEHCPMBRNFD-BZSNNMDCSA-N Lys-Phe-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CCCCN)C(O)=O)CC1=CC=CC=C1 AZOFEHCPMBRNFD-BZSNNMDCSA-N 0.000 description 3
- CENKQZWVYMLRAX-ULQDDVLXSA-N Lys-Phe-Met Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCSC)C(O)=O CENKQZWVYMLRAX-ULQDDVLXSA-N 0.000 description 3
- 102000043129 MHC class I family Human genes 0.000 description 3
- YRAWWKUTNBILNT-FXQIFTODSA-N Met-Ala-Ala Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O YRAWWKUTNBILNT-FXQIFTODSA-N 0.000 description 3
- DBXMFHGGHMXYHY-DCAQKATOSA-N Met-Leu-Ser Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O DBXMFHGGHMXYHY-DCAQKATOSA-N 0.000 description 3
- IHRFZLQEQVHXFA-RHYQMDGZSA-N Met-Thr-Lys Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCCCN IHRFZLQEQVHXFA-RHYQMDGZSA-N 0.000 description 3
- BJFJQOMZCSHBMY-YUMQZZPRSA-N Met-Val Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C(C)C)C(O)=O BJFJQOMZCSHBMY-YUMQZZPRSA-N 0.000 description 3
- XMBSYZWANAQXEV-UHFFFAOYSA-N N-alpha-L-glutamyl-L-phenylalanine Natural products OC(=O)CCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 XMBSYZWANAQXEV-UHFFFAOYSA-N 0.000 description 3
- 108010066427 N-valyltryptophan Proteins 0.000 description 3
- 102000007999 Nuclear Proteins Human genes 0.000 description 3
- 108010089610 Nuclear Proteins Proteins 0.000 description 3
- 101710163270 Nuclease Proteins 0.000 description 3
- RIYZXJVARWJLKS-KKUMJFAQSA-N Phe-Asp-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 RIYZXJVARWJLKS-KKUMJFAQSA-N 0.000 description 3
- KNYPNEYICHHLQL-ACRUOGEOSA-N Phe-Leu-Tyr Chemical compound C([C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=CC=C1 KNYPNEYICHHLQL-ACRUOGEOSA-N 0.000 description 3
- WEQJQNWXCSUVMA-RYUDHWBXSA-N Phe-Pro Chemical compound C([C@H]([NH3+])C(=O)N1[C@@H](CCC1)C([O-])=O)C1=CC=CC=C1 WEQJQNWXCSUVMA-RYUDHWBXSA-N 0.000 description 3
- HAEGAELAYWSUNC-WPRPVWTQSA-N Pro-Gly-Val Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O HAEGAELAYWSUNC-WPRPVWTQSA-N 0.000 description 3
- FXGIMYRVJJEIIM-UWVGGRQHSA-N Pro-Leu-Gly Chemical compound OC(=O)CNC(=O)[C@H](CC(C)C)NC(=O)[C@@H]1CCCN1 FXGIMYRVJJEIIM-UWVGGRQHSA-N 0.000 description 3
- SUENWIFTSTWUKD-AVGNSLFASA-N Pro-Leu-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O SUENWIFTSTWUKD-AVGNSLFASA-N 0.000 description 3
- FDMKYQQYJKYCLV-GUBZILKMSA-N Pro-Pro-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 FDMKYQQYJKYCLV-GUBZILKMSA-N 0.000 description 3
- BGWKULMLUIUPKY-BQBZGAKWSA-N Pro-Ser-Gly Chemical compound OC(=O)CNC(=O)[C@H](CO)NC(=O)[C@@H]1CCCN1 BGWKULMLUIUPKY-BQBZGAKWSA-N 0.000 description 3
- LNICFEXCAHIJOR-DCAQKATOSA-N Pro-Ser-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O LNICFEXCAHIJOR-DCAQKATOSA-N 0.000 description 3
- 102000016202 Proteolipids Human genes 0.000 description 3
- 108010010974 Proteolipids Proteins 0.000 description 3
- 101100365175 Saccharomyces cerevisiae (strain ATCC 204508 / S288c) SEC22 gene Proteins 0.000 description 3
- SFZKGGOGCNQPJY-CIUDSAMLSA-N Ser-Asp-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CO)N SFZKGGOGCNQPJY-CIUDSAMLSA-N 0.000 description 3
- MIJWOJAXARLEHA-WDSKDSINSA-N Ser-Gly-Glu Chemical compound OC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(O)=O MIJWOJAXARLEHA-WDSKDSINSA-N 0.000 description 3
- GZFAWAQTEYDKII-YUMQZZPRSA-N Ser-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CO GZFAWAQTEYDKII-YUMQZZPRSA-N 0.000 description 3
- XXXAXOWMBOKTRN-XPUUQOCRSA-N Ser-Gly-Val Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O XXXAXOWMBOKTRN-XPUUQOCRSA-N 0.000 description 3
- IAORETPTUDBBGV-CIUDSAMLSA-N Ser-Leu-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CO)N IAORETPTUDBBGV-CIUDSAMLSA-N 0.000 description 3
- KCGIREHVWRXNDH-GARJFASQSA-N Ser-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N KCGIREHVWRXNDH-GARJFASQSA-N 0.000 description 3
- YUJLIIRMIAGMCQ-CIUDSAMLSA-N Ser-Leu-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O YUJLIIRMIAGMCQ-CIUDSAMLSA-N 0.000 description 3
- OWCVUSJMEBGMOK-YUMQZZPRSA-N Ser-Lys-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O OWCVUSJMEBGMOK-YUMQZZPRSA-N 0.000 description 3
- CRJZZXMAADSBBQ-SRVKXCTJSA-N Ser-Lys-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CO CRJZZXMAADSBBQ-SRVKXCTJSA-N 0.000 description 3
- 206010051379 Systemic Inflammatory Response Syndrome Diseases 0.000 description 3
- 230000005867 T cell response Effects 0.000 description 3
- JLNMFGCJODTXDH-WEDXCCLWSA-N Thr-Lys-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O JLNMFGCJODTXDH-WEDXCCLWSA-N 0.000 description 3
- SGAOHNPSEPVAFP-ZDLURKLDSA-N Thr-Ser-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)NCC(O)=O SGAOHNPSEPVAFP-ZDLURKLDSA-N 0.000 description 3
- AXEJRUGTOJPZKG-XGEHTFHBSA-N Thr-Val-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CS)C(=O)O)N)O AXEJRUGTOJPZKG-XGEHTFHBSA-N 0.000 description 3
- 239000007983 Tris buffer Substances 0.000 description 3
- RYXOUTORDIUWNI-BPUTZDHNSA-N Trp-Asn-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N RYXOUTORDIUWNI-BPUTZDHNSA-N 0.000 description 3
- BOBZBMOTRORUPT-XIRDDKMYSA-N Trp-Ser-Leu Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O)=CNC2=C1 BOBZBMOTRORUPT-XIRDDKMYSA-N 0.000 description 3
- HPYDSVWYXXKHRD-VIFPVBQESA-N Tyr-Gly Chemical compound [O-]C(=O)CNC(=O)[C@@H]([NH3+])CC1=CC=C(O)C=C1 HPYDSVWYXXKHRD-VIFPVBQESA-N 0.000 description 3
- NSGZILIDHCIZAM-KKUMJFAQSA-N Tyr-Leu-Ser Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N NSGZILIDHCIZAM-KKUMJFAQSA-N 0.000 description 3
- MQGGXGKQSVEQHR-KKUMJFAQSA-N Tyr-Ser-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 MQGGXGKQSVEQHR-KKUMJFAQSA-N 0.000 description 3
- MFEVVAXTBZELLL-GGVZMXCHSA-N Tyr-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 MFEVVAXTBZELLL-GGVZMXCHSA-N 0.000 description 3
- PWKMJDQXKCENMF-MEYUZBJRSA-N Tyr-Thr-Leu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O PWKMJDQXKCENMF-MEYUZBJRSA-N 0.000 description 3
- RUCNAYOMFXRIKJ-DCAQKATOSA-N Val-Ala-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCCN RUCNAYOMFXRIKJ-DCAQKATOSA-N 0.000 description 3
- COYSIHFOCOMGCF-UHFFFAOYSA-N Val-Arg-Gly Natural products CC(C)C(N)C(=O)NC(C(=O)NCC(O)=O)CCCN=C(N)N COYSIHFOCOMGCF-UHFFFAOYSA-N 0.000 description 3
- SRWWRLKBEJZFPW-IHRRRGAJSA-N Val-Cys-Phe Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N SRWWRLKBEJZFPW-IHRRRGAJSA-N 0.000 description 3
- UPJONISHZRADBH-XPUUQOCRSA-N Val-Glu Chemical compound CC(C)[C@H](N)C(=O)N[C@H](C(O)=O)CCC(O)=O UPJONISHZRADBH-XPUUQOCRSA-N 0.000 description 3
- URIRWLJVWHYLET-ONGXEEELSA-N Val-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)C(C)C URIRWLJVWHYLET-ONGXEEELSA-N 0.000 description 3
- BNQVUHQWZGTIBX-IUCAKERBSA-N Val-His Chemical compound CC(C)[C@H]([NH3+])C(=O)N[C@H](C([O-])=O)CC1=CN=CN1 BNQVUHQWZGTIBX-IUCAKERBSA-N 0.000 description 3
- BMOFUVHDBROBSE-DCAQKATOSA-N Val-Leu-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](C(C)C)N BMOFUVHDBROBSE-DCAQKATOSA-N 0.000 description 3
- BCBFMJYTNKDALA-UFYCRDLUSA-N Val-Phe-Phe Chemical compound N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O BCBFMJYTNKDALA-UFYCRDLUSA-N 0.000 description 3
- GBIUHAYJGWVNLN-UHFFFAOYSA-N Val-Ser-Pro Natural products CC(C)C(N)C(=O)NC(CO)C(=O)N1CCCC1C(O)=O GBIUHAYJGWVNLN-UHFFFAOYSA-N 0.000 description 3
- JSOXWWFKRJKTMT-WOPDTQHZSA-N Val-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N JSOXWWFKRJKTMT-WOPDTQHZSA-N 0.000 description 3
- STTYIMSDIYISRG-UHFFFAOYSA-N Valyl-Serine Chemical compound CC(C)C(N)C(=O)NC(CO)C(O)=O STTYIMSDIYISRG-UHFFFAOYSA-N 0.000 description 3
- 240000004922 Vigna radiata Species 0.000 description 3
- 235000010721 Vigna radiata var radiata Nutrition 0.000 description 3
- 235000011469 Vigna radiata var sublobata Nutrition 0.000 description 3
- 239000002253 acid Substances 0.000 description 3
- 108010045350 alanyl-tyrosyl-alanine Proteins 0.000 description 3
- AVKUERGKIZMTKX-NJBDSQKTSA-N ampicillin Chemical compound C1([C@@H](N)C(=O)N[C@H]2[C@H]3SC([C@@H](N3C2=O)C(O)=O)(C)C)=CC=CC=C1 AVKUERGKIZMTKX-NJBDSQKTSA-N 0.000 description 3
- 229960000723 ampicillin Drugs 0.000 description 3
- 210000004102 animal cell Anatomy 0.000 description 3
- 230000003110 anti-inflammatory effect Effects 0.000 description 3
- 108010092854 aspartyllysine Proteins 0.000 description 3
- 210000004369 blood Anatomy 0.000 description 3
- 239000008280 blood Substances 0.000 description 3
- 239000000872 buffer Substances 0.000 description 3
- 150000001720 carbohydrates Chemical class 0.000 description 3
- 230000000747 cardiac effect Effects 0.000 description 3
- 230000010261 cell growth Effects 0.000 description 3
- 230000001659 chemokinetic effect Effects 0.000 description 3
- 230000035605 chemotaxis Effects 0.000 description 3
- 238000003776 cleavage reaction Methods 0.000 description 3
- 230000001332 colony forming effect Effects 0.000 description 3
- 238000010276 construction Methods 0.000 description 3
- 108010069495 cysteinyltyrosine Proteins 0.000 description 3
- 230000007423 decrease Effects 0.000 description 3
- 238000012217 deletion Methods 0.000 description 3
- 230000037430 deletion Effects 0.000 description 3
- RDYMFSUJUZBWLH-UHFFFAOYSA-N endosulfan Chemical compound C12COS(=O)OCC2C2(Cl)C(Cl)=C(Cl)C1(Cl)C2(Cl)Cl RDYMFSUJUZBWLH-UHFFFAOYSA-N 0.000 description 3
- 238000005516 engineering process Methods 0.000 description 3
- 238000001415 gene therapy Methods 0.000 description 3
- 108010057083 glutamyl-aspartyl-leucine Proteins 0.000 description 3
- 108010049041 glutamylalanine Proteins 0.000 description 3
- 210000002865 immune cell Anatomy 0.000 description 3
- 238000011534 incubation Methods 0.000 description 3
- 230000005764 inhibitory process Effects 0.000 description 3
- ZPNFWUPYTFPOJU-LPYSRVMUSA-N iniprol Chemical compound C([C@H]1C(=O)NCC(=O)NCC(=O)N[C@H]2CSSC[C@H]3C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@H](C(N[C@H](C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC=4C=CC(O)=CC=4)C(=O)N[C@@H](CC=4C=CC=CC=4)C(=O)N[C@@H](CC=4C=CC(O)=CC=4)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CSSC[C@H](NC(=O)[C@H](CC(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@H](C)NC(=O)[C@H](CO)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC=4C=CC=CC=4)NC(=O)[C@H](CC(N)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@H](CCCNC(N)=N)NC(=O)[C@H](CCCCN)NC(=O)[C@H](C)NC(=O)[C@H](CCCNC(N)=N)NC2=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CSSC[C@H](NC(=O)[C@H](CC=2C=CC=CC=2)NC(=O)[C@H](CC(O)=O)NC(=O)[C@H]2N(CCC2)C(=O)[C@@H](N)CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N2[C@@H](CCC2)C(=O)N2[C@@H](CCC2)C(=O)N[C@@H](CC=2C=CC(O)=CC=2)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N2[C@@H](CCC2)C(=O)N3)C(=O)NCC(=O)NCC(=O)N[C@@H](C)C(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(=O)N[C@@H](CC=2C=CC=CC=2)C(=O)N[C@H](C(=O)N1)C(C)C)[C@@H](C)O)[C@@H](C)CC)=O)[C@@H](C)CC)C1=CC=C(O)C=C1 ZPNFWUPYTFPOJU-LPYSRVMUSA-N 0.000 description 3
- 238000002372 labelling Methods 0.000 description 3
- 108010044311 leucyl-glycyl-glycine Proteins 0.000 description 3
- 108010073472 leucyl-prolyl-proline Proteins 0.000 description 3
- 230000033001 locomotion Effects 0.000 description 3
- 239000000463 material Substances 0.000 description 3
- 230000035800 maturation Effects 0.000 description 3
- 108010034507 methionyltryptophan Proteins 0.000 description 3
- 238000010369 molecular cloning Methods 0.000 description 3
- 210000001616 monocyte Anatomy 0.000 description 3
- 210000000944 nerve tissue Anatomy 0.000 description 3
- 239000002245 particle Substances 0.000 description 3
- 108010065135 phenylalanyl-phenylalanyl-phenylalanine Proteins 0.000 description 3
- 108010084572 phenylalanyl-valine Proteins 0.000 description 3
- 108010018625 phenylalanylarginine Proteins 0.000 description 3
- 108010083476 phenylalanyltryptophan Proteins 0.000 description 3
- 239000001103 potassium chloride Substances 0.000 description 3
- 235000011164 potassium chloride Nutrition 0.000 description 3
- 238000001556 precipitation Methods 0.000 description 3
- 239000002243 precursor Substances 0.000 description 3
- 230000001105 regulatory effect Effects 0.000 description 3
- 108010078070 scavenger receptors Proteins 0.000 description 3
- 102000014452 scavenger receptors Human genes 0.000 description 3
- 230000007017 scission Effects 0.000 description 3
- 238000012163 sequencing technique Methods 0.000 description 3
- 108010048818 seryl-histidine Proteins 0.000 description 3
- 238000002415 sodium dodecyl sulfate polyacrylamide gel electrophoresis Methods 0.000 description 3
- 210000000952 spleen Anatomy 0.000 description 3
- 230000004936 stimulating effect Effects 0.000 description 3
- 230000008093 supporting effect Effects 0.000 description 3
- 230000001629 suppression Effects 0.000 description 3
- 239000000725 suspension Substances 0.000 description 3
- 230000001225 therapeutic effect Effects 0.000 description 3
- 230000002537 thrombolytic effect Effects 0.000 description 3
- 238000001890 transfection Methods 0.000 description 3
- LENZDBCJOHFCAS-UHFFFAOYSA-N tris Chemical compound OCC(N)(CO)CO LENZDBCJOHFCAS-UHFFFAOYSA-N 0.000 description 3
- 108010044292 tryptophyltyrosine Proteins 0.000 description 3
- 230000003612 virological effect Effects 0.000 description 3
- FWMNVWWHGCHHJJ-SKKKGAJSSA-N 4-amino-1-[(2r)-6-amino-2-[[(2r)-2-[[(2r)-2-[[(2r)-2-amino-3-phenylpropanoyl]amino]-3-phenylpropanoyl]amino]-4-methylpentanoyl]amino]hexanoyl]piperidine-4-carboxylic acid Chemical compound C([C@H](C(=O)N[C@H](CC(C)C)C(=O)N[C@H](CCCCN)C(=O)N1CCC(N)(CC1)C(O)=O)NC(=O)[C@H](N)CC=1C=CC=CC=1)C1=CC=CC=C1 FWMNVWWHGCHHJJ-SKKKGAJSSA-N 0.000 description 2
- FJVAQLJNTSUQPY-CIUDSAMLSA-N Ala-Ala-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCCN FJVAQLJNTSUQPY-CIUDSAMLSA-N 0.000 description 2
- DPNZTBKGAUAZQU-DLOVCJGASA-N Ala-Leu-His Chemical compound C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N DPNZTBKGAUAZQU-DLOVCJGASA-N 0.000 description 2
- SOBIAADAMRHGKH-CIUDSAMLSA-N Ala-Leu-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O SOBIAADAMRHGKH-CIUDSAMLSA-N 0.000 description 2
- AUFACLFHBAGZEN-ZLUOBGJFSA-N Ala-Ser-Cys Chemical compound N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)O AUFACLFHBAGZEN-ZLUOBGJFSA-N 0.000 description 2
- HOVPGJUNRLMIOZ-CIUDSAMLSA-N Ala-Ser-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@H](C)N HOVPGJUNRLMIOZ-CIUDSAMLSA-N 0.000 description 2
- SGYSTDWPNPKJPP-GUBZILKMSA-N Arg-Ala-Arg Chemical compound NC(=N)NCCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O SGYSTDWPNPKJPP-GUBZILKMSA-N 0.000 description 2
- VWVPYNGMOCSSGK-GUBZILKMSA-N Arg-Arg-Asn Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(O)=O VWVPYNGMOCSSGK-GUBZILKMSA-N 0.000 description 2
- UXJCMQFPDWCHKX-DCAQKATOSA-N Arg-Arg-Glu Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(O)=O)C(O)=O UXJCMQFPDWCHKX-DCAQKATOSA-N 0.000 description 2
- IASNWHAGGYTEKX-IUCAKERBSA-N Arg-Arg-Gly Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)NCC(O)=O IASNWHAGGYTEKX-IUCAKERBSA-N 0.000 description 2
- XEPSCVXTCUUHDT-AVGNSLFASA-N Arg-Arg-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H](N)CCCN=C(N)N XEPSCVXTCUUHDT-AVGNSLFASA-N 0.000 description 2
- SVHRPCMZTWZROG-DCAQKATOSA-N Arg-Cys-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CCCN=C(N)N)N SVHRPCMZTWZROG-DCAQKATOSA-N 0.000 description 2
- VNFWDYWTSHFRRG-SRVKXCTJSA-N Arg-Gln-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O VNFWDYWTSHFRRG-SRVKXCTJSA-N 0.000 description 2
- OQCWXQJLCDPRHV-UWVGGRQHSA-N Arg-Gly-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O OQCWXQJLCDPRHV-UWVGGRQHSA-N 0.000 description 2
- RTDZQOFEGPWSJD-AVGNSLFASA-N Arg-Leu-Val Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O RTDZQOFEGPWSJD-AVGNSLFASA-N 0.000 description 2
- DIIGDGJKTMLQQW-IHRRRGAJSA-N Arg-Lys-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CCCN=C(N)N)N DIIGDGJKTMLQQW-IHRRRGAJSA-N 0.000 description 2
- RIQBRKVTFBWEDY-RHYQMDGZSA-N Arg-Lys-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O RIQBRKVTFBWEDY-RHYQMDGZSA-N 0.000 description 2
- OMKZPCPZEFMBIT-SRVKXCTJSA-N Arg-Met-Arg Chemical compound NC(=N)NCCC[C@H](N)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O OMKZPCPZEFMBIT-SRVKXCTJSA-N 0.000 description 2
- PYZPXCZNQSEHDT-GUBZILKMSA-N Arg-Met-Asn Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N PYZPXCZNQSEHDT-GUBZILKMSA-N 0.000 description 2
- AFNHFVVOJZBIJD-GUBZILKMSA-N Arg-Met-Asp Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(O)=O)C(O)=O AFNHFVVOJZBIJD-GUBZILKMSA-N 0.000 description 2
- NYDIVDKTULRINZ-AVGNSLFASA-N Arg-Met-Lys Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N NYDIVDKTULRINZ-AVGNSLFASA-N 0.000 description 2
- LRPZJPMQGKGHSG-XGEHTFHBSA-N Arg-Ser-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCCN=C(N)N)N)O LRPZJPMQGKGHSG-XGEHTFHBSA-N 0.000 description 2
- BECXEHHOZNFFFX-IHRRRGAJSA-N Arg-Ser-Tyr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O BECXEHHOZNFFFX-IHRRRGAJSA-N 0.000 description 2
- XRNXPIGJPQHCPC-RCWTZXSCSA-N Arg-Thr-Val Chemical compound CC(C)[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)CCCNC(N)=N)[C@@H](C)O)C(O)=O XRNXPIGJPQHCPC-RCWTZXSCSA-N 0.000 description 2
- UVTGNSWSRSCPLP-UHFFFAOYSA-N Arg-Tyr Natural products NC(CCNC(=N)N)C(=O)NC(Cc1ccc(O)cc1)C(=O)O UVTGNSWSRSCPLP-UHFFFAOYSA-N 0.000 description 2
- PJOPLXOCKACMLK-KKUMJFAQSA-N Arg-Tyr-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O PJOPLXOCKACMLK-KKUMJFAQSA-N 0.000 description 2
- VLIJAPRTSXSGFY-STQMWFEESA-N Arg-Tyr-Gly Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CC=C(O)C=C1 VLIJAPRTSXSGFY-STQMWFEESA-N 0.000 description 2
- CGWVCWFQGXOUSJ-ULQDDVLXSA-N Arg-Tyr-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(O)=O CGWVCWFQGXOUSJ-ULQDDVLXSA-N 0.000 description 2
- NMTANZXPDAHUKU-ULQDDVLXSA-N Arg-Tyr-Lys Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CCCCN)C(O)=O)CC1=CC=C(O)C=C1 NMTANZXPDAHUKU-ULQDDVLXSA-N 0.000 description 2
- ISVACHFCVRKIDG-SRVKXCTJSA-N Arg-Val-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O ISVACHFCVRKIDG-SRVKXCTJSA-N 0.000 description 2
- FTMRPIVPSDVGCC-GUBZILKMSA-N Arg-Val-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N FTMRPIVPSDVGCC-GUBZILKMSA-N 0.000 description 2
- VYZBPPBKFCHCIS-WPRPVWTQSA-N Arg-Val-Gly Chemical compound OC(=O)CNC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCN=C(N)N VYZBPPBKFCHCIS-WPRPVWTQSA-N 0.000 description 2
- DAPLJWATMAXPPZ-CIUDSAMLSA-N Asn-Asn-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CC(N)=O DAPLJWATMAXPPZ-CIUDSAMLSA-N 0.000 description 2
- COUZKSSMBFADSB-AVGNSLFASA-N Asn-Glu-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC(=O)N)N COUZKSSMBFADSB-AVGNSLFASA-N 0.000 description 2
- HDHZCEDPLTVHFZ-GUBZILKMSA-N Asn-Leu-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O HDHZCEDPLTVHFZ-GUBZILKMSA-N 0.000 description 2
- GLWFAWNYGWBMOC-SRVKXCTJSA-N Asn-Leu-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O GLWFAWNYGWBMOC-SRVKXCTJSA-N 0.000 description 2
- JEEFEQCRXKPQHC-KKUMJFAQSA-N Asn-Leu-Phe Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O JEEFEQCRXKPQHC-KKUMJFAQSA-N 0.000 description 2
- FODVBOKTYKYRFJ-CIUDSAMLSA-N Asn-Lys-Cys Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(=O)N)N FODVBOKTYKYRFJ-CIUDSAMLSA-N 0.000 description 2
- NYGILGUOUOXGMJ-YUMQZZPRSA-N Asn-Lys-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O NYGILGUOUOXGMJ-YUMQZZPRSA-N 0.000 description 2
- ORJQQZIXTOYGGH-SRVKXCTJSA-N Asn-Lys-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O ORJQQZIXTOYGGH-SRVKXCTJSA-N 0.000 description 2
- COWITDLVHMZSIW-CIUDSAMLSA-N Asn-Lys-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O COWITDLVHMZSIW-CIUDSAMLSA-N 0.000 description 2
- PBFXCUOEGVJTMV-QXEWZRGKSA-N Asn-Met-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C(C)C)C(O)=O PBFXCUOEGVJTMV-QXEWZRGKSA-N 0.000 description 2
- RAUPFUCUDBQYHE-AVGNSLFASA-N Asn-Phe-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O RAUPFUCUDBQYHE-AVGNSLFASA-N 0.000 description 2
- SONUFGRSSMFHFN-IMJSIDKUSA-N Asn-Ser Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CO)C(O)=O SONUFGRSSMFHFN-IMJSIDKUSA-N 0.000 description 2
- MKJBPDLENBUHQU-CIUDSAMLSA-N Asn-Ser-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O MKJBPDLENBUHQU-CIUDSAMLSA-N 0.000 description 2
- NCXTYSVDWLAQGZ-ZKWXMUAHSA-N Asn-Ser-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC(N)=O NCXTYSVDWLAQGZ-ZKWXMUAHSA-N 0.000 description 2
- BIGRHVNFFJTHEB-UBHSHLNASA-N Asn-Trp-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC(O)=O)C(O)=O BIGRHVNFFJTHEB-UBHSHLNASA-N 0.000 description 2
- KSZHWTRZPOTIGY-AVGNSLFASA-N Asn-Tyr-Gln Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N)O KSZHWTRZPOTIGY-AVGNSLFASA-N 0.000 description 2
- ZAESWDKAMDVHLL-RCOVLWMOSA-N Asn-Val-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O ZAESWDKAMDVHLL-RCOVLWMOSA-N 0.000 description 2
- QXNGSPZMGFEZNO-QRTARXTBSA-N Asn-Val-Trp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O QXNGSPZMGFEZNO-QRTARXTBSA-N 0.000 description 2
- PBVLJOIPOGUQQP-CIUDSAMLSA-N Asp-Ala-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O PBVLJOIPOGUQQP-CIUDSAMLSA-N 0.000 description 2
- BLQBMRNMBAYREH-UWJYBYFXSA-N Asp-Ala-Tyr Chemical compound N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O BLQBMRNMBAYREH-UWJYBYFXSA-N 0.000 description 2
- MRQQMVZUHXUPEV-IHRRRGAJSA-N Asp-Arg-Phe Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O MRQQMVZUHXUPEV-IHRRRGAJSA-N 0.000 description 2
- XYBJLTKSGFBLCS-QXEWZRGKSA-N Asp-Arg-Val Chemical compound NC(N)=NCCC[C@@H](C(=O)N[C@@H](C(C)C)C(O)=O)NC(=O)[C@@H](N)CC(O)=O XYBJLTKSGFBLCS-QXEWZRGKSA-N 0.000 description 2
- VGRHZPNRCLAHQA-IMJSIDKUSA-N Asp-Asn Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(O)=O VGRHZPNRCLAHQA-IMJSIDKUSA-N 0.000 description 2
- UQBGYPFHWFZMCD-ZLUOBGJFSA-N Asp-Asn-Asn Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O UQBGYPFHWFZMCD-ZLUOBGJFSA-N 0.000 description 2
- CELPEWWLSXMVPH-CIUDSAMLSA-N Asp-Asp-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC(O)=O CELPEWWLSXMVPH-CIUDSAMLSA-N 0.000 description 2
- KVPHTGVUMJGMCX-BIIVOSGPSA-N Asp-Cys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CS)NC(=O)[C@H](CC(=O)O)N)C(=O)O KVPHTGVUMJGMCX-BIIVOSGPSA-N 0.000 description 2
- NURJSGZGBVJFAD-ZLUOBGJFSA-N Asp-Cys-Ser Chemical compound C([C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CO)C(=O)O)N)C(=O)O NURJSGZGBVJFAD-ZLUOBGJFSA-N 0.000 description 2
- QCVXMEHGFUMKCO-YUMQZZPRSA-N Asp-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC(O)=O QCVXMEHGFUMKCO-YUMQZZPRSA-N 0.000 description 2
- OEDJQRXNDRUGEU-SRVKXCTJSA-N Asp-Leu-His Chemical compound N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)O OEDJQRXNDRUGEU-SRVKXCTJSA-N 0.000 description 2
- CJUKAWUWBZCTDQ-SRVKXCTJSA-N Asp-Leu-Lys Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(O)=O CJUKAWUWBZCTDQ-SRVKXCTJSA-N 0.000 description 2
- IVPNEDNYYYFAGI-GARJFASQSA-N Asp-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)O)N IVPNEDNYYYFAGI-GARJFASQSA-N 0.000 description 2
- ORRJQLIATJDMQM-HJGDQZAQSA-N Asp-Leu-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC(O)=O ORRJQLIATJDMQM-HJGDQZAQSA-N 0.000 description 2
- VSMYBNPOHYAXSD-GUBZILKMSA-N Asp-Lys-Glu Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O VSMYBNPOHYAXSD-GUBZILKMSA-N 0.000 description 2
- LIJXJYGRSRWLCJ-IHRRRGAJSA-N Asp-Phe-Arg Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O LIJXJYGRSRWLCJ-IHRRRGAJSA-N 0.000 description 2
- JSNWZMFSLIWAHS-HJGDQZAQSA-N Asp-Thr-Leu Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)O)NC(=O)[C@H](CC(=O)O)N)O JSNWZMFSLIWAHS-HJGDQZAQSA-N 0.000 description 2
- VHUKCUHLFMRHOD-MELADBBJSA-N Asp-Tyr-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)[C@H](CC(=O)O)N)C(=O)O VHUKCUHLFMRHOD-MELADBBJSA-N 0.000 description 2
- RKXVTTIQNKPCHU-KKHAAJSZSA-N Asp-Val-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC(O)=O RKXVTTIQNKPCHU-KKHAAJSZSA-N 0.000 description 2
- IJGRMHOSHXDMSA-UHFFFAOYSA-N Atomic nitrogen Chemical compound N#N IJGRMHOSHXDMSA-UHFFFAOYSA-N 0.000 description 2
- 241000894006 Bacteria Species 0.000 description 2
- 241000283690 Bos taurus Species 0.000 description 2
- 241000282465 Canis Species 0.000 description 2
- 108010078140 Cation Transport Proteins Proteins 0.000 description 2
- 241000282693 Cercopithecidae Species 0.000 description 2
- 108091006146 Channels Proteins 0.000 description 2
- 102000019034 Chemokines Human genes 0.000 description 2
- 108010012236 Chemokines Proteins 0.000 description 2
- 206010010356 Congenital anomaly Diseases 0.000 description 2
- 241000557626 Corvus corax Species 0.000 description 2
- NQSUTVRXXBGVDQ-LKXGYXEUSA-N Cys-Asn-Thr Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O NQSUTVRXXBGVDQ-LKXGYXEUSA-N 0.000 description 2
- KIHRUISMQZVCNO-ZLUOBGJFSA-N Cys-Asp-Asp Chemical compound SC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O KIHRUISMQZVCNO-ZLUOBGJFSA-N 0.000 description 2
- ZJBWJHQDOIMVLM-WHFBIAKZSA-N Cys-Cys-Gly Chemical compound SC[C@H](N)C(=O)N[C@@H](CS)C(=O)NCC(O)=O ZJBWJHQDOIMVLM-WHFBIAKZSA-N 0.000 description 2
- BUXAPSQPMALTOY-WHFBIAKZSA-N Cys-Glu Chemical compound SC[C@H](N)C(=O)N[C@H](C(O)=O)CCC(O)=O BUXAPSQPMALTOY-WHFBIAKZSA-N 0.000 description 2
- ZQHQTSONVIANQR-BQBZGAKWSA-N Cys-Gly-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CS)N ZQHQTSONVIANQR-BQBZGAKWSA-N 0.000 description 2
- RESAHOSBQHMOKH-KKUMJFAQSA-N Cys-Phe-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](CS)N RESAHOSBQHMOKH-KKUMJFAQSA-N 0.000 description 2
- QQOWCDCBFFBRQH-IXOXFDKPSA-N Cys-Phe-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](CS)N)O QQOWCDCBFFBRQH-IXOXFDKPSA-N 0.000 description 2
- KSMSFCBQBQPFAD-GUBZILKMSA-N Cys-Pro-Pro Chemical compound SC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 KSMSFCBQBQPFAD-GUBZILKMSA-N 0.000 description 2
- YWEHYKGJWHPGPY-XGEHTFHBSA-N Cys-Thr-Arg Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CS)N)O YWEHYKGJWHPGPY-XGEHTFHBSA-N 0.000 description 2
- UGPCUUWZXRMCIJ-KKUMJFAQSA-N Cys-Tyr-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CS)N UGPCUUWZXRMCIJ-KKUMJFAQSA-N 0.000 description 2
- XUJNEKJLAYXESH-UHFFFAOYSA-N Cysteine Chemical compound SCC(N)C(O)=O XUJNEKJLAYXESH-UHFFFAOYSA-N 0.000 description 2
- 241000255581 Drosophila <fruit fly, genus> Species 0.000 description 2
- 102100025137 Early activation antigen CD69 Human genes 0.000 description 2
- 102000009123 Fibrin Human genes 0.000 description 2
- 108010073385 Fibrin Proteins 0.000 description 2
- BWGVNKXGVNDBDI-UHFFFAOYSA-N Fibrin monomer Chemical compound CNC(=O)CNC(=O)CN BWGVNKXGVNDBDI-UHFFFAOYSA-N 0.000 description 2
- 206010017533 Fungal infection Diseases 0.000 description 2
- 241000233866 Fungi Species 0.000 description 2
- MLZRSFQRBDNJON-GUBZILKMSA-N Gln-Ala-Lys Chemical compound C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)N)N MLZRSFQRBDNJON-GUBZILKMSA-N 0.000 description 2
- KWUSGAIFNHQCBY-DCAQKATOSA-N Gln-Arg-Arg Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O KWUSGAIFNHQCBY-DCAQKATOSA-N 0.000 description 2
- ZPDVKYLJTOFQJV-WDSKDSINSA-N Gln-Asn-Gly Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O ZPDVKYLJTOFQJV-WDSKDSINSA-N 0.000 description 2
- SSHIXEILTLPAQT-WHFBIAKZSA-N Gln-Asp Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(O)=O SSHIXEILTLPAQT-WHFBIAKZSA-N 0.000 description 2
- JKPGHIQCHIIRMS-AVGNSLFASA-N Gln-Asp-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCC(=O)N)N JKPGHIQCHIIRMS-AVGNSLFASA-N 0.000 description 2
- IPHGBVYWRKCGKG-FXQIFTODSA-N Gln-Cys-Glu Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(O)=O)C(O)=O IPHGBVYWRKCGKG-FXQIFTODSA-N 0.000 description 2
- MFLMFRZBAJSGHK-ACZMJKKPSA-N Gln-Cys-Ser Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CO)C(=O)O)N MFLMFRZBAJSGHK-ACZMJKKPSA-N 0.000 description 2
- NKCZYEDZTKOFBG-GUBZILKMSA-N Gln-Gln-Arg Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O NKCZYEDZTKOFBG-GUBZILKMSA-N 0.000 description 2
- OWOFCNWTMWOOJJ-WDSKDSINSA-N Gln-Glu Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(O)=O OWOFCNWTMWOOJJ-WDSKDSINSA-N 0.000 description 2
- CGVWDTRDPLOMHZ-FXQIFTODSA-N Gln-Glu-Asp Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O CGVWDTRDPLOMHZ-FXQIFTODSA-N 0.000 description 2
- SNLOOPZHAQDMJG-CIUDSAMLSA-N Gln-Glu-Glu Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O SNLOOPZHAQDMJG-CIUDSAMLSA-N 0.000 description 2
- DRDSQGHKTLSNEA-GLLZPBPUSA-N Gln-Glu-Thr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O DRDSQGHKTLSNEA-GLLZPBPUSA-N 0.000 description 2
- ARPVSMCNIDAQBO-YUMQZZPRSA-N Gln-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](N)CCC(N)=O ARPVSMCNIDAQBO-YUMQZZPRSA-N 0.000 description 2
- HSHCEAUPUPJPTE-JYJNAYRXSA-N Gln-Leu-Tyr Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@H](CCC(=O)N)N HSHCEAUPUPJPTE-JYJNAYRXSA-N 0.000 description 2
- CLSDNFWKGFJIBZ-YUMQZZPRSA-N Gln-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](N)CCC(N)=O CLSDNFWKGFJIBZ-YUMQZZPRSA-N 0.000 description 2
- JNENSVNAUWONEZ-GUBZILKMSA-N Gln-Lys-Asn Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O JNENSVNAUWONEZ-GUBZILKMSA-N 0.000 description 2
- CELXWPDNIGWCJN-WDCWCFNPSA-N Gln-Lys-Thr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CELXWPDNIGWCJN-WDCWCFNPSA-N 0.000 description 2
- QKWBEMCLYTYBNI-GVXVVHGQSA-N Gln-Lys-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCC(N)=O QKWBEMCLYTYBNI-GVXVVHGQSA-N 0.000 description 2
- LHMWTCWZARHLPV-CIUDSAMLSA-N Gln-Met-Ser Chemical compound CSCC[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CCC(=O)N)N LHMWTCWZARHLPV-CIUDSAMLSA-N 0.000 description 2
- PBYFVIQRFLNQCO-GUBZILKMSA-N Gln-Pro-Gln Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O PBYFVIQRFLNQCO-GUBZILKMSA-N 0.000 description 2
- LGWNISYVKDNJRP-FXQIFTODSA-N Gln-Ser-Gln Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O LGWNISYVKDNJRP-FXQIFTODSA-N 0.000 description 2
- SYZZMPFLOLSMHL-XHNCKOQMSA-N Gln-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CCC(=O)N)N)C(=O)O SYZZMPFLOLSMHL-XHNCKOQMSA-N 0.000 description 2
- OTQSTOXRUBVWAP-NRPADANISA-N Gln-Ser-Val Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O OTQSTOXRUBVWAP-NRPADANISA-N 0.000 description 2
- HLRLXVPRJJITSK-IFFSRLJSSA-N Gln-Thr-Val Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O HLRLXVPRJJITSK-IFFSRLJSSA-N 0.000 description 2
- DITJVHONFRJKJW-BPUTZDHNSA-N Gln-Trp-Glu Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CCC(=O)N)N DITJVHONFRJKJW-BPUTZDHNSA-N 0.000 description 2
- ZFBBMCKQSNJZSN-AUTRQRHGSA-N Gln-Val-Gln Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O ZFBBMCKQSNJZSN-AUTRQRHGSA-N 0.000 description 2
- SOEXCCGNHQBFPV-DLOVCJGASA-N Gln-Val-Val Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C(C)C)C(O)=O SOEXCCGNHQBFPV-DLOVCJGASA-N 0.000 description 2
- MPZWMIIOPAPAKE-BQBZGAKWSA-N Glu-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@H](C(O)=O)CCCNC(N)=N MPZWMIIOPAPAKE-BQBZGAKWSA-N 0.000 description 2
- RCCDHXSRMWCOOY-GUBZILKMSA-N Glu-Arg-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(O)=O RCCDHXSRMWCOOY-GUBZILKMSA-N 0.000 description 2
- KKCUFHUTMKQQCF-SRVKXCTJSA-N Glu-Arg-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O KKCUFHUTMKQQCF-SRVKXCTJSA-N 0.000 description 2
- ZOXBSICWUDAOHX-GUBZILKMSA-N Glu-Asn-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CCC(O)=O ZOXBSICWUDAOHX-GUBZILKMSA-N 0.000 description 2
- RDDSZZJOKDVPAE-ACZMJKKPSA-N Glu-Asn-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O RDDSZZJOKDVPAE-ACZMJKKPSA-N 0.000 description 2
- CKOFNWCLWRYUHK-XHNCKOQMSA-N Glu-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCC(=O)O)N)C(=O)O CKOFNWCLWRYUHK-XHNCKOQMSA-N 0.000 description 2
- GFLQTABMFBXRIY-GUBZILKMSA-N Glu-Gln-Arg Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O GFLQTABMFBXRIY-GUBZILKMSA-N 0.000 description 2
- PXHABOCPJVTGEK-BQBZGAKWSA-N Glu-Gln-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O PXHABOCPJVTGEK-BQBZGAKWSA-N 0.000 description 2
- RFDHKPSHTXZKLL-IHRRRGAJSA-N Glu-Gln-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCC(=O)O)N RFDHKPSHTXZKLL-IHRRRGAJSA-N 0.000 description 2
- ILGFBUGLBSAQQB-GUBZILKMSA-N Glu-Glu-Arg Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O ILGFBUGLBSAQQB-GUBZILKMSA-N 0.000 description 2
- SJPMNHCEWPTRBR-BQBZGAKWSA-N Glu-Glu-Gly Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O SJPMNHCEWPTRBR-BQBZGAKWSA-N 0.000 description 2
- KASDBWKLWJKTLJ-GUBZILKMSA-N Glu-Glu-Met Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCSC)C(O)=O KASDBWKLWJKTLJ-GUBZILKMSA-N 0.000 description 2
- KUTPGXNAAOQSPD-LPEHRKFASA-N Glu-Glu-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCC(=O)O)N)C(=O)O KUTPGXNAAOQSPD-LPEHRKFASA-N 0.000 description 2
- XMPAXPSENRSOSV-RYUDHWBXSA-N Glu-Gly-Tyr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O XMPAXPSENRSOSV-RYUDHWBXSA-N 0.000 description 2
- YBAFDPFAUTYYRW-YUMQZZPRSA-N Glu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](N)CCC(O)=O YBAFDPFAUTYYRW-YUMQZZPRSA-N 0.000 description 2
- MWMJCGBSIORNCD-AVGNSLFASA-N Glu-Leu-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O MWMJCGBSIORNCD-AVGNSLFASA-N 0.000 description 2
- UGSVSNXPJJDJKL-SDDRHHMPSA-N Glu-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)O)N UGSVSNXPJJDJKL-SDDRHHMPSA-N 0.000 description 2
- NJCALAAIGREHDR-WDCWCFNPSA-N Glu-Leu-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O NJCALAAIGREHDR-WDCWCFNPSA-N 0.000 description 2
- BCYGDJXHAGZNPQ-DCAQKATOSA-N Glu-Lys-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O BCYGDJXHAGZNPQ-DCAQKATOSA-N 0.000 description 2
- RBXSZQRSEGYDFG-GUBZILKMSA-N Glu-Lys-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O RBXSZQRSEGYDFG-GUBZILKMSA-N 0.000 description 2
- LHIPZASLKPYDPI-AVGNSLFASA-N Glu-Phe-Asp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O LHIPZASLKPYDPI-AVGNSLFASA-N 0.000 description 2
- LPHGXOWFAXFCPX-KKUMJFAQSA-N Glu-Pro-Phe Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CCC(=O)O)N)C(=O)N[C@@H](CC2=CC=CC=C2)C(=O)O LPHGXOWFAXFCPX-KKUMJFAQSA-N 0.000 description 2
- BIYNPVYAZOUVFQ-CIUDSAMLSA-N Glu-Pro-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O BIYNPVYAZOUVFQ-CIUDSAMLSA-N 0.000 description 2
- YQAQQKPWFOBSMU-WDCWCFNPSA-N Glu-Thr-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O YQAQQKPWFOBSMU-WDCWCFNPSA-N 0.000 description 2
- DTLLNDVORUEOTM-WDCWCFNPSA-N Glu-Thr-Lys Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(O)=O DTLLNDVORUEOTM-WDCWCFNPSA-N 0.000 description 2
- DLISPGXMKZTWQG-IFFSRLJSSA-N Glu-Thr-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O DLISPGXMKZTWQG-IFFSRLJSSA-N 0.000 description 2
- HBMRTXJZQDVRFT-DZKIICNBSA-N Glu-Tyr-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O HBMRTXJZQDVRFT-DZKIICNBSA-N 0.000 description 2
- SITLTJHOQZFJGG-XPUUQOCRSA-N Glu-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H](N)CCC(O)=O SITLTJHOQZFJGG-XPUUQOCRSA-N 0.000 description 2
- YPHPEHMXOYTEQG-LAEOZQHASA-N Glu-Val-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCC(O)=O YPHPEHMXOYTEQG-LAEOZQHASA-N 0.000 description 2
- LZEUDRYSAZAJIO-AUTRQRHGSA-N Glu-Val-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O LZEUDRYSAZAJIO-AUTRQRHGSA-N 0.000 description 2
- NTNUEBVGKMVANB-NHCYSSNCSA-N Glu-Val-Met Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCSC)C(O)=O NTNUEBVGKMVANB-NHCYSSNCSA-N 0.000 description 2
- VSVZIEVNUYDAFR-YUMQZZPRSA-N Gly-Ala-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)CN VSVZIEVNUYDAFR-YUMQZZPRSA-N 0.000 description 2
- QSDKBRMVXSWAQE-BFHQHQDPSA-N Gly-Ala-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)CN QSDKBRMVXSWAQE-BFHQHQDPSA-N 0.000 description 2
- DTPOVRRYXPJJAZ-FJXKBIBVSA-N Gly-Arg-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCN=C(N)N DTPOVRRYXPJJAZ-FJXKBIBVSA-N 0.000 description 2
- GNBMOZPQUXTCRW-STQMWFEESA-N Gly-Asn-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CC(N)=O)NC(=O)CN)C(O)=O)=CNC2=C1 GNBMOZPQUXTCRW-STQMWFEESA-N 0.000 description 2
- XQHSBNVACKQWAV-WHFBIAKZSA-N Gly-Asp-Asn Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O XQHSBNVACKQWAV-WHFBIAKZSA-N 0.000 description 2
- VOCMRCVMAPSSAL-IUCAKERBSA-N Gly-Gln-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)CN VOCMRCVMAPSSAL-IUCAKERBSA-N 0.000 description 2
- IEFJWDNGDZAYNZ-BYPYZUCNSA-N Gly-Glu Chemical compound NCC(=O)N[C@H](C(O)=O)CCC(O)=O IEFJWDNGDZAYNZ-BYPYZUCNSA-N 0.000 description 2
- HDNXXTBKOJKWNN-WDSKDSINSA-N Gly-Glu-Asn Chemical compound NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O HDNXXTBKOJKWNN-WDSKDSINSA-N 0.000 description 2
- JSNNHGHYGYMVCK-XVKPBYJWSA-N Gly-Glu-Val Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O JSNNHGHYGYMVCK-XVKPBYJWSA-N 0.000 description 2
- XPJBQTCXPJNIFE-ZETCQYMHSA-N Gly-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)CN XPJBQTCXPJNIFE-ZETCQYMHSA-N 0.000 description 2
- KAJAOGBVWCYGHZ-JTQLQIEISA-N Gly-Gly-Phe Chemical compound [NH3+]CC(=O)NCC(=O)N[C@H](C([O-])=O)CC1=CC=CC=C1 KAJAOGBVWCYGHZ-JTQLQIEISA-N 0.000 description 2
- BUEFQXUHTUZXHR-LURJTMIESA-N Gly-Gly-Pro zwitterion Chemical compound NCC(=O)NCC(=O)N1CCC[C@H]1C(O)=O BUEFQXUHTUZXHR-LURJTMIESA-N 0.000 description 2
- IVSWQHKONQIOHA-YUMQZZPRSA-N Gly-His-Cys Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)CN IVSWQHKONQIOHA-YUMQZZPRSA-N 0.000 description 2
- YTSVAIMKVLZUDU-YUMQZZPRSA-N Gly-Leu-Asp Chemical compound [H]NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O YTSVAIMKVLZUDU-YUMQZZPRSA-N 0.000 description 2
- TWTPDFFBLQEBOE-IUCAKERBSA-N Gly-Leu-Gln Chemical compound [H]NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O TWTPDFFBLQEBOE-IUCAKERBSA-N 0.000 description 2
- LIXWIUAORXJNBH-QWRGUYRKSA-N Gly-Leu-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)CN LIXWIUAORXJNBH-QWRGUYRKSA-N 0.000 description 2
- UUYBFNKHOCJCHT-VHSXEESVSA-N Gly-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN UUYBFNKHOCJCHT-VHSXEESVSA-N 0.000 description 2
- NNCSJUBVFBDDLC-YUMQZZPRSA-N Gly-Leu-Ser Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O NNCSJUBVFBDDLC-YUMQZZPRSA-N 0.000 description 2
- LHYJCVCQPWRMKZ-WEDXCCLWSA-N Gly-Leu-Thr Chemical compound [H]NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LHYJCVCQPWRMKZ-WEDXCCLWSA-N 0.000 description 2
- MIIVFRCYJABHTQ-ONGXEEELSA-N Gly-Leu-Val Chemical compound [H]NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O MIIVFRCYJABHTQ-ONGXEEELSA-N 0.000 description 2
- CLNSYANKYVMZNM-UWVGGRQHSA-N Gly-Lys-Arg Chemical compound NCCCC[C@H](NC(=O)CN)C(=O)N[C@H](C(O)=O)CCCN=C(N)N CLNSYANKYVMZNM-UWVGGRQHSA-N 0.000 description 2
- FHQRLHFYVZAQHU-IUCAKERBSA-N Gly-Lys-Gln Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O FHQRLHFYVZAQHU-IUCAKERBSA-N 0.000 description 2
- MHXKHKWHPNETGG-QWRGUYRKSA-N Gly-Lys-Leu Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O MHXKHKWHPNETGG-QWRGUYRKSA-N 0.000 description 2
- MHZXESQPPXOING-KBPBESRZSA-N Gly-Lys-Phe Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O MHZXESQPPXOING-KBPBESRZSA-N 0.000 description 2
- JBCLFWXMTIKCCB-VIFPVBQESA-N Gly-Phe Chemical compound NCC(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 JBCLFWXMTIKCCB-VIFPVBQESA-N 0.000 description 2
- YYXJFBMCOUSYSF-RYUDHWBXSA-N Gly-Phe-Gln Chemical compound [H]NCC(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O YYXJFBMCOUSYSF-RYUDHWBXSA-N 0.000 description 2
- JPVGHHQGKPQYIL-KBPBESRZSA-N Gly-Phe-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CC=CC=C1 JPVGHHQGKPQYIL-KBPBESRZSA-N 0.000 description 2
- NSVOVKWEKGEOQB-LURJTMIESA-N Gly-Pro-Gly Chemical compound NCC(=O)N1CCC[C@H]1C(=O)NCC(O)=O NSVOVKWEKGEOQB-LURJTMIESA-N 0.000 description 2
- FGPLUIQCSKGLTI-WDSKDSINSA-N Gly-Ser-Glu Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O FGPLUIQCSKGLTI-WDSKDSINSA-N 0.000 description 2
- WNGHUXFWEWTKAO-YUMQZZPRSA-N Gly-Ser-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)CN WNGHUXFWEWTKAO-YUMQZZPRSA-N 0.000 description 2
- YABRDIBSPZONIY-BQBZGAKWSA-N Gly-Ser-Met Chemical compound [H]NCC(=O)N[C@@H](CO)C(=O)N[C@@H](CCSC)C(O)=O YABRDIBSPZONIY-BQBZGAKWSA-N 0.000 description 2
- OLIFSFOFKGKIRH-WUJLRWPWSA-N Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CN OLIFSFOFKGKIRH-WUJLRWPWSA-N 0.000 description 2
- ONSARSFSJHTMFJ-STQMWFEESA-N Gly-Trp-Ser Chemical compound [H]NCC(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CO)C(O)=O ONSARSFSJHTMFJ-STQMWFEESA-N 0.000 description 2
- XBGGUPMXALFZOT-VIFPVBQESA-N Gly-Tyr Chemical compound NCC(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 XBGGUPMXALFZOT-VIFPVBQESA-N 0.000 description 2
- RIYIFUFFFBIOEU-KBPBESRZSA-N Gly-Tyr-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CC=C(O)C=C1 RIYIFUFFFBIOEU-KBPBESRZSA-N 0.000 description 2
- DNAZKGFYFRGZIH-QWRGUYRKSA-N Gly-Tyr-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CC=C(O)C=C1 DNAZKGFYFRGZIH-QWRGUYRKSA-N 0.000 description 2
- DUAWRXXTOQOECJ-JSGCOSHPSA-N Gly-Tyr-Val Chemical compound [H]NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O DUAWRXXTOQOECJ-JSGCOSHPSA-N 0.000 description 2
- DNVDEMWIYLVIQU-RCOVLWMOSA-N Gly-Val-Asp Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O DNVDEMWIYLVIQU-RCOVLWMOSA-N 0.000 description 2
- MUGLKCQHTUFLGF-WPRPVWTQSA-N Gly-Val-Met Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)CN MUGLKCQHTUFLGF-WPRPVWTQSA-N 0.000 description 2
- AFMOTCMSEBITOE-YEPSODPASA-N Gly-Val-Thr Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O AFMOTCMSEBITOE-YEPSODPASA-N 0.000 description 2
- 102100029966 HLA class II histocompatibility antigen, DP alpha 1 chain Human genes 0.000 description 2
- PDSUIXMZYNURGI-AVGNSLFASA-N His-Arg-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H](N)CC1=CN=CN1 PDSUIXMZYNURGI-AVGNSLFASA-N 0.000 description 2
- YPLYIXGKCRQZGW-SRVKXCTJSA-N His-Arg-Glu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O YPLYIXGKCRQZGW-SRVKXCTJSA-N 0.000 description 2
- QQQHYJFKDLDUNK-CIUDSAMLSA-N His-Asp-Cys Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CS)C(=O)O)N QQQHYJFKDLDUNK-CIUDSAMLSA-N 0.000 description 2
- JCOSMKPAOYDKRO-AVGNSLFASA-N His-Glu-Lys Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N JCOSMKPAOYDKRO-AVGNSLFASA-N 0.000 description 2
- ZHHLTWUOWXHVQJ-YUMQZZPRSA-N His-Ser-Gly Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CO)C(=O)NCC(=O)O)N ZHHLTWUOWXHVQJ-YUMQZZPRSA-N 0.000 description 2
- PZAJPILZRFPYJJ-SRVKXCTJSA-N His-Ser-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O PZAJPILZRFPYJJ-SRVKXCTJSA-N 0.000 description 2
- XHQYFGPIRUHQIB-PBCZWWQYSA-N His-Thr-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H]([C@H](O)C)NC(=O)[C@@H](N)CC1=CN=CN1 XHQYFGPIRUHQIB-PBCZWWQYSA-N 0.000 description 2
- CKONPJHGMIDMJP-IHRRRGAJSA-N His-Val-His Chemical compound C([C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC=1NC=NC=1)C(O)=O)C1=CN=CN1 CKONPJHGMIDMJP-IHRRRGAJSA-N 0.000 description 2
- XGBVLRJLHUVCNK-DCAQKATOSA-N His-Val-Ser Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O XGBVLRJLHUVCNK-DCAQKATOSA-N 0.000 description 2
- 102000008949 Histocompatibility Antigens Class I Human genes 0.000 description 2
- 101000934374 Homo sapiens Early activation antigen CD69 Proteins 0.000 description 2
- 101000864089 Homo sapiens HLA class II histocompatibility antigen, DP alpha 1 chain Proteins 0.000 description 2
- 101000930802 Homo sapiens HLA class II histocompatibility antigen, DQ alpha 1 chain Proteins 0.000 description 2
- 101000968032 Homo sapiens HLA class II histocompatibility antigen, DR beta 3 chain Proteins 0.000 description 2
- 206010061218 Inflammation Diseases 0.000 description 2
- 102000004310 Ion Channels Human genes 0.000 description 2
- 108090000862 Ion Channels Proteins 0.000 description 2
- PWWVAXIEGOYWEE-UHFFFAOYSA-N Isophenergan Chemical compound C1=CC=C2N(CC(C)N(C)C)C3=CC=CC=C3SC2=C1 PWWVAXIEGOYWEE-UHFFFAOYSA-N 0.000 description 2
- PMGDADKJMCOXHX-UHFFFAOYSA-N L-Arginyl-L-glutamin-acetat Natural products NC(=N)NCCCC(N)C(=O)NC(CCC(N)=O)C(O)=O PMGDADKJMCOXHX-UHFFFAOYSA-N 0.000 description 2
- LHSGPCFBGJHPCY-UHFFFAOYSA-N L-leucine-L-tyrosine Natural products CC(C)CC(N)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 LHSGPCFBGJHPCY-UHFFFAOYSA-N 0.000 description 2
- SENJXOPIZNYLHU-UHFFFAOYSA-N L-leucyl-L-arginine Natural products CC(C)CC(N)C(=O)NC(C(O)=O)CCCN=C(N)N SENJXOPIZNYLHU-UHFFFAOYSA-N 0.000 description 2
- FFEARJCKVFRZRR-BYPYZUCNSA-N L-methionine Chemical compound CSCC[C@H](N)C(O)=O FFEARJCKVFRZRR-BYPYZUCNSA-N 0.000 description 2
- CQQGCWPXDHTTNF-GUBZILKMSA-N Leu-Ala-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(O)=O CQQGCWPXDHTTNF-GUBZILKMSA-N 0.000 description 2
- HBJZFCIVFIBNSV-DCAQKATOSA-N Leu-Arg-Asn Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC(N)=O)C(O)=O HBJZFCIVFIBNSV-DCAQKATOSA-N 0.000 description 2
- GRZSCTXVCDUIPO-SRVKXCTJSA-N Leu-Arg-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(O)=O GRZSCTXVCDUIPO-SRVKXCTJSA-N 0.000 description 2
- HASRFYOMVPJRPU-SRVKXCTJSA-N Leu-Arg-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(O)=O)C(O)=O HASRFYOMVPJRPU-SRVKXCTJSA-N 0.000 description 2
- YOZCKMXHBYKOMQ-IHRRRGAJSA-N Leu-Arg-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCCN)C(=O)O)N YOZCKMXHBYKOMQ-IHRRRGAJSA-N 0.000 description 2
- MDVZJYGNAGLPGJ-KKUMJFAQSA-N Leu-Asn-Phe Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 MDVZJYGNAGLPGJ-KKUMJFAQSA-N 0.000 description 2
- WGNOPSQMIQERPK-UHFFFAOYSA-N Leu-Asn-Pro Natural products CC(C)CC(N)C(=O)NC(CC(=O)N)C(=O)N1CCCC1C(=O)O WGNOPSQMIQERPK-UHFFFAOYSA-N 0.000 description 2
- OGCQGUIWMSBHRZ-CIUDSAMLSA-N Leu-Asn-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O OGCQGUIWMSBHRZ-CIUDSAMLSA-N 0.000 description 2
- DLFAACQHIRSQGG-CIUDSAMLSA-N Leu-Asp-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O DLFAACQHIRSQGG-CIUDSAMLSA-N 0.000 description 2
- DLCOFDAHNMMQPP-SRVKXCTJSA-N Leu-Asp-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O DLCOFDAHNMMQPP-SRVKXCTJSA-N 0.000 description 2
- PVMPDMIKUVNOBD-CIUDSAMLSA-N Leu-Asp-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O PVMPDMIKUVNOBD-CIUDSAMLSA-N 0.000 description 2
- HIZYETOZLYFUFF-BQBZGAKWSA-N Leu-Cys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CS)C(O)=O HIZYETOZLYFUFF-BQBZGAKWSA-N 0.000 description 2
- KWURTLAFFDOTEQ-GUBZILKMSA-N Leu-Cys-Glu Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N KWURTLAFFDOTEQ-GUBZILKMSA-N 0.000 description 2
- HUEBCHPSXSQUGN-GARJFASQSA-N Leu-Cys-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)N1CCC[C@@H]1C(=O)O)N HUEBCHPSXSQUGN-GARJFASQSA-N 0.000 description 2
- PNUCWVAGVNLUMW-CIUDSAMLSA-N Leu-Cys-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CS)C(=O)N[C@@H](CO)C(O)=O PNUCWVAGVNLUMW-CIUDSAMLSA-N 0.000 description 2
- FOEHRHOBWFQSNW-KATARQTJSA-N Leu-Cys-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CC(C)C)N)O FOEHRHOBWFQSNW-KATARQTJSA-N 0.000 description 2
- ZYLJULGXQDNXDK-GUBZILKMSA-N Leu-Gln-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O ZYLJULGXQDNXDK-GUBZILKMSA-N 0.000 description 2
- CIVKXGPFXDIQBV-WDCWCFNPSA-N Leu-Gln-Thr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CIVKXGPFXDIQBV-WDCWCFNPSA-N 0.000 description 2
- YSKSXVKQLLBVEX-SZMVWBNQSA-N Leu-Gln-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CC(C)C)C(O)=O)=CNC2=C1 YSKSXVKQLLBVEX-SZMVWBNQSA-N 0.000 description 2
- QDSKNVXKLPQNOJ-GVXVVHGQSA-N Leu-Gln-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O QDSKNVXKLPQNOJ-GVXVVHGQSA-N 0.000 description 2
- KVMULWOHPPMHHE-DCAQKATOSA-N Leu-Glu-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O KVMULWOHPPMHHE-DCAQKATOSA-N 0.000 description 2
- HVJVUYQWFYMGJS-GVXVVHGQSA-N Leu-Glu-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O HVJVUYQWFYMGJS-GVXVVHGQSA-N 0.000 description 2
- QJUWBDPGGYVRHY-YUMQZZPRSA-N Leu-Gly-Cys Chemical compound CC(C)C[C@@H](C(=O)NCC(=O)N[C@@H](CS)C(=O)O)N QJUWBDPGGYVRHY-YUMQZZPRSA-N 0.000 description 2
- VWHGTYCRDRBSFI-ZETCQYMHSA-N Leu-Gly-Gly Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)NCC(O)=O VWHGTYCRDRBSFI-ZETCQYMHSA-N 0.000 description 2
- YFBBUHJJUXXZOF-UWVGGRQHSA-N Leu-Gly-Pro Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N1CCC[C@H]1C(O)=O YFBBUHJJUXXZOF-UWVGGRQHSA-N 0.000 description 2
- JRJLGNFWYFSJHB-HOCLYGCPSA-N Leu-Gly-Trp Chemical compound [H]N[C@@H](CC(C)C)C(=O)NCC(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O JRJLGNFWYFSJHB-HOCLYGCPSA-N 0.000 description 2
- YWYQSLOTVIRCFE-SRVKXCTJSA-N Leu-His-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(O)=O)C(O)=O YWYQSLOTVIRCFE-SRVKXCTJSA-N 0.000 description 2
- MPSBSKHOWJQHBS-IHRRRGAJSA-N Leu-His-Met Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CCSC)C(=O)O)N MPSBSKHOWJQHBS-IHRRRGAJSA-N 0.000 description 2
- LKXANTUNFMVCNF-IHPCNDPISA-N Leu-His-Trp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O LKXANTUNFMVCNF-IHPCNDPISA-N 0.000 description 2
- RXGLHDWAZQECBI-SRVKXCTJSA-N Leu-Leu-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O RXGLHDWAZQECBI-SRVKXCTJSA-N 0.000 description 2
- IEWBEPKLKUXQBU-VOAKCMCISA-N Leu-Leu-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O IEWBEPKLKUXQBU-VOAKCMCISA-N 0.000 description 2
- UCNNZELZXFXXJQ-BZSNNMDCSA-N Leu-Leu-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 UCNNZELZXFXXJQ-BZSNNMDCSA-N 0.000 description 2
- ARRIJPQRBWRNLT-DCAQKATOSA-N Leu-Met-Asn Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(=O)N)C(=O)O)N ARRIJPQRBWRNLT-DCAQKATOSA-N 0.000 description 2
- LQUIENKUVKPNIC-ULQDDVLXSA-N Leu-Met-Tyr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O LQUIENKUVKPNIC-ULQDDVLXSA-N 0.000 description 2
- ZDBMWELMUCLUPL-QEJZJMRPSA-N Leu-Phe-Ala Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C)C(O)=O)CC1=CC=CC=C1 ZDBMWELMUCLUPL-QEJZJMRPSA-N 0.000 description 2
- KTOIECMYZZGVSI-BZSNNMDCSA-N Leu-Phe-His Chemical compound C([C@H](NC(=O)[C@@H](N)CC(C)C)C(=O)N[C@@H](CC=1NC=NC=1)C(O)=O)C1=CC=CC=C1 KTOIECMYZZGVSI-BZSNNMDCSA-N 0.000 description 2
- AKVBOOKXVAMKSS-GUBZILKMSA-N Leu-Ser-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O AKVBOOKXVAMKSS-GUBZILKMSA-N 0.000 description 2
- PPGBXYKMUMHFBF-KATARQTJSA-N Leu-Ser-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PPGBXYKMUMHFBF-KATARQTJSA-N 0.000 description 2
- HWMQRQIFVGEAPH-XIRDDKMYSA-N Leu-Ser-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC(C)C)C(O)=O)=CNC2=C1 HWMQRQIFVGEAPH-XIRDDKMYSA-N 0.000 description 2
- SVBJIZVVYJYGLA-DCAQKATOSA-N Leu-Ser-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O SVBJIZVVYJYGLA-DCAQKATOSA-N 0.000 description 2
- AEDWWMMHUGYIFD-HJGDQZAQSA-N Leu-Thr-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(O)=O AEDWWMMHUGYIFD-HJGDQZAQSA-N 0.000 description 2
- QWWPYKKLXWOITQ-VOAKCMCISA-N Leu-Thr-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(C)C QWWPYKKLXWOITQ-VOAKCMCISA-N 0.000 description 2
- URHJPNHRQMQGOZ-RHYQMDGZSA-N Leu-Thr-Met Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCSC)C(O)=O URHJPNHRQMQGOZ-RHYQMDGZSA-N 0.000 description 2
- BQVUABVGYYSDCJ-ZFWWWQNUSA-N Leu-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](N)CC(C)C)C(O)=O)=CNC2=C1 BQVUABVGYYSDCJ-ZFWWWQNUSA-N 0.000 description 2
- LFXSPAIBSZSTEM-PMVMPFDFSA-N Leu-Trp-Phe Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CC3=CC=CC=C3)C(=O)O)N LFXSPAIBSZSTEM-PMVMPFDFSA-N 0.000 description 2
- LMDVGHQPPPLYAR-IHRRRGAJSA-N Leu-Val-His Chemical compound N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)O LMDVGHQPPPLYAR-IHRRRGAJSA-N 0.000 description 2
- YQFZRHYZLARWDY-IHRRRGAJSA-N Leu-Val-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCCN YQFZRHYZLARWDY-IHRRRGAJSA-N 0.000 description 2
- QESXLSQLQHHTIX-RHYQMDGZSA-N Leu-Val-Thr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O QESXLSQLQHHTIX-RHYQMDGZSA-N 0.000 description 2
- NCTDKZKNBDZDOL-GARJFASQSA-N Lys-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCCCN)N)C(=O)O NCTDKZKNBDZDOL-GARJFASQSA-N 0.000 description 2
- SQXUUGUCGJSWCK-CIUDSAMLSA-N Lys-Asp-Cys Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CS)C(=O)O)N SQXUUGUCGJSWCK-CIUDSAMLSA-N 0.000 description 2
- GKFNXYMAMKJSKD-NHCYSSNCSA-N Lys-Asp-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O GKFNXYMAMKJSKD-NHCYSSNCSA-N 0.000 description 2
- UGTZHPSKYRIGRJ-YUMQZZPRSA-N Lys-Glu Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(O)=O)CCC(O)=O UGTZHPSKYRIGRJ-YUMQZZPRSA-N 0.000 description 2
- ATIPDCIQTUXABX-UWVGGRQHSA-N Lys-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](N)CCCCN ATIPDCIQTUXABX-UWVGGRQHSA-N 0.000 description 2
- MUXNCRWTWBMNHX-SRVKXCTJSA-N Lys-Leu-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O MUXNCRWTWBMNHX-SRVKXCTJSA-N 0.000 description 2
- SKRGVGLIRUGANF-AVGNSLFASA-N Lys-Leu-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O SKRGVGLIRUGANF-AVGNSLFASA-N 0.000 description 2
- WRODMZBHNNPRLN-SRVKXCTJSA-N Lys-Leu-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O WRODMZBHNNPRLN-SRVKXCTJSA-N 0.000 description 2
- ALGGDNMLQNFVIZ-SRVKXCTJSA-N Lys-Lys-Asp Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(=O)O)C(=O)O)N ALGGDNMLQNFVIZ-SRVKXCTJSA-N 0.000 description 2
- HVAUKHLDSDDROB-KKUMJFAQSA-N Lys-Lys-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O HVAUKHLDSDDROB-KKUMJFAQSA-N 0.000 description 2
- WBSCNDJQPKSPII-KKUMJFAQSA-N Lys-Lys-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(O)=O WBSCNDJQPKSPII-KKUMJFAQSA-N 0.000 description 2
- BXPHMHQHYHILBB-BZSNNMDCSA-N Lys-Lys-Tyr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O BXPHMHQHYHILBB-BZSNNMDCSA-N 0.000 description 2
- QQPSCXKFDSORFT-IHRRRGAJSA-N Lys-Lys-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCCCN QQPSCXKFDSORFT-IHRRRGAJSA-N 0.000 description 2
- XBZOQGHZGQLEQO-IUCAKERBSA-N Lys-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@@H](N)CCCCN XBZOQGHZGQLEQO-IUCAKERBSA-N 0.000 description 2
- DAHQKYYIXPBESV-UWVGGRQHSA-N Lys-Met-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCSC)C(=O)NCC(O)=O DAHQKYYIXPBESV-UWVGGRQHSA-N 0.000 description 2
- LNMKRJJLEFASGA-BZSNNMDCSA-N Lys-Phe-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O LNMKRJJLEFASGA-BZSNNMDCSA-N 0.000 description 2
- AFLBTVGQCQLOFJ-AVGNSLFASA-N Lys-Pro-Arg Chemical compound NCCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCN=C(N)N)C(O)=O AFLBTVGQCQLOFJ-AVGNSLFASA-N 0.000 description 2
- SQXZLVXQXWILKW-KKUMJFAQSA-N Lys-Ser-Phe Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O SQXZLVXQXWILKW-KKUMJFAQSA-N 0.000 description 2
- YKBSXQFZWFXFIB-VOAKCMCISA-N Lys-Thr-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CCCCN)C(O)=O YKBSXQFZWFXFIB-VOAKCMCISA-N 0.000 description 2
- VHTOGMKQXXJOHG-RHYQMDGZSA-N Lys-Thr-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O VHTOGMKQXXJOHG-RHYQMDGZSA-N 0.000 description 2
- YQAIUOWPSUOINN-IUCAKERBSA-N Lys-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H](N)CCCCN YQAIUOWPSUOINN-IUCAKERBSA-N 0.000 description 2
- XABXVVSWUVCZST-GVXVVHGQSA-N Lys-Val-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCCN XABXVVSWUVCZST-GVXVVHGQSA-N 0.000 description 2
- TWRXJAOTZQYOKJ-UHFFFAOYSA-L Magnesium chloride Chemical compound [Mg+2].[Cl-].[Cl-] TWRXJAOTZQYOKJ-UHFFFAOYSA-L 0.000 description 2
- QEVRUYFHWJJUHZ-DCAQKATOSA-N Met-Ala-Leu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC(C)C QEVRUYFHWJJUHZ-DCAQKATOSA-N 0.000 description 2
- DRINJBAHUGXNFC-DCAQKATOSA-N Met-Asp-His Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CNC=N1)C(O)=O DRINJBAHUGXNFC-DCAQKATOSA-N 0.000 description 2
- ADHNYKZHPOEULM-BQBZGAKWSA-N Met-Glu Chemical compound CSCC[C@H](N)C(=O)N[C@H](C(O)=O)CCC(O)=O ADHNYKZHPOEULM-BQBZGAKWSA-N 0.000 description 2
- VZBXCMCHIHEPBL-SRVKXCTJSA-N Met-Glu-Lys Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN VZBXCMCHIHEPBL-SRVKXCTJSA-N 0.000 description 2
- OGAZPKJHHZPYFK-GARJFASQSA-N Met-Glu-Pro Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N OGAZPKJHHZPYFK-GARJFASQSA-N 0.000 description 2
- STTRPDDKDVKIDF-KKUMJFAQSA-N Met-Glu-Tyr Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 STTRPDDKDVKIDF-KKUMJFAQSA-N 0.000 description 2
- HZVXPUHLTZRQEL-UWVGGRQHSA-N Met-Leu-Gly Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O HZVXPUHLTZRQEL-UWVGGRQHSA-N 0.000 description 2
- KMSMNUFBNCHMII-IHRRRGAJSA-N Met-Leu-Lys Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCCN KMSMNUFBNCHMII-IHRRRGAJSA-N 0.000 description 2
- AWGBEIYZPAXXSX-RWMBFGLXSA-N Met-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCSC)N AWGBEIYZPAXXSX-RWMBFGLXSA-N 0.000 description 2
- WPTHAGXMYDRPFD-SRVKXCTJSA-N Met-Lys-Glu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O WPTHAGXMYDRPFD-SRVKXCTJSA-N 0.000 description 2
- UFOWQBYMUILSRK-IHRRRGAJSA-N Met-Lys-His Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CC1=CNC=N1 UFOWQBYMUILSRK-IHRRRGAJSA-N 0.000 description 2
- LUYURUYVNYGKGM-RCWTZXSCSA-N Met-Pro-Thr Chemical compound CSCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O LUYURUYVNYGKGM-RCWTZXSCSA-N 0.000 description 2
- LHXFNWBNRBWMNV-DCAQKATOSA-N Met-Ser-His Chemical compound CSCC[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N LHXFNWBNRBWMNV-DCAQKATOSA-N 0.000 description 2
- CIIJWIAORKTXAH-FJXKBIBVSA-N Met-Thr-Gly Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O CIIJWIAORKTXAH-FJXKBIBVSA-N 0.000 description 2
- QQPMHUCGDRJFQK-RHYQMDGZSA-N Met-Thr-Leu Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(C)C QQPMHUCGDRJFQK-RHYQMDGZSA-N 0.000 description 2
- IQJMEDDVOGMTKT-SRVKXCTJSA-N Met-Val-Val Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C(C)C)C(O)=O IQJMEDDVOGMTKT-SRVKXCTJSA-N 0.000 description 2
- 108010086093 Mung Bean Nuclease Proteins 0.000 description 2
- 208000031888 Mycoses Diseases 0.000 description 2
- PESQCPHRXOFIPX-UHFFFAOYSA-N N-L-methionyl-L-tyrosine Natural products CSCCC(N)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 PESQCPHRXOFIPX-UHFFFAOYSA-N 0.000 description 2
- 208000016113 North Carolina macular dystrophy Diseases 0.000 description 2
- 108091028043 Nucleic acid sequence Proteins 0.000 description 2
- 108091034117 Oligonucleotide Proteins 0.000 description 2
- 241000283973 Oryctolagus cuniculus Species 0.000 description 2
- 102000035195 Peptidases Human genes 0.000 description 2
- 108091005804 Peptidases Proteins 0.000 description 2
- BJEYSVHMGIJORT-NHCYSSNCSA-N Phe-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=CC=C1 BJEYSVHMGIJORT-NHCYSSNCSA-N 0.000 description 2
- MECSIDWUTYRHRJ-KKUMJFAQSA-N Phe-Asn-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O MECSIDWUTYRHRJ-KKUMJFAQSA-N 0.000 description 2
- CSYVXYQDIVCQNU-QWRGUYRKSA-N Phe-Asp-Gly Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O CSYVXYQDIVCQNU-QWRGUYRKSA-N 0.000 description 2
- SWZKMTDPQXLQRD-XVSYOHENSA-N Phe-Asp-Thr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SWZKMTDPQXLQRD-XVSYOHENSA-N 0.000 description 2
- FRPVPGRXUKFEQE-YDHLFZDLSA-N Phe-Asp-Val Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O FRPVPGRXUKFEQE-YDHLFZDLSA-N 0.000 description 2
- WFDAEEUZPZSMOG-SRVKXCTJSA-N Phe-Cys-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CS)C(=O)N[C@@H](CO)C(O)=O WFDAEEUZPZSMOG-SRVKXCTJSA-N 0.000 description 2
- KLAONOISLHWJEE-QWRGUYRKSA-N Phe-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 KLAONOISLHWJEE-QWRGUYRKSA-N 0.000 description 2
- GLUBLISJVJFHQS-VIFPVBQESA-N Phe-Gly Chemical compound OC(=O)CNC(=O)[C@@H](N)CC1=CC=CC=C1 GLUBLISJVJFHQS-VIFPVBQESA-N 0.000 description 2
- VJLLEKDQJSMHRU-STQMWFEESA-N Phe-Gly-Met Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)NCC(=O)N[C@@H](CCSC)C(O)=O VJLLEKDQJSMHRU-STQMWFEESA-N 0.000 description 2
- WFHRXJOZEXUKLV-IRXDYDNUSA-N Phe-Gly-Tyr Chemical compound C([C@H](N)C(=O)NCC(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=CC=C1 WFHRXJOZEXUKLV-IRXDYDNUSA-N 0.000 description 2
- SWCOXQLDICUYOL-ULQDDVLXSA-N Phe-His-Arg Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O SWCOXQLDICUYOL-ULQDDVLXSA-N 0.000 description 2
- RFCVXVPWSPOMFJ-STQMWFEESA-N Phe-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 RFCVXVPWSPOMFJ-STQMWFEESA-N 0.000 description 2
- YTILBRIUASDGBL-BZSNNMDCSA-N Phe-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 YTILBRIUASDGBL-BZSNNMDCSA-N 0.000 description 2
- MSHZERMPZKCODG-ACRUOGEOSA-N Phe-Leu-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 MSHZERMPZKCODG-ACRUOGEOSA-N 0.000 description 2
- IWZRODDWOSIXPZ-IRXDYDNUSA-N Phe-Phe-Gly Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)NCC(O)=O)C1=CC=CC=C1 IWZRODDWOSIXPZ-IRXDYDNUSA-N 0.000 description 2
- DEZCWWXTRAKZKJ-UFYCRDLUSA-N Phe-Phe-Met Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCSC)C(O)=O DEZCWWXTRAKZKJ-UFYCRDLUSA-N 0.000 description 2
- ROHDXJUFQVRDAV-UWVGGRQHSA-N Phe-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 ROHDXJUFQVRDAV-UWVGGRQHSA-N 0.000 description 2
- BPCLGWHVPVTTFM-QWRGUYRKSA-N Phe-Ser-Gly Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)NCC(O)=O BPCLGWHVPVTTFM-QWRGUYRKSA-N 0.000 description 2
- IPFXYNKCXYGSSV-KKUMJFAQSA-N Phe-Ser-Lys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)O)N IPFXYNKCXYGSSV-KKUMJFAQSA-N 0.000 description 2
- MVIJMIZJPHQGEN-IHRRRGAJSA-N Phe-Ser-Val Chemical compound CC(C)[C@@H](C([O-])=O)NC(=O)[C@H](CO)NC(=O)[C@@H]([NH3+])CC1=CC=CC=C1 MVIJMIZJPHQGEN-IHRRRGAJSA-N 0.000 description 2
- NYQBYASWHVRESG-MIMYLULJSA-N Phe-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 NYQBYASWHVRESG-MIMYLULJSA-N 0.000 description 2
- KLYYKKGCPOGDPE-OEAJRASXSA-N Phe-Thr-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O KLYYKKGCPOGDPE-OEAJRASXSA-N 0.000 description 2
- BPIMVBKDLSBKIJ-FCLVOEFKSA-N Phe-Thr-Phe Chemical compound C([C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 BPIMVBKDLSBKIJ-FCLVOEFKSA-N 0.000 description 2
- GNRMAQSIROFNMI-IXOXFDKPSA-N Phe-Thr-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O GNRMAQSIROFNMI-IXOXFDKPSA-N 0.000 description 2
- JTKGCYOOJLUETJ-ULQDDVLXSA-N Phe-Val-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 JTKGCYOOJLUETJ-ULQDDVLXSA-N 0.000 description 2
- VIIRRNQMMIHYHQ-XHSDSOJGSA-N Phe-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CC=CC=C2)N VIIRRNQMMIHYHQ-XHSDSOJGSA-N 0.000 description 2
- 239000002202 Polyethylene glycol Substances 0.000 description 2
- TXPUNZXZDVJUJQ-LPEHRKFASA-N Pro-Asn-Pro Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC(=O)N)C(=O)N2CCC[C@@H]2C(=O)O TXPUNZXZDVJUJQ-LPEHRKFASA-N 0.000 description 2
- VJLJGKQAOQJXJG-CIUDSAMLSA-N Pro-Asp-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O VJLJGKQAOQJXJG-CIUDSAMLSA-N 0.000 description 2
- KPDRZQUWJKTMBP-DCAQKATOSA-N Pro-Asp-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@@H]1CCCN1 KPDRZQUWJKTMBP-DCAQKATOSA-N 0.000 description 2
- ZCXQTRXYZOSGJR-FXQIFTODSA-N Pro-Asp-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O ZCXQTRXYZOSGJR-FXQIFTODSA-N 0.000 description 2
- XUSDDSLCRPUKLP-QXEWZRGKSA-N Pro-Asp-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H]1CCCN1 XUSDDSLCRPUKLP-QXEWZRGKSA-N 0.000 description 2
- PZSCUPVOJGKHEP-CIUDSAMLSA-N Pro-Gln-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O PZSCUPVOJGKHEP-CIUDSAMLSA-N 0.000 description 2
- DIFXZGPHVCIVSQ-CIUDSAMLSA-N Pro-Gln-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O DIFXZGPHVCIVSQ-CIUDSAMLSA-N 0.000 description 2
- FRKBNXCFJBPJOL-GUBZILKMSA-N Pro-Glu-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O FRKBNXCFJBPJOL-GUBZILKMSA-N 0.000 description 2
- LGSANCBHSMDFDY-GARJFASQSA-N Pro-Glu-Pro Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CCC(=O)O)C(=O)N2CCC[C@@H]2C(=O)O LGSANCBHSMDFDY-GARJFASQSA-N 0.000 description 2
- CLNJSLSHKJECME-BQBZGAKWSA-N Pro-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)[C@@H]1CCCN1 CLNJSLSHKJECME-BQBZGAKWSA-N 0.000 description 2
- VYWNORHENYEQDW-YUMQZZPRSA-N Pro-Gly-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H]1CCCN1 VYWNORHENYEQDW-YUMQZZPRSA-N 0.000 description 2
- FKLSMYYLJHYPHH-UWVGGRQHSA-N Pro-Gly-Leu Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O FKLSMYYLJHYPHH-UWVGGRQHSA-N 0.000 description 2
- QEWBZBLXDKIQPS-STQMWFEESA-N Pro-Gly-Tyr Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O QEWBZBLXDKIQPS-STQMWFEESA-N 0.000 description 2
- VWXGFAIZUQBBBG-UWVGGRQHSA-N Pro-His-Gly Chemical compound C([C@@H](C(=O)NCC(=O)[O-])NC(=O)[C@H]1[NH2+]CCC1)C1=CN=CN1 VWXGFAIZUQBBBG-UWVGGRQHSA-N 0.000 description 2
- HFNPOYOKIPGAEI-SRVKXCTJSA-N Pro-Leu-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]1CCCN1 HFNPOYOKIPGAEI-SRVKXCTJSA-N 0.000 description 2
- FYPGHGXAOZTOBO-IHRRRGAJSA-N Pro-Leu-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@@H]2CCCN2 FYPGHGXAOZTOBO-IHRRRGAJSA-N 0.000 description 2
- MRYUJHGPZQNOAD-IHRRRGAJSA-N Pro-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@@H]1CCCN1 MRYUJHGPZQNOAD-IHRRRGAJSA-N 0.000 description 2
- DRKAXLDECUGLFE-ULQDDVLXSA-N Pro-Leu-Phe Chemical compound CC(C)C[C@H](NC(=O)[C@@H]1CCCN1)C(=O)N[C@@H](Cc1ccccc1)C(O)=O DRKAXLDECUGLFE-ULQDDVLXSA-N 0.000 description 2
- RVQDZELMXZRSSI-IUCAKERBSA-N Pro-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1 RVQDZELMXZRSSI-IUCAKERBSA-N 0.000 description 2
- VGVCNKSUVSZEIE-IHRRRGAJSA-N Pro-Phe-Asn Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(N)=O)C(O)=O VGVCNKSUVSZEIE-IHRRRGAJSA-N 0.000 description 2
- SBVPYBFMIGDIDX-SRVKXCTJSA-N Pro-Pro-Pro Chemical compound OC(=O)[C@@H]1CCCN1C(=O)[C@H]1N(C(=O)[C@H]2NCCC2)CCC1 SBVPYBFMIGDIDX-SRVKXCTJSA-N 0.000 description 2
- AJNGQVUFQUVRQT-JYJNAYRXSA-N Pro-Pro-Tyr Chemical compound C([C@@H](C(=O)O)NC(=O)[C@H]1N(CCC1)C(=O)[C@H]1NCCC1)C1=CC=C(O)C=C1 AJNGQVUFQUVRQT-JYJNAYRXSA-N 0.000 description 2
- SNGZLPOXVRTNMB-LPEHRKFASA-N Pro-Ser-Pro Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CO)C(=O)N2CCC[C@@H]2C(=O)O SNGZLPOXVRTNMB-LPEHRKFASA-N 0.000 description 2
- KWMZPPWYBVZIER-XGEHTFHBSA-N Pro-Ser-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KWMZPPWYBVZIER-XGEHTFHBSA-N 0.000 description 2
- PKHDJFHFMGQMPS-RCWTZXSCSA-N Pro-Thr-Arg Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O PKHDJFHFMGQMPS-RCWTZXSCSA-N 0.000 description 2
- JDJMFMVVJHLWDP-UNQGMJICSA-N Pro-Thr-Phe Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O JDJMFMVVJHLWDP-UNQGMJICSA-N 0.000 description 2
- RSTWKJFWBKFOFC-JYJNAYRXSA-N Pro-Trp-Asn Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC(N)=O)C(O)=O RSTWKJFWBKFOFC-JYJNAYRXSA-N 0.000 description 2
- LZHHZYDPMZEMRX-STQMWFEESA-N Pro-Tyr-Gly Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(O)=O LZHHZYDPMZEMRX-STQMWFEESA-N 0.000 description 2
- FUOGXAQMNJMBFG-WPRPVWTQSA-N Pro-Val-Gly Chemical compound OC(=O)CNC(=O)[C@H](C(C)C)NC(=O)[C@@H]1CCCN1 FUOGXAQMNJMBFG-WPRPVWTQSA-N 0.000 description 2
- 239000004365 Protease Substances 0.000 description 2
- 108020004511 Recombinant DNA Proteins 0.000 description 2
- 108010031271 Saccharomyces cerevisiae Proteins Proteins 0.000 description 2
- 102000040739 Secretory proteins Human genes 0.000 description 2
- 108091058545 Secretory proteins Proteins 0.000 description 2
- FIDMVVBUOCMMJG-CIUDSAMLSA-N Ser-Asn-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CO FIDMVVBUOCMMJG-CIUDSAMLSA-N 0.000 description 2
- TUYBIWUZWJUZDD-ACZMJKKPSA-N Ser-Cys-Gln Chemical compound OC[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@H](C(O)=O)CCC(N)=O TUYBIWUZWJUZDD-ACZMJKKPSA-N 0.000 description 2
- ULVMNZOKDBHKKI-ACZMJKKPSA-N Ser-Gln-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O ULVMNZOKDBHKKI-ACZMJKKPSA-N 0.000 description 2
- LAFKUZYWNCHOHT-WHFBIAKZSA-N Ser-Glu Chemical compound OC[C@H](N)C(=O)N[C@H](C(O)=O)CCC(O)=O LAFKUZYWNCHOHT-WHFBIAKZSA-N 0.000 description 2
- BPMRXBZYPGYPJN-WHFBIAKZSA-N Ser-Gly-Asn Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O BPMRXBZYPGYPJN-WHFBIAKZSA-N 0.000 description 2
- MUARUIBTKQJKFY-WHFBIAKZSA-N Ser-Gly-Asp Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O MUARUIBTKQJKFY-WHFBIAKZSA-N 0.000 description 2
- IXCHOHLPHNGFTJ-YUMQZZPRSA-N Ser-Gly-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CO)N IXCHOHLPHNGFTJ-YUMQZZPRSA-N 0.000 description 2
- UAJAYRMZGNQILN-BQBZGAKWSA-N Ser-Gly-Met Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CCSC)C(O)=O UAJAYRMZGNQILN-BQBZGAKWSA-N 0.000 description 2
- KDGARKCAKHBEDB-NKWVEPMBSA-N Ser-Gly-Pro Chemical compound C1C[C@@H](N(C1)C(=O)CNC(=O)[C@H](CO)N)C(=O)O KDGARKCAKHBEDB-NKWVEPMBSA-N 0.000 description 2
- UIGMAMGZOJVTDN-WHFBIAKZSA-N Ser-Gly-Ser Chemical compound OC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O UIGMAMGZOJVTDN-WHFBIAKZSA-N 0.000 description 2
- CXBFHZLODKPIJY-AAEUAGOBSA-N Ser-Gly-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CO)N CXBFHZLODKPIJY-AAEUAGOBSA-N 0.000 description 2
- QBUWQRKEHJXTOP-DCAQKATOSA-N Ser-His-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O QBUWQRKEHJXTOP-DCAQKATOSA-N 0.000 description 2
- JEHPKECJCALLRW-CUJWVEQBSA-N Ser-His-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O JEHPKECJCALLRW-CUJWVEQBSA-N 0.000 description 2
- ZIFYDQAFEMIZII-GUBZILKMSA-N Ser-Leu-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O ZIFYDQAFEMIZII-GUBZILKMSA-N 0.000 description 2
- XXNYYSXNXCJYKX-DCAQKATOSA-N Ser-Leu-Met Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(O)=O XXNYYSXNXCJYKX-DCAQKATOSA-N 0.000 description 2
- VZQRNAYURWAEFE-KKUMJFAQSA-N Ser-Leu-Phe Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 VZQRNAYURWAEFE-KKUMJFAQSA-N 0.000 description 2
- XNXRTQZTFVMJIJ-DCAQKATOSA-N Ser-Met-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O XNXRTQZTFVMJIJ-DCAQKATOSA-N 0.000 description 2
- HEYZPTCCEIWHRO-IHRRRGAJSA-N Ser-Met-Phe Chemical compound OC[C@H](N)C(=O)N[C@@H](CCSC)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 HEYZPTCCEIWHRO-IHRRRGAJSA-N 0.000 description 2
- ASGYVPAVFNDZMA-GUBZILKMSA-N Ser-Met-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](CO)N ASGYVPAVFNDZMA-GUBZILKMSA-N 0.000 description 2
- PJIQEIFXZPCWOJ-FXQIFTODSA-N Ser-Pro-Asp Chemical compound [H]N[C@@H](CO)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O PJIQEIFXZPCWOJ-FXQIFTODSA-N 0.000 description 2
- BSXKBOUZDAZXHE-CIUDSAMLSA-N Ser-Pro-Glu Chemical compound [H]N[C@@H](CO)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O BSXKBOUZDAZXHE-CIUDSAMLSA-N 0.000 description 2
- DYEGLQRVMBWQLD-IXOXFDKPSA-N Ser-Thr-Phe Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CO)N)O DYEGLQRVMBWQLD-IXOXFDKPSA-N 0.000 description 2
- SGZVZUCRAVSPKQ-FXQIFTODSA-N Ser-Val-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CO)N SGZVZUCRAVSPKQ-FXQIFTODSA-N 0.000 description 2
- ANOQEBQWIAYIMV-AEJSXWLSSA-N Ser-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N ANOQEBQWIAYIMV-AEJSXWLSSA-N 0.000 description 2
- SIEBDTCABMZCLF-XGEHTFHBSA-N Ser-Val-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SIEBDTCABMZCLF-XGEHTFHBSA-N 0.000 description 2
- HSWXBJCBYSWBPT-GUBZILKMSA-N Ser-Val-Val Chemical compound CC(C)[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)CO)C(C)C)C(O)=O HSWXBJCBYSWBPT-GUBZILKMSA-N 0.000 description 2
- VPZKQTYZIVOJDV-LMVFSUKVSA-N Thr-Ala Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(O)=O VPZKQTYZIVOJDV-LMVFSUKVSA-N 0.000 description 2
- IGROJMCBGRFRGI-YTLHQDLWSA-N Thr-Ala-Ala Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O IGROJMCBGRFRGI-YTLHQDLWSA-N 0.000 description 2
- BSNZTJXVDOINSR-JXUBOQSCSA-N Thr-Ala-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O BSNZTJXVDOINSR-JXUBOQSCSA-N 0.000 description 2
- GLQFKOVWXPPFTP-VEVYYDQMSA-N Thr-Arg-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O GLQFKOVWXPPFTP-VEVYYDQMSA-N 0.000 description 2
- TWLMXDWFVNEFFK-FJXKBIBVSA-N Thr-Arg-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O TWLMXDWFVNEFFK-FJXKBIBVSA-N 0.000 description 2
- UQTNIFUCMBFWEJ-IWGUZYHVSA-N Thr-Asn Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@H](C(O)=O)CC(N)=O UQTNIFUCMBFWEJ-IWGUZYHVSA-N 0.000 description 2
- JHBHMCMKSPXRHV-NUMRIWBASA-N Thr-Asn-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N)O JHBHMCMKSPXRHV-NUMRIWBASA-N 0.000 description 2
- VBPDMBAFBRDZSK-HOUAVDHOSA-N Thr-Asn-Trp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N)O VBPDMBAFBRDZSK-HOUAVDHOSA-N 0.000 description 2
- IOWJRKAVLALBQB-IWGUZYHVSA-N Thr-Asp Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@H](C(O)=O)CC(O)=O IOWJRKAVLALBQB-IWGUZYHVSA-N 0.000 description 2
- GNHRVXYZKWSJTF-HJGDQZAQSA-N Thr-Asp-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N)O GNHRVXYZKWSJTF-HJGDQZAQSA-N 0.000 description 2
- OHAJHDJOCKKJLV-LKXGYXEUSA-N Thr-Asp-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O OHAJHDJOCKKJLV-LKXGYXEUSA-N 0.000 description 2
- KWQBJOUOSNJDRR-XAVMHZPKSA-N Thr-Cys-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)N1CCC[C@@H]1C(=O)O)N)O KWQBJOUOSNJDRR-XAVMHZPKSA-N 0.000 description 2
- GKWNLDNXMMLRMC-GLLZPBPUSA-N Thr-Glu-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N)O GKWNLDNXMMLRMC-GLLZPBPUSA-N 0.000 description 2
- OQCXTUQTKQFDCX-HTUGSXCWSA-N Thr-Glu-Phe Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N)O OQCXTUQTKQFDCX-HTUGSXCWSA-N 0.000 description 2
- ONNSECRQFSTMCC-XKBZYTNZSA-N Thr-Glu-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O ONNSECRQFSTMCC-XKBZYTNZSA-N 0.000 description 2
- BIYXEUAFGLTAEM-WUJLRWPWSA-N Thr-Gly Chemical compound C[C@@H](O)[C@H](N)C(=O)NCC(O)=O BIYXEUAFGLTAEM-WUJLRWPWSA-N 0.000 description 2
- YZUWGFXVVZQJEI-PMVVWTBXSA-N Thr-Gly-His Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N)O YZUWGFXVVZQJEI-PMVVWTBXSA-N 0.000 description 2
- QQWNRERCGGZOKG-WEDXCCLWSA-N Thr-Gly-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O QQWNRERCGGZOKG-WEDXCCLWSA-N 0.000 description 2
- HOVLHEKTGVIKAP-WDCWCFNPSA-N Thr-Leu-Gln Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O HOVLHEKTGVIKAP-WDCWCFNPSA-N 0.000 description 2
- MECLEFZMPPOEAC-VOAKCMCISA-N Thr-Leu-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N)O MECLEFZMPPOEAC-VOAKCMCISA-N 0.000 description 2
- FIFDDJFLNVAVMS-RHYQMDGZSA-N Thr-Leu-Met Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(O)=O FIFDDJFLNVAVMS-RHYQMDGZSA-N 0.000 description 2
- SCSVNSNWUTYSFO-WDCWCFNPSA-N Thr-Lys-Glu Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O SCSVNSNWUTYSFO-WDCWCFNPSA-N 0.000 description 2
- JWQNAFHCXKVZKZ-UVOCVTCTSA-N Thr-Lys-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O JWQNAFHCXKVZKZ-UVOCVTCTSA-N 0.000 description 2
- APIDTRXFGYOLLH-VQVTYTSYSA-N Thr-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@@H](N)[C@@H](C)O APIDTRXFGYOLLH-VQVTYTSYSA-N 0.000 description 2
- XNTVWRJTUIOGQO-RHYQMDGZSA-N Thr-Met-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O XNTVWRJTUIOGQO-RHYQMDGZSA-N 0.000 description 2
- KDGBLMDAPJTQIW-RHYQMDGZSA-N Thr-Met-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCCCN)C(=O)O)N)O KDGBLMDAPJTQIW-RHYQMDGZSA-N 0.000 description 2
- GYUUYCIXELGTJS-MEYUZBJRSA-N Thr-Phe-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N)O GYUUYCIXELGTJS-MEYUZBJRSA-N 0.000 description 2
- MXNAOGFNFNKUPD-JHYOHUSXSA-N Thr-Phe-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MXNAOGFNFNKUPD-JHYOHUSXSA-N 0.000 description 2
- NWECYMJLJGCBOD-UNQGMJICSA-N Thr-Phe-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(O)=O NWECYMJLJGCBOD-UNQGMJICSA-N 0.000 description 2
- DEGCBBCMYWNJNA-RHYQMDGZSA-N Thr-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)[C@@H](C)O DEGCBBCMYWNJNA-RHYQMDGZSA-N 0.000 description 2
- XHWCDRUPDNSDAZ-XKBZYTNZSA-N Thr-Ser-Glu Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N)O XHWCDRUPDNSDAZ-XKBZYTNZSA-N 0.000 description 2
- HUPLKEHTTQBXSC-YJRXYDGGSA-N Thr-Ser-Tyr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 HUPLKEHTTQBXSC-YJRXYDGGSA-N 0.000 description 2
- DSGIVWSDDRDJIO-ZXXMMSQZSA-N Thr-Thr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O DSGIVWSDDRDJIO-ZXXMMSQZSA-N 0.000 description 2
- AAZOYLQUEQRUMZ-GSSVUCPTSA-N Thr-Thr-Asn Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(N)=O AAZOYLQUEQRUMZ-GSSVUCPTSA-N 0.000 description 2
- YOPQYBJJNSIQGZ-JNPHEJMOSA-N Thr-Tyr-Tyr Chemical compound C([C@H](NC(=O)[C@@H](N)[C@H](O)C)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=C(O)C=C1 YOPQYBJJNSIQGZ-JNPHEJMOSA-N 0.000 description 2
- SBYQHZCMVSPQCS-RCWTZXSCSA-N Thr-Val-Met Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCSC)C(O)=O SBYQHZCMVSPQCS-RCWTZXSCSA-N 0.000 description 2
- MNYNCKZAEIAONY-XGEHTFHBSA-N Thr-Val-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O MNYNCKZAEIAONY-XGEHTFHBSA-N 0.000 description 2
- 208000007536 Thrombosis Diseases 0.000 description 2
- LHHDBONOFZDWMW-AAEUAGOBSA-N Trp-Asp-Gly Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)NCC(=O)O)N LHHDBONOFZDWMW-AAEUAGOBSA-N 0.000 description 2
- UYKREHOKELZSPB-JTQLQIEISA-N Trp-Gly Chemical compound C1=CC=C2C(C[C@H](N)C(=O)NCC(O)=O)=CNC2=C1 UYKREHOKELZSPB-JTQLQIEISA-N 0.000 description 2
- JVTHMUDOKPQBOT-NSHDSACASA-N Trp-Gly-Gly Chemical compound C1=CC=C2C(C[C@H]([NH3+])C(=O)NCC(=O)NCC([O-])=O)=CNC2=C1 JVTHMUDOKPQBOT-NSHDSACASA-N 0.000 description 2
- NXQAOORHSYJRGH-AAEUAGOBSA-N Trp-Gly-Ser Chemical compound C1=CC=C2C(C[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O)=CNC2=C1 NXQAOORHSYJRGH-AAEUAGOBSA-N 0.000 description 2
- VPRHDRKAPYZMHL-SZMVWBNQSA-N Trp-Leu-Glu Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O)=CNC2=C1 VPRHDRKAPYZMHL-SZMVWBNQSA-N 0.000 description 2
- RWAYYYOZMHMEGD-XIRDDKMYSA-N Trp-Leu-Ser Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O)=CNC2=C1 RWAYYYOZMHMEGD-XIRDDKMYSA-N 0.000 description 2
- YTYHAYZPOARHAP-HOCLYGCPSA-N Trp-Lys-Gly Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)NCC(=O)O)N YTYHAYZPOARHAP-HOCLYGCPSA-N 0.000 description 2
- NWQCKAPDGQMZQN-IHPCNDPISA-N Trp-Lys-Leu Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O NWQCKAPDGQMZQN-IHPCNDPISA-N 0.000 description 2
- KWTRGSQOQHZKIA-PMVMPFDFSA-N Trp-Lys-Tyr Chemical compound C([C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)CC=1C2=CC=CC=C2NC=1)CCCCN)C(O)=O)C1=CC=C(O)C=C1 KWTRGSQOQHZKIA-PMVMPFDFSA-N 0.000 description 2
- IMMPMHKLUUZKAZ-WMZOPIPTSA-N Trp-Phe Chemical compound C([C@H](NC(=O)[C@H](CC=1C2=CC=CC=C2NC=1)N)C(O)=O)C1=CC=CC=C1 IMMPMHKLUUZKAZ-WMZOPIPTSA-N 0.000 description 2
- ACGIVBXINJFALS-HKUYNNGSSA-N Trp-Phe-Gly Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)N ACGIVBXINJFALS-HKUYNNGSSA-N 0.000 description 2
- PWPJLBWYRTVYQS-PMVMPFDFSA-N Trp-Phe-Leu Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O PWPJLBWYRTVYQS-PMVMPFDFSA-N 0.000 description 2
- XOLLWQIBBLBAHQ-WDSOQIARSA-N Trp-Pro-Leu Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O XOLLWQIBBLBAHQ-WDSOQIARSA-N 0.000 description 2
- KBKTUNYBNJWFRL-UBHSHLNASA-N Trp-Ser-Asn Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O)=CNC2=C1 KBKTUNYBNJWFRL-UBHSHLNASA-N 0.000 description 2
- RKISDJMICOREEL-QRTARXTBSA-N Trp-Val-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N RKISDJMICOREEL-QRTARXTBSA-N 0.000 description 2
- VCXWRWYFJLXITF-AUTRQRHGSA-N Tyr-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 VCXWRWYFJLXITF-AUTRQRHGSA-N 0.000 description 2
- AKFLVKKWVZMFOT-IHRRRGAJSA-N Tyr-Arg-Asn Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(O)=O AKFLVKKWVZMFOT-IHRRRGAJSA-N 0.000 description 2
- JBBYKPZAPOLCPK-JYJNAYRXSA-N Tyr-Arg-Met Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCSC)C(O)=O JBBYKPZAPOLCPK-JYJNAYRXSA-N 0.000 description 2
- MBFJIHUHHCJBSN-AVGNSLFASA-N Tyr-Asn-Gln Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O MBFJIHUHHCJBSN-AVGNSLFASA-N 0.000 description 2
- DANHCMVVXDXOHN-SRVKXCTJSA-N Tyr-Asp-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 DANHCMVVXDXOHN-SRVKXCTJSA-N 0.000 description 2
- FGJWNBBFAUHBEP-IHPCNDPISA-N Tyr-Asp-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC3=CC=C(C=C3)O)N FGJWNBBFAUHBEP-IHPCNDPISA-N 0.000 description 2
- FQNUWOHNGJWNLM-QWRGUYRKSA-N Tyr-Cys-Gly Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CS)C(=O)NCC(O)=O FQNUWOHNGJWNLM-QWRGUYRKSA-N 0.000 description 2
- NGALWFGCOMHUSN-AVGNSLFASA-N Tyr-Gln-Asp Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O NGALWFGCOMHUSN-AVGNSLFASA-N 0.000 description 2
- KEHKBBUYZWAMHL-DZKIICNBSA-N Tyr-Gln-Val Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O KEHKBBUYZWAMHL-DZKIICNBSA-N 0.000 description 2
- JWGXUKHIKXZWNG-RYUDHWBXSA-N Tyr-Gly-Gln Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)NCC(=O)N[C@@H](CCC(=O)N)C(=O)O)N)O JWGXUKHIKXZWNG-RYUDHWBXSA-N 0.000 description 2
- KSCVLGXNQXKUAR-JYJNAYRXSA-N Tyr-Leu-Glu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O KSCVLGXNQXKUAR-JYJNAYRXSA-N 0.000 description 2
- DWAMXBFJNZIHMC-KBPBESRZSA-N Tyr-Leu-Gly Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O DWAMXBFJNZIHMC-KBPBESRZSA-N 0.000 description 2
- KHCSOLAHNLOXJR-BZSNNMDCSA-N Tyr-Leu-Leu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O KHCSOLAHNLOXJR-BZSNNMDCSA-N 0.000 description 2
- ARJASMXQBRNAGI-YESZJQIVSA-N Tyr-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N ARJASMXQBRNAGI-YESZJQIVSA-N 0.000 description 2
- WOAQYWUEUYMVGK-ULQDDVLXSA-N Tyr-Lys-Arg Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O WOAQYWUEUYMVGK-ULQDDVLXSA-N 0.000 description 2
- CNNVVEPJTFOGHI-ACRUOGEOSA-N Tyr-Lys-Tyr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O CNNVVEPJTFOGHI-ACRUOGEOSA-N 0.000 description 2
- SBLZVFCEOCWRLS-BPNCWPANSA-N Tyr-Met-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](CC1=CC=C(C=C1)O)N SBLZVFCEOCWRLS-BPNCWPANSA-N 0.000 description 2
- SZEIFUXUTBBQFQ-STQMWFEESA-N Tyr-Pro-Gly Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O SZEIFUXUTBBQFQ-STQMWFEESA-N 0.000 description 2
- VXFXIBCCVLJCJT-JYJNAYRXSA-N Tyr-Pro-Pro Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N1CCC[C@H]1C(=O)N1CCC[C@H]1C(O)=O VXFXIBCCVLJCJT-JYJNAYRXSA-N 0.000 description 2
- GPLTZEMVOCZVAV-UFYCRDLUSA-N Tyr-Tyr-Arg Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O)C1=CC=C(O)C=C1 GPLTZEMVOCZVAV-UFYCRDLUSA-N 0.000 description 2
- WYOBRXPIZVKNMF-IRXDYDNUSA-N Tyr-Tyr-Gly Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)NCC(O)=O)C1=CC=C(O)C=C1 WYOBRXPIZVKNMF-IRXDYDNUSA-N 0.000 description 2
- RMRFSFXLFWWAJZ-HJOGWXRNSA-N Tyr-Tyr-Tyr Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=C(O)C=C1 RMRFSFXLFWWAJZ-HJOGWXRNSA-N 0.000 description 2
- 208000025865 Ulcer Diseases 0.000 description 2
- XSQUKJJJFZCRTK-UHFFFAOYSA-N Urea Chemical compound NC(N)=O XSQUKJJJFZCRTK-UHFFFAOYSA-N 0.000 description 2
- 108010064997 VPY tripeptide Proteins 0.000 description 2
- YFOCMOVJBQDBCE-NRPADANISA-N Val-Ala-Glu Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N YFOCMOVJBQDBCE-NRPADANISA-N 0.000 description 2
- ASQFIHTXXMFENG-XPUUQOCRSA-N Val-Ala-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O ASQFIHTXXMFENG-XPUUQOCRSA-N 0.000 description 2
- SLLKXDSRVAOREO-KZVJFYERSA-N Val-Ala-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](C)NC(=O)[C@H](C(C)C)N)O SLLKXDSRVAOREO-KZVJFYERSA-N 0.000 description 2
- KKHRWGYHBZORMQ-NHCYSSNCSA-N Val-Arg-Glu Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N KKHRWGYHBZORMQ-NHCYSSNCSA-N 0.000 description 2
- COYSIHFOCOMGCF-WPRPVWTQSA-N Val-Arg-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CCCN=C(N)N COYSIHFOCOMGCF-WPRPVWTQSA-N 0.000 description 2
- DDNIHOWRDOXXPF-NGZCFLSTSA-N Val-Asp-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N DDNIHOWRDOXXPF-NGZCFLSTSA-N 0.000 description 2
- OVLIFGQSBSNGHY-KKHAAJSZSA-N Val-Asp-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](C(C)C)N)O OVLIFGQSBSNGHY-KKHAAJSZSA-N 0.000 description 2
- XKVXSCHXGJOQND-ZOBUZTSGSA-N Val-Asp-Trp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N XKVXSCHXGJOQND-ZOBUZTSGSA-N 0.000 description 2
- XTAUQCGQFJQGEJ-NHCYSSNCSA-N Val-Gln-Arg Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N XTAUQCGQFJQGEJ-NHCYSSNCSA-N 0.000 description 2
- JTWIMNMUYLQNPI-WPRPVWTQSA-N Val-Gly-Arg Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCNC(N)=N JTWIMNMUYLQNPI-WPRPVWTQSA-N 0.000 description 2
- NXRAUQGGHPCJIB-RCOVLWMOSA-N Val-Gly-Asn Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O NXRAUQGGHPCJIB-RCOVLWMOSA-N 0.000 description 2
- HGJRMXOWUWVUOA-GVXVVHGQSA-N Val-Leu-Gln Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N HGJRMXOWUWVUOA-GVXVVHGQSA-N 0.000 description 2
- LYERIXUFCYVFFX-GVXVVHGQSA-N Val-Leu-Glu Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N LYERIXUFCYVFFX-GVXVVHGQSA-N 0.000 description 2
- AEMPCGRFEZTWIF-IHRRRGAJSA-N Val-Leu-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(O)=O AEMPCGRFEZTWIF-IHRRRGAJSA-N 0.000 description 2
- ZZGPVSZDZQRJQY-ULQDDVLXSA-N Val-Leu-Phe Chemical compound CC(C)C[C@H](NC(=O)[C@@H](N)C(C)C)C(=O)N[C@@H](Cc1ccccc1)C(O)=O ZZGPVSZDZQRJQY-ULQDDVLXSA-N 0.000 description 2
- RWOGENDAOGMHLX-DCAQKATOSA-N Val-Lys-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](C(C)C)N RWOGENDAOGMHLX-DCAQKATOSA-N 0.000 description 2
- CXWJFWAZIVWBOS-XQQFMLRXSA-N Val-Lys-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N1CCC[C@@H]1C(=O)O)N CXWJFWAZIVWBOS-XQQFMLRXSA-N 0.000 description 2
- JAKHAONCJJZVHT-DCAQKATOSA-N Val-Lys-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)O)N JAKHAONCJJZVHT-DCAQKATOSA-N 0.000 description 2
- UOUIMEGEPSBZIV-ULQDDVLXSA-N Val-Lys-Tyr Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 UOUIMEGEPSBZIV-ULQDDVLXSA-N 0.000 description 2
- MJFSRZZJQWZHFQ-SRVKXCTJSA-N Val-Met-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C(C)C)C(=O)O)N MJFSRZZJQWZHFQ-SRVKXCTJSA-N 0.000 description 2
- YLRAFVVWZRSZQC-DZKIICNBSA-N Val-Phe-Glu Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N YLRAFVVWZRSZQC-DZKIICNBSA-N 0.000 description 2
- ZEBRMWPTJNHXAJ-JYJNAYRXSA-N Val-Phe-Met Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCSC)C(=O)O)N ZEBRMWPTJNHXAJ-JYJNAYRXSA-N 0.000 description 2
- VCIYTVOBLZHFSC-XHSDSOJGSA-N Val-Phe-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N2CCC[C@@H]2C(=O)O)N VCIYTVOBLZHFSC-XHSDSOJGSA-N 0.000 description 2
- KISFXYYRKKNLOP-IHRRRGAJSA-N Val-Phe-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)O)N KISFXYYRKKNLOP-IHRRRGAJSA-N 0.000 description 2
- MJOUSKQHAIARKI-JYJNAYRXSA-N Val-Phe-Val Chemical compound CC(C)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C(C)C)C(O)=O)CC1=CC=CC=C1 MJOUSKQHAIARKI-JYJNAYRXSA-N 0.000 description 2
- RYQUMYBMOJYYDK-NHCYSSNCSA-N Val-Pro-Glu Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(=O)O)C(=O)O)N RYQUMYBMOJYYDK-NHCYSSNCSA-N 0.000 description 2
- SJRUJQFQVLMZFW-WPRPVWTQSA-N Val-Pro-Gly Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O SJRUJQFQVLMZFW-WPRPVWTQSA-N 0.000 description 2
- NHXZRXLFOBFMDM-AVGNSLFASA-N Val-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)C(C)C NHXZRXLFOBFMDM-AVGNSLFASA-N 0.000 description 2
- KSFXWENSJABBFI-ZKWXMUAHSA-N Val-Ser-Asn Chemical compound [H]N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O KSFXWENSJABBFI-ZKWXMUAHSA-N 0.000 description 2
- VHIZXDZMTDVFGX-DCAQKATOSA-N Val-Ser-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](C(C)C)N VHIZXDZMTDVFGX-DCAQKATOSA-N 0.000 description 2
- PZTZYZUTCPZWJH-FXQIFTODSA-N Val-Ser-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)O)N PZTZYZUTCPZWJH-FXQIFTODSA-N 0.000 description 2
- HWNYVQMOLCYHEA-IHRRRGAJSA-N Val-Ser-Tyr Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N HWNYVQMOLCYHEA-IHRRRGAJSA-N 0.000 description 2
- YLBNZCJFSVJDRJ-KJEVXHAQSA-N Val-Thr-Tyr Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](Cc1ccc(O)cc1)C(O)=O YLBNZCJFSVJDRJ-KJEVXHAQSA-N 0.000 description 2
- HTONZBWRYUKUKC-RCWTZXSCSA-N Val-Thr-Val Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O HTONZBWRYUKUKC-RCWTZXSCSA-N 0.000 description 2
- KJFBXCFOPAKPTM-BZSNNMDCSA-N Val-Trp-Val Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](N)C(C)C)C(=O)N[C@@H](C(C)C)C(O)=O)=CNC2=C1 KJFBXCFOPAKPTM-BZSNNMDCSA-N 0.000 description 2
- QPJSIBAOZBVELU-BPNCWPANSA-N Val-Tyr-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](C(C)C)N QPJSIBAOZBVELU-BPNCWPANSA-N 0.000 description 2
- JXCOEPXCBVCTRD-JYJNAYRXSA-N Val-Tyr-Arg Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N JXCOEPXCBVCTRD-JYJNAYRXSA-N 0.000 description 2
- ZNGPROMGGGFOAA-JYJNAYRXSA-N Val-Tyr-Val Chemical compound CC(C)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C(C)C)C(O)=O)CC1=CC=C(O)C=C1 ZNGPROMGGGFOAA-JYJNAYRXSA-N 0.000 description 2
- ZHWZDZFWBXWPDW-GUBZILKMSA-N Val-Val-Cys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CS)C(O)=O ZHWZDZFWBXWPDW-GUBZILKMSA-N 0.000 description 2
- LLJLBRRXKZTTRD-GUBZILKMSA-N Val-Val-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(=O)O)N LLJLBRRXKZTTRD-GUBZILKMSA-N 0.000 description 2
- WHNSHJJNWNSTSU-BZSNNMDCSA-N Val-Val-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)C(C)C)C(O)=O)=CNC2=C1 WHNSHJJNWNSTSU-BZSNNMDCSA-N 0.000 description 2
- 239000011543 agarose gel Substances 0.000 description 2
- 238000000246 agarose gel electrophoresis Methods 0.000 description 2
- 239000000556 agonist Substances 0.000 description 2
- 108010070944 alanylhistidine Proteins 0.000 description 2
- 108010087924 alanylproline Proteins 0.000 description 2
- ZVDPYSVOZFINEE-BQBZGAKWSA-N alpha-Asp-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](N)CC(O)=O ZVDPYSVOZFINEE-BQBZGAKWSA-N 0.000 description 2
- 238000010171 animal model Methods 0.000 description 2
- 230000000692 anti-sense effect Effects 0.000 description 2
- 229940030225 antihemorrhagics Drugs 0.000 description 2
- 230000006907 apoptotic process Effects 0.000 description 2
- 108010008355 arginyl-glutamine Proteins 0.000 description 2
- 108010069926 arginyl-glycyl-serine Proteins 0.000 description 2
- 108010059459 arginyl-threonyl-phenylalanine Proteins 0.000 description 2
- 108010084758 arginyl-tyrosyl-aspartic acid Proteins 0.000 description 2
- 108010068265 aspartyltyrosine Proteins 0.000 description 2
- 208000006673 asthma Diseases 0.000 description 2
- 230000001363 autoimmune Effects 0.000 description 2
- 230000001580 bacterial effect Effects 0.000 description 2
- 230000004071 biological effect Effects 0.000 description 2
- 230000005540 biological transmission Effects 0.000 description 2
- 239000007853 buffer solution Substances 0.000 description 2
- 238000010804 cDNA synthesis Methods 0.000 description 2
- 201000011510 cancer Diseases 0.000 description 2
- 235000014633 carbohydrates Nutrition 0.000 description 2
- 239000006143 cell culture medium Substances 0.000 description 2
- 230000003915 cell function Effects 0.000 description 2
- 238000001516 cell proliferation assay Methods 0.000 description 2
- 210000003169 central nervous system Anatomy 0.000 description 2
- 238000005119 centrifugation Methods 0.000 description 2
- 238000004587 chromatography analysis Methods 0.000 description 2
- 210000000349 chromosome Anatomy 0.000 description 2
- 230000000295 complement effect Effects 0.000 description 2
- 150000001875 compounds Chemical class 0.000 description 2
- 239000002537 cosmetic Substances 0.000 description 2
- 238000002316 cosmetic surgery Methods 0.000 description 2
- 230000016396 cytokine production Effects 0.000 description 2
- 238000004163 cytometry Methods 0.000 description 2
- 210000004443 dendritic cell Anatomy 0.000 description 2
- 206010012601 diabetes mellitus Diseases 0.000 description 2
- FSXRLASFHBWESK-UHFFFAOYSA-N dipeptide phenylalanyl-tyrosine Natural products C=1C=C(O)C=CC=1CC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 FSXRLASFHBWESK-UHFFFAOYSA-N 0.000 description 2
- VHJLVAABSRFDPM-QWWZWVQMSA-N dithiothreitol Chemical compound SC[C@@H](O)[C@H](O)CS VHJLVAABSRFDPM-QWWZWVQMSA-N 0.000 description 2
- 210000002472 endoplasmic reticulum Anatomy 0.000 description 2
- 210000003038 endothelium Anatomy 0.000 description 2
- 238000006911 enzymatic reaction Methods 0.000 description 2
- 230000029142 excretion Effects 0.000 description 2
- 230000001605 fetal effect Effects 0.000 description 2
- 229950003499 fibrin Drugs 0.000 description 2
- 230000002068 genetic effect Effects 0.000 description 2
- 108010079547 glutamylmethionine Proteins 0.000 description 2
- 102000035122 glycosylated proteins Human genes 0.000 description 2
- 108091005608 glycosylated proteins Proteins 0.000 description 2
- JYPCXBJRLBHWME-UHFFFAOYSA-N glycyl-L-prolyl-L-arginine Natural products NCC(=O)N1CCCC1C(=O)NC(CCCN=C(N)N)C(O)=O JYPCXBJRLBHWME-UHFFFAOYSA-N 0.000 description 2
- 108010067216 glycyl-glycyl-glycine Proteins 0.000 description 2
- 108010010096 glycyl-glycyl-tyrosine Proteins 0.000 description 2
- 108010050475 glycyl-leucyl-tyrosine Proteins 0.000 description 2
- 108010084389 glycyltryptophan Proteins 0.000 description 2
- 230000000025 haemostatic effect Effects 0.000 description 2
- 230000002439 hemostatic effect Effects 0.000 description 2
- 108010036413 histidylglycine Proteins 0.000 description 2
- 230000036039 immunity Effects 0.000 description 2
- 239000012678 infectious agent Substances 0.000 description 2
- 230000028709 inflammatory response Effects 0.000 description 2
- NOESYZHRGYRDHS-UHFFFAOYSA-N insulin Chemical compound N1C(=O)C(NC(=O)C(CCC(N)=O)NC(=O)C(CCC(O)=O)NC(=O)C(C(C)C)NC(=O)C(NC(=O)CN)C(C)CC)CSSCC(C(NC(CO)C(=O)NC(CC(C)C)C(=O)NC(CC=2C=CC(O)=CC=2)C(=O)NC(CCC(N)=O)C(=O)NC(CC(C)C)C(=O)NC(CCC(O)=O)C(=O)NC(CC(N)=O)C(=O)NC(CC=2C=CC(O)=CC=2)C(=O)NC(CSSCC(NC(=O)C(C(C)C)NC(=O)C(CC(C)C)NC(=O)C(CC=2C=CC(O)=CC=2)NC(=O)C(CC(C)C)NC(=O)C(C)NC(=O)C(CCC(O)=O)NC(=O)C(C(C)C)NC(=O)C(CC(C)C)NC(=O)C(CC=2NC=NC=2)NC(=O)C(CO)NC(=O)CNC2=O)C(=O)NCC(=O)NC(CCC(O)=O)C(=O)NC(CCCNC(N)=N)C(=O)NCC(=O)NC(CC=3C=CC=CC=3)C(=O)NC(CC=3C=CC=CC=3)C(=O)NC(CC=3C=CC(O)=CC=3)C(=O)NC(C(C)O)C(=O)N3C(CCC3)C(=O)NC(CCCCN)C(=O)NC(C)C(O)=O)C(=O)NC(CC(N)=O)C(O)=O)=O)NC(=O)C(C(C)CC)NC(=O)C(CO)NC(=O)C(C(C)O)NC(=O)C1CSSCC2NC(=O)C(CC(C)C)NC(=O)C(NC(=O)C(CCC(N)=O)NC(=O)C(CC(N)=O)NC(=O)C(NC(=O)C(N)CC=1C=CC=CC=1)C(C)C)CC1=CN=CN1 NOESYZHRGYRDHS-UHFFFAOYSA-N 0.000 description 2
- 230000008611 intercellular interaction Effects 0.000 description 2
- 210000000936 intestine Anatomy 0.000 description 2
- 238000002955 isolation Methods 0.000 description 2
- 108010083708 leucyl-aspartyl-valine Proteins 0.000 description 2
- 108010044056 leucyl-phenylalanine Proteins 0.000 description 2
- 108010087810 leucyl-seryl-glutamyl-leucine Proteins 0.000 description 2
- 108010000761 leucylarginine Proteins 0.000 description 2
- 108010012058 leucyltyrosine Proteins 0.000 description 2
- 208000032839 leukemia Diseases 0.000 description 2
- 206010025135 lupus erythematosus Diseases 0.000 description 2
- 239000006166 lysate Substances 0.000 description 2
- 108010044348 lysyl-glutamyl-aspartic acid Proteins 0.000 description 2
- 108010057952 lysyl-phenylalanyl-lysine Proteins 0.000 description 2
- 108010038320 lysylphenylalanine Proteins 0.000 description 2
- 210000002540 macrophage Anatomy 0.000 description 2
- 239000002609 medium Substances 0.000 description 2
- 108700023046 methionyl-leucyl-phenylalanine Proteins 0.000 description 2
- 230000003228 microsomal effect Effects 0.000 description 2
- 238000007799 mixed lymphocyte reaction assay Methods 0.000 description 2
- 210000000066 myeloid cell Anatomy 0.000 description 2
- 210000005036 nerve Anatomy 0.000 description 2
- 230000000926 neurological effect Effects 0.000 description 2
- 210000002569 neuron Anatomy 0.000 description 2
- 201000001119 neuropathy Diseases 0.000 description 2
- 230000007823 neuropathy Effects 0.000 description 2
- 210000000440 neutrophil Anatomy 0.000 description 2
- 235000016709 nutrition Nutrition 0.000 description 2
- QYSGYZVSCZSLHT-UHFFFAOYSA-N octafluoropropane Chemical compound FC(F)(F)C(F)(F)C(F)(F)F QYSGYZVSCZSLHT-UHFFFAOYSA-N 0.000 description 2
- 230000008447 perception Effects 0.000 description 2
- 210000001428 peripheral nervous system Anatomy 0.000 description 2
- 208000033808 peripheral neuropathy Diseases 0.000 description 2
- 108010012581 phenylalanylglutamate Proteins 0.000 description 2
- 239000008363 phosphate buffer Substances 0.000 description 2
- 239000013600 plasmid vector Substances 0.000 description 2
- 229920001223 polyethylene glycol Polymers 0.000 description 2
- 238000012545 processing Methods 0.000 description 2
- 210000001236 prokaryotic cell Anatomy 0.000 description 2
- 108010020755 prolyl-glycyl-glycine Proteins 0.000 description 2
- 108010004914 prolylarginine Proteins 0.000 description 2
- 108010070643 prolylglutamic acid Proteins 0.000 description 2
- 108010090894 prolylleucine Proteins 0.000 description 2
- 108010053725 prolylvaline Proteins 0.000 description 2
- 230000000069 prophylactic effect Effects 0.000 description 2
- 235000019419 proteases Nutrition 0.000 description 2
- 238000000746 purification Methods 0.000 description 2
- 230000006798 recombination Effects 0.000 description 2
- 238000005215 recombination Methods 0.000 description 2
- 210000001995 reticulocyte Anatomy 0.000 description 2
- 239000011780 sodium chloride Substances 0.000 description 2
- 238000006467 substitution reaction Methods 0.000 description 2
- 239000006228 supernatant Substances 0.000 description 2
- 230000004083 survival effect Effects 0.000 description 2
- 238000002560 therapeutic procedure Methods 0.000 description 2
- 108010031491 threonyl-lysyl-glutamic acid Proteins 0.000 description 2
- 108010072986 threonyl-seryl-lysine Proteins 0.000 description 2
- 230000017423 tissue regeneration Effects 0.000 description 2
- 238000013518 transcription Methods 0.000 description 2
- 230000035897 transcription Effects 0.000 description 2
- 230000009466 transformation Effects 0.000 description 2
- 108091005703 transmembrane proteins Proteins 0.000 description 2
- 102000035160 transmembrane proteins Human genes 0.000 description 2
- 230000032258 transport Effects 0.000 description 2
- 230000000472 traumatic effect Effects 0.000 description 2
- 108010058119 tryptophyl-glycyl-glycine Proteins 0.000 description 2
- 108010015666 tryptophyl-leucyl-glutamic acid Proteins 0.000 description 2
- 108010035534 tyrosyl-leucyl-alanine Proteins 0.000 description 2
- 108010078580 tyrosylleucine Proteins 0.000 description 2
- 231100000397 ulcer Toxicity 0.000 description 2
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Substances O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 2
- 230000029663 wound healing Effects 0.000 description 2
- UBWXUGDQUBIEIZ-UHFFFAOYSA-N (13-methyl-3-oxo-2,6,7,8,9,10,11,12,14,15,16,17-dodecahydro-1h-cyclopenta[a]phenanthren-17-yl) 3-phenylpropanoate Chemical compound CC12CCC(C3CCC(=O)C=C3CC3)C3C1CCC2OC(=O)CCC1=CC=CC=C1 UBWXUGDQUBIEIZ-UHFFFAOYSA-N 0.000 description 1
- GJLXVWOMRRWCIB-MERZOTPQSA-N (2S)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-acetamido-5-(diaminomethylideneamino)pentanoyl]amino]-3-(4-hydroxyphenyl)propanoyl]amino]-3-(4-hydroxyphenyl)propanoyl]amino]-5-(diaminomethylideneamino)pentanoyl]amino]-3-(1H-indol-3-yl)propanoyl]amino]-6-aminohexanoyl]amino]-6-aminohexanoyl]amino]-6-aminohexanoyl]amino]-6-aminohexanoyl]amino]-6-aminohexanoyl]amino]-6-aminohexanoyl]amino]-6-aminohexanamide Chemical compound C([C@H](NC(=O)[C@H](CCCN=C(N)N)NC(=O)C)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(N)=O)C1=CC=C(O)C=C1 GJLXVWOMRRWCIB-MERZOTPQSA-N 0.000 description 1
- COEXAQSTZUWMRI-STQMWFEESA-N (2s)-1-[2-[[(2s)-2-amino-3-(4-hydroxyphenyl)propanoyl]amino]acetyl]pyrrolidine-2-carboxylic acid Chemical compound C([C@H](N)C(=O)NCC(=O)N1[C@@H](CCC1)C(O)=O)C1=CC=C(O)C=C1 COEXAQSTZUWMRI-STQMWFEESA-N 0.000 description 1
- OPCHFPHZPIURNA-MFERNQICSA-N (2s)-2,5-bis(3-aminopropylamino)-n-[2-(dioctadecylamino)acetyl]pentanamide Chemical compound CCCCCCCCCCCCCCCCCCN(CC(=O)NC(=O)[C@H](CCCNCCCN)NCCCN)CCCCCCCCCCCCCCCCCC OPCHFPHZPIURNA-MFERNQICSA-N 0.000 description 1
- AUXMWYRZQPIXCC-KNIFDHDWSA-N (2s)-2-amino-4-methylpentanoic acid;(2s)-2-aminopropanoic acid Chemical compound C[C@H](N)C(O)=O.CC(C)C[C@H](N)C(O)=O AUXMWYRZQPIXCC-KNIFDHDWSA-N 0.000 description 1
- JFOWDKWFHZIMTR-RUCXOUQFSA-N (2s)-2-aminopentanedioic acid;(2s)-2,5-diamino-5-oxopentanoic acid Chemical compound OC(=O)[C@@H](N)CCC(N)=O.OC(=O)[C@@H](N)CCC(O)=O JFOWDKWFHZIMTR-RUCXOUQFSA-N 0.000 description 1
- 102000040650 (ribonucleotides)n+m Human genes 0.000 description 1
- PIDRBUDUWHBYSR-UHFFFAOYSA-N 1-[2-[[2-[(2-amino-4-methylpentanoyl)amino]-4-methylpentanoyl]amino]-4-methylpentanoyl]pyrrolidine-2-carboxylic acid Chemical compound CC(C)CC(N)C(=O)NC(CC(C)C)C(=O)NC(CC(C)C)C(=O)N1CCCC1C(O)=O PIDRBUDUWHBYSR-UHFFFAOYSA-N 0.000 description 1
- QVOBNSFUVPLVPE-ROUUACIJSA-N 2-[[(2s)-2-[[2-[[(2s)-2-amino-3-phenylpropanoyl]amino]acetyl]amino]-3-phenylpropanoyl]amino]acetic acid Chemical compound C([C@H](N)C(=O)NCC(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)NCC(O)=O)C1=CC=CC=C1 QVOBNSFUVPLVPE-ROUUACIJSA-N 0.000 description 1
- JUEUYDRZJNQZGR-UHFFFAOYSA-N 2-[[2-[[2-[(2-amino-4-methylpentanoyl)amino]-4-methylpentanoyl]amino]acetyl]amino]-3-phenylpropanoic acid Chemical compound CC(C)CC(N)C(=O)NC(CC(C)C)C(=O)NCC(=O)NC(C(O)=O)CC1=CC=CC=C1 JUEUYDRZJNQZGR-UHFFFAOYSA-N 0.000 description 1
- QMOQBVOBWVNSNO-UHFFFAOYSA-N 2-[[2-[[2-[(2-azaniumylacetyl)amino]acetyl]amino]acetyl]amino]acetate Chemical compound NCC(=O)NCC(=O)NCC(=O)NCC(O)=O QMOQBVOBWVNSNO-UHFFFAOYSA-N 0.000 description 1
- BFSVOASYOCHEOV-UHFFFAOYSA-N 2-diethylaminoethanol Chemical compound CCN(CC)CCO BFSVOASYOCHEOV-UHFFFAOYSA-N 0.000 description 1
- 108010036211 5-HT-moduline Proteins 0.000 description 1
- 241000014654 Adna Species 0.000 description 1
- 101100178313 Aedes aegypti HP-I gene Proteins 0.000 description 1
- 229920001817 Agar Polymers 0.000 description 1
- 229920000936 Agarose Polymers 0.000 description 1
- 206010001488 Aggression Diseases 0.000 description 1
- SBGXWWCLHIOABR-UHFFFAOYSA-N Ala Ala Gly Ala Chemical compound CC(N)C(=O)NC(C)C(=O)NCC(=O)NC(C)C(O)=O SBGXWWCLHIOABR-UHFFFAOYSA-N 0.000 description 1
- YYSWCHMLFJLLBJ-ZLUOBGJFSA-N Ala-Ala-Ser Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O YYSWCHMLFJLLBJ-ZLUOBGJFSA-N 0.000 description 1
- JBVSSSZFNTXJDX-YTLHQDLWSA-N Ala-Ala-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@H](C)N JBVSSSZFNTXJDX-YTLHQDLWSA-N 0.000 description 1
- IMMKUCQIKKXKNP-DCAQKATOSA-N Ala-Arg-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](C)N)CCCN=C(N)N IMMKUCQIKKXKNP-DCAQKATOSA-N 0.000 description 1
- TTXMOJWKNRJWQJ-FXQIFTODSA-N Ala-Arg-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CCCN=C(N)N TTXMOJWKNRJWQJ-FXQIFTODSA-N 0.000 description 1
- LBJYAILUMSUTAM-ZLUOBGJFSA-N Ala-Asn-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O LBJYAILUMSUTAM-ZLUOBGJFSA-N 0.000 description 1
- GORKKVHIBWAQHM-GCJQMDKQSA-N Ala-Asn-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GORKKVHIBWAQHM-GCJQMDKQSA-N 0.000 description 1
- WXERCAHAIKMTKX-ZLUOBGJFSA-N Ala-Asp-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O WXERCAHAIKMTKX-ZLUOBGJFSA-N 0.000 description 1
- DECCMEWNXSNSDO-ZLUOBGJFSA-N Ala-Cys-Ala Chemical compound C[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@@H](C)C(O)=O DECCMEWNXSNSDO-ZLUOBGJFSA-N 0.000 description 1
- KRHRBKYBJXMYBB-WHFBIAKZSA-N Ala-Cys-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CS)C(=O)NCC(O)=O KRHRBKYBJXMYBB-WHFBIAKZSA-N 0.000 description 1
- CSAHOYQKNHGDHX-ACZMJKKPSA-N Ala-Gln-Asn Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O CSAHOYQKNHGDHX-ACZMJKKPSA-N 0.000 description 1
- FUSPCLTUKXQREV-ACZMJKKPSA-N Ala-Glu-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O FUSPCLTUKXQREV-ACZMJKKPSA-N 0.000 description 1
- WGDNWOMKBUXFHR-BQBZGAKWSA-N Ala-Gly-Arg Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N WGDNWOMKBUXFHR-BQBZGAKWSA-N 0.000 description 1
- YHKANGMVQWRMAP-DCAQKATOSA-N Ala-Leu-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N YHKANGMVQWRMAP-DCAQKATOSA-N 0.000 description 1
- SUMYEVXWCAYLLJ-GUBZILKMSA-N Ala-Leu-Gln Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O SUMYEVXWCAYLLJ-GUBZILKMSA-N 0.000 description 1
- AWZKCUCQJNTBAD-SRVKXCTJSA-N Ala-Leu-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCCN AWZKCUCQJNTBAD-SRVKXCTJSA-N 0.000 description 1
- MEFILNJXAVSUTO-JXUBOQSCSA-N Ala-Leu-Thr Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MEFILNJXAVSUTO-JXUBOQSCSA-N 0.000 description 1
- AJBVYEYZVYPFCF-CIUDSAMLSA-N Ala-Lys-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O AJBVYEYZVYPFCF-CIUDSAMLSA-N 0.000 description 1
- VCSABYLVNWQYQE-UHFFFAOYSA-N Ala-Lys-Lys Natural products NCCCCC(NC(=O)C(N)C)C(=O)NC(CCCCN)C(O)=O VCSABYLVNWQYQE-UHFFFAOYSA-N 0.000 description 1
- MDNAVFBZPROEHO-DCAQKATOSA-N Ala-Lys-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O MDNAVFBZPROEHO-DCAQKATOSA-N 0.000 description 1
- YCRAFFCYWOUEOF-DLOVCJGASA-N Ala-Phe-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=CC=C1 YCRAFFCYWOUEOF-DLOVCJGASA-N 0.000 description 1
- SGFBVLBKDSXGAP-GKCIPKSASA-N Ala-Phe-Trp Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC2=CNC3=CC=CC=C32)C(=O)O)N SGFBVLBKDSXGAP-GKCIPKSASA-N 0.000 description 1
- OLVCTPPSXNRGKV-GUBZILKMSA-N Ala-Pro-Pro Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 OLVCTPPSXNRGKV-GUBZILKMSA-N 0.000 description 1
- XWFWAXPOLRTDFZ-FXQIFTODSA-N Ala-Pro-Ser Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O XWFWAXPOLRTDFZ-FXQIFTODSA-N 0.000 description 1
- DCVYRWFAMZFSDA-ZLUOBGJFSA-N Ala-Ser-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O DCVYRWFAMZFSDA-ZLUOBGJFSA-N 0.000 description 1
- OMCKWYSDUQBYCN-FXQIFTODSA-N Ala-Ser-Met Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCSC)C(O)=O OMCKWYSDUQBYCN-FXQIFTODSA-N 0.000 description 1
- NZGRHTKZFSVPAN-BIIVOSGPSA-N Ala-Ser-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CO)C(=O)N1CCC[C@@H]1C(=O)O)N NZGRHTKZFSVPAN-BIIVOSGPSA-N 0.000 description 1
- HCBKAOZYACJUEF-XQXXSGGOSA-N Ala-Thr-Gln Chemical compound N[C@@H](C)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CCC(N)=O)C(=O)O HCBKAOZYACJUEF-XQXXSGGOSA-N 0.000 description 1
- KTXKIYXZQFWJKB-VZFHVOOUSA-N Ala-Thr-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O KTXKIYXZQFWJKB-VZFHVOOUSA-N 0.000 description 1
- AENHOIXXHKNIQL-AUTRQRHGSA-N Ala-Tyr-Ala Chemical compound [O-]C(=O)[C@H](C)NC(=O)[C@@H](NC(=O)[C@@H]([NH3+])C)CC1=CC=C(O)C=C1 AENHOIXXHKNIQL-AUTRQRHGSA-N 0.000 description 1
- KLKARCOHVHLAJP-UWJYBYFXSA-N Ala-Tyr-Cys Chemical compound C[C@H](N)C(=O)N[C@@H](Cc1ccc(O)cc1)C(=O)N[C@@H](CS)C(O)=O KLKARCOHVHLAJP-UWJYBYFXSA-N 0.000 description 1
- XSLGWYYNOSUMRM-ZKWXMUAHSA-N Ala-Val-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O XSLGWYYNOSUMRM-ZKWXMUAHSA-N 0.000 description 1
- REWSWYIDQIELBE-FXQIFTODSA-N Ala-Val-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O REWSWYIDQIELBE-FXQIFTODSA-N 0.000 description 1
- 102000002260 Alkaline Phosphatase Human genes 0.000 description 1
- 108020004774 Alkaline Phosphatase Proteins 0.000 description 1
- 206010027654 Allergic conditions Diseases 0.000 description 1
- 102000040717 Alpha family Human genes 0.000 description 1
- 108091071248 Alpha family Proteins 0.000 description 1
- 102100022524 Alpha-1-antichymotrypsin Human genes 0.000 description 1
- 208000024827 Alzheimer disease Diseases 0.000 description 1
- 206010002198 Anaphylactic reaction Diseases 0.000 description 1
- 208000032467 Aplastic anaemia Diseases 0.000 description 1
- VKKYFICVTYKFIO-CIUDSAMLSA-N Arg-Ala-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCN=C(N)N VKKYFICVTYKFIO-CIUDSAMLSA-N 0.000 description 1
- KWKQGHSSNHPGOW-BQBZGAKWSA-N Arg-Ala-Gly Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(=O)NCC(O)=O KWKQGHSSNHPGOW-BQBZGAKWSA-N 0.000 description 1
- OVVUNXXROOFSIM-SDDRHHMPSA-N Arg-Arg-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CCCN=C(N)N)N)C(=O)O OVVUNXXROOFSIM-SDDRHHMPSA-N 0.000 description 1
- NABSCJGZKWSNHX-RCWTZXSCSA-N Arg-Arg-Thr Chemical compound NC(N)=NCCC[C@@H](C(=O)N[C@@H]([C@H](O)C)C(O)=O)NC(=O)[C@@H](N)CCCN=C(N)N NABSCJGZKWSNHX-RCWTZXSCSA-N 0.000 description 1
- JSLGXODUIAFWCF-WDSKDSINSA-N Arg-Asn Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(O)=O JSLGXODUIAFWCF-WDSKDSINSA-N 0.000 description 1
- DPXDVGDLWJYZBH-GUBZILKMSA-N Arg-Asn-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O DPXDVGDLWJYZBH-GUBZILKMSA-N 0.000 description 1
- WESHVRNMNFMVBE-FXQIFTODSA-N Arg-Asn-Asp Chemical compound C(C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N)CN=C(N)N WESHVRNMNFMVBE-FXQIFTODSA-N 0.000 description 1
- BVBKBQRPOJFCQM-DCAQKATOSA-N Arg-Asn-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O BVBKBQRPOJFCQM-DCAQKATOSA-N 0.000 description 1
- OTUQSEPIIVBYEM-IHRRRGAJSA-N Arg-Asn-Tyr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O OTUQSEPIIVBYEM-IHRRRGAJSA-N 0.000 description 1
- ITVINTQUZMQWJR-QXEWZRGKSA-N Arg-Asn-Val Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O ITVINTQUZMQWJR-QXEWZRGKSA-N 0.000 description 1
- SIFXMYAHXJGAFC-WDSKDSINSA-N Arg-Asp Chemical compound NC(=N)NCCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(O)=O SIFXMYAHXJGAFC-WDSKDSINSA-N 0.000 description 1
- RWCLSUOSKWTXLA-FXQIFTODSA-N Arg-Asp-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O RWCLSUOSKWTXLA-FXQIFTODSA-N 0.000 description 1
- KMSHNDWHPWXPEC-BQBZGAKWSA-N Arg-Asp-Gly Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O KMSHNDWHPWXPEC-BQBZGAKWSA-N 0.000 description 1
- VXXHDZKEQNGXNU-QXEWZRGKSA-N Arg-Asp-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CCCN=C(N)N VXXHDZKEQNGXNU-QXEWZRGKSA-N 0.000 description 1
- OSASDIVHOSJVII-WDSKDSINSA-N Arg-Cys Chemical compound SC[C@@H](C(O)=O)NC(=O)[C@@H](N)CCCNC(N)=N OSASDIVHOSJVII-WDSKDSINSA-N 0.000 description 1
- GDVDRMUYICMNFJ-CIUDSAMLSA-N Arg-Cys-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(O)=O)C(O)=O GDVDRMUYICMNFJ-CIUDSAMLSA-N 0.000 description 1
- YUGFLWBWAJFGKY-BQBZGAKWSA-N Arg-Cys-Gly Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CS)C(=O)NCC(O)=O YUGFLWBWAJFGKY-BQBZGAKWSA-N 0.000 description 1
- DGFGDPVSDQPANQ-XGEHTFHBSA-N Arg-Cys-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CCCN=C(N)N)N)O DGFGDPVSDQPANQ-XGEHTFHBSA-N 0.000 description 1
- YHSNASXGBPAHRL-BPUTZDHNSA-N Arg-Cys-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CCCN=C(N)N)N YHSNASXGBPAHRL-BPUTZDHNSA-N 0.000 description 1
- QQJSJIBESHAJPM-IHRRRGAJSA-N Arg-Cys-Tyr Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 QQJSJIBESHAJPM-IHRRRGAJSA-N 0.000 description 1
- PMGDADKJMCOXHX-BQBZGAKWSA-N Arg-Gln Chemical compound NC(=N)NCCC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(O)=O PMGDADKJMCOXHX-BQBZGAKWSA-N 0.000 description 1
- SNBHMYQRNCJSOJ-CIUDSAMLSA-N Arg-Gln-Asn Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O SNBHMYQRNCJSOJ-CIUDSAMLSA-N 0.000 description 1
- PTVGLOCPAVYPFG-CIUDSAMLSA-N Arg-Gln-Asp Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O PTVGLOCPAVYPFG-CIUDSAMLSA-N 0.000 description 1
- KBBKCNHWCDJPGN-GUBZILKMSA-N Arg-Gln-Gln Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O KBBKCNHWCDJPGN-GUBZILKMSA-N 0.000 description 1
- JUWQNWXEGDYCIE-YUMQZZPRSA-N Arg-Gln-Gly Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O JUWQNWXEGDYCIE-YUMQZZPRSA-N 0.000 description 1
- HPKSHFSEXICTLI-CIUDSAMLSA-N Arg-Glu-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O HPKSHFSEXICTLI-CIUDSAMLSA-N 0.000 description 1
- QAODJPUKWNNNRP-DCAQKATOSA-N Arg-Glu-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O QAODJPUKWNNNRP-DCAQKATOSA-N 0.000 description 1
- XUUXCWCKKCZEAW-YFKPBYRVSA-N Arg-Gly Chemical compound OC(=O)CNC(=O)[C@@H](N)CCCN=C(N)N XUUXCWCKKCZEAW-YFKPBYRVSA-N 0.000 description 1
- CYXCAHZVPFREJD-LURJTMIESA-N Arg-Gly-Gly Chemical compound NC(=N)NCCC[C@H](N)C(=O)NCC(=O)NCC(O)=O CYXCAHZVPFREJD-LURJTMIESA-N 0.000 description 1
- NVCIXQYNWYTLDO-IHRRRGAJSA-N Arg-His-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CCCN=C(N)N)N NVCIXQYNWYTLDO-IHRRRGAJSA-N 0.000 description 1
- RKQRHMKFNBYOTN-IHRRRGAJSA-N Arg-His-Lys Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N RKQRHMKFNBYOTN-IHRRRGAJSA-N 0.000 description 1
- LVMUGODRNHFGRA-AVGNSLFASA-N Arg-Leu-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O LVMUGODRNHFGRA-AVGNSLFASA-N 0.000 description 1
- OTZMRMHZCMZOJZ-SRVKXCTJSA-N Arg-Leu-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O OTZMRMHZCMZOJZ-SRVKXCTJSA-N 0.000 description 1
- COXMUHNBYCVVRG-DCAQKATOSA-N Arg-Leu-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O COXMUHNBYCVVRG-DCAQKATOSA-N 0.000 description 1
- YVTHEZNOKSAWRW-DCAQKATOSA-N Arg-Lys-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O YVTHEZNOKSAWRW-DCAQKATOSA-N 0.000 description 1
- FSNVAJOPUDVQAR-AVGNSLFASA-N Arg-Lys-Arg Chemical compound NC(=N)NCCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O FSNVAJOPUDVQAR-AVGNSLFASA-N 0.000 description 1
- GIMTZGADWZTZGV-DCAQKATOSA-N Arg-Lys-Cys Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N GIMTZGADWZTZGV-DCAQKATOSA-N 0.000 description 1
- BTJVOUQWFXABOI-IHRRRGAJSA-N Arg-Lys-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCCNC(N)=N BTJVOUQWFXABOI-IHRRRGAJSA-N 0.000 description 1
- XKDYWGLNSCNRGW-WDSOQIARSA-N Arg-Lys-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)CCCN=C(N)N)CCCCN)C(O)=O)=CNC2=C1 XKDYWGLNSCNRGW-WDSOQIARSA-N 0.000 description 1
- ROWCTNFEMKOIFQ-YUMQZZPRSA-N Arg-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@@H](N)CCCNC(N)=N ROWCTNFEMKOIFQ-YUMQZZPRSA-N 0.000 description 1
- OISWSORSLQOGFV-AVGNSLFASA-N Arg-Met-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCSC)NC(=O)[C@@H](N)CCCN=C(N)N OISWSORSLQOGFV-AVGNSLFASA-N 0.000 description 1
- KSUALAGYYLQSHJ-RCWTZXSCSA-N Arg-Met-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KSUALAGYYLQSHJ-RCWTZXSCSA-N 0.000 description 1
- YTMKMRSYXHBGER-IHRRRGAJSA-N Arg-Phe-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N YTMKMRSYXHBGER-IHRRRGAJSA-N 0.000 description 1
- GSUFZRURORXYTM-STQMWFEESA-N Arg-Phe-Gly Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CC=CC=C1 GSUFZRURORXYTM-STQMWFEESA-N 0.000 description 1
- UGZUVYDKAYNCII-ULQDDVLXSA-N Arg-Phe-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O UGZUVYDKAYNCII-ULQDDVLXSA-N 0.000 description 1
- XFXZKCRBBOVJKS-BVSLBCMMSA-N Arg-Phe-Trp Chemical compound C([C@H](NC(=O)[C@H](CCCN=C(N)N)N)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(O)=O)C1=CC=CC=C1 XFXZKCRBBOVJKS-BVSLBCMMSA-N 0.000 description 1
- UIUXXFIKWQVMEX-UFYCRDLUSA-N Arg-Phe-Tyr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O UIUXXFIKWQVMEX-UFYCRDLUSA-N 0.000 description 1
- YCYXHLZRUSJITQ-SRVKXCTJSA-N Arg-Pro-Pro Chemical compound NC(=N)NCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 YCYXHLZRUSJITQ-SRVKXCTJSA-N 0.000 description 1
- VUGWHBXPMAHEGZ-SRVKXCTJSA-N Arg-Pro-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CCCN=C(N)N VUGWHBXPMAHEGZ-SRVKXCTJSA-N 0.000 description 1
- VRTWYUYCJGNFES-CIUDSAMLSA-N Arg-Ser-Gln Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O VRTWYUYCJGNFES-CIUDSAMLSA-N 0.000 description 1
- LFAUVOXPCGJKTB-DCAQKATOSA-N Arg-Ser-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCCN=C(N)N)N LFAUVOXPCGJKTB-DCAQKATOSA-N 0.000 description 1
- JOTRDIXZHNQYGP-DCAQKATOSA-N Arg-Ser-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCCN=C(N)N)N JOTRDIXZHNQYGP-DCAQKATOSA-N 0.000 description 1
- UZSQXCMNUPKLCC-FJXKBIBVSA-N Arg-Thr-Gly Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O UZSQXCMNUPKLCC-FJXKBIBVSA-N 0.000 description 1
- MOGMYRUNTKYZFB-UNQGMJICSA-N Arg-Thr-Phe Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 MOGMYRUNTKYZFB-UNQGMJICSA-N 0.000 description 1
- ZJBUILVYSXQNSW-YTWAJWBKSA-N Arg-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N)O ZJBUILVYSXQNSW-YTWAJWBKSA-N 0.000 description 1
- INOIAEUXVVNJKA-XGEHTFHBSA-N Arg-Thr-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O INOIAEUXVVNJKA-XGEHTFHBSA-N 0.000 description 1
- UGJLILSJKSBVIR-ZFWWWQNUSA-N Arg-Trp-Gly Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CCCN=C(N)N)N)C(=O)NCC(O)=O)=CNC2=C1 UGJLILSJKSBVIR-ZFWWWQNUSA-N 0.000 description 1
- PYDIIVKGTBRIEL-SZMVWBNQSA-N Arg-Trp-Pro Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N1CCC[C@H]1C(O)=O PYDIIVKGTBRIEL-SZMVWBNQSA-N 0.000 description 1
- WAEWODAAWLGLMK-OYDLWJJNSA-N Arg-Trp-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC3=CNC4=CC=CC=C43)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N WAEWODAAWLGLMK-OYDLWJJNSA-N 0.000 description 1
- AOJYORNRFWWEIV-IHRRRGAJSA-N Arg-Tyr-Asp Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(O)=O)C(O)=O)CC1=CC=C(O)C=C1 AOJYORNRFWWEIV-IHRRRGAJSA-N 0.000 description 1
- LFWOQHSQNCKXRU-UFYCRDLUSA-N Arg-Tyr-Phe Chemical compound C([C@H](NC(=O)[C@H](CCCN=C(N)N)N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=C(O)C=C1 LFWOQHSQNCKXRU-UFYCRDLUSA-N 0.000 description 1
- QCTOLCVIGRLMQS-HRCADAONSA-N Arg-Tyr-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)[C@H](CCCN=C(N)N)N)C(=O)O QCTOLCVIGRLMQS-HRCADAONSA-N 0.000 description 1
- CNBIWSCSSCAINS-UFYCRDLUSA-N Arg-Tyr-Tyr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O CNBIWSCSSCAINS-UFYCRDLUSA-N 0.000 description 1
- XMZZGVGKGXRIGJ-JYJNAYRXSA-N Arg-Tyr-Val Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O XMZZGVGKGXRIGJ-JYJNAYRXSA-N 0.000 description 1
- JWCCFNZJIRZUCL-AVGNSLFASA-N Arg-Val-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCN=C(N)N JWCCFNZJIRZUCL-AVGNSLFASA-N 0.000 description 1
- WHLDJYNHXOMGMU-JYJNAYRXSA-N Arg-Val-Tyr Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 WHLDJYNHXOMGMU-JYJNAYRXSA-N 0.000 description 1
- ANAHQDPQQBDOBM-UHFFFAOYSA-N Arg-Val-Tyr Natural products CC(C)C(NC(=O)C(N)CCNC(=N)N)C(=O)NC(Cc1ccc(O)cc1)C(=O)O ANAHQDPQQBDOBM-UHFFFAOYSA-N 0.000 description 1
- 239000004475 Arginine Substances 0.000 description 1
- LEFKSBYHUGUWLP-ACZMJKKPSA-N Asn-Ala-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O LEFKSBYHUGUWLP-ACZMJKKPSA-N 0.000 description 1
- HZPSDHRYYIORKR-WHFBIAKZSA-N Asn-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H](N)CC(N)=O HZPSDHRYYIORKR-WHFBIAKZSA-N 0.000 description 1
- NPDLYUOYAGBHFB-WDSKDSINSA-N Asn-Arg Chemical compound NC(=O)C[C@H](N)C(=O)N[C@H](C(O)=O)CCCN=C(N)N NPDLYUOYAGBHFB-WDSKDSINSA-N 0.000 description 1
- GXMSVVBIAMWMKO-BQBZGAKWSA-N Asn-Arg-Gly Chemical compound NC(=O)C[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CCCN=C(N)N GXMSVVBIAMWMKO-BQBZGAKWSA-N 0.000 description 1
- JJGRJMKUOYXZRA-LPEHRKFASA-N Asn-Arg-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC(=O)N)N)C(=O)O JJGRJMKUOYXZRA-LPEHRKFASA-N 0.000 description 1
- LJUOLNXOWSWGKF-ACZMJKKPSA-N Asn-Asn-Glu Chemical compound C(CC(=O)O)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC(=O)N)N LJUOLNXOWSWGKF-ACZMJKKPSA-N 0.000 description 1
- UGXVKHRDGLYFKR-CIUDSAMLSA-N Asn-Asp-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC(N)=O UGXVKHRDGLYFKR-CIUDSAMLSA-N 0.000 description 1
- TWXZVVXRRRRSLT-IMJSIDKUSA-N Asn-Cys Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CS)C(O)=O TWXZVVXRRRRSLT-IMJSIDKUSA-N 0.000 description 1
- ZPMNECSEJXXNBE-CIUDSAMLSA-N Asn-Cys-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(O)=O ZPMNECSEJXXNBE-CIUDSAMLSA-N 0.000 description 1
- FJIRXKVEDFLLOQ-SRVKXCTJSA-N Asn-Cys-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CC(=O)N)N FJIRXKVEDFLLOQ-SRVKXCTJSA-N 0.000 description 1
- QCWJKJLNCFEVPQ-WHFBIAKZSA-N Asn-Gln Chemical compound NC(=O)C[C@H](N)C(=O)N[C@H](C(O)=O)CCC(N)=O QCWJKJLNCFEVPQ-WHFBIAKZSA-N 0.000 description 1
- FUHFYEKSGWOWGZ-XHNCKOQMSA-N Asn-Gln-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC(=O)N)N)C(=O)O FUHFYEKSGWOWGZ-XHNCKOQMSA-N 0.000 description 1
- KUYKVGODHGHFDI-ACZMJKKPSA-N Asn-Gln-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O KUYKVGODHGHFDI-ACZMJKKPSA-N 0.000 description 1
- OKZOABJQOMAYEC-NUMRIWBASA-N Asn-Gln-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OKZOABJQOMAYEC-NUMRIWBASA-N 0.000 description 1
- IHUJUZBUOFTIOB-QEJZJMRPSA-N Asn-Gln-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC(=O)N)N IHUJUZBUOFTIOB-QEJZJMRPSA-N 0.000 description 1
- IIFDPDVJAHQFSR-WHFBIAKZSA-N Asn-Glu Chemical compound NC(=O)C[C@H](N)C(=O)N[C@H](C(O)=O)CCC(O)=O IIFDPDVJAHQFSR-WHFBIAKZSA-N 0.000 description 1
- SNAKIVFVLVUCKB-UHFFFAOYSA-N Asn-Glu-Ala-Lys Natural products NCCCCC(C(O)=O)NC(=O)C(C)NC(=O)C(CCC(O)=O)NC(=O)C(N)CC(N)=O SNAKIVFVLVUCKB-UHFFFAOYSA-N 0.000 description 1
- HCAUEJAQCXVQQM-ACZMJKKPSA-N Asn-Glu-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O HCAUEJAQCXVQQM-ACZMJKKPSA-N 0.000 description 1
- BZMWJLLUAKSIMH-FXQIFTODSA-N Asn-Glu-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O BZMWJLLUAKSIMH-FXQIFTODSA-N 0.000 description 1
- DMLSCRJBWUEALP-LAEOZQHASA-N Asn-Glu-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O DMLSCRJBWUEALP-LAEOZQHASA-N 0.000 description 1
- DXVMJJNAOVECBA-WHFBIAKZSA-N Asn-Gly-Asn Chemical compound NC(=O)C[C@H](N)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O DXVMJJNAOVECBA-WHFBIAKZSA-N 0.000 description 1
- OPEPUCYIGFEGSW-WDSKDSINSA-N Asn-Gly-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O OPEPUCYIGFEGSW-WDSKDSINSA-N 0.000 description 1
- HYQYLOSCICEYTR-YUMQZZPRSA-N Asn-Gly-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O HYQYLOSCICEYTR-YUMQZZPRSA-N 0.000 description 1
- JQSWHKKUZMTOIH-QWRGUYRKSA-N Asn-Gly-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CC(=O)N)N JQSWHKKUZMTOIH-QWRGUYRKSA-N 0.000 description 1
- RAQMSGVCGSJKCL-FOHZUACHSA-N Asn-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC(N)=O RAQMSGVCGSJKCL-FOHZUACHSA-N 0.000 description 1
- UDSVWSUXKYXSTR-QWRGUYRKSA-N Asn-Gly-Tyr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O UDSVWSUXKYXSTR-QWRGUYRKSA-N 0.000 description 1
- SGAUXNZEFIEAAI-GARJFASQSA-N Asn-His-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CN=CN2)NC(=O)[C@H](CC(=O)N)N)C(=O)O SGAUXNZEFIEAAI-GARJFASQSA-N 0.000 description 1
- UHGUKCOQUNPSKK-CIUDSAMLSA-N Asn-Leu-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(=O)N)N UHGUKCOQUNPSKK-CIUDSAMLSA-N 0.000 description 1
- MYCSPQIARXTUTP-SRVKXCTJSA-N Asn-Leu-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC(=O)N)N MYCSPQIARXTUTP-SRVKXCTJSA-N 0.000 description 1
- FHETWELNCBMRMG-HJGDQZAQSA-N Asn-Leu-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O FHETWELNCBMRMG-HJGDQZAQSA-N 0.000 description 1
- NCFJQJRLQJEECD-NHCYSSNCSA-N Asn-Leu-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O NCFJQJRLQJEECD-NHCYSSNCSA-N 0.000 description 1
- LZLCLRQMUQWUHJ-GUBZILKMSA-N Asn-Lys-Gln Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N LZLCLRQMUQWUHJ-GUBZILKMSA-N 0.000 description 1
- IQTUDDBANZYMAR-WDSKDSINSA-N Asn-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@@H](N)CC(N)=O IQTUDDBANZYMAR-WDSKDSINSA-N 0.000 description 1
- ICDDSTLEMLGSTB-GUBZILKMSA-N Asn-Met-Arg Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O ICDDSTLEMLGSTB-GUBZILKMSA-N 0.000 description 1
- HZZIFFOVHLWGCS-KKUMJFAQSA-N Asn-Phe-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O HZZIFFOVHLWGCS-KKUMJFAQSA-N 0.000 description 1
- AWXDRZJQCVHCIT-DCAQKATOSA-N Asn-Pro-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CC(N)=O AWXDRZJQCVHCIT-DCAQKATOSA-N 0.000 description 1
- JWQWPRCDYWNVNM-ACZMJKKPSA-N Asn-Ser-Gln Chemical compound C(CC(=O)N)[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CC(=O)N)N JWQWPRCDYWNVNM-ACZMJKKPSA-N 0.000 description 1
- SNYCNNPOFYBCEK-ZLUOBGJFSA-N Asn-Ser-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O SNYCNNPOFYBCEK-ZLUOBGJFSA-N 0.000 description 1
- GOPFMQJUQDLUFW-LKXGYXEUSA-N Asn-Thr-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(=O)N)N)O GOPFMQJUQDLUFW-LKXGYXEUSA-N 0.000 description 1
- BCADFFUQHIMQAA-KKHAAJSZSA-N Asn-Thr-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O BCADFFUQHIMQAA-KKHAAJSZSA-N 0.000 description 1
- ANRZCQXIXGDXLR-CWRNSKLLSA-N Asn-Trp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CNC3=CC=CC=C32)NC(=O)[C@H](CC(=O)N)N)C(=O)O ANRZCQXIXGDXLR-CWRNSKLLSA-N 0.000 description 1
- KTDWFWNZLLFEFU-KKUMJFAQSA-N Asn-Tyr-His Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)NC(=O)[C@H](CC(=O)N)N)O KTDWFWNZLLFEFU-KKUMJFAQSA-N 0.000 description 1
- DATSKXOXPUAOLK-KKUMJFAQSA-N Asn-Tyr-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(O)=O DATSKXOXPUAOLK-KKUMJFAQSA-N 0.000 description 1
- NJPLPRFQLBZAMH-IHRRRGAJSA-N Asn-Tyr-Met Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCSC)C(O)=O NJPLPRFQLBZAMH-IHRRRGAJSA-N 0.000 description 1
- DXHINQUXBZNUCF-MELADBBJSA-N Asn-Tyr-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)[C@H](CC(=O)N)N)C(=O)O DXHINQUXBZNUCF-MELADBBJSA-N 0.000 description 1
- XEGZSHSPQNDNRH-JRQIVUDYSA-N Asn-Tyr-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XEGZSHSPQNDNRH-JRQIVUDYSA-N 0.000 description 1
- KWBQPGIYEZKDEG-FSPLSTOPSA-N Asn-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H](N)CC(N)=O KWBQPGIYEZKDEG-FSPLSTOPSA-N 0.000 description 1
- CBWCQCANJSGUOH-ZKWXMUAHSA-N Asn-Val-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O CBWCQCANJSGUOH-ZKWXMUAHSA-N 0.000 description 1
- XLDMSQYOYXINSZ-QXEWZRGKSA-N Asn-Val-Arg Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N XLDMSQYOYXINSZ-QXEWZRGKSA-N 0.000 description 1
- KBQOUDLMWYWXNP-YDHLFZDLSA-N Asn-Val-Phe Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CC(=O)N)N KBQOUDLMWYWXNP-YDHLFZDLSA-N 0.000 description 1
- XOQYDFCQPWAMSA-KKHAAJSZSA-N Asn-Val-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XOQYDFCQPWAMSA-KKHAAJSZSA-N 0.000 description 1
- XEDQMTWEYFBOIK-ACZMJKKPSA-N Asp-Ala-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O XEDQMTWEYFBOIK-ACZMJKKPSA-N 0.000 description 1
- HPNDBHLITCHRSO-WHFBIAKZSA-N Asp-Ala-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)NCC(O)=O HPNDBHLITCHRSO-WHFBIAKZSA-N 0.000 description 1
- SOYOSFXLXYZNRG-CIUDSAMLSA-N Asp-Arg-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(O)=O SOYOSFXLXYZNRG-CIUDSAMLSA-N 0.000 description 1
- MFMJRYHVLLEMQM-DCAQKATOSA-N Asp-Arg-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC(=O)O)N MFMJRYHVLLEMQM-DCAQKATOSA-N 0.000 description 1
- SDHFVYLZFBDSQT-DCAQKATOSA-N Asp-Arg-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC(=O)O)N SDHFVYLZFBDSQT-DCAQKATOSA-N 0.000 description 1
- FAEIQWHBRBWUBN-FXQIFTODSA-N Asp-Arg-Ser Chemical compound C(C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC(=O)O)N)CN=C(N)N FAEIQWHBRBWUBN-FXQIFTODSA-N 0.000 description 1
- ICTXFVKYAGQURS-UBHSHLNASA-N Asp-Asn-Trp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O ICTXFVKYAGQURS-UBHSHLNASA-N 0.000 description 1
- VHWNKSJHQFZJTH-FXQIFTODSA-N Asp-Asp-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(=O)O)N VHWNKSJHQFZJTH-FXQIFTODSA-N 0.000 description 1
- PXLNPFOJZQMXAT-BYULHYEWSA-N Asp-Asp-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC(O)=O PXLNPFOJZQMXAT-BYULHYEWSA-N 0.000 description 1
- ZRAOLTNMSCSCLN-ZLUOBGJFSA-N Asp-Cys-Asn Chemical compound C([C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(=O)N)C(=O)O)N)C(=O)O ZRAOLTNMSCSCLN-ZLUOBGJFSA-N 0.000 description 1
- ZSJFGGSPCCHMNE-LAEOZQHASA-N Asp-Gln-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC(=O)O)N ZSJFGGSPCCHMNE-LAEOZQHASA-N 0.000 description 1
- XJQRWGXKUSDEFI-ACZMJKKPSA-N Asp-Glu-Asn Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O XJQRWGXKUSDEFI-ACZMJKKPSA-N 0.000 description 1
- XAJRHVUUVUPFQL-ACZMJKKPSA-N Asp-Glu-Asp Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O XAJRHVUUVUPFQL-ACZMJKKPSA-N 0.000 description 1
- HSWYMWGDMPLTTH-FXQIFTODSA-N Asp-Glu-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O HSWYMWGDMPLTTH-FXQIFTODSA-N 0.000 description 1
- JHFNSBBHKSZXKB-VKHMYHEASA-N Asp-Gly Chemical compound OC(=O)C[C@H](N)C(=O)NCC(O)=O JHFNSBBHKSZXKB-VKHMYHEASA-N 0.000 description 1
- DTNUIAJCPRMNBT-WHFBIAKZSA-N Asp-Gly-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](C)C(O)=O DTNUIAJCPRMNBT-WHFBIAKZSA-N 0.000 description 1
- PZXPWHFYZXTFBI-YUMQZZPRSA-N Asp-Gly-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC(O)=O PZXPWHFYZXTFBI-YUMQZZPRSA-N 0.000 description 1
- WYOSXGYAKZQPGF-SRVKXCTJSA-N Asp-His-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)NC(=O)[C@H](CC(=O)O)N WYOSXGYAKZQPGF-SRVKXCTJSA-N 0.000 description 1
- ICZWAZVKLACMKR-CIUDSAMLSA-N Asp-His-Ser Chemical compound OC(=O)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CO)C(O)=O)CC1=CN=CN1 ICZWAZVKLACMKR-CIUDSAMLSA-N 0.000 description 1
- OOXKFYNWRVGYFM-XIRDDKMYSA-N Asp-His-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CC3=CN=CN3)NC(=O)[C@H](CC(=O)O)N OOXKFYNWRVGYFM-XIRDDKMYSA-N 0.000 description 1
- UJGRZQYSNYTCAX-SRVKXCTJSA-N Asp-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC(O)=O UJGRZQYSNYTCAX-SRVKXCTJSA-N 0.000 description 1
- HKEZZWQWXWGASX-KKUMJFAQSA-N Asp-Leu-Phe Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 HKEZZWQWXWGASX-KKUMJFAQSA-N 0.000 description 1
- CTWCFPWFIGRAEP-CIUDSAMLSA-N Asp-Lys-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O CTWCFPWFIGRAEP-CIUDSAMLSA-N 0.000 description 1
- LBOVBQONZJRWPV-YUMQZZPRSA-N Asp-Lys-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O LBOVBQONZJRWPV-YUMQZZPRSA-N 0.000 description 1
- YWLDTBBUHZJQHW-KKUMJFAQSA-N Asp-Lys-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC(=O)O)N YWLDTBBUHZJQHW-KKUMJFAQSA-N 0.000 description 1
- KRQFMDNIUOVRIF-KKUMJFAQSA-N Asp-Phe-His Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)NC(=O)[C@H](CC(=O)O)N KRQFMDNIUOVRIF-KKUMJFAQSA-N 0.000 description 1
- LTCKTLYKRMCFOC-KKUMJFAQSA-N Asp-Phe-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O LTCKTLYKRMCFOC-KKUMJFAQSA-N 0.000 description 1
- DWOSGXZMLQNDBN-FXQIFTODSA-N Asp-Pro-Cys Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CC(=O)O)N)C(=O)N[C@@H](CS)C(=O)O DWOSGXZMLQNDBN-FXQIFTODSA-N 0.000 description 1
- AHWRSSLYSGLBGD-CIUDSAMLSA-N Asp-Pro-Glu Chemical compound OC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O AHWRSSLYSGLBGD-CIUDSAMLSA-N 0.000 description 1
- YFGUZQQCSDZRBN-DCAQKATOSA-N Asp-Pro-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O YFGUZQQCSDZRBN-DCAQKATOSA-N 0.000 description 1
- DINOVZWPTMGSRF-QXEWZRGKSA-N Asp-Pro-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O DINOVZWPTMGSRF-QXEWZRGKSA-N 0.000 description 1
- ZBYLEBZCVKLPCY-FXQIFTODSA-N Asp-Ser-Arg Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O ZBYLEBZCVKLPCY-FXQIFTODSA-N 0.000 description 1
- NBKLEMWHDLAUEM-CIUDSAMLSA-N Asp-Ser-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CC(=O)O)N NBKLEMWHDLAUEM-CIUDSAMLSA-N 0.000 description 1
- VNXQRBXEQXLERQ-CIUDSAMLSA-N Asp-Ser-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CC(=O)O)N VNXQRBXEQXLERQ-CIUDSAMLSA-N 0.000 description 1
- ZQFRDAZBTSFGGW-SRVKXCTJSA-N Asp-Ser-Phe Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O ZQFRDAZBTSFGGW-SRVKXCTJSA-N 0.000 description 1
- HRVQDZOWMLFAOD-BIIVOSGPSA-N Asp-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CC(=O)O)N)C(=O)O HRVQDZOWMLFAOD-BIIVOSGPSA-N 0.000 description 1
- OZBXOELNJBSJOA-UBHSHLNASA-N Asp-Ser-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CC(=O)O)N OZBXOELNJBSJOA-UBHSHLNASA-N 0.000 description 1
- JJQGZGOEDSSHTE-FOHZUACHSA-N Asp-Thr-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O JJQGZGOEDSSHTE-FOHZUACHSA-N 0.000 description 1
- KCOPOPKJRHVGPE-AQZXSJQPSA-N Asp-Thr-Trp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O KCOPOPKJRHVGPE-AQZXSJQPSA-N 0.000 description 1
- NVXLFIPTHPKSKL-UBHSHLNASA-N Asp-Trp-Asn Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CC(O)=O)N)C(=O)N[C@@H](CC(N)=O)C(O)=O)=CNC2=C1 NVXLFIPTHPKSKL-UBHSHLNASA-N 0.000 description 1
- YUELDQUPTAYEGM-XIRDDKMYSA-N Asp-Trp-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H](CC(=O)O)N YUELDQUPTAYEGM-XIRDDKMYSA-N 0.000 description 1
- HCOQNGIHSXICCB-IHRRRGAJSA-N Asp-Tyr-Arg Chemical compound N[C@@H](CC(=O)O)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)O HCOQNGIHSXICCB-IHRRRGAJSA-N 0.000 description 1
- XMKXONRMGJXCJV-LAEOZQHASA-N Asp-Val-Glu Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O XMKXONRMGJXCJV-LAEOZQHASA-N 0.000 description 1
- XWKPSMRPIKKDDU-RCOVLWMOSA-N Asp-Val-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O XWKPSMRPIKKDDU-RCOVLWMOSA-N 0.000 description 1
- GIKOVDMXBAFXDF-NHCYSSNCSA-N Asp-Val-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O GIKOVDMXBAFXDF-NHCYSSNCSA-N 0.000 description 1
- SFJUYBCDQBAYAJ-YDHLFZDLSA-N Asp-Val-Phe Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 SFJUYBCDQBAYAJ-YDHLFZDLSA-N 0.000 description 1
- NVMMUAUTQCWYHD-ABHRYQDASA-N Asp-Val-Pro-Pro Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 NVMMUAUTQCWYHD-ABHRYQDASA-N 0.000 description 1
- 241000020089 Atacta Species 0.000 description 1
- 241000726103 Atta Species 0.000 description 1
- 208000034172 Autoimmune Experimental Myasthenia Gravis Diseases 0.000 description 1
- 208000030767 Autoimmune encephalitis Diseases 0.000 description 1
- 244000063299 Bacillus subtilis Species 0.000 description 1
- 235000014469 Bacillus subtilis Nutrition 0.000 description 1
- 208000035143 Bacterial infection Diseases 0.000 description 1
- 102100027314 Beta-2-microglobulin Human genes 0.000 description 1
- 241000255789 Bombyx mori Species 0.000 description 1
- 208000010392 Bone Fractures Diseases 0.000 description 1
- 206010065553 Bone marrow failure Diseases 0.000 description 1
- 102000001902 CC Chemokines Human genes 0.000 description 1
- 108010040471 CC Chemokines Proteins 0.000 description 1
- 108050006947 CXC Chemokine Proteins 0.000 description 1
- 102000019388 CXC chemokine Human genes 0.000 description 1
- 101100512078 Caenorhabditis elegans lys-1 gene Proteins 0.000 description 1
- 241000222122 Candida albicans Species 0.000 description 1
- 206010007134 Candida infections Diseases 0.000 description 1
- OKTJSMMVPCPJKN-UHFFFAOYSA-N Carbon Chemical compound [C] OKTJSMMVPCPJKN-UHFFFAOYSA-N 0.000 description 1
- 201000009030 Carcinoma Diseases 0.000 description 1
- 108010067225 Cell Adhesion Molecules Proteins 0.000 description 1
- 102000016289 Cell Adhesion Molecules Human genes 0.000 description 1
- 108091092236 Chimeric RNA Proteins 0.000 description 1
- VEXZGXHMUGYJMC-UHFFFAOYSA-M Chloride anion Chemical compound [Cl-] VEXZGXHMUGYJMC-UHFFFAOYSA-M 0.000 description 1
- 208000003044 Closed Fractures Diseases 0.000 description 1
- 206010053567 Coagulopathies Diseases 0.000 description 1
- 108020004705 Codon Proteins 0.000 description 1
- 102000029816 Collagenase Human genes 0.000 description 1
- 108060005980 Collagenase Proteins 0.000 description 1
- 206010010099 Combined immunodeficiency Diseases 0.000 description 1
- 208000035473 Communicable disease Diseases 0.000 description 1
- 208000032170 Congenital Abnormalities Diseases 0.000 description 1
- 241000699800 Cricetinae Species 0.000 description 1
- 208000011231 Crohn disease Diseases 0.000 description 1
- NOCCABSVTRONIN-CIUDSAMLSA-N Cys-Ala-Leu Chemical compound C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)O)NC(=O)[C@H](CS)N NOCCABSVTRONIN-CIUDSAMLSA-N 0.000 description 1
- NLCZGISONIGRQP-DCAQKATOSA-N Cys-Arg-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CS)N NLCZGISONIGRQP-DCAQKATOSA-N 0.000 description 1
- JTNKVWLMDHIUOG-IHRRRGAJSA-N Cys-Arg-Phe Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O JTNKVWLMDHIUOG-IHRRRGAJSA-N 0.000 description 1
- GEEXORWTBTUOHC-FXQIFTODSA-N Cys-Arg-Ser Chemical compound C(C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CS)N)CN=C(N)N GEEXORWTBTUOHC-FXQIFTODSA-N 0.000 description 1
- DEVDFMRWZASYOF-ZLUOBGJFSA-N Cys-Asn-Asp Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O DEVDFMRWZASYOF-ZLUOBGJFSA-N 0.000 description 1
- YRJICXCOIBUCRP-CIUDSAMLSA-N Cys-Asn-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CS)N YRJICXCOIBUCRP-CIUDSAMLSA-N 0.000 description 1
- WVJHEDOLHPZLRV-CIUDSAMLSA-N Cys-Asn-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CS)N WVJHEDOLHPZLRV-CIUDSAMLSA-N 0.000 description 1
- TULNGKSILXCZQT-IMJSIDKUSA-N Cys-Asp Chemical compound SC[C@H](N)C(=O)N[C@H](C(O)=O)CC(O)=O TULNGKSILXCZQT-IMJSIDKUSA-N 0.000 description 1
- MGAWEOHYNIMOQJ-ACZMJKKPSA-N Cys-Gln-Asp Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CS)N MGAWEOHYNIMOQJ-ACZMJKKPSA-N 0.000 description 1
- HHABWQIFXZPZCK-ACZMJKKPSA-N Cys-Gln-Ser Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CS)N HHABWQIFXZPZCK-ACZMJKKPSA-N 0.000 description 1
- YUZPQIQWXLRFBW-ACZMJKKPSA-N Cys-Glu-Ala Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O YUZPQIQWXLRFBW-ACZMJKKPSA-N 0.000 description 1
- BCSYBBMFGLHCOA-ACZMJKKPSA-N Cys-Glu-Cys Chemical compound SC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CS)C(O)=O BCSYBBMFGLHCOA-ACZMJKKPSA-N 0.000 description 1
- UXUSHQYYQCZWET-WDSKDSINSA-N Cys-Glu-Gly Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O UXUSHQYYQCZWET-WDSKDSINSA-N 0.000 description 1
- BSFFNUBDVYTDMV-WHFBIAKZSA-N Cys-Gly-Asn Chemical compound [H]N[C@@H](CS)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O BSFFNUBDVYTDMV-WHFBIAKZSA-N 0.000 description 1
- DZSICRGTVPDCRN-YUMQZZPRSA-N Cys-Gly-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CS)N DZSICRGTVPDCRN-YUMQZZPRSA-N 0.000 description 1
- XVLMKWWVBNESPX-XVYDVKMFSA-N Cys-His-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CS)N XVLMKWWVBNESPX-XVYDVKMFSA-N 0.000 description 1
- XELISBQUZZAPQK-CIUDSAMLSA-N Cys-His-Cys Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CS)N XELISBQUZZAPQK-CIUDSAMLSA-N 0.000 description 1
- NXTYATMDWQYLGJ-BQBZGAKWSA-N Cys-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](N)CS NXTYATMDWQYLGJ-BQBZGAKWSA-N 0.000 description 1
- SSNJZBGOMNLSLA-CIUDSAMLSA-N Cys-Leu-Asn Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O SSNJZBGOMNLSLA-CIUDSAMLSA-N 0.000 description 1
- DIHCYBRLTVEPBW-SRVKXCTJSA-N Cys-Leu-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CS)N DIHCYBRLTVEPBW-SRVKXCTJSA-N 0.000 description 1
- HBHMVBGGHDMPBF-GARJFASQSA-N Cys-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CS)N HBHMVBGGHDMPBF-GARJFASQSA-N 0.000 description 1
- XZKJEOMFLDVXJG-KATARQTJSA-N Cys-Leu-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CS)N)O XZKJEOMFLDVXJG-KATARQTJSA-N 0.000 description 1
- GDNWBSFSHJVXKL-GUBZILKMSA-N Cys-Lys-Gln Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O GDNWBSFSHJVXKL-GUBZILKMSA-N 0.000 description 1
- KVGPYKUIHZJWGA-BQBZGAKWSA-N Cys-Met-Gly Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCSC)C(=O)NCC(O)=O KVGPYKUIHZJWGA-BQBZGAKWSA-N 0.000 description 1
- CNBIWHCVAZHRBI-IHRRRGAJSA-N Cys-Met-Phe Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CS)N CNBIWHCVAZHRBI-IHRRRGAJSA-N 0.000 description 1
- XZFYRXDAULDNFX-UWVGGRQHSA-N Cys-Phe Chemical compound SC[C@H](N)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 XZFYRXDAULDNFX-UWVGGRQHSA-N 0.000 description 1
- CHRCKSPMGYDLIA-SRVKXCTJSA-N Cys-Phe-Ser Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O CHRCKSPMGYDLIA-SRVKXCTJSA-N 0.000 description 1
- YYLBXQJGWOQZOU-IHRRRGAJSA-N Cys-Phe-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](CS)N YYLBXQJGWOQZOU-IHRRRGAJSA-N 0.000 description 1
- ZOKPRHVIFAUJPV-GUBZILKMSA-N Cys-Pro-Arg Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CS)N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O ZOKPRHVIFAUJPV-GUBZILKMSA-N 0.000 description 1
- BCFXQBXXDSEHRS-FXQIFTODSA-N Cys-Ser-Arg Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O BCFXQBXXDSEHRS-FXQIFTODSA-N 0.000 description 1
- UEHCDNYDBBCQEL-CIUDSAMLSA-N Cys-Ser-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CS)N UEHCDNYDBBCQEL-CIUDSAMLSA-N 0.000 description 1
- HJXSYJVCMUOUNY-SRVKXCTJSA-N Cys-Ser-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CS)N HJXSYJVCMUOUNY-SRVKXCTJSA-N 0.000 description 1
- DQGIAOGALAQBGK-BWBBJGPYSA-N Cys-Ser-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CS)N)O DQGIAOGALAQBGK-BWBBJGPYSA-N 0.000 description 1
- WYVKPHCYMTWUCW-YUPRTTJUSA-N Cys-Thr Chemical compound C[C@@H]([C@@H](C(=O)O)NC(=O)[C@H](CS)N)O WYVKPHCYMTWUCW-YUPRTTJUSA-N 0.000 description 1
- NRVQLLDIJJEIIZ-VZFHVOOUSA-N Cys-Thr-Ala Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C)C(=O)O)NC(=O)[C@H](CS)N)O NRVQLLDIJJEIIZ-VZFHVOOUSA-N 0.000 description 1
- ALNKNYKSZPSLBD-ZDLURKLDSA-N Cys-Thr-Gly Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O ALNKNYKSZPSLBD-ZDLURKLDSA-N 0.000 description 1
- UKHNKRGNFKSHCG-CUJWVEQBSA-N Cys-Thr-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CS)N)O UKHNKRGNFKSHCG-CUJWVEQBSA-N 0.000 description 1
- BOMGEMDZTNZESV-QWRGUYRKSA-N Cys-Tyr-Gly Chemical compound SC[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CC=C(O)C=C1 BOMGEMDZTNZESV-QWRGUYRKSA-N 0.000 description 1
- QQAYIVHVRFJICE-AEJSXWLSSA-N Cys-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CS)N QQAYIVHVRFJICE-AEJSXWLSSA-N 0.000 description 1
- 201000003883 Cystic fibrosis Diseases 0.000 description 1
- 102000012410 DNA Ligases Human genes 0.000 description 1
- 108010061982 DNA Ligases Proteins 0.000 description 1
- 108020003215 DNA Probes Proteins 0.000 description 1
- 108010008286 DNA nucleotidylexotransferase Proteins 0.000 description 1
- 239000003298 DNA probe Substances 0.000 description 1
- 238000012270 DNA recombination Methods 0.000 description 1
- 102100029764 DNA-directed DNA/RNA polymerase mu Human genes 0.000 description 1
- 206010011968 Decreased immune responsiveness Diseases 0.000 description 1
- 108010053770 Deoxyribonucleases Proteins 0.000 description 1
- 102000016911 Deoxyribonucleases Human genes 0.000 description 1
- 208000020401 Depressive disease Diseases 0.000 description 1
- 229920002307 Dextran Polymers 0.000 description 1
- 101900063352 Escherichia coli DNA ligase Proteins 0.000 description 1
- 101900234631 Escherichia coli DNA polymerase I Proteins 0.000 description 1
- 208000009386 Experimental Arthritis Diseases 0.000 description 1
- 206010015866 Extravasation Diseases 0.000 description 1
- 206010015943 Eye inflammation Diseases 0.000 description 1
- 108010049003 Fibrinogen Proteins 0.000 description 1
- 102000008946 Fibrinogen Human genes 0.000 description 1
- KVYVOGYEMPEXBT-GUBZILKMSA-N Gln-Ala-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(N)=O KVYVOGYEMPEXBT-GUBZILKMSA-N 0.000 description 1
- RGXXLQWXBFNXTG-CIUDSAMLSA-N Gln-Arg-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(O)=O RGXXLQWXBFNXTG-CIUDSAMLSA-N 0.000 description 1
- LTLXPHKSQQILNF-CIUDSAMLSA-N Gln-Arg-Cys Chemical compound C(C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCC(=O)N)N)CN=C(N)N LTLXPHKSQQILNF-CIUDSAMLSA-N 0.000 description 1
- PGPJSRSLQNXBDT-YUMQZZPRSA-N Gln-Arg-Gly Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O PGPJSRSLQNXBDT-YUMQZZPRSA-N 0.000 description 1
- DXJZITDUDUPINW-WHFBIAKZSA-N Gln-Asn Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(O)=O DXJZITDUDUPINW-WHFBIAKZSA-N 0.000 description 1
- TWHDOEYLXXQYOZ-FXQIFTODSA-N Gln-Asn-Gln Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N TWHDOEYLXXQYOZ-FXQIFTODSA-N 0.000 description 1
- PHZYLYASFWHLHJ-FXQIFTODSA-N Gln-Asn-Glu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O PHZYLYASFWHLHJ-FXQIFTODSA-N 0.000 description 1
- PONUFVLSGMQFAI-AVGNSLFASA-N Gln-Asn-Phe Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O PONUFVLSGMQFAI-AVGNSLFASA-N 0.000 description 1
- CKNUKHBRCSMKMO-XHNCKOQMSA-N Gln-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)N)N)C(=O)O CKNUKHBRCSMKMO-XHNCKOQMSA-N 0.000 description 1
- WQWMZOIPXWSZNE-WDSKDSINSA-N Gln-Asp-Gly Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O WQWMZOIPXWSZNE-WDSKDSINSA-N 0.000 description 1
- SXIJQMBEVYWAQT-GUBZILKMSA-N Gln-Asp-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCC(=O)N)N SXIJQMBEVYWAQT-GUBZILKMSA-N 0.000 description 1
- JFSNBQJNDMXMQF-XHNCKOQMSA-N Gln-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCC(=O)N)N)C(=O)O JFSNBQJNDMXMQF-XHNCKOQMSA-N 0.000 description 1
- PCKOTDPDHIBGRW-CIUDSAMLSA-N Gln-Cys-Arg Chemical compound C(C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CCC(=O)N)N)CN=C(N)N PCKOTDPDHIBGRW-CIUDSAMLSA-N 0.000 description 1
- OIIIRRTWYLCQNW-ACZMJKKPSA-N Gln-Cys-Asn Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(N)=O)C(O)=O OIIIRRTWYLCQNW-ACZMJKKPSA-N 0.000 description 1
- CXFUMJQFZVCETK-FXQIFTODSA-N Gln-Cys-Gln Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(N)=O)C(O)=O CXFUMJQFZVCETK-FXQIFTODSA-N 0.000 description 1
- QYKBTDOAMKORGL-FXQIFTODSA-N Gln-Gln-Asp Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N QYKBTDOAMKORGL-FXQIFTODSA-N 0.000 description 1
- XFKUFUJECJUQTQ-CIUDSAMLSA-N Gln-Gln-Glu Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O XFKUFUJECJUQTQ-CIUDSAMLSA-N 0.000 description 1
- KVXVVDFOZNYYKZ-DCAQKATOSA-N Gln-Gln-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O KVXVVDFOZNYYKZ-DCAQKATOSA-N 0.000 description 1
- MADFVRSKEIEZHZ-DCAQKATOSA-N Gln-Gln-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCC(=O)N)N MADFVRSKEIEZHZ-DCAQKATOSA-N 0.000 description 1
- LLRJEFPKIIBGJP-DCAQKATOSA-N Gln-Glu-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCC(=O)N)N LLRJEFPKIIBGJP-DCAQKATOSA-N 0.000 description 1
- XJKAKYXMFHUIHT-AUTRQRHGSA-N Gln-Glu-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCC(=O)N)N XJKAKYXMFHUIHT-AUTRQRHGSA-N 0.000 description 1
- SMLDOQHTOAAFJQ-WDSKDSINSA-N Gln-Gly-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)NCC(=O)N[C@@H](CO)C(O)=O SMLDOQHTOAAFJQ-WDSKDSINSA-N 0.000 description 1
- DWDBJWAXPXXYLP-SRVKXCTJSA-N Gln-His-Arg Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CCC(=O)N)N DWDBJWAXPXXYLP-SRVKXCTJSA-N 0.000 description 1
- IWUFOVSLWADEJC-AVGNSLFASA-N Gln-His-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(O)=O IWUFOVSLWADEJC-AVGNSLFASA-N 0.000 description 1
- ICDIMQAMJGDHSE-GUBZILKMSA-N Gln-His-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(O)=O ICDIMQAMJGDHSE-GUBZILKMSA-N 0.000 description 1
- CAXXTYYGFYTBPV-IUCAKERBSA-N Gln-Leu-Gly Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O CAXXTYYGFYTBPV-IUCAKERBSA-N 0.000 description 1
- IULKWYSYZSURJK-AVGNSLFASA-N Gln-Leu-Lys Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(O)=O IULKWYSYZSURJK-AVGNSLFASA-N 0.000 description 1
- SHAUZYVSXAMYAZ-JYJNAYRXSA-N Gln-Leu-Phe Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CCC(=O)N)N SHAUZYVSXAMYAZ-JYJNAYRXSA-N 0.000 description 1
- ZBKUIQNCRIYVGH-SDDRHHMPSA-N Gln-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)N)N ZBKUIQNCRIYVGH-SDDRHHMPSA-N 0.000 description 1
- YPMDZWPZFOZYFG-GUBZILKMSA-N Gln-Leu-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O YPMDZWPZFOZYFG-GUBZILKMSA-N 0.000 description 1
- TWIAMTNJOMRDAK-GUBZILKMSA-N Gln-Lys-Asp Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O TWIAMTNJOMRDAK-GUBZILKMSA-N 0.000 description 1
- UWKPRVKWEKEMSY-DCAQKATOSA-N Gln-Lys-Gln Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O UWKPRVKWEKEMSY-DCAQKATOSA-N 0.000 description 1
- SXGMGNZEHFORAV-IUCAKERBSA-N Gln-Lys-Gly Chemical compound C(CCN)C[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CCC(=O)N)N SXGMGNZEHFORAV-IUCAKERBSA-N 0.000 description 1
- FKXCBKCOSVIGCT-AVGNSLFASA-N Gln-Lys-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O FKXCBKCOSVIGCT-AVGNSLFASA-N 0.000 description 1
- JRHPEMVLTRADLJ-AVGNSLFASA-N Gln-Lys-Lys Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)N)N JRHPEMVLTRADLJ-AVGNSLFASA-N 0.000 description 1
- JUUNNOLZGVYCJT-JYJNAYRXSA-N Gln-Phe-Lys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)N)N JUUNNOLZGVYCJT-JYJNAYRXSA-N 0.000 description 1
- MFORDNZDKAVNSR-SRVKXCTJSA-N Gln-Pro-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CCC(N)=O MFORDNZDKAVNSR-SRVKXCTJSA-N 0.000 description 1
- YPFFHGRJCUBXPX-NHCYSSNCSA-N Gln-Pro-Val Chemical compound CC(C)[C@H](NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CCC(N)=O)C(O)=O YPFFHGRJCUBXPX-NHCYSSNCSA-N 0.000 description 1
- UKKNTTCNGZLJEX-WHFBIAKZSA-N Gln-Ser Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](CO)C(O)=O UKKNTTCNGZLJEX-WHFBIAKZSA-N 0.000 description 1
- KUBFPYIMAGXGBT-ACZMJKKPSA-N Gln-Ser-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O KUBFPYIMAGXGBT-ACZMJKKPSA-N 0.000 description 1
- OSCLNNWLKKIQJM-WDSKDSINSA-N Gln-Ser-Gly Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)NCC(O)=O OSCLNNWLKKIQJM-WDSKDSINSA-N 0.000 description 1
- DYVMTEWCGAVKSE-HJGDQZAQSA-N Gln-Thr-Arg Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CCC(=O)N)N)O DYVMTEWCGAVKSE-HJGDQZAQSA-N 0.000 description 1
- ININBLZFFVOQIO-JHEQGTHGSA-N Gln-Thr-Gly Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CCC(=O)N)N)O ININBLZFFVOQIO-JHEQGTHGSA-N 0.000 description 1
- SYTFJIQPBRJSOK-NKIYYHGXSA-N Gln-Thr-His Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 SYTFJIQPBRJSOK-NKIYYHGXSA-N 0.000 description 1
- STHSGOZLFLFGSS-SUSMZKCASA-N Gln-Thr-Thr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O STHSGOZLFLFGSS-SUSMZKCASA-N 0.000 description 1
- RGNMNWULPAYDAH-JSGCOSHPSA-N Gln-Trp-Gly Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CCC(=O)N)N RGNMNWULPAYDAH-JSGCOSHPSA-N 0.000 description 1
- IIMZHVKZBGSEKZ-SZMVWBNQSA-N Gln-Trp-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC(C)C)C(O)=O IIMZHVKZBGSEKZ-SZMVWBNQSA-N 0.000 description 1
- VEYGCDYMOXHJLS-GVXVVHGQSA-N Gln-Val-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O VEYGCDYMOXHJLS-GVXVVHGQSA-N 0.000 description 1
- MKRDNSWGJWTBKZ-GVXVVHGQSA-N Gln-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)N)N MKRDNSWGJWTBKZ-GVXVVHGQSA-N 0.000 description 1
- ZMXZGYLINVNTKH-DZKIICNBSA-N Gln-Val-Phe Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 ZMXZGYLINVNTKH-DZKIICNBSA-N 0.000 description 1
- UTKUTMJSWKKHEM-WDSKDSINSA-N Glu-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(O)=O UTKUTMJSWKKHEM-WDSKDSINSA-N 0.000 description 1
- ATRHMOJQJWPVBQ-DRZSPHRISA-N Glu-Ala-Phe Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O ATRHMOJQJWPVBQ-DRZSPHRISA-N 0.000 description 1
- KBKGRMNVKPSQIF-XDTLVQLUSA-N Glu-Ala-Tyr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O KBKGRMNVKPSQIF-XDTLVQLUSA-N 0.000 description 1
- DYFJZDDQPNIPAB-NHCYSSNCSA-N Glu-Arg-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(O)=O DYFJZDDQPNIPAB-NHCYSSNCSA-N 0.000 description 1
- YYOBUPFZLKQUAX-FXQIFTODSA-N Glu-Asn-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O YYOBUPFZLKQUAX-FXQIFTODSA-N 0.000 description 1
- CKRUHITYRFNUKW-WDSKDSINSA-N Glu-Asn-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O CKRUHITYRFNUKW-WDSKDSINSA-N 0.000 description 1
- LXAUHIRMWXQRKI-XHNCKOQMSA-N Glu-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)O)N)C(=O)O LXAUHIRMWXQRKI-XHNCKOQMSA-N 0.000 description 1
- VAZZOGXDUQSVQF-NUMRIWBASA-N Glu-Asn-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)O)N)O VAZZOGXDUQSVQF-NUMRIWBASA-N 0.000 description 1
- DSPQRJXOIXHOHK-WDSKDSINSA-N Glu-Asp-Gly Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O DSPQRJXOIXHOHK-WDSKDSINSA-N 0.000 description 1
- JVSBYEDSSRZQGV-GUBZILKMSA-N Glu-Asp-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CCC(O)=O JVSBYEDSSRZQGV-GUBZILKMSA-N 0.000 description 1
- HJIFPJUEOGZWRI-GUBZILKMSA-N Glu-Asp-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCC(=O)O)N HJIFPJUEOGZWRI-GUBZILKMSA-N 0.000 description 1
- JRCUFCXYZLPSDZ-ACZMJKKPSA-N Glu-Asp-Ser Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O JRCUFCXYZLPSDZ-ACZMJKKPSA-N 0.000 description 1
- SBCYJMOOHUDWDA-NUMRIWBASA-N Glu-Asp-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SBCYJMOOHUDWDA-NUMRIWBASA-N 0.000 description 1
- WATXSTJXNBOHKD-LAEOZQHASA-N Glu-Asp-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O WATXSTJXNBOHKD-LAEOZQHASA-N 0.000 description 1
- FLQAKQOBSPFGKG-CIUDSAMLSA-N Glu-Cys-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@H](C(O)=O)CCCN=C(N)N FLQAKQOBSPFGKG-CIUDSAMLSA-N 0.000 description 1
- PKYAVRMYTBBRLS-FXQIFTODSA-N Glu-Cys-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(O)=O)C(O)=O PKYAVRMYTBBRLS-FXQIFTODSA-N 0.000 description 1
- XHWLNISLUFEWNS-CIUDSAMLSA-N Glu-Gln-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O XHWLNISLUFEWNS-CIUDSAMLSA-N 0.000 description 1
- CJWANNXUTOATSJ-DCAQKATOSA-N Glu-Gln-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCC(=O)O)N CJWANNXUTOATSJ-DCAQKATOSA-N 0.000 description 1
- LVCHEMOPBORRLB-DCAQKATOSA-N Glu-Gln-Lys Chemical compound NCCCC[C@H](NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CCC(O)=O)C(O)=O LVCHEMOPBORRLB-DCAQKATOSA-N 0.000 description 1
- HTTSBEBKVNEDFE-AUTRQRHGSA-N Glu-Gln-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCC(=O)O)N HTTSBEBKVNEDFE-AUTRQRHGSA-N 0.000 description 1
- KOSRFJWDECSPRO-WDSKDSINSA-N Glu-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(O)=O KOSRFJWDECSPRO-WDSKDSINSA-N 0.000 description 1
- HNVFSTLPVJWIDV-CIUDSAMLSA-N Glu-Glu-Gln Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O HNVFSTLPVJWIDV-CIUDSAMLSA-N 0.000 description 1
- BUZMZDDKFCSKOT-CIUDSAMLSA-N Glu-Glu-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O BUZMZDDKFCSKOT-CIUDSAMLSA-N 0.000 description 1
- MUSGDMDGNGXULI-DCAQKATOSA-N Glu-Glu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O MUSGDMDGNGXULI-DCAQKATOSA-N 0.000 description 1
- PHONAZGUEGIOEM-GLLZPBPUSA-N Glu-Glu-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PHONAZGUEGIOEM-GLLZPBPUSA-N 0.000 description 1
- QJCKNLPMTPXXEM-AUTRQRHGSA-N Glu-Glu-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O QJCKNLPMTPXXEM-AUTRQRHGSA-N 0.000 description 1
- MTAOBYXRYJZRGQ-WDSKDSINSA-N Glu-Gly-Asp Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O MTAOBYXRYJZRGQ-WDSKDSINSA-N 0.000 description 1
- LYCDZGLXQBPNQU-WDSKDSINSA-N Glu-Gly-Cys Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@@H](CS)C(O)=O LYCDZGLXQBPNQU-WDSKDSINSA-N 0.000 description 1
- ZWQVYZXPYSYPJD-RYUDHWBXSA-N Glu-Gly-Phe Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 ZWQVYZXPYSYPJD-RYUDHWBXSA-N 0.000 description 1
- HKTRDWYCAUTRRL-YUMQZZPRSA-N Glu-His Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 HKTRDWYCAUTRRL-YUMQZZPRSA-N 0.000 description 1
- ZJFNRQHUIHKZJF-GUBZILKMSA-N Glu-His-Asp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(O)=O)C(O)=O ZJFNRQHUIHKZJF-GUBZILKMSA-N 0.000 description 1
- DVLZZEPUNFEUBW-AVGNSLFASA-N Glu-His-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CCC(=O)O)N DVLZZEPUNFEUBW-AVGNSLFASA-N 0.000 description 1
- HVYWQYLBVXMXSV-GUBZILKMSA-N Glu-Leu-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O HVYWQYLBVXMXSV-GUBZILKMSA-N 0.000 description 1
- VMKCPNBBPGGQBJ-GUBZILKMSA-N Glu-Leu-Asn Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)O)N VMKCPNBBPGGQBJ-GUBZILKMSA-N 0.000 description 1
- IRXNJYPKBVERCW-DCAQKATOSA-N Glu-Leu-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O IRXNJYPKBVERCW-DCAQKATOSA-N 0.000 description 1
- BBBXWRGITSUJPB-YUMQZZPRSA-N Glu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](N)CCC(O)=O BBBXWRGITSUJPB-YUMQZZPRSA-N 0.000 description 1
- SWRVAQHFBRZVNX-GUBZILKMSA-N Glu-Lys-Asn Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O SWRVAQHFBRZVNX-GUBZILKMSA-N 0.000 description 1
- UJMNFCAHLYKWOZ-DCAQKATOSA-N Glu-Lys-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O UJMNFCAHLYKWOZ-DCAQKATOSA-N 0.000 description 1
- ILWHFUZZCFYSKT-AVGNSLFASA-N Glu-Lys-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O ILWHFUZZCFYSKT-AVGNSLFASA-N 0.000 description 1
- CBEUFCJRFNZMCU-SRVKXCTJSA-N Glu-Met-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O CBEUFCJRFNZMCU-SRVKXCTJSA-N 0.000 description 1
- UERORLSAFUHDGU-AVGNSLFASA-N Glu-Phe-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)O)N UERORLSAFUHDGU-AVGNSLFASA-N 0.000 description 1
- MIIGESVJEBDJMP-FHWLQOOXSA-N Glu-Phe-Tyr Chemical compound C([C@H](NC(=O)[C@H](CCC(O)=O)N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=CC=C1 MIIGESVJEBDJMP-FHWLQOOXSA-N 0.000 description 1
- QJVZSVUYZFYLFQ-CIUDSAMLSA-N Glu-Pro-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O QJVZSVUYZFYLFQ-CIUDSAMLSA-N 0.000 description 1
- DXVOKNVIKORTHQ-GUBZILKMSA-N Glu-Pro-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O DXVOKNVIKORTHQ-GUBZILKMSA-N 0.000 description 1
- BFEZQZKEPRKKHV-SRVKXCTJSA-N Glu-Pro-Lys Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CCC(=O)O)N)C(=O)N[C@@H](CCCCN)C(=O)O BFEZQZKEPRKKHV-SRVKXCTJSA-N 0.000 description 1
- ZKONLKQGTNVAPR-DCAQKATOSA-N Glu-Pro-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CCC(=O)O)N ZKONLKQGTNVAPR-DCAQKATOSA-N 0.000 description 1
- SWDNPSMMEWRNOH-HJGDQZAQSA-N Glu-Pro-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O SWDNPSMMEWRNOH-HJGDQZAQSA-N 0.000 description 1
- GMVCSRBOSIUTFC-FXQIFTODSA-N Glu-Ser-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(O)=O GMVCSRBOSIUTFC-FXQIFTODSA-N 0.000 description 1
- RFTVTKBHDXCEEX-WDSKDSINSA-N Glu-Ser-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)NCC(O)=O RFTVTKBHDXCEEX-WDSKDSINSA-N 0.000 description 1
- ZAPFAWQHBOHWLL-GUBZILKMSA-N Glu-Ser-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCC(=O)O)N ZAPFAWQHBOHWLL-GUBZILKMSA-N 0.000 description 1
- GUOWMVFLAJNPDY-CIUDSAMLSA-N Glu-Ser-Met Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCSC)C(O)=O GUOWMVFLAJNPDY-CIUDSAMLSA-N 0.000 description 1
- DMYACXMQUABZIQ-NRPADANISA-N Glu-Ser-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O DMYACXMQUABZIQ-NRPADANISA-N 0.000 description 1
- JSIQVRIXMINMTA-ZDLURKLDSA-N Glu-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H](N)CCC(O)=O JSIQVRIXMINMTA-ZDLURKLDSA-N 0.000 description 1
- HZISRJBYZAODRV-XQXXSGGOSA-N Glu-Thr-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O HZISRJBYZAODRV-XQXXSGGOSA-N 0.000 description 1
- JVYNYWXHZWVJEF-NUMRIWBASA-N Glu-Thr-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)O)N)O JVYNYWXHZWVJEF-NUMRIWBASA-N 0.000 description 1
- GPSHCSTUYOQPAI-JHEQGTHGSA-N Glu-Thr-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O GPSHCSTUYOQPAI-JHEQGTHGSA-N 0.000 description 1
- UMZHHILWZBFPGL-LOKLDPHHSA-N Glu-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)O)N)O UMZHHILWZBFPGL-LOKLDPHHSA-N 0.000 description 1
- MXJYXYDREQWUMS-XKBZYTNZSA-N Glu-Thr-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O MXJYXYDREQWUMS-XKBZYTNZSA-N 0.000 description 1
- OLTHVCNYJAALPL-BHYGNILZSA-N Glu-Trp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CNC3=CC=CC=C32)NC(=O)[C@H](CCC(=O)O)N)C(=O)O OLTHVCNYJAALPL-BHYGNILZSA-N 0.000 description 1
- UUTGYDAKPISJAO-JYJNAYRXSA-N Glu-Tyr-Leu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C(O)=O)CC1=CC=C(O)C=C1 UUTGYDAKPISJAO-JYJNAYRXSA-N 0.000 description 1
- MFYLRRCYBBJYPI-JYJNAYRXSA-N Glu-Tyr-Lys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)O)N)O MFYLRRCYBBJYPI-JYJNAYRXSA-N 0.000 description 1
- KXRORHJIRAOQPG-SOUVJXGZSA-N Glu-Tyr-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)[C@H](CCC(=O)O)N)C(=O)O KXRORHJIRAOQPG-SOUVJXGZSA-N 0.000 description 1
- XIJOPMSILDNVNJ-ZVZYQTTQSA-N Glu-Val-Trp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O XIJOPMSILDNVNJ-ZVZYQTTQSA-N 0.000 description 1
- PYTZFYUXZZHOAD-WHFBIAKZSA-N Gly-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)CN PYTZFYUXZZHOAD-WHFBIAKZSA-N 0.000 description 1
- UGVQELHRNUDMAA-BYPYZUCNSA-N Gly-Ala-Gly Chemical compound [NH3+]CC(=O)N[C@@H](C)C(=O)NCC([O-])=O UGVQELHRNUDMAA-BYPYZUCNSA-N 0.000 description 1
- UPOJUWHGMDJUQZ-IUCAKERBSA-N Gly-Arg-Arg Chemical compound NC(=N)NCCC[C@H](NC(=O)CN)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O UPOJUWHGMDJUQZ-IUCAKERBSA-N 0.000 description 1
- CLODWIOAKCSBAN-BQBZGAKWSA-N Gly-Arg-Asp Chemical compound NC(N)=NCCC[C@H](NC(=O)CN)C(=O)N[C@@H](CC(O)=O)C(O)=O CLODWIOAKCSBAN-BQBZGAKWSA-N 0.000 description 1
- JPXNYFOHTHSREU-UWVGGRQHSA-N Gly-Arg-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)CN JPXNYFOHTHSREU-UWVGGRQHSA-N 0.000 description 1
- OVSKVOOUFAKODB-UWVGGRQHSA-N Gly-Arg-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCN=C(N)N OVSKVOOUFAKODB-UWVGGRQHSA-N 0.000 description 1
- MXXXVOYFNVJHMA-IUCAKERBSA-N Gly-Arg-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)CN MXXXVOYFNVJHMA-IUCAKERBSA-N 0.000 description 1
- VXKCPBPQEKKERH-IUCAKERBSA-N Gly-Arg-Pro Chemical compound NC(N)=NCCC[C@H](NC(=O)CN)C(=O)N1CCC[C@H]1C(O)=O VXKCPBPQEKKERH-IUCAKERBSA-N 0.000 description 1
- KKBWDNZXYLGJEY-UHFFFAOYSA-N Gly-Arg-Pro Natural products NCC(=O)NC(CCNC(=N)N)C(=O)N1CCCC1C(=O)O KKBWDNZXYLGJEY-UHFFFAOYSA-N 0.000 description 1
- GWCRIHNSVMOBEQ-BQBZGAKWSA-N Gly-Arg-Ser Chemical compound [H]NCC(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O GWCRIHNSVMOBEQ-BQBZGAKWSA-N 0.000 description 1
- XUORRGAFUQIMLC-STQMWFEESA-N Gly-Arg-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)CN)O XUORRGAFUQIMLC-STQMWFEESA-N 0.000 description 1
- WKJKBELXHCTHIJ-WPRPVWTQSA-N Gly-Arg-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCN=C(N)N WKJKBELXHCTHIJ-WPRPVWTQSA-N 0.000 description 1
- UXJHNZODTMHWRD-WHFBIAKZSA-N Gly-Asn-Ala Chemical compound [H]NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(O)=O UXJHNZODTMHWRD-WHFBIAKZSA-N 0.000 description 1
- JVWPPCWUDRJGAE-YUMQZZPRSA-N Gly-Asn-Leu Chemical compound [H]NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O JVWPPCWUDRJGAE-YUMQZZPRSA-N 0.000 description 1
- FZQLXNIMCPJVJE-YUMQZZPRSA-N Gly-Asp-Leu Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O FZQLXNIMCPJVJE-YUMQZZPRSA-N 0.000 description 1
- MFBYPDKTAJXHNI-VKHMYHEASA-N Gly-Cys Chemical compound [NH3+]CC(=O)N[C@@H](CS)C([O-])=O MFBYPDKTAJXHNI-VKHMYHEASA-N 0.000 description 1
- GZBZACMXFIPIDX-WHFBIAKZSA-N Gly-Cys-Asp Chemical compound C([C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)CN)C(=O)O GZBZACMXFIPIDX-WHFBIAKZSA-N 0.000 description 1
- CEXINUGNTZFNRY-BYPYZUCNSA-N Gly-Cys-Gly Chemical compound [NH3+]CC(=O)N[C@@H](CS)C(=O)NCC([O-])=O CEXINUGNTZFNRY-BYPYZUCNSA-N 0.000 description 1
- UEGIPZAXNBYCCP-NKWVEPMBSA-N Gly-Cys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CS)NC(=O)CN)C(=O)O UEGIPZAXNBYCCP-NKWVEPMBSA-N 0.000 description 1
- GYAUWXXORNTCHU-QWRGUYRKSA-N Gly-Cys-Tyr Chemical compound NCC(=O)N[C@@H](CS)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 GYAUWXXORNTCHU-QWRGUYRKSA-N 0.000 description 1
- JMQFHZWESBGPFC-WDSKDSINSA-N Gly-Gln-Asp Chemical compound NCC(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O JMQFHZWESBGPFC-WDSKDSINSA-N 0.000 description 1
- BYYNJRSNDARRBX-YFKPBYRVSA-N Gly-Gln-Gly Chemical compound NCC(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O BYYNJRSNDARRBX-YFKPBYRVSA-N 0.000 description 1
- DHDOADIPGZTAHT-YUMQZZPRSA-N Gly-Glu-Arg Chemical compound NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N DHDOADIPGZTAHT-YUMQZZPRSA-N 0.000 description 1
- HFXJIZNEXNIZIJ-BQBZGAKWSA-N Gly-Glu-Gln Chemical compound NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O HFXJIZNEXNIZIJ-BQBZGAKWSA-N 0.000 description 1
- ZQIMMEYPEXIYBB-IUCAKERBSA-N Gly-Glu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)CN ZQIMMEYPEXIYBB-IUCAKERBSA-N 0.000 description 1
- BEQGFMIBZFNROK-JGVFFNPUSA-N Gly-Glu-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)O)NC(=O)CN)C(=O)O BEQGFMIBZFNROK-JGVFFNPUSA-N 0.000 description 1
- QSVCIFZPGLOZGH-WDSKDSINSA-N Gly-Glu-Ser Chemical compound NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O QSVCIFZPGLOZGH-WDSKDSINSA-N 0.000 description 1
- YWAQATDNEKZFFK-BYPYZUCNSA-N Gly-Gly-Ser Chemical compound NCC(=O)NCC(=O)N[C@@H](CO)C(O)=O YWAQATDNEKZFFK-BYPYZUCNSA-N 0.000 description 1
- YIWFXZNIBQBFHR-LURJTMIESA-N Gly-His Chemical compound [NH3+]CC(=O)N[C@H](C([O-])=O)CC1=CN=CN1 YIWFXZNIBQBFHR-LURJTMIESA-N 0.000 description 1
- TVDHVLGFJSHPAX-UWVGGRQHSA-N Gly-His-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CN=CN1 TVDHVLGFJSHPAX-UWVGGRQHSA-N 0.000 description 1
- YSDLIYZLOTZZNP-UWVGGRQHSA-N Gly-Leu-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)CN YSDLIYZLOTZZNP-UWVGGRQHSA-N 0.000 description 1
- IKAIKUBBJHFNBZ-LURJTMIESA-N Gly-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)CN IKAIKUBBJHFNBZ-LURJTMIESA-N 0.000 description 1
- LOEANKRDMMVOGZ-YUMQZZPRSA-N Gly-Lys-Asp Chemical compound NCCCC[C@H](NC(=O)CN)C(=O)N[C@@H](CC(O)=O)C(O)=O LOEANKRDMMVOGZ-YUMQZZPRSA-N 0.000 description 1
- GMTXWRIDLGTVFC-IUCAKERBSA-N Gly-Lys-Glu Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O GMTXWRIDLGTVFC-IUCAKERBSA-N 0.000 description 1
- PDUHNKAFQXQNLH-ZETCQYMHSA-N Gly-Lys-Gly Chemical compound NCCCC[C@H](NC(=O)CN)C(=O)NCC(O)=O PDUHNKAFQXQNLH-ZETCQYMHSA-N 0.000 description 1
- VEPBEGNDJYANCF-QWRGUYRKSA-N Gly-Lys-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCCN VEPBEGNDJYANCF-QWRGUYRKSA-N 0.000 description 1
- PFMUCCYYAAFKTH-YFKPBYRVSA-N Gly-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)CN PFMUCCYYAAFKTH-YFKPBYRVSA-N 0.000 description 1
- HHRODZSXDXMUHS-LURJTMIESA-N Gly-Met-Gly Chemical compound CSCC[C@H](NC(=O)C[NH3+])C(=O)NCC([O-])=O HHRODZSXDXMUHS-LURJTMIESA-N 0.000 description 1
- RUDRIZRGOLQSMX-IUCAKERBSA-N Gly-Met-Met Chemical compound [H]NCC(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCSC)C(O)=O RUDRIZRGOLQSMX-IUCAKERBSA-N 0.000 description 1
- QGDOOCIPHSSADO-STQMWFEESA-N Gly-Met-Phe Chemical compound [H]NCC(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O QGDOOCIPHSSADO-STQMWFEESA-N 0.000 description 1
- FXLVSYVJDPCIHH-STQMWFEESA-N Gly-Phe-Arg Chemical compound [H]NCC(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O FXLVSYVJDPCIHH-STQMWFEESA-N 0.000 description 1
- QVDGHDFFYHKJPN-QWRGUYRKSA-N Gly-Phe-Cys Chemical compound NCC(=O)N[C@@H](Cc1ccccc1)C(=O)N[C@@H](CS)C(O)=O QVDGHDFFYHKJPN-QWRGUYRKSA-N 0.000 description 1
- 108010009504 Gly-Phe-Leu-Gly Proteins 0.000 description 1
- GGAPHLIUUTVYMX-QWRGUYRKSA-N Gly-Phe-Ser Chemical compound OC[C@@H](C([O-])=O)NC(=O)[C@@H](NC(=O)C[NH3+])CC1=CC=CC=C1 GGAPHLIUUTVYMX-QWRGUYRKSA-N 0.000 description 1
- WNZOCXUOGVYYBJ-CDMKHQONSA-N Gly-Phe-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)CN)O WNZOCXUOGVYYBJ-CDMKHQONSA-N 0.000 description 1
- QAMMIGULQSIRCD-IRXDYDNUSA-N Gly-Phe-Tyr Chemical compound C([C@H](NC(=O)C[NH3+])C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C([O-])=O)C1=CC=CC=C1 QAMMIGULQSIRCD-IRXDYDNUSA-N 0.000 description 1
- VDCRBJACQKOSMS-JSGCOSHPSA-N Gly-Phe-Val Chemical compound [H]NCC(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(O)=O VDCRBJACQKOSMS-JSGCOSHPSA-N 0.000 description 1
- JYPCXBJRLBHWME-IUCAKERBSA-N Gly-Pro-Arg Chemical compound NCC(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(O)=O JYPCXBJRLBHWME-IUCAKERBSA-N 0.000 description 1
- QSQXZZCGPXQBPP-BQBZGAKWSA-N Gly-Pro-Cys Chemical compound C1C[C@H](N(C1)C(=O)CN)C(=O)N[C@@H](CS)C(=O)O QSQXZZCGPXQBPP-BQBZGAKWSA-N 0.000 description 1
- IXHQLZIWBCQBLQ-STQMWFEESA-N Gly-Pro-Phe Chemical compound NCC(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 IXHQLZIWBCQBLQ-STQMWFEESA-N 0.000 description 1
- OOCFXNOVSLSHAB-IUCAKERBSA-N Gly-Pro-Pro Chemical compound NCC(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 OOCFXNOVSLSHAB-IUCAKERBSA-N 0.000 description 1
- JNGHLWWFPGIJER-STQMWFEESA-N Gly-Pro-Tyr Chemical compound NCC(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 JNGHLWWFPGIJER-STQMWFEESA-N 0.000 description 1
- YOBGUCWZPXJHTN-BQBZGAKWSA-N Gly-Ser-Arg Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCN=C(N)N YOBGUCWZPXJHTN-BQBZGAKWSA-N 0.000 description 1
- SOEGEPHNZOISMT-BYPYZUCNSA-N Gly-Ser-Gly Chemical compound NCC(=O)N[C@@H](CO)C(=O)NCC(O)=O SOEGEPHNZOISMT-BYPYZUCNSA-N 0.000 description 1
- JSLVAHYTAJJEQH-QWRGUYRKSA-N Gly-Ser-Phe Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 JSLVAHYTAJJEQH-QWRGUYRKSA-N 0.000 description 1
- FFJQHWKSGAWSTJ-BFHQHQDPSA-N Gly-Thr-Ala Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O FFJQHWKSGAWSTJ-BFHQHQDPSA-N 0.000 description 1
- DBUNZBWUWCIELX-JHEQGTHGSA-N Gly-Thr-Glu Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O DBUNZBWUWCIELX-JHEQGTHGSA-N 0.000 description 1
- MYXNLWDWWOTERK-BHNWBGBOSA-N Gly-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN)O MYXNLWDWWOTERK-BHNWBGBOSA-N 0.000 description 1
- CUVBTVWFVIIDOC-YEPSODPASA-N Gly-Thr-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)O)NC(=O)CN CUVBTVWFVIIDOC-YEPSODPASA-N 0.000 description 1
- FXTUGWXZTFMTIV-GJZGRUSLSA-N Gly-Trp-Arg Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)CN FXTUGWXZTFMTIV-GJZGRUSLSA-N 0.000 description 1
- GULGDABMYTYMJZ-STQMWFEESA-N Gly-Trp-Asp Chemical compound [H]NCC(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC(O)=O)C(O)=O GULGDABMYTYMJZ-STQMWFEESA-N 0.000 description 1
- UMRIXLHPZZIOML-OALUTQOASA-N Gly-Trp-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)NC(=O)CN UMRIXLHPZZIOML-OALUTQOASA-N 0.000 description 1
- MREVELMMFOLESM-HOCLYGCPSA-N Gly-Trp-Val Chemical compound [H]NCC(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](C(C)C)C(O)=O MREVELMMFOLESM-HOCLYGCPSA-N 0.000 description 1
- GNNJKUYDWFIBTK-QWRGUYRKSA-N Gly-Tyr-Asp Chemical compound [H]NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O GNNJKUYDWFIBTK-QWRGUYRKSA-N 0.000 description 1
- OCRQUYDOYKCOQG-IRXDYDNUSA-N Gly-Tyr-Phe Chemical compound C([C@H](NC(=O)CN)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=C(O)C=C1 OCRQUYDOYKCOQG-IRXDYDNUSA-N 0.000 description 1
- NGBGZCUWFVVJKC-IRXDYDNUSA-N Gly-Tyr-Tyr Chemical compound C([C@H](NC(=O)CN)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=C(O)C=C1 NGBGZCUWFVVJKC-IRXDYDNUSA-N 0.000 description 1
- RYAOJUMWLWUGNW-QMMMGPOBSA-N Gly-Val-Gly Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O RYAOJUMWLWUGNW-QMMMGPOBSA-N 0.000 description 1
- BNMRSWQOHIQTFL-JSGCOSHPSA-N Gly-Val-Phe Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 BNMRSWQOHIQTFL-JSGCOSHPSA-N 0.000 description 1
- COZMNNJEGNPDED-HOCLYGCPSA-N Gly-Val-Trp Chemical compound [H]NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O COZMNNJEGNPDED-HOCLYGCPSA-N 0.000 description 1
- KSOBNUBCYHGUKH-UWVGGRQHSA-N Gly-Val-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)CN KSOBNUBCYHGUKH-UWVGGRQHSA-N 0.000 description 1
- 239000004471 Glycine Substances 0.000 description 1
- 102000003886 Glycoproteins Human genes 0.000 description 1
- 108090000288 Glycoproteins Proteins 0.000 description 1
- 208000035895 Guillain-Barré syndrome Diseases 0.000 description 1
- RVKIPWVMZANZLI-UHFFFAOYSA-N H-Lys-Trp-OH Natural products C1=CC=C2C(CC(NC(=O)C(N)CCCCN)C(O)=O)=CNC2=C1 RVKIPWVMZANZLI-UHFFFAOYSA-N 0.000 description 1
- OOFLZRMKTMLSMH-UHFFFAOYSA-N H4atta Chemical compound OC(=O)CN(CC(O)=O)CC1=CC=CC(C=2N=C(C=C(C=2)C=2C3=CC=CC=C3C=C3C=CC=CC3=2)C=2N=C(CN(CC(O)=O)CC(O)=O)C=CC=2)=N1 OOFLZRMKTMLSMH-UHFFFAOYSA-N 0.000 description 1
- 208000031886 HIV Infections Diseases 0.000 description 1
- 102100031547 HLA class II histocompatibility antigen, DO alpha chain Human genes 0.000 description 1
- 208000030836 Hashimoto thyroiditis Diseases 0.000 description 1
- 206010019196 Head injury Diseases 0.000 description 1
- 208000031220 Hemophilia Diseases 0.000 description 1
- 208000009292 Hemophilia A Diseases 0.000 description 1
- 208000028782 Hereditary disease Diseases 0.000 description 1
- 241000238631 Hexapoda Species 0.000 description 1
- BIAKMWKJMQLZOJ-ZKWXMUAHSA-N His-Ala-Ala Chemical compound C[C@H](NC(=O)[C@H](C)NC(=O)[C@@H](N)Cc1cnc[nH]1)C(O)=O BIAKMWKJMQLZOJ-ZKWXMUAHSA-N 0.000 description 1
- CIWILNZNBPIHEU-DCAQKATOSA-N His-Arg-Asn Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(O)=O CIWILNZNBPIHEU-DCAQKATOSA-N 0.000 description 1
- CJGDTAHEMXLRMB-ULQDDVLXSA-N His-Arg-Phe Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O CJGDTAHEMXLRMB-ULQDDVLXSA-N 0.000 description 1
- KYMUEAZVLPRVAE-GUBZILKMSA-N His-Asn-Glu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O KYMUEAZVLPRVAE-GUBZILKMSA-N 0.000 description 1
- RXVOMIADLXPJGW-GUBZILKMSA-N His-Asp-Glu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O RXVOMIADLXPJGW-GUBZILKMSA-N 0.000 description 1
- JFFAPRNXXLRINI-NHCYSSNCSA-N His-Asp-Val Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O JFFAPRNXXLRINI-NHCYSSNCSA-N 0.000 description 1
- QQJMARNOLHSJCQ-DCAQKATOSA-N His-Cys-Arg Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N QQJMARNOLHSJCQ-DCAQKATOSA-N 0.000 description 1
- IDQKGZWUPVOGPZ-GUBZILKMSA-N His-Cys-Gln Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N IDQKGZWUPVOGPZ-GUBZILKMSA-N 0.000 description 1
- VYMGAXSNYUFVCK-GUBZILKMSA-N His-Gln-Asn Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC(=O)N)C(=O)O)N VYMGAXSNYUFVCK-GUBZILKMSA-N 0.000 description 1
- SWSVTNGMKBDTBM-DCAQKATOSA-N His-Gln-Glu Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N SWSVTNGMKBDTBM-DCAQKATOSA-N 0.000 description 1
- HIAHVKLTHNOENC-HGNGGELXSA-N His-Glu-Ala Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O HIAHVKLTHNOENC-HGNGGELXSA-N 0.000 description 1
- TXLQHACKRLWYCM-DCAQKATOSA-N His-Glu-Glu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O TXLQHACKRLWYCM-DCAQKATOSA-N 0.000 description 1
- FZKFYOXDVWDELO-KBPBESRZSA-N His-Gly-Tyr Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O FZKFYOXDVWDELO-KBPBESRZSA-N 0.000 description 1
- UROVZOUMHNXPLZ-AVGNSLFASA-N His-Leu-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CN=CN1 UROVZOUMHNXPLZ-AVGNSLFASA-N 0.000 description 1
- AIPUZFXMXAHZKY-QWRGUYRKSA-N His-Leu-Gly Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O AIPUZFXMXAHZKY-QWRGUYRKSA-N 0.000 description 1
- DEOQGJUXUQGUJN-KKUMJFAQSA-N His-Lys-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N DEOQGJUXUQGUJN-KKUMJFAQSA-N 0.000 description 1
- DPQIPEAHIYMUEJ-IHRRRGAJSA-N His-Lys-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC1=CN=CN1)N DPQIPEAHIYMUEJ-IHRRRGAJSA-N 0.000 description 1
- ULRFSEJGSHYLQI-YESZJQIVSA-N His-Phe-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)NC(=O)[C@H](CC3=CN=CN3)N)C(=O)O ULRFSEJGSHYLQI-YESZJQIVSA-N 0.000 description 1
- KRBMQYPTDYSENE-BQBZGAKWSA-N His-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@@H](N)CC1=CNC=N1 KRBMQYPTDYSENE-BQBZGAKWSA-N 0.000 description 1
- CWSZWFILCNSNEX-CIUDSAMLSA-N His-Ser-Asn Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(=O)N)C(=O)O)N CWSZWFILCNSNEX-CIUDSAMLSA-N 0.000 description 1
- JMSONHOUHFDOJH-GUBZILKMSA-N His-Ser-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CN=CN1 JMSONHOUHFDOJH-GUBZILKMSA-N 0.000 description 1
- JUCZDDVZBMPKRT-IXOXFDKPSA-N His-Thr-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N)O JUCZDDVZBMPKRT-IXOXFDKPSA-N 0.000 description 1
- LNVILFYCPVOHPV-IHPCNDPISA-N His-Trp-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC(C)C)C(O)=O LNVILFYCPVOHPV-IHPCNDPISA-N 0.000 description 1
- HTOOKGDPMXSJSY-STQMWFEESA-N His-Tyr Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CN=CN1 HTOOKGDPMXSJSY-STQMWFEESA-N 0.000 description 1
- VLDVBZICYBVQHB-IUCAKERBSA-N His-Val Chemical compound CC(C)[C@@H](C([O-])=O)NC(=O)[C@@H]([NH3+])CC1=CN=CN1 VLDVBZICYBVQHB-IUCAKERBSA-N 0.000 description 1
- GBMSSORHVHAYLU-QTKMDUPCSA-N His-Val-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CC1=CN=CN1)N)O GBMSSORHVHAYLU-QTKMDUPCSA-N 0.000 description 1
- DRKZDEFADVYTLU-AVGNSLFASA-N His-Val-Val Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C(C)C)C(O)=O DRKZDEFADVYTLU-AVGNSLFASA-N 0.000 description 1
- 101000924577 Homo sapiens Adenomatous polyposis coli protein Proteins 0.000 description 1
- 101000678026 Homo sapiens Alpha-1-antichymotrypsin Proteins 0.000 description 1
- 101000866278 Homo sapiens HLA class II histocompatibility antigen, DO alpha chain Proteins 0.000 description 1
- 101001010568 Homo sapiens Interleukin-11 Proteins 0.000 description 1
- 101001076408 Homo sapiens Interleukin-6 Proteins 0.000 description 1
- 101001055216 Homo sapiens Interleukin-9 Proteins 0.000 description 1
- 101000957437 Homo sapiens Mitochondrial carnitine/acylcarnitine carrier protein Proteins 0.000 description 1
- 101000658574 Homo sapiens Transmembrane 4 L6 family member 1 Proteins 0.000 description 1
- 241000725303 Human immunodeficiency virus Species 0.000 description 1
- 241000713772 Human immunodeficiency virus 1 Species 0.000 description 1
- 206010020751 Hypersensitivity Diseases 0.000 description 1
- 108060003951 Immunoglobulin Proteins 0.000 description 1
- 208000026350 Inborn Genetic disease Diseases 0.000 description 1
- 206010061216 Infarction Diseases 0.000 description 1
- 208000022559 Inflammatory bowel disease Diseases 0.000 description 1
- 108090001061 Insulin Proteins 0.000 description 1
- 102000004877 Insulin Human genes 0.000 description 1
- 102000014150 Interferons Human genes 0.000 description 1
- 108010050904 Interferons Proteins 0.000 description 1
- 108010002352 Interleukin-1 Proteins 0.000 description 1
- 108010002350 Interleukin-2 Proteins 0.000 description 1
- 102000000588 Interleukin-2 Human genes 0.000 description 1
- 102000004388 Interleukin-4 Human genes 0.000 description 1
- 108090000978 Interleukin-4 Proteins 0.000 description 1
- ONIBWKKTOPOVIA-BYPYZUCNSA-N L-Proline Chemical compound OC(=O)[C@@H]1CCCN1 ONIBWKKTOPOVIA-BYPYZUCNSA-N 0.000 description 1
- SITWEMZOJNKJCH-UHFFFAOYSA-N L-alanine-L-arginine Natural products CC(N)C(=O)NC(C(O)=O)CCCNC(N)=N SITWEMZOJNKJCH-UHFFFAOYSA-N 0.000 description 1
- HFKJBCPRWWGPEY-BQBZGAKWSA-N L-arginyl-L-glutamic acid Chemical compound NC(=N)NCCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(O)=O HFKJBCPRWWGPEY-BQBZGAKWSA-N 0.000 description 1
- QOOWRKBDDXQRHC-BQBZGAKWSA-N L-lysyl-L-alanine Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN QOOWRKBDDXQRHC-BQBZGAKWSA-N 0.000 description 1
- TYYLDKGBCJGJGW-UHFFFAOYSA-N L-tryptophan-L-tyrosine Natural products C=1NC2=CC=CC=C2C=1CC(N)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 TYYLDKGBCJGJGW-UHFFFAOYSA-N 0.000 description 1
- 108090001090 Lectins Proteins 0.000 description 1
- 102000004856 Lectins Human genes 0.000 description 1
- ZRLUISBDKUWAIZ-CIUDSAMLSA-N Leu-Ala-Asp Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC(O)=O ZRLUISBDKUWAIZ-CIUDSAMLSA-N 0.000 description 1
- MJOZZTKJZQFKDK-GUBZILKMSA-N Leu-Ala-Gln Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(N)=O MJOZZTKJZQFKDK-GUBZILKMSA-N 0.000 description 1
- KWTVLKBOQATPHJ-SRVKXCTJSA-N Leu-Ala-Lys Chemical compound C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(C)C)N KWTVLKBOQATPHJ-SRVKXCTJSA-N 0.000 description 1
- BQSLGJHIAGOZCD-CIUDSAMLSA-N Leu-Ala-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O BQSLGJHIAGOZCD-CIUDSAMLSA-N 0.000 description 1
- SUPVSFFZWVOEOI-UHFFFAOYSA-N Leu-Ala-Tyr Natural products CC(C)CC(N)C(=O)NC(C)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 SUPVSFFZWVOEOI-UHFFFAOYSA-N 0.000 description 1
- SENJXOPIZNYLHU-IUCAKERBSA-N Leu-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(O)=O)CCCN=C(N)N SENJXOPIZNYLHU-IUCAKERBSA-N 0.000 description 1
- FJUKMPUELVROGK-IHRRRGAJSA-N Leu-Arg-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N FJUKMPUELVROGK-IHRRRGAJSA-N 0.000 description 1
- KKXDHFKZWKLYGB-GUBZILKMSA-N Leu-Asn-Glu Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N KKXDHFKZWKLYGB-GUBZILKMSA-N 0.000 description 1
- VIWUBXKCYJGNCL-SRVKXCTJSA-N Leu-Asn-His Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 VIWUBXKCYJGNCL-SRVKXCTJSA-N 0.000 description 1
- JKGHDYGZRDWHGA-SRVKXCTJSA-N Leu-Asn-Leu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O JKGHDYGZRDWHGA-SRVKXCTJSA-N 0.000 description 1
- BPANDPNDMJHFEV-CIUDSAMLSA-N Leu-Asp-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O BPANDPNDMJHFEV-CIUDSAMLSA-N 0.000 description 1
- ZURHXHNAEJJRNU-CIUDSAMLSA-N Leu-Asp-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O ZURHXHNAEJJRNU-CIUDSAMLSA-N 0.000 description 1
- ZDSNOSQHMJBRQN-SRVKXCTJSA-N Leu-Asp-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N ZDSNOSQHMJBRQN-SRVKXCTJSA-N 0.000 description 1
- QCSFMCFHVGTLFF-NHCYSSNCSA-N Leu-Asp-Val Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O QCSFMCFHVGTLFF-NHCYSSNCSA-N 0.000 description 1
- NFHJQETXTSDZSI-DCAQKATOSA-N Leu-Cys-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O NFHJQETXTSDZSI-DCAQKATOSA-N 0.000 description 1
- LJKJVTCIRDCITR-SRVKXCTJSA-N Leu-Cys-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N LJKJVTCIRDCITR-SRVKXCTJSA-N 0.000 description 1
- VPKIQULSKFVCSM-SRVKXCTJSA-N Leu-Gln-Arg Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O VPKIQULSKFVCSM-SRVKXCTJSA-N 0.000 description 1
- KAFOIVJDVSZUMD-DCAQKATOSA-N Leu-Gln-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O KAFOIVJDVSZUMD-DCAQKATOSA-N 0.000 description 1
- DPWGZWUMUUJQDT-IUCAKERBSA-N Leu-Gln-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O DPWGZWUMUUJQDT-IUCAKERBSA-N 0.000 description 1
- FQZPTCNSNPWHLJ-AVGNSLFASA-N Leu-Gln-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCCN)C(O)=O FQZPTCNSNPWHLJ-AVGNSLFASA-N 0.000 description 1
- RVVBWTWPNFDYBE-SRVKXCTJSA-N Leu-Glu-Arg Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O RVVBWTWPNFDYBE-SRVKXCTJSA-N 0.000 description 1
- YVKSMSDXKMSIRX-GUBZILKMSA-N Leu-Glu-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O YVKSMSDXKMSIRX-GUBZILKMSA-N 0.000 description 1
- WIDZHJTYKYBLSR-DCAQKATOSA-N Leu-Glu-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O WIDZHJTYKYBLSR-DCAQKATOSA-N 0.000 description 1
- HQUXQAMSWFIRET-AVGNSLFASA-N Leu-Glu-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN HQUXQAMSWFIRET-AVGNSLFASA-N 0.000 description 1
- LAGPXKYZCCTSGQ-JYJNAYRXSA-N Leu-Glu-Phe Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O LAGPXKYZCCTSGQ-JYJNAYRXSA-N 0.000 description 1
- OGUUKPXUTHOIAV-SDDRHHMPSA-N Leu-Glu-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N OGUUKPXUTHOIAV-SDDRHHMPSA-N 0.000 description 1
- LLBQJYDYOLIQAI-JYJNAYRXSA-N Leu-Glu-Tyr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O LLBQJYDYOLIQAI-JYJNAYRXSA-N 0.000 description 1
- BABSVXFGKFLIGW-UWVGGRQHSA-N Leu-Gly-Arg Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCNC(N)=N BABSVXFGKFLIGW-UWVGGRQHSA-N 0.000 description 1
- FMEICTQWUKNAGC-YUMQZZPRSA-N Leu-Gly-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O FMEICTQWUKNAGC-YUMQZZPRSA-N 0.000 description 1
- FIYMBBHGYNQFOP-IUCAKERBSA-N Leu-Gly-Gln Chemical compound CC(C)C[C@@H](C(=O)NCC(=O)N[C@@H](CCC(=O)N)C(=O)O)N FIYMBBHGYNQFOP-IUCAKERBSA-N 0.000 description 1
- BTNXKBVLWJBTNR-SRVKXCTJSA-N Leu-His-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(N)=O)C(O)=O BTNXKBVLWJBTNR-SRVKXCTJSA-N 0.000 description 1
- BKTXKJMNTSMJDQ-AVGNSLFASA-N Leu-His-Gln Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N BKTXKJMNTSMJDQ-AVGNSLFASA-N 0.000 description 1
- KVOFSTUWVSQMDK-KKUMJFAQSA-N Leu-His-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CC(C)C)CC1=CN=CN1 KVOFSTUWVSQMDK-KKUMJFAQSA-N 0.000 description 1
- QNBVTHNJGCOVFA-AVGNSLFASA-N Leu-Leu-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCC(O)=O QNBVTHNJGCOVFA-AVGNSLFASA-N 0.000 description 1
- PPQRKXHCLYCBSP-IHRRRGAJSA-N Leu-Leu-Met Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(=O)O)N PPQRKXHCLYCBSP-IHRRRGAJSA-N 0.000 description 1
- RZXLZBIUTDQHJQ-SRVKXCTJSA-N Leu-Lys-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O RZXLZBIUTDQHJQ-SRVKXCTJSA-N 0.000 description 1
- BGZCJDGBBUUBHA-KKUMJFAQSA-N Leu-Lys-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O BGZCJDGBBUUBHA-KKUMJFAQSA-N 0.000 description 1
- VVQJGYPTIYOFBR-IHRRRGAJSA-N Leu-Lys-Met Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCSC)C(=O)O)N VVQJGYPTIYOFBR-IHRRRGAJSA-N 0.000 description 1
- VCHVSKNMTXWIIP-SRVKXCTJSA-N Leu-Lys-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O VCHVSKNMTXWIIP-SRVKXCTJSA-N 0.000 description 1
- CPONGMJGVIAWEH-DCAQKATOSA-N Leu-Met-Ala Chemical compound CSCC[C@H](NC(=O)[C@@H](N)CC(C)C)C(=O)N[C@@H](C)C(O)=O CPONGMJGVIAWEH-DCAQKATOSA-N 0.000 description 1
- WXZOHBVPVKABQN-DCAQKATOSA-N Leu-Met-Asp Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(=O)O)C(=O)O)N WXZOHBVPVKABQN-DCAQKATOSA-N 0.000 description 1
- FLNPJLDPGMLWAU-UWVGGRQHSA-N Leu-Met-Gly Chemical compound OC(=O)CNC(=O)[C@H](CCSC)NC(=O)[C@@H](N)CC(C)C FLNPJLDPGMLWAU-UWVGGRQHSA-N 0.000 description 1
- DDVHDMSBLRAKNV-IHRRRGAJSA-N Leu-Met-Leu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O DDVHDMSBLRAKNV-IHRRRGAJSA-N 0.000 description 1
- GSSMYQHXZNERFX-WDSOQIARSA-N Leu-Met-Trp Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N GSSMYQHXZNERFX-WDSOQIARSA-N 0.000 description 1
- WXDRGWBQZIMJDE-ULQDDVLXSA-N Leu-Phe-Met Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCSC)C(O)=O WXDRGWBQZIMJDE-ULQDDVLXSA-N 0.000 description 1
- MJWVXZABPOKJJF-ACRUOGEOSA-N Leu-Phe-Phe Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O MJWVXZABPOKJJF-ACRUOGEOSA-N 0.000 description 1
- MAXILRZVORNXBE-PMVMPFDFSA-N Leu-Phe-Trp Chemical compound C([C@H](NC(=O)[C@@H](N)CC(C)C)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(O)=O)C1=CC=CC=C1 MAXILRZVORNXBE-PMVMPFDFSA-N 0.000 description 1
- RRVCZCNFXIFGRA-DCAQKATOSA-N Leu-Pro-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O RRVCZCNFXIFGRA-DCAQKATOSA-N 0.000 description 1
- MUCIDQMDOYQYBR-IHRRRGAJSA-N Leu-Pro-His Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N MUCIDQMDOYQYBR-IHRRRGAJSA-N 0.000 description 1
- KWLWZYMNUZJKMZ-IHRRRGAJSA-N Leu-Pro-Leu Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O KWLWZYMNUZJKMZ-IHRRRGAJSA-N 0.000 description 1
- DPURXCQCHSQPAN-AVGNSLFASA-N Leu-Pro-Pro Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 DPURXCQCHSQPAN-AVGNSLFASA-N 0.000 description 1
- CHJKEDSZNSONPS-DCAQKATOSA-N Leu-Pro-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O CHJKEDSZNSONPS-DCAQKATOSA-N 0.000 description 1
- JDBQSGMJBMPNFT-AVGNSLFASA-N Leu-Pro-Val Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O JDBQSGMJBMPNFT-AVGNSLFASA-N 0.000 description 1
- XGDCYUQSFDQISZ-BQBZGAKWSA-N Leu-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(O)=O XGDCYUQSFDQISZ-BQBZGAKWSA-N 0.000 description 1
- SBANPBVRHYIMRR-GARJFASQSA-N Leu-Ser-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CO)C(=O)N1CCC[C@@H]1C(=O)O)N SBANPBVRHYIMRR-GARJFASQSA-N 0.000 description 1
- BRTVHXHCUSXYRI-CIUDSAMLSA-N Leu-Ser-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O BRTVHXHCUSXYRI-CIUDSAMLSA-N 0.000 description 1
- SQUFDMCWMFOEBA-KKUMJFAQSA-N Leu-Ser-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 SQUFDMCWMFOEBA-KKUMJFAQSA-N 0.000 description 1
- ZJZNLRVCZWUONM-JXUBOQSCSA-N Leu-Thr-Ala Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O ZJZNLRVCZWUONM-JXUBOQSCSA-N 0.000 description 1
- VDIARPPNADFEAV-WEDXCCLWSA-N Leu-Thr-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O VDIARPPNADFEAV-WEDXCCLWSA-N 0.000 description 1
- YWFZWQKWNDOWPA-XIRDDKMYSA-N Leu-Trp-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC(N)=O)C(O)=O YWFZWQKWNDOWPA-XIRDDKMYSA-N 0.000 description 1
- UCRJTSIIAYHOHE-ULQDDVLXSA-N Leu-Tyr-Arg Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N UCRJTSIIAYHOHE-ULQDDVLXSA-N 0.000 description 1
- ISSAURVGLGAPDK-KKUMJFAQSA-N Leu-Tyr-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O ISSAURVGLGAPDK-KKUMJFAQSA-N 0.000 description 1
- VHTIZYYHIUHMCA-JYJNAYRXSA-N Leu-Tyr-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O VHTIZYYHIUHMCA-JYJNAYRXSA-N 0.000 description 1
- JGKHAFUAPZCCDU-BZSNNMDCSA-N Leu-Tyr-Leu Chemical compound CC(C)C[C@H]([NH3+])C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C([O-])=O)CC1=CC=C(O)C=C1 JGKHAFUAPZCCDU-BZSNNMDCSA-N 0.000 description 1
- AXVIGSRGTMNSJU-YESZJQIVSA-N Leu-Tyr-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N2CCC[C@@H]2C(=O)O)N AXVIGSRGTMNSJU-YESZJQIVSA-N 0.000 description 1
- YIRIDPUGZKHMHT-ACRUOGEOSA-N Leu-Tyr-Tyr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O YIRIDPUGZKHMHT-ACRUOGEOSA-N 0.000 description 1
- XZNJZXJZBMBGGS-NHCYSSNCSA-N Leu-Val-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O XZNJZXJZBMBGGS-NHCYSSNCSA-N 0.000 description 1
- CGHXMODRYJISSK-NHCYSSNCSA-N Leu-Val-Asp Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC(O)=O CGHXMODRYJISSK-NHCYSSNCSA-N 0.000 description 1
- TUIOUEWKFFVNLH-DCAQKATOSA-N Leu-Val-Cys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CS)C(O)=O TUIOUEWKFFVNLH-DCAQKATOSA-N 0.000 description 1
- AIMGJYMCTAABEN-GVXVVHGQSA-N Leu-Val-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O AIMGJYMCTAABEN-GVXVVHGQSA-N 0.000 description 1
- 206010065433 Ligament rupture Diseases 0.000 description 1
- 206010024774 Localised infection Diseases 0.000 description 1
- ALSRJRIWBNENFY-DCAQKATOSA-N Lys-Arg-Asn Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(O)=O ALSRJRIWBNENFY-DCAQKATOSA-N 0.000 description 1
- SWWCDAGDQHTKIE-RHYQMDGZSA-N Lys-Arg-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SWWCDAGDQHTKIE-RHYQMDGZSA-N 0.000 description 1
- JPNRPAJITHRXRH-BQBZGAKWSA-N Lys-Asn Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(O)=O)CC(N)=O JPNRPAJITHRXRH-BQBZGAKWSA-N 0.000 description 1
- DNEJSAIMVANNPA-DCAQKATOSA-N Lys-Asn-Arg Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O DNEJSAIMVANNPA-DCAQKATOSA-N 0.000 description 1
- QUCDKEKDPYISNX-HJGDQZAQSA-N Lys-Asn-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O QUCDKEKDPYISNX-HJGDQZAQSA-N 0.000 description 1
- IBQMEXQYZMVIFU-SRVKXCTJSA-N Lys-Asp-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCCCN)N IBQMEXQYZMVIFU-SRVKXCTJSA-N 0.000 description 1
- NRQRKMYZONPCTM-CIUDSAMLSA-N Lys-Asp-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O NRQRKMYZONPCTM-CIUDSAMLSA-N 0.000 description 1
- AIPHUKOBUXJNKM-KKUMJFAQSA-N Lys-Cys-Phe Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O AIPHUKOBUXJNKM-KKUMJFAQSA-N 0.000 description 1
- RZHLIPMZXOEJTL-AVGNSLFASA-N Lys-Gln-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCCCN)N RZHLIPMZXOEJTL-AVGNSLFASA-N 0.000 description 1
- NNCDAORZCMPZPX-GUBZILKMSA-N Lys-Gln-Ser Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CO)C(=O)O)N NNCDAORZCMPZPX-GUBZILKMSA-N 0.000 description 1
- PBIPLDMFHAICIP-DCAQKATOSA-N Lys-Glu-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O PBIPLDMFHAICIP-DCAQKATOSA-N 0.000 description 1
- DCRWPTBMWMGADO-AVGNSLFASA-N Lys-Glu-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O DCRWPTBMWMGADO-AVGNSLFASA-N 0.000 description 1
- ODUQLUADRKMHOZ-JYJNAYRXSA-N Lys-Glu-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCCCN)N)O ODUQLUADRKMHOZ-JYJNAYRXSA-N 0.000 description 1
- GQZMPWBZQALKJO-UWVGGRQHSA-N Lys-Gly-Arg Chemical compound [H]N[C@@H](CCCCN)C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(O)=O GQZMPWBZQALKJO-UWVGGRQHSA-N 0.000 description 1
- XNKDCYABMBBEKN-IUCAKERBSA-N Lys-Gly-Gln Chemical compound NCCCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(N)=O XNKDCYABMBBEKN-IUCAKERBSA-N 0.000 description 1
- NKKFVJRLCCUJNA-QWRGUYRKSA-N Lys-Gly-Lys Chemical compound NCCCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCCN NKKFVJRLCCUJNA-QWRGUYRKSA-N 0.000 description 1
- PBLLTSKBTAHDNA-KBPBESRZSA-N Lys-Gly-Phe Chemical compound [H]N[C@@H](CCCCN)C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O PBLLTSKBTAHDNA-KBPBESRZSA-N 0.000 description 1
- NNKLKUUGESXCBS-KBPBESRZSA-N Lys-Gly-Tyr Chemical compound [H]N[C@@H](CCCCN)C(=O)NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O NNKLKUUGESXCBS-KBPBESRZSA-N 0.000 description 1
- IGRMTQMIDNDFAA-UWVGGRQHSA-N Lys-His Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 IGRMTQMIDNDFAA-UWVGGRQHSA-N 0.000 description 1
- GTAXSKOXPIISBW-AVGNSLFASA-N Lys-His-Gln Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CCCCN)N GTAXSKOXPIISBW-AVGNSLFASA-N 0.000 description 1
- FGMHXLULNHTPID-KKUMJFAQSA-N Lys-His-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CCCCN)C(O)=O)CC1=CN=CN1 FGMHXLULNHTPID-KKUMJFAQSA-N 0.000 description 1
- HQXSFFSLXFHWOX-IXOXFDKPSA-N Lys-His-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CCCCN)N)O HQXSFFSLXFHWOX-IXOXFDKPSA-N 0.000 description 1
- AIRZWUMAHCDDHR-KKUMJFAQSA-N Lys-Leu-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O AIRZWUMAHCDDHR-KKUMJFAQSA-N 0.000 description 1
- NVGBPTNZLWRQSY-UWVGGRQHSA-N Lys-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(O)=O)CCCCN NVGBPTNZLWRQSY-UWVGGRQHSA-N 0.000 description 1
- AHFOKDZWPPGJAZ-SRVKXCTJSA-N Lys-Lys-Cys Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CS)C(=O)O)N AHFOKDZWPPGJAZ-SRVKXCTJSA-N 0.000 description 1
- UQRZFMQQXXJTTF-AVGNSLFASA-N Lys-Lys-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O UQRZFMQQXXJTTF-AVGNSLFASA-N 0.000 description 1
- XFOAWKDQMRMCDN-ULQDDVLXSA-N Lys-Phe-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CCCCN)CC1=CC=CC=C1 XFOAWKDQMRMCDN-ULQDDVLXSA-N 0.000 description 1
- ODTZHNZPINULEU-KKUMJFAQSA-N Lys-Phe-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCCCN)N ODTZHNZPINULEU-KKUMJFAQSA-N 0.000 description 1
- LUAJJLPHUXPQLH-KKUMJFAQSA-N Lys-Phe-Ser Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CCCCN)N LUAJJLPHUXPQLH-KKUMJFAQSA-N 0.000 description 1
- LUTDBHBIHHREDC-IHRRRGAJSA-N Lys-Pro-Lys Chemical compound NCCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(O)=O LUTDBHBIHHREDC-IHRRRGAJSA-N 0.000 description 1
- SBQDRNOLGSYHQA-YUMQZZPRSA-N Lys-Ser-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)NCC(O)=O SBQDRNOLGSYHQA-YUMQZZPRSA-N 0.000 description 1
- IOQWIOPSKJOEKI-SRVKXCTJSA-N Lys-Ser-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O IOQWIOPSKJOEKI-SRVKXCTJSA-N 0.000 description 1
- DYJOORGDQIGZAS-DCAQKATOSA-N Lys-Ser-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCCCN)N DYJOORGDQIGZAS-DCAQKATOSA-N 0.000 description 1
- WZVSHTFTCYOFPL-GARJFASQSA-N Lys-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CCCCN)N)C(=O)O WZVSHTFTCYOFPL-GARJFASQSA-N 0.000 description 1
- DIBZLYZXTSVGLN-CIUDSAMLSA-N Lys-Ser-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O DIBZLYZXTSVGLN-CIUDSAMLSA-N 0.000 description 1
- GIKFNMZSGYAPEJ-HJGDQZAQSA-N Lys-Thr-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(O)=O GIKFNMZSGYAPEJ-HJGDQZAQSA-N 0.000 description 1
- MYTOTTSMVMWVJN-STQMWFEESA-N Lys-Tyr Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 MYTOTTSMVMWVJN-STQMWFEESA-N 0.000 description 1
- PELXPRPDQRFBGQ-KKUMJFAQSA-N Lys-Tyr-Asn Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCCCN)N)O PELXPRPDQRFBGQ-KKUMJFAQSA-N 0.000 description 1
- MIMXMVDLMDMOJD-BZSNNMDCSA-N Lys-Tyr-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(O)=O MIMXMVDLMDMOJD-BZSNNMDCSA-N 0.000 description 1
- FPQMQEOVSKMVMA-ACRUOGEOSA-N Lys-Tyr-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O)NC(=O)[C@H](CCCCN)N)O FPQMQEOVSKMVMA-ACRUOGEOSA-N 0.000 description 1
- OHXUUQDOBQKSNB-AVGNSLFASA-N Lys-Val-Arg Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O OHXUUQDOBQKSNB-AVGNSLFASA-N 0.000 description 1
- RIPJMCFGQHGHNP-RHYQMDGZSA-N Lys-Val-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CCCCN)N)O RIPJMCFGQHGHNP-RHYQMDGZSA-N 0.000 description 1
- WYEXWKAWMNJKPN-UBHSHLNASA-N Met-Ala-Phe Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CCSC)N WYEXWKAWMNJKPN-UBHSHLNASA-N 0.000 description 1
- DSWOTZCVCBEPOU-IUCAKERBSA-N Met-Arg-Gly Chemical compound CSCC[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CCCNC(N)=N DSWOTZCVCBEPOU-IUCAKERBSA-N 0.000 description 1
- NSGXXVIHCIAISP-CIUDSAMLSA-N Met-Asn-Gln Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O NSGXXVIHCIAISP-CIUDSAMLSA-N 0.000 description 1
- YNOVBMBQSQTLFM-DCAQKATOSA-N Met-Asn-Leu Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O YNOVBMBQSQTLFM-DCAQKATOSA-N 0.000 description 1
- MNNKPHGAPRUKMW-BPUTZDHNSA-N Met-Asp-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CCSC)C(O)=O)=CNC2=C1 MNNKPHGAPRUKMW-BPUTZDHNSA-N 0.000 description 1
- NCFZHKMKRCYQBJ-CIUDSAMLSA-N Met-Cys-Gln Chemical compound CSCC[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N NCFZHKMKRCYQBJ-CIUDSAMLSA-N 0.000 description 1
- PTYVBBNIAQWUFV-DCAQKATOSA-N Met-Cys-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CCSC)N PTYVBBNIAQWUFV-DCAQKATOSA-N 0.000 description 1
- UJDMTKHGWSBHBX-IHRRRGAJSA-N Met-Cys-Phe Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 UJDMTKHGWSBHBX-IHRRRGAJSA-N 0.000 description 1
- YKWHHKDMBZBMLG-GUBZILKMSA-N Met-Cys-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CCSC)N YKWHHKDMBZBMLG-GUBZILKMSA-N 0.000 description 1
- HLYIDXAXQIJYIG-CIUDSAMLSA-N Met-Gln-Asp Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O HLYIDXAXQIJYIG-CIUDSAMLSA-N 0.000 description 1
- VOOINLQYUZOREH-SRVKXCTJSA-N Met-Gln-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCSC)N VOOINLQYUZOREH-SRVKXCTJSA-N 0.000 description 1
- HHCOOFPGNXKFGR-HJGDQZAQSA-N Met-Gln-Thr Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O HHCOOFPGNXKFGR-HJGDQZAQSA-N 0.000 description 1
- YORIKIDJCPKBON-YUMQZZPRSA-N Met-Glu-Gly Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O YORIKIDJCPKBON-YUMQZZPRSA-N 0.000 description 1
- DGNZGCQSVGGYJS-BQBZGAKWSA-N Met-Gly-Asp Chemical compound CSCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(O)=O DGNZGCQSVGGYJS-BQBZGAKWSA-N 0.000 description 1
- GVIVXNFKJQFTCE-YUMQZZPRSA-N Met-Gly-Gln Chemical compound CSCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(N)=O GVIVXNFKJQFTCE-YUMQZZPRSA-N 0.000 description 1
- BMHIFARYXOJDLD-WPRPVWTQSA-N Met-Gly-Val Chemical compound [H]N[C@@H](CCSC)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O BMHIFARYXOJDLD-WPRPVWTQSA-N 0.000 description 1
- SODXFJOPSCXOHE-IHRRRGAJSA-N Met-Leu-Leu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O SODXFJOPSCXOHE-IHRRRGAJSA-N 0.000 description 1
- LBNFTWKGISQVEE-AVGNSLFASA-N Met-Leu-Met Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCSC LBNFTWKGISQVEE-AVGNSLFASA-N 0.000 description 1
- HSJIGJRZYUADSS-IHRRRGAJSA-N Met-Lys-Leu Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O HSJIGJRZYUADSS-IHRRRGAJSA-N 0.000 description 1
- AOFZWWDTTJLHOU-ULQDDVLXSA-N Met-Lys-Tyr Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 AOFZWWDTTJLHOU-ULQDDVLXSA-N 0.000 description 1
- KKXGLCPUAWODHF-GUBZILKMSA-N Met-Met-Cys Chemical compound N[C@@H](CCSC)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CS)C(O)=O KKXGLCPUAWODHF-GUBZILKMSA-N 0.000 description 1
- WXUUEPIDLLQBLJ-DCAQKATOSA-N Met-Met-Gln Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N WXUUEPIDLLQBLJ-DCAQKATOSA-N 0.000 description 1
- ILKCLLLOGPDNIP-RCWTZXSCSA-N Met-Met-Thr Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ILKCLLLOGPDNIP-RCWTZXSCSA-N 0.000 description 1
- CNAGWYQWQDMUGC-IHRRRGAJSA-N Met-Phe-Asn Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(=O)N)C(=O)O)N CNAGWYQWQDMUGC-IHRRRGAJSA-N 0.000 description 1
- KRLKICLNEICJGV-STQMWFEESA-N Met-Phe-Gly Chemical compound CSCC[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CC=CC=C1 KRLKICLNEICJGV-STQMWFEESA-N 0.000 description 1
- LQTGGXSOMDSWTQ-UNQGMJICSA-N Met-Phe-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](CCSC)N)O LQTGGXSOMDSWTQ-UNQGMJICSA-N 0.000 description 1
- XIGAHPDZLAYQOS-SRVKXCTJSA-N Met-Pro-Pro Chemical compound CSCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 XIGAHPDZLAYQOS-SRVKXCTJSA-N 0.000 description 1
- HLZORBMOISUNIV-DCAQKATOSA-N Met-Ser-Leu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC(C)C HLZORBMOISUNIV-DCAQKATOSA-N 0.000 description 1
- DBMLDOWSVHMQQN-XGEHTFHBSA-N Met-Ser-Thr Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O DBMLDOWSVHMQQN-XGEHTFHBSA-N 0.000 description 1
- KAKJTZWHIUWTTD-VQVTYTSYSA-N Met-Thr Chemical compound CSCC[C@H]([NH3+])C(=O)N[C@@H]([C@@H](C)O)C([O-])=O KAKJTZWHIUWTTD-VQVTYTSYSA-N 0.000 description 1
- GWADARYJIJDYRC-XGEHTFHBSA-N Met-Thr-Ser Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O GWADARYJIJDYRC-XGEHTFHBSA-N 0.000 description 1
- PESQCPHRXOFIPX-RYUDHWBXSA-N Met-Tyr Chemical compound CSCC[C@H](N)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 PESQCPHRXOFIPX-RYUDHWBXSA-N 0.000 description 1
- NBEFNGUZUOUGFG-KKUMJFAQSA-N Met-Tyr-Gln Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N NBEFNGUZUOUGFG-KKUMJFAQSA-N 0.000 description 1
- MFDDVIJCQYOOES-GUBZILKMSA-N Met-Val-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCSC)N MFDDVIJCQYOOES-GUBZILKMSA-N 0.000 description 1
- LBSWWNKMVPAXOI-GUBZILKMSA-N Met-Val-Ser Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O LBSWWNKMVPAXOI-GUBZILKMSA-N 0.000 description 1
- 206010049567 Miller Fisher syndrome Diseases 0.000 description 1
- 102100038738 Mitochondrial carnitine/acylcarnitine carrier protein Human genes 0.000 description 1
- 101000686985 Mouse mammary tumor virus (strain C3H) Protein PR73 Proteins 0.000 description 1
- 208000001089 Multiple system atrophy Diseases 0.000 description 1
- 101001076414 Mus musculus Interleukin-6 Proteins 0.000 description 1
- 101001055215 Mus musculus Interleukin-9 Proteins 0.000 description 1
- 206010029260 Neuroblastoma Diseases 0.000 description 1
- 244000061176 Nicotiana tabacum Species 0.000 description 1
- 235000002637 Nicotiana tabacum Nutrition 0.000 description 1
- 208000002565 Open Fractures Diseases 0.000 description 1
- 208000001132 Osteoporosis Diseases 0.000 description 1
- 101150012195 PREB gene Proteins 0.000 description 1
- 208000000733 Paroxysmal Hemoglobinuria Diseases 0.000 description 1
- 241001494479 Pecora Species 0.000 description 1
- 108010033276 Peptide Fragments Proteins 0.000 description 1
- 102000007079 Peptide Fragments Human genes 0.000 description 1
- OZILORBBPKKGRI-RYUDHWBXSA-N Phe-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 OZILORBBPKKGRI-RYUDHWBXSA-N 0.000 description 1
- VHWOBXIWBDWZHK-IHRRRGAJSA-N Phe-Arg-Asp Chemical compound NC(N)=NCCC[C@@H](C(=O)N[C@@H](CC(O)=O)C(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 VHWOBXIWBDWZHK-IHRRRGAJSA-N 0.000 description 1
- MPGJIHFJCXTVEX-KKUMJFAQSA-N Phe-Arg-Glu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O MPGJIHFJCXTVEX-KKUMJFAQSA-N 0.000 description 1
- YYRCPTVAPLQRNC-ULQDDVLXSA-N Phe-Arg-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCNC(N)=N)NC(=O)[C@@H](N)CC1=CC=CC=C1 YYRCPTVAPLQRNC-ULQDDVLXSA-N 0.000 description 1
- GNUCSNWOCQFMMC-UFYCRDLUSA-N Phe-Arg-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 GNUCSNWOCQFMMC-UFYCRDLUSA-N 0.000 description 1
- UUWCIPUVJJIEEP-SRVKXCTJSA-N Phe-Asn-Cys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CS)C(=O)O)N UUWCIPUVJJIEEP-SRVKXCTJSA-N 0.000 description 1
- MRNRMSDVVSKPGM-AVGNSLFASA-N Phe-Asn-Gln Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O MRNRMSDVVSKPGM-AVGNSLFASA-N 0.000 description 1
- CDNPIRSCAFMMBE-SRVKXCTJSA-N Phe-Asn-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O CDNPIRSCAFMMBE-SRVKXCTJSA-N 0.000 description 1
- JIYJYFIXQTYDNF-YDHLFZDLSA-N Phe-Asn-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC1=CC=CC=C1)N JIYJYFIXQTYDNF-YDHLFZDLSA-N 0.000 description 1
- XMPUYNHKEPFERE-IHRRRGAJSA-N Phe-Asp-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 XMPUYNHKEPFERE-IHRRRGAJSA-N 0.000 description 1
- DDYIRGBOZVKRFR-AVGNSLFASA-N Phe-Asp-Glu Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N DDYIRGBOZVKRFR-AVGNSLFASA-N 0.000 description 1
- IQXOZIDWLZYYAW-IHRRRGAJSA-N Phe-Asp-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N IQXOZIDWLZYYAW-IHRRRGAJSA-N 0.000 description 1
- QPQDWBAJWOGAMJ-IHPCNDPISA-N Phe-Asp-Trp Chemical compound C([C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(O)=O)C1=CC=CC=C1 QPQDWBAJWOGAMJ-IHPCNDPISA-N 0.000 description 1
- KKYHKZCMETTXEO-AVGNSLFASA-N Phe-Cys-Glu Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N KKYHKZCMETTXEO-AVGNSLFASA-N 0.000 description 1
- ALHULIGNEXGFRM-QWRGUYRKSA-N Phe-Cys-Gly Chemical compound OC(=O)CNC(=O)[C@H](CS)NC(=O)[C@@H](N)CC1=CC=CC=C1 ALHULIGNEXGFRM-QWRGUYRKSA-N 0.000 description 1
- HQVPQHLNOVTLDD-IHRRRGAJSA-N Phe-Cys-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CC1=CC=CC=C1)N HQVPQHLNOVTLDD-IHRRRGAJSA-N 0.000 description 1
- WYPVCIACUMJRIB-JYJNAYRXSA-N Phe-Gln-Lys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N WYPVCIACUMJRIB-JYJNAYRXSA-N 0.000 description 1
- GDBOREPXIRKSEQ-FHWLQOOXSA-N Phe-Gln-Phe Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O GDBOREPXIRKSEQ-FHWLQOOXSA-N 0.000 description 1
- OPEVYHFJXLCCRT-AVGNSLFASA-N Phe-Gln-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O OPEVYHFJXLCCRT-AVGNSLFASA-N 0.000 description 1
- MPFGIYLYWUCSJG-AVGNSLFASA-N Phe-Glu-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 MPFGIYLYWUCSJG-AVGNSLFASA-N 0.000 description 1
- JWQWPTLEOFNCGX-AVGNSLFASA-N Phe-Glu-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 JWQWPTLEOFNCGX-AVGNSLFASA-N 0.000 description 1
- QPVFUAUFEBPIPT-CDMKHQONSA-N Phe-Gly-Thr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O QPVFUAUFEBPIPT-CDMKHQONSA-N 0.000 description 1
- FXYXBEZMRACDDR-KKUMJFAQSA-N Phe-His-Asp Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(O)=O)C(O)=O FXYXBEZMRACDDR-KKUMJFAQSA-N 0.000 description 1
- KXUZHWXENMYOHC-QEJZJMRPSA-N Phe-Leu-Ala Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O KXUZHWXENMYOHC-QEJZJMRPSA-N 0.000 description 1
- XMQSOOJRRVEHRO-ULQDDVLXSA-N Phe-Leu-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 XMQSOOJRRVEHRO-ULQDDVLXSA-N 0.000 description 1
- KBVJZCVLQWCJQN-KKUMJFAQSA-N Phe-Leu-Asn Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O KBVJZCVLQWCJQN-KKUMJFAQSA-N 0.000 description 1
- METZZBCMDXHFMK-BZSNNMDCSA-N Phe-Leu-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC2=CC=CC=C2)N METZZBCMDXHFMK-BZSNNMDCSA-N 0.000 description 1
- KPEIBEPEUAZWNS-ULQDDVLXSA-N Phe-Leu-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 KPEIBEPEUAZWNS-ULQDDVLXSA-N 0.000 description 1
- OSBADCBXAMSPQD-YESZJQIVSA-N Phe-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CC=CC=C2)N OSBADCBXAMSPQD-YESZJQIVSA-N 0.000 description 1
- HQPWNHXERZCIHP-PMVMPFDFSA-N Phe-Leu-Trp Chemical compound C([C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(O)=O)C1=CC=CC=C1 HQPWNHXERZCIHP-PMVMPFDFSA-N 0.000 description 1
- WLYPRKLMRIYGPP-JYJNAYRXSA-N Phe-Lys-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC1=CC=CC=C1 WLYPRKLMRIYGPP-JYJNAYRXSA-N 0.000 description 1
- FQUUYTNBMIBOHS-IHRRRGAJSA-N Phe-Met-Ser Chemical compound CSCC[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N FQUUYTNBMIBOHS-IHRRRGAJSA-N 0.000 description 1
- TXJJXEXCZBHDNA-ACRUOGEOSA-N Phe-Phe-His Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CC=CC=C2)C(=O)N[C@@H](CC3=CN=CN3)C(=O)O)N TXJJXEXCZBHDNA-ACRUOGEOSA-N 0.000 description 1
- AXIOGMQCDYVTNY-ACRUOGEOSA-N Phe-Phe-Leu Chemical compound C([C@@H](C(=O)N[C@@H](CC(C)C)C(O)=O)NC(=O)[C@@H](N)CC=1C=CC=CC=1)C1=CC=CC=C1 AXIOGMQCDYVTNY-ACRUOGEOSA-N 0.000 description 1
- CBENHWCORLVGEQ-HJOGWXRNSA-N Phe-Phe-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 CBENHWCORLVGEQ-HJOGWXRNSA-N 0.000 description 1
- MMJJFXWMCMJMQA-STQMWFEESA-N Phe-Pro-Gly Chemical compound C([C@H](N)C(=O)N1[C@@H](CCC1)C(=O)NCC(O)=O)C1=CC=CC=C1 MMJJFXWMCMJMQA-STQMWFEESA-N 0.000 description 1
- ZVRJWDUPIDMHDN-ULQDDVLXSA-N Phe-Pro-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CC1=CC=CC=C1 ZVRJWDUPIDMHDN-ULQDDVLXSA-N 0.000 description 1
- ZJPGOXWRFNKIQL-JYJNAYRXSA-N Phe-Pro-Pro Chemical compound C([C@H](N)C(=O)N1[C@@H](CCC1)C(=O)N1[C@@H](CCC1)C(O)=O)C1=CC=CC=C1 ZJPGOXWRFNKIQL-JYJNAYRXSA-N 0.000 description 1
- AFNJAQVMTIQTCB-DLOVCJGASA-N Phe-Ser-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CC=CC=C1 AFNJAQVMTIQTCB-DLOVCJGASA-N 0.000 description 1
- WEDZFLRYSIDIRX-IHRRRGAJSA-N Phe-Ser-Arg Chemical compound NC(=N)NCCC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CC=CC=C1 WEDZFLRYSIDIRX-IHRRRGAJSA-N 0.000 description 1
- YMIZSYUAZJSOFL-SRVKXCTJSA-N Phe-Ser-Asn Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O YMIZSYUAZJSOFL-SRVKXCTJSA-N 0.000 description 1
- BONHGTUEEPIMPM-AVGNSLFASA-N Phe-Ser-Glu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(O)=O BONHGTUEEPIMPM-AVGNSLFASA-N 0.000 description 1
- QSWKNJAPHQDAAS-MELADBBJSA-N Phe-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CC2=CC=CC=C2)N)C(=O)O QSWKNJAPHQDAAS-MELADBBJSA-N 0.000 description 1
- SHUFSZDAIPLZLF-BEAPCOKYSA-N Phe-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CC=CC=C2)N)O SHUFSZDAIPLZLF-BEAPCOKYSA-N 0.000 description 1
- MSSXKZBDKZAHCX-UNQGMJICSA-N Phe-Thr-Val Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O MSSXKZBDKZAHCX-UNQGMJICSA-N 0.000 description 1
- JMCOUWKXLXDERB-WMZOPIPTSA-N Phe-Trp Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(O)=O)C1=CC=CC=C1 JMCOUWKXLXDERB-WMZOPIPTSA-N 0.000 description 1
- BPIFSOUEUYDJRM-DCPHZVHLSA-N Phe-Trp-Ala Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)N[C@@H](C)C(O)=O)C1=CC=CC=C1 BPIFSOUEUYDJRM-DCPHZVHLSA-N 0.000 description 1
- ZVJGAXNBBKPYOE-HKUYNNGSSA-N Phe-Trp-Gly Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)NCC(O)=O)C1=CC=CC=C1 ZVJGAXNBBKPYOE-HKUYNNGSSA-N 0.000 description 1
- NWVMQNAELALJFW-RNXOBYDBSA-N Phe-Trp-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 NWVMQNAELALJFW-RNXOBYDBSA-N 0.000 description 1
- MJOJSHOTYWABPR-WIRXVTQYSA-N Phe-Trp-Trp Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(O)=O)C1=CC=CC=C1 MJOJSHOTYWABPR-WIRXVTQYSA-N 0.000 description 1
- BTAIJUBAGLVFKQ-BVSLBCMMSA-N Phe-Trp-Val Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)N[C@@H](C(C)C)C(O)=O)C1=CC=CC=C1 BTAIJUBAGLVFKQ-BVSLBCMMSA-N 0.000 description 1
- VFDRDMOMHBJGKD-UFYCRDLUSA-N Phe-Tyr-Arg Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N VFDRDMOMHBJGKD-UFYCRDLUSA-N 0.000 description 1
- GTMSCDVFQLNEOY-BZSNNMDCSA-N Phe-Tyr-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N GTMSCDVFQLNEOY-BZSNNMDCSA-N 0.000 description 1
- BAONJAHBAUDJKA-BZSNNMDCSA-N Phe-Tyr-Asp Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)N[C@@H](CC(O)=O)C(O)=O)C1=CC=CC=C1 BAONJAHBAUDJKA-BZSNNMDCSA-N 0.000 description 1
- MHNBYYFXWDUGBW-RPTUDFQQSA-N Phe-Tyr-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CC2=CC=CC=C2)N)O MHNBYYFXWDUGBW-RPTUDFQQSA-N 0.000 description 1
- GOUWCZRDTWTODO-YDHLFZDLSA-N Phe-Val-Asn Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O GOUWCZRDTWTODO-YDHLFZDLSA-N 0.000 description 1
- XALFIVXGQUEGKV-JSGCOSHPSA-N Phe-Val-Gly Chemical compound OC(=O)CNC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 XALFIVXGQUEGKV-JSGCOSHPSA-N 0.000 description 1
- MWQXFDIQXIXPMS-UNQGMJICSA-N Phe-Val-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CC1=CC=CC=C1)N)O MWQXFDIQXIXPMS-UNQGMJICSA-N 0.000 description 1
- APZNYJFGVAGFCF-JYJNAYRXSA-N Phe-Val-Val Chemical compound CC(C)[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)Cc1ccccc1)C(C)C)C(O)=O APZNYJFGVAGFCF-JYJNAYRXSA-N 0.000 description 1
- 102100036050 Phosphatidylinositol N-acetylglucosaminyltransferase subunit A Human genes 0.000 description 1
- 102000004160 Phosphoric Monoester Hydrolases Human genes 0.000 description 1
- 108090000608 Phosphoric Monoester Hydrolases Proteins 0.000 description 1
- 108091000080 Phosphotransferase Proteins 0.000 description 1
- 208000013544 Platelet disease Diseases 0.000 description 1
- 206010035664 Pneumonia Diseases 0.000 description 1
- 108010021757 Polynucleotide 5'-Hydroxyl-Kinase Proteins 0.000 description 1
- 102000008422 Polynucleotide 5'-hydroxyl-kinase Human genes 0.000 description 1
- NPYPAHLBTDXSSS-UHFFFAOYSA-N Potassium ion Chemical compound [K+] NPYPAHLBTDXSSS-UHFFFAOYSA-N 0.000 description 1
- 101800001357 Potential peptide Proteins 0.000 description 1
- 102400000745 Potential peptide Human genes 0.000 description 1
- 208000004210 Pressure Ulcer Diseases 0.000 description 1
- IWNOFCGBMSFTBC-CIUDSAMLSA-N Pro-Ala-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O IWNOFCGBMSFTBC-CIUDSAMLSA-N 0.000 description 1
- KIZQGKLMXKGDIV-BQBZGAKWSA-N Pro-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1 KIZQGKLMXKGDIV-BQBZGAKWSA-N 0.000 description 1
- HMNSRTLZAJHSIK-YUMQZZPRSA-N Pro-Arg Chemical compound NC(=N)NCCC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1 HMNSRTLZAJHSIK-YUMQZZPRSA-N 0.000 description 1
- NHDVNAKDACFHPX-GUBZILKMSA-N Pro-Arg-Ala Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(O)=O NHDVNAKDACFHPX-GUBZILKMSA-N 0.000 description 1
- VCYJKOLZYPYGJV-AVGNSLFASA-N Pro-Arg-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O VCYJKOLZYPYGJV-AVGNSLFASA-N 0.000 description 1
- KQCCDMFIALWGTL-GUBZILKMSA-N Pro-Asn-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H]1CCCN1 KQCCDMFIALWGTL-GUBZILKMSA-N 0.000 description 1
- MLQVJYMFASXBGZ-IHRRRGAJSA-N Pro-Asn-Tyr Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O MLQVJYMFASXBGZ-IHRRRGAJSA-N 0.000 description 1
- SGCZFWSQERRKBD-BQBZGAKWSA-N Pro-Asp-Gly Chemical compound OC(=O)CNC(=O)[C@H](CC(O)=O)NC(=O)[C@@H]1CCCN1 SGCZFWSQERRKBD-BQBZGAKWSA-N 0.000 description 1
- KIGGUSRFHJCIEJ-DCAQKATOSA-N Pro-Asp-His Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O KIGGUSRFHJCIEJ-DCAQKATOSA-N 0.000 description 1
- XKHCJJPNXFBADI-DCAQKATOSA-N Pro-Asp-Lys Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O XKHCJJPNXFBADI-DCAQKATOSA-N 0.000 description 1
- HXOLCSYHGRNXJJ-IHRRRGAJSA-N Pro-Asp-Phe Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O HXOLCSYHGRNXJJ-IHRRRGAJSA-N 0.000 description 1
- XJROSHJRQTXWAE-XGEHTFHBSA-N Pro-Cys-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CS)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XJROSHJRQTXWAE-XGEHTFHBSA-N 0.000 description 1
- CMOIIANLNNYUTP-SRVKXCTJSA-N Pro-Gln-His Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O CMOIIANLNNYUTP-SRVKXCTJSA-N 0.000 description 1
- LHALYDBUDCWMDY-CIUDSAMLSA-N Pro-Glu-Ala Chemical compound C[C@H](NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H]1CCCN1)C(O)=O LHALYDBUDCWMDY-CIUDSAMLSA-N 0.000 description 1
- PULPZRAHVFBVTO-DCAQKATOSA-N Pro-Glu-Arg Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O PULPZRAHVFBVTO-DCAQKATOSA-N 0.000 description 1
- MGDFPGCFVJFITQ-CIUDSAMLSA-N Pro-Glu-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O MGDFPGCFVJFITQ-CIUDSAMLSA-N 0.000 description 1
- VOZIBWWZSBIXQN-SRVKXCTJSA-N Pro-Glu-Lys Chemical compound NCCCC[C@H](NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H]1CCCN1)C(O)=O VOZIBWWZSBIXQN-SRVKXCTJSA-N 0.000 description 1
- VPEVBAUSTBWQHN-NHCYSSNCSA-N Pro-Glu-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O VPEVBAUSTBWQHN-NHCYSSNCSA-N 0.000 description 1
- WSRWHZRUOCACLJ-UWVGGRQHSA-N Pro-Gly-His Chemical compound C([C@@H](C(=O)O)NC(=O)CNC(=O)[C@H]1NCCC1)C1=CN=CN1 WSRWHZRUOCACLJ-UWVGGRQHSA-N 0.000 description 1
- UIMCLYYSUCIUJM-UWVGGRQHSA-N Pro-Gly-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H]1CCCN1 UIMCLYYSUCIUJM-UWVGGRQHSA-N 0.000 description 1
- XQSREVQDGCPFRJ-STQMWFEESA-N Pro-Gly-Phe Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O XQSREVQDGCPFRJ-STQMWFEESA-N 0.000 description 1
- GSPPWVHVBBSPSY-FHWLQOOXSA-N Pro-His-Trp Chemical compound OC(=O)[C@H](Cc1c[nH]c2ccccc12)NC(=O)[C@H](Cc1cnc[nH]1)NC(=O)[C@@H]1CCCN1 GSPPWVHVBBSPSY-FHWLQOOXSA-N 0.000 description 1
- ZKQOUHVVXABNDG-IUCAKERBSA-N Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1 ZKQOUHVVXABNDG-IUCAKERBSA-N 0.000 description 1
- FMLRRBDLBJLJIK-DCAQKATOSA-N Pro-Leu-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]1CCCN1 FMLRRBDLBJLJIK-DCAQKATOSA-N 0.000 description 1
- YAZNFQUKPUASKB-DCAQKATOSA-N Pro-Lys-Cys Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CS)C(=O)O YAZNFQUKPUASKB-DCAQKATOSA-N 0.000 description 1
- XQPHBAKJJJZOBX-SRVKXCTJSA-N Pro-Lys-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O XQPHBAKJJJZOBX-SRVKXCTJSA-N 0.000 description 1
- DWGFLKQSGRUQTI-IHRRRGAJSA-N Pro-Lys-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H]1CCCN1 DWGFLKQSGRUQTI-IHRRRGAJSA-N 0.000 description 1
- WOIFYRZPIORBRY-AVGNSLFASA-N Pro-Lys-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O WOIFYRZPIORBRY-AVGNSLFASA-N 0.000 description 1
- ANESFYPBAJPYNJ-SDDRHHMPSA-N Pro-Met-Pro Chemical compound CSCC[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@@H]2CCCN2 ANESFYPBAJPYNJ-SDDRHHMPSA-N 0.000 description 1
- QGLFRQCECIWXFA-RCWTZXSCSA-N Pro-Met-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@@H]1CCCN1)O QGLFRQCECIWXFA-RCWTZXSCSA-N 0.000 description 1
- IWIANZLCJVYEFX-RYUDHWBXSA-N Pro-Phe Chemical compound C([C@@H](C(=O)O)NC(=O)[C@H]1NCCC1)C1=CC=CC=C1 IWIANZLCJVYEFX-RYUDHWBXSA-N 0.000 description 1
- AWQGDZBKQTYNMN-IHRRRGAJSA-N Pro-Phe-Asp Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC2=CC=CC=C2)C(=O)N[C@@H](CC(=O)O)C(=O)O AWQGDZBKQTYNMN-IHRRRGAJSA-N 0.000 description 1
- DYMPSOABVJIFBS-IHRRRGAJSA-N Pro-Phe-Cys Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC2=CC=CC=C2)C(=O)N[C@@H](CS)C(=O)O DYMPSOABVJIFBS-IHRRRGAJSA-N 0.000 description 1
- MLKVIVZCFYRTIR-KKUMJFAQSA-N Pro-Phe-Gln Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O MLKVIVZCFYRTIR-KKUMJFAQSA-N 0.000 description 1
- DSGSTPRKNYHGCL-JYJNAYRXSA-N Pro-Phe-Met Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCSC)C(O)=O DSGSTPRKNYHGCL-JYJNAYRXSA-N 0.000 description 1
- BUEIYHBJHCDAMI-UFYCRDLUSA-N Pro-Phe-Phe Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O BUEIYHBJHCDAMI-UFYCRDLUSA-N 0.000 description 1
- GFHXZNVJIKMAGO-IHRRRGAJSA-N Pro-Phe-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O GFHXZNVJIKMAGO-IHRRRGAJSA-N 0.000 description 1
- GFHOSBYCLACKEK-GUBZILKMSA-N Pro-Pro-Asn Chemical compound [H]N1CCC[C@H]1C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O GFHOSBYCLACKEK-GUBZILKMSA-N 0.000 description 1
- FHZJRBVMLGOHBX-GUBZILKMSA-N Pro-Pro-Asp Chemical compound OC(=O)C[C@H](NC(=O)[C@@H]1CCCN1C(=O)[C@@H]1CCCN1)C(O)=O FHZJRBVMLGOHBX-GUBZILKMSA-N 0.000 description 1
- CGSOWZUPLOKYOR-AVGNSLFASA-N Pro-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 CGSOWZUPLOKYOR-AVGNSLFASA-N 0.000 description 1
- RCYUBVHMVUHEBM-RCWTZXSCSA-N Pro-Pro-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O RCYUBVHMVUHEBM-RCWTZXSCSA-N 0.000 description 1
- AFWBWPCXSWUCLB-WDSKDSINSA-N Pro-Ser Chemical compound OC[C@@H](C([O-])=O)NC(=O)[C@@H]1CCC[NH2+]1 AFWBWPCXSWUCLB-WDSKDSINSA-N 0.000 description 1
- GOMUXSCOIWIJFP-GUBZILKMSA-N Pro-Ser-Arg Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O GOMUXSCOIWIJFP-GUBZILKMSA-N 0.000 description 1
- OWQXAJQZLWHPBH-FXQIFTODSA-N Pro-Ser-Asn Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O OWQXAJQZLWHPBH-FXQIFTODSA-N 0.000 description 1
- QKDIHFHGHBYTKB-IHRRRGAJSA-N Pro-Ser-Phe Chemical compound N([C@@H](CO)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C(=O)[C@@H]1CCCN1 QKDIHFHGHBYTKB-IHRRRGAJSA-N 0.000 description 1
- UGDMQJSXSSZUKL-IHRRRGAJSA-N Pro-Ser-Tyr Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O UGDMQJSXSSZUKL-IHRRRGAJSA-N 0.000 description 1
- WVXQQUWOKUZIEG-VEVYYDQMSA-N Pro-Thr-Asn Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(O)=O WVXQQUWOKUZIEG-VEVYYDQMSA-N 0.000 description 1
- HRIXMVRZRGFKNQ-HJGDQZAQSA-N Pro-Thr-Gln Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(O)=O HRIXMVRZRGFKNQ-HJGDQZAQSA-N 0.000 description 1
- FDMCIBSQRKFSTJ-RHYQMDGZSA-N Pro-Thr-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O FDMCIBSQRKFSTJ-RHYQMDGZSA-N 0.000 description 1
- CXGLFEOYCJFKPR-RCWTZXSCSA-N Pro-Thr-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O CXGLFEOYCJFKPR-RCWTZXSCSA-N 0.000 description 1
- GNFHQWNCSSPOBT-ULQDDVLXSA-N Pro-Trp-Gln Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC2=CNC3=CC=CC=C32)C(=O)N[C@@H](CCC(=O)N)C(=O)O GNFHQWNCSSPOBT-ULQDDVLXSA-N 0.000 description 1
- OIDKVWTWGDWMHY-RYUDHWBXSA-N Pro-Tyr Chemical compound C([C@@H](C(=O)O)NC(=O)[C@H]1NCCC1)C1=CC=C(O)C=C1 OIDKVWTWGDWMHY-RYUDHWBXSA-N 0.000 description 1
- QHSSUIHLAIWXEE-IHRRRGAJSA-N Pro-Tyr-Asn Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(O)=O QHSSUIHLAIWXEE-IHRRRGAJSA-N 0.000 description 1
- BVTYXOFTHDXSNI-IHRRRGAJSA-N Pro-Tyr-Cys Chemical compound C([C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H]1NCCC1)C1=CC=C(O)C=C1 BVTYXOFTHDXSNI-IHRRRGAJSA-N 0.000 description 1
- FZXSYIPVAFVYBH-KKUMJFAQSA-N Pro-Tyr-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O FZXSYIPVAFVYBH-KKUMJFAQSA-N 0.000 description 1
- VEUACYMXJKXALX-IHRRRGAJSA-N Pro-Tyr-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(O)=O VEUACYMXJKXALX-IHRRRGAJSA-N 0.000 description 1
- JXVXYRZQIUPYSA-NHCYSSNCSA-N Pro-Val-Gln Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O JXVXYRZQIUPYSA-NHCYSSNCSA-N 0.000 description 1
- VDHGTOHMHHQSKG-JYJNAYRXSA-N Pro-Val-Phe Chemical compound CC(C)[C@H](NC(=O)[C@@H]1CCCN1)C(=O)N[C@@H](Cc1ccccc1)C(O)=O VDHGTOHMHHQSKG-JYJNAYRXSA-N 0.000 description 1
- YDTUEBLEAVANFH-RCWTZXSCSA-N Pro-Val-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H]1CCCN1 YDTUEBLEAVANFH-RCWTZXSCSA-N 0.000 description 1
- FHJQROWZEJFZPO-SRVKXCTJSA-N Pro-Val-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H]1CCCN1 FHJQROWZEJFZPO-SRVKXCTJSA-N 0.000 description 1
- ONIBWKKTOPOVIA-UHFFFAOYSA-N Proline Natural products OC(=O)C1CCCN1 ONIBWKKTOPOVIA-UHFFFAOYSA-N 0.000 description 1
- 102000002067 Protein Subunits Human genes 0.000 description 1
- 108010001267 Protein Subunits Proteins 0.000 description 1
- 201000004681 Psoriasis Diseases 0.000 description 1
- 108010009413 Pyrophosphatases Proteins 0.000 description 1
- 102000009609 Pyrophosphatases Human genes 0.000 description 1
- 108010079005 RDV peptide Proteins 0.000 description 1
- 108010003201 RGH 0205 Proteins 0.000 description 1
- 101710086015 RNA ligase Proteins 0.000 description 1
- 102000007056 Recombinant Fusion Proteins Human genes 0.000 description 1
- 108010008281 Recombinant Fusion Proteins Proteins 0.000 description 1
- 206010063837 Reperfusion injury Diseases 0.000 description 1
- 108010083644 Ribonucleases Proteins 0.000 description 1
- 102000006382 Ribonucleases Human genes 0.000 description 1
- 239000012722 SDS sample buffer Substances 0.000 description 1
- 206010039491 Sarcoma Diseases 0.000 description 1
- 102000003800 Selectins Human genes 0.000 description 1
- 108090000184 Selectins Proteins 0.000 description 1
- 206010053879 Sepsis syndrome Diseases 0.000 description 1
- 206010040070 Septic Shock Diseases 0.000 description 1
- ZUGXSSFMTXKHJS-ZLUOBGJFSA-N Ser-Ala-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O ZUGXSSFMTXKHJS-ZLUOBGJFSA-N 0.000 description 1
- BTKUIVBNGBFTTP-WHFBIAKZSA-N Ser-Ala-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)NCC(O)=O BTKUIVBNGBFTTP-WHFBIAKZSA-N 0.000 description 1
- HRNQLKCLPVKZNE-CIUDSAMLSA-N Ser-Ala-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O HRNQLKCLPVKZNE-CIUDSAMLSA-N 0.000 description 1
- GXXTUIUYTWGPMV-FXQIFTODSA-N Ser-Arg-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(O)=O GXXTUIUYTWGPMV-FXQIFTODSA-N 0.000 description 1
- FCRMLGJMPXCAHD-FXQIFTODSA-N Ser-Arg-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(O)=O FCRMLGJMPXCAHD-FXQIFTODSA-N 0.000 description 1
- QEDMOZUJTGEIBF-FXQIFTODSA-N Ser-Arg-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O QEDMOZUJTGEIBF-FXQIFTODSA-N 0.000 description 1
- QGMLKFGTGXWAHF-IHRRRGAJSA-N Ser-Arg-Phe Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O QGMLKFGTGXWAHF-IHRRRGAJSA-N 0.000 description 1
- NRCJWSGXMAPYQX-LPEHRKFASA-N Ser-Arg-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CO)N)C(=O)O NRCJWSGXMAPYQX-LPEHRKFASA-N 0.000 description 1
- OYEDZGNMSBZCIM-XGEHTFHBSA-N Ser-Arg-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OYEDZGNMSBZCIM-XGEHTFHBSA-N 0.000 description 1
- HZWAHWQZPSXNCB-BPUTZDHNSA-N Ser-Arg-Trp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O HZWAHWQZPSXNCB-BPUTZDHNSA-N 0.000 description 1
- HBOABDXGTMMDSE-GUBZILKMSA-N Ser-Arg-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(O)=O HBOABDXGTMMDSE-GUBZILKMSA-N 0.000 description 1
- ZXLUWXWISXIFIX-ACZMJKKPSA-N Ser-Asn-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O ZXLUWXWISXIFIX-ACZMJKKPSA-N 0.000 description 1
- WXWDPFVKQRVJBJ-CIUDSAMLSA-N Ser-Asn-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CO)N WXWDPFVKQRVJBJ-CIUDSAMLSA-N 0.000 description 1
- COAHUSQNSVFYBW-FXQIFTODSA-N Ser-Asn-Met Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCSC)C(O)=O COAHUSQNSVFYBW-FXQIFTODSA-N 0.000 description 1
- DKKGAAJTDKHWOD-BIIVOSGPSA-N Ser-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)[C@H](CO)N)C(=O)O DKKGAAJTDKHWOD-BIIVOSGPSA-N 0.000 description 1
- VBKBDLMWICBSCY-IMJSIDKUSA-N Ser-Asp Chemical compound OC[C@H](N)C(=O)N[C@H](C(O)=O)CC(O)=O VBKBDLMWICBSCY-IMJSIDKUSA-N 0.000 description 1
- FFOKMZOAVHEWET-IMJSIDKUSA-N Ser-Cys Chemical compound OC[C@H](N)C(=O)N[C@@H](CS)C(O)=O FFOKMZOAVHEWET-IMJSIDKUSA-N 0.000 description 1
- WTPKKLMBNBCCNL-ACZMJKKPSA-N Ser-Cys-Glu Chemical compound C(CC(=O)O)[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CO)N WTPKKLMBNBCCNL-ACZMJKKPSA-N 0.000 description 1
- MPPHJZYXDVDGOF-BWBBJGPYSA-N Ser-Cys-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CS)NC(=O)[C@@H](N)CO MPPHJZYXDVDGOF-BWBBJGPYSA-N 0.000 description 1
- ZOHGLPQGEHSLPD-FXQIFTODSA-N Ser-Gln-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O ZOHGLPQGEHSLPD-FXQIFTODSA-N 0.000 description 1
- VMVNCJDKFOQOHM-GUBZILKMSA-N Ser-Gln-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CO)N VMVNCJDKFOQOHM-GUBZILKMSA-N 0.000 description 1
- FMDHKPRACUXATF-ACZMJKKPSA-N Ser-Gln-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O FMDHKPRACUXATF-ACZMJKKPSA-N 0.000 description 1
- VDVYTKZBMFADQH-AVGNSLFASA-N Ser-Gln-Tyr Chemical compound OC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 VDVYTKZBMFADQH-AVGNSLFASA-N 0.000 description 1
- HJEBZBMOTCQYDN-ACZMJKKPSA-N Ser-Glu-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O HJEBZBMOTCQYDN-ACZMJKKPSA-N 0.000 description 1
- DSGYZICNAMEJOC-AVGNSLFASA-N Ser-Glu-Phe Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O DSGYZICNAMEJOC-AVGNSLFASA-N 0.000 description 1
- UFKPDBLKLOBMRH-XHNCKOQMSA-N Ser-Glu-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CO)N)C(=O)O UFKPDBLKLOBMRH-XHNCKOQMSA-N 0.000 description 1
- AEGUWTFAQQWVLC-BQBZGAKWSA-N Ser-Gly-Arg Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(O)=O AEGUWTFAQQWVLC-BQBZGAKWSA-N 0.000 description 1
- SNVIOQXAHVORQM-WDSKDSINSA-N Ser-Gly-Gln Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CCC(N)=O)C(O)=O SNVIOQXAHVORQM-WDSKDSINSA-N 0.000 description 1
- SFTZWNJFZYOLBD-ZDLURKLDSA-N Ser-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CO SFTZWNJFZYOLBD-ZDLURKLDSA-N 0.000 description 1
- CICQXRWZNVXFCU-SRVKXCTJSA-N Ser-His-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(O)=O CICQXRWZNVXFCU-SRVKXCTJSA-N 0.000 description 1
- CAOYHZOWXFFAIR-CIUDSAMLSA-N Ser-His-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(O)=O CAOYHZOWXFFAIR-CIUDSAMLSA-N 0.000 description 1
- FUMGHWDRRFCKEP-CIUDSAMLSA-N Ser-Leu-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O FUMGHWDRRFCKEP-CIUDSAMLSA-N 0.000 description 1
- NLOAIFSWUUFQFR-CIUDSAMLSA-N Ser-Leu-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O NLOAIFSWUUFQFR-CIUDSAMLSA-N 0.000 description 1
- VMLONWHIORGALA-SRVKXCTJSA-N Ser-Leu-Leu Chemical compound CC(C)C[C@@H](C([O-])=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]([NH3+])CO VMLONWHIORGALA-SRVKXCTJSA-N 0.000 description 1
- MUJQWSAWLLRJCE-KATARQTJSA-N Ser-Leu-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MUJQWSAWLLRJCE-KATARQTJSA-N 0.000 description 1
- GVIGVIOEYBOTCB-XIRDDKMYSA-N Ser-Leu-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)CO)CC(C)C)C(O)=O)=CNC2=C1 GVIGVIOEYBOTCB-XIRDDKMYSA-N 0.000 description 1
- JWOBLHJRDADHLN-KKUMJFAQSA-N Ser-Leu-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O JWOBLHJRDADHLN-KKUMJFAQSA-N 0.000 description 1
- IXZHZUGGKLRHJD-DCAQKATOSA-N Ser-Leu-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O IXZHZUGGKLRHJD-DCAQKATOSA-N 0.000 description 1
- SBMNPABNWKXNBJ-BQBZGAKWSA-N Ser-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](N)CO SBMNPABNWKXNBJ-BQBZGAKWSA-N 0.000 description 1
- JLPMFVAIQHCBDC-CIUDSAMLSA-N Ser-Lys-Cys Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CO)N JLPMFVAIQHCBDC-CIUDSAMLSA-N 0.000 description 1
- BYCVMHKULKRVPV-GUBZILKMSA-N Ser-Lys-Gln Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O BYCVMHKULKRVPV-GUBZILKMSA-N 0.000 description 1
- GVMUJUPXFQFBBZ-GUBZILKMSA-N Ser-Lys-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O GVMUJUPXFQFBBZ-GUBZILKMSA-N 0.000 description 1
- QJKPECIAWNNKIT-KKUMJFAQSA-N Ser-Lys-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O QJKPECIAWNNKIT-KKUMJFAQSA-N 0.000 description 1
- VXYQOFXBIXKPCX-BQBZGAKWSA-N Ser-Met-Gly Chemical compound CSCC[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CO)N VXYQOFXBIXKPCX-BQBZGAKWSA-N 0.000 description 1
- NQZFFLBPNDLTPO-DLOVCJGASA-N Ser-Phe-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](CO)N NQZFFLBPNDLTPO-DLOVCJGASA-N 0.000 description 1
- JAWGSPUJAXYXJA-IHRRRGAJSA-N Ser-Phe-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](CO)N)CC1=CC=CC=C1 JAWGSPUJAXYXJA-IHRRRGAJSA-N 0.000 description 1
- ZKBKUWQVDWWSRI-BZSNNMDCSA-N Ser-Phe-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O ZKBKUWQVDWWSRI-BZSNNMDCSA-N 0.000 description 1
- WBAXJMCUFIXCNI-WDSKDSINSA-N Ser-Pro Chemical compound OC[C@H](N)C(=O)N1CCC[C@H]1C(O)=O WBAXJMCUFIXCNI-WDSKDSINSA-N 0.000 description 1
- NUEHQDHDLDXCRU-GUBZILKMSA-N Ser-Pro-Arg Chemical compound OC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCN=C(N)N)C(O)=O NUEHQDHDLDXCRU-GUBZILKMSA-N 0.000 description 1
- XQAPEISNMXNKGE-FXQIFTODSA-N Ser-Pro-Cys Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CO)N)C(=O)N[C@@H](CS)C(=O)O XQAPEISNMXNKGE-FXQIFTODSA-N 0.000 description 1
- RHAPJNVNWDBFQI-BQBZGAKWSA-N Ser-Pro-Gly Chemical compound OC[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O RHAPJNVNWDBFQI-BQBZGAKWSA-N 0.000 description 1
- FKYWFUYPVKLJLP-DCAQKATOSA-N Ser-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CO FKYWFUYPVKLJLP-DCAQKATOSA-N 0.000 description 1
- QUGRFWPMPVIAPW-IHRRRGAJSA-N Ser-Pro-Phe Chemical compound OC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 QUGRFWPMPVIAPW-IHRRRGAJSA-N 0.000 description 1
- AZWNCEBQZXELEZ-FXQIFTODSA-N Ser-Pro-Ser Chemical compound OC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O AZWNCEBQZXELEZ-FXQIFTODSA-N 0.000 description 1
- FLONGDPORFIVQW-XGEHTFHBSA-N Ser-Pro-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CO FLONGDPORFIVQW-XGEHTFHBSA-N 0.000 description 1
- XGQKSRGHEZNWIS-IHRRRGAJSA-N Ser-Pro-Tyr Chemical compound N[C@@H](CO)C(=O)N1CCC[C@H]1C(=O)N[C@@H](Cc1ccc(O)cc1)C(O)=O XGQKSRGHEZNWIS-IHRRRGAJSA-N 0.000 description 1
- CKDXFSPMIDSMGV-GUBZILKMSA-N Ser-Pro-Val Chemical compound [H]N[C@@H](CO)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O CKDXFSPMIDSMGV-GUBZILKMSA-N 0.000 description 1
- XZKQVQKUZMAADP-IMJSIDKUSA-N Ser-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(O)=O XZKQVQKUZMAADP-IMJSIDKUSA-N 0.000 description 1
- KQNDIKOYWZTZIX-FXQIFTODSA-N Ser-Ser-Arg Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCNC(N)=N KQNDIKOYWZTZIX-FXQIFTODSA-N 0.000 description 1
- OZPDGESCTGGNAD-CIUDSAMLSA-N Ser-Ser-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CO OZPDGESCTGGNAD-CIUDSAMLSA-N 0.000 description 1
- VGQVAVQWKJLIRM-FXQIFTODSA-N Ser-Ser-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O VGQVAVQWKJLIRM-FXQIFTODSA-N 0.000 description 1
- SQHKXWODKJDZRC-LKXGYXEUSA-N Ser-Thr-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(O)=O SQHKXWODKJDZRC-LKXGYXEUSA-N 0.000 description 1
- WUXCHQZLUHBSDJ-LKXGYXEUSA-N Ser-Thr-Asp Chemical compound OC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CC(O)=O)C(O)=O WUXCHQZLUHBSDJ-LKXGYXEUSA-N 0.000 description 1
- UYLKOSODXYSWMQ-XGEHTFHBSA-N Ser-Thr-Met Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CO)N)O UYLKOSODXYSWMQ-XGEHTFHBSA-N 0.000 description 1
- VLMIUSLQONKLDV-HEIBUPTGSA-N Ser-Thr-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O VLMIUSLQONKLDV-HEIBUPTGSA-N 0.000 description 1
- BDMWLJLPPUCLNV-XGEHTFHBSA-N Ser-Thr-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O BDMWLJLPPUCLNV-XGEHTFHBSA-N 0.000 description 1
- XPVIVVLLLOFBRH-XIRDDKMYSA-N Ser-Trp-Lys Chemical compound NCCCC[C@H](NC(=O)[C@H](Cc1c[nH]c2ccccc12)NC(=O)[C@@H](N)CO)C(O)=O XPVIVVLLLOFBRH-XIRDDKMYSA-N 0.000 description 1
- HKHCTNFKZXAMIF-KKUMJFAQSA-N Ser-Tyr-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CO)CC1=CC=C(O)C=C1 HKHCTNFKZXAMIF-KKUMJFAQSA-N 0.000 description 1
- KIEIJCFVGZCUAS-MELADBBJSA-N Ser-Tyr-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)[C@H](CO)N)C(=O)O KIEIJCFVGZCUAS-MELADBBJSA-N 0.000 description 1
- YEDSOSIKVUMIJE-DCAQKATOSA-N Ser-Val-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O YEDSOSIKVUMIJE-DCAQKATOSA-N 0.000 description 1
- RCOUFINCYASMDN-GUBZILKMSA-N Ser-Val-Met Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCSC)C(O)=O RCOUFINCYASMDN-GUBZILKMSA-N 0.000 description 1
- HNDMFDBQXYZSRM-IHRRRGAJSA-N Ser-Val-Phe Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O HNDMFDBQXYZSRM-IHRRRGAJSA-N 0.000 description 1
- JGUWRQWULDWNCM-FXQIFTODSA-N Ser-Val-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O JGUWRQWULDWNCM-FXQIFTODSA-N 0.000 description 1
- 208000009106 Shy-Drager Syndrome Diseases 0.000 description 1
- VMHLLURERBWHNL-UHFFFAOYSA-M Sodium acetate Chemical compound [Na+].CC([O-])=O VMHLLURERBWHNL-UHFFFAOYSA-M 0.000 description 1
- FKNQFGJONOIPTF-UHFFFAOYSA-N Sodium cation Chemical compound [Na+] FKNQFGJONOIPTF-UHFFFAOYSA-N 0.000 description 1
- 208000029033 Spinal Cord disease Diseases 0.000 description 1
- 208000006011 Stroke Diseases 0.000 description 1
- 241000282887 Suidae Species 0.000 description 1
- NHUHCSRWZMLRLA-UHFFFAOYSA-N Sulfisoxazole Chemical compound CC1=NOC(NS(=O)(=O)C=2C=CC(N)=CC=2)=C1C NHUHCSRWZMLRLA-UHFFFAOYSA-N 0.000 description 1
- 208000002847 Surgical Wound Diseases 0.000 description 1
- 230000024932 T cell mediated immunity Effects 0.000 description 1
- 101710137500 T7 RNA polymerase Proteins 0.000 description 1
- 108010006785 Taq Polymerase Proteins 0.000 description 1
- 208000000491 Tendinopathy Diseases 0.000 description 1
- 206010043255 Tendonitis Diseases 0.000 description 1
- YRNBANYVJJBGDI-VZFHVOOUSA-N Thr-Ala-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CS)C(=O)O)N)O YRNBANYVJJBGDI-VZFHVOOUSA-N 0.000 description 1
- IRKWVRSEQFTGGV-VEVYYDQMSA-N Thr-Asn-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O IRKWVRSEQFTGGV-VEVYYDQMSA-N 0.000 description 1
- QGXCWPNQVCYJEL-NUMRIWBASA-N Thr-Asn-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O QGXCWPNQVCYJEL-NUMRIWBASA-N 0.000 description 1
- TZKPNGDGUVREEB-FOHZUACHSA-N Thr-Asn-Gly Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O TZKPNGDGUVREEB-FOHZUACHSA-N 0.000 description 1
- SKHPKKYKDYULDH-HJGDQZAQSA-N Thr-Asn-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O SKHPKKYKDYULDH-HJGDQZAQSA-N 0.000 description 1
- NLSNVZAREYQMGR-HJGDQZAQSA-N Thr-Asp-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O NLSNVZAREYQMGR-HJGDQZAQSA-N 0.000 description 1
- KRPKYGOFYUNIGM-XVSYOHENSA-N Thr-Asp-Phe Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N)O KRPKYGOFYUNIGM-XVSYOHENSA-N 0.000 description 1
- JXKMXEBNZCKSDY-JIOCBJNQSA-N Thr-Asp-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N)O JXKMXEBNZCKSDY-JIOCBJNQSA-N 0.000 description 1
- XDARBNMYXKUFOJ-GSSVUCPTSA-N Thr-Asp-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XDARBNMYXKUFOJ-GSSVUCPTSA-N 0.000 description 1
- ASJDFGOPDCVXTG-KATARQTJSA-N Thr-Cys-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(O)=O ASJDFGOPDCVXTG-KATARQTJSA-N 0.000 description 1
- YAAPRMFURSENOZ-KATARQTJSA-N Thr-Cys-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CCCCN)C(=O)O)N)O YAAPRMFURSENOZ-KATARQTJSA-N 0.000 description 1
- LOHBIDZYHQQTDM-IXOXFDKPSA-N Thr-Cys-Phe Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O LOHBIDZYHQQTDM-IXOXFDKPSA-N 0.000 description 1
- MMTOHPRBJKEZHT-BWBBJGPYSA-N Thr-Cys-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CS)C(=O)N[C@@H](CO)C(O)=O MMTOHPRBJKEZHT-BWBBJGPYSA-N 0.000 description 1
- BWUHENPAEMNGQJ-ZDLURKLDSA-N Thr-Gln Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@H](C(O)=O)CCC(N)=O BWUHENPAEMNGQJ-ZDLURKLDSA-N 0.000 description 1
- GUZGCDIZVGODML-NKIYYHGXSA-N Thr-Gln-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N)O GUZGCDIZVGODML-NKIYYHGXSA-N 0.000 description 1
- RKDFEMGVMMYYNG-WDCWCFNPSA-N Thr-Gln-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O RKDFEMGVMMYYNG-WDCWCFNPSA-N 0.000 description 1
- BECPPKYKPSRKCP-ZDLURKLDSA-N Thr-Glu Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@H](C(O)=O)CCC(O)=O BECPPKYKPSRKCP-ZDLURKLDSA-N 0.000 description 1
- FHDLKMFZKRUQCE-HJGDQZAQSA-N Thr-Glu-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O FHDLKMFZKRUQCE-HJGDQZAQSA-N 0.000 description 1
- UDQBCBUXAQIZAK-GLLZPBPUSA-N Thr-Glu-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O UDQBCBUXAQIZAK-GLLZPBPUSA-N 0.000 description 1
- HJOSVGCWOTYJFG-WDCWCFNPSA-N Thr-Glu-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N)O HJOSVGCWOTYJFG-WDCWCFNPSA-N 0.000 description 1
- XOTBWOCSLMBGMF-SUSMZKCASA-N Thr-Glu-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XOTBWOCSLMBGMF-SUSMZKCASA-N 0.000 description 1
- VYEHBMMAJFVTOI-JHEQGTHGSA-N Thr-Gly-Gln Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CCC(N)=O)C(O)=O VYEHBMMAJFVTOI-JHEQGTHGSA-N 0.000 description 1
- XPNSAQMEAVSQRD-FBCQKBJTSA-N Thr-Gly-Gly Chemical compound C[C@@H](O)[C@H](N)C(=O)NCC(=O)NCC(O)=O XPNSAQMEAVSQRD-FBCQKBJTSA-N 0.000 description 1
- KBBRNEDOYWMIJP-KYNKHSRBSA-N Thr-Gly-Thr Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(=O)O)N)O KBBRNEDOYWMIJP-KYNKHSRBSA-N 0.000 description 1
- TZJSEJOXAIWOST-RHYQMDGZSA-N Thr-Lys-Arg Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CCCN=C(N)N TZJSEJOXAIWOST-RHYQMDGZSA-N 0.000 description 1
- CJXURNZYNHCYFD-WDCWCFNPSA-N Thr-Lys-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N)O CJXURNZYNHCYFD-WDCWCFNPSA-N 0.000 description 1
- ZXIHABSKUITPTN-IXOXFDKPSA-N Thr-Lys-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N)O ZXIHABSKUITPTN-IXOXFDKPSA-N 0.000 description 1
- MGJLBZFUXUGMML-VOAKCMCISA-N Thr-Lys-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)O)N)O MGJLBZFUXUGMML-VOAKCMCISA-N 0.000 description 1
- KZURUCDWKDEAFZ-XVSYOHENSA-N Thr-Phe-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(=O)N)C(=O)O)N)O KZURUCDWKDEAFZ-XVSYOHENSA-N 0.000 description 1
- JMBRNXUOLJFURW-BEAPCOKYSA-N Thr-Phe-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N2CCC[C@@H]2C(=O)O)N)O JMBRNXUOLJFURW-BEAPCOKYSA-N 0.000 description 1
- MUAFDCVOHYAFNG-RCWTZXSCSA-N Thr-Pro-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(O)=O MUAFDCVOHYAFNG-RCWTZXSCSA-N 0.000 description 1
- MXDOAJQRJBMGMO-FJXKBIBVSA-N Thr-Pro-Gly Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O MXDOAJQRJBMGMO-FJXKBIBVSA-N 0.000 description 1
- OLFOOYQTTQSSRK-UNQGMJICSA-N Thr-Pro-Phe Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 OLFOOYQTTQSSRK-UNQGMJICSA-N 0.000 description 1
- NBIIPOKZPUGATB-BWBBJGPYSA-N Thr-Ser-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)O)N)O NBIIPOKZPUGATB-BWBBJGPYSA-N 0.000 description 1
- XZUBGOYOGDRYFC-XGEHTFHBSA-N Thr-Ser-Met Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCSC)C(O)=O XZUBGOYOGDRYFC-XGEHTFHBSA-N 0.000 description 1
- WPSKTVVMQCXPRO-BWBBJGPYSA-N Thr-Ser-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O WPSKTVVMQCXPRO-BWBBJGPYSA-N 0.000 description 1
- IEZVHOULSUULHD-XGEHTFHBSA-N Thr-Ser-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O IEZVHOULSUULHD-XGEHTFHBSA-N 0.000 description 1
- YRJOLUDFVAUXLI-GSSVUCPTSA-N Thr-Thr-Asp Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(O)=O YRJOLUDFVAUXLI-GSSVUCPTSA-N 0.000 description 1
- MFMGPEKYBXFIRF-SUSMZKCASA-N Thr-Thr-Gln Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(O)=O MFMGPEKYBXFIRF-SUSMZKCASA-N 0.000 description 1
- BBPCSGKKPJUYRB-UVOCVTCTSA-N Thr-Thr-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O BBPCSGKKPJUYRB-UVOCVTCTSA-N 0.000 description 1
- ZESGVALRVJIVLZ-VFCFLDTKSA-N Thr-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@@H]1C(=O)O)N)O ZESGVALRVJIVLZ-VFCFLDTKSA-N 0.000 description 1
- KAFKKRJQHOECGW-JCOFBHIZSA-N Thr-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](N)[C@H](O)C)C(O)=O)=CNC2=C1 KAFKKRJQHOECGW-JCOFBHIZSA-N 0.000 description 1
- XGUAUKUYQHBUNY-SWRJLBSHSA-N Thr-Trp-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCC(O)=O)C(O)=O XGUAUKUYQHBUNY-SWRJLBSHSA-N 0.000 description 1
- XEVHXNLPUBVQEX-DVJZZOLTSA-N Thr-Trp-Gly Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)NCC(=O)O)N)O XEVHXNLPUBVQEX-DVJZZOLTSA-N 0.000 description 1
- REJRKTOJTCPDPO-IRIUXVKKSA-N Thr-Tyr-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O REJRKTOJTCPDPO-IRIUXVKKSA-N 0.000 description 1
- CKHWEVXPLJBEOZ-VQVTYTSYSA-N Thr-Val Chemical compound CC(C)[C@@H](C([O-])=O)NC(=O)[C@@H]([NH3+])[C@@H](C)O CKHWEVXPLJBEOZ-VQVTYTSYSA-N 0.000 description 1
- AKHDFZHUPGVFEJ-YEPSODPASA-N Thr-Val-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O AKHDFZHUPGVFEJ-YEPSODPASA-N 0.000 description 1
- QNXZCKMXHPULME-ZNSHCXBVSA-N Thr-Val-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N)O QNXZCKMXHPULME-ZNSHCXBVSA-N 0.000 description 1
- 102000004357 Transferases Human genes 0.000 description 1
- 108090000992 Transferases Proteins 0.000 description 1
- 239000013504 Triton X-100 Substances 0.000 description 1
- 229920004890 Triton X-100 Polymers 0.000 description 1
- CDPXXGFRDZVVGF-OYDLWJJNSA-N Trp-Arg-Trp Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O CDPXXGFRDZVVGF-OYDLWJJNSA-N 0.000 description 1
- VTHNLRXALGUDBS-BPUTZDHNSA-N Trp-Gln-Glu Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N VTHNLRXALGUDBS-BPUTZDHNSA-N 0.000 description 1
- PKUJMYZNJMRHEZ-XIRDDKMYSA-N Trp-Glu-Arg Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O PKUJMYZNJMRHEZ-XIRDDKMYSA-N 0.000 description 1
- DZIKVMCFXIIETR-JSGCOSHPSA-N Trp-Gly-Glu Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O DZIKVMCFXIIETR-JSGCOSHPSA-N 0.000 description 1
- RPVDDQYNBOVWLR-HOCLYGCPSA-N Trp-Gly-Leu Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O RPVDDQYNBOVWLR-HOCLYGCPSA-N 0.000 description 1
- WLBZWXXGSOLJBA-HOCLYGCPSA-N Trp-Gly-Lys Chemical compound C1=CC=C2C(C[C@H](N)C(=O)NCC(=O)N[C@@H](CCCCN)C(O)=O)=CNC2=C1 WLBZWXXGSOLJBA-HOCLYGCPSA-N 0.000 description 1
- NOFFAYIYPAUNRM-HKUYNNGSSA-N Trp-Gly-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CC2=CNC3=CC=CC=C32)N NOFFAYIYPAUNRM-HKUYNNGSSA-N 0.000 description 1
- OGXQLUCMJZSJPW-LYSGOOTNSA-N Trp-Gly-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CC1=CNC2=CC=CC=C21)N)O OGXQLUCMJZSJPW-LYSGOOTNSA-N 0.000 description 1
- OGZRZMJASKKMJZ-XIRDDKMYSA-N Trp-Leu-Asp Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N OGZRZMJASKKMJZ-XIRDDKMYSA-N 0.000 description 1
- YTZYHKOSHOXTHA-TUSQITKMSA-N Trp-Leu-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)CC=3C4=CC=CC=C4NC=3)CC(C)C)C(O)=O)=CNC2=C1 YTZYHKOSHOXTHA-TUSQITKMSA-N 0.000 description 1
- OFTGYORHQMSPAI-PJODQICGSA-N Trp-Met-Ala Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C)C(O)=O OFTGYORHQMSPAI-PJODQICGSA-N 0.000 description 1
- TUUXFNQXSFNFLX-XIRDDKMYSA-N Trp-Met-Glu Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N TUUXFNQXSFNFLX-XIRDDKMYSA-N 0.000 description 1
- WMIUTJPFHMMUGY-ZFWWWQNUSA-N Trp-Pro-Gly Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CC2=CNC3=CC=CC=C32)N)C(=O)NCC(=O)O WMIUTJPFHMMUGY-ZFWWWQNUSA-N 0.000 description 1
- IKUMWSDCGQVGHC-UMPQAUOISA-N Trp-Pro-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CC2=CNC3=CC=CC=C32)N)O IKUMWSDCGQVGHC-UMPQAUOISA-N 0.000 description 1
- ADMHZNPMMVKGJW-BPUTZDHNSA-N Trp-Ser-Arg Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N ADMHZNPMMVKGJW-BPUTZDHNSA-N 0.000 description 1
- UJGDFQRPYGJBEH-AAEUAGOBSA-N Trp-Ser-Gly Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CO)C(=O)NCC(=O)O)N UJGDFQRPYGJBEH-AAEUAGOBSA-N 0.000 description 1
- YTHWAWACWGWBLE-MNSWYVGCSA-N Trp-Tyr-Thr Chemical compound C([C@@H](C(=O)N[C@@H]([C@H](O)C)C(O)=O)NC(=O)[C@@H](N)CC=1C2=CC=CC=C2NC=1)C1=CC=C(O)C=C1 YTHWAWACWGWBLE-MNSWYVGCSA-N 0.000 description 1
- UOXPLPBMEPLZBW-WDSOQIARSA-N Trp-Val-Lys Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCCN)C(O)=O)=CNC2=C1 UOXPLPBMEPLZBW-WDSOQIARSA-N 0.000 description 1
- WGBFZZYIWFSYER-BVSLBCMMSA-N Trp-Val-Tyr Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)N WGBFZZYIWFSYER-BVSLBCMMSA-N 0.000 description 1
- TVOGEPLDNYTAHD-CQDKDKBSSA-N Tyr-Ala-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 TVOGEPLDNYTAHD-CQDKDKBSSA-N 0.000 description 1
- WTXQBCCKXIKKHB-JYJNAYRXSA-N Tyr-Arg-Arg Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O WTXQBCCKXIKKHB-JYJNAYRXSA-N 0.000 description 1
- MICSYKFECRFCTJ-IHRRRGAJSA-N Tyr-Arg-Asp Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N)O MICSYKFECRFCTJ-IHRRRGAJSA-N 0.000 description 1
- HKIUVWMZYFBIHG-KKUMJFAQSA-N Tyr-Arg-Gln Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N)O HKIUVWMZYFBIHG-KKUMJFAQSA-N 0.000 description 1
- CRWOSTCODDFEKZ-HRCADAONSA-N Tyr-Arg-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC2=CC=C(C=C2)O)N)C(=O)O CRWOSTCODDFEKZ-HRCADAONSA-N 0.000 description 1
- DWJQKEZKLQCHKO-SRVKXCTJSA-N Tyr-Asn-Cys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CS)C(=O)O)N)O DWJQKEZKLQCHKO-SRVKXCTJSA-N 0.000 description 1
- QZOSVNLXLSNHQK-UWVGGRQHSA-N Tyr-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 QZOSVNLXLSNHQK-UWVGGRQHSA-N 0.000 description 1
- BARBHMSSVWPKPZ-IHRRRGAJSA-N Tyr-Asp-Arg Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O BARBHMSSVWPKPZ-IHRRRGAJSA-N 0.000 description 1
- IXTQGBGHWQEEDE-AVGNSLFASA-N Tyr-Asp-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 IXTQGBGHWQEEDE-AVGNSLFASA-N 0.000 description 1
- WPVGRKLNHJJCEN-BZSNNMDCSA-N Tyr-Asp-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=C(O)C=C1 WPVGRKLNHJJCEN-BZSNNMDCSA-N 0.000 description 1
- VFJIWSJKZJTQII-SRVKXCTJSA-N Tyr-Asp-Ser Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O VFJIWSJKZJTQII-SRVKXCTJSA-N 0.000 description 1
- MNMYOSZWCKYEDI-JRQIVUDYSA-N Tyr-Asp-Thr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MNMYOSZWCKYEDI-JRQIVUDYSA-N 0.000 description 1
- FFCRCJZJARTYCG-KKUMJFAQSA-N Tyr-Cys-Lys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CCCCN)C(=O)O)N)O FFCRCJZJARTYCG-KKUMJFAQSA-N 0.000 description 1
- RYSNTWVRSLCAJZ-RYUDHWBXSA-N Tyr-Gln-Gly Chemical compound OC(=O)CNC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 RYSNTWVRSLCAJZ-RYUDHWBXSA-N 0.000 description 1
- HZZKQZDUIKVFDZ-AVGNSLFASA-N Tyr-Gln-Ser Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CO)C(=O)O)N)O HZZKQZDUIKVFDZ-AVGNSLFASA-N 0.000 description 1
- WVRUKYLYMFGKAN-IHRRRGAJSA-N Tyr-Glu-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 WVRUKYLYMFGKAN-IHRRRGAJSA-N 0.000 description 1
- PMDWYLVWHRTJIW-STQMWFEESA-N Tyr-Gly-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC1=CC=C(O)C=C1 PMDWYLVWHRTJIW-STQMWFEESA-N 0.000 description 1
- LTSIAOZUVISRAQ-QWRGUYRKSA-N Tyr-Gly-Cys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)NCC(=O)N[C@@H](CS)C(=O)O)N)O LTSIAOZUVISRAQ-QWRGUYRKSA-N 0.000 description 1
- GIOBXJSONRQHKQ-RYUDHWBXSA-N Tyr-Gly-Glu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O GIOBXJSONRQHKQ-RYUDHWBXSA-N 0.000 description 1
- HIINQLBHPIQYHN-JTQLQIEISA-N Tyr-Gly-Gly Chemical compound OC(=O)CNC(=O)CNC(=O)[C@@H](N)CC1=CC=C(O)C=C1 HIINQLBHPIQYHN-JTQLQIEISA-N 0.000 description 1
- NMKJPMCEKQHRPD-IRXDYDNUSA-N Tyr-Gly-Tyr Chemical compound C([C@H](N)C(=O)NCC(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=C(O)C=C1 NMKJPMCEKQHRPD-IRXDYDNUSA-N 0.000 description 1
- STTVVMWQKDOKAM-YESZJQIVSA-N Tyr-His-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CN=CN2)NC(=O)[C@H](CC3=CC=C(C=C3)O)N)C(=O)O STTVVMWQKDOKAM-YESZJQIVSA-N 0.000 description 1
- JHORGUYURUBVOM-KKUMJFAQSA-N Tyr-His-Ser Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(O)=O JHORGUYURUBVOM-KKUMJFAQSA-N 0.000 description 1
- MVFQLSPDMMFCMW-KKUMJFAQSA-N Tyr-Leu-Asn Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O MVFQLSPDMMFCMW-KKUMJFAQSA-N 0.000 description 1
- PRONOHBTMLNXCZ-BZSNNMDCSA-N Tyr-Leu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 PRONOHBTMLNXCZ-BZSNNMDCSA-N 0.000 description 1
- WDGDKHLSDIOXQC-ACRUOGEOSA-N Tyr-Leu-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=C(O)C=C1 WDGDKHLSDIOXQC-ACRUOGEOSA-N 0.000 description 1
- OLYXUGBVBGSZDN-ACRUOGEOSA-N Tyr-Leu-Tyr Chemical compound C([C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=C(O)C=C1 OLYXUGBVBGSZDN-ACRUOGEOSA-N 0.000 description 1
- AOLHUMAVONBBEZ-STQMWFEESA-N Tyr-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 AOLHUMAVONBBEZ-STQMWFEESA-N 0.000 description 1
- GZOCMHSZGGJBCX-ULQDDVLXSA-N Tyr-Lys-Met Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCSC)C(O)=O GZOCMHSZGGJBCX-ULQDDVLXSA-N 0.000 description 1
- PMHLLBKTDHQMCY-ULQDDVLXSA-N Tyr-Lys-Val Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O PMHLLBKTDHQMCY-ULQDDVLXSA-N 0.000 description 1
- KHUVIWRRFMPVHD-JYJNAYRXSA-N Tyr-Met-Val Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C(C)C)C(O)=O KHUVIWRRFMPVHD-JYJNAYRXSA-N 0.000 description 1
- OKDNSNWJEXAMSU-IRXDYDNUSA-N Tyr-Phe-Gly Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)NCC(O)=O)C1=CC=C(O)C=C1 OKDNSNWJEXAMSU-IRXDYDNUSA-N 0.000 description 1
- VNYDHJARLHNEGA-RYUDHWBXSA-N Tyr-Pro Chemical compound C([C@H](N)C(=O)N1[C@@H](CCC1)C(O)=O)C1=CC=C(O)C=C1 VNYDHJARLHNEGA-RYUDHWBXSA-N 0.000 description 1
- QKXAEWMHAAVVGS-KKUMJFAQSA-N Tyr-Pro-Glu Chemical compound N[C@@H](Cc1ccc(O)cc1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O QKXAEWMHAAVVGS-KKUMJFAQSA-N 0.000 description 1
- XOVDRAVPGHTYLP-JYJNAYRXSA-N Tyr-Pro-Met Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCSC)C(O)=O XOVDRAVPGHTYLP-JYJNAYRXSA-N 0.000 description 1
- XGZBEGGGAUQBMB-KJEVXHAQSA-N Tyr-Pro-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CC2=CC=C(C=C2)O)N)O XGZBEGGGAUQBMB-KJEVXHAQSA-N 0.000 description 1
- ZSXJENBJGRHKIG-UWVGGRQHSA-N Tyr-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 ZSXJENBJGRHKIG-UWVGGRQHSA-N 0.000 description 1
- VYQQQIRHIFALGE-UWJYBYFXSA-N Tyr-Ser-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 VYQQQIRHIFALGE-UWJYBYFXSA-N 0.000 description 1
- HRHYJNLMIJWGLF-BZSNNMDCSA-N Tyr-Ser-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=C(O)C=C1 HRHYJNLMIJWGLF-BZSNNMDCSA-N 0.000 description 1
- UMSZZGTXGKHTFJ-SRVKXCTJSA-N Tyr-Ser-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 UMSZZGTXGKHTFJ-SRVKXCTJSA-N 0.000 description 1
- LUMQYLVYUIRHHU-YJRXYDGGSA-N Tyr-Ser-Thr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LUMQYLVYUIRHHU-YJRXYDGGSA-N 0.000 description 1
- TYFLVOUZHQUBGM-IHRRRGAJSA-N Tyr-Ser-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 TYFLVOUZHQUBGM-IHRRRGAJSA-N 0.000 description 1
- LVFZXRQQQDTBQH-IRIUXVKKSA-N Tyr-Thr-Glu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O LVFZXRQQQDTBQH-IRIUXVKKSA-N 0.000 description 1
- ZZDYJFVIKVSUFA-WLTAIBSBSA-N Tyr-Thr-Gly Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O ZZDYJFVIKVSUFA-WLTAIBSBSA-N 0.000 description 1
- KLQPIEVIKOQRAW-IZPVPAKOSA-N Tyr-Thr-Thr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N)O KLQPIEVIKOQRAW-IZPVPAKOSA-N 0.000 description 1
- DTWMJYGOUWNWEC-IHPCNDPISA-N Tyr-Trp-Cys Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)N[C@@H](CS)C(O)=O)C1=CC=C(O)C=C1 DTWMJYGOUWNWEC-IHPCNDPISA-N 0.000 description 1
- DJSYPCWZPNHQQE-FHWLQOOXSA-N Tyr-Tyr-Gln Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)N[C@@H](CCC(N)=O)C(O)=O)C1=CC=C(O)C=C1 DJSYPCWZPNHQQE-FHWLQOOXSA-N 0.000 description 1
- TYGHOWWWMTWVKM-HJOGWXRNSA-N Tyr-Tyr-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=C(O)C=C1 TYGHOWWWMTWVKM-HJOGWXRNSA-N 0.000 description 1
- MJUTYRIMFIICKL-JYJNAYRXSA-N Tyr-Val-Arg Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O MJUTYRIMFIICKL-JYJNAYRXSA-N 0.000 description 1
- GOPQNCQSXBJAII-ULQDDVLXSA-N Tyr-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N GOPQNCQSXBJAII-ULQDDVLXSA-N 0.000 description 1
- OBKOPLHSRDATFO-XHSDSOJGSA-N Tyr-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N OBKOPLHSRDATFO-XHSDSOJGSA-N 0.000 description 1
- AZSHAZJLOZQYAY-FXQIFTODSA-N Val-Ala-Ser Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O AZSHAZJLOZQYAY-FXQIFTODSA-N 0.000 description 1
- UUYCNAXCCDNULB-QXEWZRGKSA-N Val-Arg-Asn Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC(N)=O)C(O)=O UUYCNAXCCDNULB-QXEWZRGKSA-N 0.000 description 1
- JOQSQZFKFYJKKJ-GUBZILKMSA-N Val-Arg-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CS)C(=O)O)N JOQSQZFKFYJKKJ-GUBZILKMSA-N 0.000 description 1
- WITCOKQIPFWQQD-FSPLSTOPSA-N Val-Asn Chemical compound CC(C)[C@H](N)C(=O)N[C@H](C(O)=O)CC(N)=O WITCOKQIPFWQQD-FSPLSTOPSA-N 0.000 description 1
- OGNMURQZFMHFFD-NHCYSSNCSA-N Val-Asn-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N OGNMURQZFMHFFD-NHCYSSNCSA-N 0.000 description 1
- IQQYYFPCWKWUHW-YDHLFZDLSA-N Val-Asn-Tyr Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N IQQYYFPCWKWUHW-YDHLFZDLSA-N 0.000 description 1
- OBTCMSPFOITUIJ-FSPLSTOPSA-N Val-Asp Chemical compound CC(C)[C@H](N)C(=O)N[C@H](C(O)=O)CC(O)=O OBTCMSPFOITUIJ-FSPLSTOPSA-N 0.000 description 1
- ISERLACIZUGCDX-ZKWXMUAHSA-N Val-Asp-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](C(C)C)N ISERLACIZUGCDX-ZKWXMUAHSA-N 0.000 description 1
- VLOYGOZDPGYWFO-LAEOZQHASA-N Val-Asp-Glu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O VLOYGOZDPGYWFO-LAEOZQHASA-N 0.000 description 1
- TZVUSFMQWPWHON-NHCYSSNCSA-N Val-Asp-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](C(C)C)N TZVUSFMQWPWHON-NHCYSSNCSA-N 0.000 description 1
- BWVHQINTNLVWGZ-ZKWXMUAHSA-N Val-Cys-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(=O)O)C(=O)O)N BWVHQINTNLVWGZ-ZKWXMUAHSA-N 0.000 description 1
- VXCAZHCVDBQMTP-NRPADANISA-N Val-Cys-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N VXCAZHCVDBQMTP-NRPADANISA-N 0.000 description 1
- LHADRQBREKTRLR-DCAQKATOSA-N Val-Cys-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](C(C)C)N LHADRQBREKTRLR-DCAQKATOSA-N 0.000 description 1
- XIFAHCUNWWKUDE-DCAQKATOSA-N Val-Cys-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CCCCN)C(=O)O)N XIFAHCUNWWKUDE-DCAQKATOSA-N 0.000 description 1
- HIZMLPKDJAXDRG-FXQIFTODSA-N Val-Cys-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CO)C(=O)O)N HIZMLPKDJAXDRG-FXQIFTODSA-N 0.000 description 1
- VFOHXOLPLACADK-GVXVVHGQSA-N Val-Gln-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](C(C)C)N VFOHXOLPLACADK-GVXVVHGQSA-N 0.000 description 1
- NYTKXWLZSNRILS-IFFSRLJSSA-N Val-Gln-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](C(C)C)N)O NYTKXWLZSNRILS-IFFSRLJSSA-N 0.000 description 1
- UZDHNIJRRTUKKC-DLOVCJGASA-N Val-Gln-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](C(C)C)C(=O)O)N UZDHNIJRRTUKKC-DLOVCJGASA-N 0.000 description 1
- AHHJARQXFFGOKF-NRPADANISA-N Val-Glu-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CS)C(=O)O)N AHHJARQXFFGOKF-NRPADANISA-N 0.000 description 1
- VLDMQVZZWDOKQF-AUTRQRHGSA-N Val-Glu-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N VLDMQVZZWDOKQF-AUTRQRHGSA-N 0.000 description 1
- SZTTYWIUCGSURQ-AUTRQRHGSA-N Val-Glu-Glu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O SZTTYWIUCGSURQ-AUTRQRHGSA-N 0.000 description 1
- VVZDBPBZHLQPPB-XVKPBYJWSA-N Val-Glu-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O VVZDBPBZHLQPPB-XVKPBYJWSA-N 0.000 description 1
- UEHRGZCNLSWGHK-DLOVCJGASA-N Val-Glu-Val Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O UEHRGZCNLSWGHK-DLOVCJGASA-N 0.000 description 1
- CELJCNRXKZPTCX-XPUUQOCRSA-N Val-Gly-Ala Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(O)=O CELJCNRXKZPTCX-XPUUQOCRSA-N 0.000 description 1
- WFENBJPLZMPVAX-XVKPBYJWSA-N Val-Gly-Glu Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(O)=O WFENBJPLZMPVAX-XVKPBYJWSA-N 0.000 description 1
- PIFJAFRUVWZRKR-QMMMGPOBSA-N Val-Gly-Gly Chemical compound CC(C)[C@H]([NH3+])C(=O)NCC(=O)NCC([O-])=O PIFJAFRUVWZRKR-QMMMGPOBSA-N 0.000 description 1
- MDYSKHBSPXUOPV-JSGCOSHPSA-N Val-Gly-Phe Chemical compound CC(C)[C@@H](C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N MDYSKHBSPXUOPV-JSGCOSHPSA-N 0.000 description 1
- XBRMBDFYOFARST-AVGNSLFASA-N Val-His-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](C(C)C)C(=O)O)N XBRMBDFYOFARST-AVGNSLFASA-N 0.000 description 1
- OTJMMKPMLUNTQT-AVGNSLFASA-N Val-Leu-Arg Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](C(C)C)N OTJMMKPMLUNTQT-AVGNSLFASA-N 0.000 description 1
- AGXGCFSECFQMKB-NHCYSSNCSA-N Val-Leu-Asp Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N AGXGCFSECFQMKB-NHCYSSNCSA-N 0.000 description 1
- WDIWOIRFNMLNKO-ULQDDVLXSA-N Val-Leu-Tyr Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 WDIWOIRFNMLNKO-ULQDDVLXSA-N 0.000 description 1
- XXWBHOWRARMUOC-NHCYSSNCSA-N Val-Lys-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(=O)N)C(=O)O)N XXWBHOWRARMUOC-NHCYSSNCSA-N 0.000 description 1
- KTEZUXISLQTDDQ-NHCYSSNCSA-N Val-Lys-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(=O)O)C(=O)O)N KTEZUXISLQTDDQ-NHCYSSNCSA-N 0.000 description 1
- WBAJDGWKRIHOAC-GVXVVHGQSA-N Val-Lys-Gln Chemical compound [H]N[C@@H](C(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O WBAJDGWKRIHOAC-GVXVVHGQSA-N 0.000 description 1
- MLADEWAIYAPAAU-IHRRRGAJSA-N Val-Lys-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N MLADEWAIYAPAAU-IHRRRGAJSA-N 0.000 description 1
- LJSZPMSUYKKKCP-UBHSHLNASA-N Val-Phe-Ala Chemical compound CC(C)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C)C(O)=O)CC1=CC=CC=C1 LJSZPMSUYKKKCP-UBHSHLNASA-N 0.000 description 1
- NZGOVKLVQNOEKP-YDHLFZDLSA-N Val-Phe-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(=O)N)C(=O)O)N NZGOVKLVQNOEKP-YDHLFZDLSA-N 0.000 description 1
- UZFNHAXYMICTBU-DZKIICNBSA-N Val-Phe-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N UZFNHAXYMICTBU-DZKIICNBSA-N 0.000 description 1
- HJSLDXZAZGFPDK-ULQDDVLXSA-N Val-Phe-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](C(C)C)N HJSLDXZAZGFPDK-ULQDDVLXSA-N 0.000 description 1
- QIVPZSWBBHRNBA-JYJNAYRXSA-N Val-Pro-Phe Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](Cc1ccccc1)C(O)=O QIVPZSWBBHRNBA-JYJNAYRXSA-N 0.000 description 1
- SSYBNWFXCFNRFN-GUBZILKMSA-N Val-Pro-Ser Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O SSYBNWFXCFNRFN-GUBZILKMSA-N 0.000 description 1
- MIKHIIQMRFYVOR-RCWTZXSCSA-N Val-Pro-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](C(C)C)N)O MIKHIIQMRFYVOR-RCWTZXSCSA-N 0.000 description 1
- AJNUKMZFHXUBMK-GUBZILKMSA-N Val-Ser-Arg Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N AJNUKMZFHXUBMK-GUBZILKMSA-N 0.000 description 1
- RYHUIHUOYRNNIE-NRPADANISA-N Val-Ser-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N RYHUIHUOYRNNIE-NRPADANISA-N 0.000 description 1
- QTPQHINADBYBNA-DCAQKATOSA-N Val-Ser-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCCN QTPQHINADBYBNA-DCAQKATOSA-N 0.000 description 1
- CEKSLIVSNNGOKH-KZVJFYERSA-N Val-Thr-Ala Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C)C(=O)O)NC(=O)[C@H](C(C)C)N)O CEKSLIVSNNGOKH-KZVJFYERSA-N 0.000 description 1
- UVHFONIHVHLDDQ-IFFSRLJSSA-N Val-Thr-Glu Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N)O UVHFONIHVHLDDQ-IFFSRLJSSA-N 0.000 description 1
- YQYFYUSYEDNLSD-YEPSODPASA-N Val-Thr-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O YQYFYUSYEDNLSD-YEPSODPASA-N 0.000 description 1
- TVGWMCTYUFBXAP-QTKMDUPCSA-N Val-Thr-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](C(C)C)N)O TVGWMCTYUFBXAP-QTKMDUPCSA-N 0.000 description 1
- PDDJTOSAVNRJRH-UNQGMJICSA-N Val-Thr-Phe Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](C(C)C)N)O PDDJTOSAVNRJRH-UNQGMJICSA-N 0.000 description 1
- OFTXTCGQJXTNQS-XGEHTFHBSA-N Val-Thr-Ser Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](C(C)C)N)O OFTXTCGQJXTNQS-XGEHTFHBSA-N 0.000 description 1
- HVRRJRMULCPNRO-BZSNNMDCSA-N Val-Trp-Arg Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](N)C(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O)=CNC2=C1 HVRRJRMULCPNRO-BZSNNMDCSA-N 0.000 description 1
- LNWSJGJCLFUNTN-ZOBUZTSGSA-N Val-Trp-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CC(=O)N)C(=O)O)N LNWSJGJCLFUNTN-ZOBUZTSGSA-N 0.000 description 1
- NGXQOQNXSGOYOI-BQFCYCMXSA-N Val-Trp-Gln Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](N)C(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O)=CNC2=C1 NGXQOQNXSGOYOI-BQFCYCMXSA-N 0.000 description 1
- VTIAEOKFUJJBTC-YDHLFZDLSA-N Val-Tyr-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N VTIAEOKFUJJBTC-YDHLFZDLSA-N 0.000 description 1
- CFIBZQOLUDURST-IHRRRGAJSA-N Val-Tyr-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CS)C(=O)O)N CFIBZQOLUDURST-IHRRRGAJSA-N 0.000 description 1
- RTJPAGFXOWEBAI-SRVKXCTJSA-N Val-Val-Arg Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N RTJPAGFXOWEBAI-SRVKXCTJSA-N 0.000 description 1
- AOILQMZPNLUXCM-AVGNSLFASA-N Val-Val-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCCN AOILQMZPNLUXCM-AVGNSLFASA-N 0.000 description 1
- XNLUVJPMPAZHCY-JYJNAYRXSA-N Val-Val-Phe Chemical compound CC(C)[C@H]([NH3+])C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C([O-])=O)CC1=CC=CC=C1 XNLUVJPMPAZHCY-JYJNAYRXSA-N 0.000 description 1
- JVGDAEKKZKKZFO-RCWTZXSCSA-N Val-Val-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](C(C)C)N)O JVGDAEKKZKKZFO-RCWTZXSCSA-N 0.000 description 1
- 206010054880 Vascular insufficiency Diseases 0.000 description 1
- 208000036142 Viral infection Diseases 0.000 description 1
- 108091029474 Y RNA Proteins 0.000 description 1
- 230000009471 action Effects 0.000 description 1
- 230000001154 acute effect Effects 0.000 description 1
- 238000001042 affinity chromatography Methods 0.000 description 1
- 239000008272 agar Substances 0.000 description 1
- 108010086434 alanyl-seryl-glycine Proteins 0.000 description 1
- 108010005233 alanylglutamic acid Proteins 0.000 description 1
- 108010011559 alanylphenylalanine Proteins 0.000 description 1
- 201000009961 allergic asthma Diseases 0.000 description 1
- 208000026935 allergic disease Diseases 0.000 description 1
- 230000000735 allogeneic effect Effects 0.000 description 1
- 108010050025 alpha-glutamyltryptophan Proteins 0.000 description 1
- 235000005550 amino acid supplement Nutrition 0.000 description 1
- 206010002026 amyotrophic lateral sclerosis Diseases 0.000 description 1
- 230000000202 analgesic effect Effects 0.000 description 1
- 238000004458 analytical method Methods 0.000 description 1
- 230000036783 anaphylactic response Effects 0.000 description 1
- 208000003455 anaphylaxis Diseases 0.000 description 1
- 208000007502 anemia Diseases 0.000 description 1
- 230000033115 angiogenesis Effects 0.000 description 1
- 230000002491 angiogenic effect Effects 0.000 description 1
- 238000000137 annealing Methods 0.000 description 1
- 230000003466 anti-cipated effect Effects 0.000 description 1
- 230000003172 anti-dna Effects 0.000 description 1
- 230000000259 anti-tumor effect Effects 0.000 description 1
- 230000005875 antibody response Effects 0.000 description 1
- 230000010056 antibody-dependent cellular cytotoxicity Effects 0.000 description 1
- 230000030741 antigen processing and presentation Effects 0.000 description 1
- 230000000890 antigenic effect Effects 0.000 description 1
- 230000007416 antiviral immune response Effects 0.000 description 1
- 230000036528 appetite Effects 0.000 description 1
- 235000019789 appetite Nutrition 0.000 description 1
- 238000013459 approach Methods 0.000 description 1
- 239000007864 aqueous solution Substances 0.000 description 1
- ODKSFYDXXFIFQN-UHFFFAOYSA-N arginine Natural products OC(=O)C(N)CCCNC(N)=N ODKSFYDXXFIFQN-UHFFFAOYSA-N 0.000 description 1
- 125000000637 arginyl group Chemical group N[C@@H](CCCNC(N)=N)C(=O)* 0.000 description 1
- 108010080488 arginyl-arginyl-leucine Proteins 0.000 description 1
- 108010001271 arginyl-glutamyl-arginine Proteins 0.000 description 1
- 108010009111 arginyl-glycyl-glutamic acid Proteins 0.000 description 1
- 108010029539 arginyl-prolyl-proline Proteins 0.000 description 1
- 206010003246 arthritis Diseases 0.000 description 1
- 108010040443 aspartyl-aspartic acid Proteins 0.000 description 1
- 108010069205 aspartyl-phenylalanine Proteins 0.000 description 1
- 230000005784 autoimmunity Effects 0.000 description 1
- 238000000376 autoradiography Methods 0.000 description 1
- 208000022362 bacterial infectious disease Diseases 0.000 description 1
- 230000003542 behavioural effect Effects 0.000 description 1
- 230000008901 benefit Effects 0.000 description 1
- 108010081355 beta 2-Microglobulin Proteins 0.000 description 1
- 239000013060 biological fluid Substances 0.000 description 1
- 208000015294 blood coagulation disease Diseases 0.000 description 1
- 230000037237 body shape Effects 0.000 description 1
- 210000001185 bone marrow Anatomy 0.000 description 1
- 229940098773 bovine serum albumin Drugs 0.000 description 1
- 210000004556 brain Anatomy 0.000 description 1
- 210000005013 brain tissue Anatomy 0.000 description 1
- UDSAIICHUKSCKT-UHFFFAOYSA-N bromophenol blue Chemical compound C1=C(Br)C(O)=C(Br)C=C1C1(C=2C=C(Br)C(O)=C(Br)C=2)C2=CC=CC=C2S(=O)(=O)O1 UDSAIICHUKSCKT-UHFFFAOYSA-N 0.000 description 1
- 239000001506 calcium phosphate Substances 0.000 description 1
- 229910000389 calcium phosphate Inorganic materials 0.000 description 1
- 235000011010 calcium phosphates Nutrition 0.000 description 1
- 201000003984 candidiasis Diseases 0.000 description 1
- 239000002775 capsule Substances 0.000 description 1
- 239000004202 carbamide Substances 0.000 description 1
- 229910052799 carbon Inorganic materials 0.000 description 1
- 208000003295 carpal tunnel syndrome Diseases 0.000 description 1
- 230000015556 catabolic process Effects 0.000 description 1
- 230000021164 cell adhesion Effects 0.000 description 1
- 230000011712 cell development Effects 0.000 description 1
- 230000004663 cell proliferation Effects 0.000 description 1
- 239000001913 cellulose Substances 0.000 description 1
- 229920002678 cellulose Polymers 0.000 description 1
- 208000026106 cerebrovascular disease Diseases 0.000 description 1
- 210000000080 chela (arthropods) Anatomy 0.000 description 1
- 230000001684 chronic effect Effects 0.000 description 1
- 230000027288 circadian rhythm Effects 0.000 description 1
- 208000019425 cirrhosis of liver Diseases 0.000 description 1
- 238000005352 clarification Methods 0.000 description 1
- 230000009852 coagulant defect Effects 0.000 description 1
- 230000015271 coagulation Effects 0.000 description 1
- 238000005345 coagulation Methods 0.000 description 1
- 229960002424 collagenase Drugs 0.000 description 1
- 238000012790 confirmation Methods 0.000 description 1
- 208000018631 connective tissue disease Diseases 0.000 description 1
- 239000003433 contraceptive agent Substances 0.000 description 1
- 230000002254 contraceptive effect Effects 0.000 description 1
- 238000007796 conventional method Methods 0.000 description 1
- 230000004940 costimulation Effects 0.000 description 1
- 210000004748 cultured cell Anatomy 0.000 description 1
- 238000005520 cutting process Methods 0.000 description 1
- 235000018417 cysteine Nutrition 0.000 description 1
- 108010004073 cysteinylcysteine Proteins 0.000 description 1
- 102000003675 cytokine receptors Human genes 0.000 description 1
- 108010057085 cytokine receptors Proteins 0.000 description 1
- 230000009089 cytolysis Effects 0.000 description 1
- 230000001461 cytolytic effect Effects 0.000 description 1
- 230000001086 cytosolic effect Effects 0.000 description 1
- 230000003013 cytotoxicity Effects 0.000 description 1
- 231100000135 cytotoxicity Toxicity 0.000 description 1
- SUYVUBYJARFZHO-RRKCRQDMSA-N dATP Chemical compound C1=NC=2C(N)=NC=NC=2N1[C@H]1C[C@H](O)[C@@H](COP(O)(=O)OP(O)(=O)OP(O)(O)=O)O1 SUYVUBYJARFZHO-RRKCRQDMSA-N 0.000 description 1
- SUYVUBYJARFZHO-UHFFFAOYSA-N dATP Natural products C1=NC=2C(N)=NC=NC=2N1C1CC(O)C(COP(O)(=O)OP(O)(=O)OP(O)(O)=O)O1 SUYVUBYJARFZHO-UHFFFAOYSA-N 0.000 description 1
- RGWHQCVHVJXOKC-SHYZEUOFSA-J dCTP(4-) Chemical compound O=C1N=C(N)C=CN1[C@@H]1O[C@H](COP([O-])(=O)OP([O-])(=O)OP([O-])([O-])=O)[C@@H](O)C1 RGWHQCVHVJXOKC-SHYZEUOFSA-J 0.000 description 1
- HAAZLUGHYHWQIW-KVQBGUIXSA-N dGTP Chemical compound C1=NC=2C(=O)NC(N)=NC=2N1[C@H]1C[C@H](O)[C@@H](COP(O)(=O)OP(O)(=O)OP(O)(O)=O)O1 HAAZLUGHYHWQIW-KVQBGUIXSA-N 0.000 description 1
- NHVNXKFIZYSCEB-XLPZGREQSA-N dTTP Chemical compound O=C1NC(=O)C(C)=CN1[C@@H]1O[C@H](COP(O)(=O)OP(O)(=O)OP(O)(O)=O)[C@@H](O)C1 NHVNXKFIZYSCEB-XLPZGREQSA-N 0.000 description 1
- 230000034994 death Effects 0.000 description 1
- 230000005860 defense response to virus Effects 0.000 description 1
- 230000007850 degeneration Effects 0.000 description 1
- 230000030609 dephosphorylation Effects 0.000 description 1
- 238000006209 dephosphorylation reaction Methods 0.000 description 1
- 238000000502 dialysis Methods 0.000 description 1
- 235000005911 diet Nutrition 0.000 description 1
- 230000037213 diet Effects 0.000 description 1
- 230000029087 digestion Effects 0.000 description 1
- 230000008030 elimination Effects 0.000 description 1
- 238000003379 elimination reaction Methods 0.000 description 1
- 206010014599 encephalitis Diseases 0.000 description 1
- 230000002124 endocrine Effects 0.000 description 1
- 230000003511 endothelial effect Effects 0.000 description 1
- 210000003989 endothelium vascular Anatomy 0.000 description 1
- 239000002158 endotoxin Substances 0.000 description 1
- 230000006862 enzymatic digestion Effects 0.000 description 1
- 230000002327 eosinophilic effect Effects 0.000 description 1
- 210000000267 erythroid cell Anatomy 0.000 description 1
- 230000000925 erythroid effect Effects 0.000 description 1
- 210000003013 erythroid precursor cell Anatomy 0.000 description 1
- 230000001747 exhibiting effect Effects 0.000 description 1
- 238000001400 expression cloning Methods 0.000 description 1
- 230000036251 extravasation Effects 0.000 description 1
- 229940012952 fibrinogen Drugs 0.000 description 1
- 230000020764 fibrinolysis Effects 0.000 description 1
- 210000002950 fibroblast Anatomy 0.000 description 1
- 230000003176 fibrotic effect Effects 0.000 description 1
- 239000007850 fluorescent dye Substances 0.000 description 1
- 235000012041 food component Nutrition 0.000 description 1
- 108010006664 gamma-glutamyl-glycyl-glycine Proteins 0.000 description 1
- 239000000499 gel Substances 0.000 description 1
- 238000002523 gelfiltration Methods 0.000 description 1
- 238000007429 general method Methods 0.000 description 1
- 208000016361 genetic disease Diseases 0.000 description 1
- 238000010448 genetic screening Methods 0.000 description 1
- ZDXPYRJPNDTMRX-UHFFFAOYSA-N glutamine Chemical group OC(=O)C(N)CCC(N)=O ZDXPYRJPNDTMRX-UHFFFAOYSA-N 0.000 description 1
- 125000000404 glutamine group Chemical group N[C@@H](CCC(N)=O)C(=O)* 0.000 description 1
- 108010080575 glutamyl-aspartyl-alanine Proteins 0.000 description 1
- 108010042598 glutamyl-aspartyl-glycine Proteins 0.000 description 1
- 230000013595 glycosylation Effects 0.000 description 1
- 238000006206 glycosylation reaction Methods 0.000 description 1
- 108010081985 glycyl-cystinyl-aspartic acid Proteins 0.000 description 1
- 108010078326 glycyl-glycyl-valine Proteins 0.000 description 1
- 108010023364 glycyl-histidyl-arginine Proteins 0.000 description 1
- 108010077435 glycyl-phenylalanyl-glycine Proteins 0.000 description 1
- 108010017446 glycyl-prolyl-arginyl-proline Proteins 0.000 description 1
- 108010082286 glycyl-seryl-alanine Proteins 0.000 description 1
- 108010074027 glycyl-seryl-phenylalanine Proteins 0.000 description 1
- 210000003714 granulocyte Anatomy 0.000 description 1
- ZJYYHGLJYGJLLN-UHFFFAOYSA-N guanidinium thiocyanate Chemical compound SC#N.NC(N)=N ZJYYHGLJYGJLLN-UHFFFAOYSA-N 0.000 description 1
- 230000037308 hair color Effects 0.000 description 1
- 208000006454 hepatitis Diseases 0.000 description 1
- 231100000283 hepatitis Toxicity 0.000 description 1
- 208000037584 hereditary sensory and autonomic neuropathy Diseases 0.000 description 1
- 239000000833 heterodimer Substances 0.000 description 1
- 238000013537 high throughput screening Methods 0.000 description 1
- 210000003630 histaminocyte Anatomy 0.000 description 1
- 108010092114 histidylphenylalanine Proteins 0.000 description 1
- 210000003701 histiocyte Anatomy 0.000 description 1
- 230000013632 homeostatic process Effects 0.000 description 1
- 239000000710 homodimer Substances 0.000 description 1
- 230000003054 hormonal effect Effects 0.000 description 1
- 102000049885 human IL11 Human genes 0.000 description 1
- 102000052627 human IL9 Human genes 0.000 description 1
- 229940116886 human interleukin-6 Drugs 0.000 description 1
- 230000028996 humoral immune response Effects 0.000 description 1
- 230000003463 hyperproliferative effect Effects 0.000 description 1
- 230000009610 hypersensitivity Effects 0.000 description 1
- 230000008105 immune reaction Effects 0.000 description 1
- 230000008629 immune suppression Effects 0.000 description 1
- 210000000987 immune system Anatomy 0.000 description 1
- 230000003053 immunization Effects 0.000 description 1
- 238000002649 immunization Methods 0.000 description 1
- 102000018358 immunoglobulin Human genes 0.000 description 1
- 230000016784 immunoglobulin production Effects 0.000 description 1
- 229960003444 immunosuppressant agent Drugs 0.000 description 1
- 230000001861 immunosuppressant effect Effects 0.000 description 1
- 239000003018 immunosuppressive agent Substances 0.000 description 1
- 230000006872 improvement Effects 0.000 description 1
- 230000007574 infarction Effects 0.000 description 1
- 208000000509 infertility Diseases 0.000 description 1
- 230000036512 infertility Effects 0.000 description 1
- 231100000535 infertility Toxicity 0.000 description 1
- 230000004968 inflammatory condition Effects 0.000 description 1
- 206010022000 influenza Diseases 0.000 description 1
- 108010067471 inhibin A Proteins 0.000 description 1
- 230000000977 initiatory effect Effects 0.000 description 1
- 229910052500 inorganic mineral Inorganic materials 0.000 description 1
- 229940125396 insulin Drugs 0.000 description 1
- 102000006495 integrins Human genes 0.000 description 1
- 108010044426 integrins Proteins 0.000 description 1
- 229940079322 interferon Drugs 0.000 description 1
- 229940028885 interleukin-4 Drugs 0.000 description 1
- 210000002570 interstitial cell Anatomy 0.000 description 1
- 238000004255 ion exchange chromatography Methods 0.000 description 1
- 208000028867 ischemia Diseases 0.000 description 1
- 210000004153 islets of langerhan Anatomy 0.000 description 1
- 210000003292 kidney cell Anatomy 0.000 description 1
- 238000011031 large-scale manufacturing process Methods 0.000 description 1
- 239000002523 lectin Substances 0.000 description 1
- 108010009932 leucyl-alanyl-glycyl-valine Proteins 0.000 description 1
- 108010091871 leucylmethionine Proteins 0.000 description 1
- 230000021633 leukocyte mediated immunity Effects 0.000 description 1
- 150000002632 lipids Chemical class 0.000 description 1
- 239000002502 liposome Substances 0.000 description 1
- 239000007788 liquid Substances 0.000 description 1
- 210000004072 lung Anatomy 0.000 description 1
- 231100000516 lung damage Toxicity 0.000 description 1
- 210000001165 lymph node Anatomy 0.000 description 1
- 230000003526 lymphopoietic effect Effects 0.000 description 1
- 108010076718 lysyl-glutamyl-tryptophan Proteins 0.000 description 1
- 108010045397 lysyl-tyrosyl-lysine Proteins 0.000 description 1
- 229910001629 magnesium chloride Inorganic materials 0.000 description 1
- 201000004792 malaria Diseases 0.000 description 1
- 210000004962 mammalian cell Anatomy 0.000 description 1
- 238000013507 mapping Methods 0.000 description 1
- 239000003550 marker Substances 0.000 description 1
- 239000011159 matrix material Substances 0.000 description 1
- 238000013160 medical therapy Methods 0.000 description 1
- 210000003593 megakaryocyte Anatomy 0.000 description 1
- 201000001441 melanoma Diseases 0.000 description 1
- 230000004060 metabolic process Effects 0.000 description 1
- 229920000609 methyl cellulose Polymers 0.000 description 1
- 239000001923 methylcellulose Substances 0.000 description 1
- 210000001589 microsome Anatomy 0.000 description 1
- 238000013508 migration Methods 0.000 description 1
- 230000005012 migration Effects 0.000 description 1
- 239000011707 mineral Substances 0.000 description 1
- 235000010755 mineral Nutrition 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 238000010172 mouse model Methods 0.000 description 1
- 201000006417 multiple sclerosis Diseases 0.000 description 1
- 210000003205 muscle Anatomy 0.000 description 1
- 206010028417 myasthenia gravis Diseases 0.000 description 1
- 201000009240 nasopharyngitis Diseases 0.000 description 1
- 210000000822 natural killer cell Anatomy 0.000 description 1
- 201000008383 nephritis Diseases 0.000 description 1
- 210000000653 nervous system Anatomy 0.000 description 1
- 229910052757 nitrogen Inorganic materials 0.000 description 1
- 230000000771 oncological effect Effects 0.000 description 1
- 210000000287 oocyte Anatomy 0.000 description 1
- 201000008482 osteoarthritis Diseases 0.000 description 1
- 230000001599 osteoclastic effect Effects 0.000 description 1
- 230000002188 osteogenic effect Effects 0.000 description 1
- 210000001672 ovary Anatomy 0.000 description 1
- 238000012261 overproduction Methods 0.000 description 1
- 229940094443 oxytocics prostaglandins Drugs 0.000 description 1
- 210000000496 pancreas Anatomy 0.000 description 1
- 244000045947 parasite Species 0.000 description 1
- 201000003045 paroxysmal nocturnal hemoglobinuria Diseases 0.000 description 1
- 230000007170 pathology Effects 0.000 description 1
- 210000005259 peripheral blood Anatomy 0.000 description 1
- 239000011886 peripheral blood Substances 0.000 description 1
- 230000002093 peripheral effect Effects 0.000 description 1
- 210000000578 peripheral nerve Anatomy 0.000 description 1
- 208000027232 peripheral nervous system disease Diseases 0.000 description 1
- 230000035699 permeability Effects 0.000 description 1
- 108010072637 phenylalanyl-arginyl-phenylalanine Proteins 0.000 description 1
- 150000003904 phospholipids Chemical class 0.000 description 1
- 102000020233 phosphotransferase Human genes 0.000 description 1
- 238000005554 pickling Methods 0.000 description 1
- 239000000049 pigment Substances 0.000 description 1
- 239000006187 pill Substances 0.000 description 1
- 230000001817 pituitary effect Effects 0.000 description 1
- 229910001414 potassium ion Inorganic materials 0.000 description 1
- 239000000843 powder Substances 0.000 description 1
- 230000002062 proliferating effect Effects 0.000 description 1
- 108010025826 prolyl-leucyl-arginine Proteins 0.000 description 1
- 108700042769 prolyl-leucyl-glycine Proteins 0.000 description 1
- 108010087846 prolyl-prolyl-glycine Proteins 0.000 description 1
- 150000003180 prostaglandins Chemical class 0.000 description 1
- 235000005974 protein supplement Nutrition 0.000 description 1
- 229940116540 protein supplement Drugs 0.000 description 1
- 238000001243 protein synthesis Methods 0.000 description 1
- 208000005069 pulmonary fibrosis Diseases 0.000 description 1
- 230000009467 reduction Effects 0.000 description 1
- 230000002829 reductive effect Effects 0.000 description 1
- 230000022983 regulation of cell cycle Effects 0.000 description 1
- 238000002271 resection Methods 0.000 description 1
- 230000000241 respiratory effect Effects 0.000 description 1
- 238000004366 reverse phase liquid chromatography Methods 0.000 description 1
- 230000002441 reversible effect Effects 0.000 description 1
- 206010039073 rheumatoid arthritis Diseases 0.000 description 1
- 238000009938 salting Methods 0.000 description 1
- 239000012488 sample solution Substances 0.000 description 1
- 230000002000 scavenging effect Effects 0.000 description 1
- 230000036303 septic shock Effects 0.000 description 1
- 239000003352 sequestering agent Substances 0.000 description 1
- 150000003384 small molecules Chemical class 0.000 description 1
- 239000001632 sodium acetate Substances 0.000 description 1
- 235000017281 sodium acetate Nutrition 0.000 description 1
- 229910001415 sodium ion Inorganic materials 0.000 description 1
- 239000007787 solid Substances 0.000 description 1
- 239000002904 solvent Substances 0.000 description 1
- 210000002325 somatostatin-secreting cell Anatomy 0.000 description 1
- 230000021595 spermatogenesis Effects 0.000 description 1
- 210000004989 spleen cell Anatomy 0.000 description 1
- 210000004988 splenocyte Anatomy 0.000 description 1
- 230000003068 static effect Effects 0.000 description 1
- 238000003860 storage Methods 0.000 description 1
- 210000002536 stromal cell Anatomy 0.000 description 1
- 239000000126 substance Substances 0.000 description 1
- 231100000617 superantigen Toxicity 0.000 description 1
- 239000013589 supplement Substances 0.000 description 1
- 239000004094 surface-active agent Substances 0.000 description 1
- 238000001356 surgical procedure Methods 0.000 description 1
- 208000024891 symptom Diseases 0.000 description 1
- 238000007910 systemic administration Methods 0.000 description 1
- 230000008685 targeting Effects 0.000 description 1
- 201000004415 tendinitis Diseases 0.000 description 1
- 238000009210 therapy by ultrasound Methods 0.000 description 1
- 229960003766 thrombin (human) Drugs 0.000 description 1
- 206010043554 thrombocytopenia Diseases 0.000 description 1
- 230000009772 tissue formation Effects 0.000 description 1
- 230000008467 tissue growth Effects 0.000 description 1
- 208000037816 tissue injury Diseases 0.000 description 1
- QORWJWZARLRLPR-UHFFFAOYSA-H tricalcium bis(phosphate) Chemical compound [Ca+2].[Ca+2].[Ca+2].[O-]P([O-])([O-])=O.[O-]P([O-])([O-])=O QORWJWZARLRLPR-UHFFFAOYSA-H 0.000 description 1
- 230000001875 tumorinhibitory effect Effects 0.000 description 1
- OUYCCCASQSFEME-UHFFFAOYSA-N tyrosine Natural products OC(=O)C(N)CC1=CC=C(O)C=C1 OUYCCCASQSFEME-UHFFFAOYSA-N 0.000 description 1
- 108010017949 tyrosyl-glycyl-glycine Proteins 0.000 description 1
- 108010051110 tyrosyl-lysine Proteins 0.000 description 1
- 238000000108 ultra-filtration Methods 0.000 description 1
- 241001529453 unidentified herpesvirus Species 0.000 description 1
- 230000003827 upregulation Effects 0.000 description 1
- 229960005486 vaccine Drugs 0.000 description 1
- 108010009962 valyltyrosine Proteins 0.000 description 1
- 230000002792 vascular Effects 0.000 description 1
- 208000023577 vascular insufficiency disease Diseases 0.000 description 1
- 238000012795 verification Methods 0.000 description 1
- 230000009385 viral infection Effects 0.000 description 1
- 235000013343 vitamin Nutrition 0.000 description 1
- 239000011782 vitamin Substances 0.000 description 1
- 229940088594 vitamin Drugs 0.000 description 1
- 229930003231 vitamin Natural products 0.000 description 1
Abstract
Proteins containing any of the amino acid sequences represented by Sequence No. 1 to Sequence No. 2 or by Sequence No. 4 to Sequence No. 25 and DNAs encoding said proteins exemplified by cDNAs containing any of the base sequences represented by Sequence No. 26 to Sequence No. 50. Said proteins can be provided by expressing cDNAs encoding human proteins having transmembrane domains and recombinants of these human cDNAs.
Description
HUMAN PROTEINS THAT HAVE TRANSMEMBRANE DOMAINS AND DNAs THAT CODIFY FOR THESE PROTEINS
Field of the Invention The present invention relates to human proteins having transmembrane domains, DNAs encoding these proteins and eukaryotic cells expressing those DNAs. The proteins of the present invention can be used as pharmaceutical compounds or as antigens for the preparation of antibodies against said proteins. The cDNAs of the present invention can be used as probes for the diagnosis of genes and as sources of genes for gene therapy. In addition, cDNAs can be used as gene sources for the large-scale production of the proteins encoded by said cDNAs. Moreover, cells introduced with DNAs encoding transmembrane proteins and expressing transmembrane proteins in large amounts can be used for the detection of the corresponding ligands as well as for the screening of new lower molecular drugs.
Background of the Invention Membrane proteins play important roles, such as signal receptors, ion channels, transporters, etc., for the transport of material and the transmission of information that are mediated by the cell membrane. Their examples include receptors for a variety of cytokines, ion channels for the sodium ion, potassium ion, chlorine ion, etc., transporters for saccharides and amino acids and the like, wherein the genes for many of them have already been cloned. It has been clarified that the abnormalities of these membrane proteins are related to a number of diseases up to now cryptogenic. For example, a gene for a membrane protein having 12 transmembrane domains was identified as the gene responsible for cystic fibrosis [Romens, J. M. et al., Science 245: 1059-1065
(1989)]. In addition, it has been clarified that several membrane proteins act as receptors when a virus infects the cells. For example, it has been revealed that HIV-1 infects cells through the mediation of a 1-membrane protein, a membrane protein in the T-cell membrane, which has a CD-4 antigen and 7 domains of transmembrane [Feng, Y. Et al., Science 272: 872-877 (1996)]. Therefore, it is anticipated that the discovery of a new membrane protein leads to the clarification of the causes of many diseases, which is why the isolation of a new gene coding for the membrane protein has been desired. Until now, due to the difficulty in purification, many of the membrane proteins have been isolated through an approach coming from the side of the genes. A general method is the so-called expression cloning, which comprises the transfection of a cDNA library in animal cells to express the cDNA and the subsequent detection of the cells expressing the target membrane protein in the membrane by a immunological technique using an antibody or a biological technique for changing the permeability of the membrane. However, this method is applicable only to the cloning of a gene for a membrane protein with a known function. In general, membrane proteins possess hydrophobic transmembrane domains within the proteins that are synthesized in the ribosome and then remain in the phospholipid that will be trapped in the membrane. Accordingly, cDNA evidence for membrane protein coding is provided by determining the entire base sequence of a full-length cDNA followed by the detection of highly hydrophobic transmembrane domains in the amino acid sequence of the protein encoded by said cDNA.
OBJECTIVES OF THE INVENTION The objective of the present invention is to provide new human proteins having transmembrane domains, DNAs coding for said proteins and transformed eukaryotic cells capable of expressing said DNAs.
Brief Description of the Drawings Figure 1 is an illustration of the structure of the detection vector pSSD3 of the secretory signal sequence.
Figure 2 is an illustration of the hydrophobicity / hydrophilicity profile of the protein
• encoded by clone HP00442. Figure 3 is an illustration of the hydrophobicity / hydrophilicity profile of the protein encoded by clone HP00804. Figure 4 shows the result of the northern-blot hybridization of clone HP00804. Figure 5 is an illustration of the hydrophobicity / hydrophilicity profile of the protein encoded by clone HP01098. Figure 6 is an illustration of the hydrophobicity / hydrophilicity profile of the protein encoded by clone HP01148. 10 Figure 7 shows the result of the northern-blot hybridization of clone HP01148. Figure 8 is an illustration of the hydrophobicity / hydrophilicity profile of the protein encoded by clone HP01293. Figure 9 is an illustration of the hydrophobicity / hydrophilicity profile of the protein encoded by clone HP10013. Figure 10 is an illustration of the hydrophobicity / hydrophilicity profile of the protein encoded by clone HP 10034. Figure 11 is an illustration of the hydrophobicity / hydrophilicity profile of the protein encoded by clone HP10050. Figure 12 is an illustration of the hydrophobicity / hydrophilicity profile of protein 20 encoded by clone HP 10071. Figure 13 is an illustration of the hydrophobicity / hydrophilicity profile of the protein encoded by HP 10076 clone. Figure 14 is a illustration of the hydrophobicity / hydrophilicity profile of the protein encoded by HP 10085 clone. Figure 15 is an illustration of the hydrophobicity / hydrophilicity profile of the protein encoded by clone HP10122. Figure 16 is an illustration of the hydrophobicity / hydrophilicity profile of the protein encoded by clone HP10136. Figure 17 is an illustration of the hydrophobicity / hydrophilicity profile of the protein 30 encoded by clone HP10175.
Figure 18 is an illustration of the hydrophobicity / hydrophilicity profile of the protein encoded by clone HP10179. Figure 19 is an illustration of the hydrophobicity / hydrophilicity profile of the protein encoded by HP 10196 clone. Figure 20 is an illustration of the hydrophobicity / hydrophilicity profile of the protein encoded by clone HP10235. Figure 21 is an illustration of the hydrophobicity / hydrophilicity profile of the protein encoded by clone HP 10297. Figure 22 is an illustration of the hydrophobicity / hydrophilicity profile of the protein encoded by clone HP 10299. Figure 23 is an illustration of the hydrophobicity / hydrophilicity profile of the protein encoded by the HP 10301 clone. Figure 24 is an illustration of the hydrophobicity / hydrophilicity profile of the protein encoded by the HP 10302 clone. Figure 25 is an illustration of the hydrophobicity / hydrophilicity profile of the protein encoded by clone HP 10304. Figure 26 is an illustration of the hydrophobicity / hydrophilicity profile of the protein encoded by clone HP 10305. Figure 27 is an illustration of the hydrophobicity / hydrophilicity profile of the protein encoded by the HP 10306 clone. Figure 28 is an illustration of the hydrophobicity / hydrophilicity profile of the encoded protein for the HP 10328 clone.
Detailed Description of the Preferred Modalities As a result of intensive studies, the present inventors succeeded in cloning cDNAs having transmembrane domains, from a full-length human cDNA library, thereby completing the present invention. That is, the present invention provides proteins that contain any of the amino acid sequences represented by Sequence No. 1 to Sequence No. 2 or Sequence No. 4 to Sequence No. 25 which are human proteins having amino acid domains. transmembrane The present invention also provides DNAs encoding said proteins such as cDNAs containing any of the base sequences represented by Sequence No. 26 to Sequence No. 50 and transformed eukaryotic cells capable of expressing said DNAs. Each of the proteins of the present invention can be obtained, for example, by a method for isolation from human organs, cell lines, etc., a method for the preparation of the peptide by chemical synthesis based on the amino acid sequence of the present invention, or a production method with recombinant DNA technology using the DNA encoding the transmembrane domains of the present invention, wherein the method of obtaining by recombinant DNA technology is preferably employed. For example, an in vitro expression can be achieved by the preparation of an RNA by in vitro transcription from a vector having the cDNA of the present invention, followed by in vitro translation using this RNA as a model. Also, recombination of the translation domain to an appropriate expression vector by the method known in the art leads to the expression of a large amount of the encoded protein by the use of prokaryotic cells (e.g., Escherichia coli, Bacillus subtilis) or eukaryotic cells (e.g., yeast, insect cells and animal cells).
In the case where a protein of the present invention is expressed by a microorganism such as Escherichia coli, the translation region of the cDNA of the present invention is constructed in an expression vector having an origin, a promoter, site ( s) of ribosome binding, cDNA cloning site (s), a termination, etc. which can be replicated in the microorganism and, after transformation of the host cells with said expression vector, the transformant thus obtained is incubated, whereby the protein encoded by said cDNA can be produced on a large scale in the microorganism. In that case, a protein fragment containing an optional region can be obtained by performing the expression with the insertion of an initiation codon and a stop codon before and after the optional translation region. Alternatively, a fusion protein with another protein can be expressed. Only a portion of the protein encoding said cDNA can be obtained by dividing said fusion protein with an appropriate protease.
In the case where a protein of the present invention is to be produced in eukaryotic cells the translation region of said cDNA can be subjected to recombination to an expression vector for eukaryotic cells having a promoter, a splicing domain, an Poly (A) addition site, etc. and transfected into eukaryotic cells so that the protein is produced as a membrane protein on the surface of the cell membrane. Examples of the expression vector are pKAl, pCDM8, pSVK3, pMSG, pSVL, pBK-CMV, pBK-RSV, EBV vector, pRS, pYES2, etc. Examples of eukaryotic cells are cells from cultures of mammalian animals (e.g., COS7 monkey kidney cells, Chinese CHO hamster ovary cells), blast yeast, cleavage yeast, silkworm yeast, toad oocyte with pincer South African, etc. However, any eukaryotic cells can be used as long as the protein of the invention can be expressed on the surface of the cell membrane. In order to introduce the expression vector into the eukaryotic cells any conventional method per se such as the electrophoresis method, the calcium phosphate method, the liposome method, or the DEAE dextran method can be used. For the separation and purification of the protein of the invention from the culture medium after the expression of said protein in prokaryotic cells or eukaryotic cells, conventional separation operations may be adopted, if necessary, in their appropriate combinations. Examples of conventional separation operations are treatment with a denaturing agent (e.g., urea) or a surfactant, ultrasonic treatment, enzymatic digestion, salting, solvent precipitation, dialysis, centrifugation, ultrafiltration, gel filtration, SDS-PAGE, Isoelectric point electrophoresis, ion exchange chromatography, hydrophobic chromatography, affinity chromatography, reverse phase chromatography, etc. The proteins of the present invention include peptide fragments (residues of more than 5 amino acids) containing any partial amino acid sequence of the amino acid sequence represented by Sequence No. 1 to Sequence No. 2 or Sequence No. 4 to Sequence No. 25. These fragments can be used as antigens for the preparation of the antibodies. Also, proteins of the present invention having signal sequences appear in the form of maturation proteins on the cell surface, after the signal sequences are eliminated. Therefore, these maturation proteins will fall within the scope of the present invention. The N-terminus amino acid sequences of the maturation proteins can be easily identified using the method for determination of the cleavage site in a signal sequence [Japanese Patent Kokai Publication No. 1996-187100]. further, many membrane proteins are subjected to processing on the cell surface to be converted to secretory forms. These peptides or secretory proteins will fall within the scope of the present invention. When glycosylation sites are present in the amino acid sequences, expression in appropriate animal cells will provide glycosylated proteins. Therefore, these glycosylated proteins or peptides will also fall within the scope of the present invention. The DNAs of the present invention include all the DNAs that code for the aforementioned proteins. Said DNAs can be obtained using the chemical synthesis method, the DNA cloning method and the like. Each of the cDNAs of the present invention can be cloned from, for example, a cDNA library of the human cell origin. The cDNA is synthesized using a poly (A) + RNA extracted from human cells as a model. Human cells can be cells released from the human body, for example, by operation or they can be cultured cells. The cDNA can be synthesized using any method selected from the Okayama-Berg method [Okayama, H. and Berg, P. Mol. Cell. Biol. 2: 161-170 (1982)], the Gubler-Hoffman method [Gubler, U. and Hoffman, J. Gene 25: 263-269 (1983)] and the like, but the capping method is preferred [Kato , S. et al., Gene 150: 243-250 (1994)] as illustrated in the Examples in order to obtain a full-length clone in an effective form. The primary selection of a cDNA encoding a human protein having a transmembrane domain (s) is performed by sequencing a partial base sequence of the cDNA clone randomly selected from the cDNA library, sequencing the sequence of amino acids encoded by the base sequence, and recognition of the presence or absence of hydrophobic site (s) in the region of the amino acid sequence with resultant N-terminus. Next, the secondary selection is carried out by the determination of the complete base sequence by sequencing and the expression protein by in vitro translation. The investigation of the cDNA of the present invention for the coding of the protein having the sequence of the secretory signal is carried out using the method of detection of the signal sequence [Yokoyama-Kobayashi, M. cl al., Gene 163: 193- 196 (1995)]. In other words, the inquiry for the coding portion of the inserted cDNA fragment to function as a signal sequence is provided by fusing a cDNA fragment encoding the N terminus of the target protein with a cDNA encoding the urokinase protease domain. and subsequently expressing the resulting cDNA in COS7 cells to detect urokinase activity in the cell culture medium. On the other hand, it is judged that the termination region N remains in the membrane in the case where urokinase activity is not detected in the cell culture medium. The cDNAs of the present invention are characterized as containing any of the base sequences represented by Sequence No. 26 to Sequence No. 50 or by any of the base sequences represented by Sequence No. 51 to Sequence No. 75 Table I summarizes the number of clone (HP number), the cells that provide the cDNA, the number of total bases of the cDNA and the number of amino acid residues of the encoded protein, for each of the DNAs.
Table 1
Accordingly, the same clone as any of the cDNAs of the present invention can be easily obtained by screening the cDNA library constructed from the human cell line or tissue used in the present invention, by the use of a DNA probe. oligonucleotide synthesized on the basis of the corresponding cDNA base sequence described in Sequence No. 51 Sequence No. 75. In general, polymorphism due to individual difference is frequently observed in human genes. Therefore, any cDNA that is subjected to insertion or deletion of one or more nucleotides and / or substitution with other nucleotides in Sequence No. 50 to Sequence No. 75 will fall within the scope of the present invention. In a similar manner, any protein that is produced by these modifications comprising the insertion or deletion of one or multiple nucleotides and / or substitution with other nucleotides will fall within the scope of the present invention, as long as said protein possesses the activity of the corresponding protein having the amino acid sequence represented by Sequence No. 1 to Sequence No. 2 or Sequence No. 4 to Sequence No. 25. The cDNAs of the present invention include cDNA fragments (greater than 10 bp) containing any partial base sequence of the base sequence represented by Sequence No. 26 to No. 50 or of the base sequence represented by Sequence No. 51 to No. 75. Also, DNA fragments consisting of a chain of sense and a chain of anti-sense will fall within this scope. These cDNA fragments can be used as probes for the diagnosis of genes.
BEST METHOD FOR CARRYING OUT THE INVENTION EXAMPLE The present invention is embodied in greater detail by the following examples but this embodiment is not intended to restrict the present invention. The basic operations and enzyme reactions in relation to DNA recombination are carried out according to the literature [Molecular Cloning, A Laboratory Manual, Cold Spring Harbor Laboratory, 1989]. Unless otherwise specified, the restriction enzymes and a variety of modifying enzymes to be used were those available from Takara Shuzo Co., Ltd. The manufacturer's instructions were used for the regulatory compositions as well as for the reaction conditions , in each of the enzyme reactions. The cDNA synthesis was carried out according to the literature [Kato, S. et al., Gene 150: 243-250 (1994)].
(1) Preparation of Poly (A) RNA + The HT-1080 fibrosarcoma cell line (ATCC CCL 121), the epidermoid carcinoma cell line KB (ATCC CRL 17), the histiocyte lymphoma cell line U937 (ATCC CRL 1593) , osteosarcoma U-2 OS (ATCC HTB 96), a leukocyte isolated from peripheral blood, stomach cancer tissues obtained by operation, and liver were used for the extraction of mRNAs from human cells. Each of the cell lines was cultured by a conventional procedure. After approximately 1 g of human tissue was homogenized in 20 ml of 5.5 M guanidine thiocyanate solution, total mRNAs were prepared according to the literature [Okayama, H. et al., "Methods in Enzymology", vol. 164, Academic Press, 1987]. These mRNAs were subjected to chromatography using an oligo (dT) -cellulose column washed with 20 mM Tris-hydrochloric acid buffer (pH 7.6), 0.5 M NaCl, and 1 mM EDTA to obtain a po! Y RNA ( A) + according to the aforementioned literature.
(2) Construction of a cDNA Library To a solution of 10 μg of the aforementioned poly (A) + RNA in Tris buffer solution of 100 mM hydrochloric acid (pH 8) was added an alkaline phosphatase unit of bacterial origin and free of RNase and the resulting solution was allowed to react at 37 ° C for one hour. Then the reaction solution experienced phenol extraction followed by ethanol precipitation, the granules obtained were dissolved in a mixed solution of 50 mM sodium acetate (pH 6), 1 mM EDTA, 0.1% 2-mercaptoethanol and 0.01% Triton X-100. To this was added one unit of a pyrophosphatase originating from tobacco (Epicenter Technologies) and the resulting solution in a total volume of 100 μl was allowed to react at 37 ° C for one hour. After the reaction solution experienced phenol extraction followed by precipitation with ethanol, the granules thus obtained were dissolved in water to obtain a solution of pickled poly (A) + RNA. To a solution of poly (A) + RNA pickling and 3 nmol of a DNA-RNA chimeric oligonucleotide (5'-dG-dG-dG-dG-dA-dA-dT-dT-dC-dG-dA-GGA- 3 ') in a mixed aqueous solution of buffer solution of Tris-hyhloric acid 50 mM (pH 7.5), 0.5 mM ATP, 5 mM MgCl2, 10 mM 2-mercaptoethanol and 25% polyethylene glycol were added 50 units of T4 RNA ligase and the resulting solution in a total volume of 30 μl was allowed to react at 20 ° C for 12 hours. After the reaction solution experienced phenol extraction followed by ethanol precipitation, the granules thus obtained were dissolved in water to obtain a poly (A) + oligo-caped chimeric RNA. After the pKAl vector developed by the present inventors (Japanese Patent Kokai Publication No. 1992-1 17292) was digested with Kpnl, a tail of approximately 60 dT was inserted by a terminal transferase. This product was digested with EcoRV to eliminate the dT tail on one side and the resulting molecule was used as a vector primary.
After 6 μg of the previously prepared chimeric psi (A) + oligo-caped RNA was annealed with 1.2 μg of the vector primary, the product was dissolved in a mixed solution of 50 mM Tris-hydrochloric acid buffer (pH 8.3) , 75 mM KCl, 3 mM MgCl 2, 10 mM dithiothreitol and 1.25 mM dNTP (dATP + dCTP + dGTP + dTTP), mixed with 200 units of a reverse transferase (GIBCO-BRL) and the resulting solution in a total volume of 20 μl it was allowed to react at 42 ° C for one hour. After the reaction solution experienced phenol extraction followed by ethanol precipitation, the granules obtained in this manner were dissolved in a mixed solution of 50 mM Tris-hydrochloric acid buffer (pH 7.5), 100 mM NaCl, 10 mM MgCl 2 and 1 mM dithiothreitol. To this, 100 units of EcoRI were added and the resulting solution in a total volume of 20 μl was allowed to react at 37 ° C for one hour. After the reaction solution experienced phenol extraction followed by ethanol precipitation, the granules thus obtained were dissolved in a mixed solution of Tris buffer-20 mM hydrochloric acid (pH 7.5), 100 mM KCl, MgCl 2 4 mM, 10 mM (NH) 2 SO 4 and bovine serum albumin 50 μg / ml. To this was added 60 units of Escherichia coli DNA ligase and the resulting solution was allowed to react at 16 ° C for 16 hours. To the reaction solution were added 2 μl of 2 mM dNTP, 4 units of Escherichia coli DNA polymerase I and 0.1 units of Escherichia coli DNase H, and the resulting solution was allowed to react at 12 ° C for one hour and subsequently at 22 ° C for one hour. Next, the solution of the cDNA synthesis reaction was used to transform
DH12S from Escherichia coli (GIBCO-BRL). The transformation was carried out by the electrophoration method. A portion of the transformant was inoculated in a culture medium of 2xYT agar containing 100 μg / ml of ampicillin, which was incubated at 37 ° C overnight. A growth of the colony in the culture medium was taken randomly and inoculated in 2 ml of the 2xYT culture medium containing 100 μg / ml of ampicillin, which was incubated at 37 ° C overnight. The culture medium was centrifuged to separate the cells, from which a plasmid DNA was prepared by the alkaline lysis method. After the plasmid DNA was digested twice with EcoRI and NotI, the product was subjected to electrophoresis on an 8% agarose gel to determine the size of the cDNA insert. In addition, by using the plasmid obtained as a model, the sequence reaction using universal primary M13 labeled with a fluorescent dye and Taq polymerase (a team from Applied Biosistems Inc.) was carried out and the product was analyzed by a sequencer. of fluorescent DNA (Applied Biosystems Inc.) to determine the base sequence of the 5 'terminus of the approximately 400 bp cDNA. The sequence information was presented as a database of a homo-protein cDNA library.
(3) Selection of DNAs that Code for Proteins that Have Transmembrane Domains. The base sequence recorded in the homo-protein cDNA library was converted to three frames of amino acid sequences and the presence or absence of an open reading frame (ORF) starting from the initiation codon. Subsequently, the selection was made for the presence of a signal sequence that is characteristic for a protein secreted at the N-terminus of the portion encoded by ORF. These clones were sequenced from both 5 'and 3' directions using the deletion method to determine the complete base sequence. The hydrophobicity / hydrophilicity profiles were obtained for proteins encoded by ORF by the Kyte-Doolittle method (Kyte, J. &Doolittle, RF, J. Mol. Biol. 157: 105-132 (1982)] to examine the presence or absence of a hydrophobic region In the case where there is a hydrophobic region of putative transmembrane domain (s) in the amino acid sequence of an encoded protein, this protein was considered as a membrane protein.
(4) Construction of a Vector pSSD3 for Secretory Signal Detection. A microgram of pSSDl carrying the SV40 promoter and a cDNA coding for the urokinase protease domain (Yokoyama-Kobayashi, M. et al., Gene 163: 193-196 (1995)] was digested with 5 units of BglII and 5 EcoRV units Then, after dephosphorylation at the 5 'end by the CIP treatment, a DNA fragment of approximately 4.2 kbp was purified by cutting from agarose gel electrophoresis.
Two oligo ^ Ll DNA linkers (5'-GATCCCGGGTCACGTGGGAT-3 ') and L2 (5'-ATCCCACGTGACCCGG-3') were synthesized and phosphorylated by T4 polynucleotide kinase. After annealing of both linkers, followed by binding with the pSSdl fragment previously prepared by T4 DNA ligase, Escherichia coli JM109 was transformed. A plasmid pSSD3 was prepared from the transformant and the target recombinant was confirmed by determining the base sequence of the fragment inserted into the linker. Figure 1 illustrates the structure of the plasmid thus obtained. The present plasmid vector carries three types of restriction enzyme sites with blunt end formation, Smal, PmaCl and EcoRV. Since these cleavage sites are placed in succession at a range of 7 bp, the selection of an appropriate site in combination with three types of frames for cDNA insertion allows the construction of a vector that expresses a fusion protein.
(5) Functional Verification of Secretory Signal Sequence If the hydrophobic region with N-terminus in the secreted protein clone candidate obtained in the above-mentioned steps works as the secretory signal sequence was verified by the method described in the literature [Yokoyama- Kobayashi, M. et al., Gene 163: 193-196 (1995)]. First, the plasmid containing the target cDNA was divided into an appropriate restriction enzyme site that existed downstream of the portion expected for coding for the secretory signal sequence. In the case where this restriction enzyme site was a 5 'terminus protruding, the site was made blunt ended by the Klenow treatment or mung bean nuclease treatment. Digestion with HindIII was further carried out and a DNA fragment containing the SV40 promoter and a cDNA encoding the secretory sequence downstream of the promoter was separated by agarose gel electrophoresis. This fragment was inserted between the HindIII site of pSSD3 and a selected restriction enzyme site in order to match the urokinase coding frame, thereby constructing a vector expressing a fusion protein of the secretory signal portion of the cDNA. target and the protease domain of urokinase.
After the Escherichia coli (host: JM109) carrying the fusion protein expression vector was incubated at 37 ° C for 2 hours in 2 ml of the 2xYT culture medium containing 100 μg / ml of ampicillin, the helper phage M13KO7 ( 50 μl) was added and the incubation was continued at 37 ° C overnight. A supernatant separated by centrifugation was subjected to precipitation with polyethylene glycol to obtain single-stranded phage particles. These particles were suspended in 100 μl of 1 mM Tris-0.1 mM EDTA, pH 8 (TE). Also, a suspension of single-stranded particles prepared in the same manner as that of the pKAl-UPA vector containing pSSD3 and a full-length urokinase cDNA was used as a control [Yokoyama-Kobayashi, M. et al., Gene 163: 193-196 (1995)]. The cells from the simian kidney-derived culture, COS7, were incubated at 37 ° C in the presence of 5% CO2 in the Dulbecco's modified Eagle culture medium (DMEM) containing 10% fetal bovine albumin. In a 6-well dish (Nunc Inc., 3 cm well diameter), 1 x 105 COS7 cells were inoculated and the incubation was carried out at 37 ° C for 22 hours in the presence of 5% CO2. After the culture medium was removed, the cell surface was washed with a phosphate buffer and subsequently washed again with DMEM containing 50 mM Tris-hydrochloric acid (pH 7.5) (TDMEM). To the cells were added 1 μl of the single-chain phage suspension, 0.6 ml of DMEM culture medium, and 3 μl of TRANSFECTAM ™ (IBF Inc.) and the resulting mixture was incubated at 37 ° C for 3 hours. in the presence of 5% CO2. After the sample solution was removed, the surface of the cells was washed with TDMEM, 2 ml per well of DMEM containing 10% fetal bovine albumin were added, and the incubation was carried out at 37 ° C for 2 days in the presence of 5% CO2. To 10 ml of 50 mM phosphate buffer (pH 7.4) containing 2% bovine fibrinogen (Miles Inc.), 0.5% agarose and 1 mM potassium chloride were added 10 units of human thrombin (Mochida Pharmaceutical Co., Ltd .) and the resulting mixture was solidified in a dish 9 cm in diameter to prepare a fibrin plate. Ten microliters of the culture medium supernatant of the transfected COS7 cells were dispersed on the fibrin plate, which was incubated at 37 ° C for 15 hours. The diameter of the clear circle thus obtained was taken as an index to determine the urokinase activity. In the case where a cDNA fragment codes for the amino acid sequence that functions as a secretory signal sequence, a fusion protein is secreted to form a clear circle by its urokinase activity. Therefore, in the case where a clear circle was not formed, the fusion protein remains trapped in the membrane and the cDNA fragment is considered to encode a transmembrane domain.
(6) Protein Synthesis by In Vitro Translation The plasmid vector carrying the cDNA of the present invention was used for in vitro transcription / translation by the rabbit reticulocyte lysate TNT (Promega Biotec). In this case, [35 S] methionine was added and the expression product was labeled with the radioisotope. All the reactions were carried out following the protocols attached to the equipment. Two micrograms of the plasmid were allowed to react at 30 ° C for 90 minutes in 25 ml total of a reaction solution containing 12.5 μl of the rabbit reticulocyte lysate TNT, 0.5 μl of the buffer solution (attached to the kit), 2 μl of a mixture of amino acids (methionine-free), 2 μl (0.37 MBq / μl) of [35 S] methionine (Amersham Corporation), 0.5 μl of T7 RNA polymerase and 20 U of RNsin. To 3 μl of the reaction solution were added 2 μl of an SDS sample buffer (Tris buffer solution 125 mM hydrochloric acid, pH 6.8, 120 mM 2-mercaptoethanol, 2% SDS solution, 0.025% bromophenol blue and 20% glycerol) and the resulting solution was heated at 95 ° C for 3 minutes and then subjected to SDS-polyacrylamide gel electrophoresis. The molecular weight of the translation product was determined by carrying out the autoradiography.
(7) Northern Blot Hybridization Northern Blot hybridization was carried out in order to examine the expression pattern in human tissues. The membranes on which poly (A) + RNAs were scattered isolated from each human tissue were stained and acquired at Clontech Inc. The cDNA fragments that were cut from the target clones with appropriate restriction enzymes were subjected to separation by electrophoresis. on agarose gel followed by [32P] dCPT (Amersham Corporation) labeling using the Random Primer Labeling Equipment (Takara Shuzo Co., Ltd.). Hybridization was carried out using a solution attached to the stained membrane according to the protocol.
(8) Expression in COS7 Escherichia coli possessing an expression vector of the protein of the invention was infected with helper phage M13KO7 and single-stranded phage was obtained by the above method. Using the phage thus obtained, the expression vector was introduced into cultured COS7 cells of monkey kidney origin according to the above method. The culture was carried out at 37 ° C in the presence of 5% CO2 for 2 hours and then in a medium containing [35S] cysteine for 1 hour. The cells were harvested, dissolved and subjected to SDS-PAGE, whereby a band corresponding to a protein was revealed as an expression product, which was not present in the COS cells.
(9) Clone Examples < HP00442 > (Sequences Nos. 1, 26, 51) The determination of the complete base sequence for the cDNA insert of clone HP00442 obtained from the cDNA libraries of the human fibrosarcoma cell line HT-1080 revealed the structure consisting of a 5 'non-translation region of 81 bp, an ORF of 618 bp, and a 3' non-translation region of 287 bp. The ORF codes for a protein consisting of 205 amino acid residues with 5 transmembrane domains, Figure 2 describes the hydrophobicity / hydrophilicity profile of the present protein obtained by the Kyte-Doolittle method. The result of the in vitro translation did not reveal the formation of different bands for the translation products and revealed the formation of fuzzy bands in the high molecular weight position. Investigation of the protein database using the amino acid sequence of the present invention revealed that the protein was analogous to the proteolipid protein PPA1 of baker's yeast proton ATPase (SWISS-PROT Accession No. P23968). Table 2 indicates the comparison of the amino acid sequences between the human protein of the present invention (HP) and the proteolipid protein PPA1 of baker's proton ATPase (PL). "-" represents an opening, "*" represents an amino acid residue identical to that in the protein of the present invention, and "." represents an amino acid residue analogous to that in the protein of the present invention. Both proteins had a homology of 56.8% in the entire region except for the N-terminus.
Table 2
HP MTGI_ALLYS6VFVAF ACALAVGVCYTIF-DLGFR DVAW LTETSPFMWS * _. *. *. ** ***. **, PL MN ESIO) DDMSLGKFSFSHFLYYLVLIVVIVYGLYKLFTGHGSDINFGKFLLRTSPYMWA HP NLGIGLAISLSVVGAAWGIYITGSSIIGGGVKAPRI TrarLVSIIFCEAVAIYGI3_MAIV ****. *,. **********. *****. **, **. ** **. ****. ****** "*****. *. *** PL NI? IALCVGLSVVGAAWGIFITGSSMIGAGVRAPRITTKNLISIIFCEVVAIYGLIIAIV
HP ISNMAEPFSATDPKAIGHRNYHAGYSMFGAGLTVGLSNLFCGVCVGIVGSGAALADAQNP. * ,. ** *,. ***. * **. *** ***. **. ***. * .. ** .. ** ... PL FSSKL- TVATAENMYSKSNLYTGYSLFWAGITVGASNLICGIAVGITGATAAISDAADS HP SLFVKILIVEIFGSAIGLFGVIVAILQTSRVKMGD. ****** .. *****. **. *. ** .. * ... PL ALFVKILVIEIFGSILGLLGLIVGLUMAGKASEFQ
In addition, the GenBank research using the present ANDc base sequence revealed that there were some ESTs possessing 90% or more homology and that they also contained the initiation codon (eg, Accession No. H87379), but the protein present it can not be predicted from this sequence. The proteolipid protein PPA1 of baker's yeast proton ATPase is an essential membrane protein for cell growth [Apperson, M. et al., Biochem. Biophys. Res. Commun. 168: 574-579 (1990)]. Accordingly, the protein of the present invention, which is homologous to said protein, is considered essential for the growth of human cells and can be used for the diagnosis and treatment of diseases caused by the abnormality of the present protein.
< HP00804 > (Sequences Nos. 2, 27, 52) The determination of the complete base sequence for the cDNA insert of clone HP00804 obtained from the cDNA libraries of the human leukocyte cell revealed the structure consisting of a non-translation region. 5 'of 132 bp, an ORF of 1116 bp, and a 3' non-translation region of 576 bp. The ORF codes for a protein consisting of 371 amino acid residues with 7 transmembrane domains. Figure 3 illustrates the hydrophobicity / hydrophilicity profile of the present protein obtained by the Kyte-Doolittle method. The result of the in vitro translation did not reveal the formation of different bands for the translation products. Examination of the expression pattern in tissues by hybridization by northem blot using the cDNA fragment of the present invention revealed that the expression occurred in all tissues examined as shown in Figure 4. Thus, the protein of the present invention invention is considered a homemade protein. Investigation in the protein database using the amino acid sequence of the present protein revealed that the protein was analogous to the rat-glutamate NMDA receptor binding subunit (GenBank Accession No. S61973). Table 3 indicates the comparison of the amino acid sequences between the human protein of the present invention (HP) and the NMDA receptor binding subunit of rat-glutamate (RN). "-" represents an opening, "*" represents an amino acid residue identical to that in the protein of the present invention, and "." represents an amino acid residue analogous to that in the protein of the present invention. This subunit consists of 516 amino acid residues and a glutamine region at position 68 to arginine at position 342 possessed 92.6% homology with the 270 amino acid residue of the C terminus in the protein of the present invention. However, no homology was observed in the region of the N-terminus. Accordingly, a characteristic repeat sequence that is rich in proline, tyrosine and glycine was observed in the N-terminus region of the protein of the present invention.
Table 3
HP MSHEKSFLVSGDNYPPPNPGYPGGPQPPMPPYAQPPYPGAPYPQPPFQPSPYGQPGYPHG
RN MKRVSWSLGTAILPQTLAILWGHKPLCLPMFSLPTLG
HP PSPYPQGGYP GPYPQGGYPQGPYPQEGYPQGPYPQGGYPQGPYPQSPFPPNPYGQPQVF **. **************. *
RN PHTHRPLSSPLPMVNQGIPMVPVPITR LPIJDLIJCEATHQGHYPQSPFPPNPYGqPPPF
HP PGQDPDSPQHGNYQEEGPPSYYDNQDFPATNWDDKSIRQAFIRKVFLVLTLQLSVTLSTV ***. ********************** .. ** ****************** *********
RN -QDPGSPQHGNYQEEGPPSYYDNQDFPSVNW-DKSIRQAFIRKVFLVLTLQLSVTLSTV HP SVFTFVAEV GITIuO p? YVSYAVFFISLIVLSCCGDFRRCT. , ****. *******. **********. ***************. ********* *. ** ****
RN AIFTFVGEV GFVRANV TYYVSYAIFFISLIVLSCCGDFRKKHPWNLVALSILTISLSY
HP WGMIASFYNTEAVIMAVGITTAVCFTVVIFSM TRYDFTSCMGVLLVSMVVLFIFAILC **************************************** ** **********
RN MVGMIASFYNTEAVIMAVGITTAVCFTWIFSMQTRYDFTSCMGVLLVSVWLFIFAILC
HP IFIRNRILEIVYASLGALLFTCFLAVDTQLLLGNKQLSLSPEEYVFAALNLYTDIINIFL
********************************************** **********
RN IFIRNRILEIVYASLGALLFTCFLAVDTQLLLGNKQLSLSPEEYVFAALNLYTDIINIFL HP YILTIIGRAKE ********. , RN YILTIIGRSQGIGQAPAQVAWAQTHAPAMTLPSVLPPLOTPAMAWSRGSPSRPRVCTLQ In addition, the search in GenBank using the base sequence of the present cDNA revealed that there are some ESTs that have 90% or more homology (for example, Access No. W25936), but none of them was shorter than the present cDNA and did not contain the initiation codon. It has been found that the binding subunit of the rat-glutamate NMDA receptor is one of the subunits of the NMDA receptor complex that exists specifically in the brain [Kumar. K. N. et al., Nature 354: 70-73 (1991)]. Despite a high homology with the protein of the present invention, the subunit shows different patterns of expression in the N-terminus sequence and tissues, so both molecules are considered to have different functions. Since the protein of the present invention possesses 7 transmembrane domains that are characteristic for channels and transporters, this protein is considered to play a role as a channel and a transporter. Because the protein of the present invention is an essential home protein for cells, the present protein can be used for the diagnosis and treatment of diseases caused by the abnormality of this protein.
< HP01098 > (Sequences Nos. 3, 28, 53) The determination of the complete base sequence for the cDNA insert of the clone
HP01098 obtained from human stomach cancer cDNA libraries revealed the structure consisting of a 5 'non-translation region of 61 bp, an ORF of 540 bp, and a 3' non-translation region of 475 bp. The ORF codes for a protein consisting of 179 amino acid residues with a transmembrane domain. Figure 5 describes the hydrophobicity / hydrophilicity profile of the present protein obtained by the Kyte-Doolittle method.
The in vitro translation resulted in the formation of a translation product of 20 kDa that was almost consistent with the molecular weight of 20,625 predicted from ORF. Searching for the protein in the database using the amino acid sequence of the present protein revealed that the protein was completely identical with an 18 kDa subunit of the canine microsomal signal peptidase (SWISS-PROT, Accession Number P21378). It was therefore verified that the cDNA of the present invention codes for the human homolog of the 18 kDa subunit of the microsomal signal peptidase.
Searching for GenBank using the base sequence of the present cDNA revealed that there were some ESTs possessing the homology of 90% or more (eg, Accession No. T60549), but many sequences were not distinct and the same ORF was not identified as that in the cDNA present. The 18 kDa subunit of the canine micromosomal signal peptidase has been found as one of the subunits of the signal peptidase complex that exist in the microsome [Schelness, G. S. & Blobel, G., J. Biol. Chem. 265: 9512-9519 (1990)]. The signal peptidase is an enzyme that divides the signal sequence during the secretion of a secretory protein in the endoplasmic reticulum. Accordingly, the cDNA of the present invention can be used for the production of the present protein as well as for the diagnosis and treatment of diseases caused by the abnormality of the present protein.
< HP01148 > (Sequences Nos. 4, 29, 54) The determination of the complete base sequence for the cDNA insert of clone HP01148 obtained from the human liver cDNA libraries revealed the structure consisting of a 5 'non-translation region. of 101 bp, an ORF of 1044 bp, and a 3 'non-translation region of 446 bp. The ORF codes for a protein consisting of 347 amino acid residues with a transmembrane domain at the N-terminus. Figure 6 describes the hydrophobicity / hydrophilicity profile of the present protein obtained by the Kyte-Doolittle method. It was indicated that the present protein remained in the membrane from the observation that the urokinase secretion was not identified, during transduction in COS7 cells of an expression vector in which a HindIII-PvulI fragment containing a cDNA fragment coding for 178 amino acid residue of the N-terminus in the present protein was inserted into the HindlII-PmaCI site of pSSD3. Therefore, the present protein was considered to be a type II membrane protein. The in vitro translation resulted in the formation of a translation product of 41 kDa that was almost consistent with the molecular weight of 38.101 predicted by ORF. Examination of the tissue expression pattern by northern blot hybridization using the cDNA fragment of the present invention revealed that a strong expression occurred in the spleen, as shown in Figure 7. It was also indicated that a slight expression occurred in the liver. The search of the protein database using the amino acid sequence of the present protein revealed that the protein was analogous to the bovine WC1 antigen (SWISS-PROT Accession No. P30205). Table 4 indicates the comparison of the amino acid sequences between the human protein of the present invention (HP) and the bovine WC1 antigen (WC). "-" represents an opening, "*" represents an amino acid residue identical to that in the protein of the present invention, and "." represents an amino acid residue analogous to that in the protein of the present invention. Both proteins have a homology of 38%.
Table 4
HP MALLFSLILAICTRPGFLASPSGVRLVGGLHRCEGRVEVEQKGQWGTVCDDGW. ***. *. *. ****. . . * *** *** *
WC VLPQCNDFLSQPAGSAASEESSPYCSDSRQLRLVDGGGPCGGRVEIL QGSWGTICDDDW HP DIKDVAVLCRELGCGAASGTPSGILYEPPAEKEQKVLI SVSCTGTEDTLAQCEQEE-V
WC DLDDARWCRQLGCGEALNATGSAHF - GAGSGPIWLDDLNCTGKESHVWRCPSRGWGR HP YDCSHEEDAGASCENPESSFSPVPEGVRLADGPGHCKGRVEV HQNQWYTVCQTGWSLRA. **. *. ****. *. * * .. * * * .. ** ... *. ** ... . WC HDCRHKEDAGVIC- - SE- F LALRMVSEDQQCAGWLEVFYNGTWGSVCRSPMEDIT HP AKVVCRQI? CGRAVLTQKRCNKHAYGRK_? IWLSQMSCSGREATLQDCPSGPWGKNTCNHD
WC VSVICRQLGCGDSGS NTSVGLRE-GSRPRWVDLIQCR MDTSLWQCPSGPWKYSSCSPK HP EDTWVECE DPFDLRLVGGDNLCSGRLEVLHKGVWGSVCDDNWGEKE *. . . . . ** * •. *** *** **** ** *. * ** **** *, * WC EEAYISCEGRRPKSCPTAAACTDREKLRLRGGDSECSGRVEVWHNGSWGXVCDDSWSLAE HP DQWCKQLGCGKSLSPSDRDR CYGPGVGRIWLDNVRCSGEEQSLEQCQHRFWGFHDCTH. ***. ***** .. *,. *** *. ****. *. *. * * **. * ** **. * WC AEWCQQLGCGQALE-AVR- SAAFGPGNGSIWLDEV CGGRESSLWDCVAEPWGQSDCKH HP QEDVAVICSG. **. . * *** WC EEDAGVRCSGVRTTLPTTTAGTRTTSNSLPGIPSLPGVLC ILGSLLFLVLVILVT LL
In addition, searching for GenBank using the base sequence of the present cDNA revealed that there were some ESTs that possessed the homology of 90% or more (for example, Accession No. H91200), but it could not be assessed if these ESTs with partial sequences encoded for the same protein as the protein of the present invention. The bovine WC1 antigen has been found as a membrane antigen that is expressed specifically in T? D cells [Wijngaard, P.L.J. et al., J. Immunol. 149: 3273-3277 (1992)]. The region showing an analogy is termed the cysteine-rich domain of the scavenger receptor (SRCR) which also existed as a repeat sequence in macrophage scavenging receptors [Matsumoto, A. et al., Proc. Nati Acad. Sci, USA 87: 9133-9137 (1990)] The CD6 T cell differentiation antigen [Aruffo, A et al., J. Exp. Med 174: 949-952 (1991)] and the like. Since the present protein is expressed specifically in the spleen. This protein is considered to be deeply associated with the functions of the spleen and also functions as a receptor in the same way as other members of the SRCR family.
< HP01293 > (Sequences Nos. 5, 30, 55) The determination of the complete base sequence for the cDNA insert of the clone
HP01293 obtained from the human liver cDNA libraries revealed the structure consisting of a 5 'non-translation region of 89 bp, an ORF of 1665 bp, and a 3' non-translation region of 134 bp. The ORF codes for a protein consisting of 554 amino acid residues with 12 transmembrane domains. Figure 8 describes the hydrophobicity / hydrophilicity profile of the present protein obtained by the Kyte-Doolittle method. The in vitro translation did not reveal the formation of different bands and revealed the formation of fuzzy bands in the high molecular weight position. The search of the protein database using the amino acid sequence of the present protein revealed that the protein was analogous to the rat cation transporter (Genbank, Accession No. X78855). Table 5 indicates the comparison of the amino acid sequences between the human protein of the present invention (HP) and the mouse interstitial cell protein (MM). "-" represents an opening, "*" represents an amino acid residue identical to that in the protein of the present invention, and "." represents an amino acid residue analogous to that in the protein of the present invention. Both proteins have a homology of 78.1% between whole regions.
Table 5
HP MPTVDDILEQVGESGWFQKQAFLILCLLSAAFAPICVGIVFLGFTPDHHCQSPGVAISLSQ ******. ****** *********. *** ** .. *** **********. *. **. ******** RN MPTVDDVLEQVGEFGWFQKQAFLLLCLI SASLAPIYVGIVFLGFTPGHYCQNPGVAELSQ
HP RCWSPAEELNYTVPGLGPAGEA-FLGQCRRYEVDWNQSALSCVDPLASLATNRSHLPLG *****. ************* .. ** **. **. *********. *. ***** . ** .. ***. ****
RN RCGWSQAEELNYTVPGLGPSDEASFLSQCMRYEVDWNQSTLDCVDPLSSLVANRSQLPLG
HP PCQDGWVYDTPGSSIVTEFNLVCADSW LDLFQSCLNAGFFFGSLGVGYFADRFGRKLCL ** .. *******************. *. **. ******. * ***. *** *** . ********** RN PCEHGWV? DTPGSSIVTEFNLVCGDAWKVDLFQSCVNLGFFLGSLVVGYIADRFGRI CL
HP I? TVLVNAVSGVLMAFSPNYMSMLLFRLLQGLVSKGNWMAGYTLITEFVGSGSRRTVAIM * *. ** .. ***** *. *. * **********. ****. * .. ********** ** ***. **.
RN LVTT VTSVSGVLTAVAPDYTSML FRLLQGMVSKGSWVSGYTLITEFVGSGYRRTTAIL HP YQMAFTVGLVALTGLAYA PHWRWLQLAVSLPTFLFLLYYWCVPESPRWLLSQKRNTEAI **********. *. *. ***. *. ******************** ** ***********. * * #
RN YQMAFTVGLVGLAGVAYAIPDWRWLQLAVSLPTFLFLLYYWFVPESPRWLLSQ RTTRAV HP KIMDHIAQ NGKLPPADIJCMLSIJ_? DVTEiU_SPSFADLFRTPRl_ ^. ** .. *******. ********. ****., ** *********** . ***. * ******* t # *
RN RJMEQIAQKNGKVPPADLKlíLCLEEDASEKRSPSFADLFRT ^
HP LYQGLILHMGATSGNLYLDFLYSA VEIPGAFIALITIDRVGRIYPMAVSNL AGAACLV ******. *. *** .. ****** ** *** *. *** *. **** ***** m * # *** # ***** m
RN LYQGLIMHVGATGANLYLDFFYS SLVEFPAAFIILVTIDRIGRIYPIAASNLVTGAACLL
HP MIFISPDLHWI ^ IIIMCVGRMGITIAIQMICLVNAELYPTFVRNLGVMVCSSLCDIGGII **** ... ***** ... *. **** ** .. **. *********** # **** ** ** t ***. ***.
RN ÍCtFIPHEIJIWLNVTLACLGRMGATIVLQMVCLVNAELYPTFIRNLGMMVCSALCDLGGIF
HP TPFIVFRI? EVWQALPLILFAVLGLLAAGVTLLLPETKGVALPETH DAENLG-RKAKPK ***. **** ***********. **** * ... *************** # # ***** ** f *. *
RN TPFM \ n? RIJdí? RWQALPLILFGVLGLTAGAMTLLLPETKGVALPETIEEAENLGRRKSKAK HP ENTIYLKVQTSEPSGT ******. ***.,. *. *
RN ENTIYLQVQTGKSSST
In addition, the search for GenBank using the base sequence of the present cDNA revealed that there was no human gene and human EST that possessed the homology of 90% or more.
The rat cation transporter has been found as a membrane protein that is related to the excretion of drug in the kidney [Grundermann, D. et al., Nature 372: 549-552 (1994)]. Consequently, the protein of the present invention that is homologous to this transporter is considered to have a similar function and can be used for the diagnosis and treatment of diseases caused by the abnormality of this protein. In addition, since the present protein is considered to be related to drug excretion, the cells in which this protein is expressed can be used as a tool for designing drugs from these drugs. Additionally, since the present protein is expressed primarily in the liver and kidney, a molecule that is prepared to possess an affinity for this protein is applicable to the drug's release system in these tissues.
< HP10013 > (Sequences Nos. 6, 31, 56) The determination of the complete base sequence for the cDNA insert of clone HP10013 obtained from the cDNA libraries of the human epidermoid carcinoma cell line KB revealed the structure consisting of a 5 'non-translation region of 96 bp, an ORF of 1053 bp, and a 3' non-translation region of 884 bp. The ORF codes for a protein consisting of 350 amino acid residues with a signal sequence at the N terminus and an internal transmembrane domain. Figure 9 describes the hydrophobicity / hydrophilicity profile of the present protein obtained by the Kyte-Doolittle method. It was indicated that the present protein functioned as a signal sequence in the N-terminus from the observation that the urokinase activity was detected in the culture medium, during transduction in the COS7 cells of an expression vector in which a HindIII-Eco065I fragment (treated with mung bean nuclease) containing a cDNA fragment encoding the 65 amino acid residues of the N-terminus in the present protein was inserted into the HindIII-EcoRV site of pSSD3. Therefore, the present protein is considered to be a type I membrane protein. The in vitro translation resulted in the formation of a 39 kDa translation product that was almost consistent with the molecular weight of 39.008 predicted by the ORF.
The search of the protein database using the amino acid sequence of the present protein revealed that the protein was not analogous with any of the known proteins. In addition, searching for GenBank using the base sequence of the present cDNA revealed that there are some ESTs that possess 90% or more homology (eg, Accession No. HO7998), but none of them was shorter than the present cDNA. and did not contain the initiation codon
< HP10034 > (Sequences Nos. 7, 32, 57) The determination of the complete base sequence for the cDNA insert of the HP 10034 clone obtained from the cDNA libraries of the HT-1080 human fibrosarcoma cell line revealed the consistent structure in a 5 'non-translation region of 175 bp, an ORF of 630 bp, and a 3' non-translation region of 106 bp. The ORF codes for a protein consisting of 209 amino acid residues with 4 transmembrane domains. Figure 10 describes the hydrophobicity / hydrophilicity profile of the present protein obtained by the Kyte-Doolittle method. The in vitro translation resulted in the formation of a 21 kDa translation product that was almost consistent with the molecular weight of 22,432 predicted by the ORF. The search of the protein database using the amino acid sequence of the present protein revealed that the protein was analogous to the human tumor associated antigen L6 (SWISS-PROT, Accession No. P30408). Table 6 indicates the comparison of the amino acid sequences between the human protein of the present invention (HP) and the tumor-associated L6 antigen of human (L6). "-" represents an opening, "*" represents an amino acid residue identical to that in the protein of the present invention, and "." represents an amino acid residue analogous to that in the protein of the present invention. Both proteins have a homology of 31.8%.
Table 6
HP MVSSPCTQASSRTCSRILGLSLGTAALFAAGANVALLLPNWDVTYLLRGLLGRHAMLGTG *. *. * ** **. *. * L6 'MCYGKCARCIGHSLVGLALLCIAANILLYFPNGETKYASENHLSRFVWFFSG HP LWGGGLMVLTAA-ILISL-MGWRYGCFS - KSGLCRSVLTALLSGGLALLGALICFVTSG. **** .. *. * .. *. *. ** .. * .. * ... * *. * .... L6 IVGGGLL LLPAFVFIGLEQDDCCGCCGHENCGKRCAMLSSVLAALIGIAGSGYCVIVAA HP VAI? _DGPFC »ÍFDVSSFNQTQAWICYGYPFKDLHSRNYLYDRSLWNSVCLEPSAAVVWHVSL .. *. **. *. *. *. * ** * * * .. * **. * *. *** L6 LGLAEGPLCL-D SLGQWNYTFASTE- -GQYLLDTSTWSE-CTEPKHIVE NVSL HP FSALLCISLLQLLLVWHVINSLLGLFCSLCEK ** ** * ... *** .. ** •. * .. * L6 FSILLALGGIEFILCLIQVINGVLGGICGFCCSHQQQYDC
In addition, the search for GenBank using the base sequence of the present cDNA revealed that no human gene existed and Est possessed the homology of 90% or more. The human tumor-associated L6 antigen is a member of the membrane superfamily of TM4 antigen proteins that are abundantly expressed on the cell surface of human tumors [Marken, J. S. et al., Proc Nati. Acad Sci. USA 89: 3503-3507 (1992)]. Since these membrane antigens are specifically expressed in specific cells and in cancer cells, an antibody that is prepared in order to bind to this antigen is applicable for a variety of diagnosis and as a carrier for drug release. In addition, the cells in which said membrane antigen is expressed by transduction of the membrane antigen gene are applicable for the detection of the corresponding ligand.
< HP10050 > (Sequences Nos. 8, 33, 58) The determination of the complete base sequence for the cDNA insert of the HP 10050 clone obtained from the cDNA libraries of the HT-1080 human fibrosarcoma cell line revealed the consistent structure in a 5 'non-translation region of 9 bp, an ORF of 492 bp, and a 3' non-translation region of 100 bp. The ORF codes for a protein consisting of 163 amino acid residues with a transmembrane domain. Figure 11 describes the hydrophobicity / hydrophilicity profile of the present protein obtained by the Kyte-Doolittle method. The in vitro translation resulted in the formation of a 23 kDa translation product that was almost consistent with the molecular weight of 18,364 predicted by the ORF. The search of the protein database using the amino acid sequence of the present protein revealed that the protein was not analogous to any of the known proteins. In addition, searching for GenBank using the base sequence of the present cDNA revealed that there are some ESTs that possessed 90% or more homology (eg, Accession No. HO3117), but many sequences were not distinct and the same ORF as that in the present cDNA was not identified.
< HP10071 > (Sequences Nos. 9, 34, 59) The determination of the complete base sequence for the cDNA insert of the HP 10071 clone obtained from the human stomach cancer cDNA libraries revealed the structure consisting of a non-human region. 5 'translation of 46 bp, an ORF of 279 bp, and a 3' non-translation region of 69 bp. The ORF codes for a protein consisting of 92 amino acid residues with 2 transmembrane domains. Figure 12 describes the hydrophobicity / hydrophilicity profile of the present protein obtained by the Kyte-Doolittle method. The in vitro translation resulted in the formation of a 12 kDa translation product that was almost consistent with the molecular weight of 10,094 predicted by the ORF. The search of the protein database using the amino acid sequence of the present protein revealed that the protein was not analogous to any of the known proteins.
In addition, searching for GenBank using the base sequence of the present protein revealed that there are some ESTs that possessed 90% or more homology (eg, Accession No. R097442), but many sequences were not distinct and the same ORF that the one in the present cDNA was not identified.
< HP10076 > (Sequences Nos. 10, 35, 60) The determination of the complete base sequence for the cDNA insert of the clone
HP 10076 obtained from the cDNA libraries of the U937 human lymphoma cell line revealed the structure consisting of a 5 'non-translation region of 81 bp, an ORF of 519 bp, and a 3' non-translation region of 132 bp. The ORF encoded for a protein consisting of 172 amino acid residues with 2 transmembrane domains. Figure 13 describes the hydrophobicity / hydrophilicity profile of the present protein obtained by the Kyte-Doolittle method. It was indicated that the present protein remained in the membrane from the observation that the urokinase secretion was not identified during transduction in COS7 cells of an expression vector in which a fragment of HindIII-Eco0651 (previously treated with nuclease mung bean) containing a cDNA fragment encoding the 167 amino acid residues of the N-terminus in the present protein was inserted into the HindIII-EcoRV site of pSSD3. The in vitro translation resulted in the formation of a 24 kDa translation product that was almost consistent with the molecular weight of 18,450 predicted by the ORF. The search of the protein database using the amino acid sequence of the present protein revealed that the protein was analogous to the baker yeast hypothetical membrane protein of 23.1 kDa (SWISS-PROT Accession No. P34222). Table 7 indicates the comparison of the amino acid sequences between the human protein of the present invention (HP) and the hypothetical 23.1 kDa yeast membrane protein (SC). "-" represents an opening, "*" represents an amino acid residue identical to that in the protein of the present invention, and "." represents an amino acid residue analogous to that in the protein of the present invention. Both proteins have a homology of 47.5%. in the region of 139 amino acid residues of the termination C.
Table 7
HP HEYLAHPSTLGLAVGVACGMCLGWS
SC MITSFIiffiiaíTVSSNYTIALWATFTAISFAVGYQI? TSNASSTKKSSATLLRSKEMKEGK HP LRVCFGMLPKSKTSKTHTDTESEASILGD- SGEYZMILWRNDLKMGKGKVAAQCSHAAV ... * .. * .. *. *. **. * **. *. ** *. ***. ****. ***. SC LHNBTDEEESESEDESEDEDTSTSLNDIPGEV? MALVIRQDLGMTKGKIAAQCCHAAL HP SAYKQI QR NP__íIJ? Q EYCGQPKVVV APDEEXLIALI ^ HAKM GLT S IQD * ... * .. ** * .. * **. * ... * **. *. . * *. * ** *. * SC SCFRH_ATNPARASYNPIMT RWI_NAGQAKITI_KCPDKFTMDELYAKAISLGVNAAVIHD. HP AGRTQIAPGSQTVLGIGPGPADLID VTGHL LY *******. ** **** ** * ... * .. **. **** SC AGRTQIAAGSATVLGLGPAPKAVLDQITGDLKLY
In addition, searching for GenBank using the base sequence of the present protein revealed that there were some ESTs that possessed 90% or more homology (eg, Accession No. T74847), but many sequences were not distinct and the same ORF that the one in the present cDNA was not identified.
< HP10085 > (Sequences Nos. 11, 36, 61) The determination of the complete base sequence for the cDNA insert of the HP 10085 clone obtained from the cDNA libraries of the human lymphoma cell line U937 revealed the structure consisting of a 5 'non-translation region of 150 bp, an ORF of
450 bp, and a 3 'non-translation region of 97 bp. The ORF codes for a protein consisting of 149 amino acid residues with a transmembrane domain at the N-terminus. Figure 14 describes the hydrophobicity / hydrophilicity profile of the present protein obtained by the Kyte-Doolittle method. It was indicated that the present protein remained in the membrane from the observation that the urokinase secretion was not identified during transduction in COS7 cells of an expression vector in which a fragment of HindlII-EcoRI (after the Klenow treatment ) containing a cDNA fragment encoding the 57 amino acid residues of the N-terminus in the present protein was inserted into the HindIII-EcoRV site of pSSD3. Therefore, the present protein is considered to be a type II membrane protein. The in vitro translation resulted in the formation of a translation product of 20 kDa that was almost consistent with the molecular weight of 17,307 predicted by the ORF. The search of the protein database using the amino acid sequence of the present protein revealed that the protein was analogous to the early activation human antigen CD69 (SWISS-PROT Accession No. Q07108). Table 8 indicates the comparison of the amino acid sequences between the human protein of the present invention (HP) and the early activation human antigen CD96 (CD). "-" represents an opening, "*" represents an amino acid residue identical to that in the protein of the present invention, and "." represents an amino acid residue analogous to that in the protein of the present invention. Both proteins have a homology of 36.6%. in the region of 112 amino acid residues of the termination C.
Table 8 HP MMTKHKKCFI
CD MSSENCFVAENSSI ^ PESGQENDATSPHFSTRHEGSFQVPVLCAVM_TyVFITILIIALIA HP IVGVLITTNIIXLIVKLTRDSQSLCPYDWIGFQNKCYYFS EEGDWNSSKYNCSTQHADL. *. *. ** * * CD LSVGQYNCPGO.YTFSMPSDSHVSSCSEDWVGYQRKCYFISTVKRSWTSAQNACSEHGATL HP TIIDNIEEMNFRRYKCSSDHWIGUOMAKNRTGWVDGATFTKSFGMRGSEGCAYLSDDG. **. .. ****. ** ... **. *** .. **. CD AVIDSEKDMNFLKRYAGREEHWVGLK EPGHPWKWSNGKEFNNWFNVTGSDKCVFLKNTE
HP AATARCYTERKWICRKRIH ***. *
CD VSSMECEKNLYWICNKPYK
In addition, searching for GenBank using the base sequence of the present protein revealed that there were some ESTs that possessed 90% or more homology (eg, Accession No. H11808), but many sequences were not distinct and the same ORF that the one in the present cDNA was not identified. The early activation human antigen CD69 is a glycoprotein that appears on the surface of activated lymphocytes and a member of the type C lectin superfamily [Hamann, J. et al., J. Immunol. 150: 4920-4927 (1993)]. Since these membrane antigens are expressed specifically in some specific cells, an antibody that is prepared in order to bind to this antigen is applicable to a variety of diagnosis and as a carrier for drug release. In addition, cells in which said membrane antigen is expressed by transduction of the membrane antigen gene are applicable for the detection of the corresponding ligand.
< HP 10122 > (Sequences Nos. 12, 37, 62) The determination of the complete base sequence for the cDNA insert of clone HP10122 obtained from the human stomach cancer cDNA libraries revealed the structure consisting of a non-translational region. 'of 138 bp, an ORF of 567 bp, and a 3' non-translation region of 481 bp. The ORF codes for a protein consisting of 188 amino acid residues with two transmembrane domains. Figure 15 describes the hydrophobicity / hydrophilicity profile of the present protein obtained by the Kyte-Doolittle method. The in vitro translation resulted in the formation of a 22 kDa translation product that was almost consistent with the molecular weight of 21,175 predicted by the ORF. The search of the protein database using the amino acid sequence of the present protein revealed that the protein was not analogous to any known protein. In addition, searching for GenBank using the base sequence of the present cDNA revealed that there were some ESTs that possessed the homology of 90% or more (eg, Accession No. T80360), but many sequences were not distinct and the same ORF as that in the present cDNA was not identified.
< HP10136 > (Sequences Nos. 13, 38, 63) The determination of the complete base sequence for the cDNA insert of clone HP10136 obtained from the cDNA libraries of the human lymphoma cell line U937 revealed the structure consisting of a region of no translation 5 'of 81 bp, an ORF of 648 bp, and a non-translation region of 680 bp. The ORF codes for a protein consisting of 215 amino acid residues with a transmembrane domain at the C terminus. Figure 16 describes the hydrophobicity / hydrophilicity profile of the present protein obtained by the Kyte-Doolittle method. The in vitro translation resulted in the formation of a 28 kDa translation product that was almost consistent with the molecular weight of 24,740 predicted by the ORF. The search of the protein database using the amino acid sequence of the present protein revealed that the protein was analogous to the baker's yeast protein transport protein SLY2 (SWISS-PROT, Accession No. P22214). Table 9 indicates the comparison of the amino acid sequences between the human protein of the present invention (HP) and the transport protein of baker yeast protein SLY2 (SC). "-" represents an opening, "*" represents an amino acid residue identical to that in the protein of the present invention, and "." represents an amino acid residue analogous to that in the protein of the present invention. Both proteins have a homology of 36.1%. in the complete regions.
Table 9
HP MV LTMIARVADGLP ^ SMQISDEQSGRDLQQYQSqAKQLFRiaLNE SPTRCTLEAGAMT * * * ***** **. * ***. *.
SC MIKSTLIYRE-DGLPLCTSVDNENDPS- LFEQ QKVKIWSRLTPQSATEATLESGSFE HP FHYIIEQGVCYLVLCEAAFP KI? FAYLEDLHSEFDEQHGKKVPTVS-RPYSFIEFDTFI. **. . *. *. *. ** ... * .. ***. **. *. ** *. . *** * .. **. *. SC IHYl ^ SMVY r ICESGY RNl ^ SY iro ^ QE EHSFA ^ HP QKT KLYIDSRARRNLGSINTELQDVQRIMVANIEEVLQRGEALSALDSKANNLSSLSEK *. *** * *, **. . * **. * .. ** *** ._ * **. **
sc QMTKKSYSDHECVQDNLDQLNQELVGVKQ: I ^ HP YRQDAKYLNMRSTYAKLAAVAVFFIMLIVYVRFWWL ** .. *. ***
SC YRKSAQKINFDLLISQYAPI-VIVAFFFVFL-FWWIFLK
In addition, searching for GenBank using the base sequence of the present cDNA revealed that there were some ESTs that possessed the homology of 90% or more (eg, Accession No. R80136), but were shorter in length than the present cDNA and no molecule containing the initiation codon was identified. It is known that the transport protein of baker yeast SLY2 protein is essential for the transport of endoplasmic reticulum protein to Golgi and that it is also associated with cell cycle control [Dascher, C: et al., Mol. Cell. Biol.11: 872-885 (1991)]. Therefore, the cDNA of the present invention can be used for the production of the present protein as well as for the diagnosis and treatment of diseases caused by the abnormality of the present protein.
< HP10175 > (Sequences Nos. 14, 39, 64) The determination of the complete base sequence for the cDNA insert of clone HP10175 obtained from the human stomach cancer cDNA libraries revealed the structure consisting of a non-translational region. 'of 173 bp, an ORF of 339 bp, and a 3' non-translation region of 462 bp. The ORF codes for a protein consisting of 112 amino acid residues with 4 transmembrane domains. Figure 17 describes the hydrophobicity / hydrophilicity profile of the present protein obtained by the Kyte-Doolittle method. The in vitro translation resulted in the formation of a translation product of 13 kDa that was almost consistent with the molecular weight of 11,564 predicted by the ORF. The search of the protein database using the amino acid sequence of the present protein revealed that the protein was not analogous to any known protein. In addition, searching for GenBank using the base sequence of the present cDNA revealed that there were some ESTs that possessed 90% or more homology (eg, Accession No. W52852), but many sequences were not distinct and the same ORF as that in the present cDNA was not identified.
< HP10179 > (Sequences Nos. 15, 40, 65) The determination of the complete base sequence for the cDNA insert of clone HP10179 obtained from the cDNA libraries of the human epidermoid carcinoma cell line KB revealed the structure consisting of a 5 'non-translation region of 121 bp, an ORF of 345 bp, and a 3' non-translation region of 459 bp. The ORF codes for a protein consisting of 114 amino acid residues with 4 transmembrane domains. Figure 18 describes the hydrophobicity / hydrophilicity profile of the present protein obtained by the Kyte-Doolittle method. The in vitro translation resulted in the formation of a 14 kDa translation product that was almost consistent with the molecular weight of 12,078 predicted by the ORF. The search of the protein database using the amino acid sequence of the present protein revealed that the protein was not analogous to any known protein. However, this protein was analogous to the protein encoded by the HP 10175 cDNA clone of the present invention. Table 10 indicates the comparison of the amino acid sequences between the protein encoded by HP 10179 and the protein encoded by HP 10175. "-" represents an opening, "*" represents an amino acid residue identical to that in the protein of the present invention, and "." represents an amino acid residue analogous to that in the protein of the present invention. Both proteins have a homology of 80.8%. in the complete regions.
Table 10
179 MEKPLFPLVPLHWFGFGYTALWSGGIVGYVKTGSVPS AAGLLFGSLAGLGAYQLYQDP. . **********. *** **** **** *********************** ***
175 MQDTGSWPLHWFGFGYAALVASGGIIGYVKAGSVPSLAAGLLFGSLAGLGAYQLS DP 179 RNVWGFIAATSVTFVGVMGMRSYYYGKFMPVGLIAGASLIilAA VGV? Lbl ÍTSD **** ** *** *. . *. **** *, *****. **********. ***** *.
175 RNVWVT'L-ATSGTI_AGIMGMRI? YHSGKFMPAGLIAGASLLMVAKVGVSMFNRPH
In addition, searching for GenBank using the base sequence of the present cDNA revealed that there were some ESTs that possessed 90% or more homology (eg, Accession No. N55991), but many sequences were not distinct and the same ORF as that in the present cDNA was not identified.
< HP10196 > (Sequences Nos. 16, 41, 66) The determination of the complete base sequence for the cDNA insert of the clone
HP10196 obtained from the cDNA libraries of the HT-1080 human fibrosarcoma cell line revealed the structure consisting of a 5 'non-translation region of 9 bp, an ORF of 984 bp, and a 3' non-translation region of 122 bp. The ORF codes for a protein consisting of 327 amino acid residues with a transmembrane domain at the N-terminus. Figure 19 describes the profile of hydrophobicity / hydrophilicity of the present protein obtained by the Kyte-Doolittle method. It was indicated that the present protein remained in the membrane from the observation that the urokinase secretion was not identified during transduction in COS7 cells of an expression vector in which a HindIII-BglII fragment (after Klenow treatment ) containing a cDNA fragment encoding the 162 amino acid residues of the N-terminus in the present protein was inserted into the HindIII-EcoRV site of pSSD3. Thus, the present protein is considered to be a type II membrane protein. The in vitro translation resulted in the formation of a translation product of 37 kDa that was almost consistent with the molecular weight of 36,163 predicted by the ORF. The search of the protein database using the amino acid sequence of the present protein revealed that the protein was not analogous to any known protein. In addition, searching for GenBank using the base sequence of the present cDNA revealed that there were some ESTs that possessed 90% or more homology (eg, Accession No. TI 7026), but were shorter in length than the present cDNA. and no molecule containing the initiation codon was identified.
< HP10235 > (Sequences Nos. 17, 42, 67) The determination of the complete base sequence for the cDNA insert of clone HP 10235 obtained from the cDNA libraries of the HT-1080 human fibrosarcoma cell line revealed the consistent structure in a 5 'non-translation region of 5 bp, an ORF of 1122 bp, and a 3' non-translation region of 594 bp. The ORF codes for a protein consisting of 373 amino acid residues with 11 transmembrane domains. Figure 20 describes the hydrophobicity / hydrophilicity profile of the present protein obtained by the Kyte-Doolittle method. The in vitro translation did not reveal the formation of different bands and revealed the formation of fuzzy bands in the high molecular weight position. The search of the protein database using the amino acid sequence of the present protein revealed that the protein was analogous to the human nucleolar protein HNP36 (EMBL Accession No. X86681). Table 11 indicates the comparison of the amino acid sequences between the human protein of the present invention (HP) and the human nucleolar protein HNP36 (NP). "-" represents an opening, "*" represents an amino acid residue identical to that in the protein of the present invention, and "." represents an amino acid residue analogous to that in the protein of the present invention. Both proteins have a homology of 45.3%. in the complete regions. Table 11
HP MTLCAMLPLLLF YLNSFLH RIPQSVRILGSLVAILLVFLITAILVKVQLDALPFFVIT
HP HIKIVLINSFGAILQGSLFGLAGLLPASYTAPIMSGQGLAGFFASVAMICAIASGSELSE * ... ****. *. ******* *. * .. *., .. *******, ** .. **. ., ***. ..
NP MASVCFINSFSAVLGSLFGQLGTMPSTYSTLFLSG? GLAGIFAAAMLLSMASGVDAET HP SAFGYFITACAVIILTIICYLGLPRLEFYRYYQ LKLEGPGEQE-TKLDLISKGEE- **. ***** .., * ... *. ***. **. *. * ***. *. . ** **. *. .. *.
NP SAI? YFITPYVGIUlSIVCYLSLPHU ^ ARYYLAireSSQAQAQELETKAELLQSDENGIP HP - PRAGKEESGVSV- SNSQPTNESHSIK AILKNISVLAFSVCFIFTITIGMFPA *. »** • • • • • • • • • • • • • • • • • • • • • • • • • NP SSPQKVALTLJ3LDLEKEPESEPDEPQKPGKPSVFTVFKIWLTALCLVLVFTVTLSVFPA HPVTVEVKSSIAGSSTWERYFIPVSCFLTFNIFDWLGRSLTAVFMWPGKDSRWLPSLVLARL. *. *, ** * ... * * .. *** ***. ********. *. ** .. *** ** ** *.
NP ITAMVTSS-TSPGKWS FFNPICCFLLFNIl ^ WI? RSLTSYFLWPDiOSRLLPI VCI? F
HP VFVPLLLLCNII0PIU.YL VVFEHDAWFIFFMAAFAFSNGY1? SLCMCFGPKECV PAEAET. **** .. ** ... *. * ... *. **. ** ** ** ** *****. ** ** .. * .. * * * *, NP LFVPLFMLCHVPQRSRLPILFPQDAYFITFMLLFAVSNGYLVSLTMCLAPRQVLPHEREV HP AGAIMAFFLCLGLAGAVFSFLFRAIV ***. *. *** ** *, **. ****, * .. NP AGALMTFFLALGLSCGASLSFLFKALL In addition, the search for GenBank using the base sequence of the protein of the present invention revealed that there were some ESTs that possessed the homology of ninety percent or more (by example, Accession No. R57372), but it was not possible to assess whether these ESTs with partial sequences encoded for the same protein as the protein of the present invention. The human nucleolar protein HNP36 has been found as a product gene that plays a role in the growth and multiplication of cells [Williams, J. B. & Lanahan, A. A., Biochem. Biophys. Res. Commun. 213: 325-333 (1995)]. As a consequence of this, the protein of the present invention, which is homologous to said protein, is considered as a home protein essential for the growth and multiplication of cells and therefore can be used for the diagnosis and treatment of diseases caused for the abnormality of the protein.
< HP10297 > (Sequences Nos. 18, 43, 68) The determination of the complete base sequence for the cDNA insert of the clone
HP 10297 obtained from human stomach cancer cDNA libraries revealed the structure consisting of a 5 'non-translation region of 62 bp, an ORF of 552 bp, and a 3' non-translation region of 890 bp. The ORF codes for a protein consisting of 183 amino acid residues with a signal sequence at the N terminus and an inner transmembrane domain. Therefore, the present protein is considered to be a type I membrane protein. Figure 21 describes the hydrophobicity / hydrophilicity profile of the present protein obtained by the Kyte-Doolittle method. The in vitro translation resulted in the formation of a 24 kDa translation product that was almost consistent with the molecular weight of 20, 574 predicted by the ORF. The search of the protein database using the amino acid sequence of the present protein revealed that the protein was not analogous to any known protein. In addition, searching for GenBank using the base sequence of the present cDNA revealed that there were some ESTs that possessed 90% or more homology (eg, Accession No. R47823), but many sequences were not distinct and the same ORF as that in the present cDNA was not identified.
< HP 10299 > (Sequences Nos. 19, 44, 69) The determination of the complete base sequence for the cDNA insert of the HP 10299 clone obtained from the human stomach cancer cDNA libraries revealed the structure consisting of a non-translation region. 5 'of 92 bp, an ORF of 351 bp, and a 3' non-translation region of 89 bp. The ORF codes for a protein consisting of 116 amino acid residues with a transmembrane domain at the N-terminus. Figure 22 describes the hydrophobicity / hydrophilicity profile of the present protein obtained by the Kyte-Doolittle method. It was indicated that the protein of the present invention remained in the membrane from the observation that the urokinase secretion was not identified during transduction in COS7 cells of an expression vector in which a fragment of HindIII-VspI (after of subjecting it to the Klenow treatment) containing a cDNA fragment encoding the 65 amino acid residues of the N-terminus in the present protein was inserted into the HindlII-PmaCI site of pSSD3. Therefore, the present protein is considered to be a type II membrane protein. The in vitro translation resulted in the formation of a 13 kDa translation product that was almost consistent with the molecular weight of 12,498 predicted by the ORF. Searching the protein database using the amino acid sequence of the protein of the present invention revealed that the protein was analogous to the hypothetical membrane protein of the baker's 16.5 kDa yeast (SWISS-PROT, No. Access P42834). Table 12 illustrates the comparison between the amino acid sequences of the human protein of the present invention (HP) and the hypothetical membrane protein of baker yeast of 16.5 kDa (SC). In Table 12 it should be understood that "-" represents an opening, "*" represents an amino acid residue identical to that in the protein of the present invention, and "." Represents an amino acid residue analogous to that in the protein of the present invention. Both proteins have a homology of 53.0%. in the region of 66 amino acid residues of the termination C.
Table 12
HP MASTVVAVGLTIAAAGFAGRYVLQAMKHMEP VKQVF
SC MV PIIIGI? VTMVALSVKSGLNAWTVYKTLSPLTIAKLNNIRIENPTAGYRDALKFKSS HP QSLPKSAFSGGYYRGGFEPKMTKREAALILGVSP TANKGKIRDAHRRIMLLNHPDK *. ***. *. **. . ** *** . *. . . . ** *. **** SC LIDEELKNRLNQYQGGFAPRMTEPEALLILDISAREINHLDEKXLKKKHRKAMVRNHPDR HP GGSPYIAAKINEAKDLLEGQAKK *****. ******** .. ** SC GGSPYMAAKINEAKEVLERSVLLR R
In addition, searching for GenBank using the base sequence of the present cDNA revealed that there were some ESTs that possessed 90% or more homology (eg, Accession No. R27748), but many sequences were not distinct and the sequence was not identified. same ORF as that in the present cDNA.
< HP10301 > (Sequences Nos. 20, 45, 70) The determination of the complete base sequence for the cDNA insert of the clone
HP 10301 obtained from the cDNA libraries of the human epidermoid carcinoma KB cell line revealed the structure consisting of a 5 'non-translation region of 91 bp, an ORF of 459 bp, and a 3' non-translation region of 1 12 bp. The ORF codes for a protein consisting of 152 amino acid residues with 4 transmembrane domains. Figure 23 describes the hydrophobicity / hydrophilicity profile of the present protein obtained by the Kyte-Doolittle method. The in vitro translation resulted in the formation of an 18 kDa translation product that was almost consistent with the molecular weight of 16,516 predicted from ORF. The search of the protein database using the amino acid sequence of the present protein revealed that the protein was not analogous to any known protein. In addition, searching for GenBank using the base sequence of the present cDNA revealed that there were some ESTs that possessed 90% or more homology (eg, Accession No. N28828), but many sequences were not distinct and the sequence was not identified. same ORF as that in the present cDNA
< HP 10302 > (Sequences No. 21, 46, 71) The determination of the complete base sequence for the cDNA insert of clone HP 10302 obtained from the human liver cDNA libraries revealed the structure consisting of a non-translational region. 'of 133 bp, an ORF of 1680 bp, and a 3' non-translation region of 560 bp. The ORF codes for a protein consisting of 559 amino acid residues with 12 transmembrane domains. Figure 24 describes the hydrophobicity / hydrophilicity profile of the present protein obtained by the Kyte-Doolittle method. The in vitro translation did not reveal the formation of different bands and revealed the formation of fuzzy bands in the high molecular weight position. The search of the protein database using the amino acid sequence of the present protein revealed that the protein was not analogous to any known protein. In addition, searching for GenBank using the base sequence of the present cDNA revealed that there were some ESTs that possessed 90% or more homology (eg, Accession No. N72434), but were shorter in length than that of the present cDNA. and no molecule containing the initiation codon was identified.
< HP10304 > (Sequences Nos. 22, 47, 72) The determination of the complete base sequence for the cDNA insert of clone HP 10304 obtained from the human osteosarcoma cDNA libraries U-2 revealed the structure consisting of a region of no 5 'translation of 10 bp, an ORF of 993 bp, and a 3' non-translation region of 313 bp. The ORF codes for a protein consisting of 330 amino acid residues with a signal sequence at the N terminus and an inner transmembrane domain. Therefore, the present protein is considered to be a type I membrane protein. Figure 25 describes the hydrophobicity / hydrophilicity profile of the present protein obtained by the Kyte-Doolittle method. The in vitro translation resulted in the formation of a translation product of 36 kDa that was almost consistent with the molecular weight of 36,840 predicted by ORF. The search of the protein database using the amino acid sequence of the present protein revealed that the protein was not analogous to any known protein. In addition, searching for GenBank using the base sequence of the present cDNA revealed that there were some ESTs that possessed the homology of 90% or more (eg, Accession No. N26840), but the same ORF was not identified as that in the present cDNA.
< HP10305 > (Sequences Nos. 23, 48, 73) The determination of the complete base sequence for the cDNA insert of clone HP 10305 obtained from the human osteosarcoma cDNA libraries U-2 OS revealed the structure consisting of a region of non-5 'translation of 109 bp, an ORF of 327 bp, and a 3' non-translation region of 457 bp. The ORF codes for a protein consisting of 108 amino acid residues with a transmembrane domain. Figure 26 describes the hydrophobicity / hydrophilicity profile of the present protein obtained by the Kyte-Doolittle method. It was indicated that the present protein remained in the membrane from the observation that the urokinase secretion was not identified during transduction in COS7 cells of an expression vector in which a HindIII-Apal fragment (treated with nuclease of mung bean) containing a cDNA fragment encoding the 162 amino acid residues of the N-terminus in the present protein was inserted into the HindlII-PmaCI site of pSSD3. Therefore, the present protein is considered to be a type II membrane protein. The in vitro translation resulted in the formation of a translation product of 15 kDa that was almost consistent with the molecular weight of 12,199 predicted by the ORF. The search of the protein database using the amino acid sequence of the protein revealed that it was not analogous to any known protein. In addition, searching for GenBank using the base sequence of the present cDNA revealed that there were some ESTs that possessed 90% or more homology (eg, Accession No. H02768), but many sequences were not distinct and the sequence was not identified. same ORF as that in the present cDNA.
< HP10306 > (Sequences Nos. 24, 49, 74) The determination of the complete base sequence for the cDNA insert of the clone
HP10306 obtained from cDNA libraries osteosarcoma U-2 OS human revealed the structure consisting of a region of no translation 5 '229 bp, an ORF of 306 bp, and a region of no translation 3' 155 bp. The ORF codes for a protein consisting of 101 amino acid residues with 2 transmembrane domains. Figure 27 describes the hydrophobicity / hydrophilicity profile of the present protein obtained by the Kyte-Doolittle method. The in vitro translation resulted in the formation of a 14 kDa translation product that was almost consistent with the molecular weight of 12,029 predicted from ORF. The search of the protein database using the amino acid sequence of the protein revealed that the protein was not analogous to any known protein. In addition, searching for GenBank using the base sequence of the present cDNA revealed that there were some ESTs that possessed 90% or more homology (eg, Accession No. H44711), but many sequences were not distinct and the sequence was not identified. same ORF as that in the present cDNA
< HP10328 > (Sequences Nos. 25, 50, 75) The determination of the complete base sequence for the cDNA insert of the clone
HP 10328 obtained from cDNA libraries cell line KB human epidermoid carcinoma revealed the structure consisting of a region of no translation 5 '117 bp, an ORF of 1119 bp, and a region of no translation 3' 950 bp. The ORF codes for a protein consisting of 372 amino acid residues with a transmembrane domain. Figure 28 describes the hydrophobicity / hydrophilicity profile of the present protein obtained by the Kyte-Doolittle method. It indicated that this protein remained in the membrane from the observation that the secretion of urokinase was not identified during transduction in COS7 cells of an expression vector in which a fragment HindlII- PMACI (treated with nuclease mung bean) containing a cDNA fragment encoding the 129 amino acid residues of the N-terminus in the present protein was inserted into the HindIII-Smal site of pSSD3. Therefore, it is considered that this is a type II membrane protein. The in vitro translation resulted in the formation of a translation product of 41 kDa that was almost consistent with the molecular weight of 42,514 predicted by ORF. The search of the protein database using the amino acid sequence of the protein revealed that it was analogous to the Drosophila neurological secretory signal protein (Gen Bank Accession No. U41449). Table 13 indicates the comparison of the amino acid sequences between the human protein of the present invention (HP) and the neurological secretory signal protein of Drosophila (DM). "-" represents an opening, "*" represents an amino acid residue identical to that in the protein of the present invention, and "." represents an amino acid residue analogous to that in the protein of the present invention. Both proteins have a homology of 38.6%. in the middle region of 202 amino acid residues. Table 13
HP MKyLRHRRPNATLILAIGAFTLLLFSLLVSPPTC VQEQPPAIPEALAWPTPPTRPAPAP
DM MQSKHRKLLLRCLLVLPLILLVDYCGLLTHL
HP CHANTSMVTHPDFATQPQHVQNFLLYRHCRHFPLLQDVPPSKCAQPVFLLLVIKSSPSNY
DM HEUIFERHFHYPIJNDDTGSGSASSGLDKFAYLRVPSFTAEVPVDQPARLTMLIKSAVGNS HP VRRELIiRRTWGRER VRGL LRLLFLVGTASNPHEARKVNRLLELEA THGDII ^ WDFHD ***. ***** * **. **. *** *. . *. . . ****** ** * DM RRREAIRRTWGYEGRFSDVHLRRVFLLGTAEDS - EKDVAW ESREHGDILQADFTD HP SFFNLTLKQVLFLQWQETRCANASFVLNGDDDVFAHTDNMVFYL QDHDPGRHLFVG ** *** * * *** * * *..... **. * DM AYFNNTIJCTMLGMRWASEQFNRSEFYLFVDDDYYVSAia? VLKFLGRGRQSHQPE-LLFAG HP QLIQNVGPIRAFWS YYVPEWTQNERYPPYCGGGGFLLSRFTAAAIJIRAAHVLDIFPID. *. . *. *. ** **. . *. ***. *. *. **. DM-TSPIJUIKFSKWYVSLEEYPFDRWPPYVTAGAFILSQKALRQLYAASVHLPLFRFD HVFQ HP DVFL (^ ^ CLELEGI PASHSGIRTSGVRAPSQHLSSFDPCFYRDLLLVHRFLPYEMLI_M D **. **. DM DVYLGIVALKAGISLQHCDDFRFHRPAY GPDSYS SVIASHEFGDPEEMTRVWNECRSAN HP ALNQPNLTCGNQTQIY
DM YA
In addition, searching for GenBank using the base sequence of the present cDNA revealed that there were some ESTs that possessed 90% or more homology (eg, Accession No. R75815), but were shorter in length than the present cDNA and no molecule containing the initiation codon was identified. The present invention provides human proteins having transmembrane domains, cDNAs encoding said proteins and eukaryotic cells expressing said cDNA. All of the proteins of the present invention are putative proteins that control the proliferation and differentiation of cells, because said proteins exist in the cell membrane. Therefore, the proteins of the present invention can be used as pharmaceutical compounds or as antigens for the preparation of antibodies against said proteins. Additionally, said DNAs can be used for the expression of large amounts of said proteins. Cells that express large amounts of membrane proteins with transfection of the genes of these membrane proteins can be applied for the detection of the corresponding ligands, the screening of novel low molecular weight drugs, and the like. In addition to the activities and uses described above, the polynucleotides and proteins of the present invention may present one or more of the biological uses or activities (including those associated with assays cited herein) that are identified below.
The uses or activities described for the proteins of the present invention may be provided by the administration or use of such proteins or by the administration or use of polynucleotides that encode such proteins (such as, for example, in the therapy of genes or vectors suitable for the introduction of DNA).
Uses in Research and Utilities The polynucleotides provided by the present invention can be used by the research community for various purposes. The polynucleotides can be used to express recombinant protein for analysis, characterization or therapeutic use; as markers for tissues in which the corresponding protein is expressed preferentially (either constitutively or in a particular stage of tissue differentiation or development or in diseased states); as molecular weight markers on Southern gels; as markers or flags of chromosomes (when marked) to identify chromosomes or for mapping of related gene positions; for comparison with endogenous DNA sequences in patients to identify potential genetic disorders; as probes for hybridization and thus discover new related DNA sequences; as a source of information to derive primary PCR for genetic screening; as a probe for "subtracting" known sequences in the process of discovering new polynucleotides; for the selection and labeling of oligomers for binding to a "genetic chip" or other support, including for the examination of expression patterns; to generate anti-protein antibodies using DNA immunization techniques; and as an antigen to generate anti-DNA antibodies or to stimulate another immune response: Where the polynucleotide codes for a protein that binds or potentially binds to another protein (such as, for example, in a receptor-ligand interaction) , the polynucleotide can also be used in interaction trap assays (such as, for example, that described in Guris et al., Cell 75: 791-803 (1993)) to identify polynucleotides that encode the other protein with which it occurs the binding or to identify inhibitors of the binding interaction. The proteins provided by the present invention can similarly be used in assays to determine biological activity, including in a panel of multiple proteins for high-throughput screening; to generate antibodies or to produce another immune response; as a reagent (including marker reagent) in assays designed to quantitatively determine levels of the protein (or its receptor) in biological fluids; as markers for tissues in which the corresponding protein is preferentially expressed (either constitutively or at a particular stage of tissue differentiation or development or in a diseased state); and, of course, to isolate correlative receptors or ligands. Where the protein binds or potentially binds to another protein (such as, for example, in a receptor-ligand interaction), the protein can be used to identify the other protein with which the binding occurs or to identify inhibitors of the union interaction. The proteins involved in these binding interactions can also be used to screen peptides or small inhibitors of molecules or agonists of the binding interaction. Any of these research utilities are capable of being developed in reactive grade or in game format (kit) for commercialization as research products. The methods for performing the uses listed above are well known to those skilled in the art. References describing such methods include, without limitation, "Molecular Cloning: A Laboratory Manual", 2nd ed. Cold Spring Harbor Laboratory Press, Sambrook, J., E.F. Fritsch and T. Maniatis eds., 1989, and "Methods in Enzymology: Guide to Molecular Cloning Techniques", Academic Press, Berger, S.L. walk. Kimmel eds., 1987.
Nutritional Uses The polynucleotides and proteins of the present invention can also be used as nutritional sources or supplements. Such uses include, without limitation, use as an amino acid or protein supplement, use as a carbon source, use as a nitrogen source and use as a carbohydrate source. In such cases the protein or polynucleotide of the invention can be added to the feed of a particular organism or it can be administered as a separate solid or liquid preparation, such as in the form of a powder, pills, solutions, suspensions or capsules. In the case of microorganisms, the protein or polynucleotide of the invention can be added to the medium in or on which the microorganism is cultured.
Cytokine and Proliferation Activity / Cell Differentiation A protein of the present invention can exhibit cell proliferation, cytokine (either induction or inhibition) or cell differentiation (either induction or inhibition) activity, or it can induce production of other cytokines in certain cell populations. Many protein factors discovered to date, including all known cytokines, have exhibited activity in one or more factor-dependent cell proliferation assays, and hence the assays serve as a convenient confirmation of cytokine activity. The activity of a protein of the present invention is evidenced by one of a number of routine factor-dependent cell proliferation assays for cell lines including, without limitation, 32D, DA2, DA1G, UNCLE, B9, B9 / 11, BaF3 , MC9 / G, M + (preB M +), 2E8, RB5, DA1, 123, TI 165, HT2, CTLL2, TF-1, Mo7e and CMK. The activity of the protein of the invention may, among other means, be measured by the following methods: Assays for proliferation of thymocytes or T cells include, without limitation, those described in Current Protocols in Immunology, Ed. By J.E. Coligan, A.M. Kruisbeek, D.H. Marguiles, E.M. Shevach, W. Strober, Pub. Greene Publishing Associates and Wiley Interscience (Chapter 3, In Vitro Assays for Mouse Lymphocyte Function 3.1-3.19; Chapter 7, Immunologic Studies in Humans); Takai et al., J. Immunol. 137: 3494-3500, 1986; Bertagnolli et al., J. Immunol. 145: 1706-1712, 1990; Bertagnolli et al., Cellular Immunology 133: 327-341, 1991; Bertagnolli, et al., J. Immunol. 149: 3778-3783, 1992; Bowman et al., J. Immunol. 152: 1756-1761, 1994. Assays for determining cytokine production and / or proliferation of spleen cells, lymph node cells or thymocytes include, without limitation, those described in: Polyclonal T cell stimulation, Kruisbeek, A.M. and Shevach. In Current Protocols in Immunology. J.E.e.a. Coligan eds. Vol. 1 pp. 3.12.1-3.12.14, John Wiley and Sons, Toronto, 1994; and Measurements of mouse and human Interferon ?, Schreiber, R.D. In Current Protocols in Immunology, J.E.e.a. Coligan eds. Vol 1 pp. 6.8.1-6.8.8, John Wiley and Sons, Toronto, 1994. Trials to determine the proliferation and differentiation of hematopoietic and lymphopoietic cells include, without limitation, those described in: Measurement of Human and Murine Interleukin 2 and Interleukin 4, Bottomly, K., Davis, LS and Lipsky, P.E. In Current Protocols in Immunology. J.E.e.a Coligan eds. Vol 1 pp. 6.3.1-6.3.12, John Wiley and Sons, Toronto. 1991; deVries et al., J. Exp. Med. 173: 1205-1211, 1991; Moreau et al., Nature 336: 690-692, 1988; Greenberger et al., Proc. Nati Acad. Sci. U.S.A. 80: 2931-2938, 1983; Measurement of mouse and human interleukin 6 - Nordan, R. In Current Protocols in Immunology. J.E.e.a Coligan eds. Vol 1 pp. 6.6.1-6.6.5, John Wiley and Sons, Toronto. 1991; Smith et al., Proc. Nati Acad. Sci. U.S.A. 83: 1857-1861, 1986; Measurement of human Interleukin 11 - Bennett, F. Giannotti, J. Clark, S.C. and Turner, K.J. In Current Protocols in Immunology. J.E.e.a Coligan eds. Vol 1 pp. 6.15.1, John Wiley and Sons, Toronto. 1991. Measurement of mouse and human Interleukin 9 - Ciarletta, A. Giannotti, J., Clark, S.C. and Turner, K.J. In Current Protocols in Immunology. J.E.e.a Coligan eds. Vol 1 pp. 6.13.1, John Wiley and Sons, Toronto. 1991. Assays to determine the responses of T cell clones to antigens (which will identify, among others, proteins that affect the interactions of APC-T cells as well as the effects of direct T cells by measuring proliferation and cytokine production) include , without limitation, those described in: Current Protocols in Immunology, Ed by JEea Coligan, AM Kruisbeek, D.H. Margulies, E.M. Shevach, W. Strober, Pub. Greene Publishing Associates and Wiley-Interscience (Chapter 3, In Vitro Assays for Mouse Lymphocyte Function; Chapter 6, Cytokines and their cellular receptors; Chapter 7, Immunologic studies in Humans); Weinberger et al., Proc. Nati Acad. Sci. U.S.A. 77: 6091-6095, 1980; Weinberger et al., Eur. J. Immun. 1 1: 405-41 1, 1981; Takai et al., J. Immunol. 137: 3494-3500, 1986; Takei et al, J. Immunol. 140: 508-512, 1988.
Immunological Stimulation or Suppression Activity A protein of the present invention may also exhibit immunological stimulation or immune suppression activity, including without limitation the activities for which assays have been described herein. A protein may be useful in the treatment of various immune deficiencies and disorders (including severe combined immunodeficiency (SCID)), for example, in the regulation (up or down) of the growth and proliferation of T and / or B lymphocytes, as well as carrying out the cytolytic activity of NK cells and other cell populations. These immunological deficiencies can be genetic or viral (eg, HIV) as well as bacterial or fungal infections, or they can result from autoimmune disorders. More specifically, infectious diseases caused by viruses, bacteria, fungi or other infection can be treated using a protein of the present invention, including HIV infections, hepatitis viruses, herpes viruses,
_ mycobacteria, lesmania spp, malaria spp and various fungal infections such as candidiasis. Of course, in this regard, a protein of the present invention may also be useful where it is generally indicated to strengthen the immune system, for example, in the treatment of cancer. Autoimmune disorders that can be treated using a protein of the present invention include, for example, connective tissue disease, multiple sclerosis, systemic lupus erythromatosis, rheumatoid arthritis, autoimmune pulmonary inflammation, Guillain-Barre syndrome, autoimmune thyroiditis, diabetes mellitus dependent of insulin, myasthenia gravis, graft versus host disease and eye inflammation disease by autoimmunity. Said protein of the present invention may also be useful in the treatment of reactions and allergic conditions, such as asthma (particularly allergic asthma) and other respiratory problems. Other conditions, in which immunological suppression is desired (including, for example, organ transplantation) can also be treated using a protein of the present invention. The use of the proteins of the invention may also be possible for immune responses, in a number of ways. The hyporegulation may be in the form of inhibition or blocking of an immune response already in progress, or may involve the prevention of the induction of an immune response. The functions of activated T cells can be inhibited by suppressing T cell responses or by inducing specific tolerance in T cells, or both. The immunosuppression of T cell responses is generally an active, not antigen-specific process, which requires continuous exposure of the T cells to the suppressive agent. Tolerance, which involves the induction of non-response or anergy in T cells, is susceptible to distinction from immunosuppression because it is generally antigen-specific and persists after the exposure to the tolerance-creating agent has ended. Operationally, tolerance can be demonstrated by the lack of a T cell response under re-exposure to a specific antigen in the absence of the tolerance creating agent. The hyporegulation or prevention of one or more antigenic functions (including without limitation functions of B lymphocyte antigen (such as, for example, B7)), for example, preventing the synthesis of high level lymphokine by activated T cells, will be useful in situations of tissue, skin and organ transplants and in graft versus host disease (GVHD). For example, blocking T cell function should result in reduced destruction of tissue in tissue transplants. Typically, in tissue transplants, rejection of the transplant is initiated through recognition as foreign by the T cells, followed by an immune reaction that destroys the transplant. The administration of a molecule that inhibits or blocks the interaction of a B7 lymphocyte antigen with its natural ligand (s) (s) in immune cells (such as a soluble monomeric form of a peptide having B7-2 activity alone or in conjunction with a monomeric form of a peptide having an activity of another B lymphocyte antigen (e.g., B7-1, B7-3) or blocking antibody), before transplantation can lead to the binding of the molecule to the natural ligand (s) in the immune cells without the transmission of the corresponding signal. -stimulating. Blocking the antigen function of the B lymphocyte in this case prevents the synthesis of cytokine by immune cells, such as T cells, and thus acts as an immunosuppressant. Moreover, the lack of costimulation may also be sufficient to anergize the T cells, thereby inducing tolerance in a subject. The induction of long-term tolerance by B lymphocyte antigen blocking reagents can avoid the need for repeated administration of these blocking reagents. To achieve sufficient immunosuppression or tolerance in a subject, it may also be necessary to block the function of a combination of B lymphocyte antigens. The effectiveness of a particular blocking reagent in the prevention of a rejection of organ transplantation or GVHD can be evaluated using animal models that are capable of predicting efficacy in humans. Examples of suitable systems that can be used include allogeneic cardiac grafts in rats and islet cell grafts, xenogenic pancreatic grafts in mice, both of which have been used to examine the immunosuppressive effects of CTLA4Ig fusion proteins in vivo as described in Lenschow et al. al., Science 257: 789-792 (1992) and Turka et al., Proc. Nati Acad. Sci. USA, 89: 11102-1 1105 (1992). In addition, murine models of GVHD (see Paul de., Fundamental Immunology, Raven Press, New York, 1989, pp. 846-847) can be used to determine the blocking effect of the B cell antigen function in vivo in the development of The blocking of the antigen function may also be therapeutically useful for the treatment of autoimmune diseases.Many autoimmune disorders are the result of the inappropriate activation of T cells that are reactive against the same tissue and which promotes the production of cytokines. auto-antibodies involved in the pathology of the diseases.The prevention of the activation of autoreactive T cells can reduce or eliminate symptoms of the disease.The administration of reagents that block the co-stimulation of T cells breaking the interactions receptorpligando of the antigens of B lymphocytes , can be used to inhibit the activation of T cells and prevent the production of autoantibody pos or cytokines derived from T cells that may be involved in the disease process. Additionally, blocking reagents can induce antigen-specific tolerance of autoreactive T cells that could lead to long-term relief of the disease. The efficacy of blocking reagents in the prevention or amelioration of autoimmune disorders can be determined using a number of well-characterized animal models of human autoimmune diseases. Examples include experimental murine autoimmune encephalitis, systemic lupus erythomatosis in MRL / lpr / lpr mice or NZB hybrid mice, murine autoimmune collagen arthritis, diabetes mellitus in NOD mice and BB rats, and murine experimental myasthenia gravis (see Paul of., Fundamental Immunology, Raven Press, New York, 1989, pp. 840-856). The overregulation of an antigen function (preferably a B lymphocyte antigen function), as a means of over-regulated immune response, may also be useful in therapies. Overregulation of immune responses may be in the form of improving an existing immune response or eliciting an initial immune response. For example, improving an immune response through the stimulation of the B-cell antigen function may be useful in cases of viral infection. In addition, systemic viral diseases such as influenza, the common cold and encephalitis could be alleviated by the systemic administration of stimulant forms of B lymphocyte antigens.
Alternatively, immune anti-viral responses can be improved in an infected patient by removing T cells from the patient, by co-stimulating T cells in vitro with APCs pulsed with viral antigen either by expressing a peptide of the present invention or in conjunction with a stimulant of a soluble peptide of the present invention and reintroducing the T cells activated in vitro within the patient. Another method for improving anti-viral immune responses would be to isolate infected cells from a patient, transfecting them with a nucleic acid encoding a protein of the present invention as described herein, so that the cells express all or a portion of the protein on its surface, and reintroduce the transfected cells into the patient. The infected cells would now be able to deliver a co-stimulatory signal to, and thus activate, the T cells in vivo. In another application, upregulation or amelioration of the antigen function (preferably B lymphocyte antigen function) could be useful in the induction of tumor immunity. Tumor cells (e.g., sarcoma, melanoma, lymphoma, leukemia, neuroblastoma, carcinoma) transfected with a nucleic acid encoding at least one peptide of the present invention can be administered to a subject to overcome the specific tolerance to the tumor in the subject. If desired, the tumor cells can be transfected to express a combination of peptides. For example, tumor cells obtained from a patient can be transfected ex vivo with an expression vector that directs the expression of a peptide having similar activity to B7-2, alone or in combination with a peptide having activity similar to B7. -1 and / or activity similar to B7-3. The transfected tumor cells are returned to the patient to result in the expression of the peptides on the surface of the transfected cells. Alternatively, gene therapy techniques can be used to target a tumor cell for transfection in vivo. The presence of the peptide of the present invention, which has the activity of a B lymphocyte antigen (s) on the surface of the tumor cells, provides the necessary co-stimulation signal for the T cells to induce a immune response mediated by T cells against transfected tumor cells. In addition, tumor cells lacking MHC class I or MHC class II molecules, or that do not reexpress sufficient amounts of MHC class I or MHC class II molecules, can be transfected with nucleic acid encoding all or a portion of (e.g., a truncated portion of the cytoplasmic domain) of an MHC class I a chain protein and β2 microglobulin protein or an MHC class II a chain protein and a MHC class II β chain protein to express this forms MHC class I or MHC class II proteins on the surface of the cell. Expression of the appropriate MHC class I or classes II, in conjunction with a peptide having the activity of a B lymphocyte antigen (e.g., B7-1, B7-2, B7-3) induces a cell-mediated immune response T against the transfected tumor cell. Optionally, a gene encoding an antisense construct that blocks the expression of an associated MHC class II protein, such as the constant chain, can also be cotransfected with a DNA encoding a peptide that has the activity of B lymphocyte antigen to promote the presentation of antigens associated with tumor and induce tumor-specific immunity. Thus, the induction of an immune response mediated by T cells in a human subject may be sufficient to overcome tumor-specific tolerance in the subject. The activity of a protein of the invention can also, among other means, be measured by the following methods: Suitable assays for determining cytotoxicity of splenocytes or thymocytes include, without limitation, those described in Current Protocols in Immunology, Edited by J.E. Coligan, A.M. Kruisbeek, D.H. Margulies, E.M. Shevach, W. Strober, Pub. Greene Publishing Associates and Wiley-Interscience (Chapter 3, In Vitro Assays for Mouse Lymphocyte Function 3.1.-3.19; Chapter 7, Immunologic studies in Humans); Herrmann et al., Proc. Nati Acad. Sci. USA 78: 2488-2492, 1981; Herrmann et al., J. Immunol. 128: 1968-1974, 1982; Handa et al., J. Immunol. 135-1564-1572, 1985; Takai et al., J. Immunol. 137: 3494-3500, 1986; Takai et al., J. Immunol. 140: 508-512. 1988; Hermann et al., Proc. Nati Acad. Sci. USA 78: 2488-2492, 1981; Herrmann et al., J. Immunol. 128: 1968-1974, 1982; Handa et al., J. Immunol. 135: 1564-1572, 1985; Takai et al., J. Immunol. 137: 3494-3500, 1986; Bowman et al., J. Virology 61: 1992-1998; Takai et al., J. Immunol. 140: 508-512, 1988; Bertagnolli et al., Cellular Immunology 133: 327-341, 1991; Brown et al., J. Immunol. 153: 3079-3092, 1994.
Assays to determine T cell-dependent immunoglobulin responses and isotype switching (which will identify, among others, proteins that modulate T cell-dependent antibody responses and that affect Thl / Th2 profiles) include, without limitation, those described in: Maliszewski, J. Immunol. 144: 3028-3033, 1990; and Assays for B cell function: In vitro antibody production, Mond, J.J. and Brunswick, M. In Current Protocols in Immunology. J.E.e.a. Coligan eds. Vol 1 pp. 3.8.1-3.8.16, John Wiley and Sons, Toronto. 1994. Mixed lymphocyte reaction (MLR) tests (which will identify, among others, proteins that predominantly generate Thl and CTL responses) include, without limitation, those described in Current Protocols in Immunology, Edited by J.E. Coligan, A.M. Kruisbeek, D.H. Margulies, E.M. Shevach, W. Strober, Pub. Greene Publishing Associates and Wiley-Interscience (Chapter 3, In Vitro Assays for Mouse Lymphocyte Function 3.1-3.19; Chapter 7, Immunologic Studies in Humans); Takai et al., J. Immunol. 137: 3494-3500, 1986; Takai et al., J. Immunol. 140: 508-512, 1988; Bertagnolli et al., J. Immunol. 149: 3778-3783, 1992. Dendritic cell-dependent assays (which will identify, among others, proteins expressed by dendritic cells that activate natural T cells) include, without limitation, those described in: Guery et al., J. Immunol. 134: 536-544, 1995; Inaba et al., Journal of Experimental Medicine 173: 549-559, 1991; Macatonia et al., Journal of Immunology 154: 5071-5079, 1995; Porgador et al., Journal of Experimental Medicine 182: 255-260, 1995; Nair et al., Journal of Virology 67: 4062-4069, 1993; Huang et al., Science 264: 961-965, 1994; Macatonia et al., Journal of Experimental Medicine 169: 1255-1264, 1989; Bhardwaj et al., Journal of Clinical Investigation 94: 797-807, 1994; and Inaba et al., Journal of Experimental Medicine 172: 631-640, 1990. Tests to determine the survival / apoptosis of lymphocytes (which will identify, among others, the proteins that prevent apoptosis after superantigen induction and proteins). regulating lymphocyte homeostasis) include, without limitation, those described in: Darzynkiewicz et al., Cytometry 13: 795-808, 1992; Gorczyca et al., Leukemia 7: 659-670, 1993 Gorczyca et al, Cancer Research 53: 1945-1951, 1993; Itoh et al., Cell 66: 233-243, 1991; Zacharchuk, Journal of Immunology 145: 4037-4045, 1990; Zamai et al., Cytometry 14: 891-897, 1993; Gorczyca et al., International Journal of Oncology 1: 639-648, 1992. Tests for proteins that influence early stages of T cell development and confinement include, without limitation, those described in Antica et al., Blood 84; 111 -117, 1994; Fine et al., Cellular Immunology 155: 111-122, 1994; Galy et al., Blood 85: 2770-2778, 1995; Toki et al., Proc. Nat. Acad. Sci. USA 88: 7548-7551, 1991.
Hematopoiesis Regulation Activity A protein of the present invention may be useful in the regulation of hematopoiesis and, consequently, in the treatment of deficiencies of myeloid or lymphoid cells. Even a marginal biological activity in support of colony-forming cells or factor-dependent cell lines indicates involvement in the regulation of hematopoiesis, that is, in supporting the growth and proliferation of erythroid progenitor cells alone or in combination with other cytokines. , indicating utility in this way, for example, in the treatment of various anemias or to be used in conjunction with irradiation / chemotherapy to stimulate the production of erythroid precursors and / or erythroid cells; in supporting the growth and proliferation of myeloid cells such as granulocytes and monocytes / macrophages (ie, traditional CSF activity) useful, for example, in conjunction with chemotherapy to prevent or treat consequent myelosuppression; in supporting the growth and proliferation of megakaryocytes and consequently of platelets, thereby allowing the prevention or treatment of various platelet disorders such as thrombocytopenia, and in general to be used in place or in a complementary manner to platelet transfusions; and / or in sustaining the growth and proliferation of hematopoietic progenitor cells, which are capable of maturing each and every one of the aforementioned hematopoietic cells and therefore find therapeutic utility in various disorders of progenitor cells (such as those usually treated with transplants, including, without limitation, aplastic anemia and paroxysmal nocturnal hemoglobinuria) as well as in the repopulation of the division of progenitor cells that results from irradiation / chemotherapy, either in vivo or ex vivo (that is, in conjunction with transplantation of bone marrow or with the transplantation of peripheral progenitor cells (homologous or heterologous)) as normal cells or genetically engineered by gene therapy. The activity of a protein of the invention can, among other means, be measured by the following methods: Appropriate tests to determine the proliferation and differentiation of several hematopoietic lines have been cited above. Tests for the differentiation of embryonic progenitor cells (which will identify, among others, proteins that influence the hematopoiesis of embryonic differentiation) include, without limitation, those described in Johansson et al. Cellular Biology 15: 141-151, 1995; Keller et al., Molecular and Cellular Biology 13: 473-486, 1993; McClanahan et al., Blood 81: 2903-2915, 1993. Tests to determine the survival and differentiation of progenitor cells (which will identify, among others, proteins that regulate lympho-hematopoiesis) include, without limitation, those described in: Methylcellulose colony forming assays, Freshney, MG In Culture of Hematopoietic Cells, R.I. Freshney, et al. Vol pp 265-268, Wiley-Liss, Inc., New York, NY 1994; Hirayama et al., Proc Nati. Acad. Sci. USA 89: 5907-5911, 1992; Primitive hematopoietic colony forming cells with high proliferative potential, McNiece, I.K. and Briddell, R.A. In Culture of Hematopoietic Cells. R.I. Freshney, et al. eds. Vol pp. 23-39, Wiley-Liss, Inc. New York, NY 1994; Neben et al., Experimental Hematology 22: 353-359, 1994; Cobblestone area forming cell assay, Ploemacher, R.E. In Culture of Hematopoietic Cells. R.I. Freshney et al, eds. Vol pp. 1-21, Wiley-Liss, Inc. New York, NY 1994; Long term bone cultures in the presence of stromal cells, Spooncer, E., Dexter, M. and Alien, T. In Culture of Hematopoietic Cells. R.I. Freshney et al., Eds. Vol pp. 163-179, Wiley-Liss, Inc., New York, NY 1994; Long term culture initiating cell assay, Sutherland, H.J. In Culture of Hematopietic Cells, R.I. Freshney et al., Eds. Vol pp. 139-162, Wiley-Liss, Inc. New York, NY. 1994
Tissue Growth Activity A protein of the present invention may also have utility in compositions used for the growth or regeneration of bone, cartilage, tendon, ligament and / or nerve tissue, as well as for healing wounds and repair or replacement of tissue, and in the treatment of burns, incisions and ulcers. A protein of the present invention, which induces the growth of cartilage and / or bone in circumstances in which there is normally no bone formed, has application in the healing of bone fractures and damages or defects to cartilage in humans and other animals. Said preparation using a protein of the invention can have prophylactic use in the reduction of both closed and open fractures and also in the improved fixation of artificial joints. The formation of new bone induced by osteogenic agents contributes to the repair of congenital craniofacial defects, induced by trauma, or induced by oncological resection, and is also useful in cosmetic plastic surgery. A protein of this invention may also be useful in the treatment of periodental disease and other teeth repair processes. Such agents can provide an environment for attracting bone-forming cells, stimulating the growth of bone-forming cells or inducing the differentiation of progenitors of bone-forming cells. A protein of the invention may also be useful in the treatment of osteoporosis or osteoarthritis, such as through the stimulation of bone and / or cartilage repair or by blocking inflammation or tissue destruction processes (collagenase activity, osteoclastic activity, etc.) mediated by inflammatory processes. Another category of tissue regeneration activity that may be attributable to the protein of the present invention is tendon / ligament formation. A protein of the present invention, which induces tissue similar to tendon / ligament or other tissue formation in circumstances in which said tissue is not normally formed, has application in the healing of tendon or ligament tears, deformities and other defects of tendon or ligament in humans and other animals. Such preparation employing a tissue-inducing protein similar to ligament tendon may have prophylactic use in the prevention of damage to tendon or ligament tissue, as well as in the use for improved fixation of tendon or ligament to bone or other tissues, and in repair of tendon or ligament tissue defects. The formation of new tendon / ligament tissue, induced by a composition of the present invention contributes to the repair of congenital tendon or ligament defects, induced by trauma or of other origin, and is also useful in cosmetic plastic surgery for fixation or repair of tendons or ligaments. The compositions of the present invention can provide an environment for attracting tendon or ligament forming cells, stimulating the growth of tendon or ligament forming cells, inducing differentiation of progenitors of tendon or ligament forming cells, or inducing cell growth or progenitors of ex vivo ligament tendon for in vivo return to effect tissue repair. The compositions of the invention may also be useful in the treatment of tendonitis, carpal tunnel syndrome and other tendon or ligament defects. The compositions may also include an appropriate matrix and / or a sequestering agent as a carrier as is well known in the art. The protein of the present invention can also be useful for the proliferation of nerve cells and for the regeneration of nerve and brain tissue, that is, for the treatment of diseases and neuropathies of the central or peripheral nervous system, as well as of mechanical disrs and traumatic events that involve degeneration, death or injury to nerve cells or nerve tissue. More specifically, a protein can be used in the treatment of diseases of the peripheral nervous system, such as damage to the peripheral nerve, peripheral neuropathy and localized neuropathies, and diseases of the central nervous system, such as Alzheimer's disease, Parkinson's, Huntington's, amyotrophic lateral sclerosis and Shy-Drager syndrome. Additional conditions that can be treated in accordance with the present invention include mechanical and traumatic disorders, such as spinal cord disorders, head injuries and cerebrovascular diseases such as stroke. Peripheral neuropathies resulting from chemotherapy or other medical therapies can also be treatable using a protein of the present invention. The proteins of the invention may also be useful in promoting the best and fastest closure of unhealed wounds, including without limitation, pressure ulcers, ulcers associated with vascular insufficiency, trauma and surgical wounds, and the like. It is expected that a protein of the present invention may also exhibit activity for the generation or regeneration of other tissues such as tissues of organs (including, for example, pancreas, liver, intestine, kidney, skin, endothelium), muscle (soft, skeletal or cardiac) and vascular (including vascular endothelium), or to promote the growth of cells comprising said tissues. Part of the desired effects may be by inhibiting or modulating fibrotic healing to allow normal tissue to regenerate. A protein of the invention could also exhibit angiogenic activity. A protein of the present invention may also be useful for the protection or regeneration of the intestine and for the treatment of lung or liver fibrosis, reperfusion injury in various tissues and conditions resulting from systemic cytokine damage. A protein of the present invention may also be useful for promoting or inhibiting the differentiation of previously described tissues from precursor cells or tissues.; or to inhibit the growth of the tissues described above. The activity of a protein of the invention can, among other means, be measured by the following methods: Assays for determining tissue generation activity include, without limitation, those described in International Patent Publication No. WO95 / 16035 ( bone, cartilage, tendon); International Patent Publication No. WO95 / 05846 (nerve, nerve); International Patent Publication No. WO91 / 07491 (skin, endothelium). Assays for determining wound healing activity include, without limitation, those described in: Winter, Epidemial Wound Healing, pps. 71-112 (Maibach, Hl and Rovee, DT, eds.) Year Book Medical Publishers, Inc., Chicago, as modified by Eaglstein and Mertz, J. Invest. Dermatol. 71: 382-84 (1978).
Activity of Activin / Inhibin A protein of the present invention may also exhibit activities related to activin or inhibin. Inhibins are characterized by their ability to inhibit the release of follicle stimulating hormone (FSH), while activins are characterized by their ability to stimulate the release of follicle stimulating hormone (FSH). Therefore, a protein of the present invention, alone or in heterodimers with a member of the alpha family of inhibins, may be useful as a contraceptive based on the ability to inhibit or decrease fertility in female mammals and decrease spermatogenesis in mammals males The administration of sufficient amounts of other inhibins can induce infertility in these mammals. Alternatively, the protein of the invention, as a homodimer or as a heterodimer with other protein subunits of the inhibin beta group, may be useful as a fertility-inducing therapeutic, based on the ability of the activin molecules in the stimulation of the release of FSH from cells of the anterior pituitary. See, for example, U.S. Pat. No. 4,798,885. A protein of the present invention may also be useful for advancing the onset of fertility in sexually immature mammals, for the purpose of increasing the productive performance cycle of domestic animals such as cows, sheep and pigs. The activity of the protein of the invention can, among other means, be measured by the following methods: Assays for determining activin / inhibin activity include, without limitation, those described in Vale et al., Endocrinology 91: 562-572 , 1972; Ling et al., Nature 321: 779-782, 1986; Vale et al., Nature 321: 776-779, 1986; Mason et al., Nature, 318: 659-663, 1985; Forage et al., Proc. Nati Acad. Sci. USA 83: 3091-3095, 1986.
Ouimyotactic / Ouimiokinetic Activity A protein of the present invention may have chemokinetic or chemotactic activity (eg, act as a chemokine) for mammalian cells, including, for example, monocytes, fibroblasts, neutrophils, T cells, mast cells, eosinophilic, epithelial and / or endothelial. The chemokinetic and chemotactic proteins can be used to mobilize or attract a population of desired cells to a desired site of action. Chemotactic or chemokinetic proteins provide particular advantages in the treatment of wounds and other tissue injuries, as well as in the treatment of localized infections. For example, the attraction of lymphocytes, monocytes or neutrophils to tumors or sites of infection can result in improved immune responses against the tumor or infectious agent. A protein or peptide has chemotactic activity for a particular cell population if it can stimulate, directly or indirectly, the targeting or directed movement of said cell population. Preferably, the protein or peptide has the ability to directly stimulate the directed movement of cells. The fact that a particular protein has chemotactic activity for a population of cells can easily be determined by employing said protein or peptide in any known assay for cellular chemotaxis. The activity of a protein of the invention can, among other means, be measured by the following methods: Tests to determine chemotactic activity (which will identify proteins that induce or prevent chemotaxis) consist of assays that measure the ability of a protein to induce the migration of cells through a membrane as well as the ability of a protein to induce the adhesion of a population of cells to another population of cells. Appropriate assays for determining movement and adhesion include, without limitation, those described in Current Protocols in Immunology, Edited by J. E. Coligan, A.M. Kruisbeek, D.H. Margulies, E.M. Shevach, W. Strober, Published by Greene Publishing Associates and Wiley-Interscience (Chapter 6.12, Measurement of alpha and Beta Chemokines 6.12.1-6.12.28; Taub et al., J. Clin. Invest. 95: 1370-1376, 1995; Lind et al., APMIS 103: 140-146, 1995; Muller et al., Eur J. Immunol., 25; 1744-1748; Gruber et al., J. Of Immunol., 152: 5860-5867, 1994; Johnson et al., J. of Immunol., 153: 1762-1768, 1994.
Hemostatic and Thrombolytic Activity A protein of the invention may also exhibit haemostatic or thrombolytic activity. As a result of this, it is expected that said protein be useful in the treatment of various coagulation disorders (including hereditary disorders, such as hemophilia) or to improve coagulation and other hemostatic events in the treatment of wounds resulting from trauma, surgeries or other causes A protein of the invention may also be useful for dissolving or inhibiting the formation of thrombosis and for the treatment and prevention of conditions resulting therefrom (such as, for example, infarction of central or cardiac nervous system vessels (eg, attack). The activity of a protein of the present invention can, among other means, be measured by the following methods: Assays for determining haemostatic and thrombolytic activity include, without limitation, those described in: Linet et al., J. Clin. Pharmacol, 26: 131-140, 1986, Burdick et al., Thrombosis Res. 45: 413-419, 1987, Humphrey et al, Fibrinolysis 5: 71-79 (1991), Schaub, Prostaglandins 35: 467-474, 1988. .
Receptor / Ligand Activity A protein of the present invention can also demonstrate activity as a receptor, ligand of receptors or inhibitors or agonists of receptor / ligand interactions. Examples of such receptors and ligands include, without limitation, cytokine receptors and their ligands, kinase receptors and their ligands, phosphatase receptors and their ligands, receptors involved in cell-cell interactions and their ligands (including, without limitation, cell adhesion molecules (such as selectins, integrins and their ligands) and receptor / ligand pairs involved in antigen presentation, antigen recognition and development of cellular and humoral immune responses). The receptors and ligands are also useful for the screening of potential peptide or small molecule inhibitors of the relevant receptor / ligand interaction. A protein of the present invention (including, without limitation, fragments of receptors and ligands) may itself be useful as an inhibitor of receptor / ligand interactions. The activity of a protein of the present invention can, among other means, be measured by the following methods: Appropriate assays for determining receptor-ligand activity include, without limitation those described in: Current Protocols in Immunology, edited by J.E. Coligan, A.M. Kruisbeek, D.H. Margulies, E.M. Shevach, W. Strober, Pub. Greene Publishing Associates and Wiley-Interscience (Chapter 7.28, Measurements of Cellular Adhesion under static conditions 7.28.1-7.28.22), Takai et al., Proc. Nati Acad. Sci. USA 84: 6864-6868, 1987; Bierer et al., J. Exp. Med. 168: 1145-1156, 1988; Rosenstein et al., J. Exp. Med 169: 149-160 1989; Stoltenborg et al., J. Immunol. Methods 175: 59-68, 1994; Stitt et al., Cell 80: 661-670, 1995.
Anti-Inflammatory Activity The proteins of the present invention may also exhibit anti-inflammatory activity. The anti-inflammatory activity can be achieved by providing a stimulus to the cells involved in the inflammatory response, inhibiting or promoting cell-cell interactions (such as, for example, cell adhesion), inhibiting or promoting the chemotaxis of cells involved in the inflammatory process, inhibiting or promoting cellular extravasation, or stimulating or suppressing the production of other factors that directly inhibit or promote an inflammatory response. Proteins exhibiting such activities can be used to treat inflammatory conditions (including chronic or acute conditions) including, without limitation, inflammation associated with infections (such as septic shock, sepsis or systemic inflammatory response syndrome (SIRS)), damage by ischemia-reperfusion, mortality by endotoxin, arthritis, hyperacute rejection mediated by complement, nephritis, lung damage induced by cytokine or chemokine, inflammatory bowel disease, Crohn's disease or resulting from overproduction of cytokines such as TNF or IL-1 The proteins of the invention may also be useful for treating anaphylaxis and hypersensitivity to a substance or antigen material.
Tumor Inhibiting Activity In addition to the activities described above for immunological treatment or tumor prevention, a protein of the present invention may exhibit other anti-tumor activities. A protein can inhibit the growth of tumors directly or indirectly (such as, for example, via ADCC). A protein may exhibit its tumor inhibitory activity by acting on tumor tissue or tumor precursor tissue, inhibiting the formation of tissues necessary to sustain tumor growth (such as, for example, inhibiting angiogenesis), causing production of other factors, agents or cell types that inhibit tumor growth, or by suppressing, eliminating or inhibiting factors, agents or cell types that promote tumor growth.
Other Activities A protein of the invention may also exhibit one or more of the following activities or effects: inhibit the growth, infection or function of, or kill, infectious agents, including, without limitation, bacteria, viruses, fungi and other parasites; affect (suppression or improvement) bodily characteristics, including, without limitation, height, weight, hair color, eye color, skin, fat / flesh or pigment ratio of other tissues, or size or partial body shape (such as, for example, example, increase or decrease of the chest, changes in the shape or configuration of bones); affect biorhythms or cycles or circadian rhythms; affect the fertility of male or female subjects, affect the metabolism, catabolism, anabolism, processing, utilization, storage or elimination of fat, lipids, proteins, carbohydrates, vitamins, minerals, cofactors or other factors or nutritional components of the diet; affect behavioral characteristics, including, without limitation, appetite, libido, stress, perception (including disorders of perception), depression (including depressive disorders) and violent behaviors; provision of analgesic effects or other pain-reducing effects; promotion of the differentiation and growth of embryonic progenitor cells in lineages other than hematopoietic lineages; hormonal or endocrine activity; in the case of enzymes, correct deficiencies of enzymes and the treatment of diseases related to deficiencies; treatment of hyperproliferative disorders (such as, for example, psoriasis); immunoglobulin-like activity (such as, for example, the ability to bind to antigens or complements); and the ability to act as an antigen in a vaccine composition to elevate the immune response against said protein or other material or entity that is reactive in some way with said protein.
LIST OF SEQUENCES
(2) INFORMATION FOR SEQ ID NO: 1: (i) CHARACTERISTICS OF THE SEQUENCE: (A) LENGTH: 205 (B) TYPE: Amino Acid (D) TOPOLOGY: Linear (ii) TYPE OF SEQUENCE: Protein (iii) HYPOTHETICAL : No (vi) ORIGINAL SOURCE: (A) ORGANISM SPECIES: Ho or sapiens (B) TYPE OF CELL: Fibrosarcoma (C) CELLULAR LINE: HT-1080 (D) CLONE NAME: HP00442 (xi) DESCRIPTION OF THE SEQUENCE: SEQ ID NO: 1:
Met Thr Gly Leu Wing Leu Leu Tyr Ser Gly Val Phe Val Wing Phe Trp 1 5 10 15
Ala Cys Ala Leu Ala Val Gly Val Cys Tyr Thr He Phe Asp Leu Gly 20 25 30 Phe Arg Phe Asp Val Wing Trp Phe Leu Thr Glu Thr Ser Pro Phe Met 35 40 45 Trp Ser Asn Leu Gly He Gly Leu Ala He Ser Leu Ser Val Val Gly 50 55 60 Wing Wing Trp Gly He Tyr He Thr Gly Be Ser He He Gly Gly Gly 65 70 75 80
Val Lys Ala Pro Arg He Lys Thr Lys Asn Leu Val Ser He He Phe 85 90 95
Cys Glu Ala Wing Wing He Tyr Gly He He Met Wing He Val He Ser 100 105 110 Asn Met Wing Glu Pro Phe Ser Wing Thr Asp Pro Lys Wing He Gly His 115 120 125 Arg Asn Tyr His Wing Gly Tyr Ser Met Phe Gly Wing Gly Leu Thr Val 130 135 140 Gly Leu Ser Asn Leu Phe Cys Gly Val Cys Val Gly He Val Gly Ser 145 150 155 160
Gly Wing Wing Leu Wing Asp Wing Gln Asn Pro Being Leu Phe Val Lys He 165 170 175
Leu He Val Glu He Phe Gly Be Wing He Gly Leu Phe Gly Val He 180 185 190 Val Ala He Leu Gln Thr Ser Arg Val Lys Met Gly Asp 195 200 205
(2) INFORMATION FOR SEQ ID NO: 2: (i) CHARACTERISTICS OF THE SEQUENCE: (A) LENGTH: 371 (B) TYPE: Amino Acid (D) TOPOLOGY: Linear (ii) TYPE OF SEQUENCE: Protein (iii) HYPOTHETICAL : No (vi) ORIGINAL SOURCE: (A) ORGANISM SPECIES: Homo sapiens (B) TYPE OF CELL: Leukocyte (D) CLONE NAME: HP00804 (xi) DESCRIPTION OF SEQUENCE: SEQ ID NO: 2:
Met Ser His Glu Lys Ser Phe Leu Val Ser Gly Asp Asn Tyr Pro Pro 1 5 10 15
Pro Asn Pro Gly Tyr Pro Gly Gly Pro Gln Pro Pro Met Pro Pro Tyr 20 25 30 Wing Gln Pro Pro Tyr Pro Gly Wing Pro Tyr Pro Gln Pro Pro Phe Gln 35 40 Pro Pro Pro Tyr Gly Gln Pro Gly Tyr Pro His Gly Pro Ser Pro Tyr 50 55 60 Pro Gln Gly Gly Tyr Pro Gln Gly Pro Tyr Pro Gln Gly Gly Tyr Pro 65 70 75 80
Gln Gly Pro Tyr Pro Gln Glu Gly Tyr Pro Gln Gly Pro Tyr Pro Gln 85 90 95
Gly Gly Tyr Pro Gln Gly Pro Tyr Pro Gln Ser Pro Phe Pro Pro Asn 100 105 110 Pro Tyr Gly Gln Pro Gln Val Phe Pro Gly Gln Asp Pro Asp Ser Pro 115 120 125 Gln His Gly Asn Tyr Gln Glu Glu Pro Pro Ser Tyr Tyr Asp Asn 130 135 140 Gln Asp Phe Pro Wing Thr Asn Trp Asp Asp Lys Ser He Arg Glu Wing 145 150 155 160
Phe He Arg Lys Val Phe Leu Val Leu Thr Leu Gln Leu Ser Val Thr 165 170 175
Leu Ser Thr Val Ser Val Phe Thr Phe Val Wing Glu Val Lys Gly Phe 180 185 190 Val Arg Glu Asn Val Trp Thr Tyr Tyr Val Ser Tyr Wing Val Phe Phe 195 200 205 He Ser Leu He Val Leu Ser Cys Cys Gly Asp Phe Arg Arg Lys His 210 210 220 Pro Trp Asn Leu Val Ala Leu Ser Val Leu Thr Ala Ser Leu Ser Tyr 225 230 235 240
Met Val Gly Met He Wing Being Phe Tyr Asn Thr Glu Wing Val He Met 245 250 255 Wing Val Gly He Thr Thr Wing Val Cys Phe Thr Val Val He Phe Ser 260 265 270 Met Gln Thr Arg Tyr Asp Phe Thr Ser Cys Met Gly Val Leu Leu Val 275 280 285 Ser Val Val Leu Phe He Phe Ala He Leu Cys He Phe He Arg 290 295 300 Asn Arg He Leu Glu He Val Tyr Ala Ser Leu Gly Ala Leu Leu Phe 305 310 315 320
Thr Cys Phe Leu Wing Val Asp Thr Gln Leu Leu Leu Gly Asn Lys Glu 325 330 335
Leu Ser Leu Ser Pro Glu Glu Tyr Val Phe Ala Ala Leu Asn Leu Tyr 340 345 350 Thr Asp He He Asn He Phe Leu Tyr He Leu Thr He He Gly Arg 355 360 365 Wing Lys Glu 370
(2) INFORMATION FOR SEQ ID NO: 3: (i) CHARACTERISTICS OF THE SEQUENCE: (A) LENGTH: 179 (B) TYPE: Amino Acid (D) TOPOLOGY: Linear (ii) TYPE OF SEQUENCE: Protein (iii) HYPOTHETICAL : No (vi) ORIGINAL SOURCE: (A) ORGANISM SPECIES: Homo sapiens (B) TYPE OF CELL: Stomach cancer (D) CLONE NAME: HP01098 (xi) DESCRIPTION OF SEQUENCE: SEQ ID NO: 3:
Met Leu Ser Leu Asp Phe Leu Asp Asp Val Arg Arg Met Asn Lys Arg 1 5 10 15
Gln Leu Tyr Tyr Gln Val Leu Asn Phe Gly Met He Val Ser Be Wing 20 25 30 Leu Met He Trp Lys Gly Leu Met Val He Thr Gly Ser Glu Ser Pro 35 40 45 He Val Val Leu Ser Gly Ser Met Glu Pro Ala Phe His Arg Gly 50 55 60 Asp Leu Leu Phe Leu Thr Asn Arg Val Glu Asp Pro He Arg Val Gly 65 70 75 80
Glu He Val Val Phe Arg He Glu Gly Arg Glu He Pro He Val His 85 90 95
Arg Val Leu Lys He His Glu Lys Gln Asn Gly His He Lys Phe Leu 100 105 110 Thr Lys Gly Asp Asn Asn Wing Val Asp Asp Arg Gly Leu Tyr Lys Gln 115 120 125 Gly Gln His Trp Leu Glu Lys Lys Asp Val Val Gly Arg Ala Arg Gly 130 135 140 Phe Val Pro Tyr He Gly He Val Thr He Leu Met Asn Asp Tyr Pro 145 150 155 160
Lys Phe Lys Tyr Ala Val Leu Phe Leu Leu Gly Leu Phe Val Leu Val 165 170 175
His Arg Glu
(2) INFORMATION FOR SEQ ID NO: 4: (i) CHARACTERISTICS OF THE SEQUENCE: (A) LENGTH: 347 (B) TYPE: Amino Acid (D) TOPOLOGY: Linear (ii) TYPE OF SEQUENCE: Protein (iii) HYPOTHETICAL : No (vi) ORIGINAL SOURCE: (A) ORGANISM SPECIES: Homo sapiens (B) TYPE OF CELL: Liver (D) CLONE NAME: HP01148 (xi) DESCRIPTION OF SEQUENCE: SEQ ID NO: 4:
Met Ala Leu Leu Phe Ser Leu lie Leu Ala lie Cys Thr Arg Pro Gly 1 5 10 15
Phe Leu Wing Being Pro Gly Val Arg Leu Val Gly Gly Leu His Arg 20 25 30 Cys Glu Gly Arg Val Glu Val Glu Gln Lys Gly Gln Trp Gly Thr Val 35 40 45 Cys Asp Asp Gly Trp Asp He Lys Asp Val Wing Val Leu Cys Arg Glu 50 55 60 Leu Gly Cys Gly Wing Wing Ser Gly Thr Pro Ser Gly He Leu Tyr Glu 65 70 75 80
Pro Pro Wing Glu Lys Glu Gln Lys Val Leu He Gln Ser Val Ser Cys 85 90 95
Thr Gly Thr Glu Asp Thr Leu Wing Gln Cys Glu Gln Glu Glu Val Tyr 100 105 110 Asp Cys Ser His Glu Glu Asp Ala Gly Ala Ser Cys Glu Asn Pro Glu 115 120 125 Ser Ser Phe Ser Pro Val Pro Glu Val Val Lego Arg Wing Asp Gly Pro 130 135 140 Gly His Cys Lys Gly Arg Val Glu Val Lys His Gln Asn Gln Trp Tyr 145 150 155 160 Thr Val Cys Gln Thr Gly Trp Ser Leu Arg Ala Ala Lys Val Val Cys 165 170 175
Arg Gln Leu Gly Cys Gly Arg Wing Val Leu Thr Gln Lys Arg Cys Asn 180 185 190 Lys His Wing Tyr Gly Arg Lys Pro He Trp Leu Ser Gln Met Ser Cys 195 200 205 Ser Gly Arg Glu Wing Thr Leu Gln Asp Cys Pro Ser Gly Pro Trp Gly 210 t 215 220 Lys Asn Thr Cys Asn His Asp Glu Asp Thr Trp Val Glu Cys Glu Asp 225 230 235 240
Pro Phe Asp Leu Arg Leu Val Gly Gly Asp Asn Leu Cys Ser Gly Arg 245 250 255
Leu Glu Val Leu His Lys Gly Val Trp Gly Ser Val Cys Asp Asp Asn 260 265 270 Trp Gly Glu Lys Glu Asp Gln Val Val Cys Lys Gln Leu Gly Cys Gly 275 280 285 Lys Ser Leu Ser Pro Ser Phe Arg Asp Arg Lys Cys Tyr Gly Pro Gly 290 295 300 Val Gly Arg He Trp Leu Asp Asn Val Arg Cys Ser Gly Glu Glu Gln 305 310 315 320
Ser Leu Glu Gln Cys Gln His Arg Phe Trp Gly Phe His Asp Cys Thr 325 330 335
His Gln Glu Asp Val Wing Val He Cys Ser Gly 340 345
(2) INFORMATION FOR SEQ ID NO: 5: (i) CHARACTERISTICS OF THE SEQUENCE: (A) LENGTH: 554 (B) TYPE: Amino Acid (D) TOPOLOGY: Linear (ii) TYPE OF SEQUENCE: Protein (iii) HYPOTHETICAL : No (vi) ORIGINAL SOURCE: (A) ORGANISM SPECIES: Homo sapiens (B) TYPE OF CELL: Liver (D) CLONE NAME: HP01293 (xi) DESCRIPTION OF SEQUENCE: SEQ ID NO: 5:
Met Pro Thr Val Asp Asp He Leu Glu Gln Val Gly Glu Ser Gly Trp 1 5 10 15
Phe Gln Lys Gln Wing Phe Leu He Leu Cys Leu Leu Wing Wing Phe 20 25 30 Wing Pro He Cys Val Gly He Val Phe Leu Gly Phe Thr Pro Asp His 35 40 45 His Cys Gln Ser Pro Gly Val Ala Glu Leu Ser Gln Arg Cys Gly Trp 50 55 60 Ser Pro Wing Glu Glu Leu Asn Tyr Thr Val Pro Gly Leu Gly Pro Wing 65 70 75 80
Gly Glu Wing Phe Leu Gly Gln Cys Arg Arg Tyr Glu Val Asp Trp Asn 85 90 95
Gln Ser Ala Leu Ser Cys Val Asp Pro Leu Wing Ser Leu Wing Thr Asn LOO 105 110 Arg Ser His Leu Pro Leu Gly Pro Cys Gln Asp Gly Trp Val Tyr Asp 115 120 125 Thr Pro Gly Ser Ser He Val Thr Glu Phe Asn Leu Val Cys Wing Asp 130 135 140 Ser Trp Lys Leu Asp Leu Phe Gln Ser Cys Leu Asn Wing Gly Phe Phe 145 150 155 160
Phe Gly Ser Leu Gly Val Gly Tyr Phe Wing Asp Arg Phe Gly Arg Lys 165 170 175
Leu Cys Leu Leu Gly Thr Val Leu Val Asn Ala Val Ser Gly Val Leu 180 185 190 Met Wing Phe Ser Pro Asn Tyr Met Ser Met Leu Leu Phe Arg Leu Leu 195 200 205 Gln Gly Leu Val Ser Lys Gly Asn Trp Met Wing Gly Tyr Thr Leu He 210 215 220 Thr Glu Phe Val Gly Ser Gly Ser Arg Arg Thr Val Wing He Met Tyr 225 230 235 240
Gln Met Wing Phe Thr Val Gly Leu Val Wing Leu Thr Gly Leu Wing Tyr 245 250 255
Wing Leu Pro His Trp Arg Trp Leu Gln Leu Wing Val Ser Leu Pro Thr 260 265 270 Phe Leu Phe Leu Leu Tyr Tyr Trp Cys Val Pro Glu Ser Pro Arg Trp 275 280 285 Leu Leu Ser Gln Lys Arg Asn Thr Glu Wing He Lys He Met Asp His 290 295 300 He Wing Gln Lys Asn Gly Lys Leu Pro Pro Wing Asp Leu Lys Met Leu 305 310 315 320
Ser Leu Glu Glu Asp Val Thr Glu Lys Leu Ser Pro Ser Phe Ala Asp 325 330 335
Leu Phe Arg Thr Pro Arg Leu Arg Lys Arg Thr Phe He Leu Met Tyr 340 345 350 Leu Trp Phe Thr Asp Ser Val Leu Tyr Gln Gly Leu He Leu His Met 355 360 365 Gly Ala Thr Ser Gly Asn Leu Tyr Leu Asp Phe Leu Tyr Ser Ala Leu 370 375 380 Val Glu He Pro Gly Ala Phe He Ala Ala Leu He Thr He Asp Arg Val 385 390 395 400
Gly Arg He Tyr Pro Met Wing Val Ser Asn Leu Leu Wing Gly Wing Wing 405 410 415
Cys Leu Val Met He Phe He Ser Pro Asp Leu His Trp Leu Asn He 420 425 430 He He Met Met Cys Val Gly Arg Met Gly He Thr He Ala He Gln Met 435, 440 445 He Cys Leu Val Asn Ala Glu Leu Tyr Pro Thr Phe Val Arg Asn Leu 450 455 460 Gly Val Met Val Cys Ser Ser Leu Cys Asp He Gly Gly He He Thr 465 470 475 480
Pro Phe He Val Phe Arg Leu Arg Glu Val Trp Gln Ala Leu Pro Leu 485 490 495
He Leu Phe Wing Val Leu Gly Leu Leu Wing Wing Gly Val Thr Leu Leu 500 505 510 Leu Pro Glu Thr Lys Gly Val Wing Leu Pro Glu Thr Met Lys Asp Wing 515 520 525 Glu Asn Leu Gly Arg Lys Wing Lys Pro Lys Glu Asn Thr He Tyr Leu 530 535 540 Lys Val Gln Thr Ser Glu Pro Ser Gly Thr 545 550
(2) INFORMATION FOR SEQ ID NO: 6: (i) CHARACTERISTICS OF THE SEQUENCE: (A) LENGTH: 350 (B) TYPE: Amino Acid (D) TOPOLOGY: Linear (ii) TYPE OF SEQUENCE: Protein (iii) HYPOTHETICAL : No (vi) ORIGINAL SOURCE: (A) ORGANISM SPECIES: Homo sapiens (B) TYPE OF CELL: Squamous cell carcinoma (C) CELL LINE: KB (D) CLONE NAME: HP10013 (xi) DESCRIPTION OF SEQUENCE: SEQ ID NO: 6:
Met Wing Val Phe Val Val Leu Leu Wing Leu Val Wing Gly Val Leu Gly 1 5 10 15
Asn Glu Phe Be He Leu Lys Ser Pro Gly Ser Val Val Phe Arg Asn 20 25 30 Gly Asn Trp Pro He Pro Gly Glu Arg He Pro Asp Val Ala Ala Leu 35 40 45 Ser Met Gly Phe Ser Val Lys Glu Asp Leu Ser Trp Pro Gly Leu Wing 50 55 60 16 Val Gly Asn Leu Phe His Arg Pro Arg Wing Thr Val Met Val Met Val 65 70 75 80
Lys Gly Val Asn Lys Leu Ala Leu Pro Pro Gly Ser Val He Ser Tyr 85 90 95
Pro Leu Glu Asn Wing Val Pro Phe Ser Leu Asp Ser Val Wing Asn Ser 100 105 110 He His Ser Leu Phe Ser Glu Glu Thr Pro Val Val Leu Gln Leu Ala 115, 120 Pro 125 Pro Glu Glu Arg Val Tyr Met Val Gly Lys Wing Asn Ser Val Phe 130 135 140 Glu Asp Leu Ser Val Thr Leu Arg Gln Leu Arg Asn Arg Leu Phe Gln 145 150 155 160
Glu Asn Ser Val Leu Ser Ser Leu Pro Leu Asn Ser Leu Ser Arg Asn 165 170 175
Asn Glu Val Asp Leu Leu Phe Leu Ser Glu Leu Gln Val Leu His Asp 180 185 190 He Ser Ser Leu Leu Ser Arg His Lys His Leu Ala Lys Asp His Ser 195 200 205 Pro Asp Leu Tyr Ser Leu Glu Leu Ala Gly Leu Asp Glu He Gly Lys 210 215 220 Arg Tyr Gly Glu Asp Ser Glu Gln Phe Arg Asp Wing Ser Lys He Leu 225 230 235 240
Val Asp Ala Leu Gln Lys Phe Wing Asp Asp Met Tyr Ser Leu Tyr Gly 245 250 255
Gly Asn Ala Val Val Glu Leu Val Thr Val Lys Ser Phe Asp Thr Ser 260 265 270 Leu He Arg Lys Thr Arg Thr He Leu Glu Wing Lys Gln Wing Lys Asn 275 280 285 Pro Wing Pro Pro Tyr Asn Leu Wing Tyr Lys Tyr Asn Phe Glu Tyr Ser 290 295 300 Val Val Phe Asn Met Val Leu Trp He Met He Ala Leu Ala Leu Ala 305 310 315 320
Val He He Thr Ser Tyr Asn He Trp Asn Met Asp Pro Gly Tyr Asp 325 330 335
Ser He He Tyr Arg Met Thr Asn Gln Lys He Arg Met Asp 340 345 350
(2) INFORMATION FOR SEQ ID NO: 7: (i) CHARACTERISTICS OF THE SEQUENCE: (A) LENGTH: 209 (B) TYPE: Amino acid (D) TOPOLOGY: Linear (ii) SEQUENCE TYPE: Protein (iii) HYPOTHETICAL : No (vi) ORIGINAL SOURCE: (A) ORGANISM SPECIES: Homo sapiens (B) TYPE OF CELL: Fibrosarcoma (C) CELLULAR LINE: HT-1080 (D) CLONE NAME: HP10034 (xi) DESCRIPTION OF THE SEQUENCE : SEQ ID NO: 7:
Met Val Ser Ser Pro Cys Thr Gln Wing Being Ser Arg Thr Cys Ser Arg 1 5 10 15
He Leu Gly Leu Ser Leu Gly Thr Ala Ala Leu Phe Ala Ala Gly Ala 20 25 30 Asn Val Ala Leu Leu Pro Asn Trp Asp Val Thr Tyr Leu Leu Arg 35 40 45 Gly Leu Leu Gly Arg His Wing Met Leu Gly Thr Gly Leu Trp Gly Gly 50 55 60 Gly Leu Met Val Leu Thr Wing Wing He Leu He Ser Leu Met Gly Trp 65 70 75 80
Arg Tyr Gly Cys Phe Ser Lys Ser Gly Leu Cys Arg Ser Val Leu Thr 85 90 95
Wing Leu Leu Be Gly Gly Leu Wing Leu Leu Gly Wing Leu He Cys Phe 100 105 110 Val Thr Ser Gly Val Wing Leu Lys Asp Gly Pro Phe Cys Met Phe Asp 115 120 125 Val Ser Ser Phe Asn Gln Thr Gln Wing Trp Lys Tyr Gly Tyr Pro Phe 130 135 140 Lys Asp Leu His Ser Arg Asn Tyr Leu Tyr Asp Arg Ser Leu Trp Asn 145 150 155 160
Ser Val Cys Leu Glu Pro Ser Ala Ala Val Val Trp His Val Ser Leu 165 170 175
Phe Ser Ala Leu Leu Cys He Ser Leu Leu Gln Leu Leu Leu Val Val 180 185 190 Val His Val He Asn Ser Leu Leu Gly Leu Phe Cys Ser Leu Cys Glu
195 200 205 Lys
(2) INFORMATION FOR SEQ ID NO: 8: (i) CHARACTERISTICS OF THE SEQUENCE: (A) LENGTH: 163 (B) TYPE: Amino Acid (D) TOPOLOGY: Linear (ii) TYPE OF SEQUENCE: Protein (iii) HYPOTHETICAL : No (vi) ORIGINAL SOURCE: (A) ORGANISM SPECIES: Homo sapiens (B) TYPE OF CELL: Fibrosarcoma (C) CELLULAR LINE: HT-1080 (D) CLONE NAME: HP10050 (xi) DESCRIPTION OF THE SEQUENCE : SEQ ID NO: 8:
Met Ala Ala Gly Leu Phe Gly Leu Ser Ala Arg Arg Leu Leu Ala Ala 1 5 10 15
Wing Wing Thr Arg Gly Leu Pro Wing Wing Arg Val Arg Trp Glu Be Ser 20 25 30 Phe Ser Arg Thr Val Val Wing Pro Wing Val Wing Gly Lys Arg Pro 35 40 45 Pro Glu Pro Thr Thr Pro Trp Gln Glu Asp Pro Glu Pro Glu Asp Glu 50 55 60 Asn Leu Tyr Glu Lys Asn Pro Asp Ser His Gly Tyr Asp Lys Asp Pro 65 70 75 80
Val Leu Asp Val Trp Asn Met Arg Leu Val Phe Phe Phe Gly Val Ser 85 90 95
He He Leu Val Leu Gly Ser Thr Phe Val Wing Tyr Leu Pro Asp Tyr 100 105 110 Arg Cys Thr Gly Cys Pro Arg Wing Trp Asp Gly Met Lys Glu Trp Ser 115 120 125 Arg Arg Glu Wing Glu Arg Leu Val Lys Tyr Arg Glu Wing Asn Gly Leu 130 135 140 Pro He Met Glu Be Asn Cys Phe Asp Pro Ser Lys He Gln Leu Pro 145 150 155 160
Glu Asp Glu
(2) INFORMATION FOR SEQ ID NO: 9: (i) CHARACTERISTICS OF THE SEQUENCE: (A) LENGTH: 92 (B) TYPE: Amino acid (D) TOPOLOGY: Linear (ii) SEQUENCE TYPE: Protein (iii) HYPOTHETICAL : No (vi) ORIGINAL SOURCE: (A) ORGANISM SPECIES: Homo sapiens (B) TYPE OF CELL: Stomach cancer (D) CLONE NAME: HP10071 (xi) DESCRIPTION OF SEQUENCE: SEQ ID NO: 9:
Met Thr Lys Leu Wing Gln Trp Leu Trp Gly Leu Wing He Leu Gly Ser 1 5 10 15
Thr Trp Val Wing Leu Thr Thr Gly Wing Leu Gly Leu Glu Leu Pro Leu 20 25 30 Ser Cys Gln Glu Val Leu Trp Pro Leu Pro Wing Tyr Leu Leu Val Ser 35 40 45 Wing Gly Cys Tyr Wing Leu Gly Thr Val Gly Tyr Arg Val Ala Thr Phe 50 55 60 His Asp Cys Glu Asp Wing Wing Arg Glu Leu Gln Ser Gln He Gln Glu 65 70 75 80
Wing Arg Wing Asp Leu Wing Arg Arg Gly Leu Arg Phe 85 90
(2) INFORMATION FOR SEQ ID NO: 10: (i) CHARACTERISTICS OF THE SEQUENCE: (A) LENGTH: 172 (B) TYPE: Amino Acid (D) TOPOLOGY: Linear (ii) TYPE OF SEQUENCE: Protein (iii) HYPOTHETICAL : No (vi) ORIGINAL SOURCE: (A) ORGANISM SPECIES: Homo sapiens (B) TYPE OF CELL: Lymphoma (C) CELLULAR LINE: U937 (D) CLONE NAME: HP10076 (xi) DESCRIPTION OF THE SEQUENCE: SEQ ID NO: 10:
Met Glu Tyr Leu Ala His Pro Ser Thr Leu Gly Leu Ala Val Gly Val 1 5 10 15
Wing Cys Gly Met Cys Leu Gly Trp Ser Leu Arg Val Cys Phe Gly Met 20 25 30 Leu Pro Lys Ser Lys Thr Ser Lys Thr His Thr Asp Thr Glu Ser Glu 35 40 45 Wing Ser He Leu Gly Asp Ser Gly Glu Tyr Lys Met He Leu Val Val 50 55 60 Arg Asn Asp Leu Lys Met Gly Lys Gly Lys Val Wing Wing Gln Cys Ser 65 70 75 80
His Ala Ala Ala Ser Ala Tyr Lys Gln He Gln Arg Arg Asn Pro Glu 85 90 95
Met Leu Lys Gln Trp Glu Tyr Cys Gly Gln Pro Lys Val Val Val Lys 100 105 110 Wing Pro Asp Glu Glu Thr Leu He Wing Leu Leu Wing His Wing Lys Met 115 120 125 Leu Gly Leu Thr Val Ser Leu He Gln Asp Wing Gly Arg Thr Glu He 130 135 140 Wing Pro Gly Ser Gln Thr Val Leu Gly He Gly Pro Gly Pro Wing Asp 145 150 155 160
Leu He Asp Lys Val Thr Gly His Leu Lys Leu Tyr 165 170
(2) INFORMATION FOR SEQ ID NO: 11: (i) CHARACTERISTICS OF THE SEQUENCE: (A) LENGTH: 149 (B) TYPE: Amino Acid (D) TOPOLOGY: Linear (ii) TYPE OF SEQUENCE: Protein (iii) HYPOTHETICAL : No (vi) ORIGINAL SOURCE: (A) ORGANISM SPECIES: Homo sapiens (B) TYPE OF CELL: Lymphoma (C) CELLULAR LINE: U937 (D) CLONE NAME: HP10085 (xi) DESCRIPTION OF SEQUENCE: SEQ ID NO: 11:
Met Met Thr Lys His Lys Lys Cys Phe He He Val Val Val Leu He 1 5 10 15
Thr Thr Asn He He Thr Leu He Val Lys Leu Thr Arg Asp Ser Gln 20 25 30 Being Leu Cys Pro Tyr Asp Trp He Gly Phe Gln Asn Lys Cys Tyr Tyr 35 40 45 Phe Ser Lys Glu Glu Gly Asp Trp Asn Ser Ser Lys Tyr Asn Cys Ser 50 55 60 Thr Gln His Wing Asp Leu Thr He He Asp Asn He Glu Glu Met Asn 65 70 75 80
Phe Leu Arg Arg Tyr Lys Cys Ser As Asp His Trp He Gly Leu Lys 85 90 95 Met Wing Lys Asn Arg Thr Gly Gln Trp Val Asp Gly Wing Thr Phe Thr 100 105 110 Lys Ser Phe Gly Met Arg Gly Ser Glu Gly Cys Wing Tyr Leu Ser Asp 115 120 125 Asp Gly Wing Wing Thr Wing Arg Cys Tyr Thr Glu Arg Lys Trp He Cys 130 135 140 Arg Lys Arg He His 145
(2) INFORMATION FOR SEQ ID NO: 12: (i) CHARACTERISTICS OF THE SEQUENCE: (A) LENGTH: 188 (B) TYPE: Amino Acid (D) TOPOLOGY: Linear (ii) TYPE OF SEQUENCE: Protein (iii) HYPOTHETICAL : No (vi) ORIGINAL SOURCE: (A) ORGANISM SPECIES: Homo sapiens (B) TYPE OF CELL: Stomach cancer (D) CLONE NAME: HP10122 (xi) DESCRIPTION OF SEQUENCE: SEQ ID NO: 12:
Met Ser Thr Met Phe Wing Asp Thr Leu Leu He Val Phe He Ser Val 1 5 10 15
Cys Thr Ala Leu Leu Ala Glu Gly He Thr Trp Val Leu Val Tyr Arg 20 25 30 Thr Asp Lys Tyr Lys Arg Leu Lys Wing Glu Val Glu Lys Gln Ser Lys 35 40 45 Lys Leu Glu Lys Lys Lys Glu Thr He Thr Glu Ser Ala Gly Arg Gln 50 55 60 Gln Lys Lys He Glu Arg Gln Glu Glu Lys Leu Lys Asn Asn Asn 65 70 75 80
Arg Asp Leu Ser Met Val Arg Met Lys Ser Met Phe Wing He Gly Phe 85 90 95
Cys Phe Thr Ala Leu Met Gly Met Phe Asn Ser He Phe Asp Gly Arg 100 105 110 Val Val Ala Lys Leu Pro Phe Thr Pro Leu Ser Tyr He Gln Gly Leu 115 120 125 Ser His Arg Asn Leu Leu Gly Asp Asp Thr Thr Asp Cys Ser Phe He 130 135 140 Phe Leu Tyr He Leu Cys Thr Met Ser He Arg Gln Asn He Gln Lys 145 150 155 160
He Leu Gly Leu Wing Pro Being Arg Wing Wing Thr Lys Gln Wing Gly Gly 165 170 175
Phe Leu Gly Pro Pro Pro Pro Ser Gly Lys Phe Ser 180 185
(2) INFORMATION FOR SEQ ID NO: 13: (i) CHARACTERISTICS OF THE SEQUENCE: (A) LENGTH: 215 (B) TYPE: Amino Acid (D) TOPOLOGY: Linear (ii) TYPE OF SEQUENCE: Protein (iii) HYPOTHETICAL : No (vi) ORIGINAL SOURCE: (A) ORGANISM SPECIES: Homo sapiens (B) TYPE OF CELL: Lymphoma (C) CELLULAR LINE: U937 (D) CLONE NAME: HP10136 (xi) DESCRIPTION OF SEQUENCE: SEQ ID NO: 13:
Met Val Leu Leu Thr Met He Wing Arg Val Wing Asp Gly Leu Pro Leu 1 5 10 15
Wing Wing Met Met Gln Glu Asp Glu Gln Ser Gly Arg Asp Leu Gln Gln 20 25 30 Tyr Gln Ser Gln Wing Lys Gln Leu Phe Arg Lys Leu Asn Glu Gln Ser 35 40 45 Pro Thr Arg Cys Thr Leu Glu Wing Gly Wing Mßt Thr Phe His Tyr He 50 55 60 He Glu Gln Gly Val Cys Tyr Leu Val Leu Cys Glu Ala Wing Phe Pro 65 70 75 80
Lys Lys Leu Wing Phe Wing Tyr Leu Glu Asp Leu His Ser Glu Phe Asp 85 90 95
Glu Gln His Gly Lys Lys Val Pro Thr Val Ser Arg Pro Tyr Ser Phe 100 105 110 He Glu Phe Asp Thr Phe He Gln Lys Thr Lys Lys Leu Tyr He Asp 115 120 125 Ser Arg Ala Arg Arg Asn Leu Gly Ser He Asn Thr Glu Leu Gln Asp 130 135 140 Val Gln Arg He Met Val Wing Asn He Glu Glu Val Leu Gln Arg Gly 145 150 155 160
Glu Ala Leu Ser Ala Leu Asp Ser Lys Ala Asn Asn Leu Ser Ser Leu 165 170 175
Be Lys Lys Tyr Arg Gln Asp Wing Lys Tyr Leu Asn Met Arg Ser Thr 180 185 190 Tyr Wing Lys Wing Wing Val Wing Val Phe Phe He Met Leu He Val 195 200 205 Tyr Val Arg Phe Trp Trp Leu 210 215
(2) INFORMATION FOR SEQ ID NO: 14: (i) CHARACTERISTICS OF THE SEQUENCE: (A) LENGTH: 112 (B) TYPE: Amino Acid (D) TOPOLOGY: Linear (ii) TYPE OF SEQUENCE: Protein (iii) HYPOTHETICAL : No (vi) ORIGINAL SOURCE: (A) ORGANISM SPECIES: Homo sapiens (B) TYPE OF CELL: Stomach cancer (D) CLONE NAME: HP10175 (xi) DESCRIPTION OF SEQUENCE: SEQ ID NO: 14:
Met Gln Asp Thr Gly Ser Val Val Pro Leu His Trp Phe Gly Phe Gly 1 5 10 15
Tyr Ala Ala Leu Val Ala Ser Gly Gly He He Gly Tyr Val Lys Wing 20 25 30 Gly Ser Val Pro Ser Leu Ala Wing Gly Leu Lehe Phe Gly Ser Leu Wing 35 40 45 Gly Leu Gly Wing Tyr Gln Leu Ser Gln Asp Pro Arg Asn Val Trp Val 50 55 60 Phe Leu Wing Thr Ser Gly Thr Leu Wing Gly He Met Gly Met Arg Phe 65 70 75 80
Tyr His Ser Gly Lys Phe Met Pro Wing Gly Leu He Wing Gly Wing Ser 85 90 95
Leu Leu Met Val Ala Lys Val Gly Val Ser Met Phe Asn Arg Pro His 100 105 110
(2) INFORMATION FOR SEQ ID NO: 15: (i) CHARACTERISTICS OF THE SEQUENCE: (A) LENGTH: 114 (B) TYPE: Amino Acid (D) TOPOLOGY: Linear (ii) TYPE OF SEQUENCE: Protein (iii) HYPOTHETICAL : No (vi) ORIGINAL SOURCE: (A) ORGANISM SPECIES: Homo sapiens (B) TYPE OF CELL: Squamous cell carcinoma (C) CELLULAR LINE: KB (D) CLONE NAME: HP10179 (xi) DESCRIPTION OF THE SEQUENCE: SEQ ID NO: 15:
Met Glu Lys Pro Leu Phe Pro Leu Val Pro Leu His Trp Phe Gly Phe 1 5 10 15
Gly Tyr Thr Ala Leu Val Val Ser Gly Gly He Val Gly Tyr Val Lys 20 25 30 Thr Gly Ser Val Pro Ser Leu Ala Wing Gly Leu Lehe Phe Gly Ser Leu 35 40 45 Wing Gly Leu Gly Wing Tyr Gln Leu Tyr Gln Asp Pro Arg Asn Val Trp 50 55 60 Gly Phe Leu Wing Wing Thr Ser Val Thr Phe Val Gly Val Met Gly Met 65 70 75 80
Arg Ser Tyr Tyr Tyr Gly Lys Phe Met Pro Val Gly Leu He Wing Gly 85 90 95
Ala Ser Leu Leu Met Ala Ala Lys Val Gly Val Arg Met Leu Met Thr 100 105 110 Ser Asp
(2) INFORMATION FOR SEQ ID NO: 16: (i) CHARACTERISTICS OF THE SEQUENCE: (A) LENGTH: 327 (B) TYPE: Amino Acid (D) TOPOLOGY: Linear (ii) TYPE OF SEQUENCE: Protein. (iii) HYPOTHETICAL: No (vi) ORIGINAL SOURCE: (A) ORGANISM SPECIES: Homo sapiens (B) TYPE OF CELL: Fibrosarcoma (C) CELLULAR LINE: HT-1080 (D) CLONE NAME: HP10196 (xi) DESCRIPTION OF THE SEQUENCE: SEQ ID NO: 16:
Met Ala Ala Ala Ala Ala Ala Ala Ala Ala Ala Thr Asn Gly Thr Gly Gly 1 '5 10 15
Be Ser Gly Met Glu Val Asp Ala Ala Val Val Pro Ser Val Met Wing 20 25 30 Cys Gly Val Thr Gly Ser Val Ser Val Ala Leu His Pro Leu Val He 35 40 45 Leu Asn He Ser Asp His Trp He Arg Met Arg Ser Gln Glu Gly Arg 50 55 60 Pro Val Gln Val He Gly Ala Leu He Gly Lys Gln Glu Gly Arg Asn 65 70 75 80
He Glu Val Met Asn Ser Phe Glu Leu Leu Ser His Thr Val Glu Glu 85 90 95
Lys He He He As Asp Lys Glu Tyr Tyr Tyr Thr Lys Glu Glu Gln Phe 100 105 110 Lys Gln Val Phe Lys Glu Leu Glu Phe Leu Gly Trp Tyr Thr Thr Gly 115 120 125 Gly Pro Pro Asp Pro Ser Asp He His Val His Lys Gln Val Cys Glu 130 135 140 He He Glu Ser Pro Leu Phe Leu Lys Leu Asn Pro Met Thr Lys His 145 150 155 160
Thr Asp Leu Pro Val Ser Val Phe Glu Ser Val He Asp He He Asn 165 170 175
Gly Glu Wing Thr Met Leu Phe Wing Glu Leu Thr Tyr Thr Leu Wing Thr 180 185 190 Glu Glu Wing Glu Arg He Gly Val Asp His Val Wing Arg Met Thr Wing 195 200 205 Thr Gly Ser Gly Glu Asn Ser Thr Val Wing Glu His Leu He Wing Gln 210 215 220 His Ser Wing He Lys Met Leu His Ser Arg Val Lys Leu He Leu Glu 225 230 235 240
Tyr Val Lys Wing Ser Glu Wing Gly Glu Val Pro Phe Asn His Glu He 245 250 255
Leu Arg Glu Ala Tyr Ala Leu Cys His Cys Leu Pro Val Leu Ser Thr 260 265 270 Asp Lys Phe Lys Thr Asp Phe Tyr Asp Gln Cys Asn Asp Val Gly Leu 275 280 285 Met Wing Tyr Leu Gly Thr He Thr Lys Thr Cys Asn Thr Met Asn Gln 290 295 300 Phe Val Asn Lys Phe Asn Val Leu Tyr Asp Arg Gln Gly He Gly Arg 305 310 315 320
Arg Met Arg Gly Leu Phe Phe 325
(2) INFORMATION FOR SEQ ID NO: 17: (i) CHARACTERISTICS OF THE SEQUENCE: (A) LENGTH: 373 (B) TYPE: Amino Acid (D) TOPOLOGY: Linear (ii) TYPE OF SEQUENCE: Protein (iii) HYPOTHETICAL : No (vi) ORIGINAL SOURCE: (A) ORGANISM SPECIES: Homo sapiens (B) TYPE OF CELL: Fibrosarcoma (C) CELLULAR LINE: HT-1080 (D) CLONE NAME: HP10235 (xi) DESCRIPTION OF THE SEQUENCE : SEQ ID NO: 17:
Met Thr Leu Cys Wing Met Leu Pro Leu Leu Leu Phe Thr Tyr Leu Asn 1 5 10 15
Be Phe Leu His Gln Arg He Pro Gln Ser Val Arg He Leu Gly Ser 20 25 30 Leu Val Ala He Leu Leu Val Phe Leu He Thr Ala He Leu Val Lys 35 40 45 Val Gln Leu Asp Ala Leu Pro Phe Phe Val He Thr Met He Lys He 50 55 60 Val Leu He Asn Ser Phe Gly Ala He Leu Gln Gly Ser Leu Phe Gly 65 70 75 80
Leu Wing Gly Leu Leu Pro Wing Being Tyr Thr Wing Pro He Met Being Gly 85 90 95
Gln Gly Leu Wing Gly Phe Phe Wing Ser Val Wing Met He Cys Wing 100 105 110 Wing Ser Gly Ser Glu Leu Ser Glu Ser Wing Phe Gly Tyr Phe He Thr 115 120 125 Wing Cys Wing Val He He Leu Thr He He Cys Tyr Leu Gly Leu Pro 130 135 140 Arg Leu Glu Phe Tyr Arg Tyr Tyr Gln Gln Leu Lys Leu Glu Gly Pro 145 150 155 160
Gly Glu Gln Glu Thr Lys Leu Asp Leu He Ser Lys Gly Glu Glu Pro 165 170 175
Arg Ala Gly Lys Glu Glu Ser Gly Val Ser Val Ser Asn Ser Gln Pro 180 185 190 Thr Asn Glu Ser His Ser lie Lys Ala He Leu Lys Asn He Ser Val 195 200 205 Leu Ala Phe Ser Val Cys Phe He Phe Thr He Thr He Gly Met Phe 210 215 220 Pro Wing Val Thr Val Glu Val Lys Ser Ser Wing Wing Gly Ser Ser Thr 225 230 235 240
Trp Glu Arg Tyr Phe He Pro Val Ser Cys Phe Leu Thr Phe Asn He 245 250 255
Phe Asp Trp Leu Gly Arg Ser Leu Thr Wing Val Phe Met Trp Pro Gly 260 265 270 Lys Asp Ser Arg Trp Leu Pro Ser Leu Val Leu Wing Arg Leu Val Phe 275 280 285 Val Pro Leu Leu Leu Leu Cys Asn He Lys Pro Arg Arg Tyr Leu Thr 290 295 300 Val Val Phe Glu His Asp Wing Trp Phe He Phe Phe Met Wing Wing Phe 305 310 315 320
Wing Phe Ser Asn Gly Tyr Leu Wing Ser Leu Cys Met Cys Phe Gly Pro 325 330 335
Lys Lys Val Lys Pro Wing Glu Wing Glu Thr Wing Gly Wing He Met Wing 340 345 350 Phe Phe Leu Cys Leu Gly Leu Wing Leu Gly Wing Val Phe Ser Phe Leu 355 360 365 Phe Arg Wing He Val 370
(2) INFORMATION FOR SEQ ID NO: 18: (i) CHARACTERISTICS OF THE SEQUENCE: (A) LENGTH: 183 (B) TYPE: Amino Acid (D) TOPOLOGY: Linear (ii) TYPE OF SEQUENCE: Protein (iii) HYPOTHETICAL : No (vi) ORIGINAL SOURCE: (A) ORGANISM SPECIES: Homo sapiens (B) TYPE OF CELL: Stomach cancer (D) CLONE NAME: HP10297 (i) DESCRIPTION OF SEQUENCE: SEQ ID NO: 18:
Met Lys Leu Leu Ser Leu Val Wing Val Val Gly Cys Leu Leu Val Pro 1 5 10 15
Pro Wing Glu Wing Asn Lys Ser Ser Glu Asp He Arg Cys Lys Cys He 20 25 30 Cys Pro Pro Tyr Arg Asn He Ser Gly His He Tyr Asn Gln Asn Val 35 40 45 Ser Gln Lys Asp Cys Asn Cys Leu His Val Val Glu Pro Met Pro Val 50 55 60 Pro Gly His Asp Val Glu Ala Tyr Cys Leu Leu Cys Glu Cys Arg Tyr 65 70 75 80
Glu Glu Arg Ser Thr Thr He Lys Val He He Val He Tyr Leu 85 90 95
Val Val Gly Ala Leu Leu Leu Tyr Met Ala Phe Leu Met Leu Val 00 105 110 Asp Pro Leu He Arg Lys Pro Asp Ala Tyr Thr Glu Gln Leu His Asn 115 120 125 Glu Glu Glu Asn Glu Asp Ala Arg Ser Met Ala Ala Ala Ala Ala Ser 130 135 140 Leu Gly Gly Pro Arg Ala Asn Thr Val Leu Glu Arg Val Glu Gly Ala 145 150 155 160
Gln Gln Arg Trp Lys Leu Gln Val Gln Glu Gln Arg Lys Thr Val Phe 165 170 175
Asp Arg His Lys Met Leu Ser 180
(2) INFORMATION FOR SEQ ID NO: 19: (i) CHARACTERISTICS OF THE SEQUENCE: (A) LENGTH: 116 (B) TYPE: Amino Acid (D) TOPOLOGY: Linear (ii) TYPE OF SEQUENCE: Protein (iii) HYPOTHETICAL : No (vi) ORIGINAL SOURCE: (A) ORGANISM SPECIES: Homo sapiens (B) TYPE OF CELL: Stomach cancer (D) CLONE NAME: HP10299 (xi) DESCRIPTION OF SEQUENCE: SEQ ID NO: 19:
Met Wing Being Thr Val Val Wing Val Gly Leu Thr He Wing Wing Wing Gly 1 5 10 15
Phe Wing Gly Arg Tyr Val Leu Gln Wing Met Lys His Met Glu Pro Gln 20 25 30 Val Lys Gln Val Phe Gln Ser Leu Pro Lys Ser Wing Phe Ser Gly Gly 35 40 45 Tyr Tyr Arg Gly Gly Phe Glu Pro Lys Met Thr Lys Arg Glu Wing Wing 50 55 60 Leu He Leu Gly Val Ser Pro Thr Wing Asn Lys Gly Lys He Arg Asp 65 70 75 80
Wing His Arg Arg He Met Leu Leu Asn His Pro Asp Lys Gly Gly Ser 85 90 95 Pro Tyr He Wing Wing Lys He Asn Glu Wing Lys Asp Leu Leu Glu Gly 100 105 110 Gln Wing Lys Lys 115
(2) INFORMATION FOR SEQ ID NO: 20: (i) CHARACTERISTICS OF THE SEQUENCE: ((A) LENGTH: 152 (B) TYPE: Amino Acid (D) TOPOLOGY: Linear (ii) TYPE OF SEQUENCE: Protein (iii) HYPOTHETICAL: No (vi) ORIGINAL SOURCE: (A) ORGANISM SPECIES: Homo sapiens (B) TYPE OF CELL: Squamous cell carcinoma (C) CELLULAR LINE: KB (D) CLONE NAME: HP10301 (xi) DESCRIPTION OF THE SEQUENCE : SEQ ID NO: 20:
Met Wing Val Leu Ser Lys Glu Tyr Gly Phe Val Leu Leu Thr Gly Wing 1 5 10 15
Wing Being Phe He Met Val Wing His Wing Leu Wing Asn Val Being Lys Wing 20 25 30 Arg Lys Lys Tyr Lys Val Glu Tyr Pro He Mßt Tyr Ser Thr Asp Pro 35 40 45 Glu Asn Gly His He Phe Asn Cys He Gln Arg Ala His Gln Asn Thr 50 55 60 Leu Glu Val Tyr Pro Pro Phe Leu Phe Phe Leu Wing Val Gly Gly Val 65 70 75 80
Tyr His Pro Arg He Wing Ser Gly Leu Gly Leu Wing Trp He Val Gly 85 90 95
Arg Val Leu Tyr Wing Tyr Gly Tyr Tyr Thr Gly Glu Pro Ser Lys Arg 100 105 110 Be Arg Gly Wing Leu Gly Be He Wing Leu Leu Gly Leu Val Gly Thr 115 120 125 Thr Val Cys Ser Wing Phe Gln His Leu Gly Trp Val Lys Ser Gly Leu 130 135 140 Gly Ser Gly Pro Lys Cys Cys His 145 150
(2) INFORMATION FOR SEQ ID NO: 21: (i) CHARACTERISTICS OF THE SEQUENCE: (A) LENGTH: 559 (B) TYPE: Amino Acid (D) TOPOLOGY: Linear (ii) TYPE OF SEQUENCE: Protein (iii) HYPOTHETICAL : No (vi) ORIGINAL SOURCE: (A) ORGANISM SPECIES: Homo sapiens (B) TYPE OF CELL: Liver (D) CLONE NAME: HP10302 (xi) DESCRIPTION OF SEQUENCE: SEQ ID NO: 21:
Met Wing Pro Thr Leu Gln Gln Wing Tyr Arg Arg Arg Trp Trp Met Wing 1 5 10 15
Cys Thr Wing Val Leu Glu Asn Leu Phe Phe Be Wing Val Leu Leu Gly 20 25 30 Trp Gly Ser Leu Leu He He Leu Lys Asn Glu Gly Phe Tyr Ser Ser 35 40 45 Thr Cys Pro Wing Glu Be Ser Thr Asn Thr Thr Gln Asp Glu Gln Arg 50 55 60 Arg Trp Pro Gly Cys Asp Gln Gln Asp Glu Met Leu Asn Leu Gly Phe 65 70 75 80
Thr He Gly Ser Phe Val Leu Ser Wing Thr Thr Leu Pro Leu Gly He 85 90 95
Leu Met Asp Arg Phe Gly Pro Arg Pro Val Arg Leu Val Gly Ser Ala 100 105 110 Cys Phe Thr Ala Ser Cys Thr Leu Met Ala Leu Ala Ser Arg Asp Val 115 120 125 Glu Ala Leu Ser Pro Leu He Phe Leu Ala Leu Ser Leu Asn Gly Phe 130 135 140 Gly Gly He Cys Leu Thr Phe Thr Ser Leu Thr Leu Pro Asn Met Phe 145 150 155 160
Gly Asn Leu Arg Being Thr Leu Met Wing Leu Met He Gly Being Tyr Wing 165 170 175
Be Being Wing He Thr Phe Pro Gly He Lys Leu He Tyr Asp Wing Gly 180 185 190 Val Wing Phe Val Val He Met Phe Thr Trp Ser Gly Leu Wing Cys Leu 195 200 205 He Phe Leu Asn Cys Thr Leu Asn Trp Pro He Glu Wing Phe Pro Wing 210 215 220 Pro Glu Glu Val Asn Tyr Thr Lys Lys He Lys Leu Ser Gly Leu Wing 225 230 235 240
Leu Asp His Lys Val Thr Gly Asp Leu Phe Tyr Thr His Val Thr Thr 245 250 255
Met Gly Gln Arg Leu Ser Gln Lys Wing Pro Ser Leu Glu Asp Gly Ser 260 265 270 Asp Wing Phe Met Ser Pro Gln Asp Val Arg Gly Thr Ser Glu Asn Leu 275 280 285 Pro Glu Arg Ser Val Pro Leu Arg Lys Ser Leu Cys Be Pro Thr Phe 290 295 300 Leu Trp Be Leu Leu Thr Met Gly Met Thr Gln Leu Arg He He Phe 305 310 315 320
Tyr Met Ala Ala Val Asn Lys Met Leu Glu Tyr Leu Val Thr Gly Gly 325 330 335
Gln Glu His Glu Thr Asn Glu Gln Gln Gln Lys Val Wing Glu Thr Val 340 345 350 Gly Phe Tyr Ser Val Phe Gly Wing Met Gln Leu Leu Cys Leu Leu 355 360 365 Thr Cys Pro Leu He Gly Tyr He Met Asp Trp Arg He Lys Asp Cys 370 375 380 Val Asp Wing Pro Thr Gln Gly Thr Val Leu Gly Asp Wing Arg Asp Gly 385 390 395 400
Val Ala Thr Lys Ser He Arg Pro Arg Tyr Cys Lys He Gln Lys Leu 405 410 415
Thr Asn Wing Be Wing Phe Thr Leu Thr Asn Leu Leu Leu Val Gly 420 425 430 Phe Gly He Thr Cys Leu He Asn Asn Leu His Leu Gln Phe Val Thr 435 440 445 Phe Val Leu His Thr He Val Arg Gly Phe Phe His Be Ala Cys Gly 450 455 460 Ser Leu Tyr Ala Ala Val Phe Pro Ser Asn His Phe Gly Thr Leu Thr 465 470 475 480
Gly Leu Gln Ser Leu Ha Be Wing Val Phe Ala Leu Leu Gln Gln Pro 485 490 495
Leu Phe Met Wing Met Val Gly Pro Leu Lys Gly Glu Pro Phe Trp Val 500 505 510 Asn Leu Gly Leu Leu Phe Ser Leu Leu Gly Phe Leu Leu Pro Ser 515 520 525 Tyr Leu Phe Tyr Tyr Arg Ala Arg Leu Gln Gln Glu Tyr Ala Ala Asn 530 535 540 Gly Met Gly Pro Leu Lys Val Leu Ser Gly Ser Glu Val Thr Ala 545 550 555
(2) INFORMATION FOR SEQ ID NO: 22: (i) CHARACTERISTICS OF THE SEQUENCE: (A) LENGTH: 330 (B) TYPE: Amino Acid (D) TOPOLOGY: Linear (ii) TYPE OF SEQUENCE: Protein (iii) HYPOTHETICAL : No (vi) ORIGINAL SOURCE: (A) ORGANISM SPECIES: Homo sapiens (B) TYPE OF CELL: Osteosarcoma (C) CELLULAR LINE: U-2 OS (D) CLONE NAME: HP10304 (xi) DESCRIPTION OF THE SEQUENCE: SEQ ID NO: 22:
Met Glu Gly Ala Pro Pro Gly Ser Leu Ala Leu Arg Leu Leu Leu Phe 1 5 10 15
Val Ala Leu Pro Wing Ser Gly Trp Leu Thr Thr Gly Wing Pro Glu Pro 20 25 30 Pro Pro Leu Ser Gly Wing Pro Gln Asp Gly He Arg He Asn Val Thr 35 40 45 Thr Leu Lys Asp Asp Gly Asp He Ser Lys Gln Gln Val Val Leu Asn 50 55 60 He Thr Tyr Glu Ser Gly Gln Val Tyr Val Asn Asp Leu Pro Val Asn 65 70 75 80
Ser Gly Val Thr Arg He Ser Cys Gln Thr Leu He Val Lys Asn Glu 85 90 95
Asn Leu Glu Asn Leu Glu Glu Lys Glu Tyr Phe Gly He Val Ser Val 100 105 110 Arg He Leu Val His Glu Trp Pro Met Thr Ser Gly Ser Ser Leu Gln 115 120 125 Leu He Val He Gln Glu Val Val Glu He Asp Gly Lys Gln Val 130 135 140 Gln Gln Lys Asp Val Thr Glu He Asp He Leu Val Lys Asn Arg Gly 145 150 155 160
Val Leu Arg His Ser Asn Tyr Thr Leu Pro Leu Glu Glu Ser Met Leu 165 170 175
Tyr Ser He Ser Arg Asp Ser Asp He Leu Phe Thr Leu Pro Asn Leu 180 185 190 Ser Lys Lys Glu Ser Val Ser Ser Leu Gln Thr Thr Ser Gln Tyr Leu 195 200 205 He Arg Asn Val Glu Thr Val Val Asp Glu Asp Val Leu Pro Gly Lys 210 215 220 Leu Pro Glu Thr Pro Leu Arg Ala Glu Pro Pro Ser Ser Tyr Lys Val 225 230 235 240
Met Cys Gln Trp Met Glu Lys Phe Arg Lys Asp Leu Cys Arg Phe Trp 245 250 255
Be Asn Val Phe Pro Val Phe Phe Gln Phe Leu Asn He Met Val Val 260 265 270 Gly He Thr Gly Wing Wing Val Val He Thr He Leu Val Val Phe Phe 275 '280 285 Pro Val Ser Glu Tyr Lys Gly He Leu Gln Leu Asp Lys Val Asp Val 290 295 300 He Pro Val Thr Ala He Asn Leu Tyr Pro Asp Gly Pro Glu Lys Arg 305 310 315 320
Wing Glu Asn Leu Glu Asp Lys Thr Cys He 325 330
(2) INFORMATION FOR SEQ ID NO: 23: (i) CHARACTERISTICS OF THE SEQUENCE: (A) LENGTH: 108 (B) TYPE: Amino Acid (D) TOPOLOGY: Linear (ii) TYPE OF SEQUENCE: Protein (iii) HYPOTHETICAL : No (vi) ORIGINAL SOURCE: (A) ORGANISM SPECIES: Homo sapiens (B) TYPE OF CELL: Osteosarcoma (C) CELLULAR LINE: U-2 OS (D) CLONE NAME: HP10305 (xi) DESCRIPTION OF THE SEQUENCE: SEQ ID NO: 23:
Met Ser Leu Thr Ser Ser Ser Ser Val Val Val Glu Trp He Ala Ala 1 5 10 15
Val Thr He Wing Wing Gly Thr Wing Wing He Gly Tyr Leu Wing Tyr Lys 20 25 30 Arg Phe Tyr Val Lys Asp His Arg Asn Lys Wing Met He Asn Leu His 35 40 45 He Gln Lys Asp Asn Pro Lys He Val His Wing Phe Asp Met Glu Asp 50 55 60 Leu Gly Asp Lys Wing Val Tyr Cys Arg Cys Trp Arg Ser Lys Lys Phe 65 70 75 80
Pro Phe Cys Asp Gly Ala His Thr Lys His Asn Glu Glu Thr Gly Asp 85 90 95
Asn Val Gly Pro Leu He He Lys Lys Lys Glu Thr 100 105
(2) INFORMATION FOR SEQ ID NO: 24: (i) CHARACTERISTICS OF THE SEQUENCE: (A) LENGTH: 101 (B) TYPE: Amino Acid (D) TOPOLOGY: Linear (ii) TYPE OF SEQUENCE: Protein (iii) HYPOTHETICAL : No (vi) ORIGINAL SOURCE: (A) ORGANISM SPECIES: Homo sapiens (B) TYPE OF CELL: Ostaosarcoma (C) CELLULAR LINE: U-2 OS (D) CLONE NAME: HP10306 (xi) DESCRIPTION OF THE SEQUENCE: SEQ ID NO: 24:
Met Asn Leu Glu Arg Val Ser Asn Glu Glu Lys Leu Asn Leu Cys Arg 1 5 10 15
Lys Tyr Tyr Leu Gly Gly Phe Wing Phe Leu Pro Phe Leu Trp Leu Val 20 25 30 Asn He Phe Trp Phe Phe Arg Glu Wing Phe Leu Val Pro Wing Tyr Thr 35 40 45 Glu Gln Ser Gln He Lys Gly Tyr Val Trp Arg Ser Wing Val Gly Phe 50 55 60 Leu Phe Trp Val He Val Leu Thr Ser Trp He Thr He Phe Gln He 65 70 75 80
Tyr Arg Pro Arg Trp Gly Wing Leu Gly Asp Tyr Leu Ser Phe Thr He 85 90 95
Pro Leu Gly Thr Pro 100
(2) INFORMATION FOR SEQ ID NO: 25: (i) CHARACTERISTICS OF THE SEQUENCE: (A) LENGTH: 372 (B) TYPE: Amino Acid (D) TOPOLOGY: Linear (ii) TYPE OF SEQUENCE: Protein (iii) HYPOTHETICAL : No (vi) ORIGINAL SOURCE: (A) ORGANISM SPECIES: Homo sapiens (B) TYPE OF CELL: Squamous cell carcinoma (C) CELLULAR LINE: KB (D) CLONE NAME: HP10328 (xi) DESCRIPTION OF SEQUENCE: SEQ ID NO: 25:
Met Lys Tyr Leu Arg His Arg Arg Pro Asn Wing Thr Leu He Leu Wing 1 5 10 15
He Gly Wing Phe Thr Leu Leu Leu Phe Ser Leu Leu Val Ser Pro Pro 20 25 30 Thr Cys Lys Val Gln Gllu Pro Pro Wing Pro Glu Ala Leu Wing 35 40 45 Trp Pro Thr Pro Pro Thr Arg Pro Wing Pro Pro Wing Cys His Ala Asn 50 55 60 Thr Ser Met Val Thr His Pro Asp Phe Ala Thr Gln Pro Gln His Val 65 70 75 80
Gln Asn Phe Leu Leu Tyr Arg His Cys Arg His Phe Pro Leu Leu Gln 85 90 95
Asp Val Pro Pro Ser Lys Cys Wing Gln Pro Val Phe Leu Leu Val Val 105 105 110 He Lys Ser Ser Pro Ser Asn Tyr Val Arg Arg Glu Leu Leu Arg Arg 115 120 125 Thr Trp Gly Arg Glu Arg Lys Val Arg Gly Leu Gln Leu Arg Leu Leu 130 135 140 Phe Leu Val Gly Thr Ala Ser Asn Pro His Glu Ala Arg Lys Val Asn 145 150 155 160
Arg Leu Leu Glu Leu Glu Wing Gln Thr His Gly Asp He Leu Gln Trp 165 170 175
Asp Phe His Asp Ser Phe Phe Asn Leu Thr Leu Lys Gln Val Leu Phe 180 185 190 Leu Gln Trp Gln Glu Thr Arg Cys Wing Asn Wing Being Phe Val Leu Asn 195 200 205 Gly Asp Asp Asp Val Phe Wing His Thr Asp Asn Met Val Pha Tyr Leu 210 215 220 Gln Asp His Asp Pro Gly Arg His Leu Phe Val Gly Gln Leu He Gln 225 230 235 240
Asn Val Gly Pro He Arg Ala Phe Trp Ser Lys Tyr Tyr Val Pro Glu 245 250 255
Val Val Thr Gln Asn Glu Arg Tyr Pro Pro Tyr Cys Gly Gly Gly Gly 260 265 270 Phe Leu Leu Ser Arg Phe Thr Wing Wing Wing Leu Arg Wing Ala Wing 275 280 285 Val Leu Asp He Phe Pro He Asp Asp Val Phe Leu Gly Met Cys Leu 290 295 300 Glu Leu Glu Gly Leu Lys Pro Wing His Ser Gly He Arg Thr Ser 305 310 315 320
Gly Val Arg Ala Pro Ser Gln His Leu Ser Ser Phe Asp Pro Cys Phe 325 330 335
Tyr Arg Asp Leu Leu Leu Val His Arg Phe Leu Pro Tyr Glu Met Leu 340 345 350 Leu Met Trp Asp Ala Leu Asn Gln Pro Asn Leu Thr Cys Gly Asn Gln 355 360 365 Thr Gln He Tyr 370
(2) INFORMATION FOR SEQ ID NO: 26: (i) CHARACTERISTICS OF THE SEQUENCE: (A) LENGTH: 615 (B) TYPE: Nucleic Acid (C) BRANCH: Double (D) TOPOLOGY: Linear (ii) TYPE OF SEQUENCE: DNAs for mRNA (vi) ORIGINAL SOURCE: (A) ORGANISM SPECIES: Homo sapiens (B) TYPE OF CELL: Fibrosarcoma (C) CELLULAR LINE: HT-1080 (D) CLONE NAME: HP00442 (xi) DESCRIPTION OF THE SEQUENCE: SEQ ID NO: 26: ATGACGGGGC TAGCACTGCT CTACTCCGGG GTCTTCGTGG CCTTCTGGGC CTGCGCGCTG 60
GCCGTGGGAG TCTGCTACAC CATTTTTGAT TTGGGCTTCC GCTTTGATGT GGCATGGTTC 120
CTGACGGAGA CTTCGCCCTT CATGTGGTCC AACCTGGGCA TTGGCCTAGC TATCTCCCTG 180
TCTGTGGTTG GGGCAGCCTG GGGCATCTAT ATTACCGGCT CCTCCATCAT TGGTGGAGGA 240 GTGAAGGCCC CCAGGATCAA GACCAAGAAC CTGGTCAGCA TCATCTTCTG TGAGGCTGTG 300
GCCATCTACG GCATCATCAT GGCAATTGTC ATTAGCAACA TGGCTGAGCC TTTCAGTGCC 360 ACAGACCCCA AGGCCATCGG CCATCGGAAC TACCATGCAG GCTACTCCAT GTTTGGGGCT 420
GGCCTCACCG TAGGCCTGTC TAACCTCTTC TGTGGAGTCT GCGTGGGCAT CGTGGGCAGT 480
GGGGCTGCCC TGGCCGATGC TCAGAACCCC AGCCTCTTTG TAAAGATTCT CATCGTGGAG 540
ATCTTTGGCA GCGCCATTGG CCTCTTTGGG GTCATCGTCG GAATTCTTCA GACCTCCAGA 600
GTGAAGATGG GTGAC 615
(2) INFORMATION FOR SEQ ID NO: 27: (i) CHARACTERISTICS OF THE SEQUENCE: (A) LENGTH: 1113 (B) TYPE: Nucleic Acid (C) BRANCH: Double (D) TOPOLOGY: Linear (ii) TYPE OF SEQUENCE: cDNA for mRNA (vi) ORIGINAL SOURCE: (A) ORGANISM SPECIES: Homo sapiens (B) TYPE OF CELL: Leukocyte (D) CLONE NAME: HP00804 (xi) DESCRIPTION OF SEQUENCE: SEQ ID NO: 27 : ATGTCCCATG AAAAGAGTTT TTTGGTGTCT GGGGACAACT ATCCTCCCCC CAACCCTGGA 60
TATCCGGGGG GGCCCCAGCC ACCCATGCCC CCCTATGCTC AGCCTCCCTA CCCTGGGGCC 120
CCTTACCCAC AGCCCCCTTT CCAGCCCTCC CCCTACGGTC AGCCAGGGTA CCCCCATGGC 180
CCCAGCCCCT ACCCCCAAGG GGGCTACCCA CAGGGTCCCT ACCCCCAAGG GGGCTACCCA 240
CAGGGCCCCT ACCCACAAGA GGGCTACCCA CAGGGCCCCT ACCCCCAAGG GGGCTACCCC 300
CAGGGGCCAT ATCCCCAGAG CCCCTTCCCC CCCAACCCCT ATGGACAGCC ACAGGTCTTC 360
CCAGGACAAG ACCCTGACTC ACCCCAGCAT GGAAACTACC AGGAGGAGGG TCCCCCATCC 420
TACTATGACA ACCAGGACTT CCCTGCCACC AACTGGGATG ACAAGAGCAT CCGACAGGCC 480
TTCATCCGCA AGGTGTTCCT AGTGCTGACC TTGCAGCTGT CGGTGACCCT GTCCACGGTG 540 TCTGTGTTCA CTTTTGTTGC GGAGGTGAAG GGCTTTGTCC GGGAGAATGT CTGGACCTAC 600
TATGTCTCCT ATGCTGTCTT CTTCATCTCT CTCATCGTCC TCAGCTGTTG TGGGGACTTC 660
CGGCGAAAGC ACCCCTGGAA CCTTGTTGCA CTGTCGGTCC TGACCGCCAG CCTGTCGTAC 720
ATGGTGGGGA TGATCGCCAG CTTCTACAAC ACCGAGGCAG TCATCATGGC CGTGGGCATC 780
ACCACAGCCG TCTGCTTCAC CGTCGTCATC TTCTCCATGC AGACCCGCTA CGACTTCACC 840
TCATGCATGG GCGTGCTCCT GGTGAGCATG GTGGTGCTCT TCATCTTCGC CATTCTCTGC 900
ATCTTCATCC GGAACCGCAT CCTGGAGATC GTGTACGCCT CACTGGGCGC TCTGCTCTTC 960
ACCTGCTTCC TCGCAGTGGA CACCCAGCTG CTGCTGGGGA ACAAGCAGCT GTCCCTGAGC 1020
CCAGAAGAGT ATGTGTTTGC TGCGCTGAAC CTGTACACAG ACATCATCAA CATCTTCCTG 1080
TACATCCTCA CCATCATTGG CCGCGCCAAG GAG 1113
(2) INFORMATION FOR SEQ ID NO: 28: (i) CHARACTERISTICS OF THE SEQUENCE: (A) LENGTH: 537 (B) TYPE: Nucleic Acid (C) BRANCH: Double (D) TOPOLOGY: Linear (ii) TYPE OF SEQUENCE: DNAs for mRNA (vi) ORIGINAL SOURCE: (A) ORGANISM SPECIES: Homo sapiens (B) TYPE OF CELL: Stomach cancer (D) CLONE NAME: HP01098 (xi) DESCRIPTION OF SEQUENCE: SEQ ID NO : 28: ATGCTGTCTC TAGACTTTTT GGACGATGTG CGGCGGATGA ACAAGCGGCA GCTCTATTAT 60
CAAGTCCTAA ATTTTGGAAT GATTGTCTCA TCGGCACTAA TGATCTGGAA GGGGTTAATG 120
GTAATAACTG GAAGTGAAAG TCCGATTGTA GTGGTGCTCA GTGGCAGCAT GGAACCTGCA 180
TTTCATAGAG GAGATCTTCT CTTTCTAACA AATCGAGTTG AAGATCCCAT ACGAGTGGGA 240
GAAATTGTTG TTTTTAGGAT AGAAGGAAGA GAGATTCCTA TAGTTCACCG AGTCTTGAAG 300
ATTCATGAAA AGCAAAATGG GCATATCAAG TTTTTGACCA AAGGAGATAA TAATGCGGTT 360
GATGACCGAG GCCTCTATAA ACAAGGACAA CATTGGCTAG AGAAAAAAAA TGTTGTGGGG 420
AGAGCCAGGG GATTTGTTCC TTATATTGGA ATTGTGACGA TCCTCATGAA TGACTATCCT 480
AAATTTAAGT ATGCAGTTCT CTTTTTTGCTG GGTTTATTCG TGCTGGTTCA TCGTGAG 537
(2) INFORMATION FOR SEQ ID NO: 29: (i) CHARACTERISTICS OF THE SEQUENCE: (A) LENGTH: 1041 (B) TYPE: Nucleic Acid (C) BRANCH: Double (D) TOPOLOGY: Linear (ii) TYPE OF SEQUENCE: cDNA for mRNA (vi) ORIGINAL SOURCE: (A) ORGANISM SPECIES: Homo sapiens (B) TYPE OF CELL: Liver (D) CLONE NAME: HP01148 (xi) DESCRIPTION OF SEQUENCE: SEQ ID NO: 29 : ATGGCTCTGC TATTCTCCTT GATCCTTGCC ATTTGCACCA GACCTGGATT CCTAGCGTCT 60
CCATCTGGAG TGCGGCTGGT GGGGGGCCTC CACCGCTGTG AAGGGCGGGT GGAGGTGGAA 120
CAGAAAGGCC AGTGGGGCAC CGTGTGTGAT GACGGCTGGG ACATTAAGGA CGTGGCTGTG 180
TTGTGCCGGG AGCTGGGCTG TGGAGCTGCC AGCGGAACCC CTAGTGGTAT TTTGTATGAG 240
CCACCAGCAG AAAAAGAGCA AAAGGTCCTC ATCCAATCAG TCAGTTGCAC AGGAACAGAA 300
GATACATTGG CTCAGTGTGA GCAAGAAGAA GTTTATGATT GTTCACATGA AGAAGATGCT 360
GGGGCATCGT GTGAGAACCC AGAGAGCTCT TTCTCCCCAG TCCCAGAGGG TGTCAGGCTG 420
GCTGACGGCC CTGGGCATTG CAAGGGACGC GTGGAAGTGA AGCACCAGAA CCAGTGGTAT 480
ACCGTGTGCC AGACAGGCTG GAGCCTCCGG GCCGCAAAGG TGGTGTGCCG GCAGCTGGGA 540
TGTGGGAGGG CTGTACTGAC TCAAAAACGC TGCAACAAGC ATGCCTATGG CCGAAAACCC 600
ATCTGGCTGA GCCAGATGTC ATGCTCAGGA CGAGAAGCAA CCCTTCAGGA TTGCCCTTCT 660
GGGCCTTGGG GGAAGAACAC CTGCAACCAT GATGAAGACA CGTGGGTCGA ATGTGAAGAT 720
CCCTTTGACT TGAGACTAGT AGGAGGAGAC AACCTCTGCT CTGGGCGACT GGAGGTGCTG 780
CACAAGGGCG TATGGGGCTC TGTCTGTGAT GACAACTGGG GAGAAAAGGA GGACCAGGTG 840
GTATGCAAGC AACTGGGCTG TGGGAAGTCC CTCTCTCCCT CCTTCAGAGA CCGGAAATGC 900
TATGGCCCTG GGGTTGGCCG CATCTGGCTG GATAATGTTC GTTGCTCAGG GGAGGAGCAG 960
TCCCTGGAGC AGTGCCAGCA CAGATTTTGG GGGTTTCACG ACTGCACCCA CCAGGAAGAT 1020
GTGGCTGTCA TCTGCTCAGG A 1041
(2) INFORMATION FOR SEQ ID NO: 30: (i) CHARACTERISTICS OF THE SEQUENCE: (A) LENGTH: 1662 (B) TYPE: Nucleic Acid (C) BRANCH: Double (D) TOPOLOGY: Linear (ii) TYPE OF SEQUENCE: cDNA for mRNA (vi) ORIGINAL SOURCE: (A) ORGANISM SPECIES: Ho or sapiens (B) TYPE OF CELL: Liver (D) CLONE NAME: HP01293 (xi) DESCRIPTION OF SEQUENCE: SEQ ID NO: 30: ATGCCCACCG TGGATGACAT TCTGGAGCAG GTTGGGGAGT CTGGCTGGTT CCAGAAGCAA 60 GCCTTCCTCA TCTTATGCCT GCTGTCGGCT GCCTTTGCGC CCATCTGTGT GGGCATCGTC 120
TTCCTGGGTT TCACACCTGA CCACCACTGC CAGAGTCCTG GGGTGGCTGA GCTGAGCCAG 180
CGCTGTGGCT GGAGCCCTGC GGAGGAGCTO AACTATACAG TGCCAGGCCT GGGGCCCGCG 240
GGCGAGGCCT TCCTTGGCCA GTGCAGGCGC TATGAAGTGG ACTGGAACCA GAGCGCCCTC 300
AGCTGTGTAG ACCCCCTGGC TAGCCTGGCC ACCAACAGGA GCCACCTGCC GCTGGGTCCC 360
TGCCAGGATG GCTGGGTGTA TGACACGCCC GGCTCTTCCA TCGTCACTGA GTTCAACCTG 420
GTGTGTGCTG ACTCCTGGAA GCTCGACCTC TTTCAGTCCT GTTTGAATGC GGGCTTCTTC 480
TTTGGCTCTC TCGGTGTTGG CTACTTTGCA GACAGGTTTG GCCGTAAGCT GTGTCTCCTG 540
GGAACTGTGC TGGTCAACGC GGTGTCGGGC GTGCTCATGG CCTTCTCGCC CAACTACATG 600
TCCATGCTGC TCTTCCGCCT GCTGCAGGGC CTGGTCAGCA AGGGCAACTG GATGGCTGGC 660
TACACCCTAA TCACAGAATT TGTTGGCTCG GGCTCCAGAA GAACGGTGGC GATCATGTAC 720
CAGATGGCCT TCACGGTGGG GCTGGTGGCG CTTACCGGGC TGGCCTACGC CCTGCCTCAC 780
TGGCGCTGGC TGCAGCTGGC AGTCTCCCTG CCCACCTTCC TCTTCCTGCT CTACTACTGG 840
TGTGTGCCGG AGTCCCCTCG GTGGCTGTTA TCACAAAAAA GAAACACTGA AGCAATAAAG 900
ATAATGGACC ACATCGCTCA AAAGAATGGG AAGTTGCCTC CTGCTGATTT AAAGATGCTT 960
TCCCTCGAAG AGGATGTCAC CGAAAAGCTG AGCCCTTCAT TTGCAGACCT GTTCCGCACG 1020
CCGCGCCTGA GGAAGCGCAC CTTCATCCTG ATGTACCTGT GGTTCACGGA CTCTGTGCTC 1080
TATCAGGGGC TCATCCTGCA CATGGGCGCC ACCAGCGGGA ACCTCTACCT GGATTTCCTT 1140
TACTCCGCTC TGGTCGAAAT CCCGGGGGCC TTCATAGCCC TCATCACCAT TGACCGCGTG 1200
GGCCGCATCT ACCCCATGGC CGTGTCAAAT TTGTTGGCGG GGGCAGCCTG CCTCGTCATG 1260
ATTTTTTATCT CACCTGACCT GCACTGGTTA AACATCATAA TCATGTGTGT TGGCCGAATG 1320
GGAATCACCA TTGCAATACA AATGATCTGC CTGGTGAATG CTGAGCTGTA CCCCACATTC 1380
GTCAGGAACC TCGGAGTGAT GGTGTGTTCC TCCCTGTGTG ACATAGGTGG GATAATCACC 1440
CCCTTCATAG TCTTCAGGCT GAGGGAGGTC TGGCAAGCCT TGCCCCTCAT TTTGTTTGCG 1500
GTGTTGGGCC TGCTTGCCGC GGGAGTGACG CTACTTCTTC CAGAGACCAA GGGGGTCGCT 1560
TTGCCAGAGA CCATGAAGGA CGCCGAGAAC CTTGGGAGAA AAGCAAAGCC CAAAGAAAAC 1620
ACGATTTACC TTAAGGTCCA AACCTCAGAA CCCTCGGGCA CC 1662
(2) INFORMATION FOR SEQ ID NO: 31: (i) CHARACTERISTICS OF THE SEQUENCE: (A) LENGTH: 1050 (B) TYPE: Nucleic Acid (C) BRANCH: Double (D) TOPOLOGY: Linear (ii) TYPE OF SEQUENCE: cDNA for mRNA (vi) ORIGINAL SOURCE: (A) ORGANISM SPECIES: Homo sapiens (B) TYPE OF CELL: Squamous cell carcinoma (C) CELL LINE: KB (D) CLONE NAME: HP10013 (xi) DESCRIPTION OF THE SEQUENCE: SEQ ID NO: 31: ATGGCTGTGT TTGTCGTGCT CCTGGCGTTG GTGGCGGGTG TTTTGGGGAA CGAGTTTAGT 60
ATATTAAAAT CACCAGGGTC TGTTGTTTTC CGAAATGGAA ATTGGCCTAT ACCAGGAGAG 120
CGGATCCCAG ACGTGGCTGC ATTGTCCATG GGCTTCTCTG TGAAAGAAGA CCTTTCTTGG 180
CCAGGACTCG CAGTGGGTAA CCTGTTTCAT CGTCCTCGGG CTACCGTCAT GGTGATGGTG 240
AAGGGAGTGA ACAAACTGGC TCTACCCCCA GGCAGTGTCA TTTCGTACCC TTTGGAGAAT 300
GCAGTTCCTT TTAGTCTTGA CAGTGTTGCA AATTCCATTC ACTCCTTATT TTCTGAGGAA 360
ACTCCTGTTG TTTTGCAGTT GGCTCCCAGT GAGGAAAGAG TGTATATGGT AGGGAAGGCA 420
AACTCAGTGT TTGAAGACCT TTCAGTCACC TTGCGCCAGC TCCGTAATCG CCTGTTTCAA 480
GAAAACTCTG TTCTCAGTTC ACTCCCCCTC AATTCTCTGA GTAGGAACAA TGAAGTTGAC 540
CTGCTCTTTC TTTCTGAACT GCAAGTGCTA CATGATATTT CAAGCTTGCT GTCTCGTCAT 600
AAGCATCTAG CCAAGGATCA TTCTCCTGAT TTATATTCAC TGGAGCTGGC AGGTTTGGAT 660
GAAATTGGGA AGCGTTATGG GGAAGACTCT GAACAATTCA GAGATGCTTC TAAGATCCTT 720
GTTGACGCTC TGCAAAAGTT TGCAGATGAC ATGTACAGTC TTTATGGTGG GAATGCAGTG 780
GTAGAGTTAG TCACTGTCAA GTCATTTGAC ACCTCCCTCA TTAGGAAGAC AAGGACTATC 840
CTTGAGGCAA AACAAGCGAA GAACCCAGCA AGTCCCTATA ACCTTGCATA TAAGTATAAT 900
TTTGAATATT CCGTGGTTTT CAACATGGTA CTTTGGATAA TGATCGCCTT GGCCTTGGCT 960
GTGATTATCA CCTCTTACAA TATTTGGAAC ATGGATCCTG GATATGATAG CATCATTTAT 1020
AGGATGACAA ACCAGAAGAT TCGAATGGAT 1050
(2) INFORMATION FOR SEQ ID NO: 32: (i) CHARACTERISTICS OF THE SEQUENCE: (A) LENGTH: 627 (B) TYPE: Nucleic Acid (C) BRANCH: Double (D) TOPOLOGY: Linear (ii) TYPE OF SEQUENCE: cDNA for mRNA (vi) ORIGINAL SOURCE: (A) ORGANISM SPECIES: Homo sapiens (B) TYPE OF CELL: Fibrosarcoma (C) CELLULAR LINE: HT-1080 (D) CLONE NAME: HP10034 (xi) DESCRIPTION SEQUENCE: SEQ ID NO: 32: ATGGTGTCCT CTCCCTGCAC GCAGGCAAGC TCACGGACTT GCTCCCGTAT CCTGGGACTG 60 AGCCTTGGGA CTGCAGCCCT GTTTGCTGCT GGGGCCAACG TGGCACTCCT CCTTCCTAAC 120 TGGGATGTCA CCTACCTGTT GAGGGGCCTC CTTGGCAGGC ATGCCATGCT GGGAACTGGG 180 CTCTGGGGAG GAGGCCTCAT GGTACTCACT GCAGCTATCC TCATCTCCTT GATGGGCTGG 240 AGATACGGCT GCTTCAGTAA GAGTGGGCTC TGTCGAAGCG TGCTTACTGC TCTGTTGTCA 300 GGTGGCCTGG CTTTACTTGG AGCCCTGATT TGCTTTGTCA CTTCTGGAGT TGCTCTGAAA 360 GATGGTCCTT TTTGCATGTT TGATGTTTCA TCCTTCAATC AGACACAAGC TTGGAAATAT 420 GGTTACCCAT TCAAAGACCT GCATAGTAGG AATTATCTGT ATGACCGTTC GCTCTGGAAC 480 TCCGTCTGCC TGGA GCCCTC TGCAGCTGTT GTCTGGCACG TGTCCCTCTT CTCCGCCCTT 540 CTGTGCATCA GCCTGCTCCA GCTTCTCCTG GTGGTCGTTC ATGTCATCAA CAGCCTCCTG 600 GGCCTTTTCT GCAGCCTCTG CGAGAAG 627
(2) INFORMATION FOR SEQ ID NO: 33: (i) CHARACTERISTICS OF THE SEQUENCE: (A) LENGTH: 489 (B) TYPE: Nucleic Acid (C) BRANCH: Double (D) TOPOLOGY: Linear (ii) TYPE OF SEQUENCE: cDNA for mRNA (vi) ORIGINAL SOURCE: (A) ORGANISM SPECIES: Homo sapiens (B) TYPE OF CELL: Fibrosarcoma (C) CELLULAR LINE: HT-1080 (D) CLONE NAME: HP10050 (xi) DESCRIPTION SEQUENCE: SEQ ID NO: 33: ATGGCGGCTG GGCTGTTTGG TTTGAGCGCT CGCCGTCTTT TGGCGGCAGC GGCGACGCGA 60 GGGCTCCCGG CCGCCCGCGT CCGCTGGGAA TCTAGCTTCT CCAGGACTGT GGTCGCCCCG 120 TCCGCTGTGG CGGGAAAGCG GCCCCCAGAA CCGACCACAC CGTGGCAAGA GGACCCAGAA 180 CCCGAGGACG AAAACTTGTA TGAGAAGAAC CCAGACTCCC ATGGTTATGA CAAGGACCCC 240 GTTTTGGACG TCTGGAACAT GCGACTTGTC TTCTTCTTTG GCGTCTCCAT CATCCTGGTC 300 CTTGGCAGCA CCTTTGTGGC CTATCTGCCT GACTACAGGT GCACAGGGTG TCCAAGAGCG 360 TGGGATGGGA TGAAAGAGTG GTCCCGCCGC GAAGCTGAGA GGCTTGTGAA ATACCGAGAG 420 GCCAATGGCC TTCCCATCAT GGAATCCAAC TGCTTCGACC CCAGCAAGAT CCAGCTGCCA 480 GAGGATGAG 489
(2) INFORMATION FOR SEQ ID NO: 34: (i) CHARACTERISTICS OF THE SEQUENCE: (A) LENGTH: 276 (B) TYPE: Nucleic Acid (C) BRANCH: Double (D) TOPOLOGY: Linear * - í (ii) ) TYPE OF SEQUENCE: cDNA for mRNA (vi) ORIGINAL SOURCE: (A) ORGANISM SPECIES: Homo sapiens (B) TYPE OF CELL: Stomach cancer (D) CLONE NAME: HP10071 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34: ATGACGAAAT TAGCGCAGTG GCTTTGGGGA CTAGCGATCC TGGGCTCCAC CTGGGTGGCC 60
CTGACCACGG GAGCCTTGGG CCTGGAGCTG CCCTTGTCCT GCCAGGAAGT CCTGTGGCCA 120
CTGCCCGCCT ACTTGCTGGT GTCCGCCGGC TGCTATGCCC TGGGCACTGT GGGCTATCGT 180
GTGGCCACTT TTCATGACTG CGAGGACGCC GCACGCGAGC TGCAGAGCCA GATACAGGAG 240
GCCCGAGCCG ACTTAGCCCG CAGGGGGCTG CGCTTC 276
(2) INFORMATION FOR SEQ ID NO: 35: (i) CHARACTERISTICS OF THE SEQUENCE: (A) LENGTH: 516 (B) TYPE: Nucleic Acid (C) BRANCH: Double (D) TOPOLOGY: Linear (ii) TYPE OF SEQUENCE: DNAs for mRNA (vi) ORIGINAL SOURCE: (A) ORGANISM SPECIES: Homo sapiens (B) TYPE OF CELL: Lymphoma (C) CELLULAR LINE: U937 (D) CLONE NAME: HP10076 (xi) DESCRIPTION OF THE SEQUENCE: SEQ ID NO: 35: ATGGAATATT TGGCTCATCC CAGTACACTC GGCTTGGCTG TTGGAGTTGC TTGTGGCATG 60 TGCCTGGGCT GGAGCCTTCG AGTATGCTTT GGGATGCTCC CCAAAAGCAA GACGAGCAAG 120 ACACACACAG ATACTGAAAG TGAAGCAAGC ATCTTGGGAG ACAGCGGGGA GTACAAGATG 180 ATTCTTGTGG TTCGAAATGA CTTAAAGATG GGAAAAGGGA AAGTGGCTGC CCAGTGCTCT 240 CATGCTGCTG TTTCAGCCTA CAAGCAGATT CAAAGAAGAA ATCCTGAAAT GCTCAAACAA 300 TGGGAATACT GTGGCCAGCC CAAGGTGGTG GTCAAAGCTC CTGATGAAGA AACCCTGATT 360 GCATTATTGG CCCATGCAAA AATGCTGGGA CTGACTGTAA GTTTAATTCA AGATGCTGGA 420 CGTACTCAGA TTGCACCAGG CTCTCAAACT GTCCTAGGGA TTGGGCCAGG ACCAGCAGAC 480 CTAATTGACA AAGTCACTGG T CACCTAAAA CTTTAC 516
(2) INFORMATION FOR SEQ ID NO: 36: (i) CHARACTERISTICS OF THE SEQUENCE: (A) LENGTH: 447 (B) TYPE: Nucleic Acid (C) BRANCH: Double (D) TOPOLOGY: Linear (ii) TYPE OF SEQUENCE: cDNA for mRNA (vi) ORIGINAL SOURCE: (A) ORGANISM SPECIES: Homo sapiens (B) TYPE OF CELL: Lymphoma (C) CELLULAR LINE: U937 (D) CLONE NAME: HP10085 '(xi) DESCRIPTION OF THE SEQUENCE: SEQ ID NO: 36: ATGATGACCA AACATAAAAA GTGTTTTATA ATTGTTGGTG TTTTAATAAC AACTAATATT 60
ATTACTCTGA TAGTTAAACT AACTCGAGAT TCTCAGAGTT TATGCCCCTA TGATTGGATT 120
GGTTTCCAAA ACAAATGCTA TTATTTCTCT AAAGAAGAAG GAGATTGGAA TTCAAGTAAA 180
TACAACTGTT CCACTCAACA TGCCGACCTA ACTATAATTG ACAACATAGA AGAAATGAAT 240
TTTCTTAGGC GGTATAAATG CAGTTCTGAT CACTGGATTG GACTGAAGAT GGCAAAAAAT 300
CGAAGAGGAC AATGGGTAGA TGGAGCTACA TTTACCAAAT CGTTTGGCAT GAGAGGGAGT 360
GAAGGATGTG CCTACCTCAG CGATGATGGT GCAGCAACAG CTAGATGTTA CACCGAAAGA 420
AAATGGATTT GCAGGAAAAG AATACAC 447
(2) INFORMATION FOR SEQ ID NO: 37: (i) CHARACTERISTICS OF THE SEQUENCE: (A) LENGTH: 564 (B) TYPE: Nucleic Acid (C) BRANCH: Double (D) TOPOLOGY: Linear (ii) TYPE OF SEQUENCE: cDNA for mRNA (vi) ORIGINAL SOURCE: (A) ORGANISM SPECIES: Homo sapiens (B) TYPE OF CELL: Stomach cancer (D) CLONE NAME: HP10122 (i) DESCRIPTION OF SEQUENCE: SEQ ID NO : 37: ATGAGCACTA TGTTCGCGGA CACTCTCCTC ATCGTTTTTA TCTCTGTGTG CACGGCTCTG 60
CTCGCAGAGG GCATAACCTG GGTCCTGGTT TACAGGACAG ACAAGTACAA GAGACTGAAG 120
GCAGAAGTGG AAAAACAGAG TAAAAAATTG GAAAAGAAGA AGGAAACAAT AACAGAGTCA 180
GCTGGTCGAC AACAGAAAAA GAAAATAGAG AGACAAGAAG AGAAACTGAA GAATAACAAC 240
AGAGATCTAT CAATGGTTCG AATGAAATCC ATGTTTGCTA TTGGCTTTTG TTTTACTGCC 300
CTAATGGGAA TGTTCAATTC CATATTTGAT GGTAGAGTGG TGGCAAAGCT TCCTTTTACC 360
CCTCTTTCTT ACATCCAAGG ACTGTCTCAT CGAAATCTGC TGGGAGATGA CACCACAGAC 420
TGTTCCTTCA TTTTCCTGTA TATTCTCTGT ACTATGTCGA TTCGACAGAA CATTCAGAAG 480
ATTCTCGGCC TTGCCCCTTC ACGAGCCGCC ACCAAGCAGG CAGGTGGATT TCTTGGCCCA 540 CCACCTCCTT CTGGGAAGTT CTCT 564
(2) INFORMATION FOR SEQ ID NO: 38: (i) CHARACTERISTICS OF THE SEQUENCE: (A) LENGTH: 645 (B) TYPE: Nucleic Acid (C) BRANCH: Double (D) TOPOLOGY: Linear '(ii) TYPE SEQUENCE: cDNA for mRNA (vi) ORIGINAL SOURCE: (A) ORGANISM SPECIES: Homo sapiens (B) TYPE OF CELL: Lymphoma (C) CELL LINE: U937 (D) CLONE NAME: HP10136 (xi) DESCRIPTION OF THE SEQUENCE: SEQ ID NO: 38: ATGGTGTTGC TAACAATGAT CGCCCGAGTG GCGGACGGGC TCCCGCTGGC CGCCTCGATG 60
CAGGAGGACG AACAGTCTGG CCGGGACCTT CAACAGTATC AGAGTCAGGC TAAGCAACTC 120
TTTCGAAAGT TGAATGAACA GTCCCCTACC AGATGTACCT TGGAAGCAGG AGCCATGACT 180
TTTCACTACA TTATTGAGCA GGGGGTGTGT TATTTGGTTT TATGTGAAGC TGCCTTCCCT 240
AAGAAGTTGG CTTTTGCCTA CCTAGAAGAT TTGCACTCAG AATTTGATGA ACAGCATGGA 300
AAGAAGGTGC CCACTGTGTC CCGACCCTAT TCCTTTATTG AATTTGATAC TTTCATTCAG 360
AAAACCAAGA AGCTCTACAT TGACAGTCGT GCTCGAAGAA ATCTAGGCTC CATCAAGACT 420
GAATTGCAAG ATGTGCAGAG GATCATGGTG GCCAATATTG AAGAAGTGTT ACAACGAGGA 480
GAAGCACTCT CAGCATTGGA TTCAAAGGCT AACAATTTGT CCAGTCTGTC CAAGAAATAC 540
CGCCAGGATG CGAAGTACTT GAACATGCGT TCCACTTATG CCAAACTTGC AGCAGTAGCT 600
GTATTTTTCA TCATGTTAAT AGTGTATGTC CGATTCTGGT GGCTG 645
(2) INFORMATION FOR SEQ ID NO: 39: (i) CHARACTERISTICS OF THE SEQUENCE: (A) LENGTH: 336 (B) TYPE: Nucleic Acid (C) BRANCH: Double (D) TOPOLOGY: Linear (ii) TYPE OF SEQUENCE: cDNA for mRNA (vi) ORIGINAL SOURCE: (A) ORGANISM SPECIES: Homo sapiens (B) TYPE OF CELL: Stomach cancer (D) CLONE NAME: HP10175 (xi) DESCRIPTION OF SEQUENCE: SEQ ID NO : 39: ATGCAGGACA CTGGCTCAGT AGTGCCTTTG CATTGGTTTG GCTTTGGCTA CGCAGCACTG 60
GTTGCTTCTG GTGGGATCAT TGGCTATGTA AAAGCAGGCA GCGTGCCGTC CCTGGCTGCA 120 GGGCTGCTCT TTGGCAGTCT AGCCGGCCTG GGTGCTTACC AGCTGTCTCA GGATCCAAGG 180
AACGTTTGGG TTTTCCTAGC TACATCTGGT ACCTTGGCTG GCATTATGGG AATGAGGTTC 240
TACCACTCTG GAAAATTCAT GCCTGCAGGT TTAATTGCAG GTGCCAGTTT GCTGATGGTC 300
GCCAAAGTTG GAGTTAGTAT GTTCAACAGA CCCCAT 336
(2) INFORMATION FOR SEQ ID NO: 40: '(i) CHARACTERISTICS OF THE SEQUENCE: (A) LENGTH: 342 (B) TYPE: Nucleic Acid (C) BRANCH: Double (D) TOPOLOGY: Linear (ii) TYPE SEQUENCE: DNAs for mRNA (vi) ORIGINAL SOURCE: (A) ORGANISM SPECIES: Homo sapiens (B) TYPE OF CELL: Epidermoid carcinoma (C) CELL LINE: KB (D) CLONE NAME: HP10179 (xi) DESCRIPTION SEQUENCE: SEQ ID NO: 40: ATGGAGAAGC CCCTCTTCCC ATTAGTGCCT TTGCATTGGT TTGGCTTTGG CTACACAGCA 60 CTGGTTGTTT CTGGTGGGAT CGTTGGCTAT GTAAAAACAG GCAGCGTGCC GTCCCTGGCT 120 GCAGGGCTGC TCTTCGGCAG TCTAGCCGGC CTGGGTGCTT ACCAGCTGTA TCAGGATCCA 180 AGGAACGTTT GGGGTTTCCT AGCCGCTACA TCTGTTACTT TTGTTGGTGT TATGGGAATG 240 AGATCCTACT ACTATGGAAA ATTCATGCCT GTAGGTTTAA TTGCAGGTGC CAGTTTGCTG 300 ATGGCCGCCA AAGTTGGAGT TCGTATGTTG ATGACATCTG AT 342
(2) INFORMATION FOR SEQ ID NO: 41: (i) CHARACTERISTICS OF THE SEQUENCE: (A) LENGTH: 981 (B) TYPE: Nualáic Acid (C) BRANCH: Double (D) TOPOLOGY: Linear (ii) TYPE OF SEQUENCE: cDNA for mRNA (vi) ORIGINAL SOURCE: (A) ORGANISM SPECIES: Homo sapiens (B) TYPE OF CELL: Fibrosarcoma (C) CELLULAR LINE: HT-1080 (D) CLONE NAME: HP10196 (xi) DESCRIPTION SEQUENCE: SEQ ID NO: 41: ATGGCGGCGG CGGCGGCGGC GGCTGCAGCT ACGAACGGGA CCGGAGGAAG CAGCGGGATG 60 GAGGTGGATG CAGCAGTAGT CCCCAGCGTG ATGGCCTGCG GAGTGACTGG GAGTGTTTCC 120 GTCGCTCTCC ATCCCCTTGT CATTCTCAAC ATCTCAGACC ACTGGATCCG CATGCGCTCC 180 CAGGAGGGGC GGCCTGTGCA GGTGATTGGG GCTCTGATTG GCAAGCAGGA GGGCCGAAAT 240 ATCGAGGTGA TGAACTCCTT TGAGCTGCTG TCCCACACCG TGGAAGAGAA GATTATCATT 300 GACAAGGAAT ATTATTACAC CAAGGAGGAG CAGTTTAAAC AGGTGTTCAA GGAGCTGGAG 360 TTTCTGGGTT GGTATACCAC AGGGGGGCCA CCTGACCCCT CGGACATCCA CGTCCATAAG 420 CAGGTGTGTG AGATCATCGA GAGCCCCCTC TTTCTGAAGT TGAACCCTAT GACCAAGCAC 480 ACAGATCTTC CT GTCAGCGT TTTTGAGTCT GTCATTGATA TAATCAATGG AGAGGCCACA 540 ATGCTGTTTG CTGAGCTGAC CTACACTCTG GCCACAGAGG AAGCGGAACG CATTGGTGTA 600 GACCACGTAG CCCGAATGAC AGCAACAGGC AGTGGAGAGA ACTCCACTGT GGCTGAACAC 660 CTGATAGCAC AGCACAGCGC CATCAAGATG CTGCACAGCC GCGTCAAGCT CATCTTGGAG 720 TACGTCAAGG CCTCTGAAGC GGGAGAGGTC CCCTTTAATC ATGAGATCCT GCGGGAGGCC 780 TATGCTCTGT GTCACTGTCT CCCGGTGCTC AGCACAGACA AGTTCAAGAC AGATTTTTAT 840 GATCAATGCA ACGACGTGGG GCTCATGGCC TACCTCGGCA CCATCACCAA AACGTGCAAC 900 ACCATGAACC AGTTTGTGAA CAAGTTCAAT GTCCTCTACG ACCGACAAGG CATCGGCAGG 960 AGAATGCGCG GGCTCTTTTT C 981
(2) INFORMATION FOR SEQ ID NO: 42: (i) CHARACTERISTICS OF THE SEQUENCE: (A) LENGTH: 1119 (B) TYPE: Nucleic Acid (C) BRANCH: Double (D) TOPOLOGY: Linear (ii) TYPE OF SEQUENCE: DNAs for mRNA (vi) ORIGINAL SOURCE: (A) ORGANISM SPECIES: Homo sapiens (B) TYPE OF CELL: Fibrosarcoma (C) LINE CELL: HT-1080 (D) NAME CLONE: HP10235 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 42: ATGACCCTAT GTGCCATGCT GCCCCTGCTG TTATTCACCT ACCTCAACTC CTTCCTGCAT 60 CAGAGGATCC CCCAGTCCGT ACGGATCCTG GGCAGCCTGG TGGCCATCCT GCTGGTGTTT 120 CTGATCACTG CCATCCTGGT GAAGGTGCAG CTGGATGCTC TGCCCTTCTT TGTCATCACC 180 ATGATCAAGA TCGTGCTCAT TAATTCATTT GGTGCCATCC TGCAGGGCAG CCTGTTTGGT 240 CTGGCTGGCC TTCTGCCTGC CAGCTACACG GCCCCCATCA TGAGTGGCCA GGGCCTAGCA 300 GGCTTCTTTG CCTCCGTGGC CATGATCTGC GCTATTGCCA GTGGCTCGGA GCTATCAGAA 360 AGTGCCTTCG GCTACTTTAT CACAGCCTGT GCTGTTATCA TTTTGACCAT CATCTGTTAC 420 CTGGGCCTGC CCCGCCTGGA ATTCTACCGC TACTACCAGC AGCTCAAGCT TGAAGGACCC 480 GGGGAGCAGG AGACCAAGTT GGACCTCATT AGCAAAGGAG AGGAGCCAAG AGCAGGCAAA 540 GAGGAATCTG GAGTTTCAGT CTCCAACTCT CAGCCCACCA ATGAAAGCCA CTCTATCAAA 600%% GCCATCCTGA AAAATATCTC AGTCCTGGCT TTCTCTGTCT GCTTCATCTT CACTATCACC 660
ATTGGGATGT TTCCAGCCGT GACTGTTGAG GTCAAGTCCA GCATCGCAGG CAGCAGCACC 720
TGGGAACGTT ACTTCATTCC TGTGTCCTGT TTCTTGACTT TCAATATCTT TGACTGGTTG 780
GGCCGGAGCC TCACAGCTGT ATTCATGTGG CCTGGGAAGG ACAGCCGCTG GCTGCCAAGC 840
CTGGTGCTGG CCCGGCTGGT GTTTGTGCCA CTGCTGCTGC TGTGCAACAT TAAGCCCCGC 900
CGCTACCTGA CTGTGGTCTT CGAGCACGAT GCCTGGTTCA TCTTCTTCAT GGCTGCCTTT 960
GCCTTCTCCA ACGGCTACCT CGCCAGCCTC TGCATGTGCT TCGGGCCCAA GAAAGTGAAG 1020
CCAGCTGAGG CAGAGACCGC AGGAGCCATC ATGGCCTTCT TCCTGTGTCT GGGTCTGGCA 1080
CTGGGGGCTG TTTTCTCCTT CCTGTTCCGG GCAATTGTG 1119
(2) INFORMATION FOR SEQ ID NO: 43: (i) CHARACTERISTICS OF THE SEQUENCE: (A) LENGTH: 549 (B) TYPE: Nucleic Acid (C) BRANCH: Double (D) TOPOLOGY: Linear (ii) TYPE OF SEQUENCE: cDNA for mRNA (vi) ORIGINAL SOURCE: (A) ORGANISM SPECIES: Homo sapiens (B) TYPE OF CELL: Stomach cancer (D) CLONE NAME: HP10297 (xi) DESCRIPTION OF SEQUENCE: SEQ ID NO : 43: ATGAAGCTCT TATCTTTGGT GGCTGTGGTC GGGTGTTTGC TGGTGCCCCC AGCTGAAGCC 60
AACAAGAGTT CTGAAGATAT CCGGTGCAAA TGCATCTGTC CACCTTATAG AAACATCAGT 120
GGGCACATTT AGAACCAGAA TGTATCCCAG AAGGACTGCA ACTGCCTGCA CGTGGTGGAG 180
CCCATGCCAG TGCCTGGCCA TGACGTGGAG GCCTACTGCC TGCTGTGCGA GTGCAGGTAC 240
GAGGAGCGCA GCACCACCAC CATCAAGGTC ATCATTGTCA TCTACCTGTC CGTGGTGGGT 300
GCCCTGTTGC TCTACATGGC CTTCCTGATG CTGGTGGACC CTCTGATCCG AAAGCCGGAT 360
GCATACACTG AGCAACTGCA CAATGAGGAG GAGAATGAGG ATGCTCGCTC TATGGCAGCA 420
GCTGCTGCAT CCCTCGGGGG ACCCCGAGCA AACACAGTCC TGGAGCGTGT GGAAGGTGCC 480
CAGCAGCGGT GGAAGCTGCA GGTGCAGGAG CAGCGGAAGA CAGTCTTCGA TCGGCACAAG 540
ATGCTCAGC 549
(2) INFORMATION FOR SEQ ID NO: 44: (i) CHARACTERISTICS OF THE SEQUENCE: (A) LENGTH: 348 (B) TYPE: Nucleic Acid (C) BRANCH: Double (D) TOPOLOGY: Linear (ii) TYPE OF SEQUENCE: cDNA for mRNA (vi) ORIGINAL SOURCE: (A) ORGANISM SPECIES: Homo sapiens (B) TYPE OF CELL: Stomach cancer (D) CLONE NAME: HP10299, (xi) DESCRIPTION OF SEQUENCE: SEQ ID NO: 44: ATGGCCAGTA CAGTGGTAGC AGTTGGACTG ACCATTGCTG CTGCAGGATT TGCAGGCCGT 60 TACGTTTTGC AAGCCATGAA GCATATGGAG CCTCAAGTAA AACAAGTTTT TCAAAGCCTA 120 CCAAAATCTG CCTTCAGTGG TGGCTATTAT AGAGGTGGGT TTGAACCCAA AATGACAAAA 180 CGGGAAGCA GCATTAATAC TAGGTGTAAG CCCTACTGCC AATAAAGGGA AAATAAGAGA 240 GCTCATCGAC GAATTATGCT TTTAAATCAT CCTGACAAAG GAGGATCTCC TTATATAGCA 300 GCCAAAATCA ATGAAGCTAA AGATTTACTA GAAGGTCAAG CTAAAAAA 348
(2) INFORMATION FOR SEQ ID NO: 45: (i) CHARACTERISTICS OF THE SEQUENCE: (A) LENGTH: 456 (B) TYPE: Nuclide Acid (C) BRANCH: Double (D) TOPOLOGY: Linear (ii) TYPE SEQUENCE: cDNA for mRNA (vi) ORIGINAL SOURCE: (A) ORGANISM SPECIES: Homo sapiens (B) TYPE OF CELL: Squamous cell carcinoma (C) CELL LINE: KB (D) CLONE NAME: HP10301 (xi) DESCRIPTION SEQUENCE: SEQ ID NO: 45: ATGGCTGTCC TCTCTAAGGA ATATGGTTTT GTGCTTCTAA CTGGTGCTGC CAGCTTTATA 60 ATGGTGGCCC ACCTAGCCAT CAATGTTTCC AAGGCCCGCA AGAAGTACAA AGTGGAGTAT 120 CCTATCATGT ACAGCACGGA CCCTGAAAAT GGGCACATCT TCAACTGCAT TCAGCGAGCC 180 CACCAGAACA CGTTGGAAGT GTATCCTCCC TTCTTATTTT TTCTAGCTGT TGGAGGTGTT 240 TACCACCCGC GTATAGCTTC TGGCCTGGGC TTGGCCTGGA TTGTTGGACG AGTTCTTTAT 300 GCTTATGGCT ATTACACGGG AGAACCCAGC AAGCGTAGTC GAGGAGCCCT GGGGTCCATC 360 GCCCTCCTGG GCTTGGTGGG CACAACTGTG TGCTCTGCTT TCCAGCATCT TGGTTGGGTT 420 AAAAGTGGCT TGGGCAGTGG ACCCAAATGC TGCCAT 456
(2) INFORMATION FOR SEQ ID NO: 46: (i) CHARACTERISTICS OF THE SEQUENCE: (A) LENGTH: 1677 (B) TYPE: Nucleic Acid (C) BRANCH: Do ^ le (D) TOPOLOGY ": Linear (ii) ) TYPE OF SEQUENCE: cDNA for mRNA (i) ORIGINAL SOURCE: (A) ORGANISM SPECIES: Homo sapiens (B) TYPE OF CELL: Liver (D) CLONE NAME: HP10302 '(xi) DESCRIPTION OF SEQUENCE: SEQ ID NO: 46: ATGGCCCCCA CGCTGCAACA GGCGTACCGG AGGCGCTGGT GGATGGCCTG CACGGCTGTG 60
CTGGAGAACC TCTTCTTCTC TGCTGTACTC CTGGGCTGGG GCTCCCTGTT GATCATTCTG 120
AAGAACGAGG GCTTCTATTC CAGCACGTGC CCAGCTGAGA GCAGCACCAA CACCACCCAG 180
GATGAGCAGC GCAGGTGGCC AGGCTGTGAC CAGCAGGACG AGATGCTCAA CCTGGGCTTC 240
ACCATTGGTT CCTTCGTGCT CAGCGCCACC ACCCTGCCAC TGGGGATCCT CATGGACCGC 300
TTTGGCCCCC GACCCGTGCG GCTGGTTGGC AGTGCCTGCT TCACTGCGTC CTGCACCCTC 360
ATGGCCCTGG CCTCCCGGGA CGTGGAAGCT CTGTCTCCGT TGATATTCCT GGCGCTGTCC 420
CTGAATGGCT TTGGTCGCAT CTGCCTAACG TTCACTTCAC TCACGCTGCC CAACATGTTT 480
GGGAACCTGC GCTCCACGTT AATGGCCCTC ATGATTGGCT CTTACGCCTC TTCTGCCATT 540
ACGTTCCCAG GAATCAAGCT GATCTACGAT GCCGGTGTGG CCTTCGTGGT CATCATGTTC 600
ACCTGGTCTG GCCTGGCCTG CCTTATCTTT CTGAACTGCA CCCTCAACTG GCCCATCGAA 660
GCCTTTCCTG CCCCTGAGGA AGTCAATTAC ACGAAGAAGA TCAAGCTGAG TGGGCTGGCC 720
CTGGACCACA AGGTGACAGG TGACCTCTTC TACACCCATG TGACCACCAT GGGCCAGAGG 780
CTCAGCCAGA AGGCCCCCAG CCTGGAGGAC GGTTCGGATG CCTTCATGTC ACCCCAGGAT 840
GTTCGGGGCA CCTCAGAAAA CCTTCCTGAG AGGTCTGTCC CCTTACGCAA GAGCCTCTGC 900
TCCCCCACTT TCCTGTGGAG CCTCCTCACC ATGGGCATGA CCCAGCTGCG GATCATCTTC 960
TACATGGCTG CTGTGAACAA GATGCTGGAG TACCTTGTGA CTGGTGGCCA GGAGCATGAG 1020
ACAAATGAAC AGCAACAAAA GGTGGCAGAG ACAGTTGGGT TCTACTCCTC CGTCTTCGGG 1080
GCCATGCAGC TGTTGTGCCT TCTCACCTGC CCCCTCATTG GCTACATCAT GGACTGGCGG 1140
ATCAAGGACT GCGTGGACGC CCCAACTCAG GGCACTGTCC TCGGAGATGC CAGGGACGGG 1200
GTTGCTACCA AATCCATCAG ACCACGCTAC TGCAAGATCC AAAAGCTCAC CAATGCCATC 1260
AGTGCCTTCA CCCTGACCAA CCTGCTGCTT GTGGGTTTTG GCATCACCTG TCTCATCAAC 1320
AACTTACACC TCCAGTTTGT GACCTTTGTC CTGCACACCA TTGTTCGAGG TTTCTTCCAC 1380
TCAGCCTGTG GGAGTCTCTA TGCTGCAGTG TTCCCATCCA ACCACTTTGG GACGCTGACA 1440
GGCCTGCAGT CCCTCATCAG TGCTGTGTTC GCCTTGCTTC AGCAGCCACT TTTCATGGCG 1500
ATGGTGGGAC CCCTGAAAGG AGAGCCCTTC TGGGTGAATC TGGGCCTCCT GCTATTCTCA 1560 CTCCTGGGAT TCCTGTTGCC TTCCTACCTC TTCTATTACC GTGCCCGGCT CCAGCAGGAG 1620 TACGCCGCCA ATGGGATGGG CCCACTGAAG GTGCTTAGCG GCTCTGAGGT GACCGCA 1677
(2) INFORMATION FOR SEQ ID NO: 47: (i) CHARACTERISTICS OF THE SEQUENCE: (A) LENGTH: 990 (B) TYPE: Nucleic Acid (C) BRANCH: Double (D) TOPOLOGY: Linear (ii) TYPE OF SEQUENCE: cDNA for mRNA (vi) ORIGINAL SOURCE: (A) ORGANISM SPECIES: Homo sapiens (B) TYPE OF CELL: Osteosarcoma (C) CELLULAR LINE: U-2 OS (D) CLONE NAME: HP10304 (i) DESCRIPTION OF THE SEQUENCE: SEQ ID NO: 47: ATGGAGGGCG CTCCACCGGG GTCGCTCGCC CTCCGGCTCC TGCTGTTCGT GGCGCTACCC 60
GCCTCCGGCT GGCTGACGAC GGGCGCCCCC GAGCCGCCGC CGCTGTCCGG AGCCCCACAG 120
GACGGCATCA GAATTAATGT AACTACACTG AAAGATGATG GGGACATATC TAAACAGCAG 180
GTTGTTCTTA ACATAACCTA TGAGAGTGGA CAGGTGTATG TAAATGACTT ACCTGTAAAT 240
AGTGGTGTAA CCCGAATAAG CTGTCAGACT TTGATAGTGA AGAATGAAAA TCTTGAAAAT 300
TTGGAGGAAA AAGAATATTT TGGAATTGTC AGTGTAAGGA TTTTAGTTCA TGAGTGGCCT 360
ATGACATCTG GTTCCAGTTT GCAACTAATT GTCATTCAAG AAGAGGTAGT AGAGATTGAT 420
GGAAAACAAG TTCAGCAAAA GGATGTCACT GAAATTGATA TTTTAGTTAA GAACCGGGGA 480
GTACTCAGAC ATTCAAACTA TACCCTCCCT TTGGAAGAAA GCATGCTCTA CTCTATTTCT 540
CGAGACAGTG ACATTTTATT TACCCTTCCT AACCTCTCCA AAAAAGAAAG TGTTAGTTCA 600
CTGCAAACCA CTAGCCAGTA TCTTATCAGG AATGTGGAAA CCACTGTAGA TGAAGATGTT 660
TTACCTGGCA AGTTACCTGA AACTCCTCTC AGAGCAGAGC CGCCATCTTC ATATAAGGTA 720
ATGTGTCAGT GGATGGAAAA GTTTAGAAAA GATCTGTGTA GGTTCTGGAG CAACGTTTTC 780
CCAGTATTCT TTCAGTTTTT GAACATCATG GTGGTTGGAA TTACAGGAGC AGCTGTGGTA 840
ATAACCATCT. TAAAGGTGTT TTTCCCAGTT TCTGAATACA AAGGAATTCT TCAGTTGGAT 900
AAAGTGGACG TCATACCTGT GACAGCTATC AACTTATATC CAGATGGTCC AGAGAAAAGA 960
GCTGAAAACC TTGAAGATAA AACATGTATT 990
(2) INFORMATION FOR SEQ ID NO: 48: (i) CHARACTERISTICS OF THE SEQUENCE: (A) LENGTH: 324 (B) TYPE: Nucleic Acid (C) BRANCH: Double (D) TOPOLOGY: Linear (ii) TYPE OF SEQUENCE: cDNA for mRNA (vi) ORIGINAL SOURCE: (A) ORGANISM SPECIES: Homo sapiens (B) TYPE OF CELL: Osteosarcoma (C) CELLULAR LINE: U-2 OS (D) CLONE NAME: HP10305, (xi) ) SEQUENCE DESCRIPTION: SEQ ID NO: 48: ATGAGTCTGA CTTCCAGTTC CAGCGTACGA GTTGAATGGA TCGCAGCAGT TACCATTGCT 60 GCTGGGACAG CTGCAATTGG TTATCTAGCT TACAAAAGAT TTTATGTTAA AGATCATCGA 120 AATAAAGCTA TGATAAACCT TCACATCCAG AAAGACAACC CCAAGATAGT ACATGCTTTT 180 GACATGGAGG ATTTGGGAGA TAAAGCTGTG TACTGCCGTT GTTGGAGGTC CAAAAAGTTC 240 CCATTCTGTG ATGGGGCTCA CACAAAACAT AACGAAGAGA CTGGAGACAA TGTGGGCCCT 300 CTGATCATCA AGAAAAAAGA AACT 324
(2) INFORMATION FOR SEQ ID NO: 49: (i) CHARACTERISTICS OF THE SEQUENCE: (A) LENGTH: 303 (B) TYPE: Nucleic Acid (C) BRANCH: Double (D) TOPOLOGY: Linear (ii) TYPE OF SEQUENCE: DNAs for mRNA (vi) ORIGINAL SOURCE: (A) ORGANISM SPECIES: Homo sapiens (B) TYPE OF CELL: Osteosarcoma (C) CELLULAR LINE: U-2 OS (D) CLONE NAME: HP10306 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 49: ATGAACCTGG AGCGAGTGTC CAATGAGGAG AAATTGAACC TGTGCCGGAA GTACTACCTG 60 GGGGGGTTTG CTTTCCTGCC TTTTCTCTGG TTGGTCAACA TCTTCTGGTT CTTCCGAGAG 120 GCCTTCCTTG TCCCAGCCTA CACAGAACAG AGCCAAATCA AAGGCTATGT CTGGCGCTCA 180 GCTGTGGGCT TCCTCTTCTG GGTGATAGTG CTCACCTCCT GGATCACCAT CTTCCAGATC 240 TACCGGCCCC GCTGGGGTGC CCTTGGGGAC TACCTCTCCT TCACCATACC CCTGGGCACC 300 CCC 303
(2) INFORMATION FOR SEQ ID NO: 50: (i) CHARACTERISTICS OF THE SEQUENCE: (A) LENGTH: 1116 (B) TYPE: Nucleic Acid (C) BRANCH: Double (D) TOPOLOGY: Linear (ii) TYPE OF SEQUENCE: ADNo for mRNA (vi) ORIGINAL SOURCE: (A) ORGANISM SPECIES: Homo sapiens (B) TYPE OF CELL: Squamous cell carcinoma (C) CELL LINE: KB (D) CLONE NAME: HP10328 (xi) DESCRIPTION OF THE SEQUENCE: SEQ ID NO: 50: ATGAAGTATC TCCGGCACCG GCGGCCCAAT GCCACCCTCA TTCTGGCCAT CGGCGCTTTC 60
ACCCTCCTCC TCTTCAGTCT GCTAGTGTCA CCACCCACCT GCAAGGTCCA GGAGCAGCCA 120
CCGGCGATCC CCGAGGCCCT GGCCTGGCCC ACTCCACCCA CCCGCCCAGC CCCGGCCCCG 180
TGCCATGCCA ACACCTCTAT GGTCACCCAC CCGGACTTCG CCACGCAGCC GCAGCACGTT 240
CAGAACTTCC TCCTGTACAG ACACTGCCGC CACTTTCCCC TGCTGCAGGA CGTGCCCCCC 300
TCTAAGTGCG CGCAGCCGGT CTTCCTGCTG CTGGTGATCA AGTCCTCCCC TAGCAACTAT 360
GTGCGCCGCG AGCTGCTGCG GCGCACGTGG GGCCGCGAGC GCAAGGTACG GGGTTTGCAG 420
CTGCGCCTCC TCTTCCTGGT GGGCACAGCC TCCAACCCGC ACGAGGCCCG CAAGGTCAAC 480
CGGCTGCTGG AGCTGGAGGC ACAGACTCAC GGAGACATCC TGCAGTGGGA CTTCCACGAC 540
TCCTTCTTCA ACCTCACGCT CAAGCAGGTC CTGTTCTTAC AGTGGCAGGA GACAAGGTGC 600
GCCAACGCCA GCTTCGTGCT CAACGGGGAT GATGACGTCT TTGCACACAC AGACAACATG 660
GTCTTCTACC TGCAGGACCA TGACCCTGGC CGCCACCTCT TCGTGGGGCA ACTGATCCAA 720
AACGTGGGCC CCATCCGGGC TTTTTGGAGC AAGTACTATG TGCCAGAGGT GGTGACTCAG 780
AATGAGCGGT ACCCACCCTA TTGTGGGGGT GGTGGCTTCT TGCTGTCCCG CTTCACGGCC 840
GCTGCCCTGC GCCGTGCTGC CCATGTCTTG GACATCTTCC CCATTGATGA TGTCTTCCTG 900
GGTATGTGTC TGGAGCTTGA GGGACTGAAG CCTGCCTCCC ACAGCGGCAT CCGCACGTCT 960
GGCGTGCGGG CTCCATCGCA ACACCTGTCC TCCTTTGACC CCTGCTTCTA CCGAGACCTG 1020
CTGCTGGTGC ACCGCTTCCT ACCTTATGAG ATGCTGCTCA TGTGGGATGC GCTGAACCAG 1080
CCCAACCTCA CCTGCGGCAA TCAGACACAG ATCTAC 1116
(2) INFORMATION FOR SEQ ID NO: 51: (i) CHARACTERISTICS OF THE SEQUENCE: (A) LENGTH: 986 (B) TYPE: Nucleic Acid (C) BRANCH: Double (D) TOPOLOGY: Linear (ii) TYPE OF SEQUENCE: cDNA for mRNA (Vi) ORIGINAL SOURCE: (A) ORGANISM SPECIES: Homo sapiens (B) TYPE OF CELL: Fibrosarcoma (C) CELLULAR LINE: HT-1Q80 (D) CLONE NAME: HP10442 (ix) CHARACTERISTICS OF THE SEQUENCE: (A) CHARACTERIZATION CODE: CDS (B) EXISTENCE POSITION: 82 .. 699 (C) CHARACTERIZATION METHOD: E (xi) DESCRIPTION OF THE SEQUENCE: SEQ ID NO: 51: AGACTGCGGG ACGGACGGTG GACGCTGGGA CGCGTTTGTA GCTCCGGCCC CGCCGTTCCG 60
ACCCCCGCCG CCGTCGCCGC C ATG ACG GGG CTA GCA CTG CTC TAC TCC GGG 111 t Met Thr Gly Leu Wing Leu Leu Tyr Ser Gly 1 5 10 GTC TTC GTG GCC TTC TGG GCC TGC GCG CTG GCC GTG GGA GTC TGC TAC 159 Val Pha Val Wing Phe Trp Ala Cys Ala Leu Ala Val Gly Val Cys Tyr 15 20 25 ACC ATT TTT GAT TTG GGC TTC CGC TTT GAT GTG GCA TGG TTC CTG ACG 207 Thr He Phe Asp Leu Gly Phe Arg Phe Asp Val Wing Trp Phe Leu Thr 30 35 40 GAG ACT TCG CCC TTC ATG TGG TCC AAC CTG GGC ATT GGC CTA GCT ATC 255 Glu Thr Ser Pro Phe Met Trp Ser Asn Leu Gly Ha Gly Leu Wing He 45 50 55 TCC CTG TCT GTG GTT GGG GCA GCC TGG GGC ATC TAT ATT ACC GGC TCC 303 Ser Leu Ser Val Val Gly Wing Wing Trp Gly He Tyr He Thr Gly Ser 60 65 70 TCC ATC ATT GGT GGA GGA GTG AAG GCC CCC AGG ATC AAG ACC AAG AAC 351
Be He He Gly Gly Gly Val Lys Pro Wing Arg He Lys Thr Lys Asn 75 80 85 90 CTG GTC AGC ATC ATC TTC TGT GAG GCT GTG GCC ATC TAC GGC ATC ATC 399 Leu Val Ser He He Phe Cys Glu Ala Val Wing He Tyr Gly He He 95 100 105 ATG GCA ATT GTC ATT AGC AAC ATG GCT GAC CCT TTC AGT GCC ACA GAC 447 Met Wing He Val He Ser Asn Met Wing Glu Pro Phe Ser Wing Thr Asp 110 115 120 CCC AAG GCC ATC GGC CAT CGG AAC TAC CAT GCA GGC TAC TCC ATG TTT 495 Pro Lys Wing He Gly His Arg Asn Tyr His Wing Gly Tyr Ser Met Phe 125 130 135 GGG GCT GGC CTC ACC GTA GGC CTG TCT AAC CTC TTC TGT GGA GTC TGC 543 Gly Wing Gly Leu Thr Val Gly Leu Ser Asn Leu Phe Cys Gly Val Cys 140 145 150 GTG GGC ATC GTG GGC AGT GGG GCT GCC CTG GCC GAT GCT CAG AAC CCC 591 Val Gly He Val Gly Ser Gly Ala Ala Leu Ala Asp Ala Gln Asn Pro 155 160 165 170 AGC CTC TTT GTA AAG ATT CTC ATC GTG GAG ATC TTT GGC AGC GCC ATT 639 Ser Leu Phe Val Lys He Leu He Val Glu He Phe Gly Ser Wing He 175 180 185 GGC CTC TTT GGG GTC ATC GTC GCA ATT CTT CAG ACC TCC AGA GTG AAG 687 Gly Leu Phe Gly Val He Val Wing He Leu Gln Thr Ser Arg V to Lys 190 195 200 GGT ATG GAC TAGATGATAT GTGTGGGTGG GGCCGTGCCT Met Gly Asp CACT 730 205 TTTATTTATT GCTGGTTTTC CTGGGACAGC TGGAGCTGTG TCCCTTAGCC TTTCAGAGGC 790 TTGGTGTTCA GGGCCCTCCC TGCACTCCCC TCTTGCTGCG TGTTGATTTG GAGGCACTGC 850 AGTCCAGGCC GAGTCCTCAG TGCGGGGAGC AGGCTGCTGC TGCTGACTCT GTGCAGCTGC 910 GCACCTGTGT CCCCCACCTC CACCCTCAAC CCATCTTCCT AGTGTTTGTG AAATAAACTT 970 986 GGTATTTGTC TGGGTC
(2) INFORMATION FOR SEQ ID NO: 52: (i) CHARACTERISTICS OF THE SEQUENCE: (A) LENGTH: 1824 (B) TYPE: Nucleic Acid (C) BRANCH: Double (D) TOPOLOGY: Linear (ii) TYPE OF SEQUENCE: DNAs for mRNA (vi) ORIGINAL SOURCE: (A) ORGANISM SPECIES: Homo sapiens (B) TYPE OF CELL: Leukocyte (D) CLONE NAME: HP00804 (ix) SEQUENCE CHARACTERISTICS: (A) CODE OF CHARACTERIZATION: CDS (B) EXISTENCE POSITION: 133 .. 1248 (C) CHARACTERIZATION METHOD: E (xi) DESCRIPTION OF SEQUENCE: SEQ ID NO: 52: GGCCCAGCTG AGCGGCCGCC GAGCGGGTGC GGGTGCGGGC GCATCGGCCA TCACCGCGCG 60
GCCGCGCAGC GGACACCGTG CGTACCGGCC TGCGGCGCCC GGCCACCGGG GCGGACCGCG 120
GAACCCGAGG CC ATG TCC CAT GAA AAG AGT TTT TTG GTG TCT GGG GAC AAC 171 Met Ser His Glu Lys Ser Phe Leu Val Ser Gly Asp Asn 1 5 10 TAT CCT CCC CCC AAC CCT GGA TAT CCG GGG GGG CCC CAG CCA CCC ATG 219 Tyr Pro Pro Pro Asn Pro Gly Tyr Pro Gly Gly Pro Gln Pro Pro Met 15 20 25 CCC CCC TAT GCT CAG CCT CCC TAC CCT GGG GCC CCT TAC CAC GAG CCC 267
Pro Pro Tyr Wing Gln Pro Pro Tyr Pro Gly Wing Pro Tyr Pro Gln Pro 30 35 40 45 CCT TTC CAG CCC TCC CCC TAC GGT CAG CCA GGG TAC CCC CAT GGC CCC 315 Pro Phe Gln Pro Ser Pro Tyr Gly Gln Pro Gly Tyr Pro His Gly Pro 50 55 60 AGC CCC TAC CCC CAA GGG GGC TAC CAC CAG GGT CCC TAC CCC CAA GGG 363 Ser Pro Tyr Pro Gln Gly Tyr Pro Gln Gly Pro Tyr Pro Gln Gly 65 70 75 GGC TAC CCA CAG GGC CCC TAC CCA CAA GAG GGC TAC CCA CAG GGC CCC 411 Gly Tyr Pro Gln Gly Pro Tyr Pro Gln Glu Gly Tyr Pro Gln Gly Pro 80 85 90 TAC CCC CAA GGG GGC TAC CCC CAG GGG CCA TAT CCC CAG AGC CCC TTC 459 Tyr Pro Glu Gly Gly Tyr Pro Gln Gly Pro Tyr Pro Gln Ser Pro Phe 95 100 105 CCC CCC AAC CCC TAT GGA CAG CCA CAG GTC TTC CCA GGA CAA GAC CCT 507 Pro Pro Asn Pro Tyr Gly Gln Pro Gln Va Phe Pro Gly Gln Asp Pro 110 115" "120 125 GAC TCA CCC CAG CAT GGA AAC TAC CAG GAG GGT CCC CCA TCC TAC 555 Asp Ser Pro Gln His Gly Asn Tyr Gln Glu Glu Gly Pro Pro Ser Tyr 130 135 140 TAT GAC AAC CAG GAC TTC CCT GCC ACC AAC TGG GAT GAC AAG AGC ATC 603 Tyr Asp Asn Gln Asp Phe Pro Wing Thr Asn Trp Asp Asp Lys Ser He .145 150 155 CGA CAG GCC TTC ATC CGC AAG GTG TTC CTA GTG CTG ACC TTG CAG CTG 651 Arg Gln Wing Phe He Arg Lys Val Phe Leu Val Leu Thr Lau Gln Leu 160 165 170 TCG GTG ACC CTG TCC ACG GTG TCT GTG TTC ACT TTT GTT GCG GTG 699 Ser Val Thr Leu Ser Thr Val Ser Val Phe Thr Phe Val Ala Glu Val 175 180 185 AAG GGC TTT GTC CGG GAG AAT GTC TGG ACC TAC TAT GTC TCC TAT GCT 747 Lys Gly Phe Val Arg Glu Asn Val Trp Thr Tyr Tyr Val Ser Tyr Wing 190 195 200 205 GTC TTC TTC ATC TCT CTC ATC GTC CTC AGC TGT TGT GGG GAC TTC CGG 795 Val Phe Phe He Ser Leu He Val Leu Ser Cys Cys Gly Asp Phe Arg 210 215 220 CGA AAG CAC CCC TGG AAC CTT GTT GCA CTG TCG GTC CTG ACC GCC AGC 843 Arg Lys His Pro Trp Asn Leu Val Ala Leu Ser Val Leu Thr Wing Ser 225 230 235 CTG TCG TAC ATG GTG GGG ATG ATC GCC AGC TTC TAC AAC ACC GAG GCA 891 Leu Ser Tyr Met Val Gly Mat Has Wing Ser Phe Tyr Asn Thr Glu Wing 240 245 250 GTC ATC ATG GCC GTG GGC ATC ACC ACA GCC GTC TGC TTC ACC GTC GTC 939 Val He Mßt Wing Val Gly He Thr Thr Wing Val Cys Phe Thr Val Val 255 260 265 ATC TTC TCC ATG CAG ACC CGC TAC GAC TTC ACC TCA TGC ATG GGC GTG 987 He Phe Ser Mat Gln Thr Arg Tyr Asp Phe Thr Ser Cys Met Gly Val 270 275 280 285 CTC CTG GTG AGC ATG GTG GTG CTC TTC ATC TTC GCC ATT CTC TGC ATC 1035 Leu Leu Val Ser Met Val Val Leu Phe He Pha Wing He Leu Cys Ha 290 295 300 TTC ATC CGG AAC CGC ATC CTG GAG ATC GTG TAC GCC TCA CTG GGC GCT 1083 Phe He Arg Asn Arg He Leu Glu He Val Tyr Ala Ser Leu Gly Wing 305 310 315 CTG CTC TTC ACC TGC TTC CTC GCA GTG GAC ACC CAG CTG CTG CTG GGG 1131 Leu Leu Phe Thr Cys Phß Leu Ala Val Asp Thr Gln Leu Leu Leu Gly 320 325 330 AAC AAG CAG CTG TCC CTG AGC CCA GAA GAG TAT GTG TTT GCT GCG CTG 1179 Asn Lys Gln Leu Ser Leu Ser Pro Glu Glu Tyr Val Phe Ala Ala Leu 335 340 345 AAC CTG TAC ACÁ GAC ATC ATC AAC ATC TTC CTG TAC ATC CTC ACC ATC 1227 Asn Leu Tyr Thr Asp He He Asn He Pha Leu Tyr He Leu Thr He 350 355 360 365 ATT GGC CGC GCC AAG GAG TAGCCGAGCT CCAGCTCGCT GTGCC 1270 He Gly Arg Wing Lys Glu 370 CGCTCAGGTG GCACGGCTGG CCTGGACCCT GCCCCTGGCA CGGCAGTGCC AGCTGTACTT 1330TTGTCCCCAG GCACAGCCTA GGGAAAAGGA TGCCTCTCTC CAACCCTCCT 1390
GTATGTACAC TGCAGATACT TCCATTTGGA CCCGCTGTGG CCACAGCATG GCCCCTTTAG_1450_TCCTCCCGCC C.CCGCCAAGG GGCACCAAGG CCACGTTTCC GTGCCACCTC CTGTCTACTC 1510
ATTGTTGCAT GAGCCCTGTC TGCCAGCCCA CCCCAGGGAC TGGGGGCAGC ACCAGGTCCC 1570
GGGGAGAGGG ATTGAGCCAA GAGGTGAGGG TGCACGTCTT CCCTCCTGTC CCAGCTCCCC 1630
AGCCTGGCGT AGAGCACCCC TCCCCTCCCC CCCACCCCCC TGGAGTGCTG CCCTCTGGGG 1690
ACATGCGGAG TGGGGGTCTT ATCCCTGTGC TGAGCCCTGA GGGCAGAGAG GATGGCATGT 1750
TTCAGGGGAG GGGGAAGCCT TCCTCTCAAT TTGTTGTCAG TGAAATTCCA ATAAATGGGA 1810
TTTGCTCTCT GCCT 1824
(2) INFORMATION FOR SEQ ID NO: 53: (i) CHARACTERISTICS OF THE SEQUENCE: (A) LENGTH: 1076 (B) TYPE: Nucleic Acid (C) BRANCH: Double (D) TOPOLOGY: Linear (ii) TYPE OF SEQUENCE: cDNA for mRNA (vi) ORIGINAL SOURCE: (A) ORGANISM SPECIES: Homo sapiens (B) TYPE OF CELL: Stomach cancer (D) CLONE NAME: HP01098 (ix) SEQUENCE CHARACTERISTICS: (A) CHARACTERIZATION CODE: CDS (B) EXISTENCE POSITION: 62 .. 601 (C) CHARACTERIZATION METHOD: E (xi) DESCRIPTION OF SEQUENCE: SEQ ID NO: 53: AGTTCCGCCC GCTGGTCATC GCGCCCTTTC CCCTGCCGGT GTCCTGCTCG CCGTCCCCGC 60
C ATG CTG TCT CTA GAC TTT TTG GAC GAT GTG CGG CGG ATG AAC AAG CGG 109 Met Leu Ser Leu Asp Phe Leu Asp Asp Val Arg Arg Met Asn Lys Arg 1 5 10 15 CAG CTC TAT TAT CAA GTC CTA AAT TTT GGA ATG ATT GTC TCA TCG GCA 157 Gln Leu Tyr Tyr Gln Val Leu Asn Phe Gly Met He Val Ser Be Ala? 20 25 30 CTA ATG ATG TGG AAG GGG TTA ATG GTA ATA ACT GGA AGT GAA AGT CCG 205 Lau Mßt He Trp Lys Gly Leu Met Val He Thr Gly Ser Glu Ser Pro 35 40 45 ATT GTA GTG GTG CTC AGT GGC AGC ATG GAA CCT GCA TTT CAT AGA GGA 253 He Val Val Leu Ser Gly Ser Met Glu Pro Wing Phe His Arg Gly 50 55 60 GAT CTT CTC TTT CTA ACA AAT CGA GTT GAA GAT CCC ATA CGA GTG GGA 301 Asp Leu Leu Phs Leu Thr Asn Arg Val Glu Asp Pro He Arg Val Gly 65 70 75 80 GAA ATT GTT GTT TTT AGG ATA GAA GGA AGA GAG ATT CCT ATA GTT CAC 349 Glu He Val Val Phe Arg He Glu Gly Arg Glu He Pro He Val His 85 90 95 CGA GTC TTG AAG ATT CAT GAA AAG CAA AAT GGG CAT ATC AAG TTT TTG 397 Arg Val Leu Lys He His Glu Lys Gln Asn Gly His Ha Lys Phe Leu, 100 105 110 ACC AAA GGA GAT AAT AAT GCG GTT GAT GAC CGA GGC CTC TAT AAA CAA 445 Thr Lys Gly Asp Asn Asn Wing Val Asp Asp Arg Gly Leu Tyr Lys Gln 115 120 125 GGA CAA CAT TGG CTA GAG AAA AAA GAT GTT GTG GGG AGA GCC AGG GGA 493 Gly Gln His Trp Leu Glu Lys Lys Asp Val Val Gly Arg Ala Arg Gly 130 135 140 TTT GTT CCT TAT ATT GGA ATT GTG ACG ATC CTC ATG AAT GAC TAT CCT 541 Phe Val Pro Tyr He Gly He Val Thr He Leu Met Asn Asp Tyr Pro 145 150 155 160 AAA TTT AAG TAT GCA GTT CTC TTT TTG CTG GGT TTA TTC GTG CTG GTT 589 Lys Phe Lys Tyr Wing Val Leu Phe Leu Leu Gly Leu Phe Val Leu Val 165 170 175 CAT CGT GAG TA AGAAGCC TGCCTTGCTG TTCCTGGGAA GAT 630
His Arg Glu GCCATAGTTT TCGTTACTGG ATGTTTGGAG TAGATACTGG TCTGTGATTG GTGGAATGGA 690
GAACACACGT GTTGGTGCTT CTGGGTAGCA CTGGTTTGCA TTAGTTTATG TTTCCATGCC 750
AGAGTTTGTG TGGGCGGGCG CATGTGCACC ACAGAGTGCA CTCGAGGGGA CTTTCAGTCA 810
CAGGATTTCA TAATTGTCAT TGTCACACTT TCAAATTTTT GTACATCAGT GAATTTTTTT 870
ATATTAAAAG GTTGAGCCAA AGCCCCCAGT GTTTGTATTT TGAAGCCAAG CTTCACTTCT 930
AAAGTGCCTA CAGAGACTTG TAAATGAAAA TGCAGCTCTG CACGAGTTTG AAACCGTCAT 990
ACCTCCTTCT ATTAGGAATG GCATATACTG AGGTGGTCGT AAGTCTTAAC TTCTAAAATT 1050
TTAAATAAAA GACTTTGCAC ATTGAG 1076
(2) INFORMATION FOR SEQ ID NO: 54: (i) CHARACTERISTICS OF THE SEQUENCE: (A) LENGTH: 1591 (B) TYPE: Nucleic Acid (C) BRANCH: Double (D) TOPOLOGY: Linear (ii) TYPE OF SEQUENCE: cDNA for mRNA (vi) ORIGINAL SOURCE: (A) ORGANISM SPECIES: Homo sapiens (B) TYPE OF CELL: Liver (D) CLONE NAME: HP01148 (ix) SEQUENCE CHARACTERISTICS: (A) CODE OF CHARACTERIZATION: CDS (B) EXISTENCE POSITION: 102 .. 1145 (C) CHARACTERIZATION METHOD: E (xi) DESCRIPTION OF THE SEQUENCE: SEQ ID NO: 54: GTCCCTCCTC TTAACATACT TGCAGCTAAA ACTAAATATT GCTGCTTGGG GACCTCCTTC 60
TAGCCTTAAA TTTCAGCTCA TCACCTTCAC CTGCCTTGGT C ATG GCT CTG CTA TTC 116 Met Ala Leu Leu Phe 1 5 TCC TTG ATC CTT GCC ATT TGC ACC AGA CCT GGA TTC CTA GCG TCT CCA 164 Ser Leu Ha Leu Ala He Cys Thr Arg Pro Gly Phe Lau Ala Ser Pro 10 15 20 TCT GGA GTG CGG CTG GTG GGG GGC CTC CAC CGC TGT GAA GGG CGG GTG 212 Ser Gly Val Arg Leu Val Gly Gly Leu His Arg Cys Glu Gly Arg Val 25 30 35 GAG GTG GAA CAG AAA GGC CAG TGG GGC ACC GTG TGT GAT GAC GGC TGG 260 Glu Val Glu Gln Lys Gly Gln Trp Gly Thr Val Cys Asp Asp Gly Trp 40 45 50 GAC ATT AAG GAC GTG GCT GTG TTG TGC CGG GAG CTG GGC TGT GGA GCT 308 Asp He Lys Asp Val Ala Val Leu Cys Arg Glu Leu Gly Cly Gly Wing 55 60 65 GCC AGC GGA ACC CCT AGT GGT ATT TTG TAT GAG CCA CCA GCA GAA AAA 356
Wing Being Gly Thr Pro Being Gly He Leu Tyr Glu Pro Pro Wing Glu Lys 70 75 80 85 GAG CAA AAG GTC CTC ATC CAA TCA GTC AGT TGC ACÁ GGA ACÁ GAA GAT 404 Glu Gln Lys Val Leu He Gln Ser Val Ser Cys Thr Gly Thr Glu Asp 90 95 100 ACÁ TTG GCT CAG TGT GAG CAA GAA GAA GTT TAT GAT TGT TCA CAT GAA 452 Thr Leu Wing Gln Cys Glu Gln Glu Glu Val Tyr Asp Cys Ser His Glu 105 110 115 GAA GAT GCT GGG GCA TCG TGT GAG AAC CCA GAG AGC TCT TTC TCC CCA 500 Glu Asp Wing Gly Wing Ser Cys Glu Asn Pro Glu Ser Ser Phe Sar Pro 120 125 130 GTC CCA GAG GGT GTC AGG CTG GCT GAC GGC CCT GGG CAT TGC AAG GGA 548 Val Pro Glu Gly Val Arg Leu Wing Asp Gly Pro Gly His Cys Lys Gly 135 140 145 CGC GTG GAA GTG AAG CAC CAG AAC CAG TGG TAT ACC GTG TGC CAG ACA 596 Arg Val Glu Val Lys His Gln Asn Gln Trp Tyr Thr Val Cys Gln Thr 150 155 160 165 GGC TGG AGC CTC CGG GCC GCA AAG GTG GTG TGC CGG CAG CTG GGA TGT 644 Gly Trp Ser Leu Arg Ala Ala Lys Val Val Cys Arg Gln Leu Gly Cys 170 175 180 GGG AGG GCT GTA CTG ACT CAA AAA CGC TGC AAC AAG CAT GCC TAT GGC 692 Gly Arg Wing Val Leu Thr Gln Lys Arg Cys Asn Lys His Wing Tyr Gly 185 190 195 CGA AAA CCC ATC TGG CTG AGC CAG ATG TCA TGC TCA GGA CGA GAA GCA 740 Arg Lys Pro He Trp Leu Ser Gln Met Ser Cys Ser Gly Arg Glu Wing 200 205 210 ACC CTT CAG GAT TGC CCT TCT GGG CCT TGG GGG AAG AAC ACC TGC AAC 788 Thr Leu Gln Asp Cys Pro Ser Gly Pro Trp Gly Lys Asn Thr Cys Asn 215 220 225 CAT GAT GAA GAC ACG TGG GTC GAA TGT GAA GAT CCC TTT GAC TTG AGA 836
His Asp Glu Asp Thr Trp Val Glu Cys Glu Asp Pro Phe Asp Leu Arg 230 235 240 245 CTA GTA GGA GAC AAC CTC TGC TCT GGG CGA CTG GAG GTG CTG CAC 884 Leu Val Gly Gly Asp Asn Leu Cys Ser Gly Arg Leu Glu Val Leu His 250 255 260 AAG GGC GTA TGG GGC TCT GTC TGT GAT GAC AAC TGG GGA GAA AAG GAG 932 Lys Gly Val Trp Gly Ser Val Cys Asp Asp Asn Trp Gly Glu Lys Glu 265 270 275 GAC CAG GTG GTA TGC AAG CAA CTG GGC TGT GGG AAG TCC CTC TCT CCC 980
Asp Gln Val Val Cys Lys Gln Leu Gly Cys Gly Lys Ser Leu Ser Pro 280 285 290 TCC TTC AGA GAC CGG AAA TGC TAT GGC CCT GGG GTT GGC CGC ATC TGG 1028 Ser Phe Arg Asp Arg Lys Cys Tyr Gly Pro Gly Val Gly Arg He Trp 295 300 305 CTG GAT AAT GTT CGT TGC TCA GGG GAG GAG CAG TCC CTG GAG CAG TGC 1076 Leu Asp Asn Val Arg Cys Ser Gly Glu Glu Gln Ser Leu Glu Gln Cys 310 315 320 325 CAG CAC AGA TTT TGG GGG TTT CAC GAC TGC ACC CAC CAG GAA GAT GTG 1124 Gln His Arg Phe Trp Gly Phe His Asp Cys Thr His Gln Glu Asp Val 330 335 340 GCT GTC ATC TGC TCA GGA TAGTATCCTG GTGTTGCTTG ACCTGGCC 1170
Wing Val He Cys Ser Gly 345 CCCCTGGCCC CGCCTGCCCT CTGCTTGTTC TCCTGAGCCC TGATTATCCT CATACTCATT 1230
CTGGGGCTCA GGCTTGAGCC ACTACTCCCT CATCCCCTCA GGAGTCTGAA CACTGGGCTT 1290
ATGCCTTACT CTCAGGGACA AGCAGCCCCC ATTGCTGCCT GTAGATGTGA GCTGTTGAGT 1350
TCCCTCTTGC TGGGGAAGAT GAGCTTCCAT GTATCCTGTG CTCAACCCTG ACCCTTTGAC 1410
ACTGGTTCTG GCCTTTCCTG CCTTTTCTCA AGCTGCCTGG AATCCTCAAA CCTGTCACTT 1470
TGGTCAGATG TGCAGACCAT TACTAAGGTC TATGTCTGCA AACATTACTA ATCTAGGTCC 1530
TATTACTAAT CTATGTCTGC AAACATTAAA GGAATGAAAC AATGAAAGGA ACATTTGAAA 1590
G 1591
(2) INFORMATION FOR SEQ ID NO: 55: (i) CHARACTERISTICS OF THE SEQUENCE: (A) LENGTH: 1888 (B) TYPE: Nucleic Acid (C) BRANCH: Double (D) TOPOLOGY: Linear (ii) TYPE OF SEQUENCE: cDNA for mRNA (vi) ORIGINAL SOURCE: (A) ORGANISM SPECIES: Homo sapiens (B) TYPE OF CELL: linked (D) CLONE NAME: HP01293 (ix) SEQUENCE CHARACTERISTICS: (A) CODE CHARACTERIZATION: CDS (B) EXISTENCE POSITION: 90 .. 1754 (C) CHARACTERIZATION METHOD: E (xi) DESCRIPTION OF THE SEQUENCE: SEQ ID NO: 55: CCTTTTTCAAA GATCTCTGAG GGAGACATTG CACCTGGCCA CTGCAGCCCA GAGCAGGTCT 60
GGCCACGGCC ATGAGCATGC TGAGCCATC ATG CCC ACC GTG GAT GAC ATT CTG 113 Met Pro Thr Val Asp Asp He Leu 1 May GAG CAG GTT GGG GAG TCT GGC TGG TTC CAG AAG CAA GCC TTC CTC ATC 161 Glu Gln Val Gly Glu Ser Gly Trp Phe Gln Lys Gln Ala Phß Leu He 10 15 20 TTA TGC CTG CTG TCG GCT GCC TTT GCG CCC ATC TGT GTG GGC ATC GTC 209 Leu Cys Leu Lau Be Ala Wing Phe Wing Pro He Cys Val Gly He Val 25 30 35 40 TTC CTG GGT TTC ACÁ CCT GAC CAC CAC TGC CAG AGT CCT GGG GTG GCT 257 Phe Leu Gly Phe Thr Pro Asp His His Cys Gln Ser Pro Gly Val Wing 45 50 55 GAG CTG AGC CAG CGC TGT GGC TGG AGC CCT GCG GAG GAG CTG AAC TAT 305 Glu Leu Ser Gln Arg Cys Gly Trp Ser Pro Ala Glu Glu Leu Asn Tyr 60 65 70 ACA GTG CCA GGC CTG GGG CCC GCG GGC GAG GCC TTC CTT GGC CAG TGC 353 Thr Val Pro Gly Leu Gly Pro Ala Gly Glu Ala Phe Leu Gly Gln Cys 75 80 85 AGG CGC TAT GAA GTG GAC TGG AAC CAG AGC GCC CTC AGC TGT GTA GAC 401 Arg Arg Tyr Glu Val Asp Trp Asn Gln Ser Wing Leu Ser Cys Val Asp 90 95 100 CCC CTG GCT AGC CTG GCC ACC AAC A GG AGC CAC CTG CCG CTG GGT CCC 449 Pro Leu Wing Ser Leu Wing Thr Asn Arg Ser His Leu Pro Leu Gly Pro 105 110 115 120 TGC CAG GAT GGC TGG GTG TAT GAC ACG CCC GGC TCT TCC ATC GTC ACT 497 Cys Gln Asp Gly Trp Val Tyr Asp Thr Pro Gly Ser Ser He Val Thr 125 130 135 GAG TTC AAC CTG GTG TGT GCT GAC TCC TGG AAG CTG GAC CTC TTT CAG 545 Glu Phe Asn Leu Val Cys Wing Asp Ser Trp Lys Leu Asp Leu Phe Gln 140 145 150 TCC TGT TTG AAT GCG GGC TTC TTC TTT GGC TCT CTC GGT GTT GGC TAC 593 Ser Cys Lau Asn Ala Gly Phe Phe Phe Gly Ser Leu Gly Val Gly Tyr 155 160 165 TTT GCA GAC AGG TTT GGC CGT AAG CTG TGT CTC CTG GGA ACT GTG CTG 641 Phe Wing Asp Arg Phe Gly Arg Lys Leu Cys Leu Leu Gly Thr Val Leu 170 175 180 GTC AAC GCG GTG TCG GGC GTG CTC ATG GCC TTC TCG CCC AAC TAC ATG 689 Val Asn Wing Val Sar Gly Val Leu Met Wing Pha Ser Pro Asn Tyr Met 185 185 195 200 TCC ATG CTG CTC TTC CGC CTG CTG CAG GGC CTG GTC AGC AAG GGC AAC 737 Ser Met Leu Leu Phe Arg Leu Leu Gln Gly Leu Val Ser Lys Gly Asn 205 210 215 TGG AT G GCT GGC TAC ACC CTA ATC ACA GAA TTT GTT CGC TCG GGC TCC 785 Trp Met Ala Gly Tyr Thr Leu He Thr Glu Phe Val Gly Ser Gly Ser 220 225 230 AGA AGA ACG GTG GCG ATC ATG TAC CAG ATG GCC TTC ACG GTG GGG CTG 833 Arg Arg Thr Val Wing He Met Tyr Gln Met Wing Ph Ph Thr Val Gly Leu 235. 240 245 GTG GCG CTT ACC GGG CTG GCC TAC GCC CTG CCT CAC TGG CGC TGG CTG 881 Val Ala Leu Thr Gly Leu Ala Tyr Ala Leu Pro His Trp Arg Trp Leu 250 255 260 CAG CTG GCA GTC TCC CTG CCC ACC TTC CTC TTC CTG CTC TAC TAC TGG 929 Gln Leu Wing Val Ser Leu Pro Thr Phe Leu Phe Leu Leu Tyr Tyr Trp 265 270 275 280 TGT GTG CCG GAG TCC CCT CGG TGG CTG TTA TCA CAA AAA AGA AAC ACT 977 Cys Val Pro Glu Ser Pro Arg Trp Leu Leu Ser Gln Lys Arg Asn Thr 285 290 295 GAA GCA ATA AAG ATA ATG GAC CAC ATC GCT CAA AAG AAT GGG AAG TTG 1025 Glu Wing He Lys He Met Asp His He Wing Gln Lys Asn Gly Lys Leu 300 305 310 CCT CCT GCT GAT TTA AAG ATG CTT TCC CTC GAA GAG GAT GTC ACC GAA 1073 Pro Pro Wing Asp Leu Lys Met Leu Ser Leu Glu Glu Asp Val Thr Glu 315 320 325 AAG CTG AGC CCT TCA TTT GCA GAC CTG TTC CGC ACG CCG CGC CTG AGG 1121 Lys Leu Ser Pro Ser Phe Wing Asp Leu Phe Arg Thr Pro Arg Leu Arg 330 335 340 AAG CGC ACC TTC ATC CTG ATG TAC CTG TGG TTC ACG GAC TCT GTG CTC 1169 Lys Arg Thr Phe He Leu Met Tyr Leu Trp Phe Thr Asp Ser Val Leu 345 350 355 360 TAT CAG GGG CTC ATC CTG CAC ATG GGC GCC ACC AGC GGG AAC CTC TAC 1217 Tyr Gln Gly Leu He Leu His Met Gly Ala Thr Ser Gly Asn Leu Tyr 365 370 375 CTG GAT TTC CTT TAC TCC GCT CTG GTC GAA ATC CCG GGG GCC TTC ATA 1265 Leu Asp Phe Leu Tyr Ser Ala Leu Val Glu He Pro Gly Ala Phe He 380 385 390 GCC CTC ATC ACC ATT GAC CGC GTG GGC GCC ATC TAC CCC ATG GCC GTG 1313 Ala Leu lia Thr He Asp Arg Val Gly Arg lia Tyr Pro MSST Ala Val 395 400 405 TCA AAT TTG TTG GCG GGG GCA GCC TGC CTC GTC ATG ATT TTT ATC TCA 1361 Ser Asn Leu Leu Wing Wing Wing Wing Cys Leu Val Met He Phe He Ser 410 415 420 CCT GAC CTG CAC TGG TTA AAC ATC ATA ATC ATG TGT GTG GGC CGA ATG 1409 Pro Asp Leu His Trp Leu Asn Ha He He Met Cys Val Gly Arg Met 425 430 435 440 GGA ATC ACC ATT GCA ATA CAA ATG ATC TGC CTG GTG AAT GCT CTG 1457 Gly He Thr He Wing He Gln Met He Cys Leu Val Asn Wing Glu Leu 445 450 455 TAC CCC ACÁ TTC GTC AGG AAC CTC GGA GTG ATG GTG TGT TCC TCC CTG 1505 Tyr Pro Thr Phe Val Arg Asn Leu Gly Val Met Val Cys Ser Ser Leu 460 465 470 TGT GAC ATA GGT GGG ATA ATC ACC CCC TTC ATA GTC TTC AGG CTG AGG 1553 Cys Asp He Gly Gly He He Thr Pro Phe He Val Phe Arg Leu Arg 475 480 485 GAG GTC TGG CAA GCC TTG CCC CTC ATT TTG TTG GCG GTG TTG GGC CTG 1601 Glu Val Trp Gln Ala Leu Pro Leu He Leu Phe Ala Val Leu Gly Leu 490, 495 500 CTT GCC GCG GGA GTG ACG CTA CTT CTT CCA GAG ACC AAG GGG GTC GCT 1649 Leu Ala Wing Gly Val Thr Leu Leu Pro Glu Thr Lys Gly Val Wing 505 510 515 520 TTG CCA GAG ACC ATG AAG GAC GCC GAG AAC CTT GGG AGA AAA GCA AAG 1697 Leu Pro Glu Thr Met Lys Asp Ala Glu Asn Leu Gly Arg Lys Ala Lys 525 530 535 CCC AAA GAA AAC ACG ATT TAC CTT AAG GTC CAA ACC TCA GAA CCC TCG 1745 Pro Lys Glu Asn Thr He Tyr Leu Lys Val Gln Thr Ser Glu Pro Ser 540 545 550 GGC ACC TGAGAGAGAT GTTTTGCGGC GATGTCGTGT TGGAGGGATG AAGATGGAG 1800
Gly Thr TTATCCTCTG CAGAAATTCC TAGACGCCTT CACTTCTCTG TATTCTTCCT CATACTTGCC 1860
TACCCCCAAA TTAATATCAG TCCTAAAG 1888
(2) INFORMATION FOR SEQ ID NO: 56: (i) CHARACTERISTICS OF THE SEQUENCE: (A) LENGTH: 2033 (B) TYPE: Nucleic Acid (C) BRANCH: Double (D) TOPOLOGY: Linear (ii) TYPE OF SEQUENCE: cDNA for mRNA (vi) ORIGINAL SOURCE: (A) ORGANISM SPECIES: Homo sapiens (B) TYPE OF CELL: Squamous cell carcinoma (C) CELL LINE: KB (D) CLONE NAME: HP10013 (ix) CHARACTERISTICS OF THE SEQUENCE: (A) CHARACTERIZATION CODE: CDS (B) EXISTENCE POSITION: 97 .. 1149 (C) CHARACTERIZATION METHOD: E (xi) DESCRIPTION OF THE SEQUENCE: SEQ ID NO: 56: GAGTCCGAGC GCGTCACCTC CTCACGCTGC GGCTGTCGCCCGTGTCCCGC CGGCCCGTTC 60
CGTGTCGCCC CGCAGTGCTG CGGCCGCCGC GGCACC ATG GCT GTG TTT GTC GTG 114 Met Wing Val Phe Val Val 1 5 CTC CTG GCG TTG GTG GCG GGT GTT TTG GGG AAC GAG TTT AGT ATA TTA 162 Leu Leu Ala Leu Val Ala Gly Val Leu Gly Asn Glu Phe Ser He Leu 10 15 20 AAA TCA CCA GGG TCT GTT GTT TTC CGA AAT GGA AAT TGG CCT ATA CCA 210 Lys Ser Pro Gly Ser Val Val Phe Arg Asn Gly Asn Trp Pro He Pro 25 30"35 GGA GAG CGG ATC CCA GAC GTG GCT GCA TTG TCC ATG GGC TTC TCT GTG 258 Gly Glu Arg He Pro Asp Val Wing Wing Leu Ser Mat Gly Phe Ser Val 40 45 50 AAA GAA GAC CTT TCT TGG CCA GGA CTC GCA GTG GGT AAC CTG TTT CAT 306
Lys Glu Asp Leu Ser Trp Pro Gly Leu Wing Val Gly Asn Leu Phe His 55. 60 65 70 CGT CCT CGG GCT ACC GTC ATG GTG ATG GTG AAG GGA GTG AAC AAA CTG 354 Arg Pro Arg Wing Thr Val Met Val Met Val Lys Gly Val Asn Lys Leu 75 80 85 GCT CTA CCC CCA GGC AGT GTC ATT TCG TAC CCT TTG GAG AAT GCA GTT 402 Wing Leu Pro Pro Gly Ser Val He Ser Tyr Pro Leu Glu Asn Wing Val 90 95 100 CCT TTT AGT CTT GAC AGT GTT GCA AAT TCC ATT CAC TCC TTT TCT 450 Pro Phe Ser Leu Asp Ser Val Wing Asn Ser He His Ser Leu Phe Ser 105 110 115 GAG GAA ACT CCT GTT GTT TTG CAG TTG GCT CCC AGT GAG GAA AGA GTG 498
Glu Glu Thr Pro Val Val Leu Gln Leu Pro Wing Glu Glu Arg Val 120 125 130 TAT ATG GTA GGG AAG GCA AAC TCA GTG TTT GAA GAC CTT TCA GTC ACC 546 Tyr Met Val Gly Lys Wing Asn Ser Val Phe Glu Asp Leu Ser Val Thr 135 140 145 150 TTG CGC CAG CTC CGT AAT CGC CTG TTT CAA GAA AAC TCT GTT CTC AGT 594 Leu Arg Gln Leu Arg Asn Arg Leu Phe Gln Glu Asn Ser Val Leu Ser 155 160 165 TCA CTC CCC CTC AAT TCT CTG AGT AGG AAC AAT GAA GTT GAC CTG CTC 642 Ser Leu Pro Leu Asn Ser Leu Ser Arg Asn Asn Glu Val Asp Leu Leu 170 175 180 TTT CTT TCT GAA CTG CAA GTG CTA CAT GAT ATT TCA AGC TTG CTG TCT 690 Pha Leu Ser Glu Leu Gln Val Leu His Asp He Ser Ser Leu Leu Ser 185 190 195 CGT CAT AAG CAT CTA GCC AAG GAT CAT TCT CCT GAT TTA TAT TCA CTG 738 Arg His Lys His Leu Wing Lys Asp His Ser Pro Asp Leu Tyr Ser Leu 200 205 210 GAG CTG GCA GGT TTG GAT GAA ATT GGG AAG CGT TAT GGG GAA GAC TCT 786 Glu Leu Wing Gly Leu Asp Glu He Gly Lys Arg Tyr Gly Glu Asp Ser 215 220 225 230 GAA CAA TTC AGA GAT GCT TCT AAG ATC CTT GTT GAC GCT CTG CAA AAG 834 Glu Gln Phß Arg Asp Ala Ser Lys He Leu Val Asp Ala Leu Gln Lys 235 240 245 TTT GCA GAT GAC ATG TAC AGT CTT TAT GGT GGG AAT GCA GTG GTA GAG 882 Phß Ala Asp Asp Mat Tyr Ser Leu Tyr Gly Gly Asn Wing Val Val Glu 250 255 260 TTA GTC ACT GTC AAG TCA TTT GAC ACC TCC CTC ATT AGG AAG ACA AGG 930 Leu Val Thr Val Lys Ser Phe Asp Thr Ser Leu He Arg Lys Thr Arg 265 270 275 ACT ATC CTT GAG GCA AAA CAA GCG AAG AAC CCA GCA AGT CCC TAT AAC 978
Thr He Leu Glu Wing Lys Gln Wing Lys Asn_ Pro Wing Pro Tyr Asn 280 285 290 CTT GCA TAT AAT TAT AAT TTT GAA TAT TCC GTG GTT TTC AAC ATG GTA 1026 Leu Wing Tyr Lys Tyr Asn Phe Glu Tyr Ser Val Val Pha Asn Met Val 295 300 305 310 CTT TGG ATA ATG ATC GCC TTG GCC TTG GCT GTG ATT ATC ACC TCT TAC 1074 Leu Trp He Met He Wing Leu Wing Lau Wing Val Ha Ha Thr Ser Tyr 315 320 325 AAT ATT TGG AAC ATG GAT CCT GGA TAT GAT AGC ATC ATT TAT AGG ATG 1122 Asn He Trp Asn Met Asp Pro Gly Tyr Asp Ser He He Tyr Arg Met 330 335 340 AC AAC CAG AAG ATT CGA ATG GAT TGAATGTTAC CTGTGCCAGA ATTA 1170
Thr Asn Gln Lys He Arg Met Asp 345 350 GAAAAGGGGG TTGGAAATTG GCTGTTTTGT TAAAATATAT CTTTTAGTGT GCTTTAAAGT 1230
AGATAGTATA CTTTACATTT ATAAAAAAAA ATCAAATTTT GTTCTTTATT TTGTGTGTGC 1290
CTGTGATGTT TTTCTAGAGT GAATTATAGT ATTGACGTGA ATCCCACTGT GGTATAGATT 1350
CCATAATATG CTTGAATATT ATGATATAGC CATTTAATAA CATTGATTTC ATTCTGTTTA 1410
ATGAATTTGG AAATATGCAC TGAAAGAAAT GTAAAACATT TAGAATAGCT CGTGTTATGG 1470
AAAAAAGTGC ACTGAATTTA TTAGACAAAC TTACGAATGC TTAACTTCTT TACACAGCAT 1530
AGGTGAAAAT CATATTTGGG CTATTGTATA CTATGAACAA TTTGTAAATG TCTTAATTTG 1590
ATGTAAATAA CTCTGAAACA AGAGAAAAGG TTTTTAACTT AGAGTAGCCC TAAAATATGG 1650
ATGTGCTTAT ATAATCGCTT AGTTTTGGAA CTGTATCTGA GTAACAGAGG ACAGCTGTTT 1710
TTTAACCCTC TTCTGCAAGT TTGTTGACCT ACATGGGCTA ATATGGATAC TAAAAATACT 1770
ACATTGATCT AAGAAGAAAC TAGCCTTGTG GAGTATATAG ATGCTTTTCA TTATACACAC 1830
AAAAATCCCT GAGGGACATT TTGAGGCATG AATATAAAAC ATTTTTATTT CAGTAACTTT 1890
TCCCCCTGTG TAAGTTACTA TGGTTTGTGG TACAACTTCA TTCTATAGAA TATTAAGTGG 1950
AAGTGGGTGA ATTCTACTTT TTATGTTGGA GTGGACCAAT GTCTATCAAG AGTGACAAAT 2010
AAAGTTAATG ATGATTCCAA AAC 2033
(2) INFORMATION FOR SEQ ID NO: 57: (i) CHARACTERISTICS OF THE SEQUENCE: (A) LENGTH: 911 (B) TYPE: Nucleic Acid (C) BRANCH: Double (D) TOPOLOGY: Linear (ii) TYPE OF SEQUENCE: cDNA for mRNA (vi) ORIGINAL SOURCE: (A) ORGANISM SPECIES: Homo sapiens (B) TYPE OF CELL: Fibrosarcoma (C) CELLULAR LINE: HT-1080 (D) CLONE NAME: HP10034 (ix) CHARACTERISTICS OF THE SEQUENCE: (A) CHARACTERIZATION CODE: CDS (B) EXISTENCE POSITION: 176 .. 805 (C) CHARACTERIZATION METHOD: E (xi) DESCRIPTION OF THE SEQUENCE: SEQ ID NO: 57: ACGCCTGGGT GACCTCTACG TATATACAGA GCCTCCCTGG CCCTCCTGGA AAGAGTCCTG 60
GAAAGACAAC CTTCAGGTCC AGCCCTGGAG CTGGAGGAGT GGAGCCCCAC TCTGAAGACG 120
CAGCCTTTCT CCAGGTTCTG TCTCTCCCAT TCTGATTCTT GACACCAGAT GCAGG ATG 178 1 GTG TCC TCT CCC TGC ACG CAG GCA AGC TCA CGG ACT TGC TCC CGT ATC 226 Val Ser Ser Pro Cys Thr Gln Wing Ser Ser Arg Thr Cys Ser Arg Ha 5 10 15 CTG GGA CTG AGC CTT GGG ACT GCA GCC CTG TTT GCT GCT GGG GCC AAC 274 Leu Gly Leu Ser Leu Gly Thr Ala Ala Leu Phß Ala Ala Gly Ala Asn 20 25 30 GTG GCA CTC CTC CTT CCT AAC TGG GAT GTC ACC TAC CTG TTG AGG GGC 322 Val Ala Leu Leu Leu Pro Asn Trp Asp Val Thr Tyr Leu Leu Arg Gly 35 40 45 CTC CTT GGC AGG CAT GCC ATG CTG GGA ACT GGG CTC TGG GGA GGA GGC 370
Leu Leu Gly Arg His Wing Met Leu Gly Thr Gly Leu Trp Gly Gly Gly 50 55 60 65 CTC ATG GTA CTC ACT GCA GCT ATC CTC ATC TCC TTG ATG GGC TGG AGA 418 Leu Met Val Leu Thr Ala Wing He Leu He Ser Leu Met Gly Trp Arg 70 75 80 TAC GGC TGC TTC AGT AAG AGT GGG CTC TGT CGA AGC GTG CTT ACT GCT 466 Tyr Gly Cys Phe Ser Lys Ser Gly Leu Cys Arg Ser Val Leu Thr Wing 85 90 95 CTG TTG TCA GGT GGC CTG GCT TTA CTT GGA GCC CTG ATT TGC TTT GTC 514 Leu Leu Ser Gly Gly Leu Ala Leu Leu Gly Ala Leu lia Cys Phe Val 100 105 110 ACT TCT GGA GTT GCT CTG AAA GAT GGT CCT TTT TGC ATG TTT GAT GTT 562 Thr Ser Gly Val Ala Leu Lys Asp Gly Pro Phe Cys Met Phe Asp Val 115 120 125 TCA TCC TTC AAT CAG ACA CAA GCT TGG AAA TAT GGT TAC CCA TTC AAA 610 Ser Ser Phe Asn Gln Thr Gln Wing Trp Lys Tyr Gly Tyr Pro Phe Lys 130 135 140 145 GAC CTG CAT AGT AGG AAT TAT CTG TAT GAC CGT TCG CTC TGG AAC TCC 658 Asp Leu His Ser Arg Asn Tyr Leu Tyr Asp Arg Ser Leu Trp Asn Ser 150 155 160 GTC TGC CTG GAG CCC TCT GCA GCT GTT GTC TGG CAC G TG TCC CTC TTC 706 Val Cys Leu Glu Pro Ser Ala Ala Val Val Trp His Val Ser Leu Phe 165 170 175 TCC GCC CTT CTG TGC ATC AGC CTG CTC CAG CTT CTC CTG GTG GTC GTT 754 Ser Ala Leu Leu Cys He Ser Leu Lau Gln Leu Leu Leu Val Val Val 180 185 190 t * CAT GTC ATC AAC AGC CTC CTG GGC CTT TTC TGC AGC CTC TGC GAG AAG 802
His Val He Asn Ser Leu Leu Gly Leu Phe Cys Ser Leu Cys Glu Lys 195 200 205 TGACAGGC AGAACCTTCA CTTGCAAGCA TGGGTGTTTA TCATCATCGG CTGTCTTGAA 860 TCCTTTCTAC AAGGAGTGGG TACGAATTAT AAACAAACTT CCCCTTTAGG T 911
(2) INFORMATION FOR SEQ ID NO: 58: (i) CHARACTERISTICS OF THE SEQUENCE: (A) LENGTH: 601 (B) TYPE: Nucleic Acid (C) BRANCH: Double (D) TOPOLOGY: Linear (ii) TYPE OF SEQUENCE: DNAs for mRNA (vi) ORIGINAL SOURCE: (A) ORGANISM SPECIES: Homo sapiens (B) TYPE OF CELL: Fibrosarcoma (C) CELLULAR LINE: HT-1080 (D) CLONE NAME: HP10050 (ix) SEQUENCE CHARACTERISTICS: (A) CODE CHARACTERIZATION: CDS (B) EXISTENCE POSITION: 10 .. 501 (C) CHARACTERIZATION METHOD: E (xi) DESCRIPTION OF THE SEQUENCE: SEQ ID NO: 58: CCATCTGTC ATG GCG GCT GGG CTG TTT GGT TTG AGC GCT CGC CGT CTT TTG 51 Met Wing Wing Gly Leu Phe Gly Leu Wing Wing Arg Arg Leu Leu 1 5 10 GCG GCA GCG GCG ACG CGA GGG CTC CCG GCC GCC CGC GTC CGC TGG GAA 99ing Thr Arg Gly Leu Pro Wing Wing Arg Val Arg Trp Glu 15 20 25 30 TCT AGC TTC TCC AGG ACT GTG GTC GCC CCG TCC GCT GTG GCG GGA AAG 147 Ser Ser Pha Ser Arg Thr Val Val Wing Pro Ser Wing Val Ala Gly Lys 35 40 45 CGG CCC CCA GAA CCG ACC ACA CCG TGG CAA GAG GAC CCA GAA CCC GAG 195 Arg Pro Pro Glu Pro Thr Pro Trp Gln Glu Asp Pro Glu Pro Glu 50 55 60 GAC GAA AAC TTG TAT GAG AAG AAC CCA GAC TCC CAT GGT TAT GAC AAG 243 Asp Glu Asn Leu Tyr Glu Lys Asn Pro Asp Ser His Gly Tyr Asp Lys 65 70 75 GAC CCC GTT TTG GAC GTC TGG AAC ATG CGA CTT GTC TTC TTC TTG GGC 291 Asp Pro Val Leu Asp Val Trp Asn Met Arg Leu Val Phe Phe Phe Gly 80 85 90 GTC TCC ATC ATC CTG GTC CTT GGC AGC ACC TTT GTG GCC TAT CTG CCT 339
Val Ser He Leu Val Leu Gly Ser Thr Phe Val Wing Tyr Leu Pro 95 100 105 110 GAC TAC AGG TGC ACA GGG TGT CCA AGA GCG TGG GAT GGG ATG AAA GAG 387 Asp Tyr Arg Cys Thr Gly Cys Pro Arg Wing Trp Asp Gly Met Lys Glu 115 120 125 TGG TCC CGC GAC GCT AGG CTG GTG AAA TAC CGA GAG GCC AAT 435 Trp Ser Arg Arg Glu Wing Glu Arg Leu Val Lys Tyr Arg Glu Wing Asn 130 135 140 GGC CTT CCC ATC ATG GAA TCC AAC TGC TTC GAC CCC AGC AAG ATC CAG 483 Gly Leu Pro He Met Glu Be Asn Cys Phß Asp Pro Ser Lys He Gln 145 150 155 CTG CCA GAG GAT GAG TGACCAGTTG CTAAGTGGGG CTCAAGAAGC AC 530
Leu Pro Glu Asp Glu 160 CGCCTTCCCC ACCCCCTGCC TGCCATTCTG ACCTCTTCTC AGAGCACCTA ATTAAAGGGG 590
CTGAAAGTCT G 601
(2) INFORMATION FOR SEQ ID NO: 59: (i) CHARACTERISTICS OF THE SEQUENCE: (A) LENGTH: 394 (B) TYPE: Nucleic Acid (C) BRANCH: Double (D) TOPOLOGY: Linear (ii) TYPE OF SEQUENCE: cDNA for mRNA (vi) ORIGINAL SOURCE: (A) ORGANISM SPECIES: Homo sapiens (B) TYPE OF CELL: Stomach cancer (D) CLONE NAME: HP10071 (ix) SEQUENCE CHARACTERISTICS: (A) CHARACTERIZATION CODE: CDS (B) EXISTENCE POSITION: 47 .. 325 (C) CHARACTERIZATION METHOD: E (xi) DESCRIPTION OF SEQUENCE: SEQ ID NO: 59: AACATCCGGG CCGCGCGGGG AAGGGGAGAC GTGGGGTATA GTGACC ATG ACG AAA 55 Met Thr Lys 1 TTA GCG CAG TGG CTT TGG GGA CTA GCG ATC CTG GGC TCC ACC TGG GTG 103 Leu Ala Gln Trp Leu Trp Gly Leu Ala He Leu Gly Ser Thr Trp Val 5 10 15 GCC CTG ACC GGCC ACG GGT TTG GGC CTG GAG CTG CCC TTG TCC TGC CAG 151
Wing Leu Thr Thr Gly Wing Leu Gly Leu Glu Leu Pro Leu Ser Cys Gln 20 25 30 35 GAA GTC CTG TGG CCA CTG CCC GCC TAC TTG CTG GTG TCC GCC GGC TGC 199 Glu Val Lau Trp Pro Leu Pro Wing Tyr Leu Leu Val Ser Wing Gly Cys 40 45 50 TAT GCC CTG GGC ACT GTG GGC TAT CGT GTG GCC ACT TTT CAT GAC TGC 247 Tyr Wing Leu Gly Thr Val Gly Tyr Arg Val Wing Thr Phe His Asp Cys 55 60 65 GAG GAC GCC GCA CGC GAG CTG CAG AGC CAG ATA CAG GAG GCC CGA GCC 295 Glu Asp Wing Wing Arg Glu Leu Gln Ser Gln He Gln Glu Wing Arg Wing 70 75 80 GAC TTA GCC CGC AGG GGG CTG CGC TTC TGACAGCCTA ACCCCATT 340 Asp Leu Wing Arg Arg Gly Leu Arg Phe 85 90 CCTGTGCGGA CAGCCCTTCC TCCCATTTCC CATTAAAGAG CCAGTTTATT TTCT 394
(2) INFORMATION FOR SEQ ID NO: 60: (i) CHARACTERISTICS OF THE SEQUENCE: (A) LENGTH: 732 (B) TYPE: Nucleic Acid (C) BRANCH: Double (D) TOPOLOGY: Linear (ii) TYPE OF SEQUENCE: cDNA for mRNA (vi) ORIGINAL SOURCE: (A) ORGANISM SPECIES: Homo sapiens (B) TYPE OF CELL: Lymphoma (C) CELLULAR line: U937 (D) CLONE NAME: HP10076 (ix) CHARACTERISTICS OF THE SEQUENCE: (A) CHARACTERIZATION CODE: CDS (B) EXISTENCE POSITION: 82 .. 600 (C) CHARACTERIZATION METHOD: E (i) DESCRIPTION OF SEQUENCE: SEQ ID NO: 60: AGAAACGTGT TCGCTGCCCA GAAGAAGGGA AGGCGCGAGT GAGGAAAGGA GGTACTGTAG_60_ATGCCCTCCA AATCCTTGGT T ATG GAA TAT TTG GCT CAT CCC AGT ACA CTC 111 Met Glu Tyr Leu Ala His Pro Ser Thr Leu 1 5 10 GGC TTG GCT GTT GGA GTT GCT TGT GGC ATG TGC CTG GGC TGG AGC CTT 159 Gly Leu Ala Val Gly Val Wing Cys Gly Met Cys Leu Gly Trp Ser Leu 15 20 25 CGA GTA TGC TTT GGG ATG CTC CCC AAA AGC AAG ACG AGC AAG ACA CAC 207 Arg Val Cys Phe Gly Met Leu Pro Lys Ser Lys Thr Ser Lys Thr His 30 35 40 ACÁ GAT ACT GAA AGT GAA GCA AGC ATC TTG GGA AGC GGG GAG TAC 255 Thr Asp Thr Glu Ser Glu Wing Ser Ha Leu Gly Asp Ser Gly Glu Tyr 45 50 55 AAG ATG ATT CTT GTG GTT CGA AAT GAC TTA AAG ATG GGA AAA GGG AAA 303 Lys Met He Leu Val Val Arg Asn Asp Leu Lys Met Gly Lys Gly Lys 60 65 70 GTG GCT GCC GAG TGC TCT CAT GCT GCT GTT TCA GCC TAC AAG CAG ATT 351
Val Wing Wing Gln Cys Ser His Wing Wing Val Wing Tyr Lys Gln He 75 80 85 90 CAA AGA AGA AAT CCT GAA ATG CTC AAA CAA TGG GAA TAC TGT GGC CAG 399 Gln Arg Arg Asn Pro Glu Met Leu Lys Gln Trp Glu Tyr Cys Gly Gln 95 100 105 CCC AAG GTG GTG GTC AAA GCT CCT GAT GAA GAA ACC CTG ATT GCA TTA 447 Pro Lys Val Val Lys Ala Pro Asp Glu Glu Thr Leu Ha Ala Leu 110 115 120 TTG GCC CAT GCA AAA ATG CTG GGA CTG ACT GTA AGT TTA ATT CAA GAT 495
Leu Ala His Wing Lys Mat Leu Gly Leu Thr Val Ser Leu He Gln Asp 125 130 135 GCT GGA CGT ACT CAG ATT GCA CCA GGC TCT CAA ACT GTC CTA GGG ATT 543 Wing Gly Arg Thr Gln Ha Wing Pro Gly Ser Gln Thr Val Leu Gly He 140 145 150 GGG CCA GGA CCA GCA GAC CTA ATT GAC AAA GTC ACT GGT CAC CTA AAA 591 Gly Pro Gly Pro Wing Asp Leu He Asp Lys Val Thr Gly His Leu Lys 155, 160 165 170 CTT TAC TAGGTGGACT TTGATATGAC AACAACCCCT CCATCACAAG TGT 640
Leu Tyr TTGAAGCCTG TCAGATTCTA ACAACAAAAG CTGAATTTCT TCACCCAACT TAAATGTTCT 700
TGAGATGAAA ATAAAACCTA TTCCCATGTT CT 732
(2) INFORMATION FOR SEQ ID NO: 61: (i) CHARACTERISTICS OF THE SEQUENCE: (A) LENGTH: 697 (B) TYPE: Nucleic Acid (C) BRANCH: Double (D) TOPOLOGY: Linear (ii) TYPE OF SEQUENCE: DNAs for mRNA (vi) ORIGINAL SOURCE: (A) ORGANISM SPECIES: Homo sapiens (B) TYPE OF CELL: Lymphoma (C) CELLULAR LINE: U937 (D) CLONE NAME: HP10085 (ix) CHARACTERISTICS OF THE SEQUENCE: (A) CHARACTERIZATION CODE: CDS (B) EXISTENCE POSITION: 151 .. 600 (C) CHARACTERIZATION METHOD: E (xi) DESCRIPTION OF SEQUENCE: SEQ ID NO: 61: TATACCTCTA GTTTGGAGCT GTGCTGTAAA AACAAGAGTA ACATTTTTAT ATTAAAGTTA 60
AATAAAGTTA CAACTTTGAA GAGAGTTTCT GCAAGACATG ACACAAAGCT GCTAGCAGAA 120
AATCAAAACG CTGATTAAAA GAAGCACGGT ATG ACC AAA CAT AAA AAG TGT 174 Met Mat Thr Lys His Lys Lys Cys 1 5 TTT ATA ATT GTT GGT GTT TTA ATA ACA ACT AAT ATT ATT ACT CTG ATA 222 Phe He He Val Val Gly Val Leu Ha Thr Thr Asn He He Thr Leu He 10 15 20 GTT AAA CTA ACT CGA GAT TCT CAG AGT TTA TGC CCC TAT GAT TGG ATT 270
Val Lys Leu Thr Arg Asp Ser Gln Ser Leu Cys Pro Tyr Asp Trp He 25 30 35 40 GGT TTC CAA AAC AAA TGC TAT TAT TTC TCT AAA GAA GAA GGA GAT TGG 318 Gly Phe Gln Asn Lys Cys Tyr Tyr Phe Ser Lys Glu Glu Gly Asp Trp 45 50 55 AAT TCA AGT AAA TAC AAC TGT TCC ACT FAC CAT GCC GAC CTA ACT ATA 366 Asn Ser Ser Lys Tyr Asn Cys Ser Thr Gln His Wing Asp Leu Thr He 60 65 70 ATT GAC AAC ATA GAA GAA ATG AAT TTT CTT AGG CGG TAT AAA TGC AGT 414 He Asp Asn He Glu Glu Met Asn Phe Leu Arg Arg Tyr Lys Cys Ser 75 80 85 TCT GAT CAC TGG ATT GGA CTG AAG ATG GCA AAA AAT CGA ACA GGA CAA 462 Ser Asp His Trp He Gly Leu Lys Mßt Ala Lys Asn Arg Thr Gly Gln 90 95 100 TGG GTA GAT GGA GCT ACA TTT ACC AAA TCG TTT GGC ATG AGA GGG AGT 510 Trp Val Asp Gly Wing Thr Phe Thr Lys Ser Phe Gly Met Arg Gly Ser 105 110 115 120 GAA GGA TGT GCC TAC CTC AGC GAT GAT GCA GCA GCA ACA GCT AGA TGT 558 Glu Gly Cys Wing Tyr Leu Ser Asp Asp Gly Wing Wing Thr Wing Arg Cys 125 130 135 TAC ACC GAA AGA AAA TGG ATT TGC AGG AAA AGA ATA CAC TAA 600 Tyr Thr Glu Arg Lys Trp He Cys Arg Lys Arg Ha His 140 145 GTTAATGTCT AAGATAATGG GGAAAATAGA AAATAACATT ATTAAGTGTA AAACCAGCAA 660 AGTACTTTTT TAATTAAACA AAGTTCGAGT TTTGTAC 697
(2) INFORMATION FOR SEQ ID NO: 62: (i) CHARACTERISTICS OF THE SEQUENCE: (A) LENGTH: 1186 (B) TYPE: Nucleic Acid (C) BRANCH: Double (D) TOPOLOGY: Linear (ii) TYPE OF SEQUENCE: cDNA for mRNA (vi) ORIGINAL SOURCE: (A) ORGANISM SPECIES: Homo sapiens (B) TYPE OF CELL: Stomach cancer (D) CLONE NAME: HP10122 (ix) SEQUENCE CHARACTERISTICS: (A) CHARACTERIZATION CODE: CDS (B) EXISTENCE POSITION: 139 .. 705 (C) CHARACTERIZATION METHOD: E (xi) DESCRIPTION OF SEQUENCE: SEQ ID NO: 62: AAGTGCGATC TTCGGGCTGT CAGAGTTGGT CTGTTACTCG GTGGTGGCGG AGTCTACGGA 60
AGCCGTTTTC GCTTCACTTT TCCTGGCTGT AGAGCGCTTT CCCCCTGGCG GGTGAGAGTG 120
CAGAGACGAA GGTGCGAG ATG AGC ACT ATG TTC GCG GAC ACT CTC CTC ATC 171 Mat Ser Thr Met Phe Wing Asp Thr Leu Leu He 1 5 10 GTT TTT ATC TCT GTG TGC ACG GCT CTG CTC GG GG GGC ATA ACC TGG 219 Val Phe He Ser Val Cys Thr Ala Leu Leu Ala Glu Gly He Thr Trp 15 20 25 GTC CTG GTT TAC AGG ACÁ GAC AAG TAC AAG AGA CTG AAG GCA GAA GTG 267 Val Leu Val Tyr Arg Thr Asp Lys Tyr Lys Arg Leu Lys Wing Glu Val 30 35 40 GAA AAA CAG AGT AAA AAA TTG GAA AAG AAG AAG GAA ATA ATA GAG 315 Glu Lys Gln Ser Lys Lys Leu Glu Lys Lys Lys Glu Thr He Thr Glu 45 50 55 TCA GCT GGT CGA CAA CAG AAA AAA AAA ATA GAG AGA CAA GAA GAG AAA 363 Ser Ala Gly Arg Gln Gln Lys Lys He Glu Arg Gln Glu Glu Lys 60 65 70 75 CTG AAG AAT .AAC AAC AGA GAT CTA TCA ATG GTT CGA ATG AAA TCC ATG 411 Leu Lys Asn Asn Asg Arg Asp Leu Ser Met Val Arg Met Lys Ser Met 80 85 90 TTT GCT ATT GGC TTT TGT TTT ACT GCC CTA ATG GGA ATG TTC AAT TCC 459 Phe Ala He Gly Phe Cys Phß Thr Ala Leu Met Gly Met Phe Asn Ser 95 100 105 ATA TTT GAT GGT AGA GTG GG AAG CTT CCT TTT ACC CCT CTT TCT 507 He Phe Asp Gly Arg Val Val Ala Lys Leu Pro Phe Thr Pro Lau Ser 110 115 1 20 TAC ATC CAA GGA CTG CTT CAT CGA AAT CTG GG GAC GAC ACC AC 555 Tyr He Gln Gly Leu Ser His Arg Asn Leu Leu Gly Asp Asp Thr Thr 125 130 135 GAC TGT TCC TTC ATT TTC CTG TAT ATT CTC TGT ACT ATG TCG ATT CGA 603 Asp Cys Ser Phe He Phe Leu Tyr He Leu Cys Thr Mßt Ser He Arg 140 145 150 155 CAG AAC ATT CAG AAG ATT CTC GGC CTT GCC CCT TCA CGA GCC GCC ACC 651 Gln Asn He Gln Lys He Leu Gly Leu Wing Pro Ser Arg Wing Wing Thr 160 165 170 AAG CAG GCA GGT GTC TTT CTT GGC CCA CCA CCT CCT TCT GGG AAG TTC 699
Lys Gln Wing Gly Gly Phe Leu Gly Pro Pro Pro Ser Gly Lys Phe 175 180 185 TCT TGAACTCAAG AACTCTTTAT TTTCTATCAT TCTTTCTAGA CACACACA 750
Be CATCAGACTG GCAACTGTTT TGTAGCAAGA GCCATAGGTA GCCTTACTAC TTGGGCCTCT 810
TTCTAGTTTT GAATTATTTC TAAGCCTTTT GGGTATGATT AGAGTGAAAA TGGCAGCCAG 870
CAAACTTGAT AGTGCTTTTG GTCCTAGATG ATTTTTATCA AATAAGTGGA TTGATTAGTT 930
AAGTTCAGGT AATGTTTATG TAATGAAAAA CAAATAGCAT CCTTCTTGTT TCATTTACAT 990
AAGTATTTTC TGTGGGACCG ACTCTCAAGG CACTGTGTAT GCCCTGCAAG TTGGCTGTCT 1050
ATGAGCATTT AGAGATTTAG AAGAAAAATT TAGTTTGTTT AACCCTTGTA ACTGTTTGTT 1110
TTGTTGTTGT TTTTTTTTTCA AGCCAAATAC ATGACATAAG ATCAATAAAG AGGCCAAATT 1170
TTTAGCTGTT TTATGT 1186
(2) INFORMATION FOR SEQ ID NO: 63: (i) CHARACTERISTICS OF THE SEQUENCE: (A) LENGTH: 1409 (B) TYPE: Nucleic Acid (C) BRANCH: Double (D) TOPOLOGY: Linear (ii) TYPE OF SEQUENCE: cDNA for mRNA (vi) ORIGINAL SOURCE: (A) ORGANISM SPECIES: Homo sapiens (B) TYPE OF CELL: Lymphoma (C) CELLULAR LINE: U937 (D) CLONE NAME: HP10136, (i?) CHARACTERISTICS OF THE SEQUENCE: (A) CHARACTERIZATION CODE: CDS (B) EXISTENCE POSITION: 82 .. 729 (C) CHARACTERIZATION METHOD: E (xi) DESCRIPTION OF THE SEQUENCE: SEQ ID NO: 63: ATAACTGTTG TCGCGGCGGA GGAAGTGAGG ACGGCGCCAA GGGCCTTCCG GGCCAGTGTT 60
GGATCCCTGT AGTTTGTGAA G ATG GTG TTG CTA ATG ATC GCC CGA GTG 111 Met Val Leu Leu Thr Met lie Wing Arg Val 1 5 10 GCG GAC GGG CTC CCG CTG GCC GCC TCG ATG CAG GAG GAC GAA CAG TCT 159 Wing Asp Gly Leu Pro Leu Ala Ala Ser Met Gln Glu Asp Glu Gln Ser 15 20 25 GGC CGG GAC CTT CAA CAG TAT CAG AGT CAG GCT AAG CAA CTC TTT CGA 207 Gly Arg Asp Leu Gln Gln Tyr Gln Sar Gln Ala Lys Gln Leu Phe Arg 30 35 40 AAG TTG AAT GAA CAG TCC CCT ACC AGA TGT ACC TTG GAA GCA GGA GCC 255 Lys Leu Asn Glu Gln Ser Pro Thr Arg Cys Thr Leu Glu Ala Gly Wing 45 50 55 ATG ACT TTT CAC TAC ATT ATT GAG CAG GGG GTG TGT TAT TTG GTT TTA 303 Met Thr Pha His Tyr Ha He Glu Gln Gly Val Cys Tyr Leu Val Leu 60 65 70 TGT GAA GCT GCC TTC CCT AAG AAG TTG GCT TTT GCC TAC CTA GAA GAT 351
Cys Glu Wing Wing Phe Pro Lys Lys Leu Wing Phe Wing Tyr Leu Glu Asp 75 80 85 90 TTG CAC TCA GAA TTT GAT GAA CAG CAT GGA AAG AAG GTG CCC ACT GTG 399 Lau His Ser Glu Phe Asp Glu Gln His Gly Lys Lys Val Pro Thr Val 95 100 105 TCC CGA CCC TAT TCC TTT ATT GAA TTT GAT ACT TTC ATT CAG AAA ACC 447 Ser Arg Pro Tyr Ser Phe He Glu Phe Asp Thr Phe Ha Gln Lys Thr 110 115 120 AAG AAG CTC TAC ATT GAC AGT CGT GCT CGA AGA AAT CTA GGC TCC ATC 495 Lys Lys Tyr He Asp Ser Arg Wing Arg Arg Asn Leu Gly Ser He 125 130 135 AAC ACT GAA TTG CAA GAT GTG CAG AGG ATC ATG GTG GCC AAT ATT GAA 543 Asn Thr Glu Leu Gln Asp Val Gln Arg He Met Val Wing Asn He Glu 140 145 150 GAA GTG TTA CAA CGA GGA GAA GCA CTC TCA GCA TTG GAT TCA AAG GCT 591 Glu Val Leu Gln Arg Gly Glu Ala Leu Sar Ala Leu Asp Ser Lys Ala 155 160 165 170 AAC AAT TTG TCC AGT CTG TCC AAG AAA TAC CGC CAG GAT GCG AAG TAC 639 Asn Asn Leu Ser Ser Leu Ser Lys Lys Tyr Arg Gln Asp Wing Lys Tyr 175 180 185 TTG AAC ATG CGT TCC ACT TAT GCC AAA CTT GCA GCA G TA GCT GTA TTT 687 Leu Asn Mat Arg Ser Thr Tyr Wing Lys Leu Wing Wing Val Wing Val Phe 190 195 200 TTC ATC ATG TTA ATA GTG TAT GTC CGA TTC TGG TGG CTG TGAA 730
Phe Ha Met Leu Ha Val Tyr Val Arg Phe Trp Trp Lau 205 210 215 ATAATGAATA CAGTCACTGG TAAGGGAGAA CCTAGAACCC AGTAGGTGTA TATTTTCAGG 790
AAACTGAGCT CACAGAGATG TGTATTAGAA TCCAAGTGGA ACTTCTGCCT CTAAAGACCT 850
TGCAAGAAAA GAGATGCCCT GAAAATGAAA GGTTGCACCT CATTTAATGA AGCTTAACCC 910
TATGTAGAAA GTCTCTTTCG GGGGCAGAGG CTTTCTCTGG GTGCCAAGCC ATATATATTA 970
GGGAATAGTA GATTGTTAAT TTCGTTTTTT CCCTCCCAGT GCATTTTAAA AACAGCACTG 1030
GCTGGGGCAT TCTCATTCTC TGATGGAGCC ATCAATGAGA TTTAACTTAG TCAACCTGTG 1090
CTAGCAACAT TCTGAAATTC CTTCAAAGAA GGCAGTCCTT TGGGAAGGTG TTTTTTTTTT 1150
TTTTTTTTTT TTTGACTCTA ATCAACATTC CTTTTGTTGG TGACATTTGT GATTTTCAGT 1210
AATCTGAGTT TTTGATGGCC TTTTAAACAA GACTCCAGTA TGTGAAGGTT AATTGCTGTG 1270
CTCCACAGAT CTTGTCTATT GGCCCCTGTA GAAAGTTAAG CTTTGTTGTT TTCCTTTTAT 1330
AATTTGCTTA TTGCACAATT GCTTTAGGGT AAGTGAATTA TATTAAGATG CCTTGAAATT 1390
ATAGCACTCC TTGATTAAG 1409
(2) INFORMATION FOR SEQ ID NO: 64: (i) CHARACTERISTICS OF THE SEQUENCE: (A) LENGTH: 974 (B) TYPE: Nucleic Acid (C) BRANCH: Double (D) TOPOLOGY: Linear (ii) TYPE OF SEQUENCE: cDNA for mRNA (vi) ORIGINAL SOURCE: (A) ORGANISM SPECIES: Homo sapiens (B) TYPE OF CELL: Stomach cancer (D) CLONE NAME: HP10175 (ix) SEQUENCE CHARACTERISTICS: (A) CODE OF CHARACTERIZATION: CDS (B) POSITION OF EXISTENCE: 174 .. 512 (C) METHOD OF CHARACTERIZATION: E (xi) DESCRIPTION OF THE SEQUENCE: SEQ ID NO: 64:
AGAGCCGCTC CCCTCTCCTC GCCCCGCCAC CGGGACGGAG AGCGCCCGCC GCTGCATTTC 60 CGGCGACACC TCGCAGTCAT TCCTGCGGCT TGCGCGCCCT TGTAGACAGC CGGGGCCTTC 120 GTGAGACCGG TGCAGGCCTG GGGTAGTCTC CTGTCTGGAC AGAGAAGAGA AAA ATG 176 Met 1 CAG GAC ACT GGC TCA GTA GTG CCT TTG CAT TGG TTT GGC TTT GGC TAC 224 G n Asp Thr Gly Ser Val Val Pro Leu Hia Trp Phe Gly Phe Gly Tyr 5 10 15 GCA GCA CTG GTT GCT TCT GGT GGG ATC ATT GGC TAT GTA AAA GCA GGC 272 Wing Wing Leu Val Wing Gly Gly He He He Gly Tyr Val Lys Wing Gly 20 25 30 AGC GTG CCG TCC CTG GCT GCA GGG CTG CTC TTT GGC AGT CTA GCC GGC 320 Ser Val Pro Ser Leu Ala Wing Gly Leu Leu Phe Gly Ser Leu Wing Gly 35 40 45 CTG GGT GCT TAC CAG CTG TCT CAG GAT CCA AGG AAC GTT TGG GTT TTC 368
Leu Gly Wing Tyr Gln Leu Ser Gln Asp Pro Arg Asn Val Trp Val Phe 50 55 60 65 CTA GCT ACT TCT GGT ACC TTG GCT GGC ATT ATG GGA ATG AGG TTC TAC 416 Leu Wing Thr Ser Gly Thr Leu Wing Gly Met Met Gly Met Arg Phe Tyr 70 75 80 CAC TCT GGA AAA TTC ATG CCT GCA GGT TTA ATT GCA GGT GCC AGT TTG 464 His Ser Gly Lys Phe Met Pro Wing Gly Leu He Wing Gly Wing Ser Leu 85 90 95 CTG ATG GTC GCC AAA GTT GGA GTT AGT ATG TTC AAC AGA CCC CAT 509
Met Val Leu Ala Gly Val Val Lys Ser Met Arg Phe Asn Pro His 100 105 110 T AGCAGAAGTC ATGTTCCAGC TTAGACTGAT GAAGAATTAA AAATCTGCAT 560 CTTCCACTAT TTTCAATATA TTAAGAGAAA TAAGTGCAGC ATTTTTGCAT CTGACATTTT 620 ACCTAAAAAA AAAGACACCA AACTTGGCAG AGAGGTGGAA AATCAGTCAT GATTACAAAC 680 CTACAGAGGT GGCGAGTATG TAACACAAGA GCTTAATAAG ACCCTCATAG AGCTTGATTC 740
TTGTATATTG ATGTTGTCTT TTCTTTCTGT ATCTGTAGGT AAATCTCAAG GGTAAAATGT 800 S. TAGGTGTCAG CTTTCAGGGC TCTGAAACCC TATTCCCTGC TCTGAGGAAC AGTGTGAAAA 860
AAAGTCTTTT AGGAGATTTA CAATATCTGT TCTTTTGCTC ATCTTAGACC ACAGACTGAC 920 TTTGAAATTA TGTTAAGTGA AATATCAATG TAAATAAAGT TTACTATAAA TAAT 974
(2) INFORMATION FOR SEQ ID NO: 65: (i) CHARACTERISTICS OF THE SEQUENCE: (A) LENGTH: 925 (B) TYPE: Nucleic Acid (C) BRANCH: Double (D) TOPOLOGY: Linear (ii) TYPE OF SEQUENCE: cDNA for mRNA (vi) ORIGINAL SOURCE: (A) ORGANISM SPECIES: Homo sapiens (B) TYPE OF CELL: Squamous cell carcinoma (C) CELL LINE: KB (D) CLONE NAME: HP10179 (ix) CHARACTERISTICS OF THE SEQUENCE: (A) CHARACTERIZATION CODE: CDS (B) EXISTENCE POSITION: 122 .. 466 (C) CHARACTERIZATION METHOD: E (xi) DESCRIPTION OF SEQUENCE: SEQ ID NO: 65: AATCGCGTTT CCGGAGAGAC CTGGCTGCTG TGTCCCGCGG CTTGCGCTCC GTAGTGGACT 60
CCGCGGGCCT TCGGCAGATG CAGGCCTGGG GTAGTCTCCT TTCTGGACTG AGAAGAGAAG 120
ATG GAG AAG CCC CTC TTC CCA TTA GTG CCT TTG CAT TGG TTT GGC TTT 168
Met Glu Lys Pro Leu Phe Pro Leu Val Pro Leu His Trp Pha Gly Phe 1 5 10 15 GGC TAC ACA GCA CTG GTT GTT TCT GGT GGG ATC GTT GGC TAT GTA AAA 216 Gly Tyr Thr Ala Leu Val Val Ser Gly Gly Ha Val Gly Tyr Val Lys 20 25 30 ACÁ GGC AGC GTG CCG TCC CTG GCA GCA GGG CTG CTC TTC GGC AGT CTA 264 Thr Gly Ser Val Pro Sar Lau Wing Wing Wing Gly Leu Leu Phe Gly Ser Leu 35 40 45 GCC GGC CTG GGT GCT TAC CAG CTG TAT CAG GAT CCT AGG AAC GTT TGG 312 Wing Gly Leu Gly Wing Tyr Gln Leu Tyr Gln Asp Pro Arg Asn Val Trp 50 55 60 GGT TTC CTA GCC GCT ACT TCT GTT ACT TTT GTT GGT GTT ATG GGA ATA 360 Gly Phe Leu Wing Ala Thr Ser Val Thr Phe Val Gly Val Met Gly Met 65 70 75 80 AGA TCC TAC TAC TAT TGA TGA TGA ATG CCT GTA TGA GTA TTA ATT GCA GGT 408 Arg Ser Tyr Tyr Tyr Gly Lys Phe Met Pro Val Gly Leu He Wing Gly 85 90 95 GCC AGT TTG CTG ATG GCC GCC AAA GTT GGA GTT CGT ATG TTG ATG ACÁ 456 Wing Ser Leu Lau Mat Wing Wing Val Gly Val Arg Met Leu Met Thr 100 105 110 TCT GAT TAGCAGAAGT CATGTTCGCA GCTTGGACTC ATGAAGGATT AAAAAT CT 510
Be Asp GCATCTTCCA CTATTTTCAA TGTATTAAGA GAAATAAGTG CAGCATTTTT GCATCTGACA 570
TTTTACCTAA AAAAAAAAAG ACACCAAATT TGGCGGAGGG GTGGAAAATC AGTTGTTACC 630
ATTATAACCC TACAGAGGTG GTGAGCATGT AACATGAGCT TATTGAGACC ATCATAGAGA 690
TCGATTCTTG TATATTGATT TTATCTCTTT CTGTATCTAT AGGTAAATCT CAAGGGTAAA 750
ATGTTAGGTG TTGACATTGA GAACCCTGAA ACCCCATTCC CTGCTCAGAG GAACAGTGTG 810
AAAAAAAATC TCTTGAGAGA TTTAGAATAT CTTTTCTTTT GCTCATCTTA GACCACAGAC 870
TGACTTTGAA ATTATGTTAA GTGAAATATC AATGAAAATA AACTTTACTA TAAAT 925
(2) INFORMATION FOR SEQ ID NO: 66: (i) CHARACTERISTICS OF THE SEQUENCE: (A) LENGTH: 1115 (B) TYPE: Nucleic Acid (C) BRANCH: Double (D) TOPOLOGY: Linear (ii) TYPE OF SEQUENCE: ADNa for mRNA (vi) ORIGINAL SOURCE: (A) ORGANISM SPECIES: Homo sapiens (B) TYPE OF CELL: Fibrosaraome (C) CELLULAR LINE: HT-1080 (D) CLONE NAME: HP10196 (ix) CHARACTERISTICS OF THE SEQUENCE: (A) CHARACTERIZATION CODE: CDS (B) EXISTENCE POSITION: 10 .. 993 (C) CHARACTERIZATION METHOD: E (xi) DESCRIPTION OF THE SEQUENCE: SEQ ID NO: 66: GCGGGGAAA ATG GCG GCG GCG GCG GCG GCG GCT GCA GCT ACG AAC GGG ACC 51 Met Wing Wing Wing Wing Wing Wing Wing Wing Thr Asn Gly Thr 1 5 10 GGA GGA AGC AGC GGG ATG GAG GTG GAT GCA GCA GTA GTC CCC AGC GTG 99
Gly Gly Be Ser Gly Met Glu Val Asp Ala Wing Val Val Pro Ser Val 15 20 25 30 ATG GCC TGC GGA GTG ACT GGG AGT GTT TCC GTC GCT CTC CAT CCC CTT 147 Met Wing Cys Gly Val Thr Gly Ser Val Sar Val Ala Leu His Pro Leu 35 40 45 GTC ATT CTC AAC ATC TCA GAC CAC TGG ATC CGC ATG CGC TCC CAG GAG 195 Val Ha Leu Asn He Ser Asp His Trp He Arg Mat Arg Ser Gln Glu 50 55 60 GGG CGG CCT GTG GAG GTG ATT GGG GCT CTG ATT GGC AAG CAG GAG GGC 243 Gly Arg Pro Val Gln Val He Gly Ala Leu Ha Gly Lys Gln Glu Gly 65 70 75 CGA AAT ATC GAG GTG ATG AAC TCC TTT GAG CTG CTG TCC CAC ACC GTG 291 Arg Asn He Glu Val Met Asn Ser Phe Glu Leu Leu Ser His Thr Val 80 85 90 GAA GAG AAG ATT ATC ATT GAC AAG GAA TAT TAT TAC ACC AAG GAG GAG 339
Glu Glu Lys Ha He As Asp Lys Glu Tyr Tyr Tyr Thr Lys Glu Glu 95 100 105 110 CAG TTT AAA CAG GTG TTC AAG GAG CTG GAG TTT CTG GGT TGG TAT ACC 387 Gln Phe Lys Gln Val Phe Lys Glu Leu Glu Pha Leu Gly Trp Tyr Thr 115 120 125 ACÁ GGG GGG CCA CCT GAC CCC TCG GAC ATC CAC GTC CAT AAG CAG GTG 435 Thr Gly Gly Pro Pro Asp Pro Sar Asp He His Val His Lys Gln Val 130 135 140 TGT GAG ATC ATC GAG AGC CCC CTC TTT CTG AAG TTG AAC CCT ATG ACC 483 Cys Glu He He Glu Ser Pro Leu Phe Leu Lys Leu Asn Pro Mßt Thr 145 150 155 AAG CAC ACA GAT CTT CCT GTC AGC GTT TTT GAG TCT GTC ATT GAT ATA 531 Lys His Thr Asp Leu Pro Val Ser Val Phe Glu Ser Val He Asp He 160 165 170 ATC AAT GGA GAG GCC ACA ATG CTG TTT GCT GAG CTG ACC TAC ACT CTG 579 He Asn Gly Glu Wing Thr Met Leu Phe Wing Glu Leu Thr Tyr Thr Leu 175 180 185 190 GCC ACÁ GAG GAA GCG GAA CGC ATT GGT GTA GAC CAC GTA GCC CGA ATG 627 Wing Thr Glu Glu Wing Glu Arg He Gly Val Asp His Val Wing Arg Met 195 200 205 ACÁ GCA ACÁ GGC AGT GGA GAG AAC TCC ACT GTG GCT GAA CAC CTG ATA 675 Thr Wing Thr Gly Ser Gly Glu Asn Sar Thr Val Wing Glu His Lau He 210 215 220 GCA CAG CAC AGC GCC ATC AAG ATG CTG CAC AGC CGC GTC AAG CTC ATC 723 Wing Gln His Ser Wing Ha Lys Met Leu His Ser Arg Val Lys Leu Ha 225 230 235 TTG GAG TAC GTC AAG GCC TCT GAA GCG GGA GTC GTC CCC TTT AAT CAT 771 Leu Glu Tyr Val Lys Wing Ser Glu Wing Gly Glu Val Pro Phe Asn His 240 245 250 GAG ATC CTG CGG GAG GCC TAT GCT CTG TGT CAC TGT CTC CCG GTG CTC 819 Glu He Lau Arg Glu Ala Tyr Ala Lau Cys His Cys Leu Pro Val Leu 255 260 265 270 AGC ACÁ GAC AAG TTC AAG ACÁ GAT TTT TAT GAT CAA TGC AAC GAC GTG 867 Ser Thr Asp Lys Phe Lys Thr Asp Phe Tyr Asp Gln Cys Asn Asp Val 275 280 285 GGG CTC ATG GCC TAC CTC GGC ACC ATC ACC AAA ACG TGC AAC ACC ATG 915 Gly Leu Mat Wing Tyr Leu Gly Thr He Thr Lys Thr Cys Asn Thr Met 290 295 300 AAC CAG TTT GTG AAC AAG TTC AAT GTC CTC TAC GAC CGA CAA CAG GGC ATC 963 Asn Gln Phß Val Asn Lys Phe Asn Val Leu Tyr Asp Arg Gln Gly He 305 310 315 GGC AGG AGA ATG CGC G GG CTC TTT TTC TGATGAGGGT 1000
Gly Arg Arg Met Arg Gly Leu Phe Phe 320 325 ACTTGAAGGG CTGATGGACA GGGGTCAGGC AACTATCCCA AAGGGGAGGG CACTACACTT 1060
CCTTGAGAGA AACCACTGTC ATTAATAAAA GGGGAGCAGC CCCTGAGCAC CCCTG 1115
(2) INFORMATION FOR SEQ ID NO: 67: (i) CHARACTERISTICS OF THE SEQUENCE: (A) LENGTH: 1721 (B) TYPE: Nucleic Acid (C) BRANCH: Double (D) TOPOLOGY: Linear (ii) TYPE OF SEQUENCE: cDNA for mRNA (vi) ORIGINAL SOURCE: (A) ORGANISM SPECIES: Homo sapiens (B) TYPE OF CELL: Fibrosarcoma (C) CELLULAR LINE: HT-1080 (D) CLONE NAME: HP10235 (ix) CHARACTERISTICS OF THE SEQUENCE: (A) CODE OF CHARACTERIZATION: CDS (B) POSITION OF EXISTENCE: 6 .. 1127 (C) METHOD OF CHARACTERIZATION: E (xi) DESCRIPTION OF THE SEQUENCE: SEQ ID NO: 67: ATGTC ATG ACC CTA TGT GCC ATG CTG CCC CTG CTG TTA TTC ACC TAC CTC 50 Met Thr Leu Cys Wing Met Leu Pro Leu Leu Leu Phe Thr Tyr Leu 1 5 10 15 AAC TCC TTC CTG CAT CAG AGG ATC CCC CAG TCC GTA CGG ATC CTG GGC 98 Asn Ser Phe Leu His Gln Arg He Pro Gln Ser Val Arg He Leu Gly 20 25 30 AGC CTG GTG GCC ATC CTG CTG GTG TTT CTG ATC ACT GCC ATC CTG GTG 146 Ser Lau Val Ala He Leu Leu Val Phe Leu He Thr Ala He Leu Val 35 40 4 5 AAG GTG CAG CTG GAT GCT CTG CCC TTC TTT GTC ATC ACC ATG ATC AAG 194 Lys Val Gln Leu Asp Ala Leu Pro Phe Phe Val He Thr Met He Lys 50 55 60 ATC GTG CTC ATT AAT TCA TTT GGT GCC ATC CTG CAG GGC AGC CTG TTT 242 Ha Val Leu He Asn Ser Phe Gly Ala Ha Lau Gln Gly Ser Leu Pha 65 70 75 GGT CTG GCT GGC CTT CTG CCT GCC AGC TAC ACG GCC CCC ATC ATG AGT 290
Gly Leu Wing Gly Leu Leu Pro Wing Being Tyr Thr Wing Pro Ha Met Being 80 85 90 95 GGC CAG GGC CTA GCA GGC TTC TTT GCC TCC GTG GCC ATG ATC TGC GCT 338 Gly Gln Gly Leu Wing Gly Phe Phe Wing Ser Val Wing Mat He Cys Wing 100 105 110 ATT GCC AGT GGC TCG GAG CTA TCA GAA AGT GCC TTC GGC TAC TTT ATC 386 Ha Wing Ser Gly Ser Glu Leu Ser Glu Ser Wing Pha Gly Tyr Phß He 115 120 125 AC GCC TGT GCT GTT ATC ATT TTG ATC ATC TGT TAC CTG GGC CTG 434 Thr Ala Cys Ala Val He He Leu Thr Ha He Cys Tyr Leu Gly Leu 130 135 140 CCC CGC CTG GAA TTC TAC TAC TAC TAC CAG CAG CTC AAG CTT GAA GGA 482 Pro Arg Leu Glu Phe Tyr Arg Tyr Tyr Gln Gln Leu Lys Leu Glu Gly 145 150 155 CCC GGG GAG CAG GAG ACC AAG TTG GAC CTC ATT AGC AAA GGA GAG GAG 530 Pro Gly Glu Gln Glu Thr Lys Leu Asp Leu Ha Ser Lys Gly Glu Glu 160 165 170 175 CCA AGA GCA GGC AAA GAG GAA TCT GGA GTT TCA GTC TCC AAC TCT CAG 578 Pro Arg Wing Gly Lys Glu Glu Ser Gly Val Sar Val Ser Asn Ser Gln 180 185 190 CCC ACC AAT GAA AGC CAC TCT ATC AAA GCC ATC CTG AAA AAT ATC TCA 626 Pro Thr Asn Glu Ser His Lys Wing He Leu Lys Asn He Ser 195 200 205 GTC CTG GCT TTC TCT GTC TGC TTC ATC TTC ACT ATC ACC ATT GGG ATG 674 Val Leu Wing Phe Ser Val Cys Phe He Phe Thr He Thr He Gly Met 210 215 220 TTT CCA GCC GTG ACT GTT GAG GTC AAG TCC AGC ATC GCA GGC AGC AGC 722 Pha Pro Wing Val Thr Val Glu Val Lys Ser Ser He Wing Gly Ser Ser 225 230 235 ACC TGG GAA CGT TAC TTC ATT CCT GTG TTC TGT TTC TTG ACT TTC AAT 770 Thr Trp Glu Arg Tyr Phe He Pro Val Ser Cys Phe Leu Thr Pha Asn 240 245 250 255 ATC TTT GAC TGG TTG GGC CGG AGC CTC ACÁ GCT GTA TTC ATG TGG CCT 818 Ha Pha Asp Trp Leu Gly Arg Ser Leu Thr Wing Val Phe Met Trp Pro 260 265 270 GGG AAG GAC AGC CGC TGG CTG CCA AGC CTG GTG CTG GCC CGG CTG GTG 866
Gly Lys Asp Ser Arg Trp Leu Pro Ser Leu Val Leu Wing Arg Leu Val 275 280 285 TTT GTG CCA CTG CTG CTG CTG TGC AAC ATT AAG CCC CGC CT CT 914 Phß Val Pro Leu Leu Leu Leu Cys Asn He Lys Pro Arg Arg Tyr Leu 290 295 300 ACT GTG GTC TTC GAG CAC GAT GCC TGG TTC ATC TTC TTC ATG GCT GCC 962 Thr Val Val Pha Glu His Asp Wing Trp Phe He Phe Phe Met Wing Wing 305 310 315 TTT GCC TTC TCC AAC GGC TAC CTC GCC AGC CTC TGC TGG ATG TGC TTC GGG 1010 Pha Wing Pha Ser Asn Gly Tyr Leu Wing Lau Cys Mys Cys Pha Gly 320 325 330 335 CCC AAG AAA GTG AAG CCA GCT GAG GAC GAC ACC GCA GCC GCC ATC ATG 1058 Pro Lys Lys Val Lys Pro Ala Glu Ala Glu Thr Ala Gly Ala He Met 340 345 350 GCC TTC CTG TGT CTG GGT CTG GCA CTG GGG GCT GTT TTC TCC TTC 1106 Wing Phe Phß Leu Cys Leu Gly Leu Wing Leu Gly Wing Val Phe Ser Phe 355 360 365 CTG TTC CGG GCA ATT GTG TGACAAAGGA TGGACAGAAG GACTGC 1150
Leu Phe Arg Wing He Val 370 CTGCCTCCCT CCCTGTCTGC CTCCTGCCCC TTCCTTCTGC CAGGGGTGAT CCTGAGTGGT 1210
CTGGCGGTTT TTTCTTCTAA CTGACTTCTG CTTTCCACGG CGTGTGCTGG GCCCGGATCT 1270
CCAGGCCCTG GGGAGGGAGC CTCTGGACGG ACAGTGGGGA CATTGTGGGT TTGGGGCTCA 1330
GAGTCGAGGG ACGGGGTGTA GCCTCGGCAT TTGCTTGAGT TTCTCCACTC TTGGCTCTGA 1390
CTGATCCCTG CTTGTGCAGG CCAGTGGAGG CTCTTGGGCT TGGAGAACAC GTGTGTCTCT 1450
GTGTATGTGT CTGTGTGTCT GCGTCCGTGT CTGTCAGACT GTCTGCCTGT CCTGGGGTGG 1510
CTAGGAGCTG GGTCTGACCG TTGTATGGTT TGACCTGATA TACTCCATTC TCCCCTGCGC 1570
CTCCTCCTCT GTGTTCTCTC CATGTCCCCC TCCCAACTCC CCATGCCCAG TTCTTACCCA 1630
TCATGCACCC TGTACAGTTG CCACGTTACT GCCTTTTTTA AAAATATATT TGACAGAAAC 1690
CAGGTGCCTT CAGAGGCTCT CTGATTTAAA T 1721
(2) INFORMATION FOR SEQ ID NO: 68: (i) CHARACTERISTICS OF THE SEQUENCE: (A) LENGTH: 1504 (B) TYPE: Nucleic Acid (C) BRANCH: Double (D) TOPOLOGY: Linear (ii) TYPE OF SEQUENCE: cDNA for mRNA (vi) ORIGINAL SOURCE: (A) ORGANISM SPECIES: Homo sapiens (B) TYPE OF CELL: Stomach cancer (D) CLONE NAME: HP10297 (ix) SEQUENCE CHARACTERISTICS: (A) CHARACTERIZATION CODE: CDS (B) EXISTENCE POSITION: 63 .. 614 (C) CHARACTERIZATION METHOD: E (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 68: CTTTTGCGGC TGCAGCGGGC TTGTAGGTGT CCGGCTTTGC TGGCCCAGCA AGCCTGATAA 60
GC ATG AAG CTC TTA TCT TTG GTG GCT GTG GTC GGG TGT TTG CTG GTG 107
Mat Lys Leu Leu Ser Leu Val Wing Val Val Gly Cys Leu Leu Val 1 5 10 15 CCC CCA GCT GAA GCC AAC AAG AGT TCT GAA GAT ATC CGG TGC AAA TGC 155 Pro Pro Wing Glu Wing Asn Lys Ser Sar Glu Asp He Arg Cys Lys Cys 20 25 30 ATC TGT CCA CCT TAT AGA AAC ATC AGT GG GCAC ATT TAC AAC CAG AAT 203 Ha Cys Pro Pro Tyr Arg Asn He Ser Gly His He Tyr Asn Gln Asn 35 40 45 GTA TCC CAG AAG GAC TGC AAC TGC CTG CAC GTG GTG GAG CCC ATG CCA 251 Val Ser Gln Lys Asp Cys Asn Cys Leu His Val Val Glu Pro Met Pro 50 55 60 GTG CCT GGC CAT GAC GTG GAG GCC TAC TGC CTG CTG TGC GAG TGC AGG 299 Val Pro Gly His Asp Val Glu Ala Tyr Cys Leu Leu Cys Glu Cys Arg 65 70 75 TAC GAG GAG CGC AGC ACC ACC ACC ATC AAG GTC ATC ATT GTC ATC TAC 347
Tyr Glu Glu Arg Ser Thr Thr He Lys Val He Ha Val Ha Tyr 80 85 90 95 CTG TCC GTG GTG GGT GCC CTG TTG CTC TAC ATG GCC TTC CTG ATG CTG 395 Leu Ser Val Val Gly Ala Leu Leu Lau Tyr Met Ala Phe Leu Met Leu 100 105 110 GTG GAC CCT CTG ATC CGA AAG CCG GAT GCA TAC ACT GAG CAA CTG CAC 443 Val Asp Pro Lau He Arg Lys Pro Asp Ala Tyr Thr Glu Gln Leu His 115 120 125 AAT GAG GAG GAG AAT GAG GAT GCT CGC TCT ATG GCA GCA GCT GCT GCT GCA 491 Asn Glu Glu Glu Asn Glu Asp Wing Arg Sar Met Wing Wing Wing Wing Wing 130 135 140 TCC CTC GGG GGA CCC CGA GCC AAC GTC CTG GAG CGG CGG GTG GGT 539 Ser Leu Gly Gly Pro Arg Ala Asn Thr Val Leu Glu Arg Val Glu Gly 145 150 155 GCC CAG CAG CGG TGG AAG CTG CAG GTG CAG GAG CAG CGG AAG ACA GTC 587 Wing Gln Gln Arg Trp Lys Leu Gln Val Gln Glu Gln Arg Lys Thr Val 160 165 170 175 TTC GAT CGG CAC AAG ATG CTC AGC TAGATGGGCT GGTGTGGTTG GGTCAAGGC 640 Phe Asp Arg His Lys Met Leu Ser 180 CCCAACACCA TGGCTGCCAG CTTCCAGGCT GGACAAAGCA GGGGGCTACT TCTCCCTTCC 700
CTCGGTTCCA GTCTTCCCTT TAAAAGCCTG TGGCATTTTT CCTCCTTCTC CCTAACTTTA 760 GAAATGTTGT ACTTGGCTAT TTTGATTAGG GAAGAGGGAT GTGGTCTCTG ATCTCTGTTG 820
TCTTCTTGGG TCTTTGGGGT TGAAGGGAGG GGGAAGGCAG GCCAGAAGGG AATGGAGACA 880
TTCGAGGCGG CCTCAGGAGT GGATGCGATC TGTCTCTCCT GGCTCCACTC TTGCCGCCTT 940
CCAGCTCTGA GTCTTGGGAA TGTTGTTACC CTTGGAAGAT AAAGCTGGGT CTTCAGGAAC 1000
TCAGTGTCTG GGAGGAAAGC ATGGCCCAGC ATTCAGCATG TGTTCCTTTC TGCAGTGGTT 1060
CTTATCACCA QCTCCCTCCC AGCCCCAGCG CCTCAGCCCC AGCCCCAGCT CCAGCCCTGA 1120
GGACAGCTCT GATGGGAGAG CTGGGCCCCC TGAGCCCACT GGGTCTTCAG GGTGCACTGG 1180
AAGCTGGTGT TCGCTGTCCC CTGTGCACTT CTCGCACTGG GGCATGGAGT GCCCATGCAT 1240
ACTCTGCTGC CGGTCCCCTC ACCTGCACTT GAGGGGTCTG GGCAGTCCCT CCTCTCCCCA 1300
GTGTCCACAG TCACTGAGCC AGACGGTCGG TTGGAACATG AGACTCGAGG CTGAGCGTGG 1360
ATCTGAACAC CACAGCCCCT GTACTTGGGT TGCCTCTTGT CCCTGAACTT CGTTGTACCA 1420
GTGCATGGAG AGAAAATTTT GTCCTCTTGT CTTAGAGTTG TGTGTAAATC AAGGAAGCCA 1480
TCATTAAATT GTTTTATTTC TCTC 1504
(2) INFORMATION FOR SEQ ID NO: 69: (i) CHARACTERISTICS OF THE SEQUENCE: (A) LENGTH: 532 (B) TYPE: Nucleic Acid (C) BRANCH: Double (D) TOPOLOGY: Linear (ii) TYPE OF SEQUENCE: cDNA for mRNA (vi) ORIGINAL SOURCE: (A) ORGANISM SPECIES: Homo sapiens (B) TYPE OF CELL: Stomach cancer (D) CLONE NAME: HP10299 (ix) SEQUENCE CHARACTERISTICS: (A) CODE OF CHARACTERIZATION: CDS (B) POSITION OF EXISTENCE: 93 .. 443 (C) METHOD OF CHARACTERIZATION:? (xi) DESCRIPTION OF THE SEQUENCE: SEQ ID NO: 69: GCTCTCTGGT AAAGGCGTGC AGGTGTTGGC CGCGGCCTCT GAGCTGGGAT GAGCCGTGCT 60
CCCGGTGGAA GCAAGGGAGC CCAGCCGGAG CC ATG GCC AGT ACÁ GTG GTA 113 GTA Mat Wing Thr Val Val Wing 1 5 GTT GGA CTG ACC ATT GCT GCT GCA GGA TTT GCA GGC CGT TAC GTT TTG 161 Val Gly Leu Thr Ha Wing Wing Wing Gly Ala Gly Wing Arg Tyr Val Lau 10 15 20 CAA GCC ATG AAG CAT ATG GAG CCT CAA GTA AAA CAA GTT TTT CAA AGC 209 Gln Wing Met Lys His Met Glu Pro Gln Val Lys Gln Val Pha Gln Ser 25 30 35 CTA CCA AAA TCT GCC TTC AGT GGT GGC TAT TAT AGA GGT GGG TTT GAA 257 Leu Pro Lys Ser Wing Phe Ser Gly Gly Tyr Tyr Arg Gly Gly Phe Glu 40 45 _ 50 55 CCC AAA ATG AA AAA CGG GAA GCA GCA TTA ATA CTA GGT GTA AGC CCT 305 Pro Lys Mat Thr Lys Arg Glu Wing Wing Leu He Leu Gly Val Ser Pro 60 65 70 ACT GCC AAT AAA GGG AAA ATA AGA GAT GCT CAT CGA CGA ATT ATG CTT 353 Thr Wing Asn Lys Gly Lys He Arg Asp Wing His Arg Arg He Mßt Leu 75 80 85 TTA AAT CAT CCT GAC AAA GGA GGA TCT CCT TAT ATA GCA GCC AAA ATC 401 Leu Asn His Pro Asp Lys Gly Gly Ser Pro Tyr He Ala Wing Lys He 90 95 100 AAT GAA GCT AAA GAT TTA CTA GAA GGT CAA GCT AAA AAA TGAAGTAAAT 450 Asn Glu Ala Lys Asp Leu Leu Glu Gly Gln Ala Lys Lys 105 110 115 GTATGATGAA TTTTAAGTTC GTATTAGTTT ATGTATATGA GTACTAAGTT TTTATAATAA 510 AATGCCTCAG AGCTACAATT TT 532
(2) INFORMATION FOR SEQ ID NO: 70: (i) CHARACTERISTICS OF THE SEQUENCE: (A) LENGTH: 662 (B) TYPE: Nucleic Acid (C) BRANCH: Double (D) TOPOLOGY: Linear (ii) TYPE OF SEQUENCE: cDNA for RNAra (i) ORIGINAL SOURCE: (A) ORGANISM SPECIES: Homo sapiens (B) TYPE OF CELL: Squamous cell carcinoma (C) CELL LINE: KB (D) CLONE NAME: HP10301 (ix) CHARACTERISTICS OF THE SEQUENCE: (A) CHARACTERIZATION CODE: CDS (B) EXISTENCE POSITION: 92 .. 550 (C) CHARACTERIZATION METHOD: E (xi) DESCRIPTION OF THE SEQUENCE: SEQ ID NO: 70: TCTAGCCCCG CCCCAGGCGA GGGCGCCGCA CCCACACCGC GCTGCGCAGT TTTGTTCTGC 60
TCCAGCTGTT CGAAGGTGAT CCAGACGCAA G ATG GCT CTC TCT AAG GAA 112 Mat Wing Val Leu Ser Lys Glu 1 5 TAT GGT TTT GTG CTT CTA ACT GGT GCT GCC AGC TTT ATA ATG GTG GCC 160 Tyr Gly Phe Val Leu Lau Thr Gly Ala Wing Phß Ser He Mßt Val Wing 10 15 20 CAC CTA GCC ATC AAT GTT TCC AAG GCC CGC AAG AAG TAC AAA GTG GAG 208 His Leu Wing Asn Val Ser Lys Wing Arg Lys Lys Tyr Lys Val Glu 25 30 35 TAT CCT ATC ATG TAC AGC ACG GAC CCT GAA AAT GGG CAC ATC TTC AAC 256
Tyr Pro Ha Met Tyr Ser Thr Asp Pro Glu Asn Gly His He Pha Asn 40 45 50 55 TGC ATT CAG CGA GCC CAC CAG AAC_ACG TTG GAA GTG TAT CCT CCC TTC 304 Cys He Gln Arg Ala His Gln Asn ~ Thr Leu Glu Val Tyr Pro Pro Phß 60 65 70 TTA TTT TTT CTA GCT GTT GGT GTT TAC CAC CCG CGT ATA GCT TCT 352 Leu Phe Phe Lau Wing Val Gly Gly Val Tyr His Pro Arg He Wing Ser 75 80 85 GGC CTG GGC TTG GCC TGG ATT GTT GGA CGA GTT CTT TAT GCT TAT GGC 400
Gly Leu Gly Leu Wing Trp He Val Gly Arg Val Leu Tyr Wing Tyr Gly 90, 95 100 TAT TAC ACG GGA GAA CCC AGC AAG CGT AGT CGA GGA GCC CTG GGG TCC 448 Tyr Tyr Thr Gly Glu Pro Ser Lys Arg Ser Arg Gly Ala Leu Gly Ser 105 110 115 ATC GCC CTC CTG GGC TTG GTG GGC ACÁ ACT GTG TGC TCT GCT TTC CAG 496 He Ala Leu Leu Gly Leu Val Gly Thr Thr Val Cys Sar Ala Phe Gln 120 125 130 135 CAT CTT GGT TGG GTT AAA AGT GGC TTG GGC AGT GGA CCC AAA TGC TGC 544 His Leu Gly Trp Val Lys Ser Gly Leu Gly Ser Gly Pro Lys Cys Cys 140 145 150 CAT TAAAGAATTA TAGGGGTTTA AAAACTCTCA TTCATTTTAA ATG 590
His ACTTACCTTT ATTTCCAGTT ACATTTTTTT TCTAAATATA ATAAAAACTT ACCTGGCATC 650
AGCCTCATAC CT 662
(2) INFORMATION FOR SEQ ID NO: 71: (i) CHARACTERISTICS OF THE SEQUENCE: (A) LENGTH: 2372 (B) TYPE: Nucleic Acid (C) BRANCH: Double (D) TOPOLOGY: Linear (ii) TYPE OF SEQUENCE: cDNA for mRNA (vi) ORIGINAL SOURCE: (A) ORGANISM SPECIES: Homo sapiens (B) TYPE OF CELL: Liver (D) CLONE NAME: HP10302 (ix) SEQUENCE CHARACTERISTICS: (A) CODE OF CHARACTERIZATION: CDS (B) EXISTENCE POSITION: 134 .. 1813 (C) CHARACTERIZATION METHOD: E (xi) DESCRIPTION OF SEQUENCE: SEQ ID NO: 71: GAAGACCCCA GCGCCGGCGC GGCTCAGGGC TGGGCCCACG GGACTCCGGA CGCGCCGCGA 60
AAGCGTTGCG CTCCCGGAGG CGTCCGCAGC TGCTGGCTGC TCATTTGCCG GTGACCGGAG 120
GCTCGGGGCC AGC ATG GCC CCC ACG CTG CAA CAG GCG TAC CGG AGG CGC 169 Met Wing Pro Thr Leu Gln Gln Wing Tyr Arg Arg Arg 1 5 10 TGG TGG ATG GCC TGC ACG GCT GTG CTG GAG AAC CTC TTC TTC TCT GCT 217 Trp Trp Mss Ala Cys Thr Ala Val Leu Glu Asn Leu Phß Phe Ser Wing 15 20 25 * GTA CTC CTG GGC TGG GGC TCC CTG TTG ATC ATT CTG AAG AAC GAG GGC 265 Val Leu Leu Gly Trp Gly Ser Leu Leu lie? L Leu Lys Asn Glu Gly 30 35 40 TTC TAT TCC AGC ACG TGC CCA GCT GAG AGC AGC ACC AAC ACC ACC CAG 313
Phe Tyr Ser Ser Thr Cys Pro Wing Glu Ser Ser Thr Asn Thr Thr Gln 45 50 55 60 GAT GAG CAG CGC AGG TGG CCA GGC TGT GAC CAG CAG GAC GAG ATG CTC 361 Asp Glu Gln Arg Arg Trp Pro Gly Cys Asp Gln Gln Asp Glu Met Leu 65 70 75 AAC CTG GGC TTC ACC ATT GGT TCC TTC GTG CTC AGC GCC ACC ACC CTG 409 Asn Leu Gly Phe Thr He Gly Ser Phe Val Leu Ser Wing Thr Thr Leu 80 85 90 CCA CTG GGG ATC CTC ATG GAC CGC TTT GGC CCC CGA CCC GTG CGG CTG 457 Pro Leu Gly He Lau Mat Asp Arg Phe Gly Pro Arg Pro Val Arg Leu 95 100 105 GTT GGC AGT GCC TGC TTC ACT GCG TCC TGC ACC CTC ATG GCC CTG GCC 505 Val Gly Ser Ala Cys Phe Thr Ala Ser Cys Thr Leu Met Ala Leu Wing 110 115 120 TCC CGG GAC GTG GAA GCT CTG TCT CCG TTG ATA TTC CTG GCG CTG TCC 553 Ser Arg Asp Val Glu Ala Leu Ser Pro Leu He Pha Leu Ala Leu Ser 125 130 135 140 CTG AAT GGC TTT GGT GGC ATC TGC CTA ACG TTC ACT TCA CTC ACG CTG 601 Leu Asn Gly Phe Gly He Cys Leu Thr Phe Thr Ser Leu Thr Leu 145 150 155 CCC AAC ATG TTT GGG AAC CTG CGC TCC ACG TTA ATG GCC CT C ATG ATT 649 Pro Asn Met Phe Gly Asn Leu Arg Ser Thr Leu Mßt Ala Leu Met He 160 165 170 GGC TCT TAC GCC TCT TCT GCC ATT ACG TTC CCA GGA ATC AAG CTG ATC 697 Gly Ser Tyr Ala Ser Ser Ala Lie Thr Phe Pro Gly Ha Lys Leu He 175 180 185 TAC GAT GCC GGT GTG GCC TTC GTG GTC ATC ATG TTC ACC TGG TCT GGC 745 Tyr Asp Wing Gly Val Wing Phe Val Val He Mat Phe Thr Trp Ser Gly 190 195 200 CTG GCC TGC CTT ATC TTT CTG AAC TGC ACC CTC AAC TGG CCC ATC GAA 793 Leu Wing Cys Leu Ha Phß Leu Asn Cys Thr Leu Asn Trp Pro He Glu 205 210 215 220 GCC TTT CCT GCC CCT GAG GAA GTC AAT TAC ACG AAG AAG ATC AAG CTG 841 Wing Phe Pro Pro Glu Glu Val Asn Tyr Thr Lys Lys He Lys Leu 225 230 235 AGT GGG CTG GCC CTG GAC CAC AAG GTG ACA GGT GAC CTC TTC TAC ACC 889 Sar Gly Lau Ala Leu Asp His Lys Val Thr Gly Asp Leu Phe Tyr Thr 240 245 250 CAT GTG ACC ACC ATG GGC CAG AGG CTC AGC CAG AAG GCC CCC AGC CTG 937 His Val Thr Thr Met Gly Gln Arg Leu Ser Gln Lys Ala Pro Ser Leu 255 260 265 GAG GAC GGT TCG GAT GTC TTC ATG TCA CCC CAG GAT GTT CGG GGC ACC 985 Glu Asp Gly Ser Asp Wing Phe Mßt Ser Pro Gln Asp Val Arg Gly Thr 270 275 280 TCA GAA AAC CTT CCT GAG AGG TCT GTC CCC TTA CGC AAG AGC CTC TGC 1033 Ser Glu Asn Leu Pro Glu Arg Ser Val Pro Leu Arg Lys Ser Leu Cys 285 290 295 300 TCC CCC ACT TTC CTG TGG AGC CTC CTC ACC ATG GGC ATG ACC CAG CTG 1081 Ser Pro Thr Phe Leu Trp Ser Leu Leu Thr Met Gly Met Thr Gln Leu 305 310 315 CGG ATC ATC TTC TAC ATG GCT GCT GTG AAC AAG ATG CTG GAG TAC CTT 1129 Arg Ha Ha Pha Tyr Met Wing Wing Val Asn Lys Met Leu Glu Tyr Leu 320 325 330 GTG ACT GGT GGC CAG GAG CAT GAG ACÁ AAT GAA CAG CAA CAA AAG GTG 1177 Val Thr Gly Gly Gln Glu His Glu Thr Asn Glu Gln Gln Gln Lys Val 335 340 345 GCA GAG HERE GTT GGG TTC TAC TCC TCC GTC TTC GGG GCC ATG CAG CTG 1225 Wing Glu Thr Val Gly Phe Tyr Ser Val Pha Gly Ala Met Gln Leu 350 355 360 TTG TGC CTT CTC ACC TGC CCC CTC ATT GGC TAC ATC ATG GAC TGG CGG 1273 Leu Cys Leu Leu Thr Cys Pro Leu He Gly Tyr He Met Asp Trp Arg 365 370 375 380 ATC AAG GAC TGC GTG GAC GCC CCA ACT GAG GGC ACT GTC CTC GGA GAT 1321 He Lys Asp Cys Val Asp Ala Pro Thr Gln Gly Thr Val Leu Gly Asp 385 390 395 GCC AGG GAC GGG GTT GCT ACC AAA TCC ATC AGA CCA CGC TAC TGC AAG 1369 Ala Arg Asp Gly Val Ala Thr Lys Ser he Arg Pro Arg Tyr Cys Lys 400 405 410 ATC CAA AAG CTC ACC AAT GCC ATC AGT GCC TTC ACC CTG ACC AAC CTG 1417 Ha Gln Lys Leu Thr Asn Ala has Ser Ala Phe Thr Lau Thr Asn Leu 415 420 425 CTG CTT GTG GGT TTT GGC ATC ACC TGT CTC ATC AAC AAC TTA CAC CTC 1465 Leu Leu Val Gly PHSS Gly He Thr Cys Leu I Asn Asn Leu His Leu 430,435,440 CAG TTT GTG ACC TTT GTC CTG CAC ACC ATT GTT CGA GGT TTC TTC CAC 1513 Phe Val Gln Thr Val Leu His Thr PHSS Val Arg Gly Phe He Phe His TCA 445 450 455 460 GCC CTC TGT GGG AGT TAT GTG GCT GCA CCA TTC TCC CAC AAC TTT 1561 Ala Cys Ser Ser Gly Leu Tyr Ala Ala Val Phe Pro Sar Asn His Phe 465 470 475 GGG ACG CTG ACÁ GGC CTG CAG TCC CTC ATC AGT GCT GTG TTC GCC TTG 1609 Gly Thr Leu Thr Gly Leu Gln Ser Leu Ha Ser Ala Val Phe Ala Leu 480,485,490 CTT CAG CAG CCA CTT TTC ATG GCG ATG GTG GGA CCC CTG AAA GGA GAG 1657 Leu Gln Gln Pro Leu Phe Met Ala MSST Val Gly Pro Leu Lys Gly Glu 495 500 505 CCC TTC TGG GTG AAT CTG GGC CTC CTG CTA TTC TCA CTC CTG GGA TTC 1705 Pro PHSS Trp Val Asn Lau Gly Leu Leu Leu PHSS Ser Leu Lau Gly Phe 510 515 520 CTG TTG CCT TCC TAC CTC TTC TAT TAC CGT GCC CGG CTC CAG CAG GAG 1753 Lau Leu Pro Sar Tyr Leu PHSS Tyr Tyr Arg Ala Arg Leu Gln Gln Glu 525 530 535 540 TAC GCC GCC AAT GGG ATG GGC CCA CTG AAG GTG CTT AGC GGC TCT GAG 1801 Tyr Ala Ala Asn Gly Met Gly Pro Leu Lys Val Leu Ser Gly Ser Glu 545 550 555 GTG ACC GCA TAGACTTCTC AGACCAAGGG ACCTGGATGA 1840 Val Thr Wing CAGGCAATCA AGGCCTGAGC AACCAAAAGG AGTGCCCCAT ATGGCTTTTC TACCTGTAAC 1900
ATGCACATAG AGCCATGGCC GTAGATTTAT AAATACCAAG AGAAGTTCTA TTTTTGTAAA 1960
GACTGCAAAA AGGAGGAAAA AAAAACCTTC AAAAACGCCC CCTAAGTCAA CGCTCCATTG 2020
ACTGAAGACA GTCCCTATCC TAGAGGGGTT GAGCCTTCTT CCTCCTTGGG TTGGAGGAGA 2080
CCAGGGTGCC TCTTATCTCC TTCTAGCGGT CTGCCTCCTG GTACCTCTTG GGGGGATCGG 2140
CAAACAGGCT ACCCCTGAGG TCCCATGTGC CATGAGTGTG CACACATGCA TGTGTCTGTG 2200
TATGTGTGAA TGTGAGAGAG ACACAGCCCT CCTTTCAGAA GGAAAGGGGC CTGAGGTGCC 2260
AGCTGTGTCC TGGGTTAGGG GTTGGGGGTC GGCCCCTTCC AGGGCCAGGA GGGCAGGTTC 2320
CCTCTCTGGT GCTGCTGCTT GCAAGTCTTA GAGGAAATAA AAAGGGAAGT GAG 2373
(2) INFORMATION FOR SEQ ID NO: 72: (i) CHARACTERISTICS OF THE SEQUENCE: (A) LENGTH: 1316 (B) TYPE: Nucleic Acid (C) BRANCH: Double (D) TOPOLOGY: Linear (ii) TYPE OF SEQUENCE: cDNA for mRNA (vi) ORIGINAL SOURCE: (A) ORGANISM SPECIES: Homo sapiens (B) TYPE OF CELL: Osteosarcoma (C) CELLULAR LINE: U-2 OS (D) CLONE NAME: HP10304 (ix) CHARACTERISTICS OF THE SEQUENCE: (A) CODE OF CHARACTERIZATION: CDS (B) POSITION OF EXISTENCE: 11 .. 1003 (C) METHOD OF CHARACTERIZATION: E (xi) DESCRIPTION OF THE SEQUENCE: SEQ ID NO: 72: GTTGTCCAAG ATG GAG GGC GCT CCA CCG GGG TCG CTC GCC CTC CGG CTC 49 Met Glu Gly Ala Pro Pro Gly Ser Leu Ala Leu Arg Leu 1 5 10 CTG CTG TTC GTG GCG CTA CCC GCC TCC GGC TGG CTG ACG ACG GGC GCC 97 Leu Leu Phß Val Ala Lau Pro Ala Sar Gly Trp Leu Thr Thr Gly Wing 15 20 25 CCC GAG CCG CCG CCG CTG TCC GGA GCC CCA CAG GAC GGC ATC AGA ATT 145
Pro Glu Pro Pro Pro Leu Ser Gly Pro Wing Gln Asp Gly Ha Arg Ha 30 35 40 45 AAT GTA ACT AA CTG AAA GAT GAT GG GAC ATA TCT AAA CAG CAG GTT 193 Asn Val Thr Thu Leu Lys Asp Asp Gly Asp He Ser Lys Gln Gln Val 50 55 60 GTT CTT AAC ATA ACC TAT GAG AGT GGA CAG GTG TAT GTA AAT GAC TTA 241 Val Leu Asn He Thr Tyr Glu Ser Gly Gln Val Tyr Val Asn Asp Leu 65 70 75 CCT GTA AAT AGT GGT GTA ACC CGA ATA AGC TGT CAG ACT TTG ATA GTG 289
Pro Val Asn Ser Gly Val Thr Arg He Ser Cys Gln Thr Lau He Val 80 85 90 AAG AAT GAA AAT CTT GAA AAT TTG GAG GAA AAA GAA TAT TTT GGA ATT 337 Lys Asn Glu Asn Leu Glu Asn Leu Glu Glu Lys Glu Tyr Phe Gly He 95 100 105 GTC AGT GTA AGG ATT TTA GTT CAT GAG TGG CCT ATG ACT TCT GGT TCC 385 Val Ser Val Arg He Leu Val His Glu Trp Pro Mat Thr Ser Gly Ser 110 115 120 125 AGT TTG CAA CTA ATT GTC ATT CAA GAA GAG GTA GTA GTA ATT GAT GGA 433 Ser Leu Gln Leu He Val He Gln Glu Val Val Glu He Asp Gly 130 135 140 AAA CAA GTT CAG CAA AAG GAT GTC ACT GAA ATT GAT ATT TTA GTT AAG 481 Lys Gln Val Gln Gln Lys Asp Val Thr Glu He Asp He Leu Val Lys 145 150 155 AAC CGG GGA GTA CTC AGA CAT TCA AAC TAT ACC CTC CCT TTG GAA 529 Asn Arg Gly Val Leu Arg His Ser Asn Tyr Thu Leu Pro Lau Glu Glu 160 165 170 'AGC ATG CTC TAC TCT ATT TCT CGA GAC AGT GAC ATT TTA TTT ACC CTT 577 Ser Met Lau Tyr Ser Ha Be Arg Asp Ser Asp He Leu Phe Thr Leu 175 180 185 CCT AAC CTC TCC AAA AAA GAA AGT GTT AGT TCA CTG CAA ACC ACT AGC 625 Pro Asn Leu Ser Lys Lys Glu Ser Val Ser Ser Lau Gln Thr Thr Ser 190 195 200 205 CAG TAT CTT ATC AGG AAT GTG GAA ACC ACT GTA GAAT GAT GTT TTA 673 Gln Tyr Leu Ha Arg Asn Val Glu Thr Thr Val Asp Glu Asp Val Leu 210 215 220 CCT GGC AAG TTA CCT GAA ACT CCT CTC AGA GCA GAG CCG CCA TCT TCA 721 Pro Gly Lys Leu Pro Glu Thr Pro Leu Arg Wing Glu Pro Pro Ser Ser 225 230 235 TAT AAG GTA ATG TGT CAG TGG ATG GAA AAG TTT AGA AAA GAT CTG TGT 769 Tyr Lys Val Met Cys Gln Trp Mßt Glu Lys Phe Arg Lys Asp Leu Cys 240 245 250 AGG TTC TGG AGC AAC GTT TTC CCA GTA TTC TTT CAG TTT TTG AAC ATC 817 Arg Pha Trp Ser Asn Val Pha Pro Val Pha Phe Gln Pha Leu Asn He 255 260 265 ATG GTG GTT GGA ATT ACÁ GGA GCT GCT GTG GTA ATTA ACC ATC TTA AAG 865 Met Val Val Gly He Thr Gly Ala Ala Val Val He Thr He Leu Lys 270 275 280 285 GTG TTT TTC CTC GTT TCT GAA TAC AAA GGA ATT CTT CAG TTG GAT AAA 913 Val Pha Pha Pro Val Ser Glu Tyr Lys Gly He Leu Gln Lau Asp Lys 290 295 300 GTG GAC GTC ATA CCT GTG ACÁ GCT ATC AAC TTA TAT CCA GAT GGT CCA 961 Val Asp Val He Pro Val Thr Wing Ha Asn Leu Tyr Pro Asp Gly Pro 305 310 315 GAG AAA AGA GCT GAA AAC CTT GAA GAT AAA ACA TGT ATT TAAAACGCCA 1010 Glu Lys Arg Ala Glu Asn Leu Glu Asp Lys Thr Cys He 320 325 330 TCTCATATCA TGGACTCCGA AGTAGCCTGT TGCCTCCAAA TTTGCCACTT GAATATAATT 1070
TTCTTTAAAT CGTTAAGAAT CAGTTTATAC ACTAGAGAAA TTGCTAAACT CTAAGACTGC 1130
CTGAAAATTG ACCTTTACAG TGCCAAGTTA AAGTTTACCT TATTCTCGGC CGGGTGCAGT 1190
GGCTCATGCC TGTAATCCCA GGACTTTGGG AGGCCAATGC GGGCGGATCA CGAGGTCAGA 1250
TCAAGACCAT CCTGCCAACA TGGTGAAACC CTGTCTCTAC TAAAAAAAAT AAAAAAGTTA 1310
GCTGGG 1316
(2) INFORMATION FOR SEQ ID NO: 73: (i) CHARACTERISTICS OF THE SEQUENCE: (A) LENGTH: 893 (B) TYPE: Nucleic Acid (C) BRANCH: Double (D) TOPOLOGY: Linear (ii) TYPE OF SEQUENCE: cDNA for mRNA (vi) ORIGINAL SOURCE: (A) ORGANISM SPECIES: Homo sapiens (B) TYPE OF CELL: Osteosarcoma (C) CELLULAR LINE: U-2 OS (D) CLONE NAME: HP10305 (ix) CHARACTERISTICS OF THE SEQUENCE: (A) CHARACTERIZATION CODE: CDS (B) POSITION OF EXISTENCE: 110 .. 436 (C) METHOD OF CHARACTERIZATION: E (xi) DESCRIPTION OF THE SEQUENCE: SEQ ID NO: 73: ATCGCGGAGT CGGTGCTTTA GTACGCCGCT GGCACCTTTA CTCTCGCCGG CCGCGCGAAC 60
CCGTTTGAGC TCGGTATCCT AGTGCACACG CCTTGCAAGC GACGGCGCC ATG AGT CTG 118 Met Ser Leu 1 ACT TCC AGT TCC AGC GTA CGA GTT GAA TGG ATC GCA GCA GTT ACC ATT 166 Thr Ser Ser Ser Ser Val Arg Val Glu Trp He Ala Ala Val Thr He 5 10 15 GCT GCT GGG ACÁ GCT GCA ATT GGT TAT CTA GCT TAC AAA AGA TTT TAT 214
Wing Wing Gly Thr Wing Wing He Gly Tyr Leu Wing Tyr Lys Arg Phe Tyr 20 25 30 35 GTT AAA GAT CAT CGA AAT AAA GCT ATG ATA AAG CTT CAC ATC CAG AAA 262 Val Lys Asp His Arg Asn Lys Ala Met He Asn Leu His He Gln Lys 40 45 50 GAC AAC CCC AAG ATA GTA CAT GCT TTT GAC ATG GAG GAT TTG GGA GAT 310 Asp Asn Pro Lys He Val His Wing Phe Asp Mat Glu Asp Leu Gly Asp 55 60 65 AAA GCT GTG TAC TGC TGT TGT TGG AGG TCC AAA AAG TTC CCA TTC TGT 358 Lys Wing Val Tyr Cys Arg Cys Trp Arg Sar Lys Lys Pha Pro Phe Cys 70 75 80 GAT GGG GCT CAC AAA CAT AAC GAA GAG ACT GGA GAC AAT GTG GGC 406 Asp Gly Ala His Thr Lys His Asn Glu Glu Thr Gly Asp Asn Val Gly 85 90 95 CCT CTG ATC ATC AAG AAA AAA GAA ACT TAAATGGACA CTTTTGA 450
Pro Leu Ha He Lys Lys Lys-Glu Thr 100, 105 TGCTGCAAAT CAGCTTGTCG TGAAGTTACC TGATTGTTTA ATTAGAATGA CTACCACCTC 510
TGTCTGATTC ACCTTCGCTG GATTCTAAAT GTGGTATATT GCAAACTGCA GCTTTCACAT 570
TTATGGCATT TGTCTTGTTG AAACATCGTG GTGCACATTT GTTTAAACAA AAAAAAAAAA 630
AAAAAGGAAA AACCAACCTC ATGGCCTGTG GGTTATTTTG GTCTTGTAAG GATCCATTTC 690
TTTAAAATAC TGACATATAG AGTTGTACCT TATATAGAAT ATAGTTGTAT CTTGAAGTCA 750
ACATATTAAA TTATTCTCAA AATTATGTAT TTGCAGATTG TACTTGTAAG TTTCAAAGAA 810
AAATTACCAT CTTTTCATAT TGACCTGGAA ACTAAATAGG ATGTGATTCA GCTACATTAA 870
TTTCTTAATA CAATCTAGGA AAG 893
(2) INFORMATION FOR SEQ ID NO: 74: (i) CHARACTERISTICS OF THE SEQUENCE: (A) LENGTH: 690 (B) TYPE: Nucleic Acid (C) BRANCH: Double (D) TOPOLOGY: Linear (ii) TYPE OF SEQUENCE: ADNa for mRNA (vi) ORIGINAL SOURCE: (A) ORGANISM SPECIES: Homo sapiens (B) TYPE OF CELL: Osteosarcoma (C) CELLULAR LINE: U-2 OS (D) CLONE NAME: HP10306 (ix) CHARACTERISTICS OF THE SEQUENCE: (A) CHARACTERIZATION CODE: CDS (B) EXISTENCE POSITION: 230 .. 535 (C) CHARACTERIZATION METHOD: E (xi) DESCRIPTION OF THE SEQUENCE: SEQ ID NO: 74: TAACAGCGCA TGCGTGCAGT GTTGCCTCGC CCAAAGAAGA CTACAATCTC CAGGGAAACC 60
TGGGGCGTCT CGCGCAAACG TCCATAACTG AAAGTAGCTA AGGCACCCCA GCCGGAGGAA 120
GTGAGCTCTC CTGGGGCGTG GTTGTTCGTG ATCCTTGCAT CTGTTACTTA GGGTCAAGGC 180
TTGGGTCTTG CCCCGCAGAC CCTTGGGACG ACCCGGCCCC AGCGCAGCT ATG AAC CTG 238 Met Asn Leu 1 GAG CGA GTG TCC AAT GAG GAG AAA TTG AAC CTG TGC CGG AAG TAC TAC 286 Glu Arg Val Ser Asn Glu Glu Lys Leu Asn Leu Cys Arg Lys Tyr Tyr 10 15 CTG GGG GGC TTT GCT TTC CTG CCT TTT CTC TGG TTG GTC AAC ATC TTC 334 Leu Gly Gly Phe Wing Phe Leu Pro Phe Leu Trp Leu Val Asn Ha Phe 20 25 30 35 TGG TTC TTC CGA GAG GCC TTC CTT GTC CCA GCC TAC HERE GAA CAG AGC 382
Trp Phe Phe Arg Glu Wing Phe Leu Val Pro Wing Tyr Thr Glu Gln Ser 40 45 50 CAA ATC AAA GGC TAT GTC TGG CGC TCA GCT GTG GGC TTC CTC TTC TGG 430 Gln Ha Lys Gly Tyr Val Trp Arg Ser Wing Val Gly Pha Lau Phß Trp 55 60 65 GTG ATA GTG CTC ACC TCC TGG ATC ACC ATC TTC CAG ATC TAC CGG CCC 478 Val He Val Leu Thr Ser Trp He Thr He Phe Gln He Tyr Arg Pro 70 75 80 CGC TGG GGT GCC CTT GGG GAC TAC CTC TCC TTC ACC ATA CCC CTG GGC 526 Arg Trp Gly Wing Leu Gly Asp Tyr Lau Ser Phe Thr He Pro Leu Gly 85 90 95 ACC CCC TGACAACTTC TGCACATACT GGGGCCCTGC TTATTCTCCC AGGACAGG 580
Thr Pro 100 CTCCTTAAAG CAGAGGAGCC TGTCCTGGGA GCCCCTTCTC AAACTCCTAA GACTTGTTTT 640
CATGTCCCAC GTTCTCTGCT GACATCCCCC AATAAAGGAC CCTAACTTTC 690
(2) INFORMATION FOR SEQ ID NO: 75: (i) CHARACTERISTICS OF THE SEQUENCE: (A) LENGTH: 2186 (B) TYPE: Nucleic Acid (C) BRANCH: Double (D) TOPOLOGY: Linear (ii) TYPE OF SEQUENCE: ADNa for mRNA (vi) ORIGINAL SOURCE: (A) ORGANISM SPECIES: Homo sapiens (B) TYPE OF CELL: Squamous cell carcinoma (C) CELL LINE: KB (D) CLONE NAME: HP10328 (ix) CHARACTERISTICS OF THE SEQUENCE: (A) CODE OF CHARACTERIZATION: CDS (B) POSITION OF EXISTENCE: 118 .. 1236 (C) METHOD OF CHARACTERIZATION: E. { xi) DESCRIPTION OF THE SEQUENCE: SEQ ID NO: 75: ACTCTTTCTT CGGCTCGCGA GCTGAGAGGA GCAGGTAGAG GGGCAGAGGC GGGACTGTCG 60
TCTGGGGGAG CCGCCCAGGA GGCTCCTCAG GCCGACCCCA GACCCTGGCT GGCCAGG 117
ATG AAG TAT CTC CGG CAC CGG CCC AAT GCC ACC CTC ATT CTG GCC 165 Met Lys Tyr Leu Arg His Arg Arg Pro Asn Wing Thr Leu He Leu Wing 1 5 10 15 ATC GGC GCT TTC ACC CTC CTC CTC TTC AGT CTG CTA GTG TCA CCA CCC 213 lie Gly Ala Pha Thr Leu Leu Lau Pha Ser Lau Lau Val Ser Pro Pro 20 25 30 ACC TGC AAG GTC CAG GAG CAG CCA CCG GCG ATC CCC GAG GCC CTG GCC 261 Thr Cys Lys Val Gln Glu Gln Pro Pro Ala Has Pro Glu Ala Leu Wing 35 40 45 TGG CCC ACT CCA CCC ACC CGC CCA GCC CCG GCC CCG TGC CAT GCC AAC 309 Trp Pro Thr Pro Pro Thr Arg Pro Wing Pro Pro Wing Cys His Wing Asn 50 55 60 ACC TCT ATG GTC ACC CAC CCG GAC TTC GCC ACG CAG CCG CAG CAC GTT 357
Thr Ser Mßt Val Thr His Pro Asp Phe Ala Thr Gln Pro Gln His Val 65 70 75 80 CAG AAC TTC CTC CTG TAC AGA CAC TGC CGC CAC TTT CCC CTG CTG CAG 405 Gln Asn Phe Leu Leu Tyr Arg His Cys Arg His Pha Pro Leu Leu Gln 85 90 95 GAC GTG CCC CCC TCT AAG TGC GCG CAG CCG GTC TTC CTG CTG CTG GTG 453 Asp Val Pro Pro Ser Lys Cys Ala Gln Pro Val Phe Leu Leu Leu Val 100 105 110 ATC AAG TCC TCC CCT AGC AAC TAT GTG CGC GG CTG CTG CGG CGC 501 Ha Lys Ser Ser Pro Ser Asn Tyr Val Arg Arg Glu Leu Arg Arg 115 120 125 ACG TGG GGC CGC GAG CGC AAG GTA CGG GGT TTG CAG CTG CGC CTC CTC 549 Thr Trp Gly Arg Glu Arg Lys Val Arg Gly Lau Gln Leu Arg Leu Leu 130 135 140 TTC CTG GTG GGC ACA GCC TCC AAC CCG CAC GAG GCC CGC AAG GTC AAC 597 Phe Leu Val Gly Thr Ala Ser Asn Pro His Glu Ala Arg Lys Val Asn 145 150 155 160 CGG CTG CTG GAG CTG GAG GCA CAG ACT CAC GGA GAC ATC CTG CAG TGG 645 Arg Leu Leu Glu Leu Glu Wing Gln Thr His Gly Asp He Leu Gln Trp 165 170 175 GAC TTC CAC GAC TCC TTC TTC AAC CTC ACG CTC AA G CAG GTC CTG TTC 693 Asp Pha His Asp Sar Phe Pha Asn Leu Thr Leu Lys Gln Val Leu Phß 180 185 190 TTA CAG TGG CAG GAG HERE AGG TGC GCC AAC GCC AGC TTC GTG CTC AAC 741 Leu Gln Trp Gln Glu Thr Arg Cys Wing Asn Wing Ser Phe Val Leu Asn 195 200 205 GGG GAT GAT GTC GTC TTT GCA CAC ACA GAC AAC ATG GTC TTC TAC CTG 789 Gly Asp Asp Asp Val Phe Wing His Thr Asp Asn Met Val Ph? Tyr Leu 210 215 220 GAG GAC CAT GAC CCT GGC CGC CAC CTC TTC GTG GGG CAA CTG ATC CAA 837 Gln Asp His Asp Pro Gly Arg His Leu Phe Val Gly Gln Leu He Gln 225 230 235 240 AAC GTG GGC CCC ATC CGG GCT TTT TGG AGC AAG TAC TAT GTG CCA GAG 885 Asn Val Gly Pro He Arg Ala Pha Trp Ser Lys Tyr Tyr Val Pro Glu 245 250 255 GTG GTG ACT CAG AAT GAG CGG TAC CCA CCC TAT TGT GGG GGT GGT GGC 933 Val Val Thr Gln Asn Glu Arg Tyr Pro Pro Tyr Cys Gly Gly Gly Gly 260 265 270 TTC TTG CTG TCC CGC TTC ACG GCC GCT GCC CTG CGC CGT GCT GCC CAT 981 Pha Leu Lau Ser Arg Phe Thr Wing Wing Wing Leu Arg Arg Wing Wing His 275 280 285 GTC TTG GAC ATC TT C CCC ATT GAT GAT GTC TTC CTG GGT ATG TGT CTG 1029 Val Leu Asp He Pha Pro He Asp Asp Val Pha Lau Gly Mßt Cys Leu 290 295 300 GAG CTT GAG GG CTG AAG CCT GCC TCC CAC AGC GGC ATC CGC ACG TCT 1077 Glu Leu Glu Gly Leu Lys Pro Wing Ser Gly He Arg Thr Ser 305 310 315 320 GGC GTG CGG GCT CCA TCG CAA CAC CTG TCC TCC TTT GAC CCC TGC TTC 1125 Gly Val Arg Ala Pro Ser Gln His Leu Ser Ser Phe Asp Pro Cys Phe 325 330 335 TAC CGA GAC CTG CTG GTG CAC CGC TTC CTA CCT TAT GAG ATG CTG 1173 Tyr Arg Asp Leu Leu Leu Val His Arg Pha Leu Pro Tyr Glu Mat Leu 340 345 350 CTC ATG TGG GAT GCG CTG AAC CAG CCC AAC CTC ACC TGC GGC AAT CAG 1221 Leu Met Trp Asp Ala Leu Asn Gln Pro Asn Leu Thr Cys Gly Asn Gln 355 360 365 ACÁ CAG ATC TAC TGAGTCAGCA TCAGGGTCCC CAGCCTCTGG GCTCCTG 1270
Thr Gln He Tyr 370 TTTCCATAGG AAGGGGCGAC ACCTTCCTCC CAGGAAGCTG AGACCTTTGT GGTCTGAGCA 1330
TAAGGGAGTG CCAGGGAAGG TTTGAGGTTT GATGAGTGAA TATTCTGGCT GGCGAACTCC 1390
TACACATCCT TCAAAACCCA CCTGGTACTG TTCCAGCATC TTCCCTGGAT GGCTGGAGGA 1450
ACTCCAGAAA ATATCCATCT TCTTTTTTGTG GCTGCTAATG GCAGAAGTGC CTGTGCTAGA 1510
GTTCCAACTG TGGATGCATC CGTCCCGTTT GAGTCAAAGT CTTACTTCCC TGCTCTCACC 1570
TACTCACAGA CGGGATGCTA AGCAGTGCAC CTGCAGTGGT TTAATGGCAG ATAAGCTCCG 1630
TCTGCAGTTC CAGGCCAGCC AGAAACTCCT GTGTCCACAT AGAGCTGACG TGAGAAATAT 1690
CTTTCAGCCC AGGAGAGAGG GGTCCTGATC TTAACCCTTT CCTGGGTCTC AGACAACTCA 1750
GAAGGTTGGG GGGATACCAG AGAGGTGGTG GAATAGGACC GCCCCCTCCT TACTTGTGGG 1810
ATCAAATGCT'GTAATGGTGG AGGTGTGGGC AGAGGAGGGA GGCAAGTGTC CTTTGAAAGT 1870
TGTGAGAGCT CAGAGTTTCT GGGGTCCTCA TTAGGAGCCC CCATCCCTGT GTTCCCCAAG 1930
AATTCAGAGA ACAGCACTGG GGCTGGAATG ATCTTTAATG GGCCCAAGGC CAACAGGCAT 1990
ATGCCTCACT ACTGCCTGGA GAAGGGAGAG ATTCAGGTCC TCCAGCAGCC TCCCTCACCC 2050
AGTATGTTTT ACAGATTACG GGGGGACCGG GTGAGCCAGT GACCCCCTGC AGCCCCCAGC 2110
TTCAGGCCTC AGTGTCTGCC AGTCAAGCTT CACAGGCATT GTGATGGGGC AGCCTTGGGG 2170
AATATAAAAT TTTGTG 2186
Claims (5)
- Novelty of the Invention 1. A protein containing any of the amino acid sequences represented by Sequence No. 1 to Sequence No. 2, or Sequence No. 4 to Sequence No. 25.
- A DNA encoding any of the proteins described in claim 1.
- 3. A cDNA containing any of the base sequences represented by Sequence No, 26 to Sequence No. 50.
- 4. A cDNA as described in claim 3, which comprises any of the base sequences represented by Sequence No. 51 to Sequence No. 75.
- 5. A transformed eukaryotic cell, capable of expressing any DNAs as described in claims 2 to 4, and which produces a protein as described in claim 1.
Applications Claiming Priority (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| JP8-301429 | 1996-11-13 |
Publications (1)
| Publication Number | Publication Date |
|---|---|
| MXPA99004356A true MXPA99004356A (en) | 2000-12-06 |
Family
ID=
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| US20030092175A1 (en) | Human proteins having transmembrane domains and dnas encoding these proteins | |
| US20050074842A1 (en) | Human proteins having transmembrane domains and DNAs encoding these proteins | |
| CA2336225A1 (en) | Human proteins having hydrophobic domains and dnas encoding these proteins | |
| WO1998011217A2 (en) | HUMAN PROTEINS HAVING SECRETORY SIGNAL SEQUENCES AND DNAs ENCODING THESE PROTEINS | |
| AU2549399A (en) | Human proteins having transmembrane domains and dnas encoding these proteins | |
| MXPA99004356A (en) | Human proteins having transmembrane domains and dnas encoding these proteins | |
| CA2305337A1 (en) | Human proteins having secretory signal sequences and cdnas encoding these proteins | |
| CA2305886A1 (en) | Human proteins having transmembrane domains and cdnas encoding these proteins | |
| CA2305386A1 (en) | Proteins alike to human complement factor h and cdnas encoding these proteins | |
| CA2311237A1 (en) | Human proteins having transmembrane domains and dnas encoding these proteins | |
| US6500939B1 (en) | cDNAs coding for human proteins having transmembrane domains | |
| AU729019B2 (en) | Human type-I membrane protein and DNA encoding this protein | |
| CA2308120A1 (en) | Human proteins having transmembrane domains and cdnas encoding these proteins | |
| MXPA00003435A (en) | HUMAN PROTEINS HAVING TRANSMEMBRANE DOMAINS AND cDNAs ENCODING THESE PROTEINS | |
| JP2001519154A (en) | Human protein having transmembrane domain and DNA encoding the same | |
| US20040048339A1 (en) | Human proteins having transmembrane domains and cDNAs encoding these proteins | |
| MXPA00008085A (en) | HUMAN PROTEINS HAVING TRANSMEMBRANE DOMAINS AND DNAs ENCODING THESE PROTEINS | |
| JP2001508407A (en) | Human protein having transmembrane domain and DNA encoding the same | |
| MXPA00005074A (en) | HUMAN PROTEINS HAVING TRANSMEMBRANE DOMAINS AND DNAs ENCODING THESE PROTEINS | |
| CA2331326A1 (en) | Human glycoprotease-like proteins and dnas encoding these proteins | |
| MXPA99002408A (en) | Human type-i membrane protein and dna encoding this protein | |
| MXPA00003437A (en) | cDNAs CODING FOR HUMAN PROTEINS HAVING TRANSMEMBRANE DOMAINS |