CA2420932A1 - Novel mammalian nuclear receptor l66 and methods of use - Google Patents
Novel mammalian nuclear receptor l66 and methods of use Download PDFInfo
- Publication number
- CA2420932A1 CA2420932A1 CA002420932A CA2420932A CA2420932A1 CA 2420932 A1 CA2420932 A1 CA 2420932A1 CA 002420932 A CA002420932 A CA 002420932A CA 2420932 A CA2420932 A CA 2420932A CA 2420932 A1 CA2420932 A1 CA 2420932A1
- Authority
- CA
- Canada
- Prior art keywords
- nucleic acid
- receptor
- sequence
- leu
- nuclear receptor
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Abandoned
Links
- 102000006255 nuclear receptors Human genes 0.000 title claims abstract description 108
- 108020004017 nuclear receptors Proteins 0.000 title claims abstract description 108
- 108020005497 Nuclear hormone receptor Proteins 0.000 title claims abstract description 100
- 238000000034 method Methods 0.000 title claims abstract description 84
- 102000004169 proteins and genes Human genes 0.000 claims abstract description 96
- 150000007523 nucleic acids Chemical group 0.000 claims abstract description 65
- 102000039446 nucleic acids Human genes 0.000 claims abstract description 46
- 108020004707 nucleic acids Proteins 0.000 claims abstract description 46
- 108091028043 Nucleic acid sequence Proteins 0.000 claims abstract description 35
- 108090000765 processed proteins & peptides Proteins 0.000 claims description 79
- 102000004196 processed proteins & peptides Human genes 0.000 claims description 66
- 108020004414 DNA Proteins 0.000 claims description 62
- 229920001184 polypeptide Polymers 0.000 claims description 54
- 230000000295 complement effect Effects 0.000 claims description 36
- 108091032973 (ribonucleotides)n+m Proteins 0.000 claims description 21
- 239000002299 complementary DNA Substances 0.000 claims description 21
- 239000002773 nucleotide Substances 0.000 claims description 21
- 125000003729 nucleotide group Chemical group 0.000 claims description 21
- 108091034117 Oligonucleotide Proteins 0.000 claims description 19
- 239000013598 vector Substances 0.000 claims description 19
- 239000003795 chemical substances by application Substances 0.000 claims description 18
- 239000013604 expression vector Substances 0.000 claims description 16
- 238000012216 screening Methods 0.000 claims description 15
- 125000003275 alpha amino acid group Chemical group 0.000 claims description 14
- 239000000074 antisense oligonucleotide Substances 0.000 claims description 11
- 238000012230 antisense oligonucleotides Methods 0.000 claims description 11
- 102000053642 Catalytic RNA Human genes 0.000 claims description 10
- 108090000994 Catalytic RNA Proteins 0.000 claims description 10
- 108091092562 ribozyme Proteins 0.000 claims description 10
- 239000011230 binding agent Substances 0.000 claims description 7
- 230000003915 cell function Effects 0.000 claims description 7
- 230000002401 inhibitory effect Effects 0.000 claims description 7
- 238000010276 construction Methods 0.000 claims description 3
- 238000012258 culturing Methods 0.000 claims description 3
- 238000002864 sequence alignment Methods 0.000 claims description 3
- 108091005461 Nucleic proteins Chemical group 0.000 claims 1
- 239000001963 growth medium Substances 0.000 claims 1
- 238000001273 protein sequence alignment Methods 0.000 claims 1
- 108090000623 proteins and genes Proteins 0.000 abstract description 146
- 108020003175 receptors Proteins 0.000 abstract description 90
- 102000005962 receptors Human genes 0.000 abstract description 89
- 230000008569 process Effects 0.000 abstract description 2
- 210000004027 cell Anatomy 0.000 description 135
- 235000018102 proteins Nutrition 0.000 description 93
- 230000014509 gene expression Effects 0.000 description 52
- 241000282414 Homo sapiens Species 0.000 description 45
- 230000000694 effects Effects 0.000 description 39
- 230000027455 binding Effects 0.000 description 38
- 239000000126 substance Substances 0.000 description 36
- 239000000523 sample Substances 0.000 description 28
- 150000001875 compounds Chemical class 0.000 description 27
- 238000003556 assay Methods 0.000 description 26
- 239000003446 ligand Substances 0.000 description 25
- 241000699666 Mus <mouse, genus> Species 0.000 description 23
- 239000012634 fragment Substances 0.000 description 22
- 238000009396 hybridization Methods 0.000 description 22
- 239000000203 mixture Substances 0.000 description 17
- 238000003752 polymerase chain reaction Methods 0.000 description 17
- 239000013615 primer Substances 0.000 description 17
- 239000000047 product Substances 0.000 description 17
- 210000001519 tissue Anatomy 0.000 description 17
- 241001465754 Metazoa Species 0.000 description 15
- 241000699660 Mus musculus Species 0.000 description 15
- 238000013518 transcription Methods 0.000 description 15
- 230000035897 transcription Effects 0.000 description 15
- 238000001890 transfection Methods 0.000 description 14
- 102000007056 Recombinant Fusion Proteins Human genes 0.000 description 13
- 108010008281 Recombinant Fusion Proteins Proteins 0.000 description 13
- 230000006870 function Effects 0.000 description 13
- 238000003786 synthesis reaction Methods 0.000 description 13
- 239000003814 drug Substances 0.000 description 12
- 108020001756 ligand binding domains Proteins 0.000 description 12
- 108020004999 messenger RNA Proteins 0.000 description 12
- 230000002441 reversible effect Effects 0.000 description 12
- JLCPHMBAVCMARE-UHFFFAOYSA-N [3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-hydroxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methyl [5-(6-aminopurin-9-yl)-2-(hydroxymethyl)oxolan-3-yl] hydrogen phosphate Polymers Cc1cn(C2CC(OP(O)(=O)OCC3OC(CC3OP(O)(=O)OCC3OC(CC3O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c3nc(N)[nH]c4=O)C(COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3CO)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cc(C)c(=O)[nH]c3=O)n3cc(C)c(=O)[nH]c3=O)n3ccc(N)nc3=O)n3cc(C)c(=O)[nH]c3=O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)O2)c(=O)[nH]c1=O JLCPHMBAVCMARE-UHFFFAOYSA-N 0.000 description 11
- 238000001514 detection method Methods 0.000 description 11
- 230000003993 interaction Effects 0.000 description 11
- 102000034527 Retinoid X Receptors Human genes 0.000 description 10
- 108010038912 Retinoid X Receptors Proteins 0.000 description 10
- 239000000556 agonist Substances 0.000 description 10
- 239000005557 antagonist Substances 0.000 description 10
- 239000003613 bile acid Substances 0.000 description 10
- 230000015572 biosynthetic process Effects 0.000 description 10
- HVYWMOMLDIMFJA-DPAQBDIFSA-N cholesterol Chemical compound C1C=C2C[C@@H](O)CC[C@]2(C)[C@@H]2[C@@H]1[C@@H]1CC[C@H]([C@H](C)CCCC(C)C)[C@@]1(C)CC2 HVYWMOMLDIMFJA-DPAQBDIFSA-N 0.000 description 10
- 208000037265 diseases, disorders, signs and symptoms Diseases 0.000 description 10
- 102000040430 polynucleotide Human genes 0.000 description 10
- 108091033319 polynucleotide Proteins 0.000 description 10
- 239000002157 polynucleotide Substances 0.000 description 10
- 230000009261 transgenic effect Effects 0.000 description 10
- 150000001413 amino acids Chemical class 0.000 description 9
- 230000000692 anti-sense effect Effects 0.000 description 9
- 201000010099 disease Diseases 0.000 description 9
- 229940079593 drug Drugs 0.000 description 9
- 102000037865 fusion proteins Human genes 0.000 description 9
- 108020001507 fusion proteins Proteins 0.000 description 9
- 229940088597 hormone Drugs 0.000 description 9
- 239000005556 hormone Substances 0.000 description 9
- 230000001404 mediated effect Effects 0.000 description 9
- 108091026890 Coding region Proteins 0.000 description 8
- 230000004568 DNA-binding Effects 0.000 description 8
- 102000040945 Transcription factor Human genes 0.000 description 8
- 108091023040 Transcription factor Proteins 0.000 description 8
- 210000003527 eukaryotic cell Anatomy 0.000 description 8
- 239000000284 extract Substances 0.000 description 8
- 230000004927 fusion Effects 0.000 description 8
- 238000000338 in vitro Methods 0.000 description 8
- 230000005764 inhibitory process Effects 0.000 description 8
- 238000002955 isolation Methods 0.000 description 8
- 208000030159 metabolic disease Diseases 0.000 description 8
- 238000007423 screening assay Methods 0.000 description 8
- 230000014616 translation Effects 0.000 description 8
- 102100039556 Galectin-4 Human genes 0.000 description 7
- 238000004458 analytical method Methods 0.000 description 7
- 239000002502 liposome Substances 0.000 description 7
- 239000013612 plasmid Substances 0.000 description 7
- 238000003259 recombinant expression Methods 0.000 description 7
- 230000001105 regulatory effect Effects 0.000 description 7
- 150000003839 salts Chemical class 0.000 description 7
- 238000012360 testing method Methods 0.000 description 7
- 238000013519 translation Methods 0.000 description 7
- 108020000948 Antisense Oligonucleotides Proteins 0.000 description 6
- 241000894006 Bacteria Species 0.000 description 6
- 101000608765 Homo sapiens Galectin-4 Proteins 0.000 description 6
- 240000004808 Saccharomyces cerevisiae Species 0.000 description 6
- 235000014680 Saccharomyces cerevisiae Nutrition 0.000 description 6
- 210000004899 c-terminal region Anatomy 0.000 description 6
- 230000001413 cellular effect Effects 0.000 description 6
- 238000006243 chemical reaction Methods 0.000 description 6
- 238000010367 cloning Methods 0.000 description 6
- 239000012636 effector Substances 0.000 description 6
- 238000005516 engineering process Methods 0.000 description 6
- 230000002068 genetic effect Effects 0.000 description 6
- 210000004408 hybridoma Anatomy 0.000 description 6
- 230000035772 mutation Effects 0.000 description 6
- 210000001938 protoplast Anatomy 0.000 description 6
- 238000011282 treatment Methods 0.000 description 6
- 239000000260 (2E,6E)-3,7,11-trimethyldodeca-2,6,10-trien-1-ol Substances 0.000 description 5
- 102000004190 Enzymes Human genes 0.000 description 5
- 108090000790 Enzymes Proteins 0.000 description 5
- 102000016978 Orphan receptors Human genes 0.000 description 5
- 108070000031 Orphan receptors Proteins 0.000 description 5
- 206010035226 Plasma cell myeloma Diseases 0.000 description 5
- 235000012000 cholesterol Nutrition 0.000 description 5
- 230000000875 corresponding effect Effects 0.000 description 5
- 230000001419 dependent effect Effects 0.000 description 5
- 238000013461 design Methods 0.000 description 5
- 230000004064 dysfunction Effects 0.000 description 5
- 229940088598 enzyme Drugs 0.000 description 5
- 229940043259 farnesol Drugs 0.000 description 5
- 229930002886 farnesol Natural products 0.000 description 5
- 230000003054 hormonal effect Effects 0.000 description 5
- 230000001900 immune effect Effects 0.000 description 5
- 210000004962 mammalian cell Anatomy 0.000 description 5
- 230000007246 mechanism Effects 0.000 description 5
- 239000012528 membrane Substances 0.000 description 5
- 210000004379 membrane Anatomy 0.000 description 5
- 238000010369 molecular cloning Methods 0.000 description 5
- 201000000050 myeloid neoplasm Diseases 0.000 description 5
- 210000001236 prokaryotic cell Anatomy 0.000 description 5
- 238000000746 purification Methods 0.000 description 5
- 239000000243 solution Substances 0.000 description 5
- CRDAMVZIKSXKFV-UHFFFAOYSA-N trans-Farnesol Natural products CC(C)=CCCC(C)=CCCC(C)=CCO CRDAMVZIKSXKFV-UHFFFAOYSA-N 0.000 description 5
- 230000002103 transcriptional effect Effects 0.000 description 5
- 238000005406 washing Methods 0.000 description 5
- -1 ~i-galactosidase Proteins 0.000 description 5
- YBJHBAHKTGYVGT-ZKWXMUAHSA-N (+)-Biotin Chemical compound N1C(=O)N[C@@H]2[C@H](CCCCC(=O)O)SC[C@@H]21 YBJHBAHKTGYVGT-ZKWXMUAHSA-N 0.000 description 4
- HSINOMROUCMIEA-FGVHQWLLSA-N (2s,4r)-4-[(3r,5s,6r,7r,8s,9s,10s,13r,14s,17r)-6-ethyl-3,7-dihydroxy-10,13-dimethyl-2,3,4,5,6,7,8,9,11,12,14,15,16,17-tetradecahydro-1h-cyclopenta[a]phenanthren-17-yl]-2-methylpentanoic acid Chemical compound C([C@@]12C)C[C@@H](O)C[C@H]1[C@@H](CC)[C@@H](O)[C@@H]1[C@@H]2CC[C@]2(C)[C@@H]([C@H](C)C[C@H](C)C(O)=O)CC[C@H]21 HSINOMROUCMIEA-FGVHQWLLSA-N 0.000 description 4
- 102000053602 DNA Human genes 0.000 description 4
- 108090000626 DNA-directed RNA polymerases Proteins 0.000 description 4
- 102000004163 DNA-directed RNA polymerases Human genes 0.000 description 4
- 108090000079 Glucocorticoid Receptors Proteins 0.000 description 4
- UWKNTTJNVSYXPC-CIUDSAMLSA-N Lys-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN UWKNTTJNVSYXPC-CIUDSAMLSA-N 0.000 description 4
- YBAFDPFAUTYYRW-UHFFFAOYSA-N N-L-alpha-glutamyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CCC(O)=O YBAFDPFAUTYYRW-UHFFFAOYSA-N 0.000 description 4
- 102000034570 NR1 subfamily Human genes 0.000 description 4
- 108020001305 NR1 subfamily Proteins 0.000 description 4
- 241000283973 Oryctolagus cuniculus Species 0.000 description 4
- 108020004511 Recombinant DNA Proteins 0.000 description 4
- 108700008625 Reporter Genes Proteins 0.000 description 4
- 238000001261 affinity purification Methods 0.000 description 4
- KOSRFJWDECSPRO-UHFFFAOYSA-N alpha-L-glutamyl-L-glutamic acid Natural products OC(=O)CCC(N)C(=O)NC(CCC(O)=O)C(O)=O KOSRFJWDECSPRO-UHFFFAOYSA-N 0.000 description 4
- 238000000137 annealing Methods 0.000 description 4
- 239000011324 bead Substances 0.000 description 4
- 239000001506 calcium phosphate Substances 0.000 description 4
- 229910000389 calcium phosphate Inorganic materials 0.000 description 4
- 235000011010 calcium phosphates Nutrition 0.000 description 4
- 238000004113 cell culture Methods 0.000 description 4
- 210000000170 cell membrane Anatomy 0.000 description 4
- 239000003153 chemical reaction reagent Substances 0.000 description 4
- 235000014113 dietary fatty acids Nutrition 0.000 description 4
- 238000004520 electroporation Methods 0.000 description 4
- 239000003623 enhancer Substances 0.000 description 4
- 239000000194 fatty acid Substances 0.000 description 4
- 229930195729 fatty acid Natural products 0.000 description 4
- 150000004665 fatty acids Chemical class 0.000 description 4
- 108010055341 glutamyl-glutamic acid Proteins 0.000 description 4
- 108010092114 histidylphenylalanine Proteins 0.000 description 4
- 230000001976 improved effect Effects 0.000 description 4
- 108010034529 leucyl-lysine Proteins 0.000 description 4
- 238000004519 manufacturing process Methods 0.000 description 4
- 238000000520 microinjection Methods 0.000 description 4
- 238000007899 nucleic acid hybridization Methods 0.000 description 4
- 210000004940 nucleus Anatomy 0.000 description 4
- 239000008177 pharmaceutical agent Substances 0.000 description 4
- 239000008194 pharmaceutical composition Substances 0.000 description 4
- 238000002360 preparation method Methods 0.000 description 4
- 230000002285 radioactive effect Effects 0.000 description 4
- 230000010076 replication Effects 0.000 description 4
- 239000007790 solid phase Substances 0.000 description 4
- 241000894007 species Species 0.000 description 4
- 239000000758 substrate Substances 0.000 description 4
- 210000001550 testis Anatomy 0.000 description 4
- 108091006106 transcriptional activators Proteins 0.000 description 4
- 238000012546 transfer Methods 0.000 description 4
- 230000009466 transformation Effects 0.000 description 4
- QORWJWZARLRLPR-UHFFFAOYSA-H tricalcium bis(phosphate) Chemical compound [Ca+2].[Ca+2].[Ca+2].[O-]P([O-])([O-])=O.[O-]P([O-])([O-])=O QORWJWZARLRLPR-UHFFFAOYSA-H 0.000 description 4
- 108020004463 18S ribosomal RNA Proteins 0.000 description 3
- IXTPACPAXIOCRG-ACZMJKKPSA-N Ala-Glu-Cys Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CS)C(=O)O)N IXTPACPAXIOCRG-ACZMJKKPSA-N 0.000 description 3
- SUMYEVXWCAYLLJ-GUBZILKMSA-N Ala-Leu-Gln Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O SUMYEVXWCAYLLJ-GUBZILKMSA-N 0.000 description 3
- HJVGMOYJDDXLMI-AVGNSLFASA-N Arg-Arg-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCNC(N)=N)NC(=O)[C@@H](N)CCCNC(N)=N HJVGMOYJDDXLMI-AVGNSLFASA-N 0.000 description 3
- UHOVQNZJYSORNB-UHFFFAOYSA-N Benzene Chemical compound C1=CC=CC=C1 UHOVQNZJYSORNB-UHFFFAOYSA-N 0.000 description 3
- 102100038495 Bile acid receptor Human genes 0.000 description 3
- 108091003079 Bovine Serum Albumin Proteins 0.000 description 3
- MBPKYKSYUAPLMY-DCAQKATOSA-N Cys-Arg-Leu Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O MBPKYKSYUAPLMY-DCAQKATOSA-N 0.000 description 3
- MBILEVLLOHJZMG-FXQIFTODSA-N Cys-Gln-Glu Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CS)N MBILEVLLOHJZMG-FXQIFTODSA-N 0.000 description 3
- 229920002307 Dextran Polymers 0.000 description 3
- UBRQJXFDVZNYJP-AVGNSLFASA-N Gln-Tyr-Ser Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CCC(=O)N)N)O UBRQJXFDVZNYJP-AVGNSLFASA-N 0.000 description 3
- VAIWPXWHWAPYDF-FXQIFTODSA-N Glu-Asp-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O VAIWPXWHWAPYDF-FXQIFTODSA-N 0.000 description 3
- YHYDTTUSJXGTQK-UWVGGRQHSA-N Gly-Met-Leu Chemical compound CSCC[C@H](NC(=O)CN)C(=O)N[C@@H](CC(C)C)C(O)=O YHYDTTUSJXGTQK-UWVGGRQHSA-N 0.000 description 3
- JBCLFWXMTIKCCB-UHFFFAOYSA-N H-Gly-Phe-OH Natural products NCC(=O)NC(C(O)=O)CC1=CC=CC=C1 JBCLFWXMTIKCCB-UHFFFAOYSA-N 0.000 description 3
- 101000603876 Homo sapiens Bile acid receptor Proteins 0.000 description 3
- XQXGNBFMAXWIGI-MXAVVETBSA-N Leu-His-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CC(C)C)CC1=CN=CN1 XQXGNBFMAXWIGI-MXAVVETBSA-N 0.000 description 3
- UCBPDSYUVAAHCD-UWVGGRQHSA-N Leu-Pro-Gly Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O UCBPDSYUVAAHCD-UWVGGRQHSA-N 0.000 description 3
- 108060001084 Luciferase Proteins 0.000 description 3
- 239000005089 Luciferase Substances 0.000 description 3
- TWEWRDAAIYBJTO-ULQDDVLXSA-N Met-Tyr-His Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N TWEWRDAAIYBJTO-ULQDDVLXSA-N 0.000 description 3
- 108020004711 Nucleic Acid Probes Proteins 0.000 description 3
- 108020005187 Oligonucleotide Probes Proteins 0.000 description 3
- FMMIYCMOVGXZIP-AVGNSLFASA-N Phe-Glu-Asn Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O FMMIYCMOVGXZIP-AVGNSLFASA-N 0.000 description 3
- MGDFPGCFVJFITQ-CIUDSAMLSA-N Pro-Glu-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O MGDFPGCFVJFITQ-CIUDSAMLSA-N 0.000 description 3
- 238000012300 Sequence Analysis Methods 0.000 description 3
- JLNMFGCJODTXDH-WEDXCCLWSA-N Thr-Lys-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O JLNMFGCJODTXDH-WEDXCCLWSA-N 0.000 description 3
- 102100023132 Transcription factor Jun Human genes 0.000 description 3
- 230000004913 activation Effects 0.000 description 3
- 230000003321 amplification Effects 0.000 description 3
- 239000000427 antigen Substances 0.000 description 3
- 108091007433 antigens Proteins 0.000 description 3
- 102000036639 antigens Human genes 0.000 description 3
- 238000013459 approach Methods 0.000 description 3
- 230000001580 bacterial effect Effects 0.000 description 3
- 230000003197 catalytic effect Effects 0.000 description 3
- 239000003610 charcoal Substances 0.000 description 3
- 239000007795 chemical reaction product Substances 0.000 description 3
- 238000004587 chromatography analysis Methods 0.000 description 3
- 210000000805 cytoplasm Anatomy 0.000 description 3
- 238000009826 distribution Methods 0.000 description 3
- 238000007877 drug screening Methods 0.000 description 3
- 239000012091 fetal bovine serum Substances 0.000 description 3
- 238000001415 gene therapy Methods 0.000 description 3
- 238000010353 genetic engineering Methods 0.000 description 3
- 210000004602 germ cell Anatomy 0.000 description 3
- 108010013768 glutamyl-aspartyl-proline Proteins 0.000 description 3
- 108010081551 glycylphenylalanine Proteins 0.000 description 3
- 108010036413 histidylglycine Proteins 0.000 description 3
- 108010085325 histidylproline Proteins 0.000 description 3
- 230000003053 immunization Effects 0.000 description 3
- 230000002163 immunogen Effects 0.000 description 3
- 230000005847 immunogenicity Effects 0.000 description 3
- 238000011534 incubation Methods 0.000 description 3
- 230000001939 inductive effect Effects 0.000 description 3
- 239000003112 inhibitor Substances 0.000 description 3
- 230000000670 limiting effect Effects 0.000 description 3
- 210000004185 liver Anatomy 0.000 description 3
- 238000012423 maintenance Methods 0.000 description 3
- 238000012544 monitoring process Methods 0.000 description 3
- 238000002887 multiple sequence alignment Methods 0.000 description 3
- 238000003199 nucleic acid amplification method Methods 0.000 description 3
- 239000002853 nucleic acid probe Substances 0.000 description 3
- 239000002751 oligonucleotide probe Substances 0.000 description 3
- 230000001575 pathological effect Effects 0.000 description 3
- 238000010647 peptide synthesis reaction Methods 0.000 description 3
- 230000008488 polyadenylation Effects 0.000 description 3
- 102000054765 polymorphisms of proteins Human genes 0.000 description 3
- 239000011148 porous material Substances 0.000 description 3
- 108091008146 restriction endonucleases Proteins 0.000 description 3
- 238000000926 separation method Methods 0.000 description 3
- 239000001509 sodium citrate Substances 0.000 description 3
- NLJMYIDDQXHKNR-UHFFFAOYSA-K sodium citrate Chemical compound O.O.[Na+].[Na+].[Na+].[O-]C(=O)CC(O)(CC([O-])=O)C([O-])=O NLJMYIDDQXHKNR-UHFFFAOYSA-K 0.000 description 3
- 239000007787 solid Substances 0.000 description 3
- 230000004936 stimulating effect Effects 0.000 description 3
- 230000001131 transforming effect Effects 0.000 description 3
- 230000010474 transient expression Effects 0.000 description 3
- 238000010396 two-hybrid screening Methods 0.000 description 3
- 241000701161 unidentified adenovirus Species 0.000 description 3
- 238000011144 upstream manufacturing Methods 0.000 description 3
- XMRPGKVKISIQBV-UHFFFAOYSA-N (+-)-5- Pregnane-3,20-dione Natural products C1CC2CC(=O)CCC2(C)C2C1C1CCC(C(=O)C)C1(C)CC2 XMRPGKVKISIQBV-UHFFFAOYSA-N 0.000 description 2
- RUDATBOHQWOJDD-UHFFFAOYSA-N (3beta,5beta,7alpha)-3,7-Dihydroxycholan-24-oic acid Natural products OC1CC2CC(O)CCC2(C)C2C1C1CCC(C(CCC(O)=O)C)C1(C)CC2 RUDATBOHQWOJDD-UHFFFAOYSA-N 0.000 description 2
- AKHONHCNYRUTLT-NXBFSHDWSA-N (8R,9S,10R,13S,14S,17S)-15,15,17-trihydroxy-10,13-dimethyl-2,6,7,8,9,11,12,14,16,17-decahydro-1H-cyclopenta[a]phenanthren-3-one Chemical compound O=C1CC[C@]2(C)[C@H]3CC[C@](C)([C@H](CC4(O)O)O)[C@@H]4[C@@H]3CCC2=C1 AKHONHCNYRUTLT-NXBFSHDWSA-N 0.000 description 2
- YYROPELSRYBVMQ-UHFFFAOYSA-N 4-toluenesulfonyl chloride Chemical compound CC1=CC=C(S(Cl)(=O)=O)C=C1 YYROPELSRYBVMQ-UHFFFAOYSA-N 0.000 description 2
- XMRPGKVKISIQBV-XWOJZHJZSA-N 5beta-pregnane-3,20-dione Chemical compound C([C@H]1CC2)C(=O)CC[C@]1(C)[C@@H]1[C@@H]2[C@@H]2CC[C@H](C(=O)C)[C@@]2(C)CC1 XMRPGKVKISIQBV-XWOJZHJZSA-N 0.000 description 2
- BLGHHPHXVJWCNK-GUBZILKMSA-N Ala-Gln-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O BLGHHPHXVJWCNK-GUBZILKMSA-N 0.000 description 2
- OBVSBEYOMDWLRJ-BFHQHQDPSA-N Ala-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@H](C)N OBVSBEYOMDWLRJ-BFHQHQDPSA-N 0.000 description 2
- GSHKMNKPMLXSQW-KBIXCLLPSA-N Ala-Ile-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](C)N GSHKMNKPMLXSQW-KBIXCLLPSA-N 0.000 description 2
- 108020005544 Antisense RNA Proteins 0.000 description 2
- COXMUHNBYCVVRG-DCAQKATOSA-N Arg-Leu-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O COXMUHNBYCVVRG-DCAQKATOSA-N 0.000 description 2
- JCROZIFVIYMXHM-GUBZILKMSA-N Arg-Met-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](CCSC)NC(=O)[C@@H](N)CCCN=C(N)N JCROZIFVIYMXHM-GUBZILKMSA-N 0.000 description 2
- RFNDQEWMNJMQHD-SZMVWBNQSA-N Arg-Met-Trp Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N RFNDQEWMNJMQHD-SZMVWBNQSA-N 0.000 description 2
- BSYKSCBTTQKOJG-GUBZILKMSA-N Arg-Pro-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O BSYKSCBTTQKOJG-GUBZILKMSA-N 0.000 description 2
- RYQSYXFGFOTJDJ-RHYQMDGZSA-N Arg-Thr-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O RYQSYXFGFOTJDJ-RHYQMDGZSA-N 0.000 description 2
- HFPXZWPUVFVNLL-GUBZILKMSA-N Asn-Leu-Gln Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O HFPXZWPUVFVNLL-GUBZILKMSA-N 0.000 description 2
- LZLCLRQMUQWUHJ-GUBZILKMSA-N Asn-Lys-Gln Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N LZLCLRQMUQWUHJ-GUBZILKMSA-N 0.000 description 2
- YUOXLJYVSZYPBJ-CIUDSAMLSA-N Asn-Pro-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O YUOXLJYVSZYPBJ-CIUDSAMLSA-N 0.000 description 2
- DBWYWXNMZZYIRY-LPEHRKFASA-N Asp-Arg-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC(=O)O)N)C(=O)O DBWYWXNMZZYIRY-LPEHRKFASA-N 0.000 description 2
- ZKAOJVJQGVUIIU-GUBZILKMSA-N Asp-Pro-Arg Chemical compound OC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(O)=O ZKAOJVJQGVUIIU-GUBZILKMSA-N 0.000 description 2
- NAAAPCLFJPURAM-HJGDQZAQSA-N Asp-Thr-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)O)N)O NAAAPCLFJPURAM-HJGDQZAQSA-N 0.000 description 2
- 102000014914 Carrier Proteins Human genes 0.000 description 2
- HEDRZPFGACZZDS-UHFFFAOYSA-N Chloroform Chemical compound ClC(Cl)Cl HEDRZPFGACZZDS-UHFFFAOYSA-N 0.000 description 2
- 102000008169 Co-Repressor Proteins Human genes 0.000 description 2
- 108010060434 Co-Repressor Proteins Proteins 0.000 description 2
- GUKYYUFHWYRMEU-WHFBIAKZSA-N Cys-Gly-Asp Chemical compound [H]N[C@@H](CS)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O GUKYYUFHWYRMEU-WHFBIAKZSA-N 0.000 description 2
- UCSXXFRXHGUXCQ-SRVKXCTJSA-N Cys-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CS)N UCSXXFRXHGUXCQ-SRVKXCTJSA-N 0.000 description 2
- 241000701022 Cytomegalovirus Species 0.000 description 2
- 101100202237 Danio rerio rxrab gene Proteins 0.000 description 2
- 101100309320 Danio rerio rxrga gene Proteins 0.000 description 2
- RTZKZFJDLAIYFH-UHFFFAOYSA-N Diethyl ether Chemical compound CCOCC RTZKZFJDLAIYFH-UHFFFAOYSA-N 0.000 description 2
- 239000006144 Dulbecco’s modified Eagle's medium Substances 0.000 description 2
- 238000012286 ELISA Assay Methods 0.000 description 2
- 241000196324 Embryophyta Species 0.000 description 2
- 241000588724 Escherichia coli Species 0.000 description 2
- LFQSCWFLJHTTHZ-UHFFFAOYSA-N Ethanol Chemical compound CCO LFQSCWFLJHTTHZ-UHFFFAOYSA-N 0.000 description 2
- 108700024394 Exon Proteins 0.000 description 2
- 241000282326 Felis catus Species 0.000 description 2
- 102000008857 Ferritin Human genes 0.000 description 2
- 108050000784 Ferritin Proteins 0.000 description 2
- 238000008416 Ferritin Methods 0.000 description 2
- ZHNUHDYFZUAESO-UHFFFAOYSA-N Formamide Chemical compound NC=O ZHNUHDYFZUAESO-UHFFFAOYSA-N 0.000 description 2
- QYTKAVBFRUGYAU-ACZMJKKPSA-N Gln-Asp-Asn Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O QYTKAVBFRUGYAU-ACZMJKKPSA-N 0.000 description 2
- VOLVNCMGXWDDQY-LPEHRKFASA-N Gln-Glu-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCC(=O)N)N)C(=O)O VOLVNCMGXWDDQY-LPEHRKFASA-N 0.000 description 2
- FGYPOQPQTUNESW-IUCAKERBSA-N Gln-Gly-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CCC(=O)N)N FGYPOQPQTUNESW-IUCAKERBSA-N 0.000 description 2
- ZEEPYMXTJWIMSN-GUBZILKMSA-N Gln-Lys-Ser Chemical compound NCCCC[C@@H](C(=O)N[C@@H](CO)C(O)=O)NC(=O)[C@@H](N)CCC(N)=O ZEEPYMXTJWIMSN-GUBZILKMSA-N 0.000 description 2
- SBYVDRJAXWSXQL-AVGNSLFASA-N Glu-Asn-Phe Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O SBYVDRJAXWSXQL-AVGNSLFASA-N 0.000 description 2
- AIGROOHQXCACHL-WDSKDSINSA-N Glu-Gly-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H](C)C(O)=O AIGROOHQXCACHL-WDSKDSINSA-N 0.000 description 2
- RAUDKMVXNOWDLS-WDSKDSINSA-N Glu-Gly-Ser Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O RAUDKMVXNOWDLS-WDSKDSINSA-N 0.000 description 2
- BRKUZSLQMPNVFN-SRVKXCTJSA-N Glu-His-Arg Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O BRKUZSLQMPNVFN-SRVKXCTJSA-N 0.000 description 2
- WVYJNPCWJYBHJG-YVNDNENWSA-N Glu-Ile-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(N)=O)C(O)=O WVYJNPCWJYBHJG-YVNDNENWSA-N 0.000 description 2
- ZWMYUDZLXAQHCK-CIUDSAMLSA-N Glu-Met-Asp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(O)=O)C(O)=O ZWMYUDZLXAQHCK-CIUDSAMLSA-N 0.000 description 2
- HVKAAUOFFTUSAA-XDTLVQLUSA-N Glu-Tyr-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C)C(O)=O HVKAAUOFFTUSAA-XDTLVQLUSA-N 0.000 description 2
- 102000003676 Glucocorticoid Receptors Human genes 0.000 description 2
- 102100033417 Glucocorticoid receptor Human genes 0.000 description 2
- GZUKEVBTYNNUQF-WDSKDSINSA-N Gly-Ala-Gln Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(O)=O GZUKEVBTYNNUQF-WDSKDSINSA-N 0.000 description 2
- IVSWQHKONQIOHA-YUMQZZPRSA-N Gly-His-Cys Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)CN IVSWQHKONQIOHA-YUMQZZPRSA-N 0.000 description 2
- YTSVAIMKVLZUDU-YUMQZZPRSA-N Gly-Leu-Asp Chemical compound [H]NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O YTSVAIMKVLZUDU-YUMQZZPRSA-N 0.000 description 2
- PDUHNKAFQXQNLH-ZETCQYMHSA-N Gly-Lys-Gly Chemical compound NCCCC[C@H](NC(=O)CN)C(=O)NCC(O)=O PDUHNKAFQXQNLH-ZETCQYMHSA-N 0.000 description 2
- LYZYGGWCBLBDMC-QWHCGFSZSA-N Gly-Tyr-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)CN)C(=O)O LYZYGGWCBLBDMC-QWHCGFSZSA-N 0.000 description 2
- HVLSXIKZNLPZJJ-TXZCQADKSA-N HA peptide Chemical compound C([C@@H](C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N1[C@@H](CCC1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)N[C@@H](C)C(O)=O)NC(=O)[C@H]1N(CCC1)C(=O)[C@@H](N)CC=1C=CC(O)=CC=1)C1=CC=C(O)C=C1 HVLSXIKZNLPZJJ-TXZCQADKSA-N 0.000 description 2
- 108010068250 Herpes Simplex Virus Protein Vmw65 Proteins 0.000 description 2
- 241000238631 Hexapoda Species 0.000 description 2
- OBTMRGFRLJBSFI-GARJFASQSA-N His-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC2=CN=CN2)N)C(=O)O OBTMRGFRLJBSFI-GARJFASQSA-N 0.000 description 2
- LBQAHBIVXQSBIR-HVTMNAMFSA-N His-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N LBQAHBIVXQSBIR-HVTMNAMFSA-N 0.000 description 2
- DGLAHESNTJWGDO-SRVKXCTJSA-N His-Ser-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N DGLAHESNTJWGDO-SRVKXCTJSA-N 0.000 description 2
- GIRSNERMXCMDBO-GARJFASQSA-N His-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CC2=CN=CN2)N)C(=O)O GIRSNERMXCMDBO-GARJFASQSA-N 0.000 description 2
- GLYJPWIRLBAIJH-UHFFFAOYSA-N Ile-Lys-Pro Natural products CCC(C)C(N)C(=O)NC(CCCCN)C(=O)N1CCCC1C(O)=O GLYJPWIRLBAIJH-UHFFFAOYSA-N 0.000 description 2
- QGXQHJQPAPMACW-PPCPHDFISA-N Ile-Thr-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)O)N QGXQHJQPAPMACW-PPCPHDFISA-N 0.000 description 2
- RRSLQOLASISYTB-CIUDSAMLSA-N Leu-Cys-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(O)=O)C(O)=O RRSLQOLASISYTB-CIUDSAMLSA-N 0.000 description 2
- LOLUPZNNADDTAA-AVGNSLFASA-N Leu-Gln-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O LOLUPZNNADDTAA-AVGNSLFASA-N 0.000 description 2
- KGCLIYGPQXUNLO-IUCAKERBSA-N Leu-Gly-Glu Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(O)=O KGCLIYGPQXUNLO-IUCAKERBSA-N 0.000 description 2
- IAJFFZORSWOZPQ-SRVKXCTJSA-N Leu-Leu-Asn Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O IAJFFZORSWOZPQ-SRVKXCTJSA-N 0.000 description 2
- JNDYEOUZBLOVOF-AVGNSLFASA-N Leu-Leu-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O JNDYEOUZBLOVOF-AVGNSLFASA-N 0.000 description 2
- IEWBEPKLKUXQBU-VOAKCMCISA-N Leu-Leu-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O IEWBEPKLKUXQBU-VOAKCMCISA-N 0.000 description 2
- HVHRPWQEQHIQJF-AVGNSLFASA-N Leu-Lys-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O HVHRPWQEQHIQJF-AVGNSLFASA-N 0.000 description 2
- PJWOOBTYQNNRBF-BZSNNMDCSA-N Leu-Phe-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCCN)C(=O)O)N PJWOOBTYQNNRBF-BZSNNMDCSA-N 0.000 description 2
- PTRKPHUGYULXPU-KKUMJFAQSA-N Leu-Phe-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O PTRKPHUGYULXPU-KKUMJFAQSA-N 0.000 description 2
- IWMJFLJQHIDZQW-KKUMJFAQSA-N Leu-Ser-Phe Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 IWMJFLJQHIDZQW-KKUMJFAQSA-N 0.000 description 2
- ZJZNLRVCZWUONM-JXUBOQSCSA-N Leu-Thr-Ala Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O ZJZNLRVCZWUONM-JXUBOQSCSA-N 0.000 description 2
- GZRABTMNWJXFMH-UVOCVTCTSA-N Leu-Thr-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GZRABTMNWJXFMH-UVOCVTCTSA-N 0.000 description 2
- AXVIGSRGTMNSJU-YESZJQIVSA-N Leu-Tyr-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N2CCC[C@@H]2C(=O)O)N AXVIGSRGTMNSJU-YESZJQIVSA-N 0.000 description 2
- YNNPKXBBRZVIRX-IHRRRGAJSA-N Lys-Arg-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O YNNPKXBBRZVIRX-IHRRRGAJSA-N 0.000 description 2
- HQVDJTYKCMIWJP-YUMQZZPRSA-N Lys-Asn-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O HQVDJTYKCMIWJP-YUMQZZPRSA-N 0.000 description 2
- NRQRKMYZONPCTM-CIUDSAMLSA-N Lys-Asp-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O NRQRKMYZONPCTM-CIUDSAMLSA-N 0.000 description 2
- WGILOYIKJVQUPT-DCAQKATOSA-N Lys-Pro-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O WGILOYIKJVQUPT-DCAQKATOSA-N 0.000 description 2
- YTJFXEDRUOQGSP-DCAQKATOSA-N Lys-Pro-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O YTJFXEDRUOQGSP-DCAQKATOSA-N 0.000 description 2
- QVTDVTONTRSQMF-WDCWCFNPSA-N Lys-Thr-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H]([C@H](O)C)NC(=O)[C@@H](N)CCCCN QVTDVTONTRSQMF-WDCWCFNPSA-N 0.000 description 2
- YKBSXQFZWFXFIB-VOAKCMCISA-N Lys-Thr-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CCCCN)C(O)=O YKBSXQFZWFXFIB-VOAKCMCISA-N 0.000 description 2
- 101710175625 Maltose/maltodextrin-binding periplasmic protein Proteins 0.000 description 2
- 241000124008 Mammalia Species 0.000 description 2
- HZLSUXCMSIBCRV-RVMXOQNASA-N Met-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCSC)N HZLSUXCMSIBCRV-RVMXOQNASA-N 0.000 description 2
- OSZTUONKUMCWEP-XUXIUFHCSA-N Met-Leu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CCSC OSZTUONKUMCWEP-XUXIUFHCSA-N 0.000 description 2
- ZRACLHJYVRBJFC-ULQDDVLXSA-N Met-Lys-Phe Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 ZRACLHJYVRBJFC-ULQDDVLXSA-N 0.000 description 2
- OIFHHODAXVWKJN-ULQDDVLXSA-N Met-Phe-Leu Chemical compound CSCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C(O)=O)CC1=CC=CC=C1 OIFHHODAXVWKJN-ULQDDVLXSA-N 0.000 description 2
- QQPMHUCGDRJFQK-RHYQMDGZSA-N Met-Thr-Leu Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(C)C QQPMHUCGDRJFQK-RHYQMDGZSA-N 0.000 description 2
- WOGNGBROIHHFAO-JYJNAYRXSA-N Met-Tyr-Met Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCSC)C(=O)O)N WOGNGBROIHHFAO-JYJNAYRXSA-N 0.000 description 2
- 238000005481 NMR spectroscopy Methods 0.000 description 2
- 206010028980 Neoplasm Diseases 0.000 description 2
- 238000000636 Northern blotting Methods 0.000 description 2
- 108090001146 Nuclear Receptor Coactivator 1 Proteins 0.000 description 2
- 102100037223 Nuclear receptor coactivator 1 Human genes 0.000 description 2
- 108010038807 Oligopeptides Proteins 0.000 description 2
- 102000015636 Oligopeptides Human genes 0.000 description 2
- 108700020796 Oncogene Proteins 0.000 description 2
- JGSARLDLIJGVTE-MBNYWOFBSA-N Penicillin G Chemical compound N([C@H]1[C@H]2SC([C@@H](N2C1=O)C(O)=O)(C)C)C(=O)CC1=CC=CC=C1 JGSARLDLIJGVTE-MBNYWOFBSA-N 0.000 description 2
- YRKFKTQRVBJYLT-CQDKDKBSSA-N Phe-Ala-His Chemical compound C([C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CC=1NC=NC=1)C(O)=O)C1=CC=CC=C1 YRKFKTQRVBJYLT-CQDKDKBSSA-N 0.000 description 2
- IPFXYNKCXYGSSV-KKUMJFAQSA-N Phe-Ser-Lys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)O)N IPFXYNKCXYGSSV-KKUMJFAQSA-N 0.000 description 2
- BELBBZDIHDAJOR-UHFFFAOYSA-N Phenolsulfonephthalein Chemical compound C1=CC(O)=CC=C1C1(C=2C=CC(O)=CC=2)C2=CC=CC=C2S(=O)(=O)O1 BELBBZDIHDAJOR-UHFFFAOYSA-N 0.000 description 2
- BVTYXOFTHDXSNI-IHRRRGAJSA-N Pro-Tyr-Cys Chemical compound C([C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H]1NCCC1)C1=CC=C(O)C=C1 BVTYXOFTHDXSNI-IHRRRGAJSA-N 0.000 description 2
- RJKFOVLPORLFTN-LEKSSAKUSA-N Progesterone Chemical compound C1CC2=CC(=O)CC[C@]2(C)[C@@H]2[C@@H]1[C@@H]1CC[C@H](C(=O)C)[C@@]1(C)CC2 RJKFOVLPORLFTN-LEKSSAKUSA-N 0.000 description 2
- 108020004518 RNA Probes Proteins 0.000 description 2
- 239000003391 RNA probe Substances 0.000 description 2
- 101150050070 RXRA gene Proteins 0.000 description 2
- 241000235347 Schizosaccharomyces pombe Species 0.000 description 2
- LVVBAKCGXXUHFO-ZLUOBGJFSA-N Ser-Ala-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(O)=O LVVBAKCGXXUHFO-ZLUOBGJFSA-N 0.000 description 2
- BRKHVZNDAOMAHX-BIIVOSGPSA-N Ser-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N BRKHVZNDAOMAHX-BIIVOSGPSA-N 0.000 description 2
- BTPAWKABYQMKKN-LKXGYXEUSA-N Ser-Asp-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O BTPAWKABYQMKKN-LKXGYXEUSA-N 0.000 description 2
- BRGQQXQKPUCUJQ-KBIXCLLPSA-N Ser-Glu-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O BRGQQXQKPUCUJQ-KBIXCLLPSA-N 0.000 description 2
- XQJCEKXQUJQNNK-ZLUOBGJFSA-N Ser-Ser-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O XQJCEKXQUJQNNK-ZLUOBGJFSA-N 0.000 description 2
- UYLKOSODXYSWMQ-XGEHTFHBSA-N Ser-Thr-Met Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CO)N)O UYLKOSODXYSWMQ-XGEHTFHBSA-N 0.000 description 2
- 108020004682 Single-Stranded DNA Proteins 0.000 description 2
- 238000002105 Southern blotting Methods 0.000 description 2
- 241000251131 Sphyrna Species 0.000 description 2
- 101000582398 Staphylococcus aureus Replication initiation protein Proteins 0.000 description 2
- 108010017842 Telomerase Proteins 0.000 description 2
- STGXWWBXWXZOER-MBLNEYKQSA-N Thr-Ala-His Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 STGXWWBXWXZOER-MBLNEYKQSA-N 0.000 description 2
- CEXFELBFVHLYDZ-XGEHTFHBSA-N Thr-Arg-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O CEXFELBFVHLYDZ-XGEHTFHBSA-N 0.000 description 2
- KRPKYGOFYUNIGM-XVSYOHENSA-N Thr-Asp-Phe Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N)O KRPKYGOFYUNIGM-XVSYOHENSA-N 0.000 description 2
- VUVCRYXYUUPGSB-GLLZPBPUSA-N Thr-Gln-Glu Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N)O VUVCRYXYUUPGSB-GLLZPBPUSA-N 0.000 description 2
- LHEZGZQRLDBSRR-WDCWCFNPSA-N Thr-Glu-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O LHEZGZQRLDBSRR-WDCWCFNPSA-N 0.000 description 2
- SGAOHNPSEPVAFP-ZDLURKLDSA-N Thr-Ser-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)NCC(O)=O SGAOHNPSEPVAFP-ZDLURKLDSA-N 0.000 description 2
- IQPWNQRRAJHOKV-KATARQTJSA-N Thr-Ser-Lys Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCCN IQPWNQRRAJHOKV-KATARQTJSA-N 0.000 description 2
- AUYYCJSJGJYCDS-LBPRGKRZSA-N Thyrolar Chemical class IC1=CC(C[C@H](N)C(O)=O)=CC(I)=C1OC1=CC=C(O)C(I)=C1 AUYYCJSJGJYCDS-LBPRGKRZSA-N 0.000 description 2
- 108700009124 Transcription Initiation Site Proteins 0.000 description 2
- HIINQLBHPIQYHN-JTQLQIEISA-N Tyr-Gly-Gly Chemical compound OC(=O)CNC(=O)CNC(=O)[C@@H](N)CC1=CC=C(O)C=C1 HIINQLBHPIQYHN-JTQLQIEISA-N 0.000 description 2
- KWKJGBHDYJOVCR-SRVKXCTJSA-N Tyr-Ser-Cys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)O)N)O KWKJGBHDYJOVCR-SRVKXCTJSA-N 0.000 description 2
- GBESYURLQOYWLU-LAEOZQHASA-N Val-Glu-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N GBESYURLQOYWLU-LAEOZQHASA-N 0.000 description 2
- QYSXJUFSXHHAJI-XFEUOLMDSA-N Vitamin D3 Natural products C1(/[C@@H]2CC[C@@H]([C@]2(CCC1)C)[C@H](C)CCCC(C)C)=C/C=C1\C[C@@H](O)CCC1=C QYSXJUFSXHHAJI-XFEUOLMDSA-N 0.000 description 2
- HCHKCACWOHOZIP-UHFFFAOYSA-N Zinc Chemical compound [Zn] HCHKCACWOHOZIP-UHFFFAOYSA-N 0.000 description 2
- 230000002159 abnormal effect Effects 0.000 description 2
- 239000002253 acid Substances 0.000 description 2
- 230000003213 activating effect Effects 0.000 description 2
- 239000012190 activator Substances 0.000 description 2
- SHGAZHPCJJPHSC-YCNIQYBTSA-N all-trans-retinoic acid Chemical compound OC(=O)\C=C(/C)\C=C\C=C(/C)\C=C\C1=C(C)CCCC1(C)C SHGAZHPCJJPHSC-YCNIQYBTSA-N 0.000 description 2
- 230000004075 alteration Effects 0.000 description 2
- 230000003042 antagnostic effect Effects 0.000 description 2
- 239000000823 artificial membrane Substances 0.000 description 2
- 108010021908 aspartyl-aspartyl-glutamyl-aspartic acid Proteins 0.000 description 2
- 108010038633 aspartylglutamate Proteins 0.000 description 2
- 108010092854 aspartyllysine Proteins 0.000 description 2
- 102000005936 beta-Galactosidase Human genes 0.000 description 2
- 108010005774 beta-Galactosidase Proteins 0.000 description 2
- 230000008827 biological function Effects 0.000 description 2
- 229960002685 biotin Drugs 0.000 description 2
- 235000020958 biotin Nutrition 0.000 description 2
- 239000011616 biotin Substances 0.000 description 2
- 229940098773 bovine serum albumin Drugs 0.000 description 2
- 239000000872 buffer Substances 0.000 description 2
- 201000011510 cancer Diseases 0.000 description 2
- 238000000423 cell based assay Methods 0.000 description 2
- 230000024245 cell differentiation Effects 0.000 description 2
- 230000004663 cell proliferation Effects 0.000 description 2
- 108091008394 cellulose binding proteins Proteins 0.000 description 2
- 229960001091 chenodeoxycholic acid Drugs 0.000 description 2
- 125000003907 chenodeoxycholic acid group Chemical group 0.000 description 2
- 210000000349 chromosome Anatomy 0.000 description 2
- 230000003081 coactivator Effects 0.000 description 2
- 239000003184 complementary RNA Substances 0.000 description 2
- ATDGTVJJHBUTRL-UHFFFAOYSA-N cyanogen bromide Chemical compound BrC#N ATDGTVJJHBUTRL-UHFFFAOYSA-N 0.000 description 2
- 108010009297 diglycyl-histidine Proteins 0.000 description 2
- 238000010790 dilution Methods 0.000 description 2
- 239000012895 dilution Substances 0.000 description 2
- 208000037765 diseases and disorders Diseases 0.000 description 2
- 238000001962 electrophoresis Methods 0.000 description 2
- 230000012202 endocytosis Effects 0.000 description 2
- 238000000605 extraction Methods 0.000 description 2
- 239000000054 fungal extract Substances 0.000 description 2
- 238000004128 high performance liquid chromatography Methods 0.000 description 2
- 238000002744 homologous recombination Methods 0.000 description 2
- 230000006801 homologous recombination Effects 0.000 description 2
- 210000005260 human cell Anatomy 0.000 description 2
- JYGXADMDTFJGBT-VWUMJDOOSA-N hydrocortisone Chemical compound O=C1CC[C@]2(C)[C@H]3[C@@H](O)C[C@](C)([C@@](CC4)(O)C(=O)CO)[C@@H]4[C@@H]3CCC2=C1 JYGXADMDTFJGBT-VWUMJDOOSA-N 0.000 description 2
- 238000001727 in vivo Methods 0.000 description 2
- 210000000936 intestine Anatomy 0.000 description 2
- 108010057821 leucylproline Proteins 0.000 description 2
- 238000001638 lipofection Methods 0.000 description 2
- 108010003700 lysyl aspartic acid Proteins 0.000 description 2
- 108010038320 lysylphenylalanine Proteins 0.000 description 2
- 229940046166 oligodeoxynucleotide Drugs 0.000 description 2
- 229960003531 phenolsulfonphthalein Drugs 0.000 description 2
- 108010073101 phenylalanylleucine Proteins 0.000 description 2
- 108010051242 phenylalanylserine Proteins 0.000 description 2
- 230000001766 physiological effect Effects 0.000 description 2
- 230000006461 physiological response Effects 0.000 description 2
- 239000000419 plant extract Substances 0.000 description 2
- 230000023603 positive regulation of transcription initiation, DNA-dependent Effects 0.000 description 2
- 230000029279 positive regulation of transcription, DNA-dependent Effects 0.000 description 2
- 230000003389 potentiating effect Effects 0.000 description 2
- 238000001556 precipitation Methods 0.000 description 2
- 108700042769 prolyl-leucyl-glycine Proteins 0.000 description 2
- 108010077112 prolyl-proline Proteins 0.000 description 2
- 108010093296 prolyl-prolyl-alanine Proteins 0.000 description 2
- 108010031719 prolyl-serine Proteins 0.000 description 2
- 108010070643 prolylglutamic acid Proteins 0.000 description 2
- 108010029020 prolylglycine Proteins 0.000 description 2
- 230000006337 proteolytic cleavage Effects 0.000 description 2
- 238000011002 quantification Methods 0.000 description 2
- 230000009711 regulatory function Effects 0.000 description 2
- 230000033458 reproduction Effects 0.000 description 2
- 238000007894 restriction fragment length polymorphism technique Methods 0.000 description 2
- 229930002330 retinoic acid Natural products 0.000 description 2
- 238000010839 reverse transcription Methods 0.000 description 2
- 210000002966 serum Anatomy 0.000 description 2
- 108010048818 seryl-histidine Proteins 0.000 description 2
- 108010071207 serylmethionine Proteins 0.000 description 2
- 230000019491 signal transduction Effects 0.000 description 2
- 230000011664 signaling Effects 0.000 description 2
- 238000002741 site-directed mutagenesis Methods 0.000 description 2
- 239000002689 soil Substances 0.000 description 2
- 210000000952 spleen Anatomy 0.000 description 2
- 239000003270 steroid hormone Substances 0.000 description 2
- 238000006467 substitution reaction Methods 0.000 description 2
- 108010072986 threonyl-seryl-lysine Proteins 0.000 description 2
- 229940036555 thyroid hormone Drugs 0.000 description 2
- 239000005495 thyroid hormone Substances 0.000 description 2
- 229960001727 tretinoin Drugs 0.000 description 2
- 108010017949 tyrosyl-glycyl-glycine Proteins 0.000 description 2
- 241001529453 unidentified herpesvirus Species 0.000 description 2
- 235000019166 vitamin D Nutrition 0.000 description 2
- 239000011710 vitamin D Substances 0.000 description 2
- 238000001086 yeast two-hybrid system Methods 0.000 description 2
- 239000011701 zinc Substances 0.000 description 2
- 229910052725 zinc Inorganic materials 0.000 description 2
- CRDAMVZIKSXKFV-FBXUGWQNSA-N (2-cis,6-cis)-farnesol Chemical compound CC(C)=CCC\C(C)=C/CC\C(C)=C/CO CRDAMVZIKSXKFV-FBXUGWQNSA-N 0.000 description 1
- PQFMROVJTOPVDF-JBDRJPRFSA-N (2s)-2-[[(2s)-2-[[(2s)-2-[[(2s)-2-amino-3-carboxypropanoyl]amino]-3-carboxypropanoyl]amino]-4-carboxybutanoyl]amino]butanedioic acid Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O PQFMROVJTOPVDF-JBDRJPRFSA-N 0.000 description 1
- CZWUESRDTYLNDE-UHFFFAOYSA-N (2z)-2-[(2e,4e,6e)-7-[1-(5-carboxypentyl)-3,3-dimethyl-5-sulfoindol-1-ium-2-yl]hepta-2,4,6-trienylidene]-1-ethyl-3,3-dimethylindole-5-sulfonate Chemical compound CC1(C)C2=CC(S([O-])(=O)=O)=CC=C2N(CC)\C1=C/C=C/C=C/C=C/C1=[N+](CCCCCC(O)=O)C2=CC=C(S(O)(=O)=O)C=C2C1(C)C CZWUESRDTYLNDE-UHFFFAOYSA-N 0.000 description 1
- OWEGMIWEEQEYGQ-UHFFFAOYSA-N 100676-05-9 Natural products OC1C(O)C(O)C(CO)OC1OCC1C(O)C(O)C(O)C(OC2C(OC(O)C(O)C2O)CO)O1 OWEGMIWEEQEYGQ-UHFFFAOYSA-N 0.000 description 1
- FUFLCEKSBBHCMO-UHFFFAOYSA-N 11-dehydrocorticosterone Natural products O=C1CCC2(C)C3C(=O)CC(C)(C(CC4)C(=O)CO)C4C3CCC2=C1 FUFLCEKSBBHCMO-UHFFFAOYSA-N 0.000 description 1
- VOXZDWNPVJITMN-ZBRFXRBCSA-N 17β-estradiol Chemical compound OC1=CC=C2[C@H]3CC[C@](C)([C@H](CC4)O)[C@@H]4[C@@H]3CCC2=C1 VOXZDWNPVJITMN-ZBRFXRBCSA-N 0.000 description 1
- KSXTUUUQYQYKCR-LQDDAWAPSA-M 2,3-bis[[(z)-octadec-9-enoyl]oxy]propyl-trimethylazanium;chloride Chemical compound [Cl-].CCCCCCCC\C=C/CCCCCCCC(=O)OCC(C[N+](C)(C)C)OC(=O)CCCCCCC\C=C/CCCCCCCC KSXTUUUQYQYKCR-LQDDAWAPSA-M 0.000 description 1
- WEEMDRWIKYCTQM-UHFFFAOYSA-N 2,6-dimethoxybenzenecarbothioamide Chemical compound COC1=CC=CC(OC)=C1C(N)=S WEEMDRWIKYCTQM-UHFFFAOYSA-N 0.000 description 1
- ZOOGRGPOEVQQDX-UUOKFMHZSA-N 3',5'-cyclic GMP Chemical compound C([C@H]1O2)OP(O)(=O)O[C@H]1[C@@H](O)[C@@H]2N1C(N=C(NC2=O)N)=C2N=C1 ZOOGRGPOEVQQDX-UUOKFMHZSA-N 0.000 description 1
- WEVYNIUIFUYDGI-UHFFFAOYSA-N 3-[6-[4-(trifluoromethoxy)anilino]-4-pyrimidinyl]benzamide Chemical compound NC(=O)C1=CC=CC(C=2N=CN=C(NC=3C=CC(OC(F)(F)F)=CC=3)C=2)=C1 WEVYNIUIFUYDGI-UHFFFAOYSA-N 0.000 description 1
- QAEBOWDWHNLKKP-UHFFFAOYSA-N 4,4-diaminoheptanediamide Chemical compound NC(CCC(=O)N)(CCC(=O)N)N QAEBOWDWHNLKKP-UHFFFAOYSA-N 0.000 description 1
- DJTOLSNIKJIDFF-LOVVWNRFSA-N 5alpha-Androstan-3beta-ol Chemical compound C([C@@H]1CC2)[C@@H](O)CC[C@]1(C)[C@@H]1[C@@H]2[C@@H]2CCC[C@@]2(C)CC1 DJTOLSNIKJIDFF-LOVVWNRFSA-N 0.000 description 1
- LIZDKDDCWIEQIN-UHFFFAOYSA-N 6-[2-[5-(3-ethyl-1,1-dimethyl-6,8-disulfobenzo[e]indol-2-ylidene)penta-1,3-dienyl]-1,1-dimethyl-6,8-disulfobenzo[e]indol-3-ium-3-yl]hexanoate Chemical compound C1=CC2=C(S(O)(=O)=O)C=C(S(O)(=O)=O)C=C2C(C2(C)C)=C1N(CC)\C2=C\C=C\C=C\C1=[N+](CCCCCC([O-])=O)C2=CC=C(C(=CC(=C3)S(O)(=O)=O)S(O)(=O)=O)C3=C2C1(C)C LIZDKDDCWIEQIN-UHFFFAOYSA-N 0.000 description 1
- 101150100400 66 gene Proteins 0.000 description 1
- 229920000936 Agarose Polymers 0.000 description 1
- WQVFQXXBNHHPLX-ZKWXMUAHSA-N Ala-Ala-His Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](Cc1cnc[nH]1)C(O)=O WQVFQXXBNHHPLX-ZKWXMUAHSA-N 0.000 description 1
- SVBXIUDNTRTKHE-CIUDSAMLSA-N Ala-Arg-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O SVBXIUDNTRTKHE-CIUDSAMLSA-N 0.000 description 1
- CXQODNIBUNQWAS-CIUDSAMLSA-N Ala-Gln-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N CXQODNIBUNQWAS-CIUDSAMLSA-N 0.000 description 1
- MEFILNJXAVSUTO-JXUBOQSCSA-N Ala-Leu-Thr Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MEFILNJXAVSUTO-JXUBOQSCSA-N 0.000 description 1
- HOVPGJUNRLMIOZ-CIUDSAMLSA-N Ala-Ser-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@H](C)N HOVPGJUNRLMIOZ-CIUDSAMLSA-N 0.000 description 1
- QKHWNPQNOHEFST-VZFHVOOUSA-N Ala-Thr-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](C)N)O QKHWNPQNOHEFST-VZFHVOOUSA-N 0.000 description 1
- VNFSAYFQLXPHPY-CIQUZCHMSA-N Ala-Thr-Ile Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O VNFSAYFQLXPHPY-CIQUZCHMSA-N 0.000 description 1
- 102000002260 Alkaline Phosphatase Human genes 0.000 description 1
- 108020004774 Alkaline Phosphatase Proteins 0.000 description 1
- 102000007299 Amphiregulin Human genes 0.000 description 1
- 108010033760 Amphiregulin Proteins 0.000 description 1
- UISQLSIBJKEJSS-GUBZILKMSA-N Arg-Arg-Ser Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CO)C(O)=O UISQLSIBJKEJSS-GUBZILKMSA-N 0.000 description 1
- HJAICMSAKODKRF-GUBZILKMSA-N Arg-Cys-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O HJAICMSAKODKRF-GUBZILKMSA-N 0.000 description 1
- UFBURHXMKFQVLM-CIUDSAMLSA-N Arg-Glu-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O UFBURHXMKFQVLM-CIUDSAMLSA-N 0.000 description 1
- OQCWXQJLCDPRHV-UWVGGRQHSA-N Arg-Gly-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O OQCWXQJLCDPRHV-UWVGGRQHSA-N 0.000 description 1
- JEOCWTUOMKEEMF-RHYQMDGZSA-N Arg-Leu-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O JEOCWTUOMKEEMF-RHYQMDGZSA-N 0.000 description 1
- SSZGOKWBHLOCHK-DCAQKATOSA-N Arg-Lys-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCCN=C(N)N SSZGOKWBHLOCHK-DCAQKATOSA-N 0.000 description 1
- FOQFHANLUJDQEE-GUBZILKMSA-N Arg-Pro-Cys Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CCCN=C(N)N)N)C(=O)N[C@@H](CS)C(=O)O FOQFHANLUJDQEE-GUBZILKMSA-N 0.000 description 1
- URAUIUGLHBRPMF-NAKRPEOUSA-N Arg-Ser-Ile Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O URAUIUGLHBRPMF-NAKRPEOUSA-N 0.000 description 1
- JBQORRNSZGTLCV-WDSOQIARSA-N Arg-Trp-Lys Chemical compound C1=CC=C2C(C[C@@H](C(=O)N[C@@H](CCCCN)C(O)=O)NC(=O)[C@@H](N)CCCN=C(N)N)=CNC2=C1 JBQORRNSZGTLCV-WDSOQIARSA-N 0.000 description 1
- IOTKDTZEEBZNCM-UGYAYLCHSA-N Asn-Asn-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O IOTKDTZEEBZNCM-UGYAYLCHSA-N 0.000 description 1
- ACKNRKFVYUVWAC-ZPFDUUQYSA-N Asn-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)N)N ACKNRKFVYUVWAC-ZPFDUUQYSA-N 0.000 description 1
- COWITDLVHMZSIW-CIUDSAMLSA-N Asn-Lys-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O COWITDLVHMZSIW-CIUDSAMLSA-N 0.000 description 1
- YHXNKGKUDJCAHB-PBCZWWQYSA-N Asn-Thr-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC(=O)N)N)O YHXNKGKUDJCAHB-PBCZWWQYSA-N 0.000 description 1
- RTFXPCYMDYBZNQ-SRVKXCTJSA-N Asn-Tyr-Asn Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(O)=O RTFXPCYMDYBZNQ-SRVKXCTJSA-N 0.000 description 1
- VHWNKSJHQFZJTH-FXQIFTODSA-N Asp-Asp-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(=O)O)N VHWNKSJHQFZJTH-FXQIFTODSA-N 0.000 description 1
- ICZWAZVKLACMKR-CIUDSAMLSA-N Asp-His-Ser Chemical compound OC(=O)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CO)C(O)=O)CC1=CN=CN1 ICZWAZVKLACMKR-CIUDSAMLSA-N 0.000 description 1
- HXVILZUZXFLVEN-DCAQKATOSA-N Asp-Met-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O HXVILZUZXFLVEN-DCAQKATOSA-N 0.000 description 1
- WQSXAPPYLGNMQL-IHRRRGAJSA-N Asp-Met-Tyr Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@H](CC(=O)O)N WQSXAPPYLGNMQL-IHRRRGAJSA-N 0.000 description 1
- 108090001008 Avidin Proteins 0.000 description 1
- 108010077805 Bacterial Proteins Proteins 0.000 description 1
- 206010006187 Breast cancer Diseases 0.000 description 1
- OYPRJOBELJOOCE-UHFFFAOYSA-N Calcium Chemical compound [Ca] OYPRJOBELJOOCE-UHFFFAOYSA-N 0.000 description 1
- UXVMQQNJUSDDNG-UHFFFAOYSA-L Calcium chloride Chemical compound [Cl-].[Cl-].[Ca+2] UXVMQQNJUSDDNG-UHFFFAOYSA-L 0.000 description 1
- 108010078791 Carrier Proteins Proteins 0.000 description 1
- 229920002101 Chitin Polymers 0.000 description 1
- 108010035563 Chloramphenicol O-acetyltransferase Proteins 0.000 description 1
- VEXZGXHMUGYJMC-UHFFFAOYSA-M Chloride anion Chemical compound [Cl-] VEXZGXHMUGYJMC-UHFFFAOYSA-M 0.000 description 1
- 108010077544 Chromatin Proteins 0.000 description 1
- 108020004635 Complementary DNA Proteins 0.000 description 1
- 108091035707 Consensus sequence Proteins 0.000 description 1
- OMFXVFTZEKFJBZ-UHFFFAOYSA-N Corticosterone Natural products O=C1CCC2(C)C3C(O)CC(C)(C(CC4)C(=O)CO)C4C3CCC2=C1 OMFXVFTZEKFJBZ-UHFFFAOYSA-N 0.000 description 1
- MFYSYFVPBJMHGN-ZPOLXVRWSA-N Cortisone Chemical compound O=C1CC[C@]2(C)[C@H]3C(=O)C[C@](C)([C@@](CC4)(O)C(=O)CO)[C@@H]4[C@@H]3CCC2=C1 MFYSYFVPBJMHGN-ZPOLXVRWSA-N 0.000 description 1
- MFYSYFVPBJMHGN-UHFFFAOYSA-N Cortisone Natural products O=C1CCC2(C)C3C(=O)CC(C)(C(CC4)(O)C(=O)CO)C4C3CCC2=C1 MFYSYFVPBJMHGN-UHFFFAOYSA-N 0.000 description 1
- 241000699800 Cricetinae Species 0.000 description 1
- 241000699802 Cricetulus griseus Species 0.000 description 1
- SZQCDCKIGWQAQN-FXQIFTODSA-N Cys-Arg-Ala Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(O)=O SZQCDCKIGWQAQN-FXQIFTODSA-N 0.000 description 1
- PRVVCRZLTJNPCS-FXQIFTODSA-N Cys-Arg-Asn Chemical compound C(C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CS)N)CN=C(N)N PRVVCRZLTJNPCS-FXQIFTODSA-N 0.000 description 1
- UXUSHQYYQCZWET-WDSKDSINSA-N Cys-Glu-Gly Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O UXUSHQYYQCZWET-WDSKDSINSA-N 0.000 description 1
- SDXQKJAWASHMIZ-CIUDSAMLSA-N Cys-Glu-Met Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCSC)C(O)=O SDXQKJAWASHMIZ-CIUDSAMLSA-N 0.000 description 1
- VPQZSNQICFCCSO-BJDJZHNGSA-N Cys-Leu-Ile Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O VPQZSNQICFCCSO-BJDJZHNGSA-N 0.000 description 1
- BNCKELUXXUYRNY-GUBZILKMSA-N Cys-Lys-Glu Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CS)N BNCKELUXXUYRNY-GUBZILKMSA-N 0.000 description 1
- IDFVDSBJNMPBSX-SRVKXCTJSA-N Cys-Lys-Leu Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O IDFVDSBJNMPBSX-SRVKXCTJSA-N 0.000 description 1
- NIXHTNJAGGFBAW-CIUDSAMLSA-N Cys-Lys-Ser Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CS)N NIXHTNJAGGFBAW-CIUDSAMLSA-N 0.000 description 1
- NXQCSPVUPLUTJH-WHFBIAKZSA-N Cys-Ser-Gly Chemical compound SC[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O NXQCSPVUPLUTJH-WHFBIAKZSA-N 0.000 description 1
- 239000003155 DNA primer Substances 0.000 description 1
- 230000007018 DNA scission Effects 0.000 description 1
- 108010014303 DNA-directed DNA polymerase Proteins 0.000 description 1
- 102000016928 DNA-directed DNA polymerase Human genes 0.000 description 1
- FMGSKLZLMKYGDP-UHFFFAOYSA-N Dehydroepiandrosterone Natural products C1C(O)CCC2(C)C3CCC(C)(C(CC4)=O)C4C3CC=C21 FMGSKLZLMKYGDP-UHFFFAOYSA-N 0.000 description 1
- 108091027757 Deoxyribozyme Proteins 0.000 description 1
- 108090000204 Dipeptidase 1 Proteins 0.000 description 1
- 241000255581 Drosophila <fruit fly, genus> Species 0.000 description 1
- 102000001301 EGF receptor Human genes 0.000 description 1
- 108060006698 EGF receptor Proteins 0.000 description 1
- 238000002965 ELISA Methods 0.000 description 1
- UPEZCKBFRMILAV-JNEQICEOSA-N Ecdysone Natural products O=C1[C@H]2[C@@](C)([C@@H]3C([C@@]4(O)[C@@](C)([C@H]([C@H]([C@@H](O)CCC(O)(C)C)C)CC4)CC3)=C1)C[C@H](O)[C@H](O)C2 UPEZCKBFRMILAV-JNEQICEOSA-N 0.000 description 1
- 102100038595 Estrogen receptor Human genes 0.000 description 1
- 241000206602 Eukaryota Species 0.000 description 1
- XZWYTXMRWQJBGX-VXBMVYAYSA-N FLAG peptide Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](NC(=O)[C@@H](N)CC(O)=O)CC1=CC=C(O)C=C1 XZWYTXMRWQJBGX-VXBMVYAYSA-N 0.000 description 1
- 108010058643 Fungal Proteins Proteins 0.000 description 1
- 108091006027 G proteins Proteins 0.000 description 1
- 101150083413 GRIP1 gene Proteins 0.000 description 1
- 102000030782 GTP binding Human genes 0.000 description 1
- 108091000058 GTP-Binding Proteins 0.000 description 1
- 102100029974 GTPase HRas Human genes 0.000 description 1
- 108010001515 Galectin 4 Proteins 0.000 description 1
- 108700039691 Genetic Promoter Regions Proteins 0.000 description 1
- 206010071602 Genetic polymorphism Diseases 0.000 description 1
- INFBPLSHYFALDE-ACZMJKKPSA-N Gln-Asn-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(O)=O INFBPLSHYFALDE-ACZMJKKPSA-N 0.000 description 1
- LLRJEFPKIIBGJP-DCAQKATOSA-N Gln-Glu-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCC(=O)N)N LLRJEFPKIIBGJP-DCAQKATOSA-N 0.000 description 1
- PSERKXGRRADTKA-MNXVOIDGSA-N Gln-Leu-Ile Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O PSERKXGRRADTKA-MNXVOIDGSA-N 0.000 description 1
- JNENSVNAUWONEZ-GUBZILKMSA-N Gln-Lys-Asn Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O JNENSVNAUWONEZ-GUBZILKMSA-N 0.000 description 1
- SXGMGNZEHFORAV-IUCAKERBSA-N Gln-Lys-Gly Chemical compound C(CCN)C[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CCC(=O)N)N SXGMGNZEHFORAV-IUCAKERBSA-N 0.000 description 1
- OZEQPCDLCDRCGY-SOUVJXGZSA-N Gln-Phe-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)NC(=O)[C@H](CCC(=O)N)N)C(=O)O OZEQPCDLCDRCGY-SOUVJXGZSA-N 0.000 description 1
- SYZZMPFLOLSMHL-XHNCKOQMSA-N Gln-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CCC(=O)N)N)C(=O)O SYZZMPFLOLSMHL-XHNCKOQMSA-N 0.000 description 1
- XXCDTYBVGMPIOA-FXQIFTODSA-N Glu-Asp-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O XXCDTYBVGMPIOA-FXQIFTODSA-N 0.000 description 1
- RFDHKPSHTXZKLL-IHRRRGAJSA-N Glu-Gln-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCC(=O)O)N RFDHKPSHTXZKLL-IHRRRGAJSA-N 0.000 description 1
- APHGWLWMOXGZRL-DCAQKATOSA-N Glu-Glu-His Chemical compound N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](Cc1cnc[nH]1)C(O)=O APHGWLWMOXGZRL-DCAQKATOSA-N 0.000 description 1
- YLJHCWNDBKKOEB-IHRRRGAJSA-N Glu-Glu-Phe Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O YLJHCWNDBKKOEB-IHRRRGAJSA-N 0.000 description 1
- LYCDZGLXQBPNQU-WDSKDSINSA-N Glu-Gly-Cys Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@@H](CS)C(O)=O LYCDZGLXQBPNQU-WDSKDSINSA-N 0.000 description 1
- HILMIYALTUQTRC-XVKPBYJWSA-N Glu-Gly-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O HILMIYALTUQTRC-XVKPBYJWSA-N 0.000 description 1
- VSRCAOIHMGCIJK-SRVKXCTJSA-N Glu-Leu-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O VSRCAOIHMGCIJK-SRVKXCTJSA-N 0.000 description 1
- NWOUBJNMZDDGDT-AVGNSLFASA-N Glu-Leu-His Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 NWOUBJNMZDDGDT-AVGNSLFASA-N 0.000 description 1
- FBEJIDRSQCGFJI-GUBZILKMSA-N Glu-Leu-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O FBEJIDRSQCGFJI-GUBZILKMSA-N 0.000 description 1
- JZJGEKDPWVJOLD-QEWYBTABSA-N Glu-Phe-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JZJGEKDPWVJOLD-QEWYBTABSA-N 0.000 description 1
- CBWKURKPYSLMJV-SOUVJXGZSA-N Glu-Phe-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)NC(=O)[C@H](CCC(=O)O)N)C(=O)O CBWKURKPYSLMJV-SOUVJXGZSA-N 0.000 description 1
- YBTCBQBIJKGSJP-BQBZGAKWSA-N Glu-Pro Chemical compound OC(=O)CC[C@H](N)C(=O)N1CCC[C@H]1C(O)=O YBTCBQBIJKGSJP-BQBZGAKWSA-N 0.000 description 1
- JVYNYWXHZWVJEF-NUMRIWBASA-N Glu-Thr-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)O)N)O JVYNYWXHZWVJEF-NUMRIWBASA-N 0.000 description 1
- FGGKGJHCVMYGCD-UKJIMTQDSA-N Glu-Val-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FGGKGJHCVMYGCD-UKJIMTQDSA-N 0.000 description 1
- 102000053187 Glucuronidase Human genes 0.000 description 1
- 108010060309 Glucuronidase Proteins 0.000 description 1
- SXRSQZLOMIGNAQ-UHFFFAOYSA-N Glutaraldehyde Chemical compound O=CCCCC=O SXRSQZLOMIGNAQ-UHFFFAOYSA-N 0.000 description 1
- OVSKVOOUFAKODB-UWVGGRQHSA-N Gly-Arg-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCN=C(N)N OVSKVOOUFAKODB-UWVGGRQHSA-N 0.000 description 1
- JMQFHZWESBGPFC-WDSKDSINSA-N Gly-Gln-Asp Chemical compound NCC(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O JMQFHZWESBGPFC-WDSKDSINSA-N 0.000 description 1
- GNPVTZJUUBPZKW-WDSKDSINSA-N Gly-Gln-Ser Chemical compound [H]NCC(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O GNPVTZJUUBPZKW-WDSKDSINSA-N 0.000 description 1
- YWAQATDNEKZFFK-BYPYZUCNSA-N Gly-Gly-Ser Chemical compound NCC(=O)NCC(=O)N[C@@H](CO)C(O)=O YWAQATDNEKZFFK-BYPYZUCNSA-N 0.000 description 1
- GMTXWRIDLGTVFC-IUCAKERBSA-N Gly-Lys-Glu Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O GMTXWRIDLGTVFC-IUCAKERBSA-N 0.000 description 1
- GGLIDLCEPDHEJO-BQBZGAKWSA-N Gly-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)CN GGLIDLCEPDHEJO-BQBZGAKWSA-N 0.000 description 1
- MKIAPEZXQDILRR-YUMQZZPRSA-N Gly-Ser-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)CN MKIAPEZXQDILRR-YUMQZZPRSA-N 0.000 description 1
- KBBFOULZCHWGJX-KBPBESRZSA-N Gly-Tyr-His Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)NC(=O)CN)O KBBFOULZCHWGJX-KBPBESRZSA-N 0.000 description 1
- AFMOTCMSEBITOE-YEPSODPASA-N Gly-Val-Thr Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O AFMOTCMSEBITOE-YEPSODPASA-N 0.000 description 1
- 108010043121 Green Fluorescent Proteins Proteins 0.000 description 1
- 241000288105 Grus Species 0.000 description 1
- RVKIPWVMZANZLI-UHFFFAOYSA-N H-Lys-Trp-OH Natural products C1=CC=C2C(CC(NC(=O)C(N)CCCCN)C(O)=O)=CNC2=C1 RVKIPWVMZANZLI-UHFFFAOYSA-N 0.000 description 1
- 108090001102 Hammerhead ribozyme Proteins 0.000 description 1
- MVADCDSCFTXCBT-CIUDSAMLSA-N His-Asp-Asp Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O MVADCDSCFTXCBT-CIUDSAMLSA-N 0.000 description 1
- ZUPVLBAXUUGKKN-VHSXEESVSA-N His-Gly-Pro Chemical compound C1C[C@@H](N(C1)C(=O)CNC(=O)[C@H](CC2=CN=CN2)N)C(=O)O ZUPVLBAXUUGKKN-VHSXEESVSA-N 0.000 description 1
- BXOLYFJYQQRQDJ-MXAVVETBSA-N His-Leu-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC1=CN=CN1)N BXOLYFJYQQRQDJ-MXAVVETBSA-N 0.000 description 1
- SAPLASXFNUYUFE-CQDKDKBSSA-N His-Phe-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](CC2=CN=CN2)N SAPLASXFNUYUFE-CQDKDKBSSA-N 0.000 description 1
- KYFGGRHWLFZXPU-KKUMJFAQSA-N His-Phe-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC2=CN=CN2)N KYFGGRHWLFZXPU-KKUMJFAQSA-N 0.000 description 1
- JMSONHOUHFDOJH-GUBZILKMSA-N His-Ser-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CN=CN1 JMSONHOUHFDOJH-GUBZILKMSA-N 0.000 description 1
- IAYPZSHNZQHQNO-KKUMJFAQSA-N His-Ser-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CC2=CN=CN2)N IAYPZSHNZQHQNO-KKUMJFAQSA-N 0.000 description 1
- WSWAUVHXQREQQG-JYJNAYRXSA-N His-Tyr-Gln Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O WSWAUVHXQREQQG-JYJNAYRXSA-N 0.000 description 1
- 102000003964 Histone deacetylase Human genes 0.000 description 1
- 108090000353 Histone deacetylase Proteins 0.000 description 1
- 241000282412 Homo Species 0.000 description 1
- 101000584633 Homo sapiens GTPase HRas Proteins 0.000 description 1
- 101000633984 Homo sapiens Influenza virus NS1A-binding protein Proteins 0.000 description 1
- 101000979342 Homo sapiens Nuclear factor NF-kappa-B p105 subunit Proteins 0.000 description 1
- 101000603962 Homo sapiens Oxysterols receptor LXR-alpha Proteins 0.000 description 1
- 101001050288 Homo sapiens Transcription factor Jun Proteins 0.000 description 1
- 241000714260 Human T-lymphotropic virus 1 Species 0.000 description 1
- PMMMQRVUMVURGJ-XUXIUFHCSA-N Ile-Leu-Pro Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(O)=O PMMMQRVUMVURGJ-XUXIUFHCSA-N 0.000 description 1
- GVKKVHNRTUFCCE-BJDJZHNGSA-N Ile-Leu-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)O)N GVKKVHNRTUFCCE-BJDJZHNGSA-N 0.000 description 1
- XLXPYSDGMXTTNQ-UHFFFAOYSA-N Ile-Phe-Leu Natural products CCC(C)C(N)C(=O)NC(C(=O)NC(CC(C)C)C(O)=O)CC1=CC=CC=C1 XLXPYSDGMXTTNQ-UHFFFAOYSA-N 0.000 description 1
- KCTIFOCXAIUQQK-QXEWZRGKSA-N Ile-Pro-Gly Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O KCTIFOCXAIUQQK-QXEWZRGKSA-N 0.000 description 1
- AGGIYSLVUKVOPT-HTFCKZLJSA-N Ile-Ser-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N AGGIYSLVUKVOPT-HTFCKZLJSA-N 0.000 description 1
- DGAQECJNVWCQMB-PUAWFVPOSA-M Ilexoside XXIX Chemical compound C[C@@H]1CC[C@@]2(CC[C@@]3(C(=CC[C@H]4[C@]3(CC[C@@H]5[C@@]4(CC[C@@H](C5(C)C)OS(=O)(=O)[O-])C)C)[C@@H]2[C@]1(C)O)C)C(=O)O[C@H]6[C@@H]([C@H]([C@@H]([C@H](O6)CO)O)O)O.[Na+] DGAQECJNVWCQMB-PUAWFVPOSA-M 0.000 description 1
- 108060003951 Immunoglobulin Proteins 0.000 description 1
- 208000026350 Inborn Genetic disease Diseases 0.000 description 1
- 102100029241 Influenza virus NS1A-binding protein Human genes 0.000 description 1
- SHGAZHPCJJPHSC-NUEINMDLSA-N Isotretinoin Chemical compound OC(=O)C=C(C)/C=C/C=C(C)C=CC1=C(C)CCCC1(C)C SHGAZHPCJJPHSC-NUEINMDLSA-N 0.000 description 1
- 241000235058 Komagataella pastoris Species 0.000 description 1
- XUJNEKJLAYXESH-REOHCLBHSA-N L-Cysteine Chemical compound SC[C@H](N)C(O)=O XUJNEKJLAYXESH-REOHCLBHSA-N 0.000 description 1
- SENJXOPIZNYLHU-UHFFFAOYSA-N L-leucyl-L-arginine Natural products CC(C)CC(N)C(=O)NC(C(O)=O)CCCN=C(N)N SENJXOPIZNYLHU-UHFFFAOYSA-N 0.000 description 1
- RNKSNIBMTUYWSH-YFKPBYRVSA-N L-prolylglycine Chemical compound [O-]C(=O)CNC(=O)[C@@H]1CCC[NH2+]1 RNKSNIBMTUYWSH-YFKPBYRVSA-N 0.000 description 1
- CZCSUZMIRKFFFA-CIUDSAMLSA-N Leu-Ala-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O CZCSUZMIRKFFFA-CIUDSAMLSA-N 0.000 description 1
- XYUBOFCTGPZFSA-WDSOQIARSA-N Leu-Arg-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H](N)CC(C)C)C(O)=O)=CNC2=C1 XYUBOFCTGPZFSA-WDSOQIARSA-N 0.000 description 1
- BPANDPNDMJHFEV-CIUDSAMLSA-N Leu-Asp-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O BPANDPNDMJHFEV-CIUDSAMLSA-N 0.000 description 1
- MMEDVBWCMGRKKC-GARJFASQSA-N Leu-Asp-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N MMEDVBWCMGRKKC-GARJFASQSA-N 0.000 description 1
- VFQOCUQGMUXTJR-DCAQKATOSA-N Leu-Cys-Met Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CCSC)C(=O)O)N VFQOCUQGMUXTJR-DCAQKATOSA-N 0.000 description 1
- RSFGIMMPWAXNML-MNXVOIDGSA-N Leu-Gln-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O RSFGIMMPWAXNML-MNXVOIDGSA-N 0.000 description 1
- GPICTNQYKHHHTH-GUBZILKMSA-N Leu-Gln-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O GPICTNQYKHHHTH-GUBZILKMSA-N 0.000 description 1
- OYQUOLRTJHWVSQ-SRVKXCTJSA-N Leu-His-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(O)=O OYQUOLRTJHWVSQ-SRVKXCTJSA-N 0.000 description 1
- SGIIOQQGLUUMDQ-IHRRRGAJSA-N Leu-His-Val Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](C(C)C)C(=O)O)N SGIIOQQGLUUMDQ-IHRRRGAJSA-N 0.000 description 1
- QJXHMYMRGDOHRU-NHCYSSNCSA-N Leu-Ile-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O QJXHMYMRGDOHRU-NHCYSSNCSA-N 0.000 description 1
- WXUOJXIGOPMDJM-SRVKXCTJSA-N Leu-Lys-Asn Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O WXUOJXIGOPMDJM-SRVKXCTJSA-N 0.000 description 1
- MVVSHHJKJRZVNY-ACRUOGEOSA-N Leu-Phe-Tyr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O MVVSHHJKJRZVNY-ACRUOGEOSA-N 0.000 description 1
- VTJUNIYRYIAIHF-IUCAKERBSA-N Leu-Pro Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(O)=O VTJUNIYRYIAIHF-IUCAKERBSA-N 0.000 description 1
- JIHDFWWRYHSAQB-GUBZILKMSA-N Leu-Ser-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O JIHDFWWRYHSAQB-GUBZILKMSA-N 0.000 description 1
- ICYRCNICGBJLGM-HJGDQZAQSA-N Leu-Thr-Asp Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(O)=O ICYRCNICGBJLGM-HJGDQZAQSA-N 0.000 description 1
- LINKCQUOMUDLKN-KATARQTJSA-N Leu-Thr-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(C)C)N)O LINKCQUOMUDLKN-KATARQTJSA-N 0.000 description 1
- LFSQWRSVPNKJGP-WDCWCFNPSA-N Leu-Thr-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCC(O)=O LFSQWRSVPNKJGP-WDCWCFNPSA-N 0.000 description 1
- VDIARPPNADFEAV-WEDXCCLWSA-N Leu-Thr-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O VDIARPPNADFEAV-WEDXCCLWSA-N 0.000 description 1
- WUHBLPVELFTPQK-KKUMJFAQSA-N Leu-Tyr-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(O)=O WUHBLPVELFTPQK-KKUMJFAQSA-N 0.000 description 1
- JGKHAFUAPZCCDU-BZSNNMDCSA-N Leu-Tyr-Leu Chemical compound CC(C)C[C@H]([NH3+])C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C([O-])=O)CC1=CC=C(O)C=C1 JGKHAFUAPZCCDU-BZSNNMDCSA-N 0.000 description 1
- BTEMNFBEAAOGBR-BZSNNMDCSA-N Leu-Tyr-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCCCN)C(=O)O)N BTEMNFBEAAOGBR-BZSNNMDCSA-N 0.000 description 1
- QQXJROOJCMIHIV-AVGNSLFASA-N Leu-Val-Met Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCSC)C(O)=O QQXJROOJCMIHIV-AVGNSLFASA-N 0.000 description 1
- MKBIVWXCFINCLE-SRVKXCTJSA-N Lys-Asn-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCCCN)N MKBIVWXCFINCLE-SRVKXCTJSA-N 0.000 description 1
- MWVUEPNEPWMFBD-SRVKXCTJSA-N Lys-Cys-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@H](C(O)=O)CCCCN MWVUEPNEPWMFBD-SRVKXCTJSA-N 0.000 description 1
- IRRZDAIFYHNIIN-JYJNAYRXSA-N Lys-Gln-Tyr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O IRRZDAIFYHNIIN-JYJNAYRXSA-N 0.000 description 1
- XNKDCYABMBBEKN-IUCAKERBSA-N Lys-Gly-Gln Chemical compound NCCCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(N)=O XNKDCYABMBBEKN-IUCAKERBSA-N 0.000 description 1
- PBLLTSKBTAHDNA-KBPBESRZSA-N Lys-Gly-Phe Chemical compound [H]N[C@@H](CCCCN)C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O PBLLTSKBTAHDNA-KBPBESRZSA-N 0.000 description 1
- YWJQHDDBFAXNIR-MXAVVETBSA-N Lys-Ile-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CCCCN)N YWJQHDDBFAXNIR-MXAVVETBSA-N 0.000 description 1
- MUXNCRWTWBMNHX-SRVKXCTJSA-N Lys-Leu-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O MUXNCRWTWBMNHX-SRVKXCTJSA-N 0.000 description 1
- AHFOKDZWPPGJAZ-SRVKXCTJSA-N Lys-Lys-Cys Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CS)C(=O)O)N AHFOKDZWPPGJAZ-SRVKXCTJSA-N 0.000 description 1
- BXPHMHQHYHILBB-BZSNNMDCSA-N Lys-Lys-Tyr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O BXPHMHQHYHILBB-BZSNNMDCSA-N 0.000 description 1
- KTINOHQFVVCEGQ-XIRDDKMYSA-N Lys-Trp-Asp Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](Cc1c[nH]c2ccccc12)C(=O)N[C@@H](CC(O)=O)C(O)=O KTINOHQFVVCEGQ-XIRDDKMYSA-N 0.000 description 1
- FYYHWMGAXLPEAU-UHFFFAOYSA-N Magnesium Chemical compound [Mg] FYYHWMGAXLPEAU-UHFFFAOYSA-N 0.000 description 1
- GUBGYTABKSRVRQ-PICCSMPSSA-N Maltose Natural products O[C@@H]1[C@@H](O)[C@H](O)[C@@H](CO)O[C@@H]1O[C@@H]1[C@@H](CO)OC(O)[C@H](O)[C@H]1O GUBGYTABKSRVRQ-PICCSMPSSA-N 0.000 description 1
- 241001599018 Melanogaster Species 0.000 description 1
- UAPZLLPGGOOCRO-IHRRRGAJSA-N Met-Asn-Phe Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N UAPZLLPGGOOCRO-IHRRRGAJSA-N 0.000 description 1
- AETNZPKUUYYYEK-CIUDSAMLSA-N Met-Glu-Asn Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O AETNZPKUUYYYEK-CIUDSAMLSA-N 0.000 description 1
- CHDYFPCQVUOJEB-ULQDDVLXSA-N Met-Leu-Phe Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 CHDYFPCQVUOJEB-ULQDDVLXSA-N 0.000 description 1
- RMLLCGYYVZKKRT-CIUDSAMLSA-N Met-Ser-Glu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O RMLLCGYYVZKKRT-CIUDSAMLSA-N 0.000 description 1
- PCZOHLXUXFIOCF-UHFFFAOYSA-N Monacolin X Natural products C12C(OC(=O)C(C)CC)CC(C)C=C2C=CC(C)C1CCC1CC(O)CC(=O)O1 PCZOHLXUXFIOCF-UHFFFAOYSA-N 0.000 description 1
- 208000003445 Mouth Neoplasms Diseases 0.000 description 1
- 206010048723 Multiple-drug resistance Diseases 0.000 description 1
- 108010085220 Multiprotein Complexes Proteins 0.000 description 1
- 102000007474 Multiprotein Complexes Human genes 0.000 description 1
- 241001529936 Murinae Species 0.000 description 1
- 101000741806 Mus musculus Peroxisome proliferator-activated receptor gamma Proteins 0.000 description 1
- 101100107522 Mus musculus Slc1a5 gene Proteins 0.000 description 1
- 241000699670 Mus sp. Species 0.000 description 1
- 102100038895 Myc proto-oncogene protein Human genes 0.000 description 1
- 101710135898 Myc proto-oncogene protein Proteins 0.000 description 1
- WYBVBIHNJWOLCJ-UHFFFAOYSA-N N-L-arginyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CCCN=C(N)N WYBVBIHNJWOLCJ-UHFFFAOYSA-N 0.000 description 1
- XMBSYZWANAQXEV-UHFFFAOYSA-N N-alpha-L-glutamyl-L-phenylalanine Natural products OC(=O)CCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 XMBSYZWANAQXEV-UHFFFAOYSA-N 0.000 description 1
- 108010087066 N2-tryptophyllysine Proteins 0.000 description 1
- 102000048850 Neoplasm Genes Human genes 0.000 description 1
- 108700019961 Neoplasm Genes Proteins 0.000 description 1
- IGFHQQFPSIBGKE-UHFFFAOYSA-N Nonylphenol Natural products CCCCCCCCCC1=CC=C(O)C=C1 IGFHQQFPSIBGKE-UHFFFAOYSA-N 0.000 description 1
- 108090001145 Nuclear Receptor Coactivator 3 Proteins 0.000 description 1
- 102100023050 Nuclear factor NF-kappa-B p105 subunit Human genes 0.000 description 1
- 102100022883 Nuclear receptor coactivator 3 Human genes 0.000 description 1
- 102100022935 Nuclear receptor corepressor 1 Human genes 0.000 description 1
- 101710153661 Nuclear receptor corepressor 1 Proteins 0.000 description 1
- 102100040557 Osteopontin Human genes 0.000 description 1
- 108010081689 Osteopontin Proteins 0.000 description 1
- 102100038476 Oxysterols receptor LXR-alpha Human genes 0.000 description 1
- 241000282320 Panthera leo Species 0.000 description 1
- 102000003992 Peroxidases Human genes 0.000 description 1
- 241000009328 Perro Species 0.000 description 1
- LGBVMDMZZFYSFW-HJWJTTGWSA-N Phe-Arg-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC1=CC=CC=C1)N LGBVMDMZZFYSFW-HJWJTTGWSA-N 0.000 description 1
- DZVXMMSUWWUIQE-ACRUOGEOSA-N Phe-His-Tyr Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)N[C@@H](CC3=CC=C(C=C3)O)C(=O)O)N DZVXMMSUWWUIQE-ACRUOGEOSA-N 0.000 description 1
- RSPUIENXSJYZQO-JYJNAYRXSA-N Phe-Leu-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 RSPUIENXSJYZQO-JYJNAYRXSA-N 0.000 description 1
- METZZBCMDXHFMK-BZSNNMDCSA-N Phe-Leu-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC2=CC=CC=C2)N METZZBCMDXHFMK-BZSNNMDCSA-N 0.000 description 1
- YTILBRIUASDGBL-BZSNNMDCSA-N Phe-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 YTILBRIUASDGBL-BZSNNMDCSA-N 0.000 description 1
- WLYPRKLMRIYGPP-JYJNAYRXSA-N Phe-Lys-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC1=CC=CC=C1 WLYPRKLMRIYGPP-JYJNAYRXSA-N 0.000 description 1
- ZIQQNOXKEFDPBE-BZSNNMDCSA-N Phe-Lys-His Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N ZIQQNOXKEFDPBE-BZSNNMDCSA-N 0.000 description 1
- OXKJSGGTHFMGDT-UFYCRDLUSA-N Phe-Phe-Arg Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O)C1=CC=CC=C1 OXKJSGGTHFMGDT-UFYCRDLUSA-N 0.000 description 1
- ROOQMPCUFLDOSB-FHWLQOOXSA-N Phe-Phe-Gln Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)N[C@@H](CCC(N)=O)C(O)=O)C1=CC=CC=C1 ROOQMPCUFLDOSB-FHWLQOOXSA-N 0.000 description 1
- MMJJFXWMCMJMQA-STQMWFEESA-N Phe-Pro-Gly Chemical compound C([C@H](N)C(=O)N1[C@@H](CCC1)C(=O)NCC(O)=O)C1=CC=CC=C1 MMJJFXWMCMJMQA-STQMWFEESA-N 0.000 description 1
- ZJPGOXWRFNKIQL-JYJNAYRXSA-N Phe-Pro-Pro Chemical compound C([C@H](N)C(=O)N1[C@@H](CCC1)C(=O)N1[C@@H](CCC1)C(O)=O)C1=CC=CC=C1 ZJPGOXWRFNKIQL-JYJNAYRXSA-N 0.000 description 1
- XDMMOISUAHXXFD-SRVKXCTJSA-N Phe-Ser-Asp Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O XDMMOISUAHXXFD-SRVKXCTJSA-N 0.000 description 1
- BONHGTUEEPIMPM-AVGNSLFASA-N Phe-Ser-Glu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(O)=O BONHGTUEEPIMPM-AVGNSLFASA-N 0.000 description 1
- DBNGDEAQXGFGRA-ACRUOGEOSA-N Phe-Tyr-Lys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)N[C@@H](CCCCN)C(=O)O)N DBNGDEAQXGFGRA-ACRUOGEOSA-N 0.000 description 1
- APMXLWHMIVWLLR-BZSNNMDCSA-N Phe-Tyr-Ser Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)N[C@@H](CO)C(O)=O)C1=CC=CC=C1 APMXLWHMIVWLLR-BZSNNMDCSA-N 0.000 description 1
- ISWSIDIOOBJBQZ-UHFFFAOYSA-N Phenol Chemical compound OC1=CC=CC=C1 ISWSIDIOOBJBQZ-UHFFFAOYSA-N 0.000 description 1
- 108010064851 Plant Proteins Proteins 0.000 description 1
- 241000276498 Pollachius virens Species 0.000 description 1
- 239000002202 Polyethylene glycol Substances 0.000 description 1
- ZLMJMSJWJFRBEC-UHFFFAOYSA-N Potassium Chemical compound [K] ZLMJMSJWJFRBEC-UHFFFAOYSA-N 0.000 description 1
- 102000012412 Presenilin-1 Human genes 0.000 description 1
- 108010036933 Presenilin-1 Proteins 0.000 description 1
- 241000288906 Primates Species 0.000 description 1
- IFMDQWDAJUMMJC-DCAQKATOSA-N Pro-Ala-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O IFMDQWDAJUMMJC-DCAQKATOSA-N 0.000 description 1
- CMOIIANLNNYUTP-SRVKXCTJSA-N Pro-Gln-His Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O CMOIIANLNNYUTP-SRVKXCTJSA-N 0.000 description 1
- KTFZQPLSPLWLKN-KKUMJFAQSA-N Pro-Gln-Tyr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O KTFZQPLSPLWLKN-KKUMJFAQSA-N 0.000 description 1
- HFNPOYOKIPGAEI-SRVKXCTJSA-N Pro-Leu-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]1CCCN1 HFNPOYOKIPGAEI-SRVKXCTJSA-N 0.000 description 1
- RMODQFBNDDENCP-IHRRRGAJSA-N Pro-Lys-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O RMODQFBNDDENCP-IHRRRGAJSA-N 0.000 description 1
- GMJDSFYVTAMIBF-FXQIFTODSA-N Pro-Ser-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O GMJDSFYVTAMIBF-FXQIFTODSA-N 0.000 description 1
- VVAWNPIOYXAMAL-KJEVXHAQSA-N Pro-Thr-Tyr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O VVAWNPIOYXAMAL-KJEVXHAQSA-N 0.000 description 1
- 101800004937 Protein C Proteins 0.000 description 1
- 102000004022 Protein-Tyrosine Kinases Human genes 0.000 description 1
- 108090000412 Protein-Tyrosine Kinases Proteins 0.000 description 1
- 108700020978 Proto-Oncogene Proteins 0.000 description 1
- 102000052575 Proto-Oncogene Human genes 0.000 description 1
- 102000007568 Proto-Oncogene Proteins c-fos Human genes 0.000 description 1
- 108010071563 Proto-Oncogene Proteins c-fos Proteins 0.000 description 1
- 230000007022 RNA scission Effects 0.000 description 1
- 238000010240 RT-PCR analysis Methods 0.000 description 1
- 241000700159 Rattus Species 0.000 description 1
- 108091027981 Response element Proteins 0.000 description 1
- 241000283984 Rodentia Species 0.000 description 1
- 102400000827 Saposin-D Human genes 0.000 description 1
- 101800001700 Saposin-D Proteins 0.000 description 1
- 229920002684 Sepharose Polymers 0.000 description 1
- QKQDTEYDEIJPNK-GUBZILKMSA-N Ser-Glu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CO QKQDTEYDEIJPNK-GUBZILKMSA-N 0.000 description 1
- SNVIOQXAHVORQM-WDSKDSINSA-N Ser-Gly-Gln Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CCC(N)=O)C(O)=O SNVIOQXAHVORQM-WDSKDSINSA-N 0.000 description 1
- YMTLKLXDFCSCNX-BYPYZUCNSA-N Ser-Gly-Gly Chemical compound OC[C@H](N)C(=O)NCC(=O)NCC(O)=O YMTLKLXDFCSCNX-BYPYZUCNSA-N 0.000 description 1
- MOQDPPUMFSMYOM-KKUMJFAQSA-N Ser-His-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CC2=CN=CN2)NC(=O)[C@H](CO)N MOQDPPUMFSMYOM-KKUMJFAQSA-N 0.000 description 1
- GJFYFGOEWLDQGW-GUBZILKMSA-N Ser-Leu-Gln Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CO)N GJFYFGOEWLDQGW-GUBZILKMSA-N 0.000 description 1
- LRZLZIUXQBIWTB-KATARQTJSA-N Ser-Lys-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LRZLZIUXQBIWTB-KATARQTJSA-N 0.000 description 1
- NIOYDASGXWLHEZ-CIUDSAMLSA-N Ser-Met-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(O)=O)C(O)=O NIOYDASGXWLHEZ-CIUDSAMLSA-N 0.000 description 1
- UPLYXVPQLJVWMM-KKUMJFAQSA-N Ser-Phe-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O UPLYXVPQLJVWMM-KKUMJFAQSA-N 0.000 description 1
- AZWNCEBQZXELEZ-FXQIFTODSA-N Ser-Pro-Ser Chemical compound OC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O AZWNCEBQZXELEZ-FXQIFTODSA-N 0.000 description 1
- KQNDIKOYWZTZIX-FXQIFTODSA-N Ser-Ser-Arg Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCNC(N)=N KQNDIKOYWZTZIX-FXQIFTODSA-N 0.000 description 1
- PYTKULIABVRXSC-BWBBJGPYSA-N Ser-Ser-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PYTKULIABVRXSC-BWBBJGPYSA-N 0.000 description 1
- ZKOKTQPHFMRSJP-YJRXYDGGSA-N Ser-Thr-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O ZKOKTQPHFMRSJP-YJRXYDGGSA-N 0.000 description 1
- JOHPFOKBAAOQDI-UBHSHLNASA-N Ser-Trp-Cys Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CO)N JOHPFOKBAAOQDI-UBHSHLNASA-N 0.000 description 1
- ZVBCMFDJIMUELU-BZSNNMDCSA-N Ser-Tyr-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)[C@H](CO)N ZVBCMFDJIMUELU-BZSNNMDCSA-N 0.000 description 1
- 102000007451 Steroid Receptors Human genes 0.000 description 1
- 108010085012 Steroid Receptors Proteins 0.000 description 1
- 210000001744 T-lymphocyte Anatomy 0.000 description 1
- 101150088517 TCTA gene Proteins 0.000 description 1
- 241000053227 Themus Species 0.000 description 1
- 102000002933 Thioredoxin Human genes 0.000 description 1
- DDPVJPIGACCMEH-XQXXSGGOSA-N Thr-Ala-Gln Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(O)=O DDPVJPIGACCMEH-XQXXSGGOSA-N 0.000 description 1
- BSNZTJXVDOINSR-JXUBOQSCSA-N Thr-Ala-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O BSNZTJXVDOINSR-JXUBOQSCSA-N 0.000 description 1
- XSLXHSYIVPGEER-KZVJFYERSA-N Thr-Ala-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O XSLXHSYIVPGEER-KZVJFYERSA-N 0.000 description 1
- GZYNMZQXFRWDFH-YTWAJWBKSA-N Thr-Arg-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N1CCC[C@@H]1C(=O)O)N)O GZYNMZQXFRWDFH-YTWAJWBKSA-N 0.000 description 1
- LXWZOMSOUAMOIA-JIOCBJNQSA-N Thr-Asn-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N1CCC[C@@H]1C(=O)O)N)O LXWZOMSOUAMOIA-JIOCBJNQSA-N 0.000 description 1
- DHPPWTOLRWYIDS-XKBZYTNZSA-N Thr-Cys-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(O)=O)C(O)=O DHPPWTOLRWYIDS-XKBZYTNZSA-N 0.000 description 1
- MMTOHPRBJKEZHT-BWBBJGPYSA-N Thr-Cys-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CS)C(=O)N[C@@H](CO)C(O)=O MMTOHPRBJKEZHT-BWBBJGPYSA-N 0.000 description 1
- LKEKWDJCJSPXNI-IRIUXVKKSA-N Thr-Glu-Tyr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 LKEKWDJCJSPXNI-IRIUXVKKSA-N 0.000 description 1
- FDALPRWYVKJCLL-PMVVWTBXSA-N Thr-His-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)NCC(O)=O FDALPRWYVKJCLL-PMVVWTBXSA-N 0.000 description 1
- MEJHFIOYJHTWMK-VOAKCMCISA-N Thr-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)[C@@H](C)O MEJHFIOYJHTWMK-VOAKCMCISA-N 0.000 description 1
- YOOAQCZYZHGUAZ-KATARQTJSA-N Thr-Leu-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O YOOAQCZYZHGUAZ-KATARQTJSA-N 0.000 description 1
- BDGBHYCAZJPLHX-HJGDQZAQSA-N Thr-Lys-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O BDGBHYCAZJPLHX-HJGDQZAQSA-N 0.000 description 1
- ZSPQUTWLWGWTPS-HJGDQZAQSA-N Thr-Lys-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O ZSPQUTWLWGWTPS-HJGDQZAQSA-N 0.000 description 1
- QFEYTTHKPSOFLV-OSUNSFLBSA-N Thr-Met-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H]([C@@H](C)O)N QFEYTTHKPSOFLV-OSUNSFLBSA-N 0.000 description 1
- MYNYCUXMIIWUNW-IEGACIPQSA-N Thr-Trp-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CCCCN)C(=O)O)N)O MYNYCUXMIIWUNW-IEGACIPQSA-N 0.000 description 1
- KVEWWQRTAVMOFT-KJEVXHAQSA-N Thr-Tyr-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O KVEWWQRTAVMOFT-KJEVXHAQSA-N 0.000 description 1
- 108010022394 Threonine synthase Proteins 0.000 description 1
- 101710195626 Transcriptional activator protein Proteins 0.000 description 1
- 101710150448 Transcriptional regulator Myc Proteins 0.000 description 1
- GTNCSPKYWCJZAC-XIRDDKMYSA-N Trp-Asp-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N GTNCSPKYWCJZAC-XIRDDKMYSA-N 0.000 description 1
- WQYPAGQDXAJNED-AAEUAGOBSA-N Trp-Cys-Gly Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CS)C(=O)NCC(=O)O)N WQYPAGQDXAJNED-AAEUAGOBSA-N 0.000 description 1
- OWSRIUBVJOQHNY-IHPCNDPISA-N Trp-Lys-His Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC3=CN=CN3)C(=O)O)N OWSRIUBVJOQHNY-IHPCNDPISA-N 0.000 description 1
- 102000004243 Tubulin Human genes 0.000 description 1
- 108090000704 Tubulin Proteins 0.000 description 1
- WTXQBCCKXIKKHB-JYJNAYRXSA-N Tyr-Arg-Arg Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O WTXQBCCKXIKKHB-JYJNAYRXSA-N 0.000 description 1
- DKKHULUSOSWGHS-UWJYBYFXSA-N Tyr-Asn-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC1=CC=C(C=C1)O)N DKKHULUSOSWGHS-UWJYBYFXSA-N 0.000 description 1
- NHOVZGFNTGMYMI-KKUMJFAQSA-N Tyr-Ser-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 NHOVZGFNTGMYMI-KKUMJFAQSA-N 0.000 description 1
- 108090000848 Ubiquitin Proteins 0.000 description 1
- 102000044159 Ubiquitin Human genes 0.000 description 1
- FPCIBLUVDNXPJO-XPUUQOCRSA-N Val-Cys-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CS)C(=O)NCC(O)=O FPCIBLUVDNXPJO-XPUUQOCRSA-N 0.000 description 1
- SZTTYWIUCGSURQ-AUTRQRHGSA-N Val-Glu-Glu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O SZTTYWIUCGSURQ-AUTRQRHGSA-N 0.000 description 1
- PQSNETRGCRUOGP-KKHAAJSZSA-N Val-Thr-Asn Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(N)=O PQSNETRGCRUOGP-KKHAAJSZSA-N 0.000 description 1
- RFZFBOQPPFCOKG-BZSNNMDCSA-N Val-Trp-Met Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CCSC)C(=O)O)N RFZFBOQPPFCOKG-BZSNNMDCSA-N 0.000 description 1
- IECQJCJNPJVUSB-IHRRRGAJSA-N Val-Tyr-Ser Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](Cc1ccc(O)cc1)C(=O)N[C@@H](CO)C(O)=O IECQJCJNPJVUSB-IHRRRGAJSA-N 0.000 description 1
- SSKKGOWRPNIVDW-AVGNSLFASA-N Val-Val-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N SSKKGOWRPNIVDW-AVGNSLFASA-N 0.000 description 1
- 241000251539 Vertebrata <Metazoa> Species 0.000 description 1
- 229930003316 Vitamin D Natural products 0.000 description 1
- PTFCDOFLOPIGGS-UHFFFAOYSA-N Zinc dication Chemical compound [Zn+2] PTFCDOFLOPIGGS-UHFFFAOYSA-N 0.000 description 1
- 230000005856 abnormality Effects 0.000 description 1
- 230000002378 acidificating effect Effects 0.000 description 1
- 230000009471 action Effects 0.000 description 1
- 239000002671 adjuvant Substances 0.000 description 1
- 238000000246 agarose gel electrophoresis Methods 0.000 description 1
- 230000008484 agonism Effects 0.000 description 1
- 230000001270 agonistic effect Effects 0.000 description 1
- 108010086434 alanyl-seryl-glycine Proteins 0.000 description 1
- 108010070944 alanylhistidine Proteins 0.000 description 1
- 230000003281 allosteric effect Effects 0.000 description 1
- UPEZCKBFRMILAV-UHFFFAOYSA-N alpha-Ecdysone Natural products C1C(O)C(O)CC2(C)C(CCC3(C(C(C(O)CCC(C)(C)O)C)CCC33O)C)C3=CC(=O)C21 UPEZCKBFRMILAV-UHFFFAOYSA-N 0.000 description 1
- 102000005840 alpha-Galactosidase Human genes 0.000 description 1
- 108010030291 alpha-Galactosidase Proteins 0.000 description 1
- 125000000539 amino acid group Chemical group 0.000 description 1
- 238000012197 amplification kit Methods 0.000 description 1
- 108010064539 amyloid beta-protein (1-42) Proteins 0.000 description 1
- 230000019552 anatomical structure morphogenesis Effects 0.000 description 1
- 230000008485 antagonism Effects 0.000 description 1
- 230000003466 anti-cipated effect Effects 0.000 description 1
- 230000002788 anti-peptide Effects 0.000 description 1
- 210000000628 antibody-producing cell Anatomy 0.000 description 1
- 239000002246 antineoplastic agent Substances 0.000 description 1
- 229940041181 antineoplastic drug Drugs 0.000 description 1
- 230000036528 appetite Effects 0.000 description 1
- 235000019789 appetite Nutrition 0.000 description 1
- 239000012223 aqueous fraction Substances 0.000 description 1
- FOIVPCKZDPCJJY-JQIJEIRASA-N arotinoid acid Chemical compound C=1C=C(C(CCC2(C)C)(C)C)C2=CC=1C(/C)=C/C1=CC=C(C(O)=O)C=C1 FOIVPCKZDPCJJY-JQIJEIRASA-N 0.000 description 1
- 238000003491 array Methods 0.000 description 1
- 108010040443 aspartyl-aspartic acid Proteins 0.000 description 1
- 108010093581 aspartyl-proline Proteins 0.000 description 1
- 239000012131 assay buffer Substances 0.000 description 1
- 210000003719 b-lymphocyte Anatomy 0.000 description 1
- 238000002869 basic local alignment search tool Methods 0.000 description 1
- 230000006399 behavior Effects 0.000 description 1
- 230000009286 beneficial effect Effects 0.000 description 1
- 230000008901 benefit Effects 0.000 description 1
- 102000006635 beta-lactamase Human genes 0.000 description 1
- 102000030904 bile acid binding Human genes 0.000 description 1
- 108091022863 bile acid binding Proteins 0.000 description 1
- 108091008324 binding proteins Proteins 0.000 description 1
- 238000005842 biochemical reaction Methods 0.000 description 1
- 230000033228 biological regulation Effects 0.000 description 1
- 230000000903 blocking effect Effects 0.000 description 1
- 210000004369 blood Anatomy 0.000 description 1
- 239000008280 blood Substances 0.000 description 1
- 210000001124 body fluid Anatomy 0.000 description 1
- 239000012888 bovine serum Substances 0.000 description 1
- 210000000069 breast epithelial cell Anatomy 0.000 description 1
- 239000007853 buffer solution Substances 0.000 description 1
- 229960005069 calcium Drugs 0.000 description 1
- 239000011575 calcium Substances 0.000 description 1
- 229910052791 calcium Inorganic materials 0.000 description 1
- 239000001110 calcium chloride Substances 0.000 description 1
- 229960002713 calcium chloride Drugs 0.000 description 1
- 235000011148 calcium chloride Nutrition 0.000 description 1
- 229910001628 calcium chloride Inorganic materials 0.000 description 1
- 244000309466 calf Species 0.000 description 1
- PFKFTWBEEFSNDU-UHFFFAOYSA-N carbonyldiimidazole Chemical compound C1=CN=CN1C(=O)N1C=CN=C1 PFKFTWBEEFSNDU-UHFFFAOYSA-N 0.000 description 1
- 239000000969 carrier Substances 0.000 description 1
- 230000015556 catabolic process Effects 0.000 description 1
- 230000007910 cell fusion Effects 0.000 description 1
- 230000002032 cellular defenses Effects 0.000 description 1
- 230000010001 cellular homeostasis Effects 0.000 description 1
- 230000036755 cellular response Effects 0.000 description 1
- 238000005119 centrifugation Methods 0.000 description 1
- 230000008859 change Effects 0.000 description 1
- 238000012512 characterization method Methods 0.000 description 1
- 102000021178 chitin binding proteins Human genes 0.000 description 1
- 108091011157 chitin binding proteins Proteins 0.000 description 1
- BIWJNBZANLAXMG-YQELWRJZSA-N chloordaan Chemical compound ClC1=C(Cl)[C@@]2(Cl)C3CC(Cl)C(Cl)C3[C@]1(Cl)C2(Cl)Cl BIWJNBZANLAXMG-YQELWRJZSA-N 0.000 description 1
- 150000001841 cholesterols Chemical class 0.000 description 1
- 210000003483 chromatin Anatomy 0.000 description 1
- VNFPBHJOKIVQEB-UHFFFAOYSA-N clotrimazole Chemical compound ClC1=CC=CC=C1C(N1C=NC=C1)(C=1C=CC=CC=1)C1=CC=CC=C1 VNFPBHJOKIVQEB-UHFFFAOYSA-N 0.000 description 1
- 229960004022 clotrimazole Drugs 0.000 description 1
- 239000005515 coenzyme Substances 0.000 description 1
- 239000002131 composite material Substances 0.000 description 1
- 238000004590 computer program Methods 0.000 description 1
- 230000021615 conjugation Effects 0.000 description 1
- 230000001276 controlling effect Effects 0.000 description 1
- 230000002596 correlated effect Effects 0.000 description 1
- OMFXVFTZEKFJBZ-HJTSIMOOSA-N corticosterone Chemical compound O=C1CC[C@]2(C)[C@H]3[C@@H](O)C[C@](C)([C@H](CC4)C(=O)CO)[C@@H]4[C@@H]3CCC2=C1 OMFXVFTZEKFJBZ-HJTSIMOOSA-N 0.000 description 1
- 229960004544 cortisone Drugs 0.000 description 1
- ZZIALNLLNHEQPJ-UHFFFAOYSA-N coumestrol Natural products C1=C(O)C=CC2=C1OC(=O)C1=C2OC2=CC(O)=CC=C12 ZZIALNLLNHEQPJ-UHFFFAOYSA-N 0.000 description 1
- 239000007822 coupling agent Substances 0.000 description 1
- 238000012866 crystallographic experiment Methods 0.000 description 1
- 210000004748 cultured cell Anatomy 0.000 description 1
- UWFYSQMTEOIJJG-FDTZYFLXSA-N cyproterone acetate Chemical compound C1=C(Cl)C2=CC(=O)[C@@H]3C[C@@H]3[C@]2(C)[C@@H]2[C@@H]1[C@@H]1CC[C@@](C(C)=O)(OC(=O)C)[C@@]1(C)CC2 UWFYSQMTEOIJJG-FDTZYFLXSA-N 0.000 description 1
- 229960000978 cyproterone acetate Drugs 0.000 description 1
- 108010016616 cysteinylglycine Proteins 0.000 description 1
- 210000000172 cytosol Anatomy 0.000 description 1
- 230000007812 deficiency Effects 0.000 description 1
- 238000006731 degradation reaction Methods 0.000 description 1
- FMGSKLZLMKYGDP-USOAJAOKSA-N dehydroepiandrosterone Chemical compound C1[C@@H](O)CC[C@]2(C)[C@H]3CC[C@](C)(C(CC4)=O)[C@@H]4[C@@H]3CC=C21 FMGSKLZLMKYGDP-USOAJAOKSA-N 0.000 description 1
- 238000012217 deletion Methods 0.000 description 1
- 230000037430 deletion Effects 0.000 description 1
- 238000003935 denaturing gradient gel electrophoresis Methods 0.000 description 1
- 239000005547 deoxyribonucleotide Substances 0.000 description 1
- 125000002637 deoxyribonucleotide group Chemical group 0.000 description 1
- 238000003745 diagnosis Methods 0.000 description 1
- RGLYKWWBQGJZGM-ISLYRVAYSA-N diethylstilbestrol Chemical compound C=1C=C(O)C=CC=1C(/CC)=C(\CC)C1=CC=C(O)C=C1 RGLYKWWBQGJZGM-ISLYRVAYSA-N 0.000 description 1
- 229960000452 diethylstilbestrol Drugs 0.000 description 1
- 230000029087 digestion Effects 0.000 description 1
- 102000004419 dihydrofolate reductase Human genes 0.000 description 1
- 238000007865 diluting Methods 0.000 description 1
- 239000000539 dimer Substances 0.000 description 1
- 238000006471 dimerization reaction Methods 0.000 description 1
- 230000003467 diminishing effect Effects 0.000 description 1
- FSXRLASFHBWESK-UHFFFAOYSA-N dipeptide phenylalanyl-tyrosine Natural products C=1C=C(O)C=CC=1CC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 FSXRLASFHBWESK-UHFFFAOYSA-N 0.000 description 1
- 208000035475 disorder Diseases 0.000 description 1
- 230000003828 downregulation Effects 0.000 description 1
- 239000003937 drug carrier Substances 0.000 description 1
- 239000000975 dye Substances 0.000 description 1
- UPEZCKBFRMILAV-JMZLNJERSA-N ecdysone Chemical compound C1[C@@H](O)[C@@H](O)C[C@]2(C)[C@@H](CC[C@@]3([C@@H]([C@@H]([C@H](O)CCC(C)(C)O)C)CC[C@]33O)C)C3=CC(=O)[C@@H]21 UPEZCKBFRMILAV-JMZLNJERSA-N 0.000 description 1
- 238000005538 encapsulation Methods 0.000 description 1
- 210000002472 endoplasmic reticulum Anatomy 0.000 description 1
- 210000001163 endosome Anatomy 0.000 description 1
- 230000002255 enzymatic effect Effects 0.000 description 1
- 230000009088 enzymatic function Effects 0.000 description 1
- 229960005309 estradiol Drugs 0.000 description 1
- 229930182833 estradiol Natural products 0.000 description 1
- 229940011871 estrogen Drugs 0.000 description 1
- 239000000262 estrogen Substances 0.000 description 1
- 108010038795 estrogen receptors Proteins 0.000 description 1
- 238000011156 evaluation Methods 0.000 description 1
- 239000006277 exogenous ligand Substances 0.000 description 1
- 238000002474 experimental method Methods 0.000 description 1
- 239000013613 expression plasmid Substances 0.000 description 1
- 230000009123 feedback regulation Effects 0.000 description 1
- 238000001914 filtration Methods 0.000 description 1
- 239000012530 fluid Substances 0.000 description 1
- MHMNJMPURVTYEJ-UHFFFAOYSA-N fluorescein-5-isothiocyanate Chemical compound O1C(=O)C2=CC(N=C=S)=CC=C2C21C1=CC=C(O)C=C1OC1=CC(O)=CC=C21 MHMNJMPURVTYEJ-UHFFFAOYSA-N 0.000 description 1
- 238000002875 fluorescence polarization Methods 0.000 description 1
- 238000002866 fluorescence resonance energy transfer Methods 0.000 description 1
- 108091006047 fluorescent proteins Proteins 0.000 description 1
- 102000034287 fluorescent proteins Human genes 0.000 description 1
- 230000002538 fungal effect Effects 0.000 description 1
- 238000002523 gelfiltration Methods 0.000 description 1
- 208000016361 genetic disease Diseases 0.000 description 1
- 230000007614 genetic variation Effects 0.000 description 1
- 208000005017 glioblastoma Diseases 0.000 description 1
- 230000013595 glycosylation Effects 0.000 description 1
- 238000006206 glycosylation reaction Methods 0.000 description 1
- VPZXBVLAVMBEQI-UHFFFAOYSA-N glycyl-DL-alpha-alanine Natural products OC(=O)C(C)NC(=O)CN VPZXBVLAVMBEQI-UHFFFAOYSA-N 0.000 description 1
- 108010010147 glycylglutamine Proteins 0.000 description 1
- 108010050848 glycylleucine Proteins 0.000 description 1
- 230000012010 growth Effects 0.000 description 1
- 108090001052 hairpin ribozyme Proteins 0.000 description 1
- 239000000833 heterodimer Substances 0.000 description 1
- 108010018006 histidylserine Proteins 0.000 description 1
- 230000013632 homeostatic process Effects 0.000 description 1
- 108091008039 hormone receptors Proteins 0.000 description 1
- 229960000890 hydrocortisone Drugs 0.000 description 1
- 230000002209 hydrophobic effect Effects 0.000 description 1
- 229960002899 hydroxyprogesterone Drugs 0.000 description 1
- 238000003384 imaging method Methods 0.000 description 1
- 230000028993 immune response Effects 0.000 description 1
- 238000002649 immunization Methods 0.000 description 1
- 238000003018 immunoassay Methods 0.000 description 1
- 238000003119 immunoblot Methods 0.000 description 1
- 102000018358 immunoglobulin Human genes 0.000 description 1
- 229940072221 immunoglobulins Drugs 0.000 description 1
- 238000001114 immunoprecipitation Methods 0.000 description 1
- 238000000126 in silico method Methods 0.000 description 1
- 238000007901 in situ hybridization Methods 0.000 description 1
- 238000011065 in-situ storage Methods 0.000 description 1
- 230000006698 induction Effects 0.000 description 1
- 230000006882 induction of apoptosis Effects 0.000 description 1
- 230000000977 initiatory effect Effects 0.000 description 1
- 238000003780 insertion Methods 0.000 description 1
- 230000037431 insertion Effects 0.000 description 1
- 239000000543 intermediate Substances 0.000 description 1
- 102000027411 intracellular receptors Human genes 0.000 description 1
- 108091008582 intracellular receptors Proteins 0.000 description 1
- 230000008863 intramolecular interaction Effects 0.000 description 1
- 238000007918 intramuscular administration Methods 0.000 description 1
- 238000001990 intravenous administration Methods 0.000 description 1
- 108010027338 isoleucylcysteine Proteins 0.000 description 1
- 108010078274 isoleucylvaline Proteins 0.000 description 1
- 108010045069 keyhole-limpet hemocyanin Proteins 0.000 description 1
- 210000003734 kidney Anatomy 0.000 description 1
- 238000002372 labelling Methods 0.000 description 1
- 230000006651 lactation Effects 0.000 description 1
- 108010077158 leucinyl-arginyl-tryptophan Proteins 0.000 description 1
- 108010000761 leucylarginine Proteins 0.000 description 1
- 238000000670 ligand binding assay Methods 0.000 description 1
- 208000012987 lip and oral cavity carcinoma Diseases 0.000 description 1
- 230000029226 lipidation Effects 0.000 description 1
- 239000007788 liquid Substances 0.000 description 1
- 230000004807 localization Effects 0.000 description 1
- PCZOHLXUXFIOCF-BXMDZJJMSA-N lovastatin Chemical compound C([C@H]1[C@@H](C)C=CC2=C[C@H](C)C[C@@H]([C@H]12)OC(=O)[C@@H](C)CC)C[C@@H]1C[C@@H](O)CC(=O)O1 PCZOHLXUXFIOCF-BXMDZJJMSA-N 0.000 description 1
- 229960004844 lovastatin Drugs 0.000 description 1
- QLJODMDSTUBWDW-UHFFFAOYSA-N lovastatin hydroxy acid Natural products C1=CC(C)C(CCC(O)CC(O)CC(O)=O)C2C(OC(=O)C(C)CC)CC(C)C=C21 QLJODMDSTUBWDW-UHFFFAOYSA-N 0.000 description 1
- 239000006166 lysate Substances 0.000 description 1
- 108010045397 lysyl-tyrosyl-lysine Proteins 0.000 description 1
- 108010064235 lysylglycine Proteins 0.000 description 1
- 108010017391 lysylvaline Proteins 0.000 description 1
- 238000007403 mPCR Methods 0.000 description 1
- 239000011777 magnesium Substances 0.000 description 1
- 229910052749 magnesium Inorganic materials 0.000 description 1
- 210000001161 mammalian embryo Anatomy 0.000 description 1
- 210000005075 mammary gland Anatomy 0.000 description 1
- 239000000463 material Substances 0.000 description 1
- 239000011159 matrix material Substances 0.000 description 1
- 238000005259 measurement Methods 0.000 description 1
- 239000002609 medium Substances 0.000 description 1
- 201000001441 melanoma Diseases 0.000 description 1
- 230000004060 metabolic process Effects 0.000 description 1
- 108700023046 methionyl-leucyl-phenylalanine Proteins 0.000 description 1
- QNTJKRDAWNRWRV-UHFFFAOYSA-M methyl sulfate;trimethyl(propyl)azanium Chemical compound COS([O-])(=O)=O.CCC[N+](C)(C)C QNTJKRDAWNRWRV-UHFFFAOYSA-M 0.000 description 1
- 244000005700 microbiome Species 0.000 description 1
- VKHAHZOOUSRJNA-GCNJZUOMSA-N mifepristone Chemical compound C1([C@@H]2C3=C4CCC(=O)C=C4CC[C@H]3[C@@H]3CC[C@@]([C@]3(C2)C)(O)C#CC)=CC=C(N(C)C)C=C1 VKHAHZOOUSRJNA-GCNJZUOMSA-N 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 239000003607 modifier Substances 0.000 description 1
- 231100000219 mutagenic Toxicity 0.000 description 1
- 230000003505 mutagenic effect Effects 0.000 description 1
- SNQQPOLDUKLAAF-UHFFFAOYSA-N nonylphenol Chemical compound CCCCCCCCCC1=CC=CC=C1O SNQQPOLDUKLAAF-UHFFFAOYSA-N 0.000 description 1
- 238000011275 oncology therapy Methods 0.000 description 1
- 102000027450 oncoproteins Human genes 0.000 description 1
- 108091008819 oncoproteins Proteins 0.000 description 1
- 210000000056 organ Anatomy 0.000 description 1
- 230000008520 organization Effects 0.000 description 1
- 210000001672 ovary Anatomy 0.000 description 1
- 238000007500 overflow downdraw method Methods 0.000 description 1
- 230000036961 partial effect Effects 0.000 description 1
- 239000002245 particle Substances 0.000 description 1
- 229940056360 penicillin g Drugs 0.000 description 1
- 230000035699 permeability Effects 0.000 description 1
- 108040007629 peroxidase activity proteins Proteins 0.000 description 1
- 230000000858 peroxisomal effect Effects 0.000 description 1
- 239000003614 peroxisome proliferator Substances 0.000 description 1
- 238000002823 phage display Methods 0.000 description 1
- 230000003285 pharmacodynamic effect Effects 0.000 description 1
- 108010073025 phenylalanylphenylalanine Proteins 0.000 description 1
- 230000010399 physical interaction Effects 0.000 description 1
- 230000004962 physiological condition Effects 0.000 description 1
- 235000021118 plant-derived protein Nutrition 0.000 description 1
- 229920002401 polyacrylamide Polymers 0.000 description 1
- 238000002264 polyacrylamide gel electrophoresis Methods 0.000 description 1
- 229920001223 polyethylene glycol Polymers 0.000 description 1
- 239000011591 potassium Substances 0.000 description 1
- 229910052700 potassium Inorganic materials 0.000 description 1
- 229960002847 prasterone Drugs 0.000 description 1
- 238000002953 preparative HPLC Methods 0.000 description 1
- 125000002924 primary amino group Chemical group [H]N([H])* 0.000 description 1
- 210000002243 primary neuron Anatomy 0.000 description 1
- 239000000186 progesterone Substances 0.000 description 1
- 229960003387 progesterone Drugs 0.000 description 1
- 230000035755 proliferation Effects 0.000 description 1
- 108010079317 prolyl-tyrosine Proteins 0.000 description 1
- 230000000644 propagated effect Effects 0.000 description 1
- 229960000856 protein c Drugs 0.000 description 1
- 238000001243 protein synthesis Methods 0.000 description 1
- 230000012743 protein tagging Effects 0.000 description 1
- 230000009257 reactivity Effects 0.000 description 1
- 239000000018 receptor agonist Substances 0.000 description 1
- 229940044601 receptor agonist Drugs 0.000 description 1
- 229940044551 receptor antagonist Drugs 0.000 description 1
- 239000002464 receptor antagonist Substances 0.000 description 1
- 238000001525 receptor binding assay Methods 0.000 description 1
- 238000011084 recovery Methods 0.000 description 1
- 238000004064 recycling Methods 0.000 description 1
- 108010054624 red fluorescent protein Proteins 0.000 description 1
- 230000009467 reduction Effects 0.000 description 1
- 230000002829 reductive effect Effects 0.000 description 1
- 230000018406 regulation of metabolic process Effects 0.000 description 1
- 238000011160 research Methods 0.000 description 1
- 239000011347 resin Substances 0.000 description 1
- 229920005989 resin Polymers 0.000 description 1
- 210000001995 reticulocyte Anatomy 0.000 description 1
- 102000003702 retinoic acid receptors Human genes 0.000 description 1
- 108090000064 retinoic acid receptors Proteins 0.000 description 1
- 230000001177 retroviral effect Effects 0.000 description 1
- 125000002652 ribonucleotide group Chemical group 0.000 description 1
- 108020004418 ribosomal RNA Proteins 0.000 description 1
- JQXXHWHPUNPDRT-WLSIYKJHSA-N rifampicin Chemical compound O([C@](C1=O)(C)O/C=C/[C@@H]([C@H]([C@@H](OC(C)=O)[C@H](C)[C@H](O)[C@H](C)[C@@H](O)[C@@H](C)\C=C\C=C(C)/C(=O)NC=2C(O)=C3C([O-])=C4C)C)OC)C4=C1C3=C(O)C=2\C=N\N1CC[NH+](C)CC1 JQXXHWHPUNPDRT-WLSIYKJHSA-N 0.000 description 1
- 229960001225 rifampicin Drugs 0.000 description 1
- 239000006152 selective media Substances 0.000 description 1
- 238000012163 sequencing technique Methods 0.000 description 1
- 108010048397 seryl-lysyl-leucine Proteins 0.000 description 1
- 150000003384 small molecules Chemical class 0.000 description 1
- 239000011734 sodium Substances 0.000 description 1
- 229910052708 sodium Inorganic materials 0.000 description 1
- 210000001082 somatic cell Anatomy 0.000 description 1
- 210000002325 somatostatin-secreting cell Anatomy 0.000 description 1
- 238000004611 spectroscopical analysis Methods 0.000 description 1
- LXMSZDCAJNLERA-ZHYRCANASA-N spironolactone Chemical compound C([C@@H]1[C@]2(C)CC[C@@H]3[C@@]4(C)CCC(=O)C=C4C[C@H]([C@@H]13)SC(=O)C)C[C@@]21CCC(=O)O1 LXMSZDCAJNLERA-ZHYRCANASA-N 0.000 description 1
- 229960002256 spironolactone Drugs 0.000 description 1
- 210000004989 spleen cell Anatomy 0.000 description 1
- 238000010561 standard procedure Methods 0.000 description 1
- 238000013179 statistical model Methods 0.000 description 1
- 150000003431 steroids Chemical class 0.000 description 1
- 229960002385 streptomycin sulfate Drugs 0.000 description 1
- 238000005556 structure-activity relationship Methods 0.000 description 1
- 210000001768 subcellular fraction Anatomy 0.000 description 1
- 238000007920 subcutaneous administration Methods 0.000 description 1
- 239000006228 supernatant Substances 0.000 description 1
- 230000002195 synergetic effect Effects 0.000 description 1
- 230000002194 synthesizing effect Effects 0.000 description 1
- 230000008685 targeting Effects 0.000 description 1
- 125000005931 tert-butyloxycarbonyl group Chemical group [H]C([H])([H])C(OC(*)=O)(C([H])([H])[H])C([H])([H])[H] 0.000 description 1
- 238000012956 testing procedure Methods 0.000 description 1
- ABZLKHKQJHEPAX-UHFFFAOYSA-N tetramethylrhodamine Chemical compound C=12C=CC(N(C)C)=CC2=[O+]C2=CC(N(C)C)=CC=C2C=1C1=CC=CC=C1C([O-])=O ABZLKHKQJHEPAX-UHFFFAOYSA-N 0.000 description 1
- MPLHNVLQVRSVEE-UHFFFAOYSA-N texas red Chemical compound [O-]S(=O)(=O)C1=CC(S(Cl)(=O)=O)=CC=C1C(C1=CC=2CCCN3CCCC(C=23)=C1O1)=C2C1=C(CCC1)C3=[N+]1CCCC3=C2 MPLHNVLQVRSVEE-UHFFFAOYSA-N 0.000 description 1
- 229940124597 therapeutic agent Drugs 0.000 description 1
- 230000001225 therapeutic effect Effects 0.000 description 1
- RYYWUUFWQRZTIU-UHFFFAOYSA-K thiophosphate Chemical compound [O-]P([O-])([O-])=S RYYWUUFWQRZTIU-UHFFFAOYSA-K 0.000 description 1
- 108060008226 thioredoxin Proteins 0.000 description 1
- 229940094937 thioredoxin Drugs 0.000 description 1
- 108010031491 threonyl-lysyl-glutamic acid Proteins 0.000 description 1
- 239000003104 tissue culture media Substances 0.000 description 1
- 238000012549 training Methods 0.000 description 1
- OCHOKXCPKDPNQU-DFACZZRHSA-N trans-nonachlor Chemical compound ClC1=C(Cl)[C@]2(Cl)[C@@H]3[C@@H](Cl)[C@H](Cl)[C@@H](Cl)[C@@H]3[C@@]1(Cl)C2(Cl)Cl OCHOKXCPKDPNQU-DFACZZRHSA-N 0.000 description 1
- 108091006108 transcriptional coactivators Proteins 0.000 description 1
- 230000037426 transcriptional repression Effects 0.000 description 1
- 230000001052 transient effect Effects 0.000 description 1
- 238000003146 transient transfection Methods 0.000 description 1
- 230000014621 translational initiation Effects 0.000 description 1
- 210000004881 tumor cell Anatomy 0.000 description 1
- 230000005748 tumor development Effects 0.000 description 1
- 238000003160 two-hybrid assay Methods 0.000 description 1
- 241001430294 unidentified retrovirus Species 0.000 description 1
- 230000003827 upregulation Effects 0.000 description 1
- 210000003501 vero cell Anatomy 0.000 description 1
- 230000009385 viral infection Effects 0.000 description 1
- 235000019155 vitamin A Nutrition 0.000 description 1
- 239000011719 vitamin A Substances 0.000 description 1
- 150000003710 vitamin D derivatives Chemical class 0.000 description 1
- 235000005282 vitamin D3 Nutrition 0.000 description 1
- 239000011647 vitamin D3 Substances 0.000 description 1
- QYSXJUFSXHHAJI-YRZJJWOYSA-N vitamin D3 Chemical compound C1(/[C@@H]2CC[C@@H]([C@]2(CCC1)C)[C@H](C)CCCC(C)C)=C\C=C1\C[C@@H](O)CCC1=C QYSXJUFSXHHAJI-YRZJJWOYSA-N 0.000 description 1
- 229940046008 vitamin d Drugs 0.000 description 1
- 229940021056 vitamin d3 Drugs 0.000 description 1
- 238000002424 x-ray crystallography Methods 0.000 description 1
- 210000005253 yeast cell Anatomy 0.000 description 1
Classifications
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K14/00—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
- C07K14/435—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans
- C07K14/705—Receptors; Cell surface antigens; Cell surface determinants
- C07K14/70567—Nuclear receptors, e.g. retinoic acid receptor [RAR], RXR, nuclear orphan receptors
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61K—PREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
- A61K38/00—Medicinal preparations containing peptides
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61K—PREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
- A61K48/00—Medicinal preparations containing genetic material which is inserted into cells of the living body to treat genetic diseases; Gene therapy
Landscapes
- Health & Medical Sciences (AREA)
- Chemical & Material Sciences (AREA)
- Life Sciences & Earth Sciences (AREA)
- Organic Chemistry (AREA)
- Biochemistry (AREA)
- Genetics & Genomics (AREA)
- Toxicology (AREA)
- Zoology (AREA)
- Gastroenterology & Hepatology (AREA)
- Cell Biology (AREA)
- Physics & Mathematics (AREA)
- Biophysics (AREA)
- General Health & Medical Sciences (AREA)
- Immunology (AREA)
- Medicinal Chemistry (AREA)
- Molecular Biology (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- High Energy & Nuclear Physics (AREA)
- Preparation Of Compounds By Using Micro-Organisms (AREA)
- Peptides Or Proteins (AREA)
- Micro-Organisms Or Cultivation Processes Thereof (AREA)
Abstract
The present invention relates to a novel nuclear receptor called "L66" or also FXR-.beta. a homologue of the FXR-.alpha., a prototypical type 2 nuclear receptor. The invention also relates to the isolated nucleic acid sequence of L66 and the isolated protein thereof. The invention further relates to processes for isolating and/or producing the nucleic acid or the protein as well as methods of use of the receptor L66.
Description
Title:
BACKGROUND OF THE INVENTION
Multicellular organisms are dependent on advanced mechanisms of information transfer between cells and body compartments. The information that is transmitted can be highly complex and can result in the alteration of genetic programs involved in cellular differentiation, proliferation, or reproduction. The signals, or hormones, are often simple molecules, such as peptides, fatty acid, or cholesterol derivatives.
Many of these signals produce their effects by ultimately changing the transcription of specific genes. One well-studied group of proteins that mediate a cell's response to a variety of signals is the family of transcription factors lenown as nuclear receptors, hereinafter referred to often as "NR". Members of this group include receptors for steroid hormones, vitamin D, ecdysone, cis and trans retinoic acid, thyroid hormone, bile acids, cholesterol-derivatives, fatty acids (and other peroxisomal proliferators), as well as so-called orphan receptors, proteins that are structurally similar to other members of this group, but for which no ligands are known (Escriva, H. et al., Ligand binding was acquired during evolution of nuclear receptors, PNAS, 94, 6803 -6808, 1997). Orphan receptors may be indicative of unknown signaling pathways in the cell or may be nuclear receptors that function without ligand activation. The activation of transcription by some of these orphan receptors may occur in the absence of an exogenous ligand and/or through signal transduction pathways originating from the cell surface (Mangelsdorf, D. J. et al., The nuclear receptor superfamily: the second decade, Cell 83, 835-839, 1995).
In general, three functional domains have been defined in NRs. An amino terminal domain is believed to have some regulatory function. A DNA-binding domain hereinafter referred to as "DBD" usually comprises two zinc finger elements and recognizes a specific Hormone Responsive Element hereinafter referred to as "HRE"
within the promoters of responsive genes. Specific amino acid residues in the "DBD"
have been shown to confer DNA sequence binding specificity (Schena, M. &
Yamamoto, K.R., Mammalian Glucocorticoid Receptor Derivatives Enhance Transcription in Yeast, Science, 241:965-967, 1988). A Ligand-binding-domain hereinafter referred to as "LBD" is at the carboxy-terminal region of known NRs. In °:
the absence of hormone, the LBD appears to interfere with the interaction of the DBD
with its HRE. Hormone binding seems to result in a conformational change in the NR
and thus opens this interference (Brzozowski et al., Molecular basis of agonism and antagonism in the oestogen receptor, Nature, 389, 753 - 758, 1997; Wagner et al., A
structural role for hormone in the fihyroid hormone receptor, Nature, 378, 690 - 697.
1995). A NR without the HBD constitutively activates transcription but at a low level.
Both the amino-terminal domain and the LBD appear to have firanscription activation functions hereinafter referred to as "TAF". Acidic residues in the amino-terminal domains of some nuclear receptors may be important for these transcription factors to interact with RNA polymerase. TAF activity may be dependent on interactions with other protein factors or nuclear components (Diamond et al., Transcription Factor Interactions: Selectors of Positive or Negative Regulation from a Single DNA
Element, Science, 249:1266-1272 , 1990). Certain oncoproteins (e.g., c-Jun and c-Fos) can show synergistic or antagonistic activity with glucocorticoid receptors (GR) in transfected cells. Furthermore, the receptors for estrogen and vitamins A
and D, and fatty acids have been shown to interact, either physically or functionally, with the Jun and Fos components of AP-1 in the transactivation of steroid- or AP-1 regulated genes.
Coactivators or transcriptional activators are proposed to bridge between sequence specific transcription factors, the basal transcription machinery and in addition to influence the chromatin structure of a target cell. Several proteins like SRC-1, ACTR, and Grip1 interact with NRs in a ligand enhanced manner (Heery et al., A
signature motif in transcriptional coactivators mediates binding to nuclear receptors, Nature, 387, 733 - 736; Heinzel et al., A complex containing N-CoR, mSin3 and histone deacetylase mediates transcriptional repression, Nature 387, 43 - 47, 1997).
Furthermore, the physical interaction with negative receptor-interacting proteins or corepressors has been demonstrated (Xu et al., Coactivator and Corepressor complexes in nuclear receptor function, Curr Opin Genet Dev, 9 (2), 140 - 147, 1999).
Nuclear receptor modulators like steroid hormones affect the growth and function of specific cells by binding to intracellular receptors and forming nuclear receptor-ligand complexes. Nuclear receptor-hormone complexes then interact with a hormone response element (HRE) in, the control region of specific genes and alter specific gene expression.
Over the past decade, new members of the nuclear hormone gene family have been identified that lack known ligands. These orphan receptors can be used to uncover signaling molecules that regulate yet unidentified physiological networks.
Some of these orphan receptors are constitutively active in transactivate target genes without the need to interact with a ligand (MangelsdorFet al., 1995).
Farnesoid X Receptor alpha (hereinafter FXR-a) is a prototypical type 2 nuclear receptor (US Pat. 6,005,086) which activates genes upon binding to promoter region of target genes in a heterodimeric fashion with Retinoid X Receptor (hereinafter RXR, Forman et al., Cell, 81, 687-93, 1995).. The relevant physiological ligands of FXR-a seem to be bile acids. The most potent is chenodeoxycholic acid, which regulates the expression of several genes that participate in bile acid homeostisis. Farnesoid, originally described fio activate the rat ortholog at high concentration does not activate the human or mouse receptor. It is highly expressed in the liver, intestine and kidney. Like LXR-a FXR-a is involved in intracrine signaling.
The relevant physiological ligands of NR1 H4 (as FXR-a is also called) seem to be bile acids (Makishima et al., Science, 284, 1362-65, 1999; Parks et al., Science, 284, 1365-68, 1999). The most potent is chenodeoxycholic acid, which regulates the expression of several genes that participate in bile acid homeostasis.
Consequently, FXR-a is proposed to be a nuclear bile acid sensor. As a result, it modulates both, the synthetic output of bile acids in the liver and their recycling in the intestine (by regulating bile acid binding proteins?. Its is also activated by retoinic acid and TTNPB at supraphisiological concentration. Furthermore, it regulates the conversion of dietry cholesterol into bile acids by regulation the metabolizing genes like CYP7-a. This is a feed back regulation since the receptor is activated by bile acids.
Through its regulatory function in cholesterol and bile acid metabolism an FXR-a homologue could serve as a target for cholesterol lowering drugs and exert beneficial effects in diseases like artheriosclerosis and other metabolic disorders.
It was thus an object of the present invention to provide for a novel nuclear receptor.
In a preferred embodiment of the invention it was an object to provide for a homologue of FXR-a. It was an object of the present invention to provide for means of producing this receptor as well as means of screening for agonists and antagonists to the receptor. Further objects of the invention are outlined below.
SUMMARY OF THE INVENTION
The present invention provides, inter alia, a novel nuclear receptor protein.
In a preferred embodiment of the invention a novel FXR-a homologue is provided for.
Also provided is the nucleic acid sequence encoding this novel nuclear receptor protein, as well as compounds and methods for using this protein and its nucleic acid sequence.
The present invention provides a novel proteins, nucleic acids, and methods useful for developing and identifying compounds for the treatment of such diseases and disorders as metabolic disorders, immunological indications, hormonal dysfunctions, neurosystemic diseases and in preferred embodiments, high cholesterol and artheriosclerosis as well as other metabolic disorders.
Identified and disclosed herein is the protein sequence for a novel nuclear receptor and the nucleic acid sequence encoding this nuclear receptor L66, which we call the L66 nuclear receptor (or simply "L66") receptor, or also FXR-(i.
The importance of this discovery is manifested in the effects of FXR-~3 to modulate genes involved in cellular functions like regulation of metabolism and cell homeostasis, cell proliferation and differentiation, pathological cellular aberrations, or cellular defense mechanisms including tumor development, i.e. cancer.
Thus, this L66 protein is useful for screening for L66 agonists and antagonist activity for controlling these conditions.
In one aspect of the present invention, we provide isolated nucleic acid sequences for a novel receptor, the L66 receptor. In particular, we provide the cDNA
sequences, protein sequences as well as the genomic sequences encoding the human L66 receptor, as well as the cDNA sequence, protein sequence and genomic sequence of the Mus musculus (mouse) receptor.
These nucleic acid sequences have a variety of uses. For example, they are useful for making vectors and for transforming cells, both of which are ultimately useful for production of the L66 protein.
They are also useful as scientific research tools for developing nucleic acid probes for determining L66 expression levels, e.g., to identify diseased or otherwise abnormal states. They are useful for developing analytical tools such as anti sense oligonucleotides for selectively inhibiting expression of the L66 gene to determine physiological responses.
In another aspect of the present invention, we provide a homogenous composition comprising the L66 protein. The protein is useful for screening drugs for agonist and antagonist activity, and, therefore, for screening for drugs useful in regulating physiological responses associated with L66. Specifically, antagonists to the receptor could be used to treat diseases and disorders as metabolic disorders, immunological indications, hormonal dysfunctions, neurosystemic diseases and in preferred embodiments, high cholesterol and artheriosclerosis as well as other metabolic disorders, whereas agonists could be used for the treatment of these conditions. The proteins are also useful for developing antibodies for detection of the protein.
Flowing from the foregoing are a number of other aspects of the invention, including (a) vectors, such as plasmids, comprising the L66 nuclear receptor nucleic acid sequence that may further comprise additional regulatory elements, e.g., promoters, (b) transformed cells that express the L66, (c) nucleic acid probes, (d) antisense oligonucleotides, (e) agonists, (f) antagonists, and (g) transgenic mammals.
Further aspects of the invention comprise methods for making and using the foregoing compounds and compositions.
The foregoing merely summarizes certain aspects of the present invention and is not intended, nor should it be construed, to limit the invention in any manner.
All patents and other publications recited herein are hereby incorporated by reference in their entirety.
DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS
THE L66 PROTEIN AND NUCLEIC ACID:
The present invention comprises a novel member of the nuclear receptor superfamily which the inventors herein refer to as L66. Particularly preferred embodiments of the L66 receptor are those having an amino acid sequence substantially the same as SEQ ID NO. 3, 24 and/or 19. Examination of the amino acid sequence confirms that the present protein is indeed a member of the nuclear receptor family (see also US
6,005,086) which is closely related to FXR (see also figure 3 for the domain composition). The carboxy-terminal ligand binding domain "LBD" of L66 is a complex region encoding subdomains for ligand binding, often dimerization and transcriptional activation.
The nucleic acids claimed above may be present in various forms, i.e. as an RNA, DNA, cDNA or as genomic DNA.
As used herein, if reference to L66, the L66 receptor, the nuclear receptor L66 or the L66 nuclear receptor is made it is meant as a reference to any protein having an amino acid sequence substantially the same as SEQ ID NO.: 3, 24 and/or 19.
The present invention also comprises nucleic acid sequences encoding the L66 receptor, which nucleic acid sequences are substantially the same as SEQ ID
NO.
1 and SEQ ID NO. 17 (cDNAs) or splice variants thereof 4, 6, 8, 10, 12, 22, their sequence complements SEQ ID NO. 2 and 18 or complements of said splice variants 5, 7, 9, 11, 13, 23. SEQ ID NO 4encodes the human cDNA L66 receptor and is a preferred embodiment. SEQ OD NO. 17 (cDNA) encodes the mouse receptor (Mus musculus). SEQ ID NO. 20 represents the genomic sequence of L66 locus from the mouse (Mus musculus) (see also Fig. 15)1n one embodiment, the nucleic acid sequence according to the invention comprises the sequence according to SEQ ID
NO. 1, 4, 6, 8, 10, 12, 17, 20( 4 to 12 representing various splice variants of the human L66) and/or 22 or portions thereof, in a preferred embodiment nucleic acid sequence according to the invention consists of the sequence according to SEQ
ID
NO. 1, 4, 6, 8, 10 ( 4 to 10 representing various splice variants of the human L66) and/or 17 or portions thereof.
Herein the "complement" refers to the complementary strand of the nucleic acid according to the invention, thus the strand that would hybridize to the nucleic acid according to the invention. in accordance with standard biological terminology all DNA sequences herein are however written in 5'-3' orientation, thus the if a complement is mentioned (see also figures) it is actually a "reverse"
complement (as also stated in the figures). For simplification purposes they may however sometimes be referred to simply as "complements".
As used herein, a protein "having an amino acid sequence substantially the same or similar as SEQ ID NO x" (where "x" is the number of one of the protein sequences recited in the Sequence Listing) means a profiein whose amino acid sequence is the same as SEQ ID NO x or differs only in a way such that at least 50% of the residues compared in a sequence alignment with SEQ ID NO. x are identical, preferably 75%
of the residues are identical, even more preferably 95% of the residues are identical and most preferably at least 98% of the residues are identical.
Those skilled in the art will appreciate that conservative substitutions of amino acids can be made without significantly diminishing the protein's affinity for interacting proteins, DNA binding sites, L66 receptor modulators, e.g. small molecular hydrophobic compounds, or RNA.
Other substitutions may be made that increase the protein's affinity for these compounds. Making and identifying such proteins is a routine matter given the teachings herein, and can be accomplished, for example, by altering the nucleic acid sequence encoding the protein (as disclosed herein), inserting it into a vector, transforming a cell, expressing the nucleic acid sequence, and measuring the binding affinity of the resulting protein, all as taught herein.
As used herein the term "a molecule having a nucleotide sequence substantially the same as SEQ ID NO y" (wherein "y" is the number of one of the protein-encoding nucleotide sequences listed in the Sequence Listing) means a nucleic acid encoding a protein "having an amino acid sequence substantially the same as SEQ ID NO
y+I"
(wherein "y+I" is the number of the amino acid sequence for which nucleotide sequence "y" codes) as defined above. This definition is intended to encompass natural allelic variations in the L66 sequence. Cloned nucleic acid provided by the present invention may encode L66 protein of any species of origin, including (but not limited to), for example, mouse, rat, rabbit, cat, dog, primate, and human.
Preferably, the nucleic acid provided by the invention encodes L66 receptors of mammalian, preferably mouse and most preferably human origin.
_ 9 _ Preferably, the L66 receptors proteins provided by the invention are of mammalian, more preferably mouse and most preferably human origin.
The inventors have found (see figures and examples) that the L66 gene in humans is expressed primarily in testis. In mice however expression may be observed also in other tissues.
IDENTIFICATION OF VARIANTS AND HOMOLOGUES AS WELL AS USE OF
PROBES:
Nucleic acid hybridization probes provided by the invention are nucleic acids consisting essentially of the nucleotide sequences complementary to any sequence depicted in SEQ ID NO. 1, 4, 6, 8, 10, 12, 17, 20, 22, 2, 5, 7, 9, 11, 13, 18, 21 and/or 23 or a part thereof and that are effective in nucleic acid hybridization Nucleic acid hybridization probes provided by the invention are nucleic acids capable of detecting i.e. hybridizing to the gene encoding the polypeptides according to SEQ
ID NO. 3, 24 and/or 19.
Nucleic acid probes are useful for detecting L66 gene expression in cells and tissues using techniques well-known in the art, including, but not limited to, Northern blot hybridization, in situ hybridization, and Southern hybridization to reverse transcriptase - polymerase chain reaction product DNAs. The probes provided by the present invention, including oligonucleotide probes derived therefrom, are also useful for Southern hybridization of mammalian, preferably human, genomic DNA for screening for restriction fragment length polymorphism (RFLP) associated with certain genetic disorders. As used herein, the term complementary means a nucleic acid having a sequence that is sufficiently complementary in the Watson-Crick sense to a target nucleic acid to bind to the target under physiological conditions or experimental conditions those skilled in the art routinely use when employing probes.
It is understood in the art that a nucleic acid sequence will hybridize with a complementary nucleic acid sequence under high stringent conditions as defined herein, even though some mismatches maybe present. Such closely matched, but not perfectly complementary sequences are also encompassed by the present invention. For example, differences may occur through genetic code degeneracy, or by naturally occurring or man made mutations and such mismatched sequences would still be encompassed by the present claimed invention.
Preferably, the nucleotide sequence of the nuclear receptor L66 (SEQ ID NO. 1, 4, 17, and/or 22 or splice variants thereof) and/or their complements can be used to derive oligonucleotide fragments (probes) of various length. If the probe is used to detect a human L66 sequence most preferably a complement of SEQ ID NO. 1 and/or 4 or its complement is used. If the probe is supposed to detect a mouse or a rodent sequence probes complementary to SEQ ID NO. 17 and/or 22, or their respective complements are preferred. Stretches of 17 to 30 nucleotides are used frequently but depending on the screening parameters longer sequences as 40, 50, 100, 150 up to the full length of the sequence may be used. Those probes can be synthesized chemically and are obtained readily from commercial oligonucleotide providers. Chemical synthesis has improved over the years and chemical synthesis of oligonucleotides as long as 100-200 bases is possible. The field might advance further to allow chemical synthesis of even longer fragments. Alternatively, probes can also be obtained by biochemical de novo synthesis of single stranded DNA.
In this case the nucleotide sequence of the nuclear receptor L66 or its complement (see figures) serve as a template and the corresponding complementary strand is synthesized. A variety of standard fiechniques such as nick translation or primer extension from specific primers or short random oligonucleotides can be used to synthesize the probe (Molecular Cloning: A Laboratory Manual (3 Volume Set) by Joseph Sambrook, David W. Russell, Joe Sambrook, 2100 pages 3rd edition (January 15, 2001; . Molecular cloning: a laboratory manual. Cold Spring Harbor Press, Cold Spring Harbor, 1989)). Nucleic acid reproduction technologies exemplified by the polymerise chain reaction (Saiki, R.K. et al. Primer-directed enzymatic amplification of DNA with a thermostable DNA polymerise. Science 239, 487-491 (1988)) are commonly applied to synthesize probes. In the case of techniques using specific primers the nucleic acid sequence of the nuclear receptor L66 or its complement is not only used as a template in the biochemical reaction but also to derive the specific primers which are needed to prime the reaction.
In some cases one might also consider to use the nucleic acid sequence of the cofactor or its complement as a template to synthesize an RNA probe. A
promoter sequence for a DNA-dependent RNA polymerase has to be introduced at the 5'-end of sequence. As an example this can be done by cloning the sequence infio a vector which carries the respective promoter sequence. It is also possible to introduce the needed sequence by synthesizing a primer with the needed promoter in the form of a 5' "tail". The chemical synthesis of a RNA probe is another option.
Appropriate means are available to detect the event of a hybridization. There is a wide variety of labels and detection systems, e.g. radioactive isotopes, fluorescent, or chemiluminescent molecules which can be linked to the probe. Furthermore, there are methods of introducing haptens which can be detected by antibodies or other ligands such as the avidin/biotin high affinity binding system.
Hybridization can take place in solution or on solid phase or in combinations of the two, e.g. hybridization in solution and subsequent capture of the hybridization product onto a solid phase by immobilized antibodies or by ligand coated magnetic beads.
Hybridization probes act by forming selectively duplex molecules with complementary stretches of a sequence of a gene or a cDNA. The selectivity of the process can be controlled by varying the conditions of hybridization. To select sequences which are identical highly homologous to the sequence of interest stringent conditions for the hybridization will be used, e.g. low salt in the range of 0.02 M to 0.15 M
salt and/or high temperatures in the range from 50°C degrees centigrade to 70°C degrees centigrade. Stringency can be further improved by the addition of formamide to the hybridization solution. The use of stringent conditions which means that only little mismatch or a complete match will lead to a hybridization product would be used to isolate closely related members of the same gene family. Thus, as used herein stringent hybridization conditions are those where between 0.02 M to 0.15 M
salt and/or high temperatures in the range from 50°C degrees centigrade to 70°C
degrees centigrade are applied.
12_ The use of highly stringent conditions or conditions of "high stringency"
means that only very little mismatch or a complete match which lead to a hybridization product would be used to isolate very closely related members of the same gene family.
Thus, as used herein highly stringent hybridization conditions are those where between 0.02 - 0.3 M salt and 65°C degrees centigrade are applied for about 5 to 18 hours of hybridization time and additionally, the sample filters are washed twice for about 15 minutes each at between 60°C - 65°C degrees centigrade, wherein the first washing fluid contains about 0.1 M salt (NaCI and/or Sodium Citrate) and the second contains only about 0.02 M salfi (NaCI and/,or Sodium Citrate). In a preferred embodiment the following conditions are considered to be highly stringent:
Hybridisation in a buffer containing 2 x SSC (0.03 M Sodium Citrate, 0.3 M
NaCI) at 65°C - 68°C degrees centigrade for 12 hours, followed by a washing step for 15 minutes in 0.5 x SSC, 0.1 % SDS, and a washing step for 15 minutes at 65°C degrees centigrade in 0.1 x SSC, 0.1 % SDS.
Less stringent hybridization conditions, e.g. 0.15 M salt - 1 M salt and/or temperatures from 22°C degrees centigrade to 56°C degrees centigrade are applied in order to detect functionally equivalent genes in the same species or for orthologous sequences from other species.
Unspecific hybridization products are removed by washing the reaction products repeatedly in 2 x SSC solution and increasing the temperature.
DEGENERATE PCR AND CLONING OF HOMOLOGUES
The nucleotide sequence of the nuclear receptor L66 or its complement can be used to design primers for a polymerase chain reaction. Due to the degeneracy of the genetic code the respective amino acid sequence is used to design oligonucleotides in which varying bases coding for the same amino acid are included. Numerous design rules for degenerate primers have been published (Compton et al, 1990).
As in hybridization there are a number of factors known to vary the stringency of the PCR. The most important parameter is the annealing temperature. To allow annealing of primers with imperFect matches annealing temperatures are often much lower than the standard annealing temperature of 55°C, e.g. 35°C
to 52°C degrees can be chosen. PCR reaction products can be cloned. Either the PCR product is cloned directly, with reagents and protocols from commercial manufacturers (e.g.
from Invitrogen, San Diego, USA). Alternatively, restriction sites can be introduced intro the PCR product via a 5'-tail of the PCR primers and used for cloning.
Primers for the amplification of the entire or partial pieces of the L66 gene or mRNA, or for reverse transcription may be designed making use of the sequences according to the invention, i.e. those depicted in the figures' below.
GENETIC VARIANTS
Fragments from the nucleotide sequence of the nuclear receptor L66 (SEQ ID NO.
1, 4 or the mouse, ,i.e. 17 or 22) or their complements can be used to cover the whole sequence with overlapping sets of PCR primers. Also the genomic sequences may be used (see figures for sequences). These primers are used to produce PCR
products using genomic DNA from a human diversity panel of healthy individuals or genomic DNA from individuals which are phenotypically conspicuous. Also the genomic sequences may be used, i.e. that of the human clone as deposited by the applicant (deposit number DSM 14483 ) or that of the mouse according to SEQ ID
NO. 20 (or the complement thereof). The I?CR products can be screened for polymorphisms, for example by denaturing gradient gel electrophoresis, binding to proteins detecting mismatches or cleaving heteroduplices or by denaturing high-performance liquid chromatography. Products which display mutations need to be sequenced to identify the nature of the mutation. Alternatively, PCR products can be sequenced directly opnitting the mutation screening step to identify genetic polymorphisms. If genetic variants are identified and are associated wifih a discrete phenotype, these genetic variations can be included in diagnostic assays. The normal variation of the human population is of interest in designing screening assays as some variants might interact better or worse with a respective lead substance (a pharmacodynamic application). Polymorphisms or mutations which can be correlated to phenotypic outcome are a tool to extend the knowledge and the commercial applicability of the nucleotide sequence of the nuclear receptor L66 or its complement or their gene product, as variants might have a slightly different molecular behavior or desired properties. Disease-causing mutations or polymorphisms allow the replacement of this disease inducing gene copy with a wild-type copy by means of gene therapy approaches and/or the modulation of the activity of the gene product by drugs.
PREPARATION OF POLYNUCLEOTIDES:
DNA which encodes receptor L66 may be obtained, in view of the instant disclosure, by chemical synthesis, by screening reverse transcripts of mRNA from appropriate cells or cell line cultures, by screening genomic libraries from appropriate cells, or by combinations of fihese procedures, as illustrated below.
Screening of mRNA or genomic DNA may be carried out with oligonucleotide probes generated from the L66 nucleotide sequences information provided herein.
Probes may be labeled with a detectable group such as a fluorescent group, a radioactive atom or a chemiluminescent group in accordance with known procedures and used in conventional hybridization assays, as described in greater detail in the Examples below. Alternatively, the L66 nucleotide sequence may be obtained by use of the polymerase chain reaction (PCR) procedure, with the PCR oligonucleotide primers being produced from the L66 nucleotide sequences provided herein.
Upon purification or synthesis, the nucleic acid according to the invention may be labeled, e.g. for use as a probe.
As single and differential labeling agents and methods, any agents and methods which are known in the art can be used. For example, single and differential labels may consist of fihe group comprising enzymes such as ~i-galactosidase, alkaline phosphatase and peroxidase, enzyme substrates, coenzymes, dyes, chromophores, fluorescent, chemiluminescent and bioluminescent labels such as FITC, CyS, Cy5.5, Cy7, Texas-Red and IRD40(Chen et al. (1993), J. Chromatog. A 652: 355-360 and Kambara et al. (1992), Electrophoresis 13: 542-546), ligands or haptens such as biotin, and radioactive isotopes such as 3H, 355, 32P 1251 and'4C.
EXPRESSION OF THE L66 PROTEIN/POLYPETIDE:
The nuclear receptor L66 nucleic acid or polypeptide may be synthesized in host cells transformed with a recombinant expression construct comprising a nucleic acid encoding the nuclear receptor L66.
Such a recombinant expression construct can also be comprised of a vector that is a replicable DNA construct.
Amplification vectors do not require expression control domains. All that is needed is fihe ability to replicate in a host, usually conferred by an origin of replication, and a selection gene to facilitate recognition of transformants. See, Sambrook et al., Molecular Cloning: A Laboratory Manual (2nd Edition, Cold Spring Harbor Press, New York, 1989).
An expression vector comprises a polynucleotide operatively linked to a prokaryotic promoter. Alternatively, an expression vector is a polynucleotide operatively linked to an enhancer-promoter that is a eukaryotic promoter, and the expression vector further has a polyadenylation signal that is positioned 3' of the carboxy-terminal amino acid and within a transcriptional unit of the encoded polypeptide. A
promoter is a region of a DNA molecule typically within about 500 nucleotide pairs in front of (upstream of) the point at which transcription begins (i.e., a transcription start site). In general, a vector contains a replicon and control sequences which are derived from species compatible with the host cell. The vector ordinarily carries a replication site, as well as marking sequences which are capable of providing phenotypic selection in transformed cells.
Another type of discrete trahscription regulatory sequence element is an enhancer.
An enhancer provides specificity of time, location and expression level for a particular encoding region (e.g., gene). A major function of an enhancer is to increase the level of transcription of a coding sequence in a cell.
As used herein, the phrase "enhancer-promoter" means a composite unit that contains both enhancer and promoter elements. An enhancer-promoter is operatively linked to a coding sequence that encodes at least one gene product.
An enhancer-promoter used in a vector construct of the present invention may be any enhancer-promoter that drives expression in a prokaryotic or eukaryotic cell to be transformed/transfected.
A coding sequence of an expression vector is operatively linked to a transcription terminating region. RNA polymerase transcribes an encoding DNA sequence through a site where polyadenylation occurs.
An expression vector comprises a polynucleotide that encodes a nuclear receptor L66 polypeptide. Such a polynucleotide is meant to include a sequence of nucleotide bases encoding a nuclear receptor L66 polypeptide sufficient in length to distinguish said segment from a polynucleotide segment encoding a non- nuclear receptor polypeptide.
A polypeptide of the invention may also encode biologically functional polypeptides or peptides which have variant amino acid sequences, such as with changes selected based on considerations such as the relative hydropathic score of fihe amino acids being exchanged.
These variant sequences are those isolated from natural sources or induced in the sequences disclosed herein using a mutagenic procedure such as site-directed mutagenesis.
Furthermore, an expression vector of the present invention may contain regulatory elements for optimized translation of the polypeptide in prokaryotic or eukaryotic systems. This sequences are operatively located around the transcription start site and are most likely similar to ribosome recognition sites like prokaryotic ribosome binding sites (RBS) or eukaryotic Kozak sequences as known in the art (Kozak M., Initiation of translation in prokaryotes and eukaryotes. Gene 234, 187-208 (1999).
An expression vector of the present invention is useful both as a means for preparing quantities of the nuclear receptor L66 polypeptide-encoding DNA itself, and as a means for preparing the encoded nuclear receptor L66 polypeptide and peptides.
It is contemplated that where nuclear receptor L66 polypeptides of the invention are made by recombinant means, one may employ either prokaryotic or eukaryotic expression vectors as shuttle systems.
Where expression of recombinant nuclear receptor L66 polypeptides is desired and a eukaryotic host is contemplated, it is most desirable to employ a vector such as a plasmid, that incorporates a eukaryotic origin of replication. Additionally, for the purposes of expression in eukaryotic systems, one desires to position the nuclear receptor L66 encoding sequence or if desired parts thereof (SEQ ID NO. 1, 4, 6, 8, 10, 12, 17, and/or 22) adjacent to and under the control of an effective eukaryotic promoter, To bring a coding sequence under control of a promoter, whether it is eukaryotic or prokaryotic, what is generally needed is to position the 5' end of the translation initiation site of the proper translational reading frame of the polypeptide between about 1 and about 2000 nucleotides 3' of or downsfiream with respect to the promoter chosen.
Furthermore, where eukaryotic expression is anticipated, one would typically desire to incorporate into the transcriptional unit which includes the nuclear receptor L66 polypeptide, an appropriate polyadenylation side.
The invention provides homogeneous compositions of mammalian nuclear receptor L66 polypeptide produced by transformed prokaryotic or eukaryotic cells as provided herein. Such homogeneous compositions are intended to be comprised of mammalian nuclear receptor L66 protein that comprises at least 90% of the protein in such homogenous composition. The invention also provides membrane preparation from cells expressing mammalian nuclear receptor L66 polypeptide as the result of transformation with a recombinant expression construct, as described here.
Within the scope of the present invention the terms recombinant protein or coding sequence both also include tagged versions of the protein depicted in SEQ ID
NO. 3, 19 and/or 22, and/or encoded by the nucleic acids according to the invention and fusion proteins of said proteins or parts thereof such as splice variants with any other recombinant protein. Tagged versions here means that small epitopes of 3-20 amino acids are added fio the original protein by extending the coding sequence either at the 5'or the 3'terminus leading to N-terminal or C-terminal extended proteins respectively, or that such small epitopes are included elsewhere in the protein. The same applies for fusion proteins where the added sequences are coding for longer proteins, varying between 2 and 100 kDa. Tags and fusion proteins are usually used to facilitate purification of recombinant proteins by specific antibodies or affinity matrices or to increase solubility of recombinanfi proteins within the expression host.
Fusion proteins are also of major use as essential parts of yeast two hybrid screens for interaction partners of recombinant proteins.
Tags used in the scope of the present invention may include but are not limited to the following: EEF (alpha Tubulin), B-tag (QYPALT), E tag (GAPVPYPDPLEPR) c-myc Tag (EQKLISEEDL), Flag epitope (DYKDDDDK, HA tag (YPYDVPDYA), 6 or 10 x His Tag, HSV (QPELAPEDPED), Pk-Tag (GKPIPNPLLGLDST), protein C
(EDQVDPRLIDGK), T7 (MASMTGGQQMG), VSV-G (YTDIEMNRLGK), Fusion proteines may include Thioredoxin, Glutathiontransferase (GST), Maltose binding Protein (MBP), Cellulose Binding protein (CBD), chitin binding protein, ubiquitin, the Fc part of Immunoglobulins, and the IgG binding domain of Staphylococcus aureus protein A. These examples of course are illustrative and not limiting and the standard amino acid one letter code was used above.
For expression of recombinant proteins in living cells or organisms, vector constructs harboring recombinant L66 nuclear receptor as set forth in SEQ ID NO. 1 or 17 are transformed or transfected into appropriate host cells. Preferably, a recombinant host cell of the present invention is transfected with a polynucleotide of SEQ ID
NO. 1, 4, 22 or 17.
Means of transforming or transfecting cells with exogenous polynucleotide such as DNA molecules are well known in the art and include techniques such as calcium-phosphate- or DEAE-dextran-mediated transfection, protoplast fusion, electroporation, liposome mediated transfection, direct microinjection and virus infection (Sambrook et al., 1989).
The most frequently applied technique for transformation of prokaryotic cells is transformation of bacterial cells after treatment with calciumchloride to increase permeability (Dagert & Ehrlich, 1979), but a variety of other methods is also available for one skilled in the art.
The most widely used method for transfection of eukaryotic cells is transfection mediated by either calcium phosphate or DEAE-dextran. Although the mechanism remains obscure, it is believed that the transfected DNA enters the cytoplasm of the cell by endocytosis and is transported to the nucleus. Depending on the cell type, up to 90% of a population of cultured cells may be transfected at any one time.
Because of its high efficiency, transfection mediated by calcium phosphate or DEAE-dextran is the method of choice for studies requiring transient expression of the foreign nucleic acid in large numbers of cells. Calcium phosphate-mediated transfection is also used to establish cell lines that integrate copies of the foreign DNA, which are usually arranged in head-to-tail tandem arrays into the host cell genome.
In the protoplast fusion method, protoplasts derived from bacteria carrying high numbers of copies of a plasmid of interest are mixed directly with cultured mammalian cells. After fusion of the cell membranes (usually with polyethylene glycol), the contents of the bacterium are delivered into the cytoplasm of the mammalian cells and the plasmid DNA is transported to the nucleus. Protoplast fusion is not as efficient as transfection for many of the cell lines that are commonly used for transient expression assays, but it is useful for cell lines in which endocytosis of DNA occurs inefficiently. Protoplast fusion frequently yields multiple copies of the plasmid DNA tandemly integrated into the host chromosome.
The application of brief, high-voltage electric pulses to a variety of mammalian and plant cells leads to the formation of nanometer sized pores in the plasma membrane.
DNA is taken directly into the cell cytoplasm either through these pores or as a consequence of the redistribution of membrane components that accompanies closure of the pores. Electroporation may be extremely efficient and may be used both for transient expression of cloned genes and for establishment of cell lines that carry integrated copies of the gene of interest. Electroporation, in contrast to calcium phosphate-mediated transfection and protoplast fusion, frequently gives rise to cell lines that carry one, or at most a few, integrated copies of the foreign DNA.
Liposome transfection involves encapsulation of DNA and RNA within liposomes, followed by fusion of the liposomes with the cell membrane. The mechanism of how DNA is delivered into the cell is unclear but transfection efficiencies may be as high as 90%.
Direct microinjection of a DNA molecule into nuclei has the advantage of not exposing DNA to cellular compartments such as !ow-pH endosomes. Microinjection is therefore used primarily as a method to establish lines of cells that carry integrated copies of the DNA of interest.
The use of adenovirus as a vector for cell transfection is well known in the art.
Adenovirus vector-mediated cell transfection has been reported for various cells (Stratford-Perricaudet et al., 1992), A transfected cell may be prokaryotic or eukaryotic, transfection may be transient or stable. Where it is of interest to produce a full length human or mouse L66 protein, cultured mammalian mouse, or human cells are of particular interest.
!n another aspect, the recombinant host cells of the present invention are prokaryotic host cells. In addition to prokaryotes, eukaryotic microbes, such as yeast may also be used illustrative examples for suitable cells and organisms for expression of recombinant proteins are belonging to but not limited to the following examples:
Insect cells, such as Drosophila Sf21, SF9 cells or others, Expression strains of Escherichia coli, such as XL1 blue, BRL21, M15, Saccharomyces cerevisiae, Schizosaccharomyces pombe, Hansenlua polymorpha and Pichia pastoris strains, immortalized mammalian cell lines such as AtT-20, VERO and HeLa cells, Chinese hamster ovary (CHO) cell lines, and W138, BHK, COSM6, COS-7, 293 and MDCK
cells, BHK-21 cells, Att 20HeLa cells, HeK 294, T47 D cells and others.
Expression of recombinant proteins within the scope of this invenfiion can also be performed in vitro. This may occur by a two step procedure, thereby producing first mRNA by in vitro transcription of an apt polynucleotide construct followed by in vitro translation with convenient cellular extracts. These cellular extracts may be reticulocyte lysates but are not limited to this type. In vitro transcription may be performed by T7 or SP6 DNA polymerase or any other RNA polymerase which can recognize per se or with the help of accessory factors the promoter sequence contained in the recombinant DNA construct of choice. Alternatively one of the recently made available one step coupled transcription/translation systems may be used for in vitro translation of DNA coding for the proteins of this invention, e.g. from Roche Molecular Biochemicals. One illustrative but not limiting example for such a system is the TNT~ T7 Quick System by Promega.
Expression of recombinant proteins in transfected cell may occur constitutively or upon induction. Procedures depend on the Cell/vector combination used and are well known in the art. In all.cases, transfected cells are maintained for a period of time sufficient for expression of the recombinant L66 nuclear receptor protein. A
suitable maintenance time depends strongly on the cell type and organism used and is easily ascertainable by one skilled in the art. Typically, maintenance time is from about 2 hours to about 14 days. For the same reasons and for sake of protein stability and solubility incubation temperatures during maintenance time may vary from 20°C to 42 °C.
Recombinant proteins are recovered or collected either from the transfected cells or the medium in which those cells are cultured. Recovery comprises cell disruption, isolation and purification of the recombinant protein. Isolation and purification techniques for polypeptides are well-known in the art and include such procedures as precipitation, filtration, chromatography, elecfirophoresis and the like.
In a preferred embodiment, purification includes but is not limited to affinity purification of tagged or non-tagged recombinant proteins. This is a well established robust technique easily adapted to any tagged protein by one skilled in the art. For affinity purification of tagged proteins, small molecules such as gluthathione, maltose or chitin, specific proteins such as the IgG binding domain of Staphylococcus aureus protein A, antibodies or specific chelates which bind with high affinity to the tag of the recombinant protein are employed. For afFinity purification of non-tagged proteins 22_ .
specific monoclonal or polyclonal antibodies, which were raised against said protein, can be used. Alternatively immobilized specific interactors of said protein may be employed for affinity purification. Interactors include native or recombinant proteins as well as native or artificial specific low molecular weight ligands.
CHEMICAL SYNTHESIS OF THE POLYPEPTIDE ACCORDING TO THE
INVENTION:
Alternatively, the protein itself may be produced using chemical methods to synthesize any of the amino acid sequences according to the invention or that is encoded by the nucleotide sequences according to the invention (SEQ ID NO. 1, 4, 17 or 22) and/or a portion thereof and/or splice variants thereof. For example, peptide synthesis can be performed using conventional Merrifield solid phase f-Moc or t-Boc chemistry or various solid-phase techniques (Roberge, J. Y, et al. (1995) Science 269: 202-204) and automated synthesis may be achieved, for example, using the ABI
431A Peptide Synthesizer (Perkin Elmer). The newly synthesized peptides) may be substantially purified by preparative high perFormance liquid chromatography (e.g., Creighton, T. (1983) Proteins, Structures and Molecular Principles, WH Freeman and Co., New York, N.Y.). The composition of the synthetic peptides may be confirmed by amino acid analysis or sequencing (e.g., the Edman degradation procedure;
Creighton, supra). Additionally, the amino acid sequences according to the invention, i.e. SEQ ID NO. 3, 24 or 19 or the sequence that is encoded by SEQ ID NO. 1, 4, 17 or 22 or any part thereof, may be altered during direct synthesis and/or combined using chemical methods with sequences from other proteins, or any part thereof, to produce a variant polypeptide.
SCREENING ASSAYS
The invention also concerns a method for screening for agents which are capable of inhibiting the cellular function of the nuclear receptor L66 comprising the steps of contacting one or more candidate agents with a polypeptide according to the invention, removing unbound agents) and' detecting whether the agents) interact with the polypeptide of the nuclear receptor.
_ 23 The invention also concerns method for inhibiting the cellular function of the nuclear receptor L66, comprising the steps of contacting a cell with a binding agent of a polypeptide previously identified as outlined herein whereby the cellular function of L66 is inhibited.
Such a binding agent may be an antibody., RNA. an anti-sense oligonucleotide.
a ribozyme or one of substances shown below or identified in a respective assay as disclosed herein.
In still a further embodiment, the present invention concerns a method for identifying new nuclear receptor inhibitory or stimulatory substances, which may be termed as "candidate substances". It is contemplated that this screening technique proves useful in the general identification of compounds that serve the purpose of inhibiting or stimulating nuclear receptor activity.
In one embodiment of the invention the following subsfiances are disclosed as potential interactors of the nuclear receptor according to the invention:
Steroids: dexamethasone-t-butylacetate, RU486, progesterone, 17-aipha-hydroxyprogesterone, 1,16-alpha dimethylpregnenolone, 17-alpha-hydroxypregnenonlone, pregnenonlone, 5beta-pregnane-3,20-dione, pregnenonlone-16-carbonitrile, 5beta-pregnane-3,20-dione, androstanol, corticosterone, dehydroepiandrosterone, dihydroxytestosterone, estradiol, cortisol, cortisone, dihydroxytestosterone.
Other substances: transnonachlor, chlordane, spironolactone, cyproterone acetate, rifampicin, nefipine, diethylstilbestrol, coumesterol, clotrimazole, lovastatin, phenoarbital, pthalic acid, nonylphenol, 1,4-bis(2-(3,5-dichloropyridyloxy1))benzene, This also includes the use of heteromultimeric complexes of the nuclear receptor with other proteins, such as heterodimeric complexes with RXR, or any other binding partner.
Accordingly, in screening assays to identify pharmaceuticals agents which affect nuclear receptor activity, it is proposed that compounds isolated from natural sources, such as fungal extracts, plant extracts, bacterial extracts, higher eukaryotic cell extracts, or even extracts from animal sources, or marine, forest or soil samples, may be assayed for the presence of potentially useful pharmaceutical agents.
It will be understood that that the pharmaceutical agents to be screened could also be derived from chemical compositions or man-made compounds. The candidate substances can could also include monoclonal or polyclonal antibodies, peptides or proteins, such as those derived from recombinant DNA technology or by other means, including chemical peptide synthesis. The active compounds may include fragments or parts or derivatives of naturally-occurring compounds or may be only found as active combinations of known compounds which are otherwise inactive.
We anticipate that such screens will in some cases lead to the isolation of agonists of nuclear receptors, in other cases to the isolation of antagonists. In other instances, substances will be identified that have mixed agonistic and antagonistic effects, or affect nuclear receptors in any other way.
CELL BASED ASSAYS
To identify a candidate substance capable of influencing L66 nuclear receptor activity, one first obtains a recombinant cell line. One designs the cell line in such a way that the activity of the nuclear receptor leads to the expression of a protein which has an easily detectable phenotype ( a reporter), such as luciferase, fluorescent proteins such as green or red fluorescent protein, beta-galactosidase, alpha-galactosidase, beta-lactamase, chloramphenicol-acetyl-transferase, beta-glucuronidase, or any protein which can be detected by a secondary reagent such as an antibody.
Methods for detecting proteins using antibodies, such as ELISA assays, are well known to those skilled in the art.
Here, the amount of reporter protein present reflects the transcriptional activity of the nuclear receptor. This recombinant cell line is then screened for the effect of _25 substances on the expression of the reporters, thus measuring the effect of these substances on the activity of the nuclear receptor. These substances can be derived from natural sources, such as fungal extracts, plant extracts, bacterial extracts, higher eukaryotic cell extracts, or even extracts from animal sources, or marine, forest or soil samples, maybe assayed for the presence of potentially useful pharmaceutical agents. It will be understood that that the pharmaceutical agents to be screened may be derived from chemical compositions or man-made compounds.
The candidate substances can also include monoclonal or polyclonal antibodies, peptides or proteins, such as those derived from recombinant DNA technology or by other means, including chemical peptide synthesis. The active compounds may include fragments or parts or derivatives of naturally-occurring compounds or may be only found as active combinations of known compounds which are otherwise inactive.
In general the assay comprises, contacting a suitable cell containing a reporter under the control of the L66 nuclear receptor with a test compound, monitoring said host cell for the expression of the reporter gene, wherein expression of the reporter reflects the transcriptional activity of the nuclear receptor L66, and therefore reflects effects of the compound on the nuclear receptor.
In other embodiments of the invention assays are included where measuring the activity of a dimer of the nuclear receptor L66 and another protein, such as RXR
takes place. Further included are assays aiming at the identification of compounds which specifically influence only the monomeric, homodimeric or homomultimeric form of the nuclear receptor, or influencing only multimeric forms of the nuclear receptor. Such assays include measuring the effect of a compound on the nuclear receptor in the absence of a binding partner, and measuring the effect of a compound on the nuclear receptor in the presence of a binding partner, such as RXR. One skilled in the art will find numerous more assays which are equally covered by the invention.
A cell line where the activity of a nuclear receptor determines the expression of a reporter can be obtained by creating a fusion gene driving the expression of a fusion protein. consisting of the ligand-binding domain of the L66 nuclear receptor fused to the DNA binding domain of a transcription factor with a known specificity for a given DNA sequence (the binding site). This DNA sequence (the binding site) can then be inserted in one or multiple copies before (5') to the promoter driving the expression of the reporter. Transcription factors useful for this approach include bacterial proteins, such as IexA, yeast proteins, such as Gal4, mammalian proteins such as NFkappaB
or NFAT, the glucocorticoid receptor, the estrogen receptor, or plant proteins. The binding sites for these proteins can all be used in combination with the appropriate transcription factor to generate a useful reporter assay.
Another way to screen for inhibitors is to identify binding sites on DNA for the L66 nuclear receptor, and operatively link this binding site to a promoter operatively linked to a reporter gene. Included among others are binding sites for heterodimers of the L66 nuclear receptor with another protein, such as RXR.
Furthermore, transgenic animals described in the invention can be used to derive cell lines useful for cellular screening assays.
Cell lines useful for such an assay include many different kinds of cells, including prokaryotic, animal, fungal, plant and human cells. Yeasfi cells can be used in this assay, including Saccharomyces cerevisiae and Schizosaccharomyces pombe cells.
Another way to build cellular assays to measure the effect of compounds is the use of the yeast two hybrid system (see for example see, for example, U.S. Pat.
No.
5,283,317; Zervos et al. (1993) Cell 72:223-232; Madura et al. (1993) J. Biol.
Chem.
268:12046-12054; Bartel et al. (1993) Biotechniques 14:920-924; Iwabuchi et al.
(1993) Oncogene 8:1693-1696; PCT Publication No. WO 94/10300, and U.S. Pat.
No. 5,667,973), and or possible variants of the basic two hybrid system as discussed e.g in Vidal M, Legrain P, Nucleic Acids Res. 1999 Feb 15;27(4):919-29.
Briefly, the two hybrid assay relies on reconstituting in vivo a functional transcriptional activator protein from two separate fusion proteins. In particular, the method makes use of chimeric genes which express hybrid proteins. To illustrate, a first hybrid gene comprises the coding sequence for a DNA-binding domain of a transcriptional activator fused in frame to the coding sequence for a TI polypeptide. The second hybrid protein encodes a transcriptional activation domain fused in frame to a sample gene from a cDNA library. If the bait and sample hybrid proteins are able to interact, e.g., form a TI-dependent complex, they bring into close proximity the two domains of the transcriptional activator. This proximity is sufficient to cause transcription of a reporter gene which is operably linked to a transcriptional regulatory site responsive to the transcriptional activator, and expression of the reporter gene can be detected and used to score for the interaction of the TI and sample proteins.
In such assays, one primarily measures the effect of a compound on a given interaction involving the L66 nuclear receptor and a binding protein. In a preferred embodiment of the invention systems using other hosts such as prokaryotes as E.
coli, or eukaryotic mammalian cells are described.
Two hybrid systems using hybrid protein fusions with other proteins than transcription factors, including enzymes such as beta-galactosidase or dihydrofolate reductase may also be applied. These assays are useful both to monitor the effect of a compound, including peptides, proteins or nucleic acids on an interaction of a nuclear receptor with a given binding partner, as well as to identify novel proteins or nucleic acids interacting with the nuclear receptor.
Monitoring the influence of compounds on cells may be applied not only in basic drug screening, but also in clinical trials. In such clinical trials, the expression of a panel of genes may be used as a "read out" of a particular drug's therapeutic effect.
CELL-FREE ASSAYS
Recombinant forms of the polypeptide according to SEQ ID NO. 3, 24 or 19 or as encoded by the nucleic acids according to the invention can be used in cell-free screening assays aiming at the isolation of compounds affecting the activity of nuclear receptors. In such an assay, the nuclear receptor polypeptide is brought into contact with a substance to test if the substance has an effect on the activity of the L66 receptor.
_ 28 The detection of an interaction between an agent and a receptor may be accomplished through techniques well-known in the art. These techniques include but are not limited to centrifugation, chromatography, electrophoresis and spectroscopy. The use of isotopically labeled reagents in conjunction with these techniques or alone is also contemplated. Commonly used radioactive isotopes include 3H, 14C, . 2ZNa, 32P~ 33P~ 355 45Ca~ 60C0~ 1251, and 1311. Commonly used stable isotopes include '2H, .13 C, 15N, 1s0.
For example, if an agent binds to the receptor of the present invention, the binding may be detected by using radiolabeled agent or radiolabeled receptor. Briefly, if radiolabeled agent or radiolabeled receptor is utilized, the agent-receptor complex may be detected by liquid scintillation or by exposure to x-ray film or phosho-imaging devices.
One way to screen for substances affecting nuclear receptor activity is to measure the effect of the binding of nuclear receptors to ligands, such as cofactors, activators, repressors, DNA, RNA, proteins, antibodies, peptides or other substances, including chemical compounds known to affect receptor activity. Assays measuring the binding of a protein to a ligand are well known in the art, such as ELISA assays, FRET
assays, bandshift assays, plasmon-resonance based assays, scintilllation proximity assays, fluorescence polarization assays.
In one example, a mixture containing the L66 polypeptide, effector and candidate substance is allowed to incubate. The unbound effector is separable from any effector/receptor complex so formed. One then simply measures the amount of each (e.g., versus a control to which no candidate substance has been added). This measurement may be made at various time points where velocity data is desired.
From this, one determines the ability of the candidate substance to alter or modify the function of the receptor.
Numerous techniques are known for separating the effector from efFector/receptor complex, and all such methods are intended to fall within the scope of the invention.
This includes the use of thin layer chromatographic methods (TLC), HPLC, spectrophotometric, gas chromatographiclmass spectrophotometric or NMR
analyses. Another method of separation is to immobilize one of the binding partners on a solid support, and to wash away any unbound material. It is contemplated that any such technique may be employed so long as it is capable of differentiating between the effector and complex, and may be used to determine enzymatic function such as by identifying or quantifying the substrate and product.
A screening assay provides a L66 receptor under conditions suitable for the binding of an agent to the L66 receptor. These conditions include but are not limited to pH, temperature, tonicity, the presence of relevant cofactors, and relevant modifications to the polypeptide such as glycosylation or lipidation. It is contemplated that the receptor can be expressed 'and utilized in a prokaryotic or eukaryotic cell.
The host cell expressing the L66 receptor can be used whole or the receptor can be isolated from the host cell. The L66 receptor can be membrane bound in the membrane of the host cell or it can be free in the cytosol of the host cell. The host cell can also be fractionated into sub-cellular fractions where the receptor can be found. For example, cells expressing the receptor can be fractionated into the nuclei, the endoplasmic reticulum, vesicles, or the membrane surfaces of the cell.
pH is preferably from about a value of 6.0 to a value of about 8.0, more preferably from about a value of about 6.8 to a value of about 7:8, and most preferably, about 7.4. In a preferred embodiment, temperature is from about 20°C degrees to about 50°C degrees more preferably, from about 30°C degrees to about 40°C degrees and even more preferably about 37°C degrees. Osmolality is preferably from about 5 milliosmols per liter (mosm%L) to about 400 mosm/l, and more preferably, from about 200 milliosmols per liter to about 400 mosm/l and, even more preferably from about 290 mosm/L to about 310 mosm/L. The presence of cofactors can be required for the proper functioning of the L66 receptor. Typical cofactors include sodium, potassium, calcium, magnesium, and chloride. In addition, small, non-peptide molecules, known as prosthetic groups may also be required. Other biological conditions needed for receptor function are well-known in the art.
It is well-known in the art that proteins can be reconstituted in artificial membranes, vesicles or liposomes. (Danboldt et a1.,1990). The present invention contemplates that the receptor can be incorporated into artificial membranes, vesicles or liposomes. The reconstituted receptor can be utilized in screening assays.
It is further contemplated that a receptor of the present invention can be coupled to a solid support, e.g., to agarose beads, polyacrylamide beads, polyacrylic, sepharose beads or other solid matrices capable of being coupled to polypeptides. Well-known coupling agents include cyanogen bromide (CNBr), carbonyldiimidazole, tosyl chloride, diaminopimelimidate, and glutaraldehyde.
In a typical screening assay for identifying candidate substances, one employs the same recombinant expression host as the starting source for obtaining the receptor polypeptide, generally prepared in the form of a crude homogenate. Recombinant cells expressing the receptor are washed and homogenized to prepare a crude polypeptide homogenate in a desirable buffer such as disclosed herein. In a typical assay, an amount of polypeptide from the cell homogenate, is placed into a small volume of an appropriate assay buffer at an appropriate pH. Candidate substances, such as agonists and antagonists, are added to the admixture in convenient concentrations and the interaction between the candidate substance and the receptor polypeptide is monitored.
Where one uses an appropriate known substrate for the L66 receptor, one can, in the foregoing manner, obtain a baseline activity for the recombinantly produced receptor. Then, to test for inhibitors or modifiers of the receptor function, one can incorporate into the admixture a candidate substance whose effect on the L66 receptor is unknown. By comparing reactions which are carried out in the presence or absence of the candidate substance, one can then obtain information regarding the effect of the candidate substance on the normal function of the receptor.
Accordingly, this aspect of the present invention will provide those of skill in the art with methodology that allows for the identification of candidate substances having the ability to modify the action of nuclear receptor polypeptides in one or more manners.
Additionally, screening assays for the testing of candidate substances are designed to allow the determination of structure-activity relationships of agonists or antagonists with the receptors, e.g., comparisons of binding between naturally-occurring hormones or other substances capable of interacting with or otherwise modulating the receptor; or comparison of the activity caused by the binding of such molecules to the receptor.
In certain aspects, the polypeptides of the invention are crystallized in order to carry out x-ray crystallographic studies as a means of evaluating interactions with candidate substances or other molecules with the nuclear receptor polypeptide.
For instance, the purified recombinant polypeptides of the invention, when crystallized in a suitable form, are amenable to detection of intra-molecular interactions by x-ray crystallography. In another aspect, the structure of the polypeptides can be determined using nuclear magnetic resonance.
PHARMACEUTICAL COMPOSITION:
This invention provides a pharmaceutical composition comprising an effective amount of a agonist or antagonist drug identified by the method described herein and a pharmaceutically acceptable carrier. Such drugs and carrier can be administered by various routes, for example oral, subcutaneous, intramuscular, intravenous or intracerebral. The preferred route of administration would be oral at daily doses of about 0.01 -100 mg/kg.
This invention provides a method of treating metabolic disorders, immunological indications, hormonal dysfunctions, neurosystemic diseases wherein the abnormality is improved by reducing the activity of L66 receptor or blocking the binding of ligands to a L66 receptor, which method comprises administering an effective amount of the antagonist-containing pharmaceutical composition described above to suppress the subject's appetite. Similarly, the invention also provides methods for treating diseases and conditions resulting from metabolic disorders, immunological indications, hormonal dysfunctions, neurosystemic diseases, which method comprises administering an effective amount of an agonist-containing pharmaceutical composition described above.
_ 32 TRANSFORMATION OF CELLS AND DRUG SCREENING
The recombinant expression constructs of the present invention are useful in molecular biology to transform cells which do not ordinarily express L66 to thereafter express this receptor.
Such cells are useful as intermediates for making cellular preparations useful for receptor binding assays, which are in turn useful for drug screening. Drugs identified from such receptor assays can be used for the treatment of metabolic disorders, immunological indications, hormonal dysfunctions, and/or neurosystemic diseases.
The recombinant expression constructs of the present invention are also useful in gene therapy. Cloned genes of the present invention, or fragments thereof, may also be used in gene therapy carried out by homologous recombination or site-directed mutagenesis. See generally Thomas & Capecchi, Cell 51, 503-512 (1987);
Bertling, Bioscience Reports 7, 107-112 (1987); Smithies et al., Nature 317, 230-234 (1985).
Oligonucleotides of the present invention are useful as diagnostic tools for probing L66 expression in tissues. For example, tissues are probed in situ with oligonucleotide probes carrying detectable groups by conventional autoradiographic techniques, as explained in greater detail in the Examples below, to investigate native expression of this receptor or pathological conditions relating thereto. Further, chromosomes can be probed to investigate the presence or absence of the L66, and potential pathological conditions related thereto, as also illustrated by the Examples below. Probes according to the invention should generally be at least about 15 nucleotides in length to prevent binding to random sequences, but, under the appropriate circumstances may be smaller (see above for details on hybridization).
POLYPEPTIDE
Another aspect of the invention includes an antibody specifically reactive with the protein or any part of the protein according to the invention (SEQ ID NO. 3, 24 or 19) and or a polypeptide encoded by the nucleotide sequence of the nuclear receptor L66 (see also figures). (The term "antibody" refers to intact molecules as well as fragments thereof, such as Fa, F(ab)2, and Fv, which are capable of binding the epitopic determinant.) By using immunogens derived from the polypeptide according to the invention (SEQ ID NO. 3, 24, 19) and/or encoded by the nucleic acids according to the invention, anti-protein/anti-peptide antisera or monoclonal antibodies can be made by standard protocols (E, Howell & D. Lane. Antibodies: A
Laboratory Manual. Cold Spring Harbor Laboratory (1988)).
A polyclonal antibody is prepared by immunizing a mammal, such as a mouse, a hamster or rabbit with an immunogenic form of the polypeptide, i.e. the human polypeptide of the present invention, and collecting antisera from that immunized animal. Because of the relatively large blood volume of rabbits, a rabbit is a preferred choice for production of polyclonal antibodies.
As an immunizing antigen, fusion proteins, intact polypeptides or fragments containing small peptides of interest can be used. They can be derived by expression from a cDNA transfected in a host cell with subsequent recovering of the protein/peptide or peptides can be synthesized chemically (e.g. oligopeptides with 10-15 residues in length). Important tools for monitoring the function of the gene according to the present invention, i.e. encoded by a sequence according to SEQ ID
NO. 1, 4, 24 or 17 (or portions thereof or splice variants thereof) are antibodies against various domains of the protein according to the invention. Various Oligopeptides from the N- and C-terminal sequences and the DBD/hinge region of the protein can be used as antigens.
A given polypeptide or polynucleotide may vary in its immunogenicity. It is often necessary to couple the immunogen (e.g. the polypeptide) with a carrier, Commonly used carriers that are chemically coupled to peptides include bovine serum albumin (BSA) and keyhole limpet hemocyanin (ICLH). The coupled peptide is then used to immunize the animal in the presence of an adjuvant, a non-specific stimulator of the immune response in order to enhance immunogenicity. The production of polyclonal antibodies is monitored by detection of antibody titers in plasma or serum at various time points following immunization. Standard ELISA or other immunoassays can be 34 _ used with the immunogen as antigen to assess the levels of antibodies. When a desired level of immunogenicity is obtained, the immunized animal may be bled and the serum isolated, stored and purified.
To produce monoclonal antibodies, antibody-producing cells (e.g. spleen cells) from an immunized animal (preferably mouse or rat) are fused by standard somatic cell fusion procedures with immortalizing cells such as myeloma cells to yield hybridoma cells. Where the immunized animal is a mouse, a preferred myeloma cell is the murine NS-1 myeloma cell. Such techniques are well known in the art, and include, for example, the hybridoma technique (originally developed by Kohler &
Milstein.
Nature 256: 495-497 (1975)), the human B cell hybridoma technique (Kozbar et al.
Immunology Today 4:72 (1983)), and the EBV-hybridoma technique to produce human monoclonal antibodies (Cole et al. Monoclonal Antibodies and Cancer Therapy. Alan R. Liss, Inc. pp. 77-96 (1985)).
The fused spleen/myeloma cells are cultured in a selective medium to select fused spleen/myeloma cells from the parental cells. Fused cells are separated from the mixture of non-fused parental cells, for example, by the addition of agents that block the de novo synthesis of nucleotides in the tissue culture media. This culturing provides a population of hybridomas from which specific hybridomas are selected.
Typically, selection of hybridomas is performed by culturing the cells by single-clone dilution in microtiter plates, followed by testing the individual clonal supernatants for reactivity with an antigen-polypeptide. The selected clones may then be propagated indefinitely to provide the monoclonal antibody in convenient quantity.
The creation of antibodies which specifically bind the polypeptide according to the invention (SEQ ID NO. 3, 24 or 19) and/or encoded by the nucleotide sequence of the nuclear receptor L66 or its complement (SEQ ID NO. 1, 4, 17 or 22) provides an important utility in immunolocalization studies, and may play an important role in the diagnosis and treatment of receptor disorders. The antibodies may be employed to identify tissues, organs, and cells which express or the nuclear receptor L66.
Antibodies can be used diagnostically in immuno-precipitation and immuno-blotting to detect and evaluate nuclear receptor L66 protein levels in tissue or from cells in bodily fluid as part of a clinical testing procedure.
Monoclonal antibodies provided by the present invention are also produced by recombinant genetic methods well known to those of skill in the art, and the present invention encompasses antibodies made by such methods that are immunologically reactive with an epitope of a mammalian nuclear L66 receptor protein or peptide.
The present invention encompasses fragments of the antibody that are immunologically reactive with an epitope of a mammalian nuclear L66 receptor protein or peptide. Such fragments are produced by any number of methods, including but not limited to proteolytic cleavage, chemical synthesis or preparation of such fragments by means of genetic engineering technology. The present invention also encompasses single-chain antibodies that are immunologically reactive with an epitope of a mammalian nuclear L66 receptor protein or peptide made by methods known to those of skilled in the art.
CHIMERIC ANTIBODIES AND OTHER TYPES OF ANTIBODIES:
The invention also includes chimeric antibodies, comprised of light chain and heavy chain peptides immunologically reactive to an epitope that is a mammalian nuclear L66 receptor protein or peptide. The chimeric antibodies embodied in the present invention include those that are derived from naturally occurring antibodies as well as chimeric antibodies made by means of genetic engineering technology well known to those of skill in the art.
Also included are methods for the generation of antibodies against L66 which rely on the use of phage display systems and related systems, such as described in Hoogenboom HR, de Bruine AP, Hufton SE, Hoet RM, Arends JW, Roovers RC, Immunotechnology 1998 Jun;4(1 ):1-20, and references therein.
The present invention also encompasses an epitope of a mammalian nuclear L66 receptor protein or peptide that is comprised of sequences and/or a conformation of sequences present in the mammalian nuclear L66 receptor protein or peptide molecule. This epitope may be naturally occurring, or may be the result of proteolytic cleavage of the mammalian nuclear L66 receptor protein or peptide molecule and isolation of an epitope-containing peptide or may be obtained by synthesis of an epitope-containing peptide using method of genetic engineering technology and synthesized by genetically engineered prokaryotic or eukaryotic cells.
Antisense oligonucleotides are short single stranded DNA or RNA molecules which may be used to block the availability of the L66 receptor messenger. Synthetic derivatives of ribonucleotides or deoxyribonucleotides and/or PNAs (see above) are equally possible.
The sequence of an antisense oligonucleotide is at least partially complementary to the sequence (or the gene) of interest. The complementarity of the sequence is in any case high enough to enable the antisense oligonucleotide to bind to the nucleic acid according to the invention or parts fihereof..Many examples exist in which the binding of oligonucleotides to the target sequence interFere with the biological function of the targeted sequence (Brysch W, Schlingensiepen KH, Design and application of antisense oligonucleotides in cell culture, in vivo, and as therapeutic agents, Cell Mol Neurobiol 1994 Oct;14(5):557-68; Wagner RW, Gene inhibition using antisense oligodeoxynucleotides, Nature 1994 Nov 24;372(6504):333-5 or Brysch W, Magal E, Louis JC, Kunst M, Klinger l, Schlingensiepen R, Schlingensiepen KH Inhibition of p185c-erbB-2 proto-oncogene expression by antisense oligodeoxynucleotides down-regulates p185-associated tyrosine-kinase activity and strongly inhibits mammary tumor-cell proliferation, Cancer Gene Ther 1994 Jun;1(2):99-105 or Monia BP, Johnston JF, Ecker DJ, Zounes MA, Lima WF, Freier SM Selective inhibition of mutant Ha-ras mRNA expression by antisense oligonucleotides, J Biol Chem 1992 Oct 5;267(28):19954-62 or Bertram J, Palfner K, Killian M, Brysch W, Schlingensiepen KH, Hiddemann W, Kneba M, Reversal of multiple drug resistance in vitro by phosphorothioate oligonucleotides and ribozymes, Anticancer Drugs 1995 Feb;6(1):124-34) This interference occurs in most instances at the level of translation, i.e.
through the inhibition of the translational machinery by oligonucleotides fihat bind to mRNA, however, two other mechanisms of interference with a given gene's function by oligonucleotides can also be envisioned, (i) the functional interference with the transcription of a gene through formation of a triple helix at the level of genomic DNA
and the interference of oligonucleotides with the function of RNA molecules that are executing at least part of their biological function in the untranslated form (Kochetkova M, Shannon MF, Triplex-forming oligonucleotides and their use in the analysis of gene transcription. Methods Mol Biol 2000;130:189-201 Rainer B.
Lanz1, Neil J. McKenna1, Sergio A. Onate1, Urs Albrecht2, Jiemin Wong1, Sophia Y.
Tsai1, Ming-Jer Tsai1, and Bert W. O'Malley A Steroid Receptor Coactivator, SRA, Functions as an RNA and Is Present in an SRC-1 Complex Cell, Vol. 97, 17-27, April, 1999).
Antisense oligonucleotides can be conjugated to different other molecules in order to deliver them to the cell or tissue expressing L66. For instance the antisense oligonucleotide can be conjugated to a carrier protein (e.g. ferritin) in order to direct the oligonucleotide towards the desired target tissue, i.e. in case of ferritin predominantly to the liver.
Antisense expression constructs are expression vector systems that allow the expression - either inducible or uninducible - of a complementary sequence to the L66 sequences according to the invention. The potential possibility of such an approach has been demonstrated in many different model systems (von Ruden T, Gilboa E, Inhibition of human T-cell leukemia virus type I replication in primary human T cells that express antisense RNA, J Virol 1989 Feb;63(2):677-82; Nemir M, Bhattacharyya D, Li X, Singh K, Mukherjee AB, Mukherjee BB, Targeted inhibition of osteopontin expression in the mammary gland causes abnormal morphogenesis and lactation deficiency, J Biol Chem 2000 Jan 14;275(2):969-76; Ma L, Gauville C, Berthois Y, Millot G, Johnson GR, Calvo F,Antisense expression for amphiregulin suppresses tumorigenicity of a transformed human breast epithelial cell line, Oncogene 1999 Nov 11;18(47):6513-20; Refolo LM, Eckman C, Prada CM, Yager D, Sambamurti K, Mehta N, Hardy J, Younkin SG, Antisense-induced reduction of presenilin 1 expression selectively increases the production of amyloid beta42 in _ _ _ ~ 38 transfected cells, J Neurochem 1999 Dec;73(6):2383-8; Buckley NJ, Abogadie FC, Brown DA, Dayrell M, Caulfield MP, Delmas P, Haley JE, Use of antisense expression plasmids to attenuate G-protein expression in primary neurons, Methods Enzymol 2000;314:136-48).
According to the invention an antisense expression construct can be constructed with virtually any expression vector capable of fulfilling at least the basic requirements known to those skilled in the art.
In one embodiment of the invention retroviral expression systems or tissue specific gene expression systems are preferred.
Current standard technologies for delivering antisense constructs are perFormed through a conjugation of constructs with liposomes and related, complex-forming compounds, which are transferred via electroporation techniques or via particle-mediated "gene gun" technologies into the cell. Other techniques may be envisioned by one skilled in the art.
Microinjection still plays a major role in most gene transfer techniques for the generation of germ-line mutants expressing foreign DNA (including antisense RNA
constructs) and is preferred embodiment of the present invention.
Ribozymes are either RNA molecules (Gibson SA, Pellenz C, Hutchison RE, Davey FR, Shillitoe EJ, Induction of apoptosis in oral cancer cells by an anti-bcl-2 ribozyme delivered by an adenovirus vector, Clin Cancer Res 2000 Jan;6(1 ):213-22;
Folini M, Colella G, Villa R, Lualdi S, Daidone MG, Zaffaroni N, Inhibition of Telomerase Activity by a Hammerhead Ribozyme Targeting the RNA Component of Telomerase in Human Melanoma Cells, J Invest Dermatol 2000 Feb;114(2):259-267; Halatsch ME, Schmidt U, Botefur IC, Holland JF, Ohnuma T, Marked inhibition of glioblastoma target cell tumorigenicity in vitro by retrovirus-mediated transfer of a hairpin ribozyme against deletion-mutant epidermal growth factor receptor messenger RNA, J
Neurosurg 2000 Feb;92(2):297-305; Ohmichi T, Kool ET, The virtues of self binding:
high sequence specificity for RNA cleavage by self-processed hammerhead ribozymes, Nucleic Acids Res 2000 Feb 1;28(3):776-783) or DNA molecules (Li J, Zheng W, Kwon AH, Lu Y, In vitro selection and characterization of a highly efficient Zn(II)-dependent RNA-cleaving deoxyribozyme; Nucleic Acids Res 2000 Jan 15;28(2):481-488) that have catalytic activity. The catalytic activity located in one part of the RNA (or DNA) molecule can be "targeted" to a specific sequence of interest by fusing the enzymatically active RNA molecule sequence with a short stretch of RNA
(or DNA) sequence that is complementary to the L66 transcript. Such a construcfi will, when introduced into a cell either physically or via gene transfer of a ribozyme expression construct find the L66 sequence (our sequence of interest) and bind via its sequence-specific part to said sequence. The catalytic activity attached to the consfiruct, usually associated with a special nucleic acid structure (people distinguish so called "hammerhead" structures and "hairpin" structures), will then cleave the targeted RNA. The targeted mRNA will be destroyed and cannot be translated efficiently, thus the protein encoded by the mRNA derived from L66 will not be expressed or at least will be expressed at significantly reduced amounts.
In a preferred embodiment the invention covers inducible ribozyme constructs (Koizumi M, Soukup GA, Kerr JN, Breaker RR, Allosteric selection of ribozymes that respond to the second messengers cGMP and cAMP, Nat Struct Biol 1999 Nov;6(11 ):1062-1071 ).
In a further preferred embodiment the invention concerns the use of "bivalent"
ribozymes (multimers of catalytically active nucleic acids) as described in (Hamada M, Kuwabara T, Warashina M, Nakayama A, Taira K, Specificity of novel allosterically trans- and cis-activated connected maxizymes that are designed to suppress BCR-ABL expression FEBS Lett 1999 Nov 12;461(1-2);77-85).
Also provided by the present invention are non-human transgenic animals grown from germ cells transformed with the L66 nucleic acid sequence according to the invention and fihat express the L66 receptor according to the invention and offspring and descendants thereof. Also provided are transgenic non-human mammals comprising a homologous recombination knockout of the native L66 receptor, as well as transgenic non-human mammals grown from germ cells transformed with nucleic acid antisense to the L66 nucleic acid of the invention and offspring and descendants thereof. Further included as part of the present invention are transgenic animals which the native L66 receptor has been replaced with the human homolog. Of course, ofFspring and descendants of all of the foregoing transgenic animals are also encompassed by the invention.
Transgenic animals according to the invention can be made using well known 10 techniques with the nucleic acids disclosed herein. E.g., Leder et al., U.S. Patent Nos.4,736,866 and 5,175,383; Hogan et al., Manipulating the Mouse Embryo, A
Laboratory Manual (Cold Spring Harbor Laboratory (1986)); Capecchi, Science 244, 1288 (1989); Zimmer and Gruss, Nature 338, 150 (1989); Kuhn et al., Science 269, 1427 (1995); Katsuki et al., Science 241, 593 (1988); Hasty et al., Nature 350, 243 (1991); Stacey et al., Mot. Ceil Biol. 14, 1009 (1994); Hanks et al., Science 269, 679 (1995); and Marx, Science 269, 636 (1995). Such transgenic animals are useful for screening for and determining the physiological effects of L66 receptor agonists and antagonist.
20 Consequently, such transgenic animals are useful for developing drugs to regulate physiological activities in which L66 participates.
The following examples are provided for illustrative purposes only and are not intended, nor should they be construed, as limiting the invention in any manner.
The novel nuclear receptor sequences disclosed herein may be used for various in silico, i.e. compufier analyses. Such analyses may be for example nuclear receptor 30 specific sequence alignments which permit the identification of domains and even new receptors. The novel domain sequences disclosed herein may be used in order to create domain specific hidden markov models (hmms) or simply as search sequences.
_ . _ 41 In a preferred embodiment this similarity search tool is the BLAST algorithm.( Altschul, Stephen F., Warren Gish, Webb Miller, Eugene W. Myers, and David J.
Lipman (1990). Basic local alignment search tool. J. Mol. Biol. 215:403-10 and the sequence used is one of those disclosed herein.
Another search tool that may be used is FASTA (W. R. Pearson and D. J. Lipman (1988), "Improved Tools for Biological Sequence Analysis", PNAS 85:2444- 2448, and W. R. Pearson (1990) "Rapid and Sensitive Sequence Comparison with FASTP
and FASTA" Methods in Enzymology 183:63- 98).
The invention is not limited to one particular type of search tool. In one embodiment of the invention search tools are used that do not search by sequence similarity but by applying sequence profiles such as a profile generated when applying the Profile Hidden Markov Model.
Profile Hidden Markov Models also called "Hidden Markov Models", here abbreviated as HMM, are statistical models representing the consensus of the primary structure of a sequence family. The profiles use scores specific of the position of amino acids (or nucleotides) and position specific scores for the opening or the expansion of an insertion or deletion. Methods for the creation of profiles, starting from multiple alignments, have been introduced by Taylor (1986), Gribskov et al. (1987), Barton (1990) and Heinikoff (1996).
HMMs provide an utterly probabilistic description of profiles, i.e. Bayes' theory rules the positioning of all probability (evaluation) parameters (compare Krogh et al. 1994, Eddy 1996 and Eddy 1998). The central idea behind this is that a HMM is a finite model describing the probability distribution of an infinite number of possible sequences. The HMM consists of a number of states corresponding with the columns of a multiple alignment as it is usually depicted. Each state emits symbols (remainders) corresponding with the probability of the symbol emission (specific of the respective state), and the states are linked with each other by probabilities of the changing of states. Starting from one specific state, a succession of states is generated by changing from one state to the other in accordance with the probability '_ ' 42 of the changing of states, until a final state has been reached. Each state then emits symbols according to the probability distribution of emissions specific of this state, creating an observable sequence of symbols.
The attribute "hidden" has been derived from the fact that the underlying sequence of states cannot be observed. Only the sequence of symbols is visible. An assessment of the probabilities of changing of states and of emissions (the training of the model) is achieved by dynamic programming algorithms implemented in the HMMER
package.
The sequences according to the invention may be aligned with other nuclear receptor sequences in order to create a multiple sequence alignment which is used as the basis for the creation of a HMM.
If an existing HMM and a sequence are given, the probability that the HMM
could generate the sequence in question, can be calculated. The HMMER package provides a numerical quantity (the Score) in proportion to this probability, i.e. the information content of the sequence indicated as bits, measured according to the HMM.
See also Barton, G.J. (1990): Protein multiple alignment and flexible pattern matching, Methods Ezymol. 183: 403-427, Eddy, S.R. (1996): Hidden markov models. Curr. Opin. Strct. Biol. 6: 361-365, Eddy, S.R. (1998): Profile hidden markov models.Bioinformatics. 14: 755-763, Gribskov, M. McLachlan, A.D. and Eisenberg D.
(1987): Profile analysis: Detection of distantly related proteins. Proc. Natl.
Acad. Sci.
USA. 84: 4355-5358, Heinikoff, S. (1996): Scores for sequence searches and alignment, Curr. Opin. Strct. Biol. 6: 353-360, Krogh, A., Brown, M., Mian, I.S., Sjolander, K. and Haussler, D. (1994): Hidden markov models in computational biology: Applications to protein modelling. J. Mol. Biol. 235: 1501-1531, Taylor, W.R.
(1986): Identification of protein sequence homology by consensus template alignment. J. Mol. Biol. 188: 233-258.
In general the sequence are selected such that a query using a search sequence returns a result consisting of sequences which are at least to a certain degree similar to the query sequence.
In one embodiment of the invention amino acid sequences of the present invention are used to model the three-dimensional structure of the protein. Initially, this involves the comparison of the protein sequence with the sequence of related proteins where the structure is known, such as the human PPARy ligand-binding domain (Nolte RT, Wisely GB, Westin S, Cobb JE, Lambert MH, Kurokawa R, Rosenfeld MG, Willson TM, Glass CK, Milburn MV,Nature 1998 Sep 10;395(6698):137-43). The three-dimensionale structure can then be modelled using computer programs. From the three-dimensional structure, binding sites of potential inhibitors or activators can be predicted. It can further be predicted which kinds of molecule might bind there. The predicted substances can then be screened to test their effect on nuclear receptor activity.
EXAMPLES
EXAMPLE 1: CLONING AND EXPRESSION OF THE GENE ACCORDING TO THE
INVENTION
Construction of suitable vectors containing the desired coding and control sequences employs standard ligation and restriction techniques that are well undersfiood in the art. Isolated plasmids, DNA sequences, were synthesized oligonucleotides were cleaved, tailored, and religated in the form desired.
Site-specific DNA cleavage was performed by treatment with the suifiable restriction enzyme (or enzymes) under conditions that are generally understood in the art, and the particulars of which are specified by the manufacturer of these commercially available restriction enzymes.
See, e.g., New England Biolabs, Product Catalog. In general, about 1 pg of plasmid and/or DNA sequence was cleaved by one unit of enzyme in about 20 p1 of buffer solution. Often excess of restriction enzyme was used to ensure complete digestion of the DNA substrate. Incubation times of about one hour to two hours at about 37°C
are workable, although variations are tolerable.
After each incubation, protein was removed by extraction with phenol/chloroform, and may be followed by ether extraction. The nucleic acid was recovered from aqueous fractions by precipitation with ethanol. If desired, size separation of the cleaved fragments was performed by polyacrylamide gel or agarose gel electrophoresis using standard techniques. A general description of size separations is found in Methods in Enzymology 65, 499-560 (1980).
Transformed host cells are cells which have been transformed or transfected with recombinant expression constructs made using recombinant DNA techniques and comprising mammalian nuclear receptor L66 encoding sequences. Preferred host cells for transient transfection are COS-7 cells. Transformed host cells may ordinarily express nuclear receptor L66, but host cells transformed for purposes of cloning or amplifying nucleic acid hybridization probe DNA need not express the nuclear receptor. When expressed, the mammalian nuclear L66 receptor protein was typically located in the host cell membrane.
Cultures of cells derived from multicellular organisms are desirable hosts for recombinant nuclear receptor L66 protein synthesis. In principal, any higher eukaryotic cell culture is workable, whether from vertebrate or invertebrate culture.
However, mammalian cells are preferred. Propagation of such cells in cell culture has become a routine procedure. See Tissue Culture (Academic Press, Kruse &
Patterson, Eds., 1973). Examples of useful host cell fines are bacteria cells, insect cells, yeast cells, human 293 cells, VERO and HeLa cells, LMTK- cells, and W1138, BHK, COS-7, CV, and MDCK cell lines. Human 293 cells are preferred.
EXAMPLE 2: NUCLEAR RECEPTOR L66 TISSUE LOCALIZATION:
A multiple tissue northern blot (Clontech, Palo Alto) was hybridized to a labeled probe. The blot contained about 03 to 3 pg of poly A RNA derived from various tissues. Hybridization was carried out in a hybridization solution such as one __ - , 4g_ containing SSC (see Maniatis et al, ibid) at an optimized temperature between 50°c and 70°C, preferably 65°C. The filter was washed and a film exposed for signal detection (see also: Maniatis et al., Molecular Cloning: A laboratory Manual, Cold Spring Harbor Laboratory Press, N.Y.(1989)).
EXAMPLE 3: NUCLEAR RECEPTOR L66 cDNA ISOLATION FROM HUMAN AND
OTHER ORGANISMS:
A cloning strategy was used to clone the L66 receptor cDNA from specific cDNA
libraries (Clontech, Palo Alto) or alternatively, RNA was obtained from various tissues and used to prepare cDNA expression libraries by using for example an Invitrogen kit. (Invitrogen Corporation, San Diego). For the isolation of the L66 cDNA
clone the chosen library was screened under stringent condition (see definitions above) by using an L66 specific probe. The cDNA insert of positive clones was subsequently sequenced and cloned in a suitable expression vector.
Additionally, full length receptor L66 clones from human and mouse (Mus musculus) was obtained by using RACE PCR technology. In brief, suitable cDNA libraries were constructed or purchased. Following reverse transcription, the first strand cDNA was used directly in RACE PCR reactions using a RACE cDNA amplification kit according to the manufactures protocol (Clontech, Palo Alto). Amplified fragments were purified, cloned and subsequently used for sequence analysis.
To obtain information about the genomic organization (see also figures) of the receptor gene, genomic libraries were screened (commercially available clone libraries were used) with a receptor specific probe under stringent conditions.
Positive clones were isolated and the complete DNA sequence of the putative receptor region was determined by sequence analysis (Maniatis et al., Molecular Cloning: A laboratory Manual, Cold Spring Harbor Laboratory Press, N.Y.(1989)) (human genomic clone deposited at "Deutsche sammlung von Mikroorganismen and Zellkulturen GmbH (DSMZ, International Depositary Authority under the Budapest Treaty in Germany) under number DSM 14483).
EXAMPLE 4: NUCLEAR RECEPTOR L66 LIGAND BINDING ASSAY:
Saturation ligand binding analysis and ligand competition studies are carried out by using an over expressed L66 receptor-protein which is incubated with a labeled potential ligand at different concentrations. The L66 receptor may be immobilized on suitable surFaces. Various competitors are added in the presence of a labeled ligand.
Bound and unbound ligand is separated be using e.g. gel filtration or charcoal based methods or simply removed by a washing step if the receptor protein L66 is immobilized. Receptor-bound ligand is detected by scintilation counting.
EXAMPLE 5: NUCLEAR RECEPTOR L66 EXPRESSION ASSAY:
Relative quantification was performed in multiplex PCR reactions using the ABI
PRISM~ 7700 Sequence Detection System. Total RNA was used as template and was either ordered from Clontech or Ambion in the case of human normal tissue or was isolated from cell lines that were ordered from the DSMZ (German Collection of Microorganisms and Cell Cultures).
Relative quantification was achieved by normalising the results for the presence of 18S rRNA in the samples and subsequently by standardising the corresponding expression data to the expression levels detected in testis (testis = 1 ). A
standard curve was generated by diluting total RNA from testis tissue 1:10 fold, starting from 100 ng input amount and perForming 7 dilution steps. For the detection of target mRNA in other samples the input amount was always 100 ng total RNA, samples were measured in triplicates.
For the detection of 18S rRNA the endogenous control pre-developed assay reagent "Ribosomal RNA control (18S rRNA)" from Applied biosystems was used. For the detection of the target sequence, PCR primers and the probe were designed according to Applied biosystem's specifications using its Primer Express° software.
PCR primers were ordered from Interactiva (forward primer (SEQ ID NO. 14): CGT
GGG CTA ATG AAT TTT ACC AAG; reverse primer (SEQ ID NO. 15): GGC CCC
ATG GAG AAA TAT CAC T). The probe (SEQ ID NO. 16) CCA ATG AGG ATC AAA
CTG CAC TAC AGA AGG G was ordered from Applied Biosystems and was labelled with FAM at the 5'end and contained a quencher (TAMRA) at the 3' end.
EXAMPLE 6: FORMATION OF L66-RXR OR OTHER PROTEIN COMPLEXES
In order to explore the functional properties of L66, the DNA binding properties of L66 are analyzed. It has previously been shown that RXR is a common heterodimeric partner of various members of the nuclear receptor superfamily required for high affinity DNA binding (Hallenbeck et al., PNAS 89: 5572-5576 (1989)), Kliewer et al., EMBO J. 11: 1419-1435 (1992)). It has also been shown that DNA and ligand binding activities of the Dros. melanogaster ecdysan receptor (EcR) require heterodimer formation with RXR or USP (the homologue of RXR) (O'Malley in Endocrinology 125:
1119-1120 (1989)).
Consequently, it is of interest whether L66 can interact with RXR, or with other members of the family.
A two hybrid system is used. CV-1 cells are transiently transfected with cytomegalovirus promoter driven expression vectors containing the yeast GAL4 DNA
binding domain (DBD) alone, GAL4 linked to L66 LBD (LBD; i.e. GAL4-L66) and the 78 amino acid Herpes virus VP16 transactivation domain (VP) linked to the amon terminal end of the LBDs for human (or mouse) RXRa (VP-RXR), mouse PPARgamma (VP-PPAR) VDR (VP-VDR) and others.
All cells are cotransfected with a luciferase reporter construct containing about 4 copies of the yeast GAL4 upstream activating sequence and a ~i-galactosidase expression vector as internal control.
CV-1 cells are grown in DMEM supplemented with 10% AG1-X8 resin-charcoal stripped calf bovine serum, 50 U/ml penicillin G and 50 pg/ml streptomycin sulfate (DMEM-CBS) at 37°C in 5% C02. One day prior to transfection, cells are plated to 50-80% confluence using a phenol-red free DMEM with 10% resin charcoal stripped _ _ _ . 48 fetal bovine serum (DMEM-FBS). Cells are transfected (with reporter construct (300 ng/105 cells), cytomegalovirus driven receptor (100 ng/105 cells) and (i-galactosidase expression vectors (500 ng/10~ cells) by lipofection using N-~2-(2,3)-dioleoyloxy)propyl-N,N,N-trimethyl ammonium methyl sulfate} according to. the manufacturer s instructions (DOTAP, Boehringer Mannheim).
After 2 hours the liposomes are removed and cells treated for 40 hours with phenol-red free DMEM-FBS containing farnesol as the ligand. Cells are harvested and assayed for luciferase and (i-galactosidase activity.
All points are preferentially perFormed in triplicate and should ideally vary less than 10%. Experiments are repeated a 2-4 times. Data points are normalized for differences in transfection efficiency using ~i-galactosidase, and plotted as relative activity where the untreated reporter is defined to have an activity of 1 unit.
Neither the GAL4 DBD, nor the GAL4-L66 chimera should be capable of stimulating.
transcription from a reporter construct containing the GAL4 upstream activating sequence. Similarly, a fusion protein containing the Herpes virus VP16 firansactivation domain linked to the RXRa-LBD (VP-RXR) should be inactive when expressed alone or with the GAL4 DBD. However, when GAL4-L66 and VP-RXR are coexpressed, the reporter could be activated, indicating that L66 and RXRa interact efficiently in cells. Using similar VP16-LBD fusion proteins, interaction can be tested between L66 and receptors for peroxisome proliferators/fatty acids (PPAR), vitamin D3 (VDR), thyroid hormone, (T3R), retinoic acid (RAR), or other members of the nuclear receptor superfamily (see also US Pat. 6,005,086).
FIGURE CAPTIONS:
FIG. 1:
Fig. 1 shows the cDNA sequence of the L66 gene according to the invention. It also shows the reverse complement thereof.
___ _ _ __ _ -- 49 FIG. 2:
Fig. 1 shows the protein sequence of the L66 gene according to the invention.
FIG. 3:
Fig. 3 shows the domain composition of FXR-~i (L66) comprising a so called DNA
binding domain "DBD" and a ligand binding domain "LBD" which are present in members of the nuclear receptor superfamily. Also shown are the respective probability values for the presence of these domains which as can be seen in the figure are extremely high.
' FIG. 4:
Fig, 4 shows the cDNA sequence from L66 from Mus musculus (SEQ ID NO. 17) as well as its reverse complement (SEQ ID NO. 18) and the protein sequence of L66 from Mus musculus (SEQ ID NO. 19).
FIG. 5:
Fig. 5 shows the DNA sequence (NC Fragment) of a splice variant of L66 (SEQ ID
NO. 4). It also shows the reverse complement of the sequence (SEQ ID NO. 5).
FIG. 6: ' Fig. 6 shows the DNA sequence (PolyA1 Fragment, C-terminal end) of a splice variant of L66. It also shows the reverse complement of the sequence. This differs from the NC Fragment in so far as the poly-A-tail follows the exons 7 and 7B
directly (SEQ ID NO. 7).
FIG. 7:
Fig. 7 shows the DNA sequence (PolyA2 Fragment, C-terminal end) of a splice variant of L66. It also shows the reverse complement of the sequence. This differs __ _ , 50 ___ from the NC Fragment in so far as the poly-A-tail follows the exons 7 and 7B
directly (SEQ ID NO. 9).
FIG. 8:
Fig. 8 shows the DNA sequence (GF Fragment, C-terminal end) of a splice varianfi of L66. It also shows the reverse complement of the sequence. This differs from the NC
Fragment in so far as exon.8 is not present (SEQ ID NO. 11 ).
FIG. 9:
Fig. 9 shows the DNA sequence (SF Fragment, C-terminal end) of a splice variant of L66. It also shows the reverse complement of the sequence. This differs from the NC
Fragment in so far as exon 7 and 8 are not present (SEQ ID NO. 13).
FIG. 10:
Fig. 10 is a schematic representation of the exon distribution within the splice variants found.
FIG. 11:
Fig. 11 shows the result of a expression test on various cell lines and tissues.
FIG. 12:
Fig. 12 shows the genomic DNA sequence for L66 from Mus musculus (SEQ ID NO.
20).
__ " _ _ g1 _ FIG. 13:
Fig. 13 shows the reverse complement of the genomic DNA sequence for L66 from Mus musculus (SEQ ID NO. 21 ).
FIG. 14:
Fig. 14 shows a splice variant cDNA sequence of L66 from Mus musculus (SEQ ID
NO. 22), its reverse complement (SEQ ID NO. 23) and the corresponding protein sequence (SEQ ID NO. 24).
FIG. 15:
Fig. 15 lists all the sequences according to the invention, their respective origin ~as well as in which figures they are depicted.
FIG. 16:
Fig. 16 shows a BLASTP (improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements; Nucleic Acids Res. 2001 Jul 15;29(14):2994-3005.) - alignment of L66 against tremblnew (sequence identifier ~AF384555~AF384555) "NR1 H4"; product: "farnesol receptor"
from Homo sapiens farnesol receptor (NR1 H4) mRNA, complete cds, alternatively spliced. //:gp~AF384555~14326451 gene: "NR1H4"; product: "farnesol receptor";
Homo sapiens farnesol receptor (NR1 H4) mRNA, complete cds, alternatively spliced.
The hit is scoring at : 7e-88 (expectation value) for an alignment length (overlap) 389 having 46 % identities (Scoring matrix : BLOSUM62 (used to infer consensus pattern).
FiG. 17:
__ __52 Fig. 17 A shows the intron - exon composition of the mouse L66 gene (numericals).
Fig. 17 B shows the intron = exon composition of spliceform variant 1 of the mouse L,66 gene (numericals).
FIG. 18:
Fig. 18 A shows the intron - exon composition of the mouse L66 gene (sequences).
Fig. 18 B shows the intron - exon composition of spliceform variant 1 of the mouse L66 gene (sequences).
FIG. 19:
Fig. 19 shows a multiple sequence alignment of the L66 C4 zinc finger domain and shows the conservation of this domain.
FIG. 20:
Fig. 20 19 shows a multiple sequence alignment of the L66 protein and demonstrates the conservation of the L66 Ligand Binding Domain (LBD).
L-0020-01-WO-Ol.ST25 SEQUENCE LISTING
<110> LION Bioscience AG
<120> NOVEL MAMMALIAN NUCLEAR RECEPTOR L66 AND METHODS OF USE
<130> L-0020-01-WO-01 <160> 24 <170> PatentIn version 3.0 <210> 1 <211> 1152 <212> JNA
. . <:213=> . Horric.. sa~~i-ens -_ . _ ; . _ _ - . -<220>
<221> n <222> (1)..(1152) <223> n may be a, c, g or t <400> 1 cctggaataaaaaggtccagaccaacctattcttcctcgagaaataagggacaggaagaa60 ttctgtgtagtttgtggtgataaagcatcaccatcaccatatcattataatgcacttacc120 tgtgaaggttgcaaagaaatacctatggtaaaaaattttaaaacttttttattgggtttt180 tttcaatgtagcatcnnncaaaatgcagtatatagttgcaggaatggtagtcactgtgaa240 atggacatgtacatgcgtagaaaatgtcaagagtgcagactgaaaaagtataaggcagta300 ggaatgttggcagaatgtttgctcacagaaatccaatgtaaattaaagagacttcaaaag360 aactttaaggagaagaatcatttttactctaacatcaaagtggaagaggaaggagtagac420 cacagttttctatcatccaccactagacctggaaaagaaagcatggaactaactgaagag480 gaacatcagctcattaataacattgtggctgctcatcaaaaatataccattcctttagaa540 gaaacaaatttgtatctgcaggaacatacaaatcctgaactgagctttttgcaactctca600 gagacagcagtcctacacatacgtgggctaatgaattttaccaaggggctcccaggattt660 gaaaatttggccaatgaggatcaaactgcactacagaagggatcaaaaactgaagtgata720 Sei to L-0020-01-WO-01.ST25 tttctccatggggcccaactttacaatacaatgataatttccatatgtttgattctaccc780 tatgtttggatgaaaatacattttcgtatcagttttttgggtgttactgaagaatttatt840 acannnctgttttacttctacaaaagaatgagcaaacttgatgtaactaatactgaatat900 gctctgcttgcagcaacaattgttttttcagatcgtccatgccttaaaaataagcaatat960 atggaaaatttannngaaccagttttacaaatattgtataagtattcaaaaatgtatcat1020 ccagaagacccannncattttgcccatctcatatggaagcatactgaactgagaactctg1080 aattataaccattcagaaatacttagcacttggaaaacaaaggaccccaaattggctact1140 t~ta;:U:ctct~~ag . _. . , - . .. . . . . 1152.
<210> 2 <211> 1152 <212> DNA
<213> Homo sapiens <220>
<221> n <222> (1)..(1152) <223> n may be a, c> g or t <400> 2 ctcagagagtaaagtagccaatttggggtcctttgttttccaagtgctaagtatttctga60 atggttataattcagagttctcagttcagtatgcttccatatgagatgggcaaaatgnnn120 tgggtcttctggatgatacatttttgaatacttatacaatatttgtaaaactggttcnnn180 taaattttccatatattgcttatttttaaggcatggacgatctgaaaaaacaattgttgc240 tgcaagcagagcatattcagtattagttacatcaagtttgctcattcttttgtagaagta300 aaacagnnntgtaataaattcttcagtaacacccaaaaaactgatacgaaaatgtatttt360 catccaaacatagggtagaatcaaacatatggaaattatcattgtattgtaaagttgggc420 cccatggagaaatatcacttcagtttttgatcccttctgtagtgcagtttgatcctcatt480 ggccaaattttcaaatcctgggagccccttggtaaaattcattagcccacgtatgtgtag540 Seite 2 L-0020-01-WO-01.ST25 gactgctgtc tctgagagttgcaaaaagctcagttcaggatttgtatgttcctgcagata600 caaatttgtt tcttctaaaggaatggtatatttttgatgagcagccacaatgttattaat660 gagctgatgt tcctcttcagttagttccatgctttcttttccaggtctagtggtggatga720 tagaaaactg tggtctactccttcctcttccactttgatgttagagtaaaaatgattctt780 ctccttaaag ttcttttgaagtctctttaatttacattggatttctgtgagcaaacattc840 tgccaacatt cctactgccttatactttttcagtctgcactcttgacattttctacgcat900 gtacatgtcc atttcacagtgactaccattcctgcaactatatactgcattttgnnngat960 gctacatt~a aaaUaacccaataaaaaGgttttaaaattttttaccataggt~tttcttt.1020.
gcaaccttca caggtaagtgcattataatgatatggtgatggtgatgctttatcaccaca1080 aactacacag aattcttcctgtcccttatttctcgaggaagaataggttggtctggacct1140 ttttattcca gg 1152 <210>3 <211>384 <212>PRT
<213>Homo sapiens <220>.
<221>X
<222>(1)..(384) <223>X may be any amino acid <400>
ProGly IleLysArgSerArg ProThrTyr SerSerArg AsnLys Ser GlyGln GluGluPheCysVal ValCysGly LysAlaSer ProSer Asp ProTyr HisTyrAsnAlaLeu ThrCysGlu CysLysGlu IlePro Gly MetVal LysAsnPheLysThr PheLeuLeu PhePheGln CysSer Gly Sei to 3 L-0020-01-WO-01.ST25 Ile Xaa Gln Asn Ala Val Tyr Ser Cys Arg Asn Gly Ser His Cys Glu Met Asp Met Tyr Met Arg Arg Lys Cys Gln Glu Cys Arg Leu Lys Lys Tyr Lys Ala Ual Gly Met Leu Ala Glu Cys Leu Leu Thr Glu Ile Gln Cys Lys Leu Lys Arg Leu Gln Lys Asn Phe Lys Glu Lys Asn His Phe Tyr Ser Asn Ile Lys Val Glu Glu Glu Gly Val Asp His Ser Phe Leu .. ~ 13g . ...' ..: ' . 1. Z5. , :: .. . .. 140, ' ' . . ..
Ser Ser Thr Thr Arg Pro Gly Lys Glu Ser Met Glu Leu Thr Glu Glu Glu His Gln Leu Ile Asn Asn Ile Val Ala Ala His Gln Lys Ty r Thr Tle Pro Leu Glu Glu Thr Asn Leu Tyr Leu Gln Glu His Thr Asn Pro Glu Leu Ser Phe Leu Gln Leu Ser Glu Thr Ala Val Leu His Ile Arg Gly Leu Met Asn Phe Thr Lys Gly Leu Pro Gly Phe Glu Asn Leu Ala Asn Glu Asp Gln Thr Ala Leu Gln Lys Gly Ser Lys Thr Glu Val Ile Phe Leu His Gly Ala Gln Leu Tyr Asn Thr Met Ile Ile Ser Ile Cys Leu Ile Leu Pro Ty r Val Trp Met Lys Ile His Phe Arg Ile Ser Phe Leu Gly Val Thr Glu Glu Phe Iie Thr Xaa Leu Phe Tyr Phe Tyr Lys Arg Met Ser Lys Leu Asp Val Thr Asn Thr Glu Tyr Ala Leu Leu Ala Seite 4 L-0020-01-WO-01.ST25 Ala Thr Ile Ual Phe Ser Asp Arg Pro Cys Leu Lys Asn Lys Gln Tyr Met Glu Asn Leu Xaa Glu Pro Ual Leu Gln Ile Leu Tyr Lys Tyr Ser Lys Met Tyr His Pro Glu Asp Pro Xaa His Phe Ala His Leu Ile Trp Lys His Thr Glu Leu Arg Thr Leu Asn Tyr Asn His Ser Glu Ile Leu Ser Thr Trp Lys Thr Lys Asp Pro Lys Leu Aia Thr Leu Leu Ser Giu ..<L10> 4 - . . ... . ' , . . , ' : - ' - .- , <211> 2077 <212> DNA
<213> Homo sapiens <400> 4 cctggaataa aaaggtccag accaacctat tcttcctcga gaaataaggg acaggaagaa 60 ttctgtgtag tttgtggtga taaagcatca ccatcaccat atcattataa tgcacttacc 120 tgtgaaggtt gcaaagcatc aacaaaatgc agtatatagt tgcaggaatg gtagtcactg 180 tgaaatggac atgtacatgc gtagaaaatg tcaagagtgc agactgaaaa agtataaggc 240 agtaggaatg ttggcagaat gtttgcacac agaaatccaa tgtaaattaa agagacttca 300 aaagaacttt aaggagaaga atcattttta ctctaacatc aaagtggaag aggaaggagt 360 agaccacagt tttctatcat ccaccactag acctggaaaa gtgattcagg aaagcatgga 420 actaactgaa gaggaacatc agctcattaa taacattgtg gctgctcatc aaaaatatac 480 cattccttta gaagaaacaa atttctgcag gaacatacaa atcctgaact gagctttttg 540 caactctcag agacagcagt cctacacata cgtgggctaa tgaattttac caaggggctc 600 ccaggatttg aaaatttggc caatgaggat caaactgcac tacagaaggg atcaaaaact 660 gaagtgatat ttctccatgg ggcccaactt tacagtcaga aacaatcagc ctctgaaagt 720 tctgtgagaa tattaaatca ttcagattat acaccaaatt gtcacaatag gagtggtgat 780 Seite 5 L- 0020-01-WO-01.ST25 agaagtcttatttgttctatggaaaaattttacaatgaagaatgtccttctactactcta840 attggtttcaagaagctcatctggaaaatggtgataataatagaacttacctcatacagt900 attgtgactactacataaaataatacatagagatcgtccatgccttaaaaataagcaata960 tatggaaaatttacaagaaccagttttacaaatattgtataagtattcaaaaatgtatca1020 tccagaagacccatagcattttgcccatctcatatggaagcatactgaactgagaac 1077 <210> 5 <211> 1077 <212> DNA
<213> Homo Sapiens <400> 5 gttctcagtt cagtatgctt ccatatgaga tgggcaaaat gctatgggtc ttctggatga 60 tacatttttg aatacttata caatatttgt aaaactggtt cttgtaaatt ttccatatat 120 tgcttatttttaaggcatggacgatctctatgtattattttatgtagtagtcacaatact180 gtatgaggtaagttctattattatcaccattttccagatgagcttcttgaaaccaattag240 agtagtagaaggacattcttcattgtaaaatttttccatagaacaaataagacttctatc300 accactcctattgtgacaatttggtgtataatctgaatgatttaatattctcacagaact360 ttcagaggctgattgtttctgactgtaaagttgggccccatggagaaatatcacttcagt420 ttttgatcccttctgtagtgcagtttgatcctcattggccaaattttcaaatcctgggag480 ccccttggta aaattcatta gcccacgtat gtgtaggact gctgtctctg agagttgcaa 540 aaagctcagt tcaggatttg tatgttcctg cagaaatttg tttcttctaa aggaatggta 600 tatttttgat gagcagccac aatgttatta atgagctgat gttcctcttc agttagttcc 660 atgctttcct gaatcacttt tccaggtcta gtggtggatg atagaaaact gtggtctact 720 ccttcctctt ccactttgat gttagagtaa aaatgattct tctccttaaa gttcttttga 780 agtctcttta atttacattg gatttctgtg tgcaaacatt ctgccaacat tcctactgcc 840 ttatactttt tcagtctgca ctcttgacat tttctacgca tgtacatgtc catttcacag 900 Seite 6 L-0020-01-WO-01.ST25 tgactaccat tcctgcaact atatactgca ttttgttgat gctttgcaac cttcacaggt 960 aagtgcatta taatgatatg gtgatggtga tgctttatca ccacaaacta cacagaattc 1020 ttcctgtccc ttatttctcg aggaagaata ggttggtctg gaccttttta ttccagg 1077 <210>6 <211>331 <2l2>DNA
<213>Homo sapiens <220>
<221> n <222> (1)..(331) .. . <223> ~~n., may, be ,~. ~. u:.:or l~ .. ~ . . . . -<400>
cccaactttacagtcagaaacaatcagcctctgaaagttctgtgagaatattaaatcatt60 cagattatacaccaaattgtcacaataggagtggtgatagaagtcttatttgttctatgg120 aaaaattttacaatgaagaatgtccttctactactctaattggtaatatgactcaatatg180 aaatattatattggatgcaaaaatttgtatataatgtttaactttcttatactgctttga240 gatacaatgataatttccatatgtttgattctaccctatgtttggatgaaaatacatttt300 cgtatcngcccaaaaaaaaaaaaaaaaaaaa 331 <210>7 <211>331 <212>DNA
<213>Homo sapiens <220>
<221> n <222> (1)..(331) <223> "N" may be either A. C. G or T
<400> 7 tttttttttt tttttttttt gggcngatac gaaaatgtat tttcatccaa acatagggta 60 gaatcaaaca tatggaaatt atcattgtat ctcaaagcag tataagaaag ttaaacatta 120 Seite 7 L-0020-01-WO-01.ST25 tatacaaatt tttgcatcca atataatatt tcatattgag tcatattacc aattagagta 180 gtagaaggac attcttcatt gtaaaatttt tccatagaac aaataagact tctatcacca 240 ctcctattgt gacaatttgg tgtataatct gaatgattta atattctcac agaactttca 300 gaggctgatt gtttctgact gtaaagttgg g 331 <210>8 <211>311 <212>DNA
<213>Homo Sapiens <4fl0> ... . ...., .:. ~ . .. .
g ..~
cccaactttacagtcagaaacaatcagcctctgaaagttctgtgagaatattaaatcatt60 cagattatacaccaaa,ttgtcacaataggagtggtgatagaagtcttatttgttctatgg120 aaaaattti;acaatgaagaatgtccttctactactctaattggtaatatgactcaatatg180 aaatattatattggatgcaaaaatttgtatataatgtttaactttcttatactgctttga240 gatacaatgataatttccatatgtttgattctaccctatgtttggatgaggaaaaaaaaa300 aaaaaaaaaa a 311 <210>9 <211>311 <212>DNA
<213>Nomo sapiens <400>
ttttttttttttttttttttcctcatccaaacatagggtagaatcaaacatatggaaatt60 atcattgtatctcaaagcagtataagaaagttaaacattatatacaaatttttgcatcca120 atataatatttcatattgagtcatattaccaattagagtagtagaaggacattcttcatt180 gtaaaatttttccatagaacaaataagacttctatcaccactcctattgtgacaatttgg240 tgtataatctgaatgatttaatattctcacagaactttcagaggctgattgtttctgact300 gtaaagttggg 311 Seite 8 L-0020-01-WO-Ol.ST25 <210>10 <211>425 <212>DNA
<27.3>Homo sapi ens <400> IO
ggccaatgag gatcaaactg cactacagaa gggatcaaaa actgaagtga tatttctcca 60 tggggcccaa ctttacagtc agaaacaatc agcctctgaa agttctgtga gaatattaaa 120 tcattcagat tatacaccaa attgtcacaa taggagtggt gatagaagtc ttatttgttc 180 tatggaaaaa ttttacaatg aagaatgtcc ttctactact ctaattgatc gtccatgcct 240 taaaaataag-caatatatgg aaaatttaca agaaccagti ttacaaatat-tgtatuagta 30tJ.
ttcaaaaatg tatcatccag aagacccata gcattttgcc catctcatat ggaagcatac 360 tgaactgaga actctgaatt ataaccattc agaaatactt agcacttgga aaacaaagga 420 cccca 425 <210>11 <211>425 <212>DNA
<213>Homo Sapiens <400> 11 tggggtcctt tgttttccaa gtgctaagta tttctgaatg gttataattc agagttctca 60 gttcagtatg cttccatatg agatgggcaa aatgctatgg gtcttctgga tgatacattt 120 ttgaatactt atacaatatt tgtaaaactg gttcttgtaa attttccata tattgcttat 180 ttttaaggca tggacgatca attagagtag tagaaggaca ttcttcattg taaaattttt 240 ccatagaaca aataagactt ctatcaccac tcctattgtg acaatttggt gtataatctg 300 aatgatttaa tattctcaca gaactttcag aggctgattg tttctgactg taaagttggg 360 ccccatggag aaatatcact tcagtttttg atcccttctg tagtgcagtt tgatcctcat 420 tggcc 425 Seite 9 L-0020-01-WO-01.ST25 <210>12 <211>299 <212>DNA
<213>Homo Sapiens <400> 12 .
ggccaatgag gatcaaactg cactacagaa gggatcaaaa actgaagtga tatttctcca 60 tggggcccaa ctttacagtc agaaacaatc agcctctgaa aatcgtccat gccttaaaaa 120 taagcaatat atggaaaatt tacaagaacc agttttacaa atattgtata agtattcaaa 180 aatgtatcat ccagaagacc catagcattt tgcccatctc atatggaagc atactgaact 240 gagaactctg aattataacc attcagaaat acttagcact tggaaaacaa aggacccca 299 <210>13 <211>299 <212>DNA
<213>Nomo Sapiens <400> 13 tggggtcctt tgttttccaa gtgctaagta tttctgaatg gttataattc agagttctca 60 gttcagtatg cttccatatg agatgggcaa aatgctatgg gtcttctgga tgatacattt 120 ttgaatactt atacaatatt tgtaaaactg gttcttgtaa attttccata tattgcttat 180 ttttaaggca tggacgattt tcagaggctg attgtttctg actgtaaagt tgggccccat 240 ggagaaatat cacttcagtt tttgatccct tctgtagtgc agtttgatcc tcattggcc 299 <210> 14 <211> 24 <212> DNA
<213> Artificial <400> 14 cgtgggctaa tgaattttac caag 24 <210> 15 <211> 22 <212> DNA
<213> Artificial Seite 10 L-0020-Ol-WO-Ol.ST25 <400> 15 ggccccatgg agaaatatca ct 22 <210>16 <211>31 <212>DNA
<213>Artificial <400> 16 ccaatgagga tcaaactgca ctacagaagg g 31 <210> _ <~.~~> - .. . . _. . . . , . ~, ~:,n <212>
DNA
<213> musculus Mus <400>
atgttaataaaaccagatattttgccagaacaattccattatcagctgtgtgatacagat60 ttccaagaaccaccctattgtcaatattctaccgctcagtttcctccagcgttacagtcc120 ccatctttacaaagtcatttcaacacacatggcttggatccacagtacagtggaggcagt180 , tggtgtggactcgacgctcgagaatctggtcagtccacttatgtggttgttcacgatgat240 gaagatgaattccctggggcacaaaggtgcagagcaacttgttctttacgctggaagggt300 caagatgacatgctctgcatggtctgcggtgataaggcatcaggatatcactacaatgca360 cttacttgtgaggggtgcaaaggctttttccggcgtagcattaccaagaatgcagtgtat420 tcttgcaagaacggtggtcactgtgaaatggacatgtacatgcgcagaaaatgccaagag480 tgcagactgaagaagtgtaaggcggtggggatgttggcagaatgtttgctcacagagatc540 cagtgtaagtcaaagagacttcgcaagaacttcaagcacgggcctgccctgtaccctgcc600 atccaagtggaagatgaaggagcagacaccaaacacgtgtcatccagcaccagatctggg660 aaaggggttcaggacaacatgactctaactcaagaggaacatcggcttctgaataccata720 gtgactgctcaccaaaaatccatgattcccttgggagaaacaagcaaacttctgcaggag780 ggttccaaccccgaactaagttttctgagactctcagaggtatcagtcctgcacatacaa840 Seite L-0020-01-WO-01.ST25 gggctaatgaagtttaccaagggactcccaggatttgaaaatttaaccactgaggatcag900 gctgcattacagaaggcgtcaaaaactgaagtgatgttccttcatgtagcccagctttat960 ggtgggaaagactcaacctctggaagtactatgagaccagcaaagccctcagctgggaca1020 ctagaggtgcataatcctagcgctgatgaaagtgttcattctccggaaaactttctcaag1080 gaaggctacccttcggctcctctaactgatattactaaagaatttattgcctcactatct1140 tacttctaca gaagaatgag tgaacttcat gtatcggata ctgaatatgc tctgcttacg 1200 gcgacaacag tgcttttctc agatcgtcca tgccttaaaa ataagcagca tatagaaaac 1260 ._ ct~~aagaac c~c~tcctgca- acttttg~tt aagttttcaa .aaat gtacca tcca.g~agac ~-13?_~,,~
ccacagcatt tcgcccacct catagggagg cttactgaac tgagaactct gagtcacagc 1380 cactctgaaa tccttcgcat gtggaaaaca aaggacccca ggttggtgat gttattctct 1440 gagaaatggg atctgcactc attttcctga 1470 <210>18 <211>1470 <212>DNA
<213>Mus musculus <400> 18 tcaggaaaat gagtgcagat cccatttctc agagaataac atcaccaacc tggggtcctt 60 tgttttccac atgcgaagga tttcagagtg gctgtgactc agagttctca gttcagtaag 120 cctccctatg aggtgggcga aatgctgtgg gtcttctgga tggtacattt ttgaaaactt 180 aaacaaaagt tgcaggactg gttcttgtag gttttctata tgctgcttat ttttaaggca 240 tggacgatct gagaaaagca ctgttgtcgc cgtaagcaga gcatattcag tatccgatac 300' atgaagttca ctcattcttc tgtagaagta agatagtgag gcaataaatt ctttagtaat 360 atcagttaga ggagccgaag ggtagccttc cttgagaaag ttttccggag aatgaacact 420 ttcatcagcg ctaggattat gcacctctag tgtcccagct gagggctttg ctggtctcat 480 agtacttcca gaggttgagt ctttcccacc ataaagctgg gctacatgaa ggaacatcac 540 Seite 12 L-0020-01-WO-01.ST25 ttcagtttttgacgccttctgtaatgcagcctgatcctcagtggttaaattttcaaatcc600 tgggagtcccttggtaaacttcattagcccttgtatgtgcaggactgatacctctgagag660 tctcagaaaacttagttcggggttggaaccctcctgcagaagtttgcttgtttctcccaa720 gggaatcatggatttttggtgagcagtcactatggtattcagaagccgatgttcctcttg780 agttagagtcatgttgtcctgaacccctttcccagatctggtgctggatgacacgtgttt840 ggtgtctgctccttcatcttccacttggatggcagggtacagggcaggcccgtgcttgaa900 gttcttgcgaagtctctttgacttacactggatctctgtgagcaaacattctgccaacat960 ceccaccgcc.:ttu~acttct~tcagt-ctgca-~tcttggcattttctgcgcatg~Lacatgtc..1-t~20. -.. .
catttcacagtgaccaccgttcttgcaagaatacactgcattcttggtaatgctacgccg1080 gaaaaagcctttgcacccctcacaagtaagtgcattgtagtgatatcctgatgccttatc1140 accgcagaccatgcagagcatgtcatcttgacccttccagcgtaaagaacaagttgctct1200 gcacctttgtgccccagggaattcatcttcatcatcgtgaacaaccacataagtggactg1260 accagattctcgagcgtcgagtccacaccaactgcctccactgtactgtggatccaagcc1320 atgtgtgttgaaatgactttgtaaagatggggactgtaacgctggaggaaactgagcggt1380 agaatattgacaatagggtggttcttggaaatctgtatcacacagctgataatggaattg1440 ttctggcaaaatatctggttttattaacat 1470 <210>19 <211>489 <212>PRT
<213>Mus musculus <400> 19 Met Leu Ile Lys Pro Asp Iie Leu Pro Giu G1n Phe His Tyr Gin Leu Cys Asp Thr Asp Phe Gln Glu Pro Pro Tyr Cys Gln Tyr Ser Thr Ala Seite 13 L-0020-01-WO-01.ST25 Gln Phe Pro Pro Ala Leu Gln Ser Pro Ser Leu Gln Ser His Phe Asn Thr His Gly Leu Asp Pro Gln Tyr Ser Gly Gly Ser Trp Cys Gly Leu Asp Ala Arg Glu Ser Gly Gln Ser Thr Tyr Ual Val Ual His Asp Asp Glu Asp Glu Phe Pro Gly Ala Gln Arg Cys Arg Ala Thr Cys Ser Leu Arg Trp Lys Gly Gln Asp Asp Met Leu Cys Met Ual Cys Gly Asp Lys ._ .,qlU ser C;ly iyr:His-..~wr~;'~~n~Ala Leu Thr Cys Glu Gly. Cys.Lys G1~<:.-., ... . ._.
Phe Phe Arg Arg Ser Ile Thr Lys Asn Ala Ual Tyr Ser Cys Lys Asn Gly Gly His Cys Glu Met Asp Met Tyr Met Arg Arg Lys Cys Gln Glu Cys Arg Leu Lys.Lys Cys Lys Ala Ual Gly Met Leu Ala Glu Cys Leu Leu Thr Glu Ile Gln Cys Lys Ser Lys Arg Leu Arg Lys Asn Phe Lys His Gly Pro Ala Leu Tyr Pro Ala Ile Gln Val Glu Asp Glu Gly Ala Asp Thr Lys His Ual Ser Ser Ser Thr Arg Ser Gly Lys Gly Ual Gln Asp Asn Met Thr Leu Thr Gln Glu Glu His Arg Leu Leu Asn Thr Ile Ual Thr Ala His Gln Lys Ser Met Ile Pro Leu Gly Glu Thr Ser Lys Leu Leu Gln Glu Gly Ser Asn Pro Glu Leu Ser Phe Leu Arg Leu Ser . 260 265 270 Glu Ual Ser Ual Leu His Ile Gln Gly Leu Met Lys Phe Thr Lys Gly Seite 14 L-0020-01-WO-01.ST25 Leu Pro Gly Phe Glu Asn Leu Thr Thr Glu Asp Gln Ala Ala Leu Gln Lys Ala Ser Lys Thr Glu Ual Met Phe Leu His Ual Ala Gln Leu Tyr Gly Gly Lys Asp Ser Thr Ser Gly Ser Thr Met Arg Pro Ala Lys Pro Ser Ala Gly Thr Leu Glu Ual His Asn Pro Ser Ala Asp Glu Ser Ual His Ser Pro Glu Asn Phe Leu Lys Glu Gly Tyr Pro Ser Ala Pro Leu Thr Asp Ile Thr Lys Glu Phe Ile Ala Ser Leu Ser Tyr Phe Tyr Arg Arg Met Ser Glu Leu His Val Ser Asp Thr Glu Tyr Ala Leu Leu Thr Ala Thr Thr Ual Leu Phe Ser Asp Arg Pro Cys Leu Lys Asn Lys Gln His Ile Glu Asn Leu Gln Glu Pro Ual Leu Gln Leu Leu Phe Lys Phe Ser Lys Met Tyr His Pro Glu Asp Pro Gln His Phe Ala His Leu Ile Gly Arg Leu Thr Glu Leu Arg Thr Leu Ser His Ser His Ser Glu Ile Leu Arg Met Trp Lys Thr Lys Asp Pro Arg Leu Ual Met Leu Phe Ser Glu Lys Trp Asp Leu His Ser Phe Ser <210>20 <211>16200 <212>DNA
<213>Mus musculus <400> 20 ctttggggtc atagcattta ttaggtattg ttagttccat ttactgtgtt cctattgtgc 60 Seite 15 L-0020-01-WO-01.ST25 tggacggcaa gtcctgagatggaatagatgtgttgaactaactttctgttgagtataaaa120 cattttgagg tttaaaacaattatgaacttttttgaccttttatgatttgttctttttta180 tgttaataaa accagatattttgccagaacaattccattatcagctgtgtgatacagatt240 tccaagaacc accctattgtcaatattctaccgctcagtttcctccagcgttacagtccc300 catctttaca aagtcatttcaacacacatggcttggatccacagtacagtggaggcagtt360 ggtgtggact cgacgctcgagaatctggtcagtccacttatgtggttgttcacgatgatg420 aagatgaatt ccctggggcacaaaggtgcagagcaacttgttctttacgctggaagggtc480 ~- w aa~a~tgacat~gct~tg~ag ttgcgg atuaggcatcagga-tatcactacaa:~gc . 540 tg: tg ac :
ttacttgtga ggggtgcaaaggtaagggtaaggcgcaaccaaacagacagtgctgcgcac600 gcgcacacgt gtgtatgtgtgtgtgcacgtgcgtgtgcaatcgttataaaacatgagaaa660 caaaaccagg aaaaagtatgtctaaagggatcaatctgtgtgatcaaagataggagaata720 aaatgttaag tttaaggttagtttctttcctttcctttcctttcctttcctttcctttcc780 tttcctttcc ctttccctttccctttccctttcccttcctctttctttctttctttcttt840 ctttctttct ttctttctttctttctttctttctttctttctttctttctctctctctct900 ctctctctct ctctctctctctctctctctctctctctctctccctctctctctctctct960 ctctctctct ctctctctctctctctctctctctctctctctctctctctctctctccct1020 ccctccccccctctctctcttttgaagtagagtatcattgtatagccctggctggcctag1080 acctcactatggagatcaggttggctttgaactcacagatacctgcctgtctctgctttc1140 taggtgctgggattaaaggtgtgggccaccatgcctggaccatttctttctttatttttt1200 tttctcctcccttgtcttccccttcgtcctcatcttctcctcctcatctttcttcttttt1260 ttgagaaaaggtctctctatgcatccctggctgtcctggaactcactgtttagaccaggc1320 tgtccttgaactctcagagatccacctacctttgtatccttactgctggcattaagagca1380 ttcactaccaggtctgtctcttcatttttgatgaagactttcacactttactttaaactc1440 Seite 16 L- 0020-01-WO-01.ST25 gatacatata agcaaacgttttccctatttctagaagaatttgatgctggaacaaaacat1500 gacaatgtaa agatctgtaaaccatgacacctttttatgtggttgataaaatttatacac1560 ttggagaaaa tttgaaataaatgacttctatgattctaagtcaataaattttaaaaagtg1620 tttcgtatag aaagagaacaagctgaaagtcgggaagcaagggccagtgtttgcaatgat1680 atttgtaaag agcggaagcctctctgataaataacagttctgaaacagtctaccactcca1740 gaaatggata aacatcgggatgggaatgcaggccagtggtagagcacttagcacctactt1800 ttggatttga ttctagcaccctctgaaacaaaacaaaacaacccaaacaacagaaaacag1860 tttagtcaat gtaacacaaataaacgtgtttgtttagggtaaaattttacctgtaaatct1920 taaacctcaa cactggcaaagaatagccaa~aggatatgaggtgacagagtcctgtggcga198 0 gtagttaggcaggggtggcctgctcttgacggtgacgggttactgttaaaactgagctga2040 agaacccagcacagttttgaaaacctgagctgagctttttcttccttcgttgaactgaaa2100 tggcgaaactccatctttcaagtgtccactgcaagagagtaagtggtggggctttgggac2160 aactcataggagggagagcaggaaatggcaggtattactgtgtaataaggttcgaaggag2220 gaaaagggatgggaactatctggcaagtgctgggaaagggagagagagcatctgcggatg2280 agggagggagggaccacctttgtgggctgcaactcagactgggagggtttagaaaagaga2340 ggcttttctgaattttttctttattttaggtgaccatgtattctctcaaaggaaactcga2400 gagatgcattgtaggatcttgatattttgtataatgcagttaaaaaaaaaattcttatag2460 gtaatttacctatagtagggtagagagttgtcttaatgtgataatgatactccatttaat2520 ctccagacgttataaagaaagacccagcctcaggatgtcaactgaactatactaactata2580 aagaattatagaattagctaactttccagataaatgtatatttaaaaaaatgtcgggaat2640 aaaaaaaaccctatttaattgaaatactctagttagcttgattactttctataaaggtaa2700 aatgttaataaaaatgagaagtaactaaacatatccatgtcaaactttgaccatctttgt2760 tgttaggctttttccggcgtagcattaccaagaatgcagtgtattcttgcaagaacggtg2820 gtcactgtgaaatggacatgtacatgcgcagaaaatgccaagagtgcagactgaagaagt2880 Seite 17 L-0020-01-WO-01.ST25 gtaaggcggt ggggatgttggcagaatgtaagtgccagtgtgttcgtggcttgcttttct2940 gaggggattt aacttttcccagccttaccttcctgacaaggaggacatgatagtggtggt3000 ggtttttcca gggcaagaatcgcccattgcactagggatgtgctgactgacccctgtgat3060 ttccacaaat gtgggaaactcaggtgcatcatttgtgttagtgaaggactgtattcacac3120 tctcctctat ataataaaaatgaatttaaaaattaataaaaaattaaaaatttccaacct3180 tcttgaggac ataaagcattcttcaagagttattttgaacatggccgtattcgtttgtat3240 atgcacacca cacacatatgtacagtaattgcgccacgtggatggcccattatcaattta3300 . agtag tgattccca taaatgt~tgccctgtt~cta to - ctaat~cttgaaa~tragtca3~a0. .
ttgLt t agtttcatct taattagatg ttctctgaga cctgaagagt agttaggcag ttaagagcat 3420 taggagagga cctgggcccg gtccccagct tccacgtggc ccctcacaat catttataac 3480 tcagttcctg agcatcccag gccctcttct gacctctgca ggtatgagtg ttgtgtgcat 3540 acacacatgc aagcaaaaca cacacataaa ataaaagaaa cctccatagt gatttaaaaa 3600 aggtattgta tgtttctgtt acttaaccta aaaatatata ctgccttgaa aagtgcaaag 3660 ttaaagttag tagttcccaa aaagaaagaa agaggaggag aaagaagaga aaaggaagtc 3720 ttccctttct gagaaggggt tctcagagtc ctaacacact acagtcttag gacaattgaa 3780 gggtaactgg atctctggac ccctgacatc ctcatatgta aagtccacta atagtctcac 3840 gagtgtcagg actcatctct gaatatcaac acattagcat atgagcagtg ggagatacaa 3900 ctcagtccaa agaaaccttt gaaaaagttg tcttttattt tattcattaa tttaatttgt 3960 attcaggatc tcactatgta gctgacctgg aacttgctat gtacgccagg ctggcatcaa 4020 actcagagat ccccctgcct ctgccctcca atttctggct caaaagttgt cttttaacct 4080 gtggcaggca atgcatgtga aagggaagtt ggagtcctct gaggaactct atgcataaaa 4140 tgagtcatcc tacacttaaa gaaacttaaa gaaattctgg caaggtgaag ctcgagagag 4200 ttggtgagca gtcacggtga ttggcgagca taagctggtc cgcaccctgc caccattgct 4260 Seite 18 L-0020-01-WO-01.ST25 tggatctgca cacttcatttacctttcagttgtcctaatactgccatgtgttagtgctct4320 gacggcccct cctcctcttccttccctcattctcctttccctttcctcctttcttttcct4380 cttcctcctc ttccccctcttcttcctccttttcctcctcttcgttctcttcttgtggtt4440 ttccttctca tggtcctgggggttgagttcaggtttctcacatcctagacaatctaccac4500 taagctacac ccatagactcctttcctactaaaaaacactgagttgccatggctggtatt4560 caacgtgaga tcatcttgtaacagtgtcttaagtagctggggttataggtgtttactacc4620 atgttcaata ctcaatatgcttaaaatggctttatttttttgtgtattctgttctctgac4680 aaataataaa ataatatagaggggttttaaggtttttgtttatacttcagctcctcatta4740..
gttttttatt ttataggtgtgtttttaaaaatactttgaaaacgacaggtatattaatag4800 ttggatactt ttagtagctg ctaaaagtac ttagagtagt ccagtccagt ccagtacata 4860 cactacagct cacaaccagc tatagtctca cttcaagggg ctatgacccc tctgtgctct 4920 gtgggcacct gcactcacct gtgcatttcc ccatacagac tcatgcaagt aattataaaa 4980 tatttaaaaa caaacaaaaa caatcctaaa ataaatatta tttgcctttg aaaaagaatt 5040 aaatgctttg tctgacatag gaaaaattga cttagcttat ttttaaaaat cctaagatag 5100 tcttttttaa ccttattaat aaaaccagtg ataatagtca tagataatgt aattctacta 5160 ttctatcaaa tgactttttc acttccaaga tgcaactaat gtgtaaatct cttatgtaaa 5220 aatctaaaca taatttggaa atgaaaaatt aattcttaat ctttgttata actattttta 5280 agcagacaag gtgtctatga agggaaaagc agaataggaa atttaataga tgcagacatg 5340 cacgagagat atctaggact atttactcac tgaagaccat tttaatcaaa ccctacagtg 5400 ttgttttccg agccaaaagt acttaatgtg aattctgtaa tgtcacaggt ttgctcacag 5460 agatccagtg taagtcaaag agacttcgca agaacttcaa gcacgggcct gccctgtacc 5520 ctgccatcca agtggaagat gaaggagcag acaccaaaca cgtgtcatcc agcaccagat 5580 ctgggaaagg ggcaagacgc ttcttccata cagtctcttg tgtctttatg aacagtgtgt 5640 aacaagcaca cattacaggc gccctctgtt tccttatagg ttcaggacaa catgactcta 5700 Seite 19 L-0020-02-WO-01.ST25 actcaagagg aacatcggct tctgaatacc atagtgactg ctcaccaaaa atccatgatt 5760 cccttgggag aaacaagcaa acttgtatgt atccattgct ggcaagaaat aatttgtaat 5820 gaatgatatt aggaaatgat agcaagaaca gtaaaaaaga aacaccttag taattatcag 5880 aaagacatac tatgaatcat ttttatttat ttattttttt tatggaccag ctgcaggagg 5940 gttccaaccc cgaactaagt tttctgagac tctcagaggt atcagtcctg cacatacaag 6000 ggctaatgaa gtttaccaag ggactcccag gtaaaattct gtttcattat tttttaagga 6060 tttatttatt tttatgttta tgtgttatgc atgttttgat tgcaattata tattcatatc 6220 atgtacacaa catattgtgg ~aggccagaag aga~tgttag ~tcccctaga actagagt,.ti .. ..6180 cctaccgatg gctgtgaatt accatgtggg cgttggaacc agagtctctg caagagcaag 6240 acaggctctt aactgctgat ctgtctttct agcccccaat ttttaaaaaa gattatttat 6300 ttatttattt atttatttat ttatttattt atttatctga gtacattgta gctgtacaga 6360 tggttgtgag ccttcatgtg gttgttgaga attgaatttt taggacctct gctcactctg 6420 gtcggccccg ctcgttctgg cccaaagatt tcataagtac actgtagctg tcttcaggtg 6480 caccagaaga gggcgttaga tctcgttacg gatggttgcg agccaccgtg tggttgctgg 6540 gacttgaact cagggctttt taggaagagt agtcagtgct cttacccact gagctatctc 6600 tccagcccca aattttcttt tttttaaagg gatagtatat caaacagaaa tttctagttg 6660 tggtactaga tctatcgctc agaggcagag cacctgtcta atgtgtgtga agccctgggt 6720 tttctagcaa tgttgcttct aactaaaaaa aaaaaaaatg tttatttgtc tttgtcaatt 6780 aatcagatga gtgacaatct aaatatttac aaaaattatt ataatgatgt gggctttctt 6840 agatgaaaca agaaacaaag aaggtttcct ctataatact aaacaagtca cagtttgacc 6900 aaaacttata tataactcag ctttattata tgcttagagg tacagaaact ctttacaaaa 6960 ttacattctt caaggctaat gcagctatgt tgatttgtcc ttgctgtaag aaagtgtaac 7020 ctagtgctga taaaagtgcc ccaaaataga aaaaaagaat tttattcaga tttcaattta 7080 Seite 20 L-0020-01-WO-01.ST25 tatactattg ttattcccca gaagcactta gggttaataa gtaaatattt ttaaacattt 7140 aatttatgaa cgttctcttt tgattatctg tttctttatg ctttatttat ttatttattt 7200 atttattttt aaggatttga aaatttaacc actgaggatc aggctgcatt acagaaggcg 7260 tcaaaaactg aagtgatgtt ccttcatgta gcccagcttt atggtgggaa agactcaacc 7320 tctggaagta aaaagaaaaa cagaatttta ttcaacttca agtacacaca gacagacaga 7380 cagacagaca gacacacaca cacacacaca cacacacaca cacacacacg cacgcacgca 7440 cgcacacact tttaaacact gttttttgga aatagactct catgtagtcc aggctatgta 7500 gaactcagta tgtagctaag gaggatctta aagccctgct gtgtctgctc ttacctgcca 7560 accatcagcc tagagcctct atgcctgcct tctatggtgc tggggacaga agccagggtc 7620 tgtgctcagt aagccagaac tcctaactcc taacatcatc agcctttcat ttatcttttc 7680 attttatttt accagttagt gtaggaacaa atgtctcata gtagctgttt ataaatagtt 7740 tcacattcat ctagcatttt ctctctaccg tgtaattctt ttaaaaattg tatttcttat 7800 tcttgtgaga actgtataca tgtcctaagt gttttttttt ttaaatctta ctcacctcac 7860 ttccttccca ccagctcctt tagaatcccc cacacacctc tctcctaact tcatgtcctt 7920 ccctgctcct tctttttcaa aataacccat tgactctaaa tagtgctgct catatacacg 7980 ggggtggggg tagggtcatc cactgaagca tggccaactt agagggctca gttccctaaa 8040 gagactggat tgtccttccc ctagcagctg ccaacagctt ctcagctgtg ggtggggctt 8100 ctgtgggtgg ggcttctgtg ctccttagac aggctcttcc aatgaggagg acttttatgg 8160 gtgatcaaac ataaaggcgt tggtgattct aagacacaaa gcagctctgg ggacggccgc 8220 atgcttgctt tgaaactcat aaccacgggc tactgtttgt ggaacgtctc tttactgggg 8280 tgaagcccta ttaggttagt gttgaaccaa ctatgcagga cctcacttca ataaaaaaaa 8340 agcaagtatt ttttaaaaaa tcaataaacc atttgaatag agcaggaaag ccatcattgt 8400 ttagtgtgtg aatgcaagag tccaatctgt aaagtcgcat gttggttgag gacatgggaa 8460 gaggcaatag gacaaaatca aaaccaaaag ccaacaatgc tagctgcctt cactagctgt 8520 Sei to 21 L-0020-01-WO-01.ST25 actcgttacagtgcacagccttttaaaatatccttatttattgattgttctctgcaaaca 8580 .
ctagaccagaaggctctaaagggaggcccttgatgacctggtccatgattgcttgagtag 8640 tgtttgcttaaaagtctccacagcctctggtgcactgagtggttcgtggagggtgtgtgt 8700 gtttaagaaagccattcttagtttctatttcattatgcaaaactcattttggagggtctt 8760 ctgtacaatcttttttttttttttacaaaatatgaactaatatctgttttgtgttccagt 8820 aacaaaaatgagaatgatatcattttaggagtttttttttcccccacataatagttatgc 8880 tggatactgaaaagatactgaatttactatttatataatgattaagaaagactatcaaca 8940 tt tgtgtttctcttttgctai~gt~ctatgagaccagcaaagcc~tcag- ctggguc~ct?000: _ ..
tt ~ ..
agaggtgcat aatcctagcg ctgatgaaag tgttcattct ccggaaaact ttctcaagga 9060 aggctaccct tcggctcctc taactggtaa cactgtgcag taggaagcat caccttgggt 9120 ggacacgctt ccttttctcc ccttttcatt gcactacaaa gagaaggcca tgggctttct 9180 cacagctcac aggcgtctct gtggtggtga aagcatttat agacacagcc aatacagttg 9240 ttagtgttgg caggtcatca gtgagaattc tatttgtaac cattttctta ctttggcttc 9300 atcatggcaa ttctttccct atttaattgc gaagccattt ggggcctctg aattattctt 9360 aggatgatga tgtaatctac agccaacatc ccagtatcaa ccacttttat gggtagaagg 9420 gtaagcctta ctgtagataa tagagataca cggagagtgg cttctgtagg ctaaaggctc 9480 tcagttccac agcctcctcc ttggtccacg tgatgacaac ctctctttgt tctggtcctt 9540 ttcagtagca cacctaatgc gcccactctt gaggttcagg cctgggcttc aagggcacaa 9600 caaaggggat tgtctgtgcc ggccatcaca tcggtagcca tcttgaaccc cactctcttt 9660 tcaattttct tacactgttt gagagaagaa aagacaatca caatgtgttt gcttctgtgc 9720 tccattccga tgaaaattca cttatattaa tcttttcaga tattactaaa gaatttattg 9780 cctcactatc ttacttctac agaagaatga gtgaacttca tgtatcggat actgaatatg 9840 ctctgcttac ggcgacaaca gtgcttttct caggtacaga ctgagaagta aggctcggag 9900 Seite 22 L-0020-01-WO-01.ST25 tgatctctgg acaggcaatc actaagttct ttttatcgaa ctatgaactg aagttgaatg 9960 taccactctc atcccatcat gctttgctct ctatgtgcat tcctttaagc ttaccactgg 10020 gttacatctt tcttgcatct taaagaaatg ctttcttgcc ctcatagccc ctcagatata 10080 cgctcccagt ccctgataga gagaggttct cagggagtta gctatattct agtttccacc 10140 ttactcatgc tcactcctga gttcaccaca gtcacatttc actccaccct ctccagtcca 10200 ctacaacctg tcaggatctt caggacatgc tatcactgac tcaactggtc aatgctctgt 10260 cttcatctta cttgccccac tggtgtattt tgtagagagt atcataagca tctgaaagaa 10320 aagaagggga tatgttgttt cctgtgagcc cttcagtgga attttgaagc acggtgaaga 10380 tgtctgatgt gttaggagaa tttctcatgc ttctttatgt cagttcacag tcttcttgac 10440 tatgcgttac aaaccggacc caggttgaag tgagacagtt acttagcatg ggtaacccat 10500 tgctcaagcc agcgacgtct tcctagggga ccattgtcct gtgtccttca ccgattcggc 10560 catgttt,gtg ttcttttata ttctaacttg ctgtgatatc agggttcgtg aatgttcact 10620 aggtatgtgc atctgaagtt gaatcacata ggctccagag cagctccatt atattgtcca 10680 cttttattat gctttattcc aagaagtttt ggaagcaact gtgtgacaaa caggataaag 10740 gatgtagaag gctggaggga cagcagtgtt ggctatcatc tgaatctgaa tatttggaaa 10800 atcagctcag actgttacct agggacaggt cactctggtt ggaccataaa actcttcttt 10860 ctttcagttc agtgtaggga attaagctac aaattctgga tggtgcagta ccttagagcc 10920 ctggtgttct ggctgtgtga ggctttcccc atctctcagg agtgggactg ataacgcctg 10980 tggccatcat cctttatcct tcttggtggg tcttttaaaa caaggactgt gtcatgctct 11040 ctacatttaa gaatttcaat ggggccacag gcgcccctag attataaagt tctcataaag 11100 atatgttcat cacatccctt cagctgatct ttgggtttgc agggctaata tttgattccc 11160 aacaatttac cttttcataa tcttgccatt tctacttact gtttagatgg aaaagagtaa 11220 gttctccttg tagactgcag tccacctaac aacttgcctc cagccttgtt aacagtacca 11280 caagcagaac taaaatcatg caaagaattg cagaagcatg gcagtctcag gaagtttcca 1134D
Seite 23 L-0020-01-WO-01.ST25 ctccaaaaaa accccaaaac aaaaacaaaa acaaaaacaa aaacaaaaaa ccccacccac 11400 tcactaactc acataggtgg tttgtttcct tttgttggac atttacaatt ctatctatct 11460 atctatctat ctatctatct atctatctat ctatctatct atctatctat ctatctatcc 11520 tactatctat ctatctatct atctatctat ctatctatct atctatctat ctatctatct 11580 atctacctac ctacctatct atctatctat ctatctatct atctatctat cctactatct 11640 atctatctat ctatctatct atctatctat ctatctatct atccatccac tgtgtgtgtg 11700 tacatgtcat gcttgtggag gtcagaggac aacctatggg aattgtttct ctctttccac 11760 c~tgtgtgtt ctggggttc-a g~tagtcagg Ctgggtgata-gcatcttcec tgctgggctaw.11820. . -, tcccactggc agctttgttg tttcacggaa tctctggaac atcaccctct taatttctct 11880 gttcattggc tgttctttct gtgccttctt tattctgtca tcttcgtctc tcacaatgac 11940 aaatatttaa gttgtcctag gacccagttt tcttttaaat tattttatga ttttatgcac 12000 atgtttcgcc tgcatctatg caccatggtg tgcctcacgc tgctgagttc agaagatatc 12060 aggtcccttg ggacttgact tacagaggtt gtgacaaggc ccagttttct tctgcactgg 12120 cttccttcct gattccatcc taactcttgg cattaaaggc ctgtgtggat tcgatactcc 12180 cagacgtatc tgtgtgctcc cccagaatgt attaaatagc ctcctcaagt tttccatgta 12240 accgtcaaga aggtgtatca aatttaaaat gcccaaagct ggactcttga cacctttata 12300 ccttagcttt tggtattcct cccccatctg gaccagacca tagcttcgtc ttccaaattg 12360 catgtgcaac tttctttttt aatccttagt ttcttttctt tctcttatat tcatatcaca 12420 atgcacccgt tctgtttatt caaccctaag atatacccag cacttaaccg cttcctaccg 12480 tgtatagctt gggcagcgta ggtgaatcca cttttctaac tggattttct gactctatgc 12540 tttatagact ctctctgtat tagtacacag aggggttccc tggaaatata agtcagatga 12600 tggtatgcct ctggaactgt cgatggcttc ctagtttgca tagattaaaa ccaaaggtcc 12660 gggggggggg ggctggagag atggctcagt ggttaagagc agtgactgct cttccaaagg 12720 Seite 24 L-0020-01-WO-01.ST25 tcctgagttc aattcccagc aaccatatgg tggctcacaa ccatctgtaa tgagatctga 12780 ctccatcttc tggagtgtct gaagacagct acagtgtact cacatataat aaaaataaat 12840 aaatcttaaa aaaaaaaacc aaaggtcctt ctattggtct agtttgtctt gctccttttt 12900 catcctcgcc catccacatt gctttgacac aggtttccct tgatctcgca ggtttctgct 12960 acatgctctg ccatttttga tgcatctttt gccccctttc cttggatgcc cacatgatgc 13020 catcttacac tgttttgcca ttttatctac ttgttttttt cttttacctt tttgggtttg 13080 ggttaaatgt tgttgtatcc ctgattaatc tttacaaata gctaccaccc ccccccaaga 13140 gtcttattta tttctgatta ctttttttaa tggcacatga ttattttgtt tttgcttctt 13200 taaaaaaacc tttttcattc attgataaaa agcacttact ttccttgatg tgtatgtctc 13260 ctttgtctaa atgtgcaccc cttgtgtgct gtgagtccac aagaaggtgt catatcccct 13320 ggaactggag ttacagatgg ccgtgagcca ccatgtgggt gtggaaatca aaccccagta 13380 cttgggaaga gcagccagtg ctcttagctg ttgataggtc tttccagact gtttttactt 13440 ctgaagatgt gtgtaagtct ctccctctcc acttcttcca cattaagaat ataaggacaa 13500 taaagggaca caatcttgcc tgtgagtttc attgtggtat agttagcaag acgactggat 13560 gaatgaaggc tgcctaagcc ttcggaactt cctggttagc tcatttctta cgtaggaacc 13620 caagaagtct cacgctttac ctctgttgac tgtagagagg ttacttacta ctattttaaa 13680 gtttcaggca agcgtttatg gctgttatat ccccaacctt actcaccgat ccctttcttc 13740 ccatgtgact ttatggaaat tacttaactc cctacatttc agggagctaa tctgaaaaaa 13800 atgtgataaa aatagaatgt agctcaatag aatatagtga agattaagta aagtcatgaa 13860 caaaaagaat ttagttagaa aacctaatgt gtattagggt tctgcaagta cttatgattt 13920 tataactctg ttaataactc attgttttaa gtgcattcat tattcccacc actttgaatc 13980 attattaaga taaatatgat gtttttcaga tcgtccatgc cttaaaaata agcagcatat 14040 agaaaaccta caagaaccag tcctgcaact tttgtttaag ttttcaaaaa tgtaccatcc 14100 agaagaccca cagcatttcg cccacctcat agggaggctt actgaactga gaactctgag 14160 Seite 25 L-0020-01-WO-01.ST25 tcacagccac tctgaaatcc ttcgcatgtg gaaaacaaag gaccccaggt tggtgatgtt 14220 attctctgag aaatgggatc tgcactcatt ttcctgaaaa tttacagtgg tgagacttgc 14280 cttaaaattt caaaatcaaa attggatgaa tgatttgaaa aaaatgtgat aactcaaaat 14340 cattgagtta tcacacacca ccttaaaatg gagatccaaa aggctcatca gggtttgtgg 14400 aactttcaac tcttagaaga tgaacactcc aggattcatc tcaagattgg atgcaaagga 14460 aagaaaaatg gagccatgtg gtggcatagg ctctgtagtt gagagacaga gacaagagga 14520 ccccatgttt aaggcagtcc tgggctatgt agtgagacct catctcagaa gaaaaaagag 14580 _ .. ua~aactgat ggcatatt atgtctceaa tcetcaccta a:~gaaactaa t~ctTTtgtt,g ~~5~:~-.
ttgttgttta gctgaaacac ataatgtgta aatgtgaaca caggtaggtt atggttagga 14700 taaggcagtt tgtatgcttc actggttaag gacacatgga tggtgtagca gctaatgctt 14760 agaacctctt cttcatgaat gattaacgag gaacaagcct tttcaaattg tatctgcttt 14820 atatttttaa taagtatgtg ataattgtac agaatagtca atattgtaca gaatatttca 14880 atatgggaat ttcatgtgta tatattagat atttctggcg tatttgccac ccaccctgtg 14940 atgcccacac tgccaatgcc cttctcatcc ccctaacatt tcccttcctc tgccttgccc 15000 taaaacacca gattcatacg tgcaagaaaa cattttttaa acaaatttta ttagatattt 15060 tcttcattta caattcaaat gctatcccaa aagtccctta ttccctcccc ttatcctgct 15120 ctccaaccca cccacttctg cttcctggcc ctggtattcc cctgtattgg ggtatataat 15180 ctttgcaaga ccaagggcct ctcctcccaa cgatggtcca ctaggccatc tgcaagaaaa 15240 catttgattc cagtctgagt ctgttttatt tcctttgagt atgactatct cgtcaatatc 15300 tgcaaatgac ataatttcat tcctcttaag gatggaaagg actggattgt gtatgtatat 15360 ctgtatatat tattttattt atccacctgc tgatgacacc tagtttgatt ctgtatcttg 15420 gccactgtga atcgtgctat aataaaccta tgtaggtttt tagattttat gtggctcttg 15480 attctttcag gtgtactccc agaagtggta tggtttatat ggtagctcta tttttagatt 15540 Seite 26 L-0020-01-WO-01.ST25 tgaagggaac tatgtagtga ttttcacagt gattaccttc ttctcaatag tgtataaaga 15600 ttaatttgcc ttcaaaattc ttgccaattt ttgtttttcc ctttcctttt ctttttaatt 15660 aaaaaagtgt gtgtgtgtgt gcgtgcatgc caaagaacat tgtggacatc ggaggacagc 15720 tctgtggaat acattctctc cttccacctt tatgggtggt ctcctctgac cccctctgcc 15780 atggcctgca tggcttggtt ctggaagcct gattggggtg atgagcgaat gaagaaatga 15840 cagacatgaa gacacaagta cagaaaagct gagattgggt ggtgtctaag tttctttttt 15900 attgctataa agagacacca ggaccttatg aaagaaagca ttaaattggg ggcttgctta 15960 cagtcttaaa gattgcgtcc atgatgatca tggcaggtag catggcacca ggcatgcact 1602 0 ggaacagtat ctgaagagct tacattctga tcggcaagtt ggaggcaggg ggtgggggga 16080 gaagaaagtg gctgatgggg cctgggacag gatttggaaa tctcaaagcc catccctagt 16140 gacacacctc ctccaacgag gccacaccca gtcctcctca aataggtcca ccaagtaggg 16200 <210> 21 <211> 16200 <212> DNA
<213> Mus musculus <400> 21 ccctacttgg tggacctatt tgaggaggac tgggtgtggc ctcgttggag gaggtgtgtc 60 actagggatg ggctttgaga tttccaaatc ctgtcccagg ccccatcagc cactttcttc 120 tccccccacc ccctgcctcc aacttgccga tcagaatgta agctcttcag atactgttcc 180 agtgcatgcctggtgccatgctacctgccatgatcatcatggacgcaatctttaagactg240 taagcaagcccccaatttaatgctttctttcataaggtcctggtgtctctttatagcaat300 aaaaaagaaacttagacaccacccaatctcagcttttctgtacttgtgtcttcatgtctg360 tcatttcttcattcgctcatcaccccdatcaggcttccagaaccaagccatgcaggccat420 ggcagagggggtcagaggagaccacccataaaggtggaaggagagaatgtattccacaga480 gctgtcctccgatgtccacaatgttctttggcatgcacgcacacacacacacactttttt540 Sei to 27 L-0020-01-WO-01.ST25 aattaaaaag aaaaggaaag ggaaaaacaa aaattggcaa gaattttgaa ggcaaattaa 600 tctttataca ctattgagaa gaaggtaatc actgtgaaaa tcactacata gttcccttca 660 aatctaaaaa tagagctacc atataaacca taccacttct gggagtacac ctgaaagaat 720 caagagccac ataaaatcta aaaacctaca taggtttatt atagcacgat tcacagtggc 780 caagatacag aatcaaacta ggtgtcatca gcaggtggat aaataaaata atatatacag 840 atatacatac acaatccagt cctttccatc cttaagagga atgaaattat gtcatttgca 900 gatattgacg agatagtcat actcaaagga aataaaacag actcagactg gaatcaaatg 960 ttttcttgca gatggcctag tggaccatcg ttgggaggag aggcccttgg tcttgcaaag 1020 attatatacc ccaatacagg ggaataccag ggccaggaag cagaagtggg tgggttggag 1080 agcaggataa ggggagggaa taagggactt ttgggatagc atttgaattg taaatgaaga 1140 aaatatctaa taaaatttgt ttaaaaaatg ttttcttgca cgtatgaatc tggtgtttta 1200 gggcaaggca gaggaaggga aatgttaggg ggatgagaag ggcattggca gtgtgggcat 1260 cacagggtgg gtggcaaata cgccagaaat atctaatata tacacatgaa attcccatat 1320 tgaaatattc tgtacaatat tgactattct gtacaattat cacatactta ttaaaaatat 1380 aaagcagata caatttgaaa aggcttgttc ctcgttaatc attcatgaag aagaggttct 1440 aagcattagc tgctacacca tccatgtgtc cttaaccagt gaagcataca aactgcctta 1500 tcctaaccat aacctacctg tgttcacatt tacacattat gtgtttcagc taaacaacaa 1560 caacaaaaaa ttagtttctt taggtgagga ttggagacat aatgatggcc atcagttctt 1620 ctcttttttc ttctgagatg aggtctcact acatagccca ggactgcctt aaacatgggg 1680 tcctcttgtc tctgtctctc aactacagag cctatgccac cacatggctc catttttctt 1740 tcctttgcat ccaatcttga gatgaatcct ggagtgttca tcttctaaga gttgaaagtt 1800 ccacaaaccc tgatgagcct tttggatctc cattttaagg tggtgtgtga taactcaatg 1860 attttgagtt atcacatttt tttcaaatca ttcatccaat tttgattttg aaattttaag 1920 gcaagtctca ccactgtaaa ttttcaggaa aatgagtgca gatcccattt ctcagagaat 1980 Seite 28 L-0020-01-WO-01.ST25 aacatcacca acctggggtc ctttgttttc cacatgcgaa ggatttcaga gtggctgtga 2040 ctcagagttc tcagttcagt aagcctccct atgaggtggg cgaaatgctg tgggtcttct 2100 ggatggtaca tttttgaaaa cttaaacaaa agttgcagga ctggttcttg taggttttct 2160 atatgctgct tatttttaag gcatggacga tctgaaaaac atcatattta tcttaataat 2220 gattcaaagt ggtgggaata atgaatgcac ttaaaacaat gagttattaa cagagttata 2280 aaatcataag tacttgcaga accctaatac acattaggtt ttctaactaa attctttttg 2340 ttcatgactt tacttaatct tcactatatt ctattgagct acattctatt tttatcacat 2400 ttttttca~a tta~ctccct gaaatgtagg gag~taagta atttccataa agtc~catgg 2450 _ .
gaagaaaggg atcggtgagt aaggttgggg atataacagc cataaacgct tgcctgaaac 2520 tttaaaatag tagtaagtaa cctctctaca gtcaacagag gtaaagcgtg agacttcttg 2580 ggttcctacg taagaaatga gctaaccagg aagttccgaa ggcttaggca gccttcattc 2640 atccagtcgt cttgctaact ataccacaat gaaactcaca ggcaagattg tgtcccttta 2700 ttgtccttat attcttaatg tggaagaagt ggagagggag agacttacac acatcttcag 2760 aagtaaaaac agtctggaaa gacctatcaa cagctaagag cactggctgc tcttcccaag 2820 tactggggtt tgatttccac acccacatgg tggctcacgg ccatctgtaa ctccagttcc 2880 aggggatatg acaccttctt gtggactcac agcacacaag gggtgcacat ttagacaaag 2940 gagacataca catcaaggaa agtaagtgct ttttatcaat gaatgaaaaa ggttttttta 3000 aagaagcaaa aacaaaataa tcatgtgcca ttaaaaaaag taatcagaaa taaataagac 3060 tcttgggggg gggtggtagc tatttgtaaa gattaatcag ggatacaaca acatttaacc 3120 caaacccaaa aaggtaaaag aaaaaaacaa gtagataaaa tggcaaaaca gtgtaagatg 3180 gcatcatgtg ggcatccaag gaaagggggc aaaagatgca tcaaaaatgg cagagcatgt 3240 agcagaaacc tgcgagatca agggaaacct gtgtcaaagc aatgtggatg ggcgaggatg 3300 aaaaaggagc aagacaaact agaccaatag aaggaccttt ggtttttttt tttaagattt 3360 Seite 29 L-0020-Ol-WO-01.ST25 atttattttt attatatgtgagtacactgtagctgtcttcagacactccagaagatggag3420 tcagatctca ttacagatggttgtgagccaccatatggttgctgggaattgaactcagga3480 cctttggaag agcagtcactgctcttaaccactgagccatctctccagcccccccccccc3540 ggacctttgg ttttaatctatgcaaactaggaagccatcgacagttccagaggcatacca3600 tcatctgact tatatttccagggaacccctctgtgtactaatacagagagagtctataaa3660 gcatagagtc agaaaatccagttagaaaagtggattcacctacgctgcccaagctataca3720 cggtaggaag cggttaagtgctgggtatatcttagggttgaataaacagaacgggtgcat3780 tgtgatatga atataagagaaagaaaagaaactaaggattaaaaaagaaagttgcacatg3840 caatttggaa gacgaagctatggtctggtccagatgggggaggaataccaaaagctaagg3900 tataaaggtg tcaagagtccagctttgggcattttaaatttgatacaccttcttgacggt3960 tacatggaaaacttgaggaggctatttaatacattctgggggagcacacagat~cgtctg4020 ggagtatcgaatccacacaggcctttaatgccaagagttaggatggaatcag~~aggaag4080 ccagtgcagaagaaaactgggccttgtcacaacctctgtaagtcaagtcccaagggacct4140 gatatcttctgaactcagcagcgtgaggcacaccatggtgcatagatgcaggcgaaacat4200 gtgcataaaatcataaaataatttaaaagaaaactgggtcctaggacaacttaaatattt4260 gtcattgtgagagacgaagatgacagaataaagaaggcacagaaagaacagccaatgaac4320 agagaaattaagagggtgatgttccagagattccgtgaaacaacaaagctgccagtggga4380 tagcccagcagggaagatgctatcacccagcctgactacctgaaccccagaacacacatg4440 gtggaaagagagaaacaattcccataggttgtcctctgacctccacaagcatgacatgta4500 cacacacacagtggatggatagatagatagatagatagatagatagatagatagatagat4560 agatagtaggatagatagatagatagatagatagatagatagataggtaggtaggtagat4620 agatagatagatagatagatagatagatagatagatagatagatagatagatagatagta4680 ggatagatagatagatagatagatagatagatagatagatagatagatagatagatagat4740 agatagatagaattgtaaatgtccaacaaaaggaaacaaaccacctatgtgagttagtga4800 Seite 30 L-0020-Ol-WO-01.ST25 gtgggtgggg ttttttgttt ttgtttttgt ttttgttttt gttttggggt ttttttggag 4860 tggaaacttc ctgagactgc catgcttctg caattctttg catgatttta gttctgcttg 4920 tggtactgtt aacaaggctg gaggcaagtt gttaggtgga ctgcagtcta caaggagaac 4980 ttactctttt ccatctaaac agtaagtaga aatggcaaga ttatgaaaag gtaaattgtt 5040 gggaatcaaa tattagccct gcaaacccaa agatcagctg aagggatgtg atgaacatat 5100 ctttatgaga actttataat ctaggggcgc ctgtggcccc attgaaattc ttaaatgtag 5160 agagcatgac acagtccttg ttttaaaaga cccaccaaga aggataaagg atgatggcca 5220 ca~g~:gttat cagtcccact cctgagagat gg~gaaagcc t~acacagcc agaacaccag .5?$~_-, .
ggctctaagg tactgcacca tccagaattt gtagcttaat tccctacact gaactgaaag 5340 aaagaagagt tttatggtcc aaccagagtg acctgtccct aggtaacagt ctgagctgat 5400 tttccaaata ttcagattca gatgatagcc aacactgctg tccctccagc cttctacatc 5460 ctttatcctg tttgtcacac agttgcttcc aaaacttctt ggaataaagc ataataaaag 5520 tggacaatat aatggagctg ctctggagcc tatgtgattc aacttcagat gcacatacct 5580 agtgaacatt cacgaaccct gatatcacag caagttagaa tataaaagaa cacaaacatg 5640 gccgaatcgg tgaaggacac aggacaatgg tcccctagga agacgtcgct ggcttgagca 5700 atgggttacc catgctaagt aactgtctca cttcaacctg ggtccggttt gtaacgcata 5760 gtcaagaaga ctgtgaactg acataaagaa gcatgagaaa ttctcctaac acatcagaca 5820 tcttcaccgt gcttcaaaat tccactgaag ggctcacagg aaacaacata tccccttctt. 5880 ttctttcaga tgcttatgat actctctaca aaatacacca gtggggcaag taagatgaag 5940 acagagcatt gaccagttga gtcagtgata gcatgtcctg aagatcctga caggttgtag 6000 tggactggag agggtggagt gaaatgtgac tgtggtgaac tcaggagtga gcatgagtaa 6060 ggtggaaact agaatatagc taactccctg agaacctctc tctatcaggg actgggagcg 6120 tatatctgag gggctatgag ggcaagaaag catttcttta agatgcaaga aagatgtaac 6180 Seite 31 L-0020-01-WO-01.ST25 ccagtggtaa gcttaaagga atgcacatag agagcaaagc atgatgggat gagagtggta 6240 cattcaactt cagttcatag ttcgataaaa agaacttagt gattgcctgt ccagagatca 6300 ctccgagcct tacttctcag tctgtacctg agaaaagcac tgttgtcgcc gtaagcagag 6360 catattcagt atccgataca tgaagttcac tcattcttct gtagaagtaa gatagtgagg 6420 caataaattc tttagtaata tctgaaaaga ttaatataag tgaattttca tcggaatgga 6480 gcacagaagc aaacacattg tgattgtctt ttcttctctc aaacagtgta agaaaattga 6540 aaagagagtg gggttcaaga tggctaccga tgtgatggcc ggcacagaca atcccctttg 6600 ttgtgccctt gaagcccagg cctgaacctc aagagtgggc gcattaggtg tgctactgaa 6660 aaggaccaga acaaagagag gttgtcatca cgtgg~accaa ggaggaggct gtggaactga 6720 gagcctttag cctacagaag ccactctccg tgtatctcta ttatctacag taaggcttac 6780 ccttctaccc ataaaagtgg ttgatactgg gatgttggct gtagattaca tcatcatcct 6840 aagaataatt cagaggcccc aaatggcttc gcaattaaat agggaaagaa ttgccatgat 6900 gaagccaaag taagaaaatg gttacaaata gaattctcac tgatgacctg ccaacactaa 6960 caactgtatt ggctgtgtct ataaatgctt tcaccaccac agagacgcct gtgagctgtg 7020 agaaagccca tggccttctc tttgtagtgc aatgaaaagg ggagaaaagg aagcgtgtcc 7080 acccaaggtg atgcttccta ctgcacagtg ttaccagtta gaggagccga agggtagcct 7140 tccttgagaa agttttccgg agaatgaaca ctttcatcag cgctaggatt atgcacctct 7200 agtgtcccag ctgagggctt tgctggtctc atagtaccta gcaaaagaga aaaacacaaa 7260 tgttgatagt ctttcttaat cattatataa atagtaaatt cagtatcttt tcagtatcca 7320 gcataactat tatgtggggg aaaaaaaaac tcctaaaatg atatcattct catttttgtt 7380 actggaacac aaaacagata ttagttcata ttttgtaaaa aaaaaaaaaa gattgtacag 7440 aagaccctcc aaaatgagtt ttgcataatg aaatagaaac taagaatggc tttcttaaac 7500 acacacaccc tccacgaacc actcagtgca ccagaggctg tggagacttt taagcaaaca 7560 ctactcaagc aatcatggac caggtcatca agggcctccc tttagagcct tctggtctag 7620 Seite 32 L-0020-01-W0-01.ST25 tgtttgcaga gaacaatcaa taaataagga tattttaaaa ggctgtgcac tgtaacgagt 7680 acagctagtg aaggcagcta gcattgttgg cttttggttt tgattttgtc ctattgcctc 7740 ttcccatgtc ctcaaccaac atgcgacttt acagattgga ctcttgcatt cacacactaa 7800 acaatgatggctttcctgctctattcaaatggtttattgattttttaaaaaatacttgct7860 ttttttttattgaagtgaggtcctgcatagttggttcaacactaacctaatagggcttca7920 ccccagtaaagagacgttccacaaacagta.gcccgtggttatgagtttcaaagcaagcat7980 gcggccgtccccagagctgctttgtgtcttagaatcaccaacgcctttatgtttgatcac8040 ~~cat~aaaagtrctcctcatt~gga.agagcctg~cc~~a~jgga9c.ar~aga.agcc.cca~~c~ca.;_,8100 a~~ ,. ..
aagccccacccacagctgagaagctgttggcagctgctaggggaaggacaatccagtctc8160 tttagggaac tgagccctct aagttggcca tgcttcagtg gatgacccta cccccacccc 8220 cgtgtatatg agcagcacta tttagagtca atgggttatt ttgaaaaaga aggagcaggg 8280 aaggacatga agttaggaga gaggtgtgtg ggggattcta aaggagctgg tgggaaggaa 8340 gtgaggtgag taagatttaa aaaaaaaaac acttaggaca tgtatacagt tctcacaaga 8400 ataagaaata caatttttaa aagaattaca cggtagagag aaaatgctag atgaatgtga 8460 aactatttat aaacagctac tatgagacat ttgttcctac actaactggt aaaataaaat 8520 gaaaagataa atgaaaggct gatgatgtta ggagttagga gttctggctt actgagcaca 8580 gaccctggct tctgtcccca gcaccataga aggcaggcat agaggctcta ggctgatggt 8640 tggcaggtaa gagcagacac agcagggctt taagatcctc cttagctaca tactgagttc 8700 tacatagcct ggactacatg agagtctatt tccaaaaaac agtgtttaaa agtgtgtgcg 8760 tgcgtgcgtg cgtgtgtgtg tgtgtgtgtg tgtgtgtgtg tgtgtgtgtc tgtctgtctg 8820 tctgtctgtc tgtgtgtact tgaagttgaa taaaattctg tttttctttt tacttccaga 8880 ggttgagtct ttcccaccat aaagctgggc tacatgaagg aacatcactt cagtttttga 8940 cgccttctgt aatgcagcct gatcctcagt ggttaaattt tcaaatcctt aaaaataaat 9000 Seite 33 L-0020-01-WO-01.ST25 aaataaataa ataaataaag cataaagaaa cagataatca aaagagaacg ttcataaatt 9060 aaatgtttaa aaatatttac ttattaaccc taagtgcttc tggggaataa caatagtata 9120 taaattgaaa tctgaataaa attctttttt tctattttgg ggcactttta tcagcactag 9180 gttacacttt cttacagcaa ggacaaatca acatagctgc attagccttg aagaatgtaa 9240 ttttgtaaag agtttctgta cctctaagca tataataaag ctgagttata tataagtttt 9300 ggtcaaactg tgacttgttt agtattatag aggaaacctt ctttgtttct tgtttcatct 9360 aagaaagccc acatcattat aataattttt gtaaatattt agattgtcac tcatctgatt 9420 aattgacaaa gacaaataaa catttttttt ttttttagtt agaagcaaca ttgctagaaa 9480 ~acccagggct~tcacacacat tagacaggtg~ctctgcctct gagcgataga tctagtacca ,9540 caactagaaa tttctgtttg atatactatc cctttaaaaa aaagaaaatt tggggctgga 9600 gagatagctc agtgggtaag agcactgact actcttccta aaaagccctg agttcaagtc 9660 ccagcaacca cacggtggct cgcaaccatc cgtaacgaga tctaacgccc tcttctggtg 9720 cacctgaaga cagctacagt gtacttatga aatctttggg ccagaacgag cggggccgac 9780 cagagtgagc agaggtccta aaaattcaat tctcaacaac cacatgaagg ctcacaacca 9840 tctgtacagc tacaatgtac tcagataaat aaataaataa ataaataaat aaataaataa 9900 ataaataatc ttttttaaaa attgggggct agaaagacag atcagcagtt aagagcctgt 9960 cttgctcttg cagagactct ggttccaacg cccacatggt aattcacagc catcggtagg 10020 aaactctagt tctaggggac caaacactct cttctggcct ccacaatatg ttgtgtacat 10080 gatatgaata tataattgca atcaaaacat gcataacaca taaacataaa aataaataaa 10140 tccttaaaaa ataatgaaac agaattttac ctgggagtcc cttggtaaac ttcattagcc 10200 cttgtatgtg caggactgat acctctgaga gtctcagaaa acttagttcg gggttggaac 10260 cctcctgcag ctggtccata aaaaaaataa ataaataaaa atgattcata gtatgtcttt 10320 ctgataatta ctaaggtgtt tcttttttac tgttcttgct atcatttcct aatatcattc 10380 attacaaatt atttcttgcc agcaatggat acatacaagt ttgcttgttt ctcccaaggg 10440 Seite 34 L-0020-Ol-WO-01.ST25 aatcatggat ttttggtgag cagtcactat ggtattcaga agccgatgtt cctcttgagt 10500 tagagtcatg ttgtcctgaa cctataagga aacagagggc gcctgtaatg tgtgcttgtt 10560 acacactgtt cataaagaca caagagactg tatggaagaa gcgtcttgcc cctttcccag 10620 atctggtgct ggatgacacg tgtttggtgt ctgctccttc atcttccact tggatggcag 10680 ggtacagggc aggcccgtgc ttgaagttct tgcgaagtct ctttgactta cactggatct 10740 ctgtgagcaa~ acctgtgaca ttacagaatt cacattaagt acttttggct cggaaaacaa 10800 cactgtaggg tttgattaaa atggtcttca gtgagtaaat agtcctagat atctctcgtg 10860 .. .. . ~~tgtctgca tetattaaat ttcc~attct. ;gcttttcc:ct.. tcataga,cac .ctt,gtctgct ~ln~~.~: - - '. .. ..
taaaaatagt tataacaaag attaagaatt aatttttcat ttccaaatta tgtttagatt 10980 tttacataag agatttacac attagttgca tcttggaagt gaaaaagtca tttgatagaa 11040 tagtagaatt acattatcta tgactattat cactggtttt attaataagg ttaaaaaaga 11100' ctatcttagg atttttaaaa ataagctaag tcaatttttc ctatgtcaga caaagcattt 11160 aattcttttt caaaggcaaa taatatttat tttaggattg tttttgtttg tttttaaata 11220 ttttataatt acttgcatga gtctgtatgg ggaaatgcac aggtgagtgc aggtgcccac 11280 agagcacaga ggggtcatag ccccttgaag tgagactata gctggttgtg agctgtagtg 11340 tatgtactgg actggactgg actactctaa gtacttttag cagctactaa aagtatccaa 11400 ctattaatat acctgtcgtt ttcaaagtat ttttaaaaac acacctataa aataaaaaac 11460 taatgaggag ctgaagtata aacaaaaacc ttaaaacccc tctatattat tttattattt 11520 gtcagagaac agaatacaca aaaaaataaa gccattttaa gcatattgag tattgaacat 11580 ggtagtaaac acctataacc ccagctactt aagacactgt tacaagatga tctcacgttg 11640 aataccagcc atggcaactc agtgtttttt agtaggaaag gagtctatgg gtgtagctta 11700 gtggtagatt gtctaggatg tgagaaacct gaactcaacc cccaggacca tgagaaggaa 11760 aaccacaaga agagaacgaa gaggaggaaa aggaggaaga agagggggaa gaggaggaag 11820 Seite 35 L-0020-Ol-WO-01.ST25 aggaaaagaa aggaggaaag ggaaaggaga atgagggaag gaagaggagg aggggccgtc 11880 agagcactaa cacatggcag tattaggaca actgaaaggt aaatgaagtg tgcagatcca 11940 agcaatggtg gcagggtgcg gaccagctta tgctcgccaa tcaccgtgac tgctcaccaa 12000 ctctctcgag cttcaccttg ccagaatttc tttaagtttc tttaagtgta ggatgactca 12060 ttttatgcat agagttcctc agaggactcc aacttccctt tcacatgcat tgcctgccac 12120 aggttaaaag acaacttttg agccagaaat tggagggcag aggcaggggg atctctgagt 12180 ttgatgccag cctggcgtac atagcaagtt ccaggtcagc tacatagtga gatcctgaat 12240 acaaattaaa ttaatgaata aaataaaaga caactttttc aaaggtttct ttggactgag 12300 ttgtatctcc cactgctcat atgctaatgtngttgatattc agagatgagt cctgacactc 12360 gtgagactat tagtggactt tacatatgag gatgtcaggg gtccagagat ccagttaccc 12420 ttcaattgtc ctaagactgt agtgtgttag gactctgaga accccttctc agaaagggaa 12480 gacttccttt tctcttcttt ctcctcctct ttctttcttt ttgggaacta ctaactttaa 12540 ctttgcactt ttcaaggcag tatatatttt taggttaagt aacagaaaca tacaatacct 12600 tttttaaatc actatggagg tttcttttat tttatgtgtg tgttttgctt gcatgtgtgt 12660 atgcacacaa cactcatacc tgcagaggtc agaagagggc ctgggatgct caggaactga 12720 gttataaatg attgtgaggg gccacgtgga ~agctggggac cgggcccagg tcctctccta 12780 atgctcttaa ctgcctaact actcttcagg tctcagagaa catctaatta agatgaaact 12840 atgactgatt tcaaaattag agcaatatag aacagggcaa catttatggg aatcactact 12900 taaattgata atgggccatc cacgtggcgc aattactgta catatgtgtg tggtgtgcat 12960 atacaaacga atacggccat gttcaaaata actcttgaag aatgctttat gtcctcaaga 13020 aggttggaaa tttttaattt tttattaatt tttaaattca tttttattat atagaggaga 13080 gtgtgaatac agtccttcac taacacaaat gatgcacctg agtttcccac atttgtggaa 13140 atcacagggg tcagtcagca catccctagt gcaatgggcg attcttgccc tggaaaaacc 13200 accaccacta tcatgtcctc cttgtcagga aggtaaggct gggaaaagtt aaatcccctc 13260 Seite 36 L-0020-Ol-WO-01.ST25 agaaaagcaa gccacgaaca cactggcact tacattctgc caacatcccc accgccttac 13320 acttcttcag tctgcactct tggcattttc tgcgcatgta catgtccatt tcacagtgac 13380 caccgttctt gcaagaatac actgcattct tggtaatgct acgccggaaa aagcctaaca 13440 acaaagatgg tcaaagtttg acatggatat gtttagttac ttctcatttt tattaacatt 13500 ttacctttat agaaagtaat caagctaact agagtatttc aattaaatag ggtttttttt 13560 attcccgaca tttttttaaa tatacattta tctggaaagt tagctaattc tataattctt 13620 tatagttagt atagttcagt tgacatcctg aggctgggtc tttctttata acgtctggag 13680 .. - a~aaaat.gga-'gtatcattat cacatt?aga caracvtc~tcta..ccctactata ggGaaal:i:ac~ i:~7.40. ..
ctataagaat ttttttttta actgcattat acaaaatatc aagatcctac aatgcatctc 13800 tcgagtttcc tttgagagaa tacatggtca cctaaaataa agaaaaaatt cagaaaagcc 13860 tctcttttct aaaccctccc agtctgagtt gcagcccaca aaggtggtcc ctccctccct 13920 catccgcaga tgctctctct ccctttccca gcacttgcca gatagttccc atcccttttc 13980 ctccttcgaa ccttattaca cagtaatacc tgccatttcc tgctctccct cctatgagtt 14040 gtcccaaagc cccaccactt actctcttgc agtggacact tgaaagatgg agtttcgcca 14100 tttcagttca acgaaggaag aaaaagctca gctcaggttt tcaaaactgt gctgggttct 1416 0 tcagctcagt tttaacagta acccgtcacc gtcaagagca ggccacccct gcctaactac 14220 tcgccacagg actctgtcac ctcatatcct ttggctattc tttgccagtg ttgaggttta 14280 agatttacag gtaaaatttt accctaaaca aacacgttta tttgtgttac attgactaaa 14340 ctgttttctg ttgtttgggt tgttttgttt tgtttcagag ggtgctagaa tcaaatccaa 14400 aagtaggtgc taagtgctct accactggcc tgcattccca tcccgatgtt tatccatttc 14460 tggagtggta gactgtttca gaactgttat ttatcagaga ggcttccgct ctttacaaat 14520 atcattgcaa acactggccc ttgcttcccg actttcagct tgttctcttt ctatacgaaa 14580 cactttttaa aatttattga cttagaatca tagaagtcat ttatttcaaa ttttctccaa 14640 Seite 37 L-0020-01-WO-01.ST25 gtgtataaat tttatcaacc acataaaaag gtgtcatggt ttacagatct ttacattgtc 14700 atgttttgtt ccagcatcaa attcttctag aaatagggaa aacgtttgct tatatgtatc 14760 gagtttaaag taaagtgtga aagtcttcat caaaaatgaa gagacagacc tggtagtgaa 14820 tgctcttaat gccagcagta aggatacaaa ggtaggtgga tctctgagag ttcaaggaca 14880 gcctggtcta aacagtgagt tccaggacag ccagggatgc atagagagac cttttctcaa 14940 aaaaagaaga aagatgagga ggagaagatg aggacgaagg ggaagacaag ggaggagaaa 15000 aaaaaataaa gaaagaaatg gtccaggcat ggtggcccac acctttaatc ccagcaccta 15060 gaaagcagag acaggcaggt atctgtgagt tcaaagccaa cctgatctcc atagtgaggt 15120 ctaggccagc cagggctata caatgatact ctacttcaaa agagagagag ggggggaggg 15180 ~agggagagag agagagagag agagagagag agagagagag agagagagag agagagagag 15240 agagagagag agagagggag agagagagag agagagagag agagagagag agagagagag 15300 agagagagag agaaagaaag aaagaaagaa agaaagaaag aaagaaagaa agaaagaaag 15360 aaagaaagaa agaaagaaag aggaagggaa agggaaaggg aaagggaaag ggaaaggaaa 15420 ggaaaggaaa ggaaaggaaa ggaaaggaaa ggaaagaaac taaccttaaa cttaacattt 15480 tattctccta tctttgatca cacagattga tccctttaga catacttttt cctggttttg 15540 tttctcatgt tttataacga ttgcacacgc acgtgcacac acacatacac acgtgtgcgc 15600 gtgcgcagca ctgtctgttt ggttgcgcct tacccttacc tttgcacccc tcacaagtaa 15660 gtgcattgta gtgatatcct gatgccttat caccgcagac catgcagagc atgtcatctt 15720 gacccttcca gcgtaaagaa caagttgctc tgcacctttg tgccccaggg aattcatctt 15780 catcatcgtg aacaaccaca taagtggact gaccagattc tcgagcgtcg agtccacacc 15840 aactgcctcc actgtactgt ggatccaagc catgtgtgtt gaaatgactt tgtaaagatg 15900 gggactgtaa cgctggagga aactgagcgg tagaatattg acaatagggt ggttcttgga 15960 aatctgtatc acacagctga taatggaatt gttctggcaa aatatctggt tttattaaca 16020 taaaaaagaa ~caaatcataa aaggtcaaaa aagttcataa ttgttttaaa cctcaaaatg 16080 Seite 38 L-0020-01-WO-Ol.ST25 ttttatactc aacagaaagt tagttcaaca catctattcc atctcaggac ttgccgtcca 16140 gcacaatagg aacacagtaa atggaactaa caatacctaa taaatgctat gaccccaaag 16200 <210>22 <211>1428 <212>DNA
<213>Mus musculus <400> 22 atgttaataa aaccagatat tttgccagaa caattccatt atcagctgtg tgatacagat 60 ttccaagaac caccctattg tcaatattct accgctcagt ttcctccagc gttacagtcc 120 ccatctttac aaagtcattt caacacacat ggcttggatc cacagtacag tggaggcagt 180 tggtgtggac tcgacgctcg agaatctggt cagtccactt atgtggttgt tcacgatgat 240 gaagatgaat tccctggggc acaaaggtgc agagcaactt gttctttacg ctggaagggt 300 caagatgaca tgctctgcat ggtctgcggt gataaggcat caggatatca ctacaatgca 360 cttacttgtgaggggtgcaaaggctttttccggcgtagcattaccaagaatgcagtgtat 420 tcttgcaagaacggtggtcactgtgaaatggacatgtacatgcgcagaaaatgccaagag 480 tgcagactgaagaagtgtaaggcggtggggatgttggcagaatgtttgctcacagagatc 540 cagtgtaagtcaaagagacttcgcaagaacttcaagcacgggcctgccctgtaccctgcc 600 atccaagtggaagatgaaggagcagacaccaaacacgtgtcatccagcaccagatctggg 660 aaaggggttc aggacaacat gactctaact caagaggaac atcggcttct gaataccata 720 gtgactgctc accaaaaatc catgattccc ttgggagaaa caagcaaact tctgcaggag 780 ggttccaacc ccgaactaag ttttctgaga ctctcagagg tatcagtcct gcacatacaa 840 gggctaatga agtttaccaa gggactccca ggatttgaaa atttaaccac tgaggatcag 900 gctgcattac agaaggcgtc aaaaactgaa gtgatgttcc ttcatgtagc ccagctttat 960 ggtgggaaag actcaacctc tggaagtact atgagaccag caaagccctc agctgggaca 1020 ctagaggtgc ataatcctag cgctgatgaa agtgttcatt ctccggaaaa ctttctcaag 1080 Seite 39 L-0020-01-WO-01.ST25 gaaggctacc cttcggctcc tctaactgga agaatgagtg aacttcatgt atcggatact 1140 gaatatgctc tgcttacggc gacaacagtg cttttctcag atcgtccatg ccttaaaaat 1200 aagcagcata tagaaaacct acaagaacca gtcctgcaac ttttgtttaa gttttcaaaa 1260 atgtaccatc cagaagaccc acagcatttc gcccacctca tagggaggct tactgaactg 1320 agaactctga gtcacagcca ctctgaaatc cttcgcatgt ggaaaacaaa ggaccccagg 1380 ttggtgatgt tattctctga gaaatgggat ctgcactcat tttcctga 1428 <210> 23 . . . , ,.
<212> DNA
<213> Mus musculus <400> 23 tcaggaaaat gagtgcagat cccatttctc agagaataac atcaccaacc tggggtcctt 60 tgttttccac atgcgaagga tttcagagtg gctgtgactc agagttctca gttcagtaag 120 cctccctatg aggtgggcga aatgctgtgg gtcttctgga tggtacattt ttgaaaactt 180 aaacaaaagt tgcaggactg gttcttgtag gttttctata tgctgcttat ttttaaggca 240 tggacgatct gagaaaagca ctgttgtcgc cgtaagcaga gcatattcag tatccgatac 300 atgaagttca ctcattcttc cagttagagg agccgaaggg tagccttcct tgagaaagtt 360 ttccggagaa tgaacacttt catcagcgct aggattatgc acctctagtg tcccagctga 420 gggctttgct ggtctcatag tacttccaga ggttgagtct ttcccaccat aaagctgggc 480 tacatgaagg aacatcactt cagtttttga cgccttctgt aatgcagcct gatcctcagt 540 ggttaaattt tcaaatcctg ggagtccctt ggtaaacttc attagccctt gtatgtgcag 600 gactgatacc tctgagagtc tcagaaaact tagttcgggg ttggaaccct cctgcagaag 660 tttgcttgtt tctcccaagg gaatcatgga tttttggtga gcagtcacta tggtattcag 720 aagccgatgt tcctcttgag ttagagtcat gttgtcctga acccctttcc cagatctggt 780 gctggatgac acgtgtttgg tgtctgctcc ttcatcttcc acttggatgg cagggtacag 840 Seite 40 L-0020-Ol-WO-O1.ST25 ggcaggcccg tgcttgaagt tcttgcgaag tctctttgac ttacactgga tctctgtgag 900 caaacattct gccaacatcc ccaccgcctt acacttcttc agtctgcact cttggcattt 960 tctgcgcatg tacatgtcca tttcacagtg accaccgttc ttgcaagaat acactgcatt 1020 cttggtaatg ctacgccgga aaaagccttt gcacccctca caagtaagtg cattgtagtg 1080 atatcctgat gccttatcac cgcagaccat gcagagcatg tcatcttgac ccttccagcg 1140 taaagaacaa gttgctctgc acctttgtgc cccagggaat tcatcttcat catcgtgaac 1200 .
aaccacataa gtggactgac cagattctcg agcgtcgagt ccacaccaac tgcctccact 1260 gta.Gt9tg9a.. t cGa~.gceat.- gtgtgttc~aa ~~tgacattgt aa~gavg~fJg ac'~Lgt~at~gc~ . 1:j20.: . . ..
tggaggaaac tgagcggtag aatattgaca atagggtggt tcttggaaat ctgtatcaca 1380 cagctgataa tggaattgtt ctggcaaaat atctggtttt attaacat 1428 <210>24 <211>475 <212>PRT
<213>Mus musculus <400> 24 Met Leu Ile Lys Pro Asp Ile Leu Pro Glu Gln Phe His Tyr Gln Leu Cys Asp Thr Asp Phe Gln Glu Pro Pro Tyr Cys Gln Tyr Ser Thr Ala Gln Phe Pro Pro Ala Leu Gln Ser Pro Ser Leu Gln Ser His Phe Asn Thr His Gly Leu Asp Pro Gln Tyr Ser Gly Gly Ser Trp Cys Gly Leu Asp Ala Arg Glu Ser Gly Gln Ser Thr Tyr Val Val Val His Asp Asp Glu Asp Glu Phe Pro Gly Ala Gln Arg Cys Arg Ala Thr Cys Ser Leu Seite 41 L-0020-Ol-WO-Ol.ST25 Arg Trp Lys Gly Gln Asp Asp Met Leu Cys Met Ual Cys Gly Asp Lys Ala Ser Gly Tyr His Tyr Asn Ala Leu Thr Cys Glu Gly Cys Lys Gly Phe Phe Arg Arg Ser Ile Thr Lys Asn Ala Ual Tyr Ser Cys Lys Asn Gly Gly His Cys Glu Met Asp Met Tyr Met Arg Arg Lys Cys Gln Glu Cys Arg Leu Lys Lys Cys Lys Ala Ual Gly Met Leu Ala Glu Cys Leu Lei. ihr~ Glu ~tle::Gl;s. Cys.-Lys,:Se~-r.:i ys~A.r~:c~ I_e~.t ArYgr.Lys.~A~n phe_..~4.y~ ..- .. . .... : : ~ _ ~ ..~
His Gly Pro Ala Leu Tyr Pro Ala Ile Gln Val Glu Asp Glu Gly Ala 195 200 ~ 205 Asp Thr Lys His Ual Ser Ser Ser Thr Arg Ser Gly Lys Gly Ual Gln Asp Asn Met Thr Leu Thr Gln Glu Glu His Arg Leu Leu Asn Thr Ile Ual Thr Ala His Gln Lys Ser Met Ile Pro Leu Gly Glu Thr Ser Lys Leu Leu Gln Glu Gly Ser Asn Pro Glu Leu Ser Phe Leu Arg Leu Ser Glu Ual Ser Ual Leu His Ile Gln Gly Leu Met Lys Phe Thr Lys Gly 275 280 ~ 285 Leu Pro Gly Phe Glu Asn Leu Thr Thr Glu Asp Gln Ala Ala Leu Gln Lys Ala Ser Lys Thr Glu Ual Met Phe Leu His Ual Ala Gln Leu Tyr Gly Gly Lys Asp Ser Thr Ser Gly Ser Thr Met Arg Pro Ala Lys Pro Ser Ala Gly Thr Leu Glu Ual His Asn Pro Ser Ala Asp Glu Ser Ual Seite 42 L-0020-Ol-WO-01.ST25 His Ser Pro Glu Asn Phe Leu Lys Glu Gly Tyr Pro Ser Ala Pro Leu Thr Gly Arg Met Ser Glu Leu His Ual Ser Asp Thr Glu Tyr Ala Leu Leu Thr Ala Thr Thr Ual Leu Phe Ser Asp Arg Pro Cys Leu Lys Asn Lys Gln His Ile Glu Asn Leu Gln Glu Pro Ual Leu Gln Leu Leu Phe Lys Phe Ser Lys Met Tyr His Pro Glu Asp Pro Gln Nis Phe Ala His Leu Il~e~ Gly Arg Leu Thr Glu Leu Arg Thr Leu Ser~His Ser His Ser Glu Ile Leu Arg Met Trp Lys Thr Lys Asp Pro Arg Leu Val Met Leu 450 455 . 460 Phe Ser Glu Lys Trp Asp Leu His Ser Phe Ser Seite 43
BACKGROUND OF THE INVENTION
Multicellular organisms are dependent on advanced mechanisms of information transfer between cells and body compartments. The information that is transmitted can be highly complex and can result in the alteration of genetic programs involved in cellular differentiation, proliferation, or reproduction. The signals, or hormones, are often simple molecules, such as peptides, fatty acid, or cholesterol derivatives.
Many of these signals produce their effects by ultimately changing the transcription of specific genes. One well-studied group of proteins that mediate a cell's response to a variety of signals is the family of transcription factors lenown as nuclear receptors, hereinafter referred to often as "NR". Members of this group include receptors for steroid hormones, vitamin D, ecdysone, cis and trans retinoic acid, thyroid hormone, bile acids, cholesterol-derivatives, fatty acids (and other peroxisomal proliferators), as well as so-called orphan receptors, proteins that are structurally similar to other members of this group, but for which no ligands are known (Escriva, H. et al., Ligand binding was acquired during evolution of nuclear receptors, PNAS, 94, 6803 -6808, 1997). Orphan receptors may be indicative of unknown signaling pathways in the cell or may be nuclear receptors that function without ligand activation. The activation of transcription by some of these orphan receptors may occur in the absence of an exogenous ligand and/or through signal transduction pathways originating from the cell surface (Mangelsdorf, D. J. et al., The nuclear receptor superfamily: the second decade, Cell 83, 835-839, 1995).
In general, three functional domains have been defined in NRs. An amino terminal domain is believed to have some regulatory function. A DNA-binding domain hereinafter referred to as "DBD" usually comprises two zinc finger elements and recognizes a specific Hormone Responsive Element hereinafter referred to as "HRE"
within the promoters of responsive genes. Specific amino acid residues in the "DBD"
have been shown to confer DNA sequence binding specificity (Schena, M. &
Yamamoto, K.R., Mammalian Glucocorticoid Receptor Derivatives Enhance Transcription in Yeast, Science, 241:965-967, 1988). A Ligand-binding-domain hereinafter referred to as "LBD" is at the carboxy-terminal region of known NRs. In °:
the absence of hormone, the LBD appears to interfere with the interaction of the DBD
with its HRE. Hormone binding seems to result in a conformational change in the NR
and thus opens this interference (Brzozowski et al., Molecular basis of agonism and antagonism in the oestogen receptor, Nature, 389, 753 - 758, 1997; Wagner et al., A
structural role for hormone in the fihyroid hormone receptor, Nature, 378, 690 - 697.
1995). A NR without the HBD constitutively activates transcription but at a low level.
Both the amino-terminal domain and the LBD appear to have firanscription activation functions hereinafter referred to as "TAF". Acidic residues in the amino-terminal domains of some nuclear receptors may be important for these transcription factors to interact with RNA polymerase. TAF activity may be dependent on interactions with other protein factors or nuclear components (Diamond et al., Transcription Factor Interactions: Selectors of Positive or Negative Regulation from a Single DNA
Element, Science, 249:1266-1272 , 1990). Certain oncoproteins (e.g., c-Jun and c-Fos) can show synergistic or antagonistic activity with glucocorticoid receptors (GR) in transfected cells. Furthermore, the receptors for estrogen and vitamins A
and D, and fatty acids have been shown to interact, either physically or functionally, with the Jun and Fos components of AP-1 in the transactivation of steroid- or AP-1 regulated genes.
Coactivators or transcriptional activators are proposed to bridge between sequence specific transcription factors, the basal transcription machinery and in addition to influence the chromatin structure of a target cell. Several proteins like SRC-1, ACTR, and Grip1 interact with NRs in a ligand enhanced manner (Heery et al., A
signature motif in transcriptional coactivators mediates binding to nuclear receptors, Nature, 387, 733 - 736; Heinzel et al., A complex containing N-CoR, mSin3 and histone deacetylase mediates transcriptional repression, Nature 387, 43 - 47, 1997).
Furthermore, the physical interaction with negative receptor-interacting proteins or corepressors has been demonstrated (Xu et al., Coactivator and Corepressor complexes in nuclear receptor function, Curr Opin Genet Dev, 9 (2), 140 - 147, 1999).
Nuclear receptor modulators like steroid hormones affect the growth and function of specific cells by binding to intracellular receptors and forming nuclear receptor-ligand complexes. Nuclear receptor-hormone complexes then interact with a hormone response element (HRE) in, the control region of specific genes and alter specific gene expression.
Over the past decade, new members of the nuclear hormone gene family have been identified that lack known ligands. These orphan receptors can be used to uncover signaling molecules that regulate yet unidentified physiological networks.
Some of these orphan receptors are constitutively active in transactivate target genes without the need to interact with a ligand (MangelsdorFet al., 1995).
Farnesoid X Receptor alpha (hereinafter FXR-a) is a prototypical type 2 nuclear receptor (US Pat. 6,005,086) which activates genes upon binding to promoter region of target genes in a heterodimeric fashion with Retinoid X Receptor (hereinafter RXR, Forman et al., Cell, 81, 687-93, 1995).. The relevant physiological ligands of FXR-a seem to be bile acids. The most potent is chenodeoxycholic acid, which regulates the expression of several genes that participate in bile acid homeostisis. Farnesoid, originally described fio activate the rat ortholog at high concentration does not activate the human or mouse receptor. It is highly expressed in the liver, intestine and kidney. Like LXR-a FXR-a is involved in intracrine signaling.
The relevant physiological ligands of NR1 H4 (as FXR-a is also called) seem to be bile acids (Makishima et al., Science, 284, 1362-65, 1999; Parks et al., Science, 284, 1365-68, 1999). The most potent is chenodeoxycholic acid, which regulates the expression of several genes that participate in bile acid homeostasis.
Consequently, FXR-a is proposed to be a nuclear bile acid sensor. As a result, it modulates both, the synthetic output of bile acids in the liver and their recycling in the intestine (by regulating bile acid binding proteins?. Its is also activated by retoinic acid and TTNPB at supraphisiological concentration. Furthermore, it regulates the conversion of dietry cholesterol into bile acids by regulation the metabolizing genes like CYP7-a. This is a feed back regulation since the receptor is activated by bile acids.
Through its regulatory function in cholesterol and bile acid metabolism an FXR-a homologue could serve as a target for cholesterol lowering drugs and exert beneficial effects in diseases like artheriosclerosis and other metabolic disorders.
It was thus an object of the present invention to provide for a novel nuclear receptor.
In a preferred embodiment of the invention it was an object to provide for a homologue of FXR-a. It was an object of the present invention to provide for means of producing this receptor as well as means of screening for agonists and antagonists to the receptor. Further objects of the invention are outlined below.
SUMMARY OF THE INVENTION
The present invention provides, inter alia, a novel nuclear receptor protein.
In a preferred embodiment of the invention a novel FXR-a homologue is provided for.
Also provided is the nucleic acid sequence encoding this novel nuclear receptor protein, as well as compounds and methods for using this protein and its nucleic acid sequence.
The present invention provides a novel proteins, nucleic acids, and methods useful for developing and identifying compounds for the treatment of such diseases and disorders as metabolic disorders, immunological indications, hormonal dysfunctions, neurosystemic diseases and in preferred embodiments, high cholesterol and artheriosclerosis as well as other metabolic disorders.
Identified and disclosed herein is the protein sequence for a novel nuclear receptor and the nucleic acid sequence encoding this nuclear receptor L66, which we call the L66 nuclear receptor (or simply "L66") receptor, or also FXR-(i.
The importance of this discovery is manifested in the effects of FXR-~3 to modulate genes involved in cellular functions like regulation of metabolism and cell homeostasis, cell proliferation and differentiation, pathological cellular aberrations, or cellular defense mechanisms including tumor development, i.e. cancer.
Thus, this L66 protein is useful for screening for L66 agonists and antagonist activity for controlling these conditions.
In one aspect of the present invention, we provide isolated nucleic acid sequences for a novel receptor, the L66 receptor. In particular, we provide the cDNA
sequences, protein sequences as well as the genomic sequences encoding the human L66 receptor, as well as the cDNA sequence, protein sequence and genomic sequence of the Mus musculus (mouse) receptor.
These nucleic acid sequences have a variety of uses. For example, they are useful for making vectors and for transforming cells, both of which are ultimately useful for production of the L66 protein.
They are also useful as scientific research tools for developing nucleic acid probes for determining L66 expression levels, e.g., to identify diseased or otherwise abnormal states. They are useful for developing analytical tools such as anti sense oligonucleotides for selectively inhibiting expression of the L66 gene to determine physiological responses.
In another aspect of the present invention, we provide a homogenous composition comprising the L66 protein. The protein is useful for screening drugs for agonist and antagonist activity, and, therefore, for screening for drugs useful in regulating physiological responses associated with L66. Specifically, antagonists to the receptor could be used to treat diseases and disorders as metabolic disorders, immunological indications, hormonal dysfunctions, neurosystemic diseases and in preferred embodiments, high cholesterol and artheriosclerosis as well as other metabolic disorders, whereas agonists could be used for the treatment of these conditions. The proteins are also useful for developing antibodies for detection of the protein.
Flowing from the foregoing are a number of other aspects of the invention, including (a) vectors, such as plasmids, comprising the L66 nuclear receptor nucleic acid sequence that may further comprise additional regulatory elements, e.g., promoters, (b) transformed cells that express the L66, (c) nucleic acid probes, (d) antisense oligonucleotides, (e) agonists, (f) antagonists, and (g) transgenic mammals.
Further aspects of the invention comprise methods for making and using the foregoing compounds and compositions.
The foregoing merely summarizes certain aspects of the present invention and is not intended, nor should it be construed, to limit the invention in any manner.
All patents and other publications recited herein are hereby incorporated by reference in their entirety.
DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS
THE L66 PROTEIN AND NUCLEIC ACID:
The present invention comprises a novel member of the nuclear receptor superfamily which the inventors herein refer to as L66. Particularly preferred embodiments of the L66 receptor are those having an amino acid sequence substantially the same as SEQ ID NO. 3, 24 and/or 19. Examination of the amino acid sequence confirms that the present protein is indeed a member of the nuclear receptor family (see also US
6,005,086) which is closely related to FXR (see also figure 3 for the domain composition). The carboxy-terminal ligand binding domain "LBD" of L66 is a complex region encoding subdomains for ligand binding, often dimerization and transcriptional activation.
The nucleic acids claimed above may be present in various forms, i.e. as an RNA, DNA, cDNA or as genomic DNA.
As used herein, if reference to L66, the L66 receptor, the nuclear receptor L66 or the L66 nuclear receptor is made it is meant as a reference to any protein having an amino acid sequence substantially the same as SEQ ID NO.: 3, 24 and/or 19.
The present invention also comprises nucleic acid sequences encoding the L66 receptor, which nucleic acid sequences are substantially the same as SEQ ID
NO.
1 and SEQ ID NO. 17 (cDNAs) or splice variants thereof 4, 6, 8, 10, 12, 22, their sequence complements SEQ ID NO. 2 and 18 or complements of said splice variants 5, 7, 9, 11, 13, 23. SEQ ID NO 4encodes the human cDNA L66 receptor and is a preferred embodiment. SEQ OD NO. 17 (cDNA) encodes the mouse receptor (Mus musculus). SEQ ID NO. 20 represents the genomic sequence of L66 locus from the mouse (Mus musculus) (see also Fig. 15)1n one embodiment, the nucleic acid sequence according to the invention comprises the sequence according to SEQ ID
NO. 1, 4, 6, 8, 10, 12, 17, 20( 4 to 12 representing various splice variants of the human L66) and/or 22 or portions thereof, in a preferred embodiment nucleic acid sequence according to the invention consists of the sequence according to SEQ
ID
NO. 1, 4, 6, 8, 10 ( 4 to 10 representing various splice variants of the human L66) and/or 17 or portions thereof.
Herein the "complement" refers to the complementary strand of the nucleic acid according to the invention, thus the strand that would hybridize to the nucleic acid according to the invention. in accordance with standard biological terminology all DNA sequences herein are however written in 5'-3' orientation, thus the if a complement is mentioned (see also figures) it is actually a "reverse"
complement (as also stated in the figures). For simplification purposes they may however sometimes be referred to simply as "complements".
As used herein, a protein "having an amino acid sequence substantially the same or similar as SEQ ID NO x" (where "x" is the number of one of the protein sequences recited in the Sequence Listing) means a profiein whose amino acid sequence is the same as SEQ ID NO x or differs only in a way such that at least 50% of the residues compared in a sequence alignment with SEQ ID NO. x are identical, preferably 75%
of the residues are identical, even more preferably 95% of the residues are identical and most preferably at least 98% of the residues are identical.
Those skilled in the art will appreciate that conservative substitutions of amino acids can be made without significantly diminishing the protein's affinity for interacting proteins, DNA binding sites, L66 receptor modulators, e.g. small molecular hydrophobic compounds, or RNA.
Other substitutions may be made that increase the protein's affinity for these compounds. Making and identifying such proteins is a routine matter given the teachings herein, and can be accomplished, for example, by altering the nucleic acid sequence encoding the protein (as disclosed herein), inserting it into a vector, transforming a cell, expressing the nucleic acid sequence, and measuring the binding affinity of the resulting protein, all as taught herein.
As used herein the term "a molecule having a nucleotide sequence substantially the same as SEQ ID NO y" (wherein "y" is the number of one of the protein-encoding nucleotide sequences listed in the Sequence Listing) means a nucleic acid encoding a protein "having an amino acid sequence substantially the same as SEQ ID NO
y+I"
(wherein "y+I" is the number of the amino acid sequence for which nucleotide sequence "y" codes) as defined above. This definition is intended to encompass natural allelic variations in the L66 sequence. Cloned nucleic acid provided by the present invention may encode L66 protein of any species of origin, including (but not limited to), for example, mouse, rat, rabbit, cat, dog, primate, and human.
Preferably, the nucleic acid provided by the invention encodes L66 receptors of mammalian, preferably mouse and most preferably human origin.
_ 9 _ Preferably, the L66 receptors proteins provided by the invention are of mammalian, more preferably mouse and most preferably human origin.
The inventors have found (see figures and examples) that the L66 gene in humans is expressed primarily in testis. In mice however expression may be observed also in other tissues.
IDENTIFICATION OF VARIANTS AND HOMOLOGUES AS WELL AS USE OF
PROBES:
Nucleic acid hybridization probes provided by the invention are nucleic acids consisting essentially of the nucleotide sequences complementary to any sequence depicted in SEQ ID NO. 1, 4, 6, 8, 10, 12, 17, 20, 22, 2, 5, 7, 9, 11, 13, 18, 21 and/or 23 or a part thereof and that are effective in nucleic acid hybridization Nucleic acid hybridization probes provided by the invention are nucleic acids capable of detecting i.e. hybridizing to the gene encoding the polypeptides according to SEQ
ID NO. 3, 24 and/or 19.
Nucleic acid probes are useful for detecting L66 gene expression in cells and tissues using techniques well-known in the art, including, but not limited to, Northern blot hybridization, in situ hybridization, and Southern hybridization to reverse transcriptase - polymerase chain reaction product DNAs. The probes provided by the present invention, including oligonucleotide probes derived therefrom, are also useful for Southern hybridization of mammalian, preferably human, genomic DNA for screening for restriction fragment length polymorphism (RFLP) associated with certain genetic disorders. As used herein, the term complementary means a nucleic acid having a sequence that is sufficiently complementary in the Watson-Crick sense to a target nucleic acid to bind to the target under physiological conditions or experimental conditions those skilled in the art routinely use when employing probes.
It is understood in the art that a nucleic acid sequence will hybridize with a complementary nucleic acid sequence under high stringent conditions as defined herein, even though some mismatches maybe present. Such closely matched, but not perfectly complementary sequences are also encompassed by the present invention. For example, differences may occur through genetic code degeneracy, or by naturally occurring or man made mutations and such mismatched sequences would still be encompassed by the present claimed invention.
Preferably, the nucleotide sequence of the nuclear receptor L66 (SEQ ID NO. 1, 4, 17, and/or 22 or splice variants thereof) and/or their complements can be used to derive oligonucleotide fragments (probes) of various length. If the probe is used to detect a human L66 sequence most preferably a complement of SEQ ID NO. 1 and/or 4 or its complement is used. If the probe is supposed to detect a mouse or a rodent sequence probes complementary to SEQ ID NO. 17 and/or 22, or their respective complements are preferred. Stretches of 17 to 30 nucleotides are used frequently but depending on the screening parameters longer sequences as 40, 50, 100, 150 up to the full length of the sequence may be used. Those probes can be synthesized chemically and are obtained readily from commercial oligonucleotide providers. Chemical synthesis has improved over the years and chemical synthesis of oligonucleotides as long as 100-200 bases is possible. The field might advance further to allow chemical synthesis of even longer fragments. Alternatively, probes can also be obtained by biochemical de novo synthesis of single stranded DNA.
In this case the nucleotide sequence of the nuclear receptor L66 or its complement (see figures) serve as a template and the corresponding complementary strand is synthesized. A variety of standard fiechniques such as nick translation or primer extension from specific primers or short random oligonucleotides can be used to synthesize the probe (Molecular Cloning: A Laboratory Manual (3 Volume Set) by Joseph Sambrook, David W. Russell, Joe Sambrook, 2100 pages 3rd edition (January 15, 2001; . Molecular cloning: a laboratory manual. Cold Spring Harbor Press, Cold Spring Harbor, 1989)). Nucleic acid reproduction technologies exemplified by the polymerise chain reaction (Saiki, R.K. et al. Primer-directed enzymatic amplification of DNA with a thermostable DNA polymerise. Science 239, 487-491 (1988)) are commonly applied to synthesize probes. In the case of techniques using specific primers the nucleic acid sequence of the nuclear receptor L66 or its complement is not only used as a template in the biochemical reaction but also to derive the specific primers which are needed to prime the reaction.
In some cases one might also consider to use the nucleic acid sequence of the cofactor or its complement as a template to synthesize an RNA probe. A
promoter sequence for a DNA-dependent RNA polymerase has to be introduced at the 5'-end of sequence. As an example this can be done by cloning the sequence infio a vector which carries the respective promoter sequence. It is also possible to introduce the needed sequence by synthesizing a primer with the needed promoter in the form of a 5' "tail". The chemical synthesis of a RNA probe is another option.
Appropriate means are available to detect the event of a hybridization. There is a wide variety of labels and detection systems, e.g. radioactive isotopes, fluorescent, or chemiluminescent molecules which can be linked to the probe. Furthermore, there are methods of introducing haptens which can be detected by antibodies or other ligands such as the avidin/biotin high affinity binding system.
Hybridization can take place in solution or on solid phase or in combinations of the two, e.g. hybridization in solution and subsequent capture of the hybridization product onto a solid phase by immobilized antibodies or by ligand coated magnetic beads.
Hybridization probes act by forming selectively duplex molecules with complementary stretches of a sequence of a gene or a cDNA. The selectivity of the process can be controlled by varying the conditions of hybridization. To select sequences which are identical highly homologous to the sequence of interest stringent conditions for the hybridization will be used, e.g. low salt in the range of 0.02 M to 0.15 M
salt and/or high temperatures in the range from 50°C degrees centigrade to 70°C degrees centigrade. Stringency can be further improved by the addition of formamide to the hybridization solution. The use of stringent conditions which means that only little mismatch or a complete match will lead to a hybridization product would be used to isolate closely related members of the same gene family. Thus, as used herein stringent hybridization conditions are those where between 0.02 M to 0.15 M
salt and/or high temperatures in the range from 50°C degrees centigrade to 70°C
degrees centigrade are applied.
12_ The use of highly stringent conditions or conditions of "high stringency"
means that only very little mismatch or a complete match which lead to a hybridization product would be used to isolate very closely related members of the same gene family.
Thus, as used herein highly stringent hybridization conditions are those where between 0.02 - 0.3 M salt and 65°C degrees centigrade are applied for about 5 to 18 hours of hybridization time and additionally, the sample filters are washed twice for about 15 minutes each at between 60°C - 65°C degrees centigrade, wherein the first washing fluid contains about 0.1 M salt (NaCI and/or Sodium Citrate) and the second contains only about 0.02 M salfi (NaCI and/,or Sodium Citrate). In a preferred embodiment the following conditions are considered to be highly stringent:
Hybridisation in a buffer containing 2 x SSC (0.03 M Sodium Citrate, 0.3 M
NaCI) at 65°C - 68°C degrees centigrade for 12 hours, followed by a washing step for 15 minutes in 0.5 x SSC, 0.1 % SDS, and a washing step for 15 minutes at 65°C degrees centigrade in 0.1 x SSC, 0.1 % SDS.
Less stringent hybridization conditions, e.g. 0.15 M salt - 1 M salt and/or temperatures from 22°C degrees centigrade to 56°C degrees centigrade are applied in order to detect functionally equivalent genes in the same species or for orthologous sequences from other species.
Unspecific hybridization products are removed by washing the reaction products repeatedly in 2 x SSC solution and increasing the temperature.
DEGENERATE PCR AND CLONING OF HOMOLOGUES
The nucleotide sequence of the nuclear receptor L66 or its complement can be used to design primers for a polymerase chain reaction. Due to the degeneracy of the genetic code the respective amino acid sequence is used to design oligonucleotides in which varying bases coding for the same amino acid are included. Numerous design rules for degenerate primers have been published (Compton et al, 1990).
As in hybridization there are a number of factors known to vary the stringency of the PCR. The most important parameter is the annealing temperature. To allow annealing of primers with imperFect matches annealing temperatures are often much lower than the standard annealing temperature of 55°C, e.g. 35°C
to 52°C degrees can be chosen. PCR reaction products can be cloned. Either the PCR product is cloned directly, with reagents and protocols from commercial manufacturers (e.g.
from Invitrogen, San Diego, USA). Alternatively, restriction sites can be introduced intro the PCR product via a 5'-tail of the PCR primers and used for cloning.
Primers for the amplification of the entire or partial pieces of the L66 gene or mRNA, or for reverse transcription may be designed making use of the sequences according to the invention, i.e. those depicted in the figures' below.
GENETIC VARIANTS
Fragments from the nucleotide sequence of the nuclear receptor L66 (SEQ ID NO.
1, 4 or the mouse, ,i.e. 17 or 22) or their complements can be used to cover the whole sequence with overlapping sets of PCR primers. Also the genomic sequences may be used (see figures for sequences). These primers are used to produce PCR
products using genomic DNA from a human diversity panel of healthy individuals or genomic DNA from individuals which are phenotypically conspicuous. Also the genomic sequences may be used, i.e. that of the human clone as deposited by the applicant (deposit number DSM 14483 ) or that of the mouse according to SEQ ID
NO. 20 (or the complement thereof). The I?CR products can be screened for polymorphisms, for example by denaturing gradient gel electrophoresis, binding to proteins detecting mismatches or cleaving heteroduplices or by denaturing high-performance liquid chromatography. Products which display mutations need to be sequenced to identify the nature of the mutation. Alternatively, PCR products can be sequenced directly opnitting the mutation screening step to identify genetic polymorphisms. If genetic variants are identified and are associated wifih a discrete phenotype, these genetic variations can be included in diagnostic assays. The normal variation of the human population is of interest in designing screening assays as some variants might interact better or worse with a respective lead substance (a pharmacodynamic application). Polymorphisms or mutations which can be correlated to phenotypic outcome are a tool to extend the knowledge and the commercial applicability of the nucleotide sequence of the nuclear receptor L66 or its complement or their gene product, as variants might have a slightly different molecular behavior or desired properties. Disease-causing mutations or polymorphisms allow the replacement of this disease inducing gene copy with a wild-type copy by means of gene therapy approaches and/or the modulation of the activity of the gene product by drugs.
PREPARATION OF POLYNUCLEOTIDES:
DNA which encodes receptor L66 may be obtained, in view of the instant disclosure, by chemical synthesis, by screening reverse transcripts of mRNA from appropriate cells or cell line cultures, by screening genomic libraries from appropriate cells, or by combinations of fihese procedures, as illustrated below.
Screening of mRNA or genomic DNA may be carried out with oligonucleotide probes generated from the L66 nucleotide sequences information provided herein.
Probes may be labeled with a detectable group such as a fluorescent group, a radioactive atom or a chemiluminescent group in accordance with known procedures and used in conventional hybridization assays, as described in greater detail in the Examples below. Alternatively, the L66 nucleotide sequence may be obtained by use of the polymerase chain reaction (PCR) procedure, with the PCR oligonucleotide primers being produced from the L66 nucleotide sequences provided herein.
Upon purification or synthesis, the nucleic acid according to the invention may be labeled, e.g. for use as a probe.
As single and differential labeling agents and methods, any agents and methods which are known in the art can be used. For example, single and differential labels may consist of fihe group comprising enzymes such as ~i-galactosidase, alkaline phosphatase and peroxidase, enzyme substrates, coenzymes, dyes, chromophores, fluorescent, chemiluminescent and bioluminescent labels such as FITC, CyS, Cy5.5, Cy7, Texas-Red and IRD40(Chen et al. (1993), J. Chromatog. A 652: 355-360 and Kambara et al. (1992), Electrophoresis 13: 542-546), ligands or haptens such as biotin, and radioactive isotopes such as 3H, 355, 32P 1251 and'4C.
EXPRESSION OF THE L66 PROTEIN/POLYPETIDE:
The nuclear receptor L66 nucleic acid or polypeptide may be synthesized in host cells transformed with a recombinant expression construct comprising a nucleic acid encoding the nuclear receptor L66.
Such a recombinant expression construct can also be comprised of a vector that is a replicable DNA construct.
Amplification vectors do not require expression control domains. All that is needed is fihe ability to replicate in a host, usually conferred by an origin of replication, and a selection gene to facilitate recognition of transformants. See, Sambrook et al., Molecular Cloning: A Laboratory Manual (2nd Edition, Cold Spring Harbor Press, New York, 1989).
An expression vector comprises a polynucleotide operatively linked to a prokaryotic promoter. Alternatively, an expression vector is a polynucleotide operatively linked to an enhancer-promoter that is a eukaryotic promoter, and the expression vector further has a polyadenylation signal that is positioned 3' of the carboxy-terminal amino acid and within a transcriptional unit of the encoded polypeptide. A
promoter is a region of a DNA molecule typically within about 500 nucleotide pairs in front of (upstream of) the point at which transcription begins (i.e., a transcription start site). In general, a vector contains a replicon and control sequences which are derived from species compatible with the host cell. The vector ordinarily carries a replication site, as well as marking sequences which are capable of providing phenotypic selection in transformed cells.
Another type of discrete trahscription regulatory sequence element is an enhancer.
An enhancer provides specificity of time, location and expression level for a particular encoding region (e.g., gene). A major function of an enhancer is to increase the level of transcription of a coding sequence in a cell.
As used herein, the phrase "enhancer-promoter" means a composite unit that contains both enhancer and promoter elements. An enhancer-promoter is operatively linked to a coding sequence that encodes at least one gene product.
An enhancer-promoter used in a vector construct of the present invention may be any enhancer-promoter that drives expression in a prokaryotic or eukaryotic cell to be transformed/transfected.
A coding sequence of an expression vector is operatively linked to a transcription terminating region. RNA polymerase transcribes an encoding DNA sequence through a site where polyadenylation occurs.
An expression vector comprises a polynucleotide that encodes a nuclear receptor L66 polypeptide. Such a polynucleotide is meant to include a sequence of nucleotide bases encoding a nuclear receptor L66 polypeptide sufficient in length to distinguish said segment from a polynucleotide segment encoding a non- nuclear receptor polypeptide.
A polypeptide of the invention may also encode biologically functional polypeptides or peptides which have variant amino acid sequences, such as with changes selected based on considerations such as the relative hydropathic score of fihe amino acids being exchanged.
These variant sequences are those isolated from natural sources or induced in the sequences disclosed herein using a mutagenic procedure such as site-directed mutagenesis.
Furthermore, an expression vector of the present invention may contain regulatory elements for optimized translation of the polypeptide in prokaryotic or eukaryotic systems. This sequences are operatively located around the transcription start site and are most likely similar to ribosome recognition sites like prokaryotic ribosome binding sites (RBS) or eukaryotic Kozak sequences as known in the art (Kozak M., Initiation of translation in prokaryotes and eukaryotes. Gene 234, 187-208 (1999).
An expression vector of the present invention is useful both as a means for preparing quantities of the nuclear receptor L66 polypeptide-encoding DNA itself, and as a means for preparing the encoded nuclear receptor L66 polypeptide and peptides.
It is contemplated that where nuclear receptor L66 polypeptides of the invention are made by recombinant means, one may employ either prokaryotic or eukaryotic expression vectors as shuttle systems.
Where expression of recombinant nuclear receptor L66 polypeptides is desired and a eukaryotic host is contemplated, it is most desirable to employ a vector such as a plasmid, that incorporates a eukaryotic origin of replication. Additionally, for the purposes of expression in eukaryotic systems, one desires to position the nuclear receptor L66 encoding sequence or if desired parts thereof (SEQ ID NO. 1, 4, 6, 8, 10, 12, 17, and/or 22) adjacent to and under the control of an effective eukaryotic promoter, To bring a coding sequence under control of a promoter, whether it is eukaryotic or prokaryotic, what is generally needed is to position the 5' end of the translation initiation site of the proper translational reading frame of the polypeptide between about 1 and about 2000 nucleotides 3' of or downsfiream with respect to the promoter chosen.
Furthermore, where eukaryotic expression is anticipated, one would typically desire to incorporate into the transcriptional unit which includes the nuclear receptor L66 polypeptide, an appropriate polyadenylation side.
The invention provides homogeneous compositions of mammalian nuclear receptor L66 polypeptide produced by transformed prokaryotic or eukaryotic cells as provided herein. Such homogeneous compositions are intended to be comprised of mammalian nuclear receptor L66 protein that comprises at least 90% of the protein in such homogenous composition. The invention also provides membrane preparation from cells expressing mammalian nuclear receptor L66 polypeptide as the result of transformation with a recombinant expression construct, as described here.
Within the scope of the present invention the terms recombinant protein or coding sequence both also include tagged versions of the protein depicted in SEQ ID
NO. 3, 19 and/or 22, and/or encoded by the nucleic acids according to the invention and fusion proteins of said proteins or parts thereof such as splice variants with any other recombinant protein. Tagged versions here means that small epitopes of 3-20 amino acids are added fio the original protein by extending the coding sequence either at the 5'or the 3'terminus leading to N-terminal or C-terminal extended proteins respectively, or that such small epitopes are included elsewhere in the protein. The same applies for fusion proteins where the added sequences are coding for longer proteins, varying between 2 and 100 kDa. Tags and fusion proteins are usually used to facilitate purification of recombinant proteins by specific antibodies or affinity matrices or to increase solubility of recombinanfi proteins within the expression host.
Fusion proteins are also of major use as essential parts of yeast two hybrid screens for interaction partners of recombinant proteins.
Tags used in the scope of the present invention may include but are not limited to the following: EEF (alpha Tubulin), B-tag (QYPALT), E tag (GAPVPYPDPLEPR) c-myc Tag (EQKLISEEDL), Flag epitope (DYKDDDDK, HA tag (YPYDVPDYA), 6 or 10 x His Tag, HSV (QPELAPEDPED), Pk-Tag (GKPIPNPLLGLDST), protein C
(EDQVDPRLIDGK), T7 (MASMTGGQQMG), VSV-G (YTDIEMNRLGK), Fusion proteines may include Thioredoxin, Glutathiontransferase (GST), Maltose binding Protein (MBP), Cellulose Binding protein (CBD), chitin binding protein, ubiquitin, the Fc part of Immunoglobulins, and the IgG binding domain of Staphylococcus aureus protein A. These examples of course are illustrative and not limiting and the standard amino acid one letter code was used above.
For expression of recombinant proteins in living cells or organisms, vector constructs harboring recombinant L66 nuclear receptor as set forth in SEQ ID NO. 1 or 17 are transformed or transfected into appropriate host cells. Preferably, a recombinant host cell of the present invention is transfected with a polynucleotide of SEQ ID
NO. 1, 4, 22 or 17.
Means of transforming or transfecting cells with exogenous polynucleotide such as DNA molecules are well known in the art and include techniques such as calcium-phosphate- or DEAE-dextran-mediated transfection, protoplast fusion, electroporation, liposome mediated transfection, direct microinjection and virus infection (Sambrook et al., 1989).
The most frequently applied technique for transformation of prokaryotic cells is transformation of bacterial cells after treatment with calciumchloride to increase permeability (Dagert & Ehrlich, 1979), but a variety of other methods is also available for one skilled in the art.
The most widely used method for transfection of eukaryotic cells is transfection mediated by either calcium phosphate or DEAE-dextran. Although the mechanism remains obscure, it is believed that the transfected DNA enters the cytoplasm of the cell by endocytosis and is transported to the nucleus. Depending on the cell type, up to 90% of a population of cultured cells may be transfected at any one time.
Because of its high efficiency, transfection mediated by calcium phosphate or DEAE-dextran is the method of choice for studies requiring transient expression of the foreign nucleic acid in large numbers of cells. Calcium phosphate-mediated transfection is also used to establish cell lines that integrate copies of the foreign DNA, which are usually arranged in head-to-tail tandem arrays into the host cell genome.
In the protoplast fusion method, protoplasts derived from bacteria carrying high numbers of copies of a plasmid of interest are mixed directly with cultured mammalian cells. After fusion of the cell membranes (usually with polyethylene glycol), the contents of the bacterium are delivered into the cytoplasm of the mammalian cells and the plasmid DNA is transported to the nucleus. Protoplast fusion is not as efficient as transfection for many of the cell lines that are commonly used for transient expression assays, but it is useful for cell lines in which endocytosis of DNA occurs inefficiently. Protoplast fusion frequently yields multiple copies of the plasmid DNA tandemly integrated into the host chromosome.
The application of brief, high-voltage electric pulses to a variety of mammalian and plant cells leads to the formation of nanometer sized pores in the plasma membrane.
DNA is taken directly into the cell cytoplasm either through these pores or as a consequence of the redistribution of membrane components that accompanies closure of the pores. Electroporation may be extremely efficient and may be used both for transient expression of cloned genes and for establishment of cell lines that carry integrated copies of the gene of interest. Electroporation, in contrast to calcium phosphate-mediated transfection and protoplast fusion, frequently gives rise to cell lines that carry one, or at most a few, integrated copies of the foreign DNA.
Liposome transfection involves encapsulation of DNA and RNA within liposomes, followed by fusion of the liposomes with the cell membrane. The mechanism of how DNA is delivered into the cell is unclear but transfection efficiencies may be as high as 90%.
Direct microinjection of a DNA molecule into nuclei has the advantage of not exposing DNA to cellular compartments such as !ow-pH endosomes. Microinjection is therefore used primarily as a method to establish lines of cells that carry integrated copies of the DNA of interest.
The use of adenovirus as a vector for cell transfection is well known in the art.
Adenovirus vector-mediated cell transfection has been reported for various cells (Stratford-Perricaudet et al., 1992), A transfected cell may be prokaryotic or eukaryotic, transfection may be transient or stable. Where it is of interest to produce a full length human or mouse L66 protein, cultured mammalian mouse, or human cells are of particular interest.
!n another aspect, the recombinant host cells of the present invention are prokaryotic host cells. In addition to prokaryotes, eukaryotic microbes, such as yeast may also be used illustrative examples for suitable cells and organisms for expression of recombinant proteins are belonging to but not limited to the following examples:
Insect cells, such as Drosophila Sf21, SF9 cells or others, Expression strains of Escherichia coli, such as XL1 blue, BRL21, M15, Saccharomyces cerevisiae, Schizosaccharomyces pombe, Hansenlua polymorpha and Pichia pastoris strains, immortalized mammalian cell lines such as AtT-20, VERO and HeLa cells, Chinese hamster ovary (CHO) cell lines, and W138, BHK, COSM6, COS-7, 293 and MDCK
cells, BHK-21 cells, Att 20HeLa cells, HeK 294, T47 D cells and others.
Expression of recombinant proteins within the scope of this invenfiion can also be performed in vitro. This may occur by a two step procedure, thereby producing first mRNA by in vitro transcription of an apt polynucleotide construct followed by in vitro translation with convenient cellular extracts. These cellular extracts may be reticulocyte lysates but are not limited to this type. In vitro transcription may be performed by T7 or SP6 DNA polymerase or any other RNA polymerase which can recognize per se or with the help of accessory factors the promoter sequence contained in the recombinant DNA construct of choice. Alternatively one of the recently made available one step coupled transcription/translation systems may be used for in vitro translation of DNA coding for the proteins of this invention, e.g. from Roche Molecular Biochemicals. One illustrative but not limiting example for such a system is the TNT~ T7 Quick System by Promega.
Expression of recombinant proteins in transfected cell may occur constitutively or upon induction. Procedures depend on the Cell/vector combination used and are well known in the art. In all.cases, transfected cells are maintained for a period of time sufficient for expression of the recombinant L66 nuclear receptor protein. A
suitable maintenance time depends strongly on the cell type and organism used and is easily ascertainable by one skilled in the art. Typically, maintenance time is from about 2 hours to about 14 days. For the same reasons and for sake of protein stability and solubility incubation temperatures during maintenance time may vary from 20°C to 42 °C.
Recombinant proteins are recovered or collected either from the transfected cells or the medium in which those cells are cultured. Recovery comprises cell disruption, isolation and purification of the recombinant protein. Isolation and purification techniques for polypeptides are well-known in the art and include such procedures as precipitation, filtration, chromatography, elecfirophoresis and the like.
In a preferred embodiment, purification includes but is not limited to affinity purification of tagged or non-tagged recombinant proteins. This is a well established robust technique easily adapted to any tagged protein by one skilled in the art. For affinity purification of tagged proteins, small molecules such as gluthathione, maltose or chitin, specific proteins such as the IgG binding domain of Staphylococcus aureus protein A, antibodies or specific chelates which bind with high affinity to the tag of the recombinant protein are employed. For afFinity purification of non-tagged proteins 22_ .
specific monoclonal or polyclonal antibodies, which were raised against said protein, can be used. Alternatively immobilized specific interactors of said protein may be employed for affinity purification. Interactors include native or recombinant proteins as well as native or artificial specific low molecular weight ligands.
CHEMICAL SYNTHESIS OF THE POLYPEPTIDE ACCORDING TO THE
INVENTION:
Alternatively, the protein itself may be produced using chemical methods to synthesize any of the amino acid sequences according to the invention or that is encoded by the nucleotide sequences according to the invention (SEQ ID NO. 1, 4, 17 or 22) and/or a portion thereof and/or splice variants thereof. For example, peptide synthesis can be performed using conventional Merrifield solid phase f-Moc or t-Boc chemistry or various solid-phase techniques (Roberge, J. Y, et al. (1995) Science 269: 202-204) and automated synthesis may be achieved, for example, using the ABI
431A Peptide Synthesizer (Perkin Elmer). The newly synthesized peptides) may be substantially purified by preparative high perFormance liquid chromatography (e.g., Creighton, T. (1983) Proteins, Structures and Molecular Principles, WH Freeman and Co., New York, N.Y.). The composition of the synthetic peptides may be confirmed by amino acid analysis or sequencing (e.g., the Edman degradation procedure;
Creighton, supra). Additionally, the amino acid sequences according to the invention, i.e. SEQ ID NO. 3, 24 or 19 or the sequence that is encoded by SEQ ID NO. 1, 4, 17 or 22 or any part thereof, may be altered during direct synthesis and/or combined using chemical methods with sequences from other proteins, or any part thereof, to produce a variant polypeptide.
SCREENING ASSAYS
The invention also concerns a method for screening for agents which are capable of inhibiting the cellular function of the nuclear receptor L66 comprising the steps of contacting one or more candidate agents with a polypeptide according to the invention, removing unbound agents) and' detecting whether the agents) interact with the polypeptide of the nuclear receptor.
_ 23 The invention also concerns method for inhibiting the cellular function of the nuclear receptor L66, comprising the steps of contacting a cell with a binding agent of a polypeptide previously identified as outlined herein whereby the cellular function of L66 is inhibited.
Such a binding agent may be an antibody., RNA. an anti-sense oligonucleotide.
a ribozyme or one of substances shown below or identified in a respective assay as disclosed herein.
In still a further embodiment, the present invention concerns a method for identifying new nuclear receptor inhibitory or stimulatory substances, which may be termed as "candidate substances". It is contemplated that this screening technique proves useful in the general identification of compounds that serve the purpose of inhibiting or stimulating nuclear receptor activity.
In one embodiment of the invention the following subsfiances are disclosed as potential interactors of the nuclear receptor according to the invention:
Steroids: dexamethasone-t-butylacetate, RU486, progesterone, 17-aipha-hydroxyprogesterone, 1,16-alpha dimethylpregnenolone, 17-alpha-hydroxypregnenonlone, pregnenonlone, 5beta-pregnane-3,20-dione, pregnenonlone-16-carbonitrile, 5beta-pregnane-3,20-dione, androstanol, corticosterone, dehydroepiandrosterone, dihydroxytestosterone, estradiol, cortisol, cortisone, dihydroxytestosterone.
Other substances: transnonachlor, chlordane, spironolactone, cyproterone acetate, rifampicin, nefipine, diethylstilbestrol, coumesterol, clotrimazole, lovastatin, phenoarbital, pthalic acid, nonylphenol, 1,4-bis(2-(3,5-dichloropyridyloxy1))benzene, This also includes the use of heteromultimeric complexes of the nuclear receptor with other proteins, such as heterodimeric complexes with RXR, or any other binding partner.
Accordingly, in screening assays to identify pharmaceuticals agents which affect nuclear receptor activity, it is proposed that compounds isolated from natural sources, such as fungal extracts, plant extracts, bacterial extracts, higher eukaryotic cell extracts, or even extracts from animal sources, or marine, forest or soil samples, may be assayed for the presence of potentially useful pharmaceutical agents.
It will be understood that that the pharmaceutical agents to be screened could also be derived from chemical compositions or man-made compounds. The candidate substances can could also include monoclonal or polyclonal antibodies, peptides or proteins, such as those derived from recombinant DNA technology or by other means, including chemical peptide synthesis. The active compounds may include fragments or parts or derivatives of naturally-occurring compounds or may be only found as active combinations of known compounds which are otherwise inactive.
We anticipate that such screens will in some cases lead to the isolation of agonists of nuclear receptors, in other cases to the isolation of antagonists. In other instances, substances will be identified that have mixed agonistic and antagonistic effects, or affect nuclear receptors in any other way.
CELL BASED ASSAYS
To identify a candidate substance capable of influencing L66 nuclear receptor activity, one first obtains a recombinant cell line. One designs the cell line in such a way that the activity of the nuclear receptor leads to the expression of a protein which has an easily detectable phenotype ( a reporter), such as luciferase, fluorescent proteins such as green or red fluorescent protein, beta-galactosidase, alpha-galactosidase, beta-lactamase, chloramphenicol-acetyl-transferase, beta-glucuronidase, or any protein which can be detected by a secondary reagent such as an antibody.
Methods for detecting proteins using antibodies, such as ELISA assays, are well known to those skilled in the art.
Here, the amount of reporter protein present reflects the transcriptional activity of the nuclear receptor. This recombinant cell line is then screened for the effect of _25 substances on the expression of the reporters, thus measuring the effect of these substances on the activity of the nuclear receptor. These substances can be derived from natural sources, such as fungal extracts, plant extracts, bacterial extracts, higher eukaryotic cell extracts, or even extracts from animal sources, or marine, forest or soil samples, maybe assayed for the presence of potentially useful pharmaceutical agents. It will be understood that that the pharmaceutical agents to be screened may be derived from chemical compositions or man-made compounds.
The candidate substances can also include monoclonal or polyclonal antibodies, peptides or proteins, such as those derived from recombinant DNA technology or by other means, including chemical peptide synthesis. The active compounds may include fragments or parts or derivatives of naturally-occurring compounds or may be only found as active combinations of known compounds which are otherwise inactive.
In general the assay comprises, contacting a suitable cell containing a reporter under the control of the L66 nuclear receptor with a test compound, monitoring said host cell for the expression of the reporter gene, wherein expression of the reporter reflects the transcriptional activity of the nuclear receptor L66, and therefore reflects effects of the compound on the nuclear receptor.
In other embodiments of the invention assays are included where measuring the activity of a dimer of the nuclear receptor L66 and another protein, such as RXR
takes place. Further included are assays aiming at the identification of compounds which specifically influence only the monomeric, homodimeric or homomultimeric form of the nuclear receptor, or influencing only multimeric forms of the nuclear receptor. Such assays include measuring the effect of a compound on the nuclear receptor in the absence of a binding partner, and measuring the effect of a compound on the nuclear receptor in the presence of a binding partner, such as RXR. One skilled in the art will find numerous more assays which are equally covered by the invention.
A cell line where the activity of a nuclear receptor determines the expression of a reporter can be obtained by creating a fusion gene driving the expression of a fusion protein. consisting of the ligand-binding domain of the L66 nuclear receptor fused to the DNA binding domain of a transcription factor with a known specificity for a given DNA sequence (the binding site). This DNA sequence (the binding site) can then be inserted in one or multiple copies before (5') to the promoter driving the expression of the reporter. Transcription factors useful for this approach include bacterial proteins, such as IexA, yeast proteins, such as Gal4, mammalian proteins such as NFkappaB
or NFAT, the glucocorticoid receptor, the estrogen receptor, or plant proteins. The binding sites for these proteins can all be used in combination with the appropriate transcription factor to generate a useful reporter assay.
Another way to screen for inhibitors is to identify binding sites on DNA for the L66 nuclear receptor, and operatively link this binding site to a promoter operatively linked to a reporter gene. Included among others are binding sites for heterodimers of the L66 nuclear receptor with another protein, such as RXR.
Furthermore, transgenic animals described in the invention can be used to derive cell lines useful for cellular screening assays.
Cell lines useful for such an assay include many different kinds of cells, including prokaryotic, animal, fungal, plant and human cells. Yeasfi cells can be used in this assay, including Saccharomyces cerevisiae and Schizosaccharomyces pombe cells.
Another way to build cellular assays to measure the effect of compounds is the use of the yeast two hybrid system (see for example see, for example, U.S. Pat.
No.
5,283,317; Zervos et al. (1993) Cell 72:223-232; Madura et al. (1993) J. Biol.
Chem.
268:12046-12054; Bartel et al. (1993) Biotechniques 14:920-924; Iwabuchi et al.
(1993) Oncogene 8:1693-1696; PCT Publication No. WO 94/10300, and U.S. Pat.
No. 5,667,973), and or possible variants of the basic two hybrid system as discussed e.g in Vidal M, Legrain P, Nucleic Acids Res. 1999 Feb 15;27(4):919-29.
Briefly, the two hybrid assay relies on reconstituting in vivo a functional transcriptional activator protein from two separate fusion proteins. In particular, the method makes use of chimeric genes which express hybrid proteins. To illustrate, a first hybrid gene comprises the coding sequence for a DNA-binding domain of a transcriptional activator fused in frame to the coding sequence for a TI polypeptide. The second hybrid protein encodes a transcriptional activation domain fused in frame to a sample gene from a cDNA library. If the bait and sample hybrid proteins are able to interact, e.g., form a TI-dependent complex, they bring into close proximity the two domains of the transcriptional activator. This proximity is sufficient to cause transcription of a reporter gene which is operably linked to a transcriptional regulatory site responsive to the transcriptional activator, and expression of the reporter gene can be detected and used to score for the interaction of the TI and sample proteins.
In such assays, one primarily measures the effect of a compound on a given interaction involving the L66 nuclear receptor and a binding protein. In a preferred embodiment of the invention systems using other hosts such as prokaryotes as E.
coli, or eukaryotic mammalian cells are described.
Two hybrid systems using hybrid protein fusions with other proteins than transcription factors, including enzymes such as beta-galactosidase or dihydrofolate reductase may also be applied. These assays are useful both to monitor the effect of a compound, including peptides, proteins or nucleic acids on an interaction of a nuclear receptor with a given binding partner, as well as to identify novel proteins or nucleic acids interacting with the nuclear receptor.
Monitoring the influence of compounds on cells may be applied not only in basic drug screening, but also in clinical trials. In such clinical trials, the expression of a panel of genes may be used as a "read out" of a particular drug's therapeutic effect.
CELL-FREE ASSAYS
Recombinant forms of the polypeptide according to SEQ ID NO. 3, 24 or 19 or as encoded by the nucleic acids according to the invention can be used in cell-free screening assays aiming at the isolation of compounds affecting the activity of nuclear receptors. In such an assay, the nuclear receptor polypeptide is brought into contact with a substance to test if the substance has an effect on the activity of the L66 receptor.
_ 28 The detection of an interaction between an agent and a receptor may be accomplished through techniques well-known in the art. These techniques include but are not limited to centrifugation, chromatography, electrophoresis and spectroscopy. The use of isotopically labeled reagents in conjunction with these techniques or alone is also contemplated. Commonly used radioactive isotopes include 3H, 14C, . 2ZNa, 32P~ 33P~ 355 45Ca~ 60C0~ 1251, and 1311. Commonly used stable isotopes include '2H, .13 C, 15N, 1s0.
For example, if an agent binds to the receptor of the present invention, the binding may be detected by using radiolabeled agent or radiolabeled receptor. Briefly, if radiolabeled agent or radiolabeled receptor is utilized, the agent-receptor complex may be detected by liquid scintillation or by exposure to x-ray film or phosho-imaging devices.
One way to screen for substances affecting nuclear receptor activity is to measure the effect of the binding of nuclear receptors to ligands, such as cofactors, activators, repressors, DNA, RNA, proteins, antibodies, peptides or other substances, including chemical compounds known to affect receptor activity. Assays measuring the binding of a protein to a ligand are well known in the art, such as ELISA assays, FRET
assays, bandshift assays, plasmon-resonance based assays, scintilllation proximity assays, fluorescence polarization assays.
In one example, a mixture containing the L66 polypeptide, effector and candidate substance is allowed to incubate. The unbound effector is separable from any effector/receptor complex so formed. One then simply measures the amount of each (e.g., versus a control to which no candidate substance has been added). This measurement may be made at various time points where velocity data is desired.
From this, one determines the ability of the candidate substance to alter or modify the function of the receptor.
Numerous techniques are known for separating the effector from efFector/receptor complex, and all such methods are intended to fall within the scope of the invention.
This includes the use of thin layer chromatographic methods (TLC), HPLC, spectrophotometric, gas chromatographiclmass spectrophotometric or NMR
analyses. Another method of separation is to immobilize one of the binding partners on a solid support, and to wash away any unbound material. It is contemplated that any such technique may be employed so long as it is capable of differentiating between the effector and complex, and may be used to determine enzymatic function such as by identifying or quantifying the substrate and product.
A screening assay provides a L66 receptor under conditions suitable for the binding of an agent to the L66 receptor. These conditions include but are not limited to pH, temperature, tonicity, the presence of relevant cofactors, and relevant modifications to the polypeptide such as glycosylation or lipidation. It is contemplated that the receptor can be expressed 'and utilized in a prokaryotic or eukaryotic cell.
The host cell expressing the L66 receptor can be used whole or the receptor can be isolated from the host cell. The L66 receptor can be membrane bound in the membrane of the host cell or it can be free in the cytosol of the host cell. The host cell can also be fractionated into sub-cellular fractions where the receptor can be found. For example, cells expressing the receptor can be fractionated into the nuclei, the endoplasmic reticulum, vesicles, or the membrane surfaces of the cell.
pH is preferably from about a value of 6.0 to a value of about 8.0, more preferably from about a value of about 6.8 to a value of about 7:8, and most preferably, about 7.4. In a preferred embodiment, temperature is from about 20°C degrees to about 50°C degrees more preferably, from about 30°C degrees to about 40°C degrees and even more preferably about 37°C degrees. Osmolality is preferably from about 5 milliosmols per liter (mosm%L) to about 400 mosm/l, and more preferably, from about 200 milliosmols per liter to about 400 mosm/l and, even more preferably from about 290 mosm/L to about 310 mosm/L. The presence of cofactors can be required for the proper functioning of the L66 receptor. Typical cofactors include sodium, potassium, calcium, magnesium, and chloride. In addition, small, non-peptide molecules, known as prosthetic groups may also be required. Other biological conditions needed for receptor function are well-known in the art.
It is well-known in the art that proteins can be reconstituted in artificial membranes, vesicles or liposomes. (Danboldt et a1.,1990). The present invention contemplates that the receptor can be incorporated into artificial membranes, vesicles or liposomes. The reconstituted receptor can be utilized in screening assays.
It is further contemplated that a receptor of the present invention can be coupled to a solid support, e.g., to agarose beads, polyacrylamide beads, polyacrylic, sepharose beads or other solid matrices capable of being coupled to polypeptides. Well-known coupling agents include cyanogen bromide (CNBr), carbonyldiimidazole, tosyl chloride, diaminopimelimidate, and glutaraldehyde.
In a typical screening assay for identifying candidate substances, one employs the same recombinant expression host as the starting source for obtaining the receptor polypeptide, generally prepared in the form of a crude homogenate. Recombinant cells expressing the receptor are washed and homogenized to prepare a crude polypeptide homogenate in a desirable buffer such as disclosed herein. In a typical assay, an amount of polypeptide from the cell homogenate, is placed into a small volume of an appropriate assay buffer at an appropriate pH. Candidate substances, such as agonists and antagonists, are added to the admixture in convenient concentrations and the interaction between the candidate substance and the receptor polypeptide is monitored.
Where one uses an appropriate known substrate for the L66 receptor, one can, in the foregoing manner, obtain a baseline activity for the recombinantly produced receptor. Then, to test for inhibitors or modifiers of the receptor function, one can incorporate into the admixture a candidate substance whose effect on the L66 receptor is unknown. By comparing reactions which are carried out in the presence or absence of the candidate substance, one can then obtain information regarding the effect of the candidate substance on the normal function of the receptor.
Accordingly, this aspect of the present invention will provide those of skill in the art with methodology that allows for the identification of candidate substances having the ability to modify the action of nuclear receptor polypeptides in one or more manners.
Additionally, screening assays for the testing of candidate substances are designed to allow the determination of structure-activity relationships of agonists or antagonists with the receptors, e.g., comparisons of binding between naturally-occurring hormones or other substances capable of interacting with or otherwise modulating the receptor; or comparison of the activity caused by the binding of such molecules to the receptor.
In certain aspects, the polypeptides of the invention are crystallized in order to carry out x-ray crystallographic studies as a means of evaluating interactions with candidate substances or other molecules with the nuclear receptor polypeptide.
For instance, the purified recombinant polypeptides of the invention, when crystallized in a suitable form, are amenable to detection of intra-molecular interactions by x-ray crystallography. In another aspect, the structure of the polypeptides can be determined using nuclear magnetic resonance.
PHARMACEUTICAL COMPOSITION:
This invention provides a pharmaceutical composition comprising an effective amount of a agonist or antagonist drug identified by the method described herein and a pharmaceutically acceptable carrier. Such drugs and carrier can be administered by various routes, for example oral, subcutaneous, intramuscular, intravenous or intracerebral. The preferred route of administration would be oral at daily doses of about 0.01 -100 mg/kg.
This invention provides a method of treating metabolic disorders, immunological indications, hormonal dysfunctions, neurosystemic diseases wherein the abnormality is improved by reducing the activity of L66 receptor or blocking the binding of ligands to a L66 receptor, which method comprises administering an effective amount of the antagonist-containing pharmaceutical composition described above to suppress the subject's appetite. Similarly, the invention also provides methods for treating diseases and conditions resulting from metabolic disorders, immunological indications, hormonal dysfunctions, neurosystemic diseases, which method comprises administering an effective amount of an agonist-containing pharmaceutical composition described above.
_ 32 TRANSFORMATION OF CELLS AND DRUG SCREENING
The recombinant expression constructs of the present invention are useful in molecular biology to transform cells which do not ordinarily express L66 to thereafter express this receptor.
Such cells are useful as intermediates for making cellular preparations useful for receptor binding assays, which are in turn useful for drug screening. Drugs identified from such receptor assays can be used for the treatment of metabolic disorders, immunological indications, hormonal dysfunctions, and/or neurosystemic diseases.
The recombinant expression constructs of the present invention are also useful in gene therapy. Cloned genes of the present invention, or fragments thereof, may also be used in gene therapy carried out by homologous recombination or site-directed mutagenesis. See generally Thomas & Capecchi, Cell 51, 503-512 (1987);
Bertling, Bioscience Reports 7, 107-112 (1987); Smithies et al., Nature 317, 230-234 (1985).
Oligonucleotides of the present invention are useful as diagnostic tools for probing L66 expression in tissues. For example, tissues are probed in situ with oligonucleotide probes carrying detectable groups by conventional autoradiographic techniques, as explained in greater detail in the Examples below, to investigate native expression of this receptor or pathological conditions relating thereto. Further, chromosomes can be probed to investigate the presence or absence of the L66, and potential pathological conditions related thereto, as also illustrated by the Examples below. Probes according to the invention should generally be at least about 15 nucleotides in length to prevent binding to random sequences, but, under the appropriate circumstances may be smaller (see above for details on hybridization).
POLYPEPTIDE
Another aspect of the invention includes an antibody specifically reactive with the protein or any part of the protein according to the invention (SEQ ID NO. 3, 24 or 19) and or a polypeptide encoded by the nucleotide sequence of the nuclear receptor L66 (see also figures). (The term "antibody" refers to intact molecules as well as fragments thereof, such as Fa, F(ab)2, and Fv, which are capable of binding the epitopic determinant.) By using immunogens derived from the polypeptide according to the invention (SEQ ID NO. 3, 24, 19) and/or encoded by the nucleic acids according to the invention, anti-protein/anti-peptide antisera or monoclonal antibodies can be made by standard protocols (E, Howell & D. Lane. Antibodies: A
Laboratory Manual. Cold Spring Harbor Laboratory (1988)).
A polyclonal antibody is prepared by immunizing a mammal, such as a mouse, a hamster or rabbit with an immunogenic form of the polypeptide, i.e. the human polypeptide of the present invention, and collecting antisera from that immunized animal. Because of the relatively large blood volume of rabbits, a rabbit is a preferred choice for production of polyclonal antibodies.
As an immunizing antigen, fusion proteins, intact polypeptides or fragments containing small peptides of interest can be used. They can be derived by expression from a cDNA transfected in a host cell with subsequent recovering of the protein/peptide or peptides can be synthesized chemically (e.g. oligopeptides with 10-15 residues in length). Important tools for monitoring the function of the gene according to the present invention, i.e. encoded by a sequence according to SEQ ID
NO. 1, 4, 24 or 17 (or portions thereof or splice variants thereof) are antibodies against various domains of the protein according to the invention. Various Oligopeptides from the N- and C-terminal sequences and the DBD/hinge region of the protein can be used as antigens.
A given polypeptide or polynucleotide may vary in its immunogenicity. It is often necessary to couple the immunogen (e.g. the polypeptide) with a carrier, Commonly used carriers that are chemically coupled to peptides include bovine serum albumin (BSA) and keyhole limpet hemocyanin (ICLH). The coupled peptide is then used to immunize the animal in the presence of an adjuvant, a non-specific stimulator of the immune response in order to enhance immunogenicity. The production of polyclonal antibodies is monitored by detection of antibody titers in plasma or serum at various time points following immunization. Standard ELISA or other immunoassays can be 34 _ used with the immunogen as antigen to assess the levels of antibodies. When a desired level of immunogenicity is obtained, the immunized animal may be bled and the serum isolated, stored and purified.
To produce monoclonal antibodies, antibody-producing cells (e.g. spleen cells) from an immunized animal (preferably mouse or rat) are fused by standard somatic cell fusion procedures with immortalizing cells such as myeloma cells to yield hybridoma cells. Where the immunized animal is a mouse, a preferred myeloma cell is the murine NS-1 myeloma cell. Such techniques are well known in the art, and include, for example, the hybridoma technique (originally developed by Kohler &
Milstein.
Nature 256: 495-497 (1975)), the human B cell hybridoma technique (Kozbar et al.
Immunology Today 4:72 (1983)), and the EBV-hybridoma technique to produce human monoclonal antibodies (Cole et al. Monoclonal Antibodies and Cancer Therapy. Alan R. Liss, Inc. pp. 77-96 (1985)).
The fused spleen/myeloma cells are cultured in a selective medium to select fused spleen/myeloma cells from the parental cells. Fused cells are separated from the mixture of non-fused parental cells, for example, by the addition of agents that block the de novo synthesis of nucleotides in the tissue culture media. This culturing provides a population of hybridomas from which specific hybridomas are selected.
Typically, selection of hybridomas is performed by culturing the cells by single-clone dilution in microtiter plates, followed by testing the individual clonal supernatants for reactivity with an antigen-polypeptide. The selected clones may then be propagated indefinitely to provide the monoclonal antibody in convenient quantity.
The creation of antibodies which specifically bind the polypeptide according to the invention (SEQ ID NO. 3, 24 or 19) and/or encoded by the nucleotide sequence of the nuclear receptor L66 or its complement (SEQ ID NO. 1, 4, 17 or 22) provides an important utility in immunolocalization studies, and may play an important role in the diagnosis and treatment of receptor disorders. The antibodies may be employed to identify tissues, organs, and cells which express or the nuclear receptor L66.
Antibodies can be used diagnostically in immuno-precipitation and immuno-blotting to detect and evaluate nuclear receptor L66 protein levels in tissue or from cells in bodily fluid as part of a clinical testing procedure.
Monoclonal antibodies provided by the present invention are also produced by recombinant genetic methods well known to those of skill in the art, and the present invention encompasses antibodies made by such methods that are immunologically reactive with an epitope of a mammalian nuclear L66 receptor protein or peptide.
The present invention encompasses fragments of the antibody that are immunologically reactive with an epitope of a mammalian nuclear L66 receptor protein or peptide. Such fragments are produced by any number of methods, including but not limited to proteolytic cleavage, chemical synthesis or preparation of such fragments by means of genetic engineering technology. The present invention also encompasses single-chain antibodies that are immunologically reactive with an epitope of a mammalian nuclear L66 receptor protein or peptide made by methods known to those of skilled in the art.
CHIMERIC ANTIBODIES AND OTHER TYPES OF ANTIBODIES:
The invention also includes chimeric antibodies, comprised of light chain and heavy chain peptides immunologically reactive to an epitope that is a mammalian nuclear L66 receptor protein or peptide. The chimeric antibodies embodied in the present invention include those that are derived from naturally occurring antibodies as well as chimeric antibodies made by means of genetic engineering technology well known to those of skill in the art.
Also included are methods for the generation of antibodies against L66 which rely on the use of phage display systems and related systems, such as described in Hoogenboom HR, de Bruine AP, Hufton SE, Hoet RM, Arends JW, Roovers RC, Immunotechnology 1998 Jun;4(1 ):1-20, and references therein.
The present invention also encompasses an epitope of a mammalian nuclear L66 receptor protein or peptide that is comprised of sequences and/or a conformation of sequences present in the mammalian nuclear L66 receptor protein or peptide molecule. This epitope may be naturally occurring, or may be the result of proteolytic cleavage of the mammalian nuclear L66 receptor protein or peptide molecule and isolation of an epitope-containing peptide or may be obtained by synthesis of an epitope-containing peptide using method of genetic engineering technology and synthesized by genetically engineered prokaryotic or eukaryotic cells.
Antisense oligonucleotides are short single stranded DNA or RNA molecules which may be used to block the availability of the L66 receptor messenger. Synthetic derivatives of ribonucleotides or deoxyribonucleotides and/or PNAs (see above) are equally possible.
The sequence of an antisense oligonucleotide is at least partially complementary to the sequence (or the gene) of interest. The complementarity of the sequence is in any case high enough to enable the antisense oligonucleotide to bind to the nucleic acid according to the invention or parts fihereof..Many examples exist in which the binding of oligonucleotides to the target sequence interFere with the biological function of the targeted sequence (Brysch W, Schlingensiepen KH, Design and application of antisense oligonucleotides in cell culture, in vivo, and as therapeutic agents, Cell Mol Neurobiol 1994 Oct;14(5):557-68; Wagner RW, Gene inhibition using antisense oligodeoxynucleotides, Nature 1994 Nov 24;372(6504):333-5 or Brysch W, Magal E, Louis JC, Kunst M, Klinger l, Schlingensiepen R, Schlingensiepen KH Inhibition of p185c-erbB-2 proto-oncogene expression by antisense oligodeoxynucleotides down-regulates p185-associated tyrosine-kinase activity and strongly inhibits mammary tumor-cell proliferation, Cancer Gene Ther 1994 Jun;1(2):99-105 or Monia BP, Johnston JF, Ecker DJ, Zounes MA, Lima WF, Freier SM Selective inhibition of mutant Ha-ras mRNA expression by antisense oligonucleotides, J Biol Chem 1992 Oct 5;267(28):19954-62 or Bertram J, Palfner K, Killian M, Brysch W, Schlingensiepen KH, Hiddemann W, Kneba M, Reversal of multiple drug resistance in vitro by phosphorothioate oligonucleotides and ribozymes, Anticancer Drugs 1995 Feb;6(1):124-34) This interference occurs in most instances at the level of translation, i.e.
through the inhibition of the translational machinery by oligonucleotides fihat bind to mRNA, however, two other mechanisms of interference with a given gene's function by oligonucleotides can also be envisioned, (i) the functional interference with the transcription of a gene through formation of a triple helix at the level of genomic DNA
and the interference of oligonucleotides with the function of RNA molecules that are executing at least part of their biological function in the untranslated form (Kochetkova M, Shannon MF, Triplex-forming oligonucleotides and their use in the analysis of gene transcription. Methods Mol Biol 2000;130:189-201 Rainer B.
Lanz1, Neil J. McKenna1, Sergio A. Onate1, Urs Albrecht2, Jiemin Wong1, Sophia Y.
Tsai1, Ming-Jer Tsai1, and Bert W. O'Malley A Steroid Receptor Coactivator, SRA, Functions as an RNA and Is Present in an SRC-1 Complex Cell, Vol. 97, 17-27, April, 1999).
Antisense oligonucleotides can be conjugated to different other molecules in order to deliver them to the cell or tissue expressing L66. For instance the antisense oligonucleotide can be conjugated to a carrier protein (e.g. ferritin) in order to direct the oligonucleotide towards the desired target tissue, i.e. in case of ferritin predominantly to the liver.
Antisense expression constructs are expression vector systems that allow the expression - either inducible or uninducible - of a complementary sequence to the L66 sequences according to the invention. The potential possibility of such an approach has been demonstrated in many different model systems (von Ruden T, Gilboa E, Inhibition of human T-cell leukemia virus type I replication in primary human T cells that express antisense RNA, J Virol 1989 Feb;63(2):677-82; Nemir M, Bhattacharyya D, Li X, Singh K, Mukherjee AB, Mukherjee BB, Targeted inhibition of osteopontin expression in the mammary gland causes abnormal morphogenesis and lactation deficiency, J Biol Chem 2000 Jan 14;275(2):969-76; Ma L, Gauville C, Berthois Y, Millot G, Johnson GR, Calvo F,Antisense expression for amphiregulin suppresses tumorigenicity of a transformed human breast epithelial cell line, Oncogene 1999 Nov 11;18(47):6513-20; Refolo LM, Eckman C, Prada CM, Yager D, Sambamurti K, Mehta N, Hardy J, Younkin SG, Antisense-induced reduction of presenilin 1 expression selectively increases the production of amyloid beta42 in _ _ _ ~ 38 transfected cells, J Neurochem 1999 Dec;73(6):2383-8; Buckley NJ, Abogadie FC, Brown DA, Dayrell M, Caulfield MP, Delmas P, Haley JE, Use of antisense expression plasmids to attenuate G-protein expression in primary neurons, Methods Enzymol 2000;314:136-48).
According to the invention an antisense expression construct can be constructed with virtually any expression vector capable of fulfilling at least the basic requirements known to those skilled in the art.
In one embodiment of the invention retroviral expression systems or tissue specific gene expression systems are preferred.
Current standard technologies for delivering antisense constructs are perFormed through a conjugation of constructs with liposomes and related, complex-forming compounds, which are transferred via electroporation techniques or via particle-mediated "gene gun" technologies into the cell. Other techniques may be envisioned by one skilled in the art.
Microinjection still plays a major role in most gene transfer techniques for the generation of germ-line mutants expressing foreign DNA (including antisense RNA
constructs) and is preferred embodiment of the present invention.
Ribozymes are either RNA molecules (Gibson SA, Pellenz C, Hutchison RE, Davey FR, Shillitoe EJ, Induction of apoptosis in oral cancer cells by an anti-bcl-2 ribozyme delivered by an adenovirus vector, Clin Cancer Res 2000 Jan;6(1 ):213-22;
Folini M, Colella G, Villa R, Lualdi S, Daidone MG, Zaffaroni N, Inhibition of Telomerase Activity by a Hammerhead Ribozyme Targeting the RNA Component of Telomerase in Human Melanoma Cells, J Invest Dermatol 2000 Feb;114(2):259-267; Halatsch ME, Schmidt U, Botefur IC, Holland JF, Ohnuma T, Marked inhibition of glioblastoma target cell tumorigenicity in vitro by retrovirus-mediated transfer of a hairpin ribozyme against deletion-mutant epidermal growth factor receptor messenger RNA, J
Neurosurg 2000 Feb;92(2):297-305; Ohmichi T, Kool ET, The virtues of self binding:
high sequence specificity for RNA cleavage by self-processed hammerhead ribozymes, Nucleic Acids Res 2000 Feb 1;28(3):776-783) or DNA molecules (Li J, Zheng W, Kwon AH, Lu Y, In vitro selection and characterization of a highly efficient Zn(II)-dependent RNA-cleaving deoxyribozyme; Nucleic Acids Res 2000 Jan 15;28(2):481-488) that have catalytic activity. The catalytic activity located in one part of the RNA (or DNA) molecule can be "targeted" to a specific sequence of interest by fusing the enzymatically active RNA molecule sequence with a short stretch of RNA
(or DNA) sequence that is complementary to the L66 transcript. Such a construcfi will, when introduced into a cell either physically or via gene transfer of a ribozyme expression construct find the L66 sequence (our sequence of interest) and bind via its sequence-specific part to said sequence. The catalytic activity attached to the consfiruct, usually associated with a special nucleic acid structure (people distinguish so called "hammerhead" structures and "hairpin" structures), will then cleave the targeted RNA. The targeted mRNA will be destroyed and cannot be translated efficiently, thus the protein encoded by the mRNA derived from L66 will not be expressed or at least will be expressed at significantly reduced amounts.
In a preferred embodiment the invention covers inducible ribozyme constructs (Koizumi M, Soukup GA, Kerr JN, Breaker RR, Allosteric selection of ribozymes that respond to the second messengers cGMP and cAMP, Nat Struct Biol 1999 Nov;6(11 ):1062-1071 ).
In a further preferred embodiment the invention concerns the use of "bivalent"
ribozymes (multimers of catalytically active nucleic acids) as described in (Hamada M, Kuwabara T, Warashina M, Nakayama A, Taira K, Specificity of novel allosterically trans- and cis-activated connected maxizymes that are designed to suppress BCR-ABL expression FEBS Lett 1999 Nov 12;461(1-2);77-85).
Also provided by the present invention are non-human transgenic animals grown from germ cells transformed with the L66 nucleic acid sequence according to the invention and fihat express the L66 receptor according to the invention and offspring and descendants thereof. Also provided are transgenic non-human mammals comprising a homologous recombination knockout of the native L66 receptor, as well as transgenic non-human mammals grown from germ cells transformed with nucleic acid antisense to the L66 nucleic acid of the invention and offspring and descendants thereof. Further included as part of the present invention are transgenic animals which the native L66 receptor has been replaced with the human homolog. Of course, ofFspring and descendants of all of the foregoing transgenic animals are also encompassed by the invention.
Transgenic animals according to the invention can be made using well known 10 techniques with the nucleic acids disclosed herein. E.g., Leder et al., U.S. Patent Nos.4,736,866 and 5,175,383; Hogan et al., Manipulating the Mouse Embryo, A
Laboratory Manual (Cold Spring Harbor Laboratory (1986)); Capecchi, Science 244, 1288 (1989); Zimmer and Gruss, Nature 338, 150 (1989); Kuhn et al., Science 269, 1427 (1995); Katsuki et al., Science 241, 593 (1988); Hasty et al., Nature 350, 243 (1991); Stacey et al., Mot. Ceil Biol. 14, 1009 (1994); Hanks et al., Science 269, 679 (1995); and Marx, Science 269, 636 (1995). Such transgenic animals are useful for screening for and determining the physiological effects of L66 receptor agonists and antagonist.
20 Consequently, such transgenic animals are useful for developing drugs to regulate physiological activities in which L66 participates.
The following examples are provided for illustrative purposes only and are not intended, nor should they be construed, as limiting the invention in any manner.
The novel nuclear receptor sequences disclosed herein may be used for various in silico, i.e. compufier analyses. Such analyses may be for example nuclear receptor 30 specific sequence alignments which permit the identification of domains and even new receptors. The novel domain sequences disclosed herein may be used in order to create domain specific hidden markov models (hmms) or simply as search sequences.
_ . _ 41 In a preferred embodiment this similarity search tool is the BLAST algorithm.( Altschul, Stephen F., Warren Gish, Webb Miller, Eugene W. Myers, and David J.
Lipman (1990). Basic local alignment search tool. J. Mol. Biol. 215:403-10 and the sequence used is one of those disclosed herein.
Another search tool that may be used is FASTA (W. R. Pearson and D. J. Lipman (1988), "Improved Tools for Biological Sequence Analysis", PNAS 85:2444- 2448, and W. R. Pearson (1990) "Rapid and Sensitive Sequence Comparison with FASTP
and FASTA" Methods in Enzymology 183:63- 98).
The invention is not limited to one particular type of search tool. In one embodiment of the invention search tools are used that do not search by sequence similarity but by applying sequence profiles such as a profile generated when applying the Profile Hidden Markov Model.
Profile Hidden Markov Models also called "Hidden Markov Models", here abbreviated as HMM, are statistical models representing the consensus of the primary structure of a sequence family. The profiles use scores specific of the position of amino acids (or nucleotides) and position specific scores for the opening or the expansion of an insertion or deletion. Methods for the creation of profiles, starting from multiple alignments, have been introduced by Taylor (1986), Gribskov et al. (1987), Barton (1990) and Heinikoff (1996).
HMMs provide an utterly probabilistic description of profiles, i.e. Bayes' theory rules the positioning of all probability (evaluation) parameters (compare Krogh et al. 1994, Eddy 1996 and Eddy 1998). The central idea behind this is that a HMM is a finite model describing the probability distribution of an infinite number of possible sequences. The HMM consists of a number of states corresponding with the columns of a multiple alignment as it is usually depicted. Each state emits symbols (remainders) corresponding with the probability of the symbol emission (specific of the respective state), and the states are linked with each other by probabilities of the changing of states. Starting from one specific state, a succession of states is generated by changing from one state to the other in accordance with the probability '_ ' 42 of the changing of states, until a final state has been reached. Each state then emits symbols according to the probability distribution of emissions specific of this state, creating an observable sequence of symbols.
The attribute "hidden" has been derived from the fact that the underlying sequence of states cannot be observed. Only the sequence of symbols is visible. An assessment of the probabilities of changing of states and of emissions (the training of the model) is achieved by dynamic programming algorithms implemented in the HMMER
package.
The sequences according to the invention may be aligned with other nuclear receptor sequences in order to create a multiple sequence alignment which is used as the basis for the creation of a HMM.
If an existing HMM and a sequence are given, the probability that the HMM
could generate the sequence in question, can be calculated. The HMMER package provides a numerical quantity (the Score) in proportion to this probability, i.e. the information content of the sequence indicated as bits, measured according to the HMM.
See also Barton, G.J. (1990): Protein multiple alignment and flexible pattern matching, Methods Ezymol. 183: 403-427, Eddy, S.R. (1996): Hidden markov models. Curr. Opin. Strct. Biol. 6: 361-365, Eddy, S.R. (1998): Profile hidden markov models.Bioinformatics. 14: 755-763, Gribskov, M. McLachlan, A.D. and Eisenberg D.
(1987): Profile analysis: Detection of distantly related proteins. Proc. Natl.
Acad. Sci.
USA. 84: 4355-5358, Heinikoff, S. (1996): Scores for sequence searches and alignment, Curr. Opin. Strct. Biol. 6: 353-360, Krogh, A., Brown, M., Mian, I.S., Sjolander, K. and Haussler, D. (1994): Hidden markov models in computational biology: Applications to protein modelling. J. Mol. Biol. 235: 1501-1531, Taylor, W.R.
(1986): Identification of protein sequence homology by consensus template alignment. J. Mol. Biol. 188: 233-258.
In general the sequence are selected such that a query using a search sequence returns a result consisting of sequences which are at least to a certain degree similar to the query sequence.
In one embodiment of the invention amino acid sequences of the present invention are used to model the three-dimensional structure of the protein. Initially, this involves the comparison of the protein sequence with the sequence of related proteins where the structure is known, such as the human PPARy ligand-binding domain (Nolte RT, Wisely GB, Westin S, Cobb JE, Lambert MH, Kurokawa R, Rosenfeld MG, Willson TM, Glass CK, Milburn MV,Nature 1998 Sep 10;395(6698):137-43). The three-dimensionale structure can then be modelled using computer programs. From the three-dimensional structure, binding sites of potential inhibitors or activators can be predicted. It can further be predicted which kinds of molecule might bind there. The predicted substances can then be screened to test their effect on nuclear receptor activity.
EXAMPLES
EXAMPLE 1: CLONING AND EXPRESSION OF THE GENE ACCORDING TO THE
INVENTION
Construction of suitable vectors containing the desired coding and control sequences employs standard ligation and restriction techniques that are well undersfiood in the art. Isolated plasmids, DNA sequences, were synthesized oligonucleotides were cleaved, tailored, and religated in the form desired.
Site-specific DNA cleavage was performed by treatment with the suifiable restriction enzyme (or enzymes) under conditions that are generally understood in the art, and the particulars of which are specified by the manufacturer of these commercially available restriction enzymes.
See, e.g., New England Biolabs, Product Catalog. In general, about 1 pg of plasmid and/or DNA sequence was cleaved by one unit of enzyme in about 20 p1 of buffer solution. Often excess of restriction enzyme was used to ensure complete digestion of the DNA substrate. Incubation times of about one hour to two hours at about 37°C
are workable, although variations are tolerable.
After each incubation, protein was removed by extraction with phenol/chloroform, and may be followed by ether extraction. The nucleic acid was recovered from aqueous fractions by precipitation with ethanol. If desired, size separation of the cleaved fragments was performed by polyacrylamide gel or agarose gel electrophoresis using standard techniques. A general description of size separations is found in Methods in Enzymology 65, 499-560 (1980).
Transformed host cells are cells which have been transformed or transfected with recombinant expression constructs made using recombinant DNA techniques and comprising mammalian nuclear receptor L66 encoding sequences. Preferred host cells for transient transfection are COS-7 cells. Transformed host cells may ordinarily express nuclear receptor L66, but host cells transformed for purposes of cloning or amplifying nucleic acid hybridization probe DNA need not express the nuclear receptor. When expressed, the mammalian nuclear L66 receptor protein was typically located in the host cell membrane.
Cultures of cells derived from multicellular organisms are desirable hosts for recombinant nuclear receptor L66 protein synthesis. In principal, any higher eukaryotic cell culture is workable, whether from vertebrate or invertebrate culture.
However, mammalian cells are preferred. Propagation of such cells in cell culture has become a routine procedure. See Tissue Culture (Academic Press, Kruse &
Patterson, Eds., 1973). Examples of useful host cell fines are bacteria cells, insect cells, yeast cells, human 293 cells, VERO and HeLa cells, LMTK- cells, and W1138, BHK, COS-7, CV, and MDCK cell lines. Human 293 cells are preferred.
EXAMPLE 2: NUCLEAR RECEPTOR L66 TISSUE LOCALIZATION:
A multiple tissue northern blot (Clontech, Palo Alto) was hybridized to a labeled probe. The blot contained about 03 to 3 pg of poly A RNA derived from various tissues. Hybridization was carried out in a hybridization solution such as one __ - , 4g_ containing SSC (see Maniatis et al, ibid) at an optimized temperature between 50°c and 70°C, preferably 65°C. The filter was washed and a film exposed for signal detection (see also: Maniatis et al., Molecular Cloning: A laboratory Manual, Cold Spring Harbor Laboratory Press, N.Y.(1989)).
EXAMPLE 3: NUCLEAR RECEPTOR L66 cDNA ISOLATION FROM HUMAN AND
OTHER ORGANISMS:
A cloning strategy was used to clone the L66 receptor cDNA from specific cDNA
libraries (Clontech, Palo Alto) or alternatively, RNA was obtained from various tissues and used to prepare cDNA expression libraries by using for example an Invitrogen kit. (Invitrogen Corporation, San Diego). For the isolation of the L66 cDNA
clone the chosen library was screened under stringent condition (see definitions above) by using an L66 specific probe. The cDNA insert of positive clones was subsequently sequenced and cloned in a suitable expression vector.
Additionally, full length receptor L66 clones from human and mouse (Mus musculus) was obtained by using RACE PCR technology. In brief, suitable cDNA libraries were constructed or purchased. Following reverse transcription, the first strand cDNA was used directly in RACE PCR reactions using a RACE cDNA amplification kit according to the manufactures protocol (Clontech, Palo Alto). Amplified fragments were purified, cloned and subsequently used for sequence analysis.
To obtain information about the genomic organization (see also figures) of the receptor gene, genomic libraries were screened (commercially available clone libraries were used) with a receptor specific probe under stringent conditions.
Positive clones were isolated and the complete DNA sequence of the putative receptor region was determined by sequence analysis (Maniatis et al., Molecular Cloning: A laboratory Manual, Cold Spring Harbor Laboratory Press, N.Y.(1989)) (human genomic clone deposited at "Deutsche sammlung von Mikroorganismen and Zellkulturen GmbH (DSMZ, International Depositary Authority under the Budapest Treaty in Germany) under number DSM 14483).
EXAMPLE 4: NUCLEAR RECEPTOR L66 LIGAND BINDING ASSAY:
Saturation ligand binding analysis and ligand competition studies are carried out by using an over expressed L66 receptor-protein which is incubated with a labeled potential ligand at different concentrations. The L66 receptor may be immobilized on suitable surFaces. Various competitors are added in the presence of a labeled ligand.
Bound and unbound ligand is separated be using e.g. gel filtration or charcoal based methods or simply removed by a washing step if the receptor protein L66 is immobilized. Receptor-bound ligand is detected by scintilation counting.
EXAMPLE 5: NUCLEAR RECEPTOR L66 EXPRESSION ASSAY:
Relative quantification was performed in multiplex PCR reactions using the ABI
PRISM~ 7700 Sequence Detection System. Total RNA was used as template and was either ordered from Clontech or Ambion in the case of human normal tissue or was isolated from cell lines that were ordered from the DSMZ (German Collection of Microorganisms and Cell Cultures).
Relative quantification was achieved by normalising the results for the presence of 18S rRNA in the samples and subsequently by standardising the corresponding expression data to the expression levels detected in testis (testis = 1 ). A
standard curve was generated by diluting total RNA from testis tissue 1:10 fold, starting from 100 ng input amount and perForming 7 dilution steps. For the detection of target mRNA in other samples the input amount was always 100 ng total RNA, samples were measured in triplicates.
For the detection of 18S rRNA the endogenous control pre-developed assay reagent "Ribosomal RNA control (18S rRNA)" from Applied biosystems was used. For the detection of the target sequence, PCR primers and the probe were designed according to Applied biosystem's specifications using its Primer Express° software.
PCR primers were ordered from Interactiva (forward primer (SEQ ID NO. 14): CGT
GGG CTA ATG AAT TTT ACC AAG; reverse primer (SEQ ID NO. 15): GGC CCC
ATG GAG AAA TAT CAC T). The probe (SEQ ID NO. 16) CCA ATG AGG ATC AAA
CTG CAC TAC AGA AGG G was ordered from Applied Biosystems and was labelled with FAM at the 5'end and contained a quencher (TAMRA) at the 3' end.
EXAMPLE 6: FORMATION OF L66-RXR OR OTHER PROTEIN COMPLEXES
In order to explore the functional properties of L66, the DNA binding properties of L66 are analyzed. It has previously been shown that RXR is a common heterodimeric partner of various members of the nuclear receptor superfamily required for high affinity DNA binding (Hallenbeck et al., PNAS 89: 5572-5576 (1989)), Kliewer et al., EMBO J. 11: 1419-1435 (1992)). It has also been shown that DNA and ligand binding activities of the Dros. melanogaster ecdysan receptor (EcR) require heterodimer formation with RXR or USP (the homologue of RXR) (O'Malley in Endocrinology 125:
1119-1120 (1989)).
Consequently, it is of interest whether L66 can interact with RXR, or with other members of the family.
A two hybrid system is used. CV-1 cells are transiently transfected with cytomegalovirus promoter driven expression vectors containing the yeast GAL4 DNA
binding domain (DBD) alone, GAL4 linked to L66 LBD (LBD; i.e. GAL4-L66) and the 78 amino acid Herpes virus VP16 transactivation domain (VP) linked to the amon terminal end of the LBDs for human (or mouse) RXRa (VP-RXR), mouse PPARgamma (VP-PPAR) VDR (VP-VDR) and others.
All cells are cotransfected with a luciferase reporter construct containing about 4 copies of the yeast GAL4 upstream activating sequence and a ~i-galactosidase expression vector as internal control.
CV-1 cells are grown in DMEM supplemented with 10% AG1-X8 resin-charcoal stripped calf bovine serum, 50 U/ml penicillin G and 50 pg/ml streptomycin sulfate (DMEM-CBS) at 37°C in 5% C02. One day prior to transfection, cells are plated to 50-80% confluence using a phenol-red free DMEM with 10% resin charcoal stripped _ _ _ . 48 fetal bovine serum (DMEM-FBS). Cells are transfected (with reporter construct (300 ng/105 cells), cytomegalovirus driven receptor (100 ng/105 cells) and (i-galactosidase expression vectors (500 ng/10~ cells) by lipofection using N-~2-(2,3)-dioleoyloxy)propyl-N,N,N-trimethyl ammonium methyl sulfate} according to. the manufacturer s instructions (DOTAP, Boehringer Mannheim).
After 2 hours the liposomes are removed and cells treated for 40 hours with phenol-red free DMEM-FBS containing farnesol as the ligand. Cells are harvested and assayed for luciferase and (i-galactosidase activity.
All points are preferentially perFormed in triplicate and should ideally vary less than 10%. Experiments are repeated a 2-4 times. Data points are normalized for differences in transfection efficiency using ~i-galactosidase, and plotted as relative activity where the untreated reporter is defined to have an activity of 1 unit.
Neither the GAL4 DBD, nor the GAL4-L66 chimera should be capable of stimulating.
transcription from a reporter construct containing the GAL4 upstream activating sequence. Similarly, a fusion protein containing the Herpes virus VP16 firansactivation domain linked to the RXRa-LBD (VP-RXR) should be inactive when expressed alone or with the GAL4 DBD. However, when GAL4-L66 and VP-RXR are coexpressed, the reporter could be activated, indicating that L66 and RXRa interact efficiently in cells. Using similar VP16-LBD fusion proteins, interaction can be tested between L66 and receptors for peroxisome proliferators/fatty acids (PPAR), vitamin D3 (VDR), thyroid hormone, (T3R), retinoic acid (RAR), or other members of the nuclear receptor superfamily (see also US Pat. 6,005,086).
FIGURE CAPTIONS:
FIG. 1:
Fig. 1 shows the cDNA sequence of the L66 gene according to the invention. It also shows the reverse complement thereof.
___ _ _ __ _ -- 49 FIG. 2:
Fig. 1 shows the protein sequence of the L66 gene according to the invention.
FIG. 3:
Fig. 3 shows the domain composition of FXR-~i (L66) comprising a so called DNA
binding domain "DBD" and a ligand binding domain "LBD" which are present in members of the nuclear receptor superfamily. Also shown are the respective probability values for the presence of these domains which as can be seen in the figure are extremely high.
' FIG. 4:
Fig, 4 shows the cDNA sequence from L66 from Mus musculus (SEQ ID NO. 17) as well as its reverse complement (SEQ ID NO. 18) and the protein sequence of L66 from Mus musculus (SEQ ID NO. 19).
FIG. 5:
Fig. 5 shows the DNA sequence (NC Fragment) of a splice variant of L66 (SEQ ID
NO. 4). It also shows the reverse complement of the sequence (SEQ ID NO. 5).
FIG. 6: ' Fig. 6 shows the DNA sequence (PolyA1 Fragment, C-terminal end) of a splice variant of L66. It also shows the reverse complement of the sequence. This differs from the NC Fragment in so far as the poly-A-tail follows the exons 7 and 7B
directly (SEQ ID NO. 7).
FIG. 7:
Fig. 7 shows the DNA sequence (PolyA2 Fragment, C-terminal end) of a splice variant of L66. It also shows the reverse complement of the sequence. This differs __ _ , 50 ___ from the NC Fragment in so far as the poly-A-tail follows the exons 7 and 7B
directly (SEQ ID NO. 9).
FIG. 8:
Fig. 8 shows the DNA sequence (GF Fragment, C-terminal end) of a splice varianfi of L66. It also shows the reverse complement of the sequence. This differs from the NC
Fragment in so far as exon.8 is not present (SEQ ID NO. 11 ).
FIG. 9:
Fig. 9 shows the DNA sequence (SF Fragment, C-terminal end) of a splice variant of L66. It also shows the reverse complement of the sequence. This differs from the NC
Fragment in so far as exon 7 and 8 are not present (SEQ ID NO. 13).
FIG. 10:
Fig. 10 is a schematic representation of the exon distribution within the splice variants found.
FIG. 11:
Fig. 11 shows the result of a expression test on various cell lines and tissues.
FIG. 12:
Fig. 12 shows the genomic DNA sequence for L66 from Mus musculus (SEQ ID NO.
20).
__ " _ _ g1 _ FIG. 13:
Fig. 13 shows the reverse complement of the genomic DNA sequence for L66 from Mus musculus (SEQ ID NO. 21 ).
FIG. 14:
Fig. 14 shows a splice variant cDNA sequence of L66 from Mus musculus (SEQ ID
NO. 22), its reverse complement (SEQ ID NO. 23) and the corresponding protein sequence (SEQ ID NO. 24).
FIG. 15:
Fig. 15 lists all the sequences according to the invention, their respective origin ~as well as in which figures they are depicted.
FIG. 16:
Fig. 16 shows a BLASTP (improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements; Nucleic Acids Res. 2001 Jul 15;29(14):2994-3005.) - alignment of L66 against tremblnew (sequence identifier ~AF384555~AF384555) "NR1 H4"; product: "farnesol receptor"
from Homo sapiens farnesol receptor (NR1 H4) mRNA, complete cds, alternatively spliced. //:gp~AF384555~14326451 gene: "NR1H4"; product: "farnesol receptor";
Homo sapiens farnesol receptor (NR1 H4) mRNA, complete cds, alternatively spliced.
The hit is scoring at : 7e-88 (expectation value) for an alignment length (overlap) 389 having 46 % identities (Scoring matrix : BLOSUM62 (used to infer consensus pattern).
FiG. 17:
__ __52 Fig. 17 A shows the intron - exon composition of the mouse L66 gene (numericals).
Fig. 17 B shows the intron = exon composition of spliceform variant 1 of the mouse L,66 gene (numericals).
FIG. 18:
Fig. 18 A shows the intron - exon composition of the mouse L66 gene (sequences).
Fig. 18 B shows the intron - exon composition of spliceform variant 1 of the mouse L66 gene (sequences).
FIG. 19:
Fig. 19 shows a multiple sequence alignment of the L66 C4 zinc finger domain and shows the conservation of this domain.
FIG. 20:
Fig. 20 19 shows a multiple sequence alignment of the L66 protein and demonstrates the conservation of the L66 Ligand Binding Domain (LBD).
L-0020-01-WO-Ol.ST25 SEQUENCE LISTING
<110> LION Bioscience AG
<120> NOVEL MAMMALIAN NUCLEAR RECEPTOR L66 AND METHODS OF USE
<130> L-0020-01-WO-01 <160> 24 <170> PatentIn version 3.0 <210> 1 <211> 1152 <212> JNA
. . <:213=> . Horric.. sa~~i-ens -_ . _ ; . _ _ - . -<220>
<221> n <222> (1)..(1152) <223> n may be a, c, g or t <400> 1 cctggaataaaaaggtccagaccaacctattcttcctcgagaaataagggacaggaagaa60 ttctgtgtagtttgtggtgataaagcatcaccatcaccatatcattataatgcacttacc120 tgtgaaggttgcaaagaaatacctatggtaaaaaattttaaaacttttttattgggtttt180 tttcaatgtagcatcnnncaaaatgcagtatatagttgcaggaatggtagtcactgtgaa240 atggacatgtacatgcgtagaaaatgtcaagagtgcagactgaaaaagtataaggcagta300 ggaatgttggcagaatgtttgctcacagaaatccaatgtaaattaaagagacttcaaaag360 aactttaaggagaagaatcatttttactctaacatcaaagtggaagaggaaggagtagac420 cacagttttctatcatccaccactagacctggaaaagaaagcatggaactaactgaagag480 gaacatcagctcattaataacattgtggctgctcatcaaaaatataccattcctttagaa540 gaaacaaatttgtatctgcaggaacatacaaatcctgaactgagctttttgcaactctca600 gagacagcagtcctacacatacgtgggctaatgaattttaccaaggggctcccaggattt660 gaaaatttggccaatgaggatcaaactgcactacagaagggatcaaaaactgaagtgata720 Sei to L-0020-01-WO-01.ST25 tttctccatggggcccaactttacaatacaatgataatttccatatgtttgattctaccc780 tatgtttggatgaaaatacattttcgtatcagttttttgggtgttactgaagaatttatt840 acannnctgttttacttctacaaaagaatgagcaaacttgatgtaactaatactgaatat900 gctctgcttgcagcaacaattgttttttcagatcgtccatgccttaaaaataagcaatat960 atggaaaatttannngaaccagttttacaaatattgtataagtattcaaaaatgtatcat1020 ccagaagacccannncattttgcccatctcatatggaagcatactgaactgagaactctg1080 aattataaccattcagaaatacttagcacttggaaaacaaaggaccccaaattggctact1140 t~ta;:U:ctct~~ag . _. . , - . .. . . . . 1152.
<210> 2 <211> 1152 <212> DNA
<213> Homo sapiens <220>
<221> n <222> (1)..(1152) <223> n may be a, c> g or t <400> 2 ctcagagagtaaagtagccaatttggggtcctttgttttccaagtgctaagtatttctga60 atggttataattcagagttctcagttcagtatgcttccatatgagatgggcaaaatgnnn120 tgggtcttctggatgatacatttttgaatacttatacaatatttgtaaaactggttcnnn180 taaattttccatatattgcttatttttaaggcatggacgatctgaaaaaacaattgttgc240 tgcaagcagagcatattcagtattagttacatcaagtttgctcattcttttgtagaagta300 aaacagnnntgtaataaattcttcagtaacacccaaaaaactgatacgaaaatgtatttt360 catccaaacatagggtagaatcaaacatatggaaattatcattgtattgtaaagttgggc420 cccatggagaaatatcacttcagtttttgatcccttctgtagtgcagtttgatcctcatt480 ggccaaattttcaaatcctgggagccccttggtaaaattcattagcccacgtatgtgtag540 Seite 2 L-0020-01-WO-01.ST25 gactgctgtc tctgagagttgcaaaaagctcagttcaggatttgtatgttcctgcagata600 caaatttgtt tcttctaaaggaatggtatatttttgatgagcagccacaatgttattaat660 gagctgatgt tcctcttcagttagttccatgctttcttttccaggtctagtggtggatga720 tagaaaactg tggtctactccttcctcttccactttgatgttagagtaaaaatgattctt780 ctccttaaag ttcttttgaagtctctttaatttacattggatttctgtgagcaaacattc840 tgccaacatt cctactgccttatactttttcagtctgcactcttgacattttctacgcat900 gtacatgtcc atttcacagtgactaccattcctgcaactatatactgcattttgnnngat960 gctacatt~a aaaUaacccaataaaaaGgttttaaaattttttaccataggt~tttcttt.1020.
gcaaccttca caggtaagtgcattataatgatatggtgatggtgatgctttatcaccaca1080 aactacacag aattcttcctgtcccttatttctcgaggaagaataggttggtctggacct1140 ttttattcca gg 1152 <210>3 <211>384 <212>PRT
<213>Homo sapiens <220>.
<221>X
<222>(1)..(384) <223>X may be any amino acid <400>
ProGly IleLysArgSerArg ProThrTyr SerSerArg AsnLys Ser GlyGln GluGluPheCysVal ValCysGly LysAlaSer ProSer Asp ProTyr HisTyrAsnAlaLeu ThrCysGlu CysLysGlu IlePro Gly MetVal LysAsnPheLysThr PheLeuLeu PhePheGln CysSer Gly Sei to 3 L-0020-01-WO-01.ST25 Ile Xaa Gln Asn Ala Val Tyr Ser Cys Arg Asn Gly Ser His Cys Glu Met Asp Met Tyr Met Arg Arg Lys Cys Gln Glu Cys Arg Leu Lys Lys Tyr Lys Ala Ual Gly Met Leu Ala Glu Cys Leu Leu Thr Glu Ile Gln Cys Lys Leu Lys Arg Leu Gln Lys Asn Phe Lys Glu Lys Asn His Phe Tyr Ser Asn Ile Lys Val Glu Glu Glu Gly Val Asp His Ser Phe Leu .. ~ 13g . ...' ..: ' . 1. Z5. , :: .. . .. 140, ' ' . . ..
Ser Ser Thr Thr Arg Pro Gly Lys Glu Ser Met Glu Leu Thr Glu Glu Glu His Gln Leu Ile Asn Asn Ile Val Ala Ala His Gln Lys Ty r Thr Tle Pro Leu Glu Glu Thr Asn Leu Tyr Leu Gln Glu His Thr Asn Pro Glu Leu Ser Phe Leu Gln Leu Ser Glu Thr Ala Val Leu His Ile Arg Gly Leu Met Asn Phe Thr Lys Gly Leu Pro Gly Phe Glu Asn Leu Ala Asn Glu Asp Gln Thr Ala Leu Gln Lys Gly Ser Lys Thr Glu Val Ile Phe Leu His Gly Ala Gln Leu Tyr Asn Thr Met Ile Ile Ser Ile Cys Leu Ile Leu Pro Ty r Val Trp Met Lys Ile His Phe Arg Ile Ser Phe Leu Gly Val Thr Glu Glu Phe Iie Thr Xaa Leu Phe Tyr Phe Tyr Lys Arg Met Ser Lys Leu Asp Val Thr Asn Thr Glu Tyr Ala Leu Leu Ala Seite 4 L-0020-01-WO-01.ST25 Ala Thr Ile Ual Phe Ser Asp Arg Pro Cys Leu Lys Asn Lys Gln Tyr Met Glu Asn Leu Xaa Glu Pro Ual Leu Gln Ile Leu Tyr Lys Tyr Ser Lys Met Tyr His Pro Glu Asp Pro Xaa His Phe Ala His Leu Ile Trp Lys His Thr Glu Leu Arg Thr Leu Asn Tyr Asn His Ser Glu Ile Leu Ser Thr Trp Lys Thr Lys Asp Pro Lys Leu Aia Thr Leu Leu Ser Giu ..<L10> 4 - . . ... . ' , . . , ' : - ' - .- , <211> 2077 <212> DNA
<213> Homo sapiens <400> 4 cctggaataa aaaggtccag accaacctat tcttcctcga gaaataaggg acaggaagaa 60 ttctgtgtag tttgtggtga taaagcatca ccatcaccat atcattataa tgcacttacc 120 tgtgaaggtt gcaaagcatc aacaaaatgc agtatatagt tgcaggaatg gtagtcactg 180 tgaaatggac atgtacatgc gtagaaaatg tcaagagtgc agactgaaaa agtataaggc 240 agtaggaatg ttggcagaat gtttgcacac agaaatccaa tgtaaattaa agagacttca 300 aaagaacttt aaggagaaga atcattttta ctctaacatc aaagtggaag aggaaggagt 360 agaccacagt tttctatcat ccaccactag acctggaaaa gtgattcagg aaagcatgga 420 actaactgaa gaggaacatc agctcattaa taacattgtg gctgctcatc aaaaatatac 480 cattccttta gaagaaacaa atttctgcag gaacatacaa atcctgaact gagctttttg 540 caactctcag agacagcagt cctacacata cgtgggctaa tgaattttac caaggggctc 600 ccaggatttg aaaatttggc caatgaggat caaactgcac tacagaaggg atcaaaaact 660 gaagtgatat ttctccatgg ggcccaactt tacagtcaga aacaatcagc ctctgaaagt 720 tctgtgagaa tattaaatca ttcagattat acaccaaatt gtcacaatag gagtggtgat 780 Seite 5 L- 0020-01-WO-01.ST25 agaagtcttatttgttctatggaaaaattttacaatgaagaatgtccttctactactcta840 attggtttcaagaagctcatctggaaaatggtgataataatagaacttacctcatacagt900 attgtgactactacataaaataatacatagagatcgtccatgccttaaaaataagcaata960 tatggaaaatttacaagaaccagttttacaaatattgtataagtattcaaaaatgtatca1020 tccagaagacccatagcattttgcccatctcatatggaagcatactgaactgagaac 1077 <210> 5 <211> 1077 <212> DNA
<213> Homo Sapiens <400> 5 gttctcagtt cagtatgctt ccatatgaga tgggcaaaat gctatgggtc ttctggatga 60 tacatttttg aatacttata caatatttgt aaaactggtt cttgtaaatt ttccatatat 120 tgcttatttttaaggcatggacgatctctatgtattattttatgtagtagtcacaatact180 gtatgaggtaagttctattattatcaccattttccagatgagcttcttgaaaccaattag240 agtagtagaaggacattcttcattgtaaaatttttccatagaacaaataagacttctatc300 accactcctattgtgacaatttggtgtataatctgaatgatttaatattctcacagaact360 ttcagaggctgattgtttctgactgtaaagttgggccccatggagaaatatcacttcagt420 ttttgatcccttctgtagtgcagtttgatcctcattggccaaattttcaaatcctgggag480 ccccttggta aaattcatta gcccacgtat gtgtaggact gctgtctctg agagttgcaa 540 aaagctcagt tcaggatttg tatgttcctg cagaaatttg tttcttctaa aggaatggta 600 tatttttgat gagcagccac aatgttatta atgagctgat gttcctcttc agttagttcc 660 atgctttcct gaatcacttt tccaggtcta gtggtggatg atagaaaact gtggtctact 720 ccttcctctt ccactttgat gttagagtaa aaatgattct tctccttaaa gttcttttga 780 agtctcttta atttacattg gatttctgtg tgcaaacatt ctgccaacat tcctactgcc 840 ttatactttt tcagtctgca ctcttgacat tttctacgca tgtacatgtc catttcacag 900 Seite 6 L-0020-01-WO-01.ST25 tgactaccat tcctgcaact atatactgca ttttgttgat gctttgcaac cttcacaggt 960 aagtgcatta taatgatatg gtgatggtga tgctttatca ccacaaacta cacagaattc 1020 ttcctgtccc ttatttctcg aggaagaata ggttggtctg gaccttttta ttccagg 1077 <210>6 <211>331 <2l2>DNA
<213>Homo sapiens <220>
<221> n <222> (1)..(331) .. . <223> ~~n., may, be ,~. ~. u:.:or l~ .. ~ . . . . -<400>
cccaactttacagtcagaaacaatcagcctctgaaagttctgtgagaatattaaatcatt60 cagattatacaccaaattgtcacaataggagtggtgatagaagtcttatttgttctatgg120 aaaaattttacaatgaagaatgtccttctactactctaattggtaatatgactcaatatg180 aaatattatattggatgcaaaaatttgtatataatgtttaactttcttatactgctttga240 gatacaatgataatttccatatgtttgattctaccctatgtttggatgaaaatacatttt300 cgtatcngcccaaaaaaaaaaaaaaaaaaaa 331 <210>7 <211>331 <212>DNA
<213>Homo sapiens <220>
<221> n <222> (1)..(331) <223> "N" may be either A. C. G or T
<400> 7 tttttttttt tttttttttt gggcngatac gaaaatgtat tttcatccaa acatagggta 60 gaatcaaaca tatggaaatt atcattgtat ctcaaagcag tataagaaag ttaaacatta 120 Seite 7 L-0020-01-WO-01.ST25 tatacaaatt tttgcatcca atataatatt tcatattgag tcatattacc aattagagta 180 gtagaaggac attcttcatt gtaaaatttt tccatagaac aaataagact tctatcacca 240 ctcctattgt gacaatttgg tgtataatct gaatgattta atattctcac agaactttca 300 gaggctgatt gtttctgact gtaaagttgg g 331 <210>8 <211>311 <212>DNA
<213>Homo Sapiens <4fl0> ... . ...., .:. ~ . .. .
g ..~
cccaactttacagtcagaaacaatcagcctctgaaagttctgtgagaatattaaatcatt60 cagattatacaccaaa,ttgtcacaataggagtggtgatagaagtcttatttgttctatgg120 aaaaattti;acaatgaagaatgtccttctactactctaattggtaatatgactcaatatg180 aaatattatattggatgcaaaaatttgtatataatgtttaactttcttatactgctttga240 gatacaatgataatttccatatgtttgattctaccctatgtttggatgaggaaaaaaaaa300 aaaaaaaaaa a 311 <210>9 <211>311 <212>DNA
<213>Nomo sapiens <400>
ttttttttttttttttttttcctcatccaaacatagggtagaatcaaacatatggaaatt60 atcattgtatctcaaagcagtataagaaagttaaacattatatacaaatttttgcatcca120 atataatatttcatattgagtcatattaccaattagagtagtagaaggacattcttcatt180 gtaaaatttttccatagaacaaataagacttctatcaccactcctattgtgacaatttgg240 tgtataatctgaatgatttaatattctcacagaactttcagaggctgattgtttctgact300 gtaaagttggg 311 Seite 8 L-0020-01-WO-Ol.ST25 <210>10 <211>425 <212>DNA
<27.3>Homo sapi ens <400> IO
ggccaatgag gatcaaactg cactacagaa gggatcaaaa actgaagtga tatttctcca 60 tggggcccaa ctttacagtc agaaacaatc agcctctgaa agttctgtga gaatattaaa 120 tcattcagat tatacaccaa attgtcacaa taggagtggt gatagaagtc ttatttgttc 180 tatggaaaaa ttttacaatg aagaatgtcc ttctactact ctaattgatc gtccatgcct 240 taaaaataag-caatatatgg aaaatttaca agaaccagti ttacaaatat-tgtatuagta 30tJ.
ttcaaaaatg tatcatccag aagacccata gcattttgcc catctcatat ggaagcatac 360 tgaactgaga actctgaatt ataaccattc agaaatactt agcacttgga aaacaaagga 420 cccca 425 <210>11 <211>425 <212>DNA
<213>Homo Sapiens <400> 11 tggggtcctt tgttttccaa gtgctaagta tttctgaatg gttataattc agagttctca 60 gttcagtatg cttccatatg agatgggcaa aatgctatgg gtcttctgga tgatacattt 120 ttgaatactt atacaatatt tgtaaaactg gttcttgtaa attttccata tattgcttat 180 ttttaaggca tggacgatca attagagtag tagaaggaca ttcttcattg taaaattttt 240 ccatagaaca aataagactt ctatcaccac tcctattgtg acaatttggt gtataatctg 300 aatgatttaa tattctcaca gaactttcag aggctgattg tttctgactg taaagttggg 360 ccccatggag aaatatcact tcagtttttg atcccttctg tagtgcagtt tgatcctcat 420 tggcc 425 Seite 9 L-0020-01-WO-01.ST25 <210>12 <211>299 <212>DNA
<213>Homo Sapiens <400> 12 .
ggccaatgag gatcaaactg cactacagaa gggatcaaaa actgaagtga tatttctcca 60 tggggcccaa ctttacagtc agaaacaatc agcctctgaa aatcgtccat gccttaaaaa 120 taagcaatat atggaaaatt tacaagaacc agttttacaa atattgtata agtattcaaa 180 aatgtatcat ccagaagacc catagcattt tgcccatctc atatggaagc atactgaact 240 gagaactctg aattataacc attcagaaat acttagcact tggaaaacaa aggacccca 299 <210>13 <211>299 <212>DNA
<213>Nomo Sapiens <400> 13 tggggtcctt tgttttccaa gtgctaagta tttctgaatg gttataattc agagttctca 60 gttcagtatg cttccatatg agatgggcaa aatgctatgg gtcttctgga tgatacattt 120 ttgaatactt atacaatatt tgtaaaactg gttcttgtaa attttccata tattgcttat 180 ttttaaggca tggacgattt tcagaggctg attgtttctg actgtaaagt tgggccccat 240 ggagaaatat cacttcagtt tttgatccct tctgtagtgc agtttgatcc tcattggcc 299 <210> 14 <211> 24 <212> DNA
<213> Artificial <400> 14 cgtgggctaa tgaattttac caag 24 <210> 15 <211> 22 <212> DNA
<213> Artificial Seite 10 L-0020-Ol-WO-Ol.ST25 <400> 15 ggccccatgg agaaatatca ct 22 <210>16 <211>31 <212>DNA
<213>Artificial <400> 16 ccaatgagga tcaaactgca ctacagaagg g 31 <210> _ <~.~~> - .. . . _. . . . , . ~, ~:,n <212>
DNA
<213> musculus Mus <400>
atgttaataaaaccagatattttgccagaacaattccattatcagctgtgtgatacagat60 ttccaagaaccaccctattgtcaatattctaccgctcagtttcctccagcgttacagtcc120 ccatctttacaaagtcatttcaacacacatggcttggatccacagtacagtggaggcagt180 , tggtgtggactcgacgctcgagaatctggtcagtccacttatgtggttgttcacgatgat240 gaagatgaattccctggggcacaaaggtgcagagcaacttgttctttacgctggaagggt300 caagatgacatgctctgcatggtctgcggtgataaggcatcaggatatcactacaatgca360 cttacttgtgaggggtgcaaaggctttttccggcgtagcattaccaagaatgcagtgtat420 tcttgcaagaacggtggtcactgtgaaatggacatgtacatgcgcagaaaatgccaagag480 tgcagactgaagaagtgtaaggcggtggggatgttggcagaatgtttgctcacagagatc540 cagtgtaagtcaaagagacttcgcaagaacttcaagcacgggcctgccctgtaccctgcc600 atccaagtggaagatgaaggagcagacaccaaacacgtgtcatccagcaccagatctggg660 aaaggggttcaggacaacatgactctaactcaagaggaacatcggcttctgaataccata720 gtgactgctcaccaaaaatccatgattcccttgggagaaacaagcaaacttctgcaggag780 ggttccaaccccgaactaagttttctgagactctcagaggtatcagtcctgcacatacaa840 Seite L-0020-01-WO-01.ST25 gggctaatgaagtttaccaagggactcccaggatttgaaaatttaaccactgaggatcag900 gctgcattacagaaggcgtcaaaaactgaagtgatgttccttcatgtagcccagctttat960 ggtgggaaagactcaacctctggaagtactatgagaccagcaaagccctcagctgggaca1020 ctagaggtgcataatcctagcgctgatgaaagtgttcattctccggaaaactttctcaag1080 gaaggctacccttcggctcctctaactgatattactaaagaatttattgcctcactatct1140 tacttctaca gaagaatgag tgaacttcat gtatcggata ctgaatatgc tctgcttacg 1200 gcgacaacag tgcttttctc agatcgtcca tgccttaaaa ataagcagca tatagaaaac 1260 ._ ct~~aagaac c~c~tcctgca- acttttg~tt aagttttcaa .aaat gtacca tcca.g~agac ~-13?_~,,~
ccacagcatt tcgcccacct catagggagg cttactgaac tgagaactct gagtcacagc 1380 cactctgaaa tccttcgcat gtggaaaaca aaggacccca ggttggtgat gttattctct 1440 gagaaatggg atctgcactc attttcctga 1470 <210>18 <211>1470 <212>DNA
<213>Mus musculus <400> 18 tcaggaaaat gagtgcagat cccatttctc agagaataac atcaccaacc tggggtcctt 60 tgttttccac atgcgaagga tttcagagtg gctgtgactc agagttctca gttcagtaag 120 cctccctatg aggtgggcga aatgctgtgg gtcttctgga tggtacattt ttgaaaactt 180 aaacaaaagt tgcaggactg gttcttgtag gttttctata tgctgcttat ttttaaggca 240 tggacgatct gagaaaagca ctgttgtcgc cgtaagcaga gcatattcag tatccgatac 300' atgaagttca ctcattcttc tgtagaagta agatagtgag gcaataaatt ctttagtaat 360 atcagttaga ggagccgaag ggtagccttc cttgagaaag ttttccggag aatgaacact 420 ttcatcagcg ctaggattat gcacctctag tgtcccagct gagggctttg ctggtctcat 480 agtacttcca gaggttgagt ctttcccacc ataaagctgg gctacatgaa ggaacatcac 540 Seite 12 L-0020-01-WO-01.ST25 ttcagtttttgacgccttctgtaatgcagcctgatcctcagtggttaaattttcaaatcc600 tgggagtcccttggtaaacttcattagcccttgtatgtgcaggactgatacctctgagag660 tctcagaaaacttagttcggggttggaaccctcctgcagaagtttgcttgtttctcccaa720 gggaatcatggatttttggtgagcagtcactatggtattcagaagccgatgttcctcttg780 agttagagtcatgttgtcctgaacccctttcccagatctggtgctggatgacacgtgttt840 ggtgtctgctccttcatcttccacttggatggcagggtacagggcaggcccgtgcttgaa900 gttcttgcgaagtctctttgacttacactggatctctgtgagcaaacattctgccaacat960 ceccaccgcc.:ttu~acttct~tcagt-ctgca-~tcttggcattttctgcgcatg~Lacatgtc..1-t~20. -.. .
catttcacagtgaccaccgttcttgcaagaatacactgcattcttggtaatgctacgccg1080 gaaaaagcctttgcacccctcacaagtaagtgcattgtagtgatatcctgatgccttatc1140 accgcagaccatgcagagcatgtcatcttgacccttccagcgtaaagaacaagttgctct1200 gcacctttgtgccccagggaattcatcttcatcatcgtgaacaaccacataagtggactg1260 accagattctcgagcgtcgagtccacaccaactgcctccactgtactgtggatccaagcc1320 atgtgtgttgaaatgactttgtaaagatggggactgtaacgctggaggaaactgagcggt1380 agaatattgacaatagggtggttcttggaaatctgtatcacacagctgataatggaattg1440 ttctggcaaaatatctggttttattaacat 1470 <210>19 <211>489 <212>PRT
<213>Mus musculus <400> 19 Met Leu Ile Lys Pro Asp Iie Leu Pro Giu G1n Phe His Tyr Gin Leu Cys Asp Thr Asp Phe Gln Glu Pro Pro Tyr Cys Gln Tyr Ser Thr Ala Seite 13 L-0020-01-WO-01.ST25 Gln Phe Pro Pro Ala Leu Gln Ser Pro Ser Leu Gln Ser His Phe Asn Thr His Gly Leu Asp Pro Gln Tyr Ser Gly Gly Ser Trp Cys Gly Leu Asp Ala Arg Glu Ser Gly Gln Ser Thr Tyr Ual Val Ual His Asp Asp Glu Asp Glu Phe Pro Gly Ala Gln Arg Cys Arg Ala Thr Cys Ser Leu Arg Trp Lys Gly Gln Asp Asp Met Leu Cys Met Ual Cys Gly Asp Lys ._ .,qlU ser C;ly iyr:His-..~wr~;'~~n~Ala Leu Thr Cys Glu Gly. Cys.Lys G1~<:.-., ... . ._.
Phe Phe Arg Arg Ser Ile Thr Lys Asn Ala Ual Tyr Ser Cys Lys Asn Gly Gly His Cys Glu Met Asp Met Tyr Met Arg Arg Lys Cys Gln Glu Cys Arg Leu Lys.Lys Cys Lys Ala Ual Gly Met Leu Ala Glu Cys Leu Leu Thr Glu Ile Gln Cys Lys Ser Lys Arg Leu Arg Lys Asn Phe Lys His Gly Pro Ala Leu Tyr Pro Ala Ile Gln Val Glu Asp Glu Gly Ala Asp Thr Lys His Ual Ser Ser Ser Thr Arg Ser Gly Lys Gly Ual Gln Asp Asn Met Thr Leu Thr Gln Glu Glu His Arg Leu Leu Asn Thr Ile Ual Thr Ala His Gln Lys Ser Met Ile Pro Leu Gly Glu Thr Ser Lys Leu Leu Gln Glu Gly Ser Asn Pro Glu Leu Ser Phe Leu Arg Leu Ser . 260 265 270 Glu Ual Ser Ual Leu His Ile Gln Gly Leu Met Lys Phe Thr Lys Gly Seite 14 L-0020-01-WO-01.ST25 Leu Pro Gly Phe Glu Asn Leu Thr Thr Glu Asp Gln Ala Ala Leu Gln Lys Ala Ser Lys Thr Glu Ual Met Phe Leu His Ual Ala Gln Leu Tyr Gly Gly Lys Asp Ser Thr Ser Gly Ser Thr Met Arg Pro Ala Lys Pro Ser Ala Gly Thr Leu Glu Ual His Asn Pro Ser Ala Asp Glu Ser Ual His Ser Pro Glu Asn Phe Leu Lys Glu Gly Tyr Pro Ser Ala Pro Leu Thr Asp Ile Thr Lys Glu Phe Ile Ala Ser Leu Ser Tyr Phe Tyr Arg Arg Met Ser Glu Leu His Val Ser Asp Thr Glu Tyr Ala Leu Leu Thr Ala Thr Thr Ual Leu Phe Ser Asp Arg Pro Cys Leu Lys Asn Lys Gln His Ile Glu Asn Leu Gln Glu Pro Ual Leu Gln Leu Leu Phe Lys Phe Ser Lys Met Tyr His Pro Glu Asp Pro Gln His Phe Ala His Leu Ile Gly Arg Leu Thr Glu Leu Arg Thr Leu Ser His Ser His Ser Glu Ile Leu Arg Met Trp Lys Thr Lys Asp Pro Arg Leu Ual Met Leu Phe Ser Glu Lys Trp Asp Leu His Ser Phe Ser <210>20 <211>16200 <212>DNA
<213>Mus musculus <400> 20 ctttggggtc atagcattta ttaggtattg ttagttccat ttactgtgtt cctattgtgc 60 Seite 15 L-0020-01-WO-01.ST25 tggacggcaa gtcctgagatggaatagatgtgttgaactaactttctgttgagtataaaa120 cattttgagg tttaaaacaattatgaacttttttgaccttttatgatttgttctttttta180 tgttaataaa accagatattttgccagaacaattccattatcagctgtgtgatacagatt240 tccaagaacc accctattgtcaatattctaccgctcagtttcctccagcgttacagtccc300 catctttaca aagtcatttcaacacacatggcttggatccacagtacagtggaggcagtt360 ggtgtggact cgacgctcgagaatctggtcagtccacttatgtggttgttcacgatgatg420 aagatgaatt ccctggggcacaaaggtgcagagcaacttgttctttacgctggaagggtc480 ~- w aa~a~tgacat~gct~tg~ag ttgcgg atuaggcatcagga-tatcactacaa:~gc . 540 tg: tg ac :
ttacttgtga ggggtgcaaaggtaagggtaaggcgcaaccaaacagacagtgctgcgcac600 gcgcacacgt gtgtatgtgtgtgtgcacgtgcgtgtgcaatcgttataaaacatgagaaa660 caaaaccagg aaaaagtatgtctaaagggatcaatctgtgtgatcaaagataggagaata720 aaatgttaag tttaaggttagtttctttcctttcctttcctttcctttcctttcctttcc780 tttcctttcc ctttccctttccctttccctttcccttcctctttctttctttctttcttt840 ctttctttct ttctttctttctttctttctttctttctttctttctttctctctctctct900 ctctctctct ctctctctctctctctctctctctctctctctccctctctctctctctct960 ctctctctct ctctctctctctctctctctctctctctctctctctctctctctctccct1020 ccctccccccctctctctcttttgaagtagagtatcattgtatagccctggctggcctag1080 acctcactatggagatcaggttggctttgaactcacagatacctgcctgtctctgctttc1140 taggtgctgggattaaaggtgtgggccaccatgcctggaccatttctttctttatttttt1200 tttctcctcccttgtcttccccttcgtcctcatcttctcctcctcatctttcttcttttt1260 ttgagaaaaggtctctctatgcatccctggctgtcctggaactcactgtttagaccaggc1320 tgtccttgaactctcagagatccacctacctttgtatccttactgctggcattaagagca1380 ttcactaccaggtctgtctcttcatttttgatgaagactttcacactttactttaaactc1440 Seite 16 L- 0020-01-WO-01.ST25 gatacatata agcaaacgttttccctatttctagaagaatttgatgctggaacaaaacat1500 gacaatgtaa agatctgtaaaccatgacacctttttatgtggttgataaaatttatacac1560 ttggagaaaa tttgaaataaatgacttctatgattctaagtcaataaattttaaaaagtg1620 tttcgtatag aaagagaacaagctgaaagtcgggaagcaagggccagtgtttgcaatgat1680 atttgtaaag agcggaagcctctctgataaataacagttctgaaacagtctaccactcca1740 gaaatggata aacatcgggatgggaatgcaggccagtggtagagcacttagcacctactt1800 ttggatttga ttctagcaccctctgaaacaaaacaaaacaacccaaacaacagaaaacag1860 tttagtcaat gtaacacaaataaacgtgtttgtttagggtaaaattttacctgtaaatct1920 taaacctcaa cactggcaaagaatagccaa~aggatatgaggtgacagagtcctgtggcga198 0 gtagttaggcaggggtggcctgctcttgacggtgacgggttactgttaaaactgagctga2040 agaacccagcacagttttgaaaacctgagctgagctttttcttccttcgttgaactgaaa2100 tggcgaaactccatctttcaagtgtccactgcaagagagtaagtggtggggctttgggac2160 aactcataggagggagagcaggaaatggcaggtattactgtgtaataaggttcgaaggag2220 gaaaagggatgggaactatctggcaagtgctgggaaagggagagagagcatctgcggatg2280 agggagggagggaccacctttgtgggctgcaactcagactgggagggtttagaaaagaga2340 ggcttttctgaattttttctttattttaggtgaccatgtattctctcaaaggaaactcga2400 gagatgcattgtaggatcttgatattttgtataatgcagttaaaaaaaaaattcttatag2460 gtaatttacctatagtagggtagagagttgtcttaatgtgataatgatactccatttaat2520 ctccagacgttataaagaaagacccagcctcaggatgtcaactgaactatactaactata2580 aagaattatagaattagctaactttccagataaatgtatatttaaaaaaatgtcgggaat2640 aaaaaaaaccctatttaattgaaatactctagttagcttgattactttctataaaggtaa2700 aatgttaataaaaatgagaagtaactaaacatatccatgtcaaactttgaccatctttgt2760 tgttaggctttttccggcgtagcattaccaagaatgcagtgtattcttgcaagaacggtg2820 gtcactgtgaaatggacatgtacatgcgcagaaaatgccaagagtgcagactgaagaagt2880 Seite 17 L-0020-01-WO-01.ST25 gtaaggcggt ggggatgttggcagaatgtaagtgccagtgtgttcgtggcttgcttttct2940 gaggggattt aacttttcccagccttaccttcctgacaaggaggacatgatagtggtggt3000 ggtttttcca gggcaagaatcgcccattgcactagggatgtgctgactgacccctgtgat3060 ttccacaaat gtgggaaactcaggtgcatcatttgtgttagtgaaggactgtattcacac3120 tctcctctat ataataaaaatgaatttaaaaattaataaaaaattaaaaatttccaacct3180 tcttgaggac ataaagcattcttcaagagttattttgaacatggccgtattcgtttgtat3240 atgcacacca cacacatatgtacagtaattgcgccacgtggatggcccattatcaattta3300 . agtag tgattccca taaatgt~tgccctgtt~cta to - ctaat~cttgaaa~tragtca3~a0. .
ttgLt t agtttcatct taattagatg ttctctgaga cctgaagagt agttaggcag ttaagagcat 3420 taggagagga cctgggcccg gtccccagct tccacgtggc ccctcacaat catttataac 3480 tcagttcctg agcatcccag gccctcttct gacctctgca ggtatgagtg ttgtgtgcat 3540 acacacatgc aagcaaaaca cacacataaa ataaaagaaa cctccatagt gatttaaaaa 3600 aggtattgta tgtttctgtt acttaaccta aaaatatata ctgccttgaa aagtgcaaag 3660 ttaaagttag tagttcccaa aaagaaagaa agaggaggag aaagaagaga aaaggaagtc 3720 ttccctttct gagaaggggt tctcagagtc ctaacacact acagtcttag gacaattgaa 3780 gggtaactgg atctctggac ccctgacatc ctcatatgta aagtccacta atagtctcac 3840 gagtgtcagg actcatctct gaatatcaac acattagcat atgagcagtg ggagatacaa 3900 ctcagtccaa agaaaccttt gaaaaagttg tcttttattt tattcattaa tttaatttgt 3960 attcaggatc tcactatgta gctgacctgg aacttgctat gtacgccagg ctggcatcaa 4020 actcagagat ccccctgcct ctgccctcca atttctggct caaaagttgt cttttaacct 4080 gtggcaggca atgcatgtga aagggaagtt ggagtcctct gaggaactct atgcataaaa 4140 tgagtcatcc tacacttaaa gaaacttaaa gaaattctgg caaggtgaag ctcgagagag 4200 ttggtgagca gtcacggtga ttggcgagca taagctggtc cgcaccctgc caccattgct 4260 Seite 18 L-0020-01-WO-01.ST25 tggatctgca cacttcatttacctttcagttgtcctaatactgccatgtgttagtgctct4320 gacggcccct cctcctcttccttccctcattctcctttccctttcctcctttcttttcct4380 cttcctcctc ttccccctcttcttcctccttttcctcctcttcgttctcttcttgtggtt4440 ttccttctca tggtcctgggggttgagttcaggtttctcacatcctagacaatctaccac4500 taagctacac ccatagactcctttcctactaaaaaacactgagttgccatggctggtatt4560 caacgtgaga tcatcttgtaacagtgtcttaagtagctggggttataggtgtttactacc4620 atgttcaata ctcaatatgcttaaaatggctttatttttttgtgtattctgttctctgac4680 aaataataaa ataatatagaggggttttaaggtttttgtttatacttcagctcctcatta4740..
gttttttatt ttataggtgtgtttttaaaaatactttgaaaacgacaggtatattaatag4800 ttggatactt ttagtagctg ctaaaagtac ttagagtagt ccagtccagt ccagtacata 4860 cactacagct cacaaccagc tatagtctca cttcaagggg ctatgacccc tctgtgctct 4920 gtgggcacct gcactcacct gtgcatttcc ccatacagac tcatgcaagt aattataaaa 4980 tatttaaaaa caaacaaaaa caatcctaaa ataaatatta tttgcctttg aaaaagaatt 5040 aaatgctttg tctgacatag gaaaaattga cttagcttat ttttaaaaat cctaagatag 5100 tcttttttaa ccttattaat aaaaccagtg ataatagtca tagataatgt aattctacta 5160 ttctatcaaa tgactttttc acttccaaga tgcaactaat gtgtaaatct cttatgtaaa 5220 aatctaaaca taatttggaa atgaaaaatt aattcttaat ctttgttata actattttta 5280 agcagacaag gtgtctatga agggaaaagc agaataggaa atttaataga tgcagacatg 5340 cacgagagat atctaggact atttactcac tgaagaccat tttaatcaaa ccctacagtg 5400 ttgttttccg agccaaaagt acttaatgtg aattctgtaa tgtcacaggt ttgctcacag 5460 agatccagtg taagtcaaag agacttcgca agaacttcaa gcacgggcct gccctgtacc 5520 ctgccatcca agtggaagat gaaggagcag acaccaaaca cgtgtcatcc agcaccagat 5580 ctgggaaagg ggcaagacgc ttcttccata cagtctcttg tgtctttatg aacagtgtgt 5640 aacaagcaca cattacaggc gccctctgtt tccttatagg ttcaggacaa catgactcta 5700 Seite 19 L-0020-02-WO-01.ST25 actcaagagg aacatcggct tctgaatacc atagtgactg ctcaccaaaa atccatgatt 5760 cccttgggag aaacaagcaa acttgtatgt atccattgct ggcaagaaat aatttgtaat 5820 gaatgatatt aggaaatgat agcaagaaca gtaaaaaaga aacaccttag taattatcag 5880 aaagacatac tatgaatcat ttttatttat ttattttttt tatggaccag ctgcaggagg 5940 gttccaaccc cgaactaagt tttctgagac tctcagaggt atcagtcctg cacatacaag 6000 ggctaatgaa gtttaccaag ggactcccag gtaaaattct gtttcattat tttttaagga 6060 tttatttatt tttatgttta tgtgttatgc atgttttgat tgcaattata tattcatatc 6220 atgtacacaa catattgtgg ~aggccagaag aga~tgttag ~tcccctaga actagagt,.ti .. ..6180 cctaccgatg gctgtgaatt accatgtggg cgttggaacc agagtctctg caagagcaag 6240 acaggctctt aactgctgat ctgtctttct agcccccaat ttttaaaaaa gattatttat 6300 ttatttattt atttatttat ttatttattt atttatctga gtacattgta gctgtacaga 6360 tggttgtgag ccttcatgtg gttgttgaga attgaatttt taggacctct gctcactctg 6420 gtcggccccg ctcgttctgg cccaaagatt tcataagtac actgtagctg tcttcaggtg 6480 caccagaaga gggcgttaga tctcgttacg gatggttgcg agccaccgtg tggttgctgg 6540 gacttgaact cagggctttt taggaagagt agtcagtgct cttacccact gagctatctc 6600 tccagcccca aattttcttt tttttaaagg gatagtatat caaacagaaa tttctagttg 6660 tggtactaga tctatcgctc agaggcagag cacctgtcta atgtgtgtga agccctgggt 6720 tttctagcaa tgttgcttct aactaaaaaa aaaaaaaatg tttatttgtc tttgtcaatt 6780 aatcagatga gtgacaatct aaatatttac aaaaattatt ataatgatgt gggctttctt 6840 agatgaaaca agaaacaaag aaggtttcct ctataatact aaacaagtca cagtttgacc 6900 aaaacttata tataactcag ctttattata tgcttagagg tacagaaact ctttacaaaa 6960 ttacattctt caaggctaat gcagctatgt tgatttgtcc ttgctgtaag aaagtgtaac 7020 ctagtgctga taaaagtgcc ccaaaataga aaaaaagaat tttattcaga tttcaattta 7080 Seite 20 L-0020-01-WO-01.ST25 tatactattg ttattcccca gaagcactta gggttaataa gtaaatattt ttaaacattt 7140 aatttatgaa cgttctcttt tgattatctg tttctttatg ctttatttat ttatttattt 7200 atttattttt aaggatttga aaatttaacc actgaggatc aggctgcatt acagaaggcg 7260 tcaaaaactg aagtgatgtt ccttcatgta gcccagcttt atggtgggaa agactcaacc 7320 tctggaagta aaaagaaaaa cagaatttta ttcaacttca agtacacaca gacagacaga 7380 cagacagaca gacacacaca cacacacaca cacacacaca cacacacacg cacgcacgca 7440 cgcacacact tttaaacact gttttttgga aatagactct catgtagtcc aggctatgta 7500 gaactcagta tgtagctaag gaggatctta aagccctgct gtgtctgctc ttacctgcca 7560 accatcagcc tagagcctct atgcctgcct tctatggtgc tggggacaga agccagggtc 7620 tgtgctcagt aagccagaac tcctaactcc taacatcatc agcctttcat ttatcttttc 7680 attttatttt accagttagt gtaggaacaa atgtctcata gtagctgttt ataaatagtt 7740 tcacattcat ctagcatttt ctctctaccg tgtaattctt ttaaaaattg tatttcttat 7800 tcttgtgaga actgtataca tgtcctaagt gttttttttt ttaaatctta ctcacctcac 7860 ttccttccca ccagctcctt tagaatcccc cacacacctc tctcctaact tcatgtcctt 7920 ccctgctcct tctttttcaa aataacccat tgactctaaa tagtgctgct catatacacg 7980 ggggtggggg tagggtcatc cactgaagca tggccaactt agagggctca gttccctaaa 8040 gagactggat tgtccttccc ctagcagctg ccaacagctt ctcagctgtg ggtggggctt 8100 ctgtgggtgg ggcttctgtg ctccttagac aggctcttcc aatgaggagg acttttatgg 8160 gtgatcaaac ataaaggcgt tggtgattct aagacacaaa gcagctctgg ggacggccgc 8220 atgcttgctt tgaaactcat aaccacgggc tactgtttgt ggaacgtctc tttactgggg 8280 tgaagcccta ttaggttagt gttgaaccaa ctatgcagga cctcacttca ataaaaaaaa 8340 agcaagtatt ttttaaaaaa tcaataaacc atttgaatag agcaggaaag ccatcattgt 8400 ttagtgtgtg aatgcaagag tccaatctgt aaagtcgcat gttggttgag gacatgggaa 8460 gaggcaatag gacaaaatca aaaccaaaag ccaacaatgc tagctgcctt cactagctgt 8520 Sei to 21 L-0020-01-WO-01.ST25 actcgttacagtgcacagccttttaaaatatccttatttattgattgttctctgcaaaca 8580 .
ctagaccagaaggctctaaagggaggcccttgatgacctggtccatgattgcttgagtag 8640 tgtttgcttaaaagtctccacagcctctggtgcactgagtggttcgtggagggtgtgtgt 8700 gtttaagaaagccattcttagtttctatttcattatgcaaaactcattttggagggtctt 8760 ctgtacaatcttttttttttttttacaaaatatgaactaatatctgttttgtgttccagt 8820 aacaaaaatgagaatgatatcattttaggagtttttttttcccccacataatagttatgc 8880 tggatactgaaaagatactgaatttactatttatataatgattaagaaagactatcaaca 8940 tt tgtgtttctcttttgctai~gt~ctatgagaccagcaaagcc~tcag- ctggguc~ct?000: _ ..
tt ~ ..
agaggtgcat aatcctagcg ctgatgaaag tgttcattct ccggaaaact ttctcaagga 9060 aggctaccct tcggctcctc taactggtaa cactgtgcag taggaagcat caccttgggt 9120 ggacacgctt ccttttctcc ccttttcatt gcactacaaa gagaaggcca tgggctttct 9180 cacagctcac aggcgtctct gtggtggtga aagcatttat agacacagcc aatacagttg 9240 ttagtgttgg caggtcatca gtgagaattc tatttgtaac cattttctta ctttggcttc 9300 atcatggcaa ttctttccct atttaattgc gaagccattt ggggcctctg aattattctt 9360 aggatgatga tgtaatctac agccaacatc ccagtatcaa ccacttttat gggtagaagg 9420 gtaagcctta ctgtagataa tagagataca cggagagtgg cttctgtagg ctaaaggctc 9480 tcagttccac agcctcctcc ttggtccacg tgatgacaac ctctctttgt tctggtcctt 9540 ttcagtagca cacctaatgc gcccactctt gaggttcagg cctgggcttc aagggcacaa 9600 caaaggggat tgtctgtgcc ggccatcaca tcggtagcca tcttgaaccc cactctcttt 9660 tcaattttct tacactgttt gagagaagaa aagacaatca caatgtgttt gcttctgtgc 9720 tccattccga tgaaaattca cttatattaa tcttttcaga tattactaaa gaatttattg 9780 cctcactatc ttacttctac agaagaatga gtgaacttca tgtatcggat actgaatatg 9840 ctctgcttac ggcgacaaca gtgcttttct caggtacaga ctgagaagta aggctcggag 9900 Seite 22 L-0020-01-WO-01.ST25 tgatctctgg acaggcaatc actaagttct ttttatcgaa ctatgaactg aagttgaatg 9960 taccactctc atcccatcat gctttgctct ctatgtgcat tcctttaagc ttaccactgg 10020 gttacatctt tcttgcatct taaagaaatg ctttcttgcc ctcatagccc ctcagatata 10080 cgctcccagt ccctgataga gagaggttct cagggagtta gctatattct agtttccacc 10140 ttactcatgc tcactcctga gttcaccaca gtcacatttc actccaccct ctccagtcca 10200 ctacaacctg tcaggatctt caggacatgc tatcactgac tcaactggtc aatgctctgt 10260 cttcatctta cttgccccac tggtgtattt tgtagagagt atcataagca tctgaaagaa 10320 aagaagggga tatgttgttt cctgtgagcc cttcagtgga attttgaagc acggtgaaga 10380 tgtctgatgt gttaggagaa tttctcatgc ttctttatgt cagttcacag tcttcttgac 10440 tatgcgttac aaaccggacc caggttgaag tgagacagtt acttagcatg ggtaacccat 10500 tgctcaagcc agcgacgtct tcctagggga ccattgtcct gtgtccttca ccgattcggc 10560 catgttt,gtg ttcttttata ttctaacttg ctgtgatatc agggttcgtg aatgttcact 10620 aggtatgtgc atctgaagtt gaatcacata ggctccagag cagctccatt atattgtcca 10680 cttttattat gctttattcc aagaagtttt ggaagcaact gtgtgacaaa caggataaag 10740 gatgtagaag gctggaggga cagcagtgtt ggctatcatc tgaatctgaa tatttggaaa 10800 atcagctcag actgttacct agggacaggt cactctggtt ggaccataaa actcttcttt 10860 ctttcagttc agtgtaggga attaagctac aaattctgga tggtgcagta ccttagagcc 10920 ctggtgttct ggctgtgtga ggctttcccc atctctcagg agtgggactg ataacgcctg 10980 tggccatcat cctttatcct tcttggtggg tcttttaaaa caaggactgt gtcatgctct 11040 ctacatttaa gaatttcaat ggggccacag gcgcccctag attataaagt tctcataaag 11100 atatgttcat cacatccctt cagctgatct ttgggtttgc agggctaata tttgattccc 11160 aacaatttac cttttcataa tcttgccatt tctacttact gtttagatgg aaaagagtaa 11220 gttctccttg tagactgcag tccacctaac aacttgcctc cagccttgtt aacagtacca 11280 caagcagaac taaaatcatg caaagaattg cagaagcatg gcagtctcag gaagtttcca 1134D
Seite 23 L-0020-01-WO-01.ST25 ctccaaaaaa accccaaaac aaaaacaaaa acaaaaacaa aaacaaaaaa ccccacccac 11400 tcactaactc acataggtgg tttgtttcct tttgttggac atttacaatt ctatctatct 11460 atctatctat ctatctatct atctatctat ctatctatct atctatctat ctatctatcc 11520 tactatctat ctatctatct atctatctat ctatctatct atctatctat ctatctatct 11580 atctacctac ctacctatct atctatctat ctatctatct atctatctat cctactatct 11640 atctatctat ctatctatct atctatctat ctatctatct atccatccac tgtgtgtgtg 11700 tacatgtcat gcttgtggag gtcagaggac aacctatggg aattgtttct ctctttccac 11760 c~tgtgtgtt ctggggttc-a g~tagtcagg Ctgggtgata-gcatcttcec tgctgggctaw.11820. . -, tcccactggc agctttgttg tttcacggaa tctctggaac atcaccctct taatttctct 11880 gttcattggc tgttctttct gtgccttctt tattctgtca tcttcgtctc tcacaatgac 11940 aaatatttaa gttgtcctag gacccagttt tcttttaaat tattttatga ttttatgcac 12000 atgtttcgcc tgcatctatg caccatggtg tgcctcacgc tgctgagttc agaagatatc 12060 aggtcccttg ggacttgact tacagaggtt gtgacaaggc ccagttttct tctgcactgg 12120 cttccttcct gattccatcc taactcttgg cattaaaggc ctgtgtggat tcgatactcc 12180 cagacgtatc tgtgtgctcc cccagaatgt attaaatagc ctcctcaagt tttccatgta 12240 accgtcaaga aggtgtatca aatttaaaat gcccaaagct ggactcttga cacctttata 12300 ccttagcttt tggtattcct cccccatctg gaccagacca tagcttcgtc ttccaaattg 12360 catgtgcaac tttctttttt aatccttagt ttcttttctt tctcttatat tcatatcaca 12420 atgcacccgt tctgtttatt caaccctaag atatacccag cacttaaccg cttcctaccg 12480 tgtatagctt gggcagcgta ggtgaatcca cttttctaac tggattttct gactctatgc 12540 tttatagact ctctctgtat tagtacacag aggggttccc tggaaatata agtcagatga 12600 tggtatgcct ctggaactgt cgatggcttc ctagtttgca tagattaaaa ccaaaggtcc 12660 gggggggggg ggctggagag atggctcagt ggttaagagc agtgactgct cttccaaagg 12720 Seite 24 L-0020-01-WO-01.ST25 tcctgagttc aattcccagc aaccatatgg tggctcacaa ccatctgtaa tgagatctga 12780 ctccatcttc tggagtgtct gaagacagct acagtgtact cacatataat aaaaataaat 12840 aaatcttaaa aaaaaaaacc aaaggtcctt ctattggtct agtttgtctt gctccttttt 12900 catcctcgcc catccacatt gctttgacac aggtttccct tgatctcgca ggtttctgct 12960 acatgctctg ccatttttga tgcatctttt gccccctttc cttggatgcc cacatgatgc 13020 catcttacac tgttttgcca ttttatctac ttgttttttt cttttacctt tttgggtttg 13080 ggttaaatgt tgttgtatcc ctgattaatc tttacaaata gctaccaccc ccccccaaga 13140 gtcttattta tttctgatta ctttttttaa tggcacatga ttattttgtt tttgcttctt 13200 taaaaaaacc tttttcattc attgataaaa agcacttact ttccttgatg tgtatgtctc 13260 ctttgtctaa atgtgcaccc cttgtgtgct gtgagtccac aagaaggtgt catatcccct 13320 ggaactggag ttacagatgg ccgtgagcca ccatgtgggt gtggaaatca aaccccagta 13380 cttgggaaga gcagccagtg ctcttagctg ttgataggtc tttccagact gtttttactt 13440 ctgaagatgt gtgtaagtct ctccctctcc acttcttcca cattaagaat ataaggacaa 13500 taaagggaca caatcttgcc tgtgagtttc attgtggtat agttagcaag acgactggat 13560 gaatgaaggc tgcctaagcc ttcggaactt cctggttagc tcatttctta cgtaggaacc 13620 caagaagtct cacgctttac ctctgttgac tgtagagagg ttacttacta ctattttaaa 13680 gtttcaggca agcgtttatg gctgttatat ccccaacctt actcaccgat ccctttcttc 13740 ccatgtgact ttatggaaat tacttaactc cctacatttc agggagctaa tctgaaaaaa 13800 atgtgataaa aatagaatgt agctcaatag aatatagtga agattaagta aagtcatgaa 13860 caaaaagaat ttagttagaa aacctaatgt gtattagggt tctgcaagta cttatgattt 13920 tataactctg ttaataactc attgttttaa gtgcattcat tattcccacc actttgaatc 13980 attattaaga taaatatgat gtttttcaga tcgtccatgc cttaaaaata agcagcatat 14040 agaaaaccta caagaaccag tcctgcaact tttgtttaag ttttcaaaaa tgtaccatcc 14100 agaagaccca cagcatttcg cccacctcat agggaggctt actgaactga gaactctgag 14160 Seite 25 L-0020-01-WO-01.ST25 tcacagccac tctgaaatcc ttcgcatgtg gaaaacaaag gaccccaggt tggtgatgtt 14220 attctctgag aaatgggatc tgcactcatt ttcctgaaaa tttacagtgg tgagacttgc 14280 cttaaaattt caaaatcaaa attggatgaa tgatttgaaa aaaatgtgat aactcaaaat 14340 cattgagtta tcacacacca ccttaaaatg gagatccaaa aggctcatca gggtttgtgg 14400 aactttcaac tcttagaaga tgaacactcc aggattcatc tcaagattgg atgcaaagga 14460 aagaaaaatg gagccatgtg gtggcatagg ctctgtagtt gagagacaga gacaagagga 14520 ccccatgttt aaggcagtcc tgggctatgt agtgagacct catctcagaa gaaaaaagag 14580 _ .. ua~aactgat ggcatatt atgtctceaa tcetcaccta a:~gaaactaa t~ctTTtgtt,g ~~5~:~-.
ttgttgttta gctgaaacac ataatgtgta aatgtgaaca caggtaggtt atggttagga 14700 taaggcagtt tgtatgcttc actggttaag gacacatgga tggtgtagca gctaatgctt 14760 agaacctctt cttcatgaat gattaacgag gaacaagcct tttcaaattg tatctgcttt 14820 atatttttaa taagtatgtg ataattgtac agaatagtca atattgtaca gaatatttca 14880 atatgggaat ttcatgtgta tatattagat atttctggcg tatttgccac ccaccctgtg 14940 atgcccacac tgccaatgcc cttctcatcc ccctaacatt tcccttcctc tgccttgccc 15000 taaaacacca gattcatacg tgcaagaaaa cattttttaa acaaatttta ttagatattt 15060 tcttcattta caattcaaat gctatcccaa aagtccctta ttccctcccc ttatcctgct 15120 ctccaaccca cccacttctg cttcctggcc ctggtattcc cctgtattgg ggtatataat 15180 ctttgcaaga ccaagggcct ctcctcccaa cgatggtcca ctaggccatc tgcaagaaaa 15240 catttgattc cagtctgagt ctgttttatt tcctttgagt atgactatct cgtcaatatc 15300 tgcaaatgac ataatttcat tcctcttaag gatggaaagg actggattgt gtatgtatat 15360 ctgtatatat tattttattt atccacctgc tgatgacacc tagtttgatt ctgtatcttg 15420 gccactgtga atcgtgctat aataaaccta tgtaggtttt tagattttat gtggctcttg 15480 attctttcag gtgtactccc agaagtggta tggtttatat ggtagctcta tttttagatt 15540 Seite 26 L-0020-01-WO-01.ST25 tgaagggaac tatgtagtga ttttcacagt gattaccttc ttctcaatag tgtataaaga 15600 ttaatttgcc ttcaaaattc ttgccaattt ttgtttttcc ctttcctttt ctttttaatt 15660 aaaaaagtgt gtgtgtgtgt gcgtgcatgc caaagaacat tgtggacatc ggaggacagc 15720 tctgtggaat acattctctc cttccacctt tatgggtggt ctcctctgac cccctctgcc 15780 atggcctgca tggcttggtt ctggaagcct gattggggtg atgagcgaat gaagaaatga 15840 cagacatgaa gacacaagta cagaaaagct gagattgggt ggtgtctaag tttctttttt 15900 attgctataa agagacacca ggaccttatg aaagaaagca ttaaattggg ggcttgctta 15960 cagtcttaaa gattgcgtcc atgatgatca tggcaggtag catggcacca ggcatgcact 1602 0 ggaacagtat ctgaagagct tacattctga tcggcaagtt ggaggcaggg ggtgggggga 16080 gaagaaagtg gctgatgggg cctgggacag gatttggaaa tctcaaagcc catccctagt 16140 gacacacctc ctccaacgag gccacaccca gtcctcctca aataggtcca ccaagtaggg 16200 <210> 21 <211> 16200 <212> DNA
<213> Mus musculus <400> 21 ccctacttgg tggacctatt tgaggaggac tgggtgtggc ctcgttggag gaggtgtgtc 60 actagggatg ggctttgaga tttccaaatc ctgtcccagg ccccatcagc cactttcttc 120 tccccccacc ccctgcctcc aacttgccga tcagaatgta agctcttcag atactgttcc 180 agtgcatgcctggtgccatgctacctgccatgatcatcatggacgcaatctttaagactg240 taagcaagcccccaatttaatgctttctttcataaggtcctggtgtctctttatagcaat300 aaaaaagaaacttagacaccacccaatctcagcttttctgtacttgtgtcttcatgtctg360 tcatttcttcattcgctcatcaccccdatcaggcttccagaaccaagccatgcaggccat420 ggcagagggggtcagaggagaccacccataaaggtggaaggagagaatgtattccacaga480 gctgtcctccgatgtccacaatgttctttggcatgcacgcacacacacacacactttttt540 Sei to 27 L-0020-01-WO-01.ST25 aattaaaaag aaaaggaaag ggaaaaacaa aaattggcaa gaattttgaa ggcaaattaa 600 tctttataca ctattgagaa gaaggtaatc actgtgaaaa tcactacata gttcccttca 660 aatctaaaaa tagagctacc atataaacca taccacttct gggagtacac ctgaaagaat 720 caagagccac ataaaatcta aaaacctaca taggtttatt atagcacgat tcacagtggc 780 caagatacag aatcaaacta ggtgtcatca gcaggtggat aaataaaata atatatacag 840 atatacatac acaatccagt cctttccatc cttaagagga atgaaattat gtcatttgca 900 gatattgacg agatagtcat actcaaagga aataaaacag actcagactg gaatcaaatg 960 ttttcttgca gatggcctag tggaccatcg ttgggaggag aggcccttgg tcttgcaaag 1020 attatatacc ccaatacagg ggaataccag ggccaggaag cagaagtggg tgggttggag 1080 agcaggataa ggggagggaa taagggactt ttgggatagc atttgaattg taaatgaaga 1140 aaatatctaa taaaatttgt ttaaaaaatg ttttcttgca cgtatgaatc tggtgtttta 1200 gggcaaggca gaggaaggga aatgttaggg ggatgagaag ggcattggca gtgtgggcat 1260 cacagggtgg gtggcaaata cgccagaaat atctaatata tacacatgaa attcccatat 1320 tgaaatattc tgtacaatat tgactattct gtacaattat cacatactta ttaaaaatat 1380 aaagcagata caatttgaaa aggcttgttc ctcgttaatc attcatgaag aagaggttct 1440 aagcattagc tgctacacca tccatgtgtc cttaaccagt gaagcataca aactgcctta 1500 tcctaaccat aacctacctg tgttcacatt tacacattat gtgtttcagc taaacaacaa 1560 caacaaaaaa ttagtttctt taggtgagga ttggagacat aatgatggcc atcagttctt 1620 ctcttttttc ttctgagatg aggtctcact acatagccca ggactgcctt aaacatgggg 1680 tcctcttgtc tctgtctctc aactacagag cctatgccac cacatggctc catttttctt 1740 tcctttgcat ccaatcttga gatgaatcct ggagtgttca tcttctaaga gttgaaagtt 1800 ccacaaaccc tgatgagcct tttggatctc cattttaagg tggtgtgtga taactcaatg 1860 attttgagtt atcacatttt tttcaaatca ttcatccaat tttgattttg aaattttaag 1920 gcaagtctca ccactgtaaa ttttcaggaa aatgagtgca gatcccattt ctcagagaat 1980 Seite 28 L-0020-01-WO-01.ST25 aacatcacca acctggggtc ctttgttttc cacatgcgaa ggatttcaga gtggctgtga 2040 ctcagagttc tcagttcagt aagcctccct atgaggtggg cgaaatgctg tgggtcttct 2100 ggatggtaca tttttgaaaa cttaaacaaa agttgcagga ctggttcttg taggttttct 2160 atatgctgct tatttttaag gcatggacga tctgaaaaac atcatattta tcttaataat 2220 gattcaaagt ggtgggaata atgaatgcac ttaaaacaat gagttattaa cagagttata 2280 aaatcataag tacttgcaga accctaatac acattaggtt ttctaactaa attctttttg 2340 ttcatgactt tacttaatct tcactatatt ctattgagct acattctatt tttatcacat 2400 ttttttca~a tta~ctccct gaaatgtagg gag~taagta atttccataa agtc~catgg 2450 _ .
gaagaaaggg atcggtgagt aaggttgggg atataacagc cataaacgct tgcctgaaac 2520 tttaaaatag tagtaagtaa cctctctaca gtcaacagag gtaaagcgtg agacttcttg 2580 ggttcctacg taagaaatga gctaaccagg aagttccgaa ggcttaggca gccttcattc 2640 atccagtcgt cttgctaact ataccacaat gaaactcaca ggcaagattg tgtcccttta 2700 ttgtccttat attcttaatg tggaagaagt ggagagggag agacttacac acatcttcag 2760 aagtaaaaac agtctggaaa gacctatcaa cagctaagag cactggctgc tcttcccaag 2820 tactggggtt tgatttccac acccacatgg tggctcacgg ccatctgtaa ctccagttcc 2880 aggggatatg acaccttctt gtggactcac agcacacaag gggtgcacat ttagacaaag 2940 gagacataca catcaaggaa agtaagtgct ttttatcaat gaatgaaaaa ggttttttta 3000 aagaagcaaa aacaaaataa tcatgtgcca ttaaaaaaag taatcagaaa taaataagac 3060 tcttgggggg gggtggtagc tatttgtaaa gattaatcag ggatacaaca acatttaacc 3120 caaacccaaa aaggtaaaag aaaaaaacaa gtagataaaa tggcaaaaca gtgtaagatg 3180 gcatcatgtg ggcatccaag gaaagggggc aaaagatgca tcaaaaatgg cagagcatgt 3240 agcagaaacc tgcgagatca agggaaacct gtgtcaaagc aatgtggatg ggcgaggatg 3300 aaaaaggagc aagacaaact agaccaatag aaggaccttt ggtttttttt tttaagattt 3360 Seite 29 L-0020-Ol-WO-01.ST25 atttattttt attatatgtgagtacactgtagctgtcttcagacactccagaagatggag3420 tcagatctca ttacagatggttgtgagccaccatatggttgctgggaattgaactcagga3480 cctttggaag agcagtcactgctcttaaccactgagccatctctccagcccccccccccc3540 ggacctttgg ttttaatctatgcaaactaggaagccatcgacagttccagaggcatacca3600 tcatctgact tatatttccagggaacccctctgtgtactaatacagagagagtctataaa3660 gcatagagtc agaaaatccagttagaaaagtggattcacctacgctgcccaagctataca3720 cggtaggaag cggttaagtgctgggtatatcttagggttgaataaacagaacgggtgcat3780 tgtgatatga atataagagaaagaaaagaaactaaggattaaaaaagaaagttgcacatg3840 caatttggaa gacgaagctatggtctggtccagatgggggaggaataccaaaagctaagg3900 tataaaggtg tcaagagtccagctttgggcattttaaatttgatacaccttcttgacggt3960 tacatggaaaacttgaggaggctatttaatacattctgggggagcacacagat~cgtctg4020 ggagtatcgaatccacacaggcctttaatgccaagagttaggatggaatcag~~aggaag4080 ccagtgcagaagaaaactgggccttgtcacaacctctgtaagtcaagtcccaagggacct4140 gatatcttctgaactcagcagcgtgaggcacaccatggtgcatagatgcaggcgaaacat4200 gtgcataaaatcataaaataatttaaaagaaaactgggtcctaggacaacttaaatattt4260 gtcattgtgagagacgaagatgacagaataaagaaggcacagaaagaacagccaatgaac4320 agagaaattaagagggtgatgttccagagattccgtgaaacaacaaagctgccagtggga4380 tagcccagcagggaagatgctatcacccagcctgactacctgaaccccagaacacacatg4440 gtggaaagagagaaacaattcccataggttgtcctctgacctccacaagcatgacatgta4500 cacacacacagtggatggatagatagatagatagatagatagatagatagatagatagat4560 agatagtaggatagatagatagatagatagatagatagatagataggtaggtaggtagat4620 agatagatagatagatagatagatagatagatagatagatagatagatagatagatagta4680 ggatagatagatagatagatagatagatagatagatagatagatagatagatagatagat4740 agatagatagaattgtaaatgtccaacaaaaggaaacaaaccacctatgtgagttagtga4800 Seite 30 L-0020-Ol-WO-01.ST25 gtgggtgggg ttttttgttt ttgtttttgt ttttgttttt gttttggggt ttttttggag 4860 tggaaacttc ctgagactgc catgcttctg caattctttg catgatttta gttctgcttg 4920 tggtactgtt aacaaggctg gaggcaagtt gttaggtgga ctgcagtcta caaggagaac 4980 ttactctttt ccatctaaac agtaagtaga aatggcaaga ttatgaaaag gtaaattgtt 5040 gggaatcaaa tattagccct gcaaacccaa agatcagctg aagggatgtg atgaacatat 5100 ctttatgaga actttataat ctaggggcgc ctgtggcccc attgaaattc ttaaatgtag 5160 agagcatgac acagtccttg ttttaaaaga cccaccaaga aggataaagg atgatggcca 5220 ca~g~:gttat cagtcccact cctgagagat gg~gaaagcc t~acacagcc agaacaccag .5?$~_-, .
ggctctaagg tactgcacca tccagaattt gtagcttaat tccctacact gaactgaaag 5340 aaagaagagt tttatggtcc aaccagagtg acctgtccct aggtaacagt ctgagctgat 5400 tttccaaata ttcagattca gatgatagcc aacactgctg tccctccagc cttctacatc 5460 ctttatcctg tttgtcacac agttgcttcc aaaacttctt ggaataaagc ataataaaag 5520 tggacaatat aatggagctg ctctggagcc tatgtgattc aacttcagat gcacatacct 5580 agtgaacatt cacgaaccct gatatcacag caagttagaa tataaaagaa cacaaacatg 5640 gccgaatcgg tgaaggacac aggacaatgg tcccctagga agacgtcgct ggcttgagca 5700 atgggttacc catgctaagt aactgtctca cttcaacctg ggtccggttt gtaacgcata 5760 gtcaagaaga ctgtgaactg acataaagaa gcatgagaaa ttctcctaac acatcagaca 5820 tcttcaccgt gcttcaaaat tccactgaag ggctcacagg aaacaacata tccccttctt. 5880 ttctttcaga tgcttatgat actctctaca aaatacacca gtggggcaag taagatgaag 5940 acagagcatt gaccagttga gtcagtgata gcatgtcctg aagatcctga caggttgtag 6000 tggactggag agggtggagt gaaatgtgac tgtggtgaac tcaggagtga gcatgagtaa 6060 ggtggaaact agaatatagc taactccctg agaacctctc tctatcaggg actgggagcg 6120 tatatctgag gggctatgag ggcaagaaag catttcttta agatgcaaga aagatgtaac 6180 Seite 31 L-0020-01-WO-01.ST25 ccagtggtaa gcttaaagga atgcacatag agagcaaagc atgatgggat gagagtggta 6240 cattcaactt cagttcatag ttcgataaaa agaacttagt gattgcctgt ccagagatca 6300 ctccgagcct tacttctcag tctgtacctg agaaaagcac tgttgtcgcc gtaagcagag 6360 catattcagt atccgataca tgaagttcac tcattcttct gtagaagtaa gatagtgagg 6420 caataaattc tttagtaata tctgaaaaga ttaatataag tgaattttca tcggaatgga 6480 gcacagaagc aaacacattg tgattgtctt ttcttctctc aaacagtgta agaaaattga 6540 aaagagagtg gggttcaaga tggctaccga tgtgatggcc ggcacagaca atcccctttg 6600 ttgtgccctt gaagcccagg cctgaacctc aagagtgggc gcattaggtg tgctactgaa 6660 aaggaccaga acaaagagag gttgtcatca cgtgg~accaa ggaggaggct gtggaactga 6720 gagcctttag cctacagaag ccactctccg tgtatctcta ttatctacag taaggcttac 6780 ccttctaccc ataaaagtgg ttgatactgg gatgttggct gtagattaca tcatcatcct 6840 aagaataatt cagaggcccc aaatggcttc gcaattaaat agggaaagaa ttgccatgat 6900 gaagccaaag taagaaaatg gttacaaata gaattctcac tgatgacctg ccaacactaa 6960 caactgtatt ggctgtgtct ataaatgctt tcaccaccac agagacgcct gtgagctgtg 7020 agaaagccca tggccttctc tttgtagtgc aatgaaaagg ggagaaaagg aagcgtgtcc 7080 acccaaggtg atgcttccta ctgcacagtg ttaccagtta gaggagccga agggtagcct 7140 tccttgagaa agttttccgg agaatgaaca ctttcatcag cgctaggatt atgcacctct 7200 agtgtcccag ctgagggctt tgctggtctc atagtaccta gcaaaagaga aaaacacaaa 7260 tgttgatagt ctttcttaat cattatataa atagtaaatt cagtatcttt tcagtatcca 7320 gcataactat tatgtggggg aaaaaaaaac tcctaaaatg atatcattct catttttgtt 7380 actggaacac aaaacagata ttagttcata ttttgtaaaa aaaaaaaaaa gattgtacag 7440 aagaccctcc aaaatgagtt ttgcataatg aaatagaaac taagaatggc tttcttaaac 7500 acacacaccc tccacgaacc actcagtgca ccagaggctg tggagacttt taagcaaaca 7560 ctactcaagc aatcatggac caggtcatca agggcctccc tttagagcct tctggtctag 7620 Seite 32 L-0020-01-W0-01.ST25 tgtttgcaga gaacaatcaa taaataagga tattttaaaa ggctgtgcac tgtaacgagt 7680 acagctagtg aaggcagcta gcattgttgg cttttggttt tgattttgtc ctattgcctc 7740 ttcccatgtc ctcaaccaac atgcgacttt acagattgga ctcttgcatt cacacactaa 7800 acaatgatggctttcctgctctattcaaatggtttattgattttttaaaaaatacttgct7860 ttttttttattgaagtgaggtcctgcatagttggttcaacactaacctaatagggcttca7920 ccccagtaaagagacgttccacaaacagta.gcccgtggttatgagtttcaaagcaagcat7980 gcggccgtccccagagctgctttgtgtcttagaatcaccaacgcctttatgtttgatcac8040 ~~cat~aaaagtrctcctcatt~gga.agagcctg~cc~~a~jgga9c.ar~aga.agcc.cca~~c~ca.;_,8100 a~~ ,. ..
aagccccacccacagctgagaagctgttggcagctgctaggggaaggacaatccagtctc8160 tttagggaac tgagccctct aagttggcca tgcttcagtg gatgacccta cccccacccc 8220 cgtgtatatg agcagcacta tttagagtca atgggttatt ttgaaaaaga aggagcaggg 8280 aaggacatga agttaggaga gaggtgtgtg ggggattcta aaggagctgg tgggaaggaa 8340 gtgaggtgag taagatttaa aaaaaaaaac acttaggaca tgtatacagt tctcacaaga 8400 ataagaaata caatttttaa aagaattaca cggtagagag aaaatgctag atgaatgtga 8460 aactatttat aaacagctac tatgagacat ttgttcctac actaactggt aaaataaaat 8520 gaaaagataa atgaaaggct gatgatgtta ggagttagga gttctggctt actgagcaca 8580 gaccctggct tctgtcccca gcaccataga aggcaggcat agaggctcta ggctgatggt 8640 tggcaggtaa gagcagacac agcagggctt taagatcctc cttagctaca tactgagttc 8700 tacatagcct ggactacatg agagtctatt tccaaaaaac agtgtttaaa agtgtgtgcg 8760 tgcgtgcgtg cgtgtgtgtg tgtgtgtgtg tgtgtgtgtg tgtgtgtgtc tgtctgtctg 8820 tctgtctgtc tgtgtgtact tgaagttgaa taaaattctg tttttctttt tacttccaga 8880 ggttgagtct ttcccaccat aaagctgggc tacatgaagg aacatcactt cagtttttga 8940 cgccttctgt aatgcagcct gatcctcagt ggttaaattt tcaaatcctt aaaaataaat 9000 Seite 33 L-0020-01-WO-01.ST25 aaataaataa ataaataaag cataaagaaa cagataatca aaagagaacg ttcataaatt 9060 aaatgtttaa aaatatttac ttattaaccc taagtgcttc tggggaataa caatagtata 9120 taaattgaaa tctgaataaa attctttttt tctattttgg ggcactttta tcagcactag 9180 gttacacttt cttacagcaa ggacaaatca acatagctgc attagccttg aagaatgtaa 9240 ttttgtaaag agtttctgta cctctaagca tataataaag ctgagttata tataagtttt 9300 ggtcaaactg tgacttgttt agtattatag aggaaacctt ctttgtttct tgtttcatct 9360 aagaaagccc acatcattat aataattttt gtaaatattt agattgtcac tcatctgatt 9420 aattgacaaa gacaaataaa catttttttt ttttttagtt agaagcaaca ttgctagaaa 9480 ~acccagggct~tcacacacat tagacaggtg~ctctgcctct gagcgataga tctagtacca ,9540 caactagaaa tttctgtttg atatactatc cctttaaaaa aaagaaaatt tggggctgga 9600 gagatagctc agtgggtaag agcactgact actcttccta aaaagccctg agttcaagtc 9660 ccagcaacca cacggtggct cgcaaccatc cgtaacgaga tctaacgccc tcttctggtg 9720 cacctgaaga cagctacagt gtacttatga aatctttggg ccagaacgag cggggccgac 9780 cagagtgagc agaggtccta aaaattcaat tctcaacaac cacatgaagg ctcacaacca 9840 tctgtacagc tacaatgtac tcagataaat aaataaataa ataaataaat aaataaataa 9900 ataaataatc ttttttaaaa attgggggct agaaagacag atcagcagtt aagagcctgt 9960 cttgctcttg cagagactct ggttccaacg cccacatggt aattcacagc catcggtagg 10020 aaactctagt tctaggggac caaacactct cttctggcct ccacaatatg ttgtgtacat 10080 gatatgaata tataattgca atcaaaacat gcataacaca taaacataaa aataaataaa 10140 tccttaaaaa ataatgaaac agaattttac ctgggagtcc cttggtaaac ttcattagcc 10200 cttgtatgtg caggactgat acctctgaga gtctcagaaa acttagttcg gggttggaac 10260 cctcctgcag ctggtccata aaaaaaataa ataaataaaa atgattcata gtatgtcttt 10320 ctgataatta ctaaggtgtt tcttttttac tgttcttgct atcatttcct aatatcattc 10380 attacaaatt atttcttgcc agcaatggat acatacaagt ttgcttgttt ctcccaaggg 10440 Seite 34 L-0020-Ol-WO-01.ST25 aatcatggat ttttggtgag cagtcactat ggtattcaga agccgatgtt cctcttgagt 10500 tagagtcatg ttgtcctgaa cctataagga aacagagggc gcctgtaatg tgtgcttgtt 10560 acacactgtt cataaagaca caagagactg tatggaagaa gcgtcttgcc cctttcccag 10620 atctggtgct ggatgacacg tgtttggtgt ctgctccttc atcttccact tggatggcag 10680 ggtacagggc aggcccgtgc ttgaagttct tgcgaagtct ctttgactta cactggatct 10740 ctgtgagcaa~ acctgtgaca ttacagaatt cacattaagt acttttggct cggaaaacaa 10800 cactgtaggg tttgattaaa atggtcttca gtgagtaaat agtcctagat atctctcgtg 10860 .. .. . ~~tgtctgca tetattaaat ttcc~attct. ;gcttttcc:ct.. tcataga,cac .ctt,gtctgct ~ln~~.~: - - '. .. ..
taaaaatagt tataacaaag attaagaatt aatttttcat ttccaaatta tgtttagatt 10980 tttacataag agatttacac attagttgca tcttggaagt gaaaaagtca tttgatagaa 11040 tagtagaatt acattatcta tgactattat cactggtttt attaataagg ttaaaaaaga 11100' ctatcttagg atttttaaaa ataagctaag tcaatttttc ctatgtcaga caaagcattt 11160 aattcttttt caaaggcaaa taatatttat tttaggattg tttttgtttg tttttaaata 11220 ttttataatt acttgcatga gtctgtatgg ggaaatgcac aggtgagtgc aggtgcccac 11280 agagcacaga ggggtcatag ccccttgaag tgagactata gctggttgtg agctgtagtg 11340 tatgtactgg actggactgg actactctaa gtacttttag cagctactaa aagtatccaa 11400 ctattaatat acctgtcgtt ttcaaagtat ttttaaaaac acacctataa aataaaaaac 11460 taatgaggag ctgaagtata aacaaaaacc ttaaaacccc tctatattat tttattattt 11520 gtcagagaac agaatacaca aaaaaataaa gccattttaa gcatattgag tattgaacat 11580 ggtagtaaac acctataacc ccagctactt aagacactgt tacaagatga tctcacgttg 11640 aataccagcc atggcaactc agtgtttttt agtaggaaag gagtctatgg gtgtagctta 11700 gtggtagatt gtctaggatg tgagaaacct gaactcaacc cccaggacca tgagaaggaa 11760 aaccacaaga agagaacgaa gaggaggaaa aggaggaaga agagggggaa gaggaggaag 11820 Seite 35 L-0020-Ol-WO-01.ST25 aggaaaagaa aggaggaaag ggaaaggaga atgagggaag gaagaggagg aggggccgtc 11880 agagcactaa cacatggcag tattaggaca actgaaaggt aaatgaagtg tgcagatcca 11940 agcaatggtg gcagggtgcg gaccagctta tgctcgccaa tcaccgtgac tgctcaccaa 12000 ctctctcgag cttcaccttg ccagaatttc tttaagtttc tttaagtgta ggatgactca 12060 ttttatgcat agagttcctc agaggactcc aacttccctt tcacatgcat tgcctgccac 12120 aggttaaaag acaacttttg agccagaaat tggagggcag aggcaggggg atctctgagt 12180 ttgatgccag cctggcgtac atagcaagtt ccaggtcagc tacatagtga gatcctgaat 12240 acaaattaaa ttaatgaata aaataaaaga caactttttc aaaggtttct ttggactgag 12300 ttgtatctcc cactgctcat atgctaatgtngttgatattc agagatgagt cctgacactc 12360 gtgagactat tagtggactt tacatatgag gatgtcaggg gtccagagat ccagttaccc 12420 ttcaattgtc ctaagactgt agtgtgttag gactctgaga accccttctc agaaagggaa 12480 gacttccttt tctcttcttt ctcctcctct ttctttcttt ttgggaacta ctaactttaa 12540 ctttgcactt ttcaaggcag tatatatttt taggttaagt aacagaaaca tacaatacct 12600 tttttaaatc actatggagg tttcttttat tttatgtgtg tgttttgctt gcatgtgtgt 12660 atgcacacaa cactcatacc tgcagaggtc agaagagggc ctgggatgct caggaactga 12720 gttataaatg attgtgaggg gccacgtgga ~agctggggac cgggcccagg tcctctccta 12780 atgctcttaa ctgcctaact actcttcagg tctcagagaa catctaatta agatgaaact 12840 atgactgatt tcaaaattag agcaatatag aacagggcaa catttatggg aatcactact 12900 taaattgata atgggccatc cacgtggcgc aattactgta catatgtgtg tggtgtgcat 12960 atacaaacga atacggccat gttcaaaata actcttgaag aatgctttat gtcctcaaga 13020 aggttggaaa tttttaattt tttattaatt tttaaattca tttttattat atagaggaga 13080 gtgtgaatac agtccttcac taacacaaat gatgcacctg agtttcccac atttgtggaa 13140 atcacagggg tcagtcagca catccctagt gcaatgggcg attcttgccc tggaaaaacc 13200 accaccacta tcatgtcctc cttgtcagga aggtaaggct gggaaaagtt aaatcccctc 13260 Seite 36 L-0020-Ol-WO-01.ST25 agaaaagcaa gccacgaaca cactggcact tacattctgc caacatcccc accgccttac 13320 acttcttcag tctgcactct tggcattttc tgcgcatgta catgtccatt tcacagtgac 13380 caccgttctt gcaagaatac actgcattct tggtaatgct acgccggaaa aagcctaaca 13440 acaaagatgg tcaaagtttg acatggatat gtttagttac ttctcatttt tattaacatt 13500 ttacctttat agaaagtaat caagctaact agagtatttc aattaaatag ggtttttttt 13560 attcccgaca tttttttaaa tatacattta tctggaaagt tagctaattc tataattctt 13620 tatagttagt atagttcagt tgacatcctg aggctgggtc tttctttata acgtctggag 13680 .. - a~aaaat.gga-'gtatcattat cacatt?aga caracvtc~tcta..ccctactata ggGaaal:i:ac~ i:~7.40. ..
ctataagaat ttttttttta actgcattat acaaaatatc aagatcctac aatgcatctc 13800 tcgagtttcc tttgagagaa tacatggtca cctaaaataa agaaaaaatt cagaaaagcc 13860 tctcttttct aaaccctccc agtctgagtt gcagcccaca aaggtggtcc ctccctccct 13920 catccgcaga tgctctctct ccctttccca gcacttgcca gatagttccc atcccttttc 13980 ctccttcgaa ccttattaca cagtaatacc tgccatttcc tgctctccct cctatgagtt 14040 gtcccaaagc cccaccactt actctcttgc agtggacact tgaaagatgg agtttcgcca 14100 tttcagttca acgaaggaag aaaaagctca gctcaggttt tcaaaactgt gctgggttct 1416 0 tcagctcagt tttaacagta acccgtcacc gtcaagagca ggccacccct gcctaactac 14220 tcgccacagg actctgtcac ctcatatcct ttggctattc tttgccagtg ttgaggttta 14280 agatttacag gtaaaatttt accctaaaca aacacgttta tttgtgttac attgactaaa 14340 ctgttttctg ttgtttgggt tgttttgttt tgtttcagag ggtgctagaa tcaaatccaa 14400 aagtaggtgc taagtgctct accactggcc tgcattccca tcccgatgtt tatccatttc 14460 tggagtggta gactgtttca gaactgttat ttatcagaga ggcttccgct ctttacaaat 14520 atcattgcaa acactggccc ttgcttcccg actttcagct tgttctcttt ctatacgaaa 14580 cactttttaa aatttattga cttagaatca tagaagtcat ttatttcaaa ttttctccaa 14640 Seite 37 L-0020-01-WO-01.ST25 gtgtataaat tttatcaacc acataaaaag gtgtcatggt ttacagatct ttacattgtc 14700 atgttttgtt ccagcatcaa attcttctag aaatagggaa aacgtttgct tatatgtatc 14760 gagtttaaag taaagtgtga aagtcttcat caaaaatgaa gagacagacc tggtagtgaa 14820 tgctcttaat gccagcagta aggatacaaa ggtaggtgga tctctgagag ttcaaggaca 14880 gcctggtcta aacagtgagt tccaggacag ccagggatgc atagagagac cttttctcaa 14940 aaaaagaaga aagatgagga ggagaagatg aggacgaagg ggaagacaag ggaggagaaa 15000 aaaaaataaa gaaagaaatg gtccaggcat ggtggcccac acctttaatc ccagcaccta 15060 gaaagcagag acaggcaggt atctgtgagt tcaaagccaa cctgatctcc atagtgaggt 15120 ctaggccagc cagggctata caatgatact ctacttcaaa agagagagag ggggggaggg 15180 ~agggagagag agagagagag agagagagag agagagagag agagagagag agagagagag 15240 agagagagag agagagggag agagagagag agagagagag agagagagag agagagagag 15300 agagagagag agaaagaaag aaagaaagaa agaaagaaag aaagaaagaa agaaagaaag 15360 aaagaaagaa agaaagaaag aggaagggaa agggaaaggg aaagggaaag ggaaaggaaa 15420 ggaaaggaaa ggaaaggaaa ggaaaggaaa ggaaagaaac taaccttaaa cttaacattt 15480 tattctccta tctttgatca cacagattga tccctttaga catacttttt cctggttttg 15540 tttctcatgt tttataacga ttgcacacgc acgtgcacac acacatacac acgtgtgcgc 15600 gtgcgcagca ctgtctgttt ggttgcgcct tacccttacc tttgcacccc tcacaagtaa 15660 gtgcattgta gtgatatcct gatgccttat caccgcagac catgcagagc atgtcatctt 15720 gacccttcca gcgtaaagaa caagttgctc tgcacctttg tgccccaggg aattcatctt 15780 catcatcgtg aacaaccaca taagtggact gaccagattc tcgagcgtcg agtccacacc 15840 aactgcctcc actgtactgt ggatccaagc catgtgtgtt gaaatgactt tgtaaagatg 15900 gggactgtaa cgctggagga aactgagcgg tagaatattg acaatagggt ggttcttgga 15960 aatctgtatc acacagctga taatggaatt gttctggcaa aatatctggt tttattaaca 16020 taaaaaagaa ~caaatcataa aaggtcaaaa aagttcataa ttgttttaaa cctcaaaatg 16080 Seite 38 L-0020-01-WO-Ol.ST25 ttttatactc aacagaaagt tagttcaaca catctattcc atctcaggac ttgccgtcca 16140 gcacaatagg aacacagtaa atggaactaa caatacctaa taaatgctat gaccccaaag 16200 <210>22 <211>1428 <212>DNA
<213>Mus musculus <400> 22 atgttaataa aaccagatat tttgccagaa caattccatt atcagctgtg tgatacagat 60 ttccaagaac caccctattg tcaatattct accgctcagt ttcctccagc gttacagtcc 120 ccatctttac aaagtcattt caacacacat ggcttggatc cacagtacag tggaggcagt 180 tggtgtggac tcgacgctcg agaatctggt cagtccactt atgtggttgt tcacgatgat 240 gaagatgaat tccctggggc acaaaggtgc agagcaactt gttctttacg ctggaagggt 300 caagatgaca tgctctgcat ggtctgcggt gataaggcat caggatatca ctacaatgca 360 cttacttgtgaggggtgcaaaggctttttccggcgtagcattaccaagaatgcagtgtat 420 tcttgcaagaacggtggtcactgtgaaatggacatgtacatgcgcagaaaatgccaagag 480 tgcagactgaagaagtgtaaggcggtggggatgttggcagaatgtttgctcacagagatc 540 cagtgtaagtcaaagagacttcgcaagaacttcaagcacgggcctgccctgtaccctgcc 600 atccaagtggaagatgaaggagcagacaccaaacacgtgtcatccagcaccagatctggg 660 aaaggggttc aggacaacat gactctaact caagaggaac atcggcttct gaataccata 720 gtgactgctc accaaaaatc catgattccc ttgggagaaa caagcaaact tctgcaggag 780 ggttccaacc ccgaactaag ttttctgaga ctctcagagg tatcagtcct gcacatacaa 840 gggctaatga agtttaccaa gggactccca ggatttgaaa atttaaccac tgaggatcag 900 gctgcattac agaaggcgtc aaaaactgaa gtgatgttcc ttcatgtagc ccagctttat 960 ggtgggaaag actcaacctc tggaagtact atgagaccag caaagccctc agctgggaca 1020 ctagaggtgc ataatcctag cgctgatgaa agtgttcatt ctccggaaaa ctttctcaag 1080 Seite 39 L-0020-01-WO-01.ST25 gaaggctacc cttcggctcc tctaactgga agaatgagtg aacttcatgt atcggatact 1140 gaatatgctc tgcttacggc gacaacagtg cttttctcag atcgtccatg ccttaaaaat 1200 aagcagcata tagaaaacct acaagaacca gtcctgcaac ttttgtttaa gttttcaaaa 1260 atgtaccatc cagaagaccc acagcatttc gcccacctca tagggaggct tactgaactg 1320 agaactctga gtcacagcca ctctgaaatc cttcgcatgt ggaaaacaaa ggaccccagg 1380 ttggtgatgt tattctctga gaaatgggat ctgcactcat tttcctga 1428 <210> 23 . . . , ,.
<212> DNA
<213> Mus musculus <400> 23 tcaggaaaat gagtgcagat cccatttctc agagaataac atcaccaacc tggggtcctt 60 tgttttccac atgcgaagga tttcagagtg gctgtgactc agagttctca gttcagtaag 120 cctccctatg aggtgggcga aatgctgtgg gtcttctgga tggtacattt ttgaaaactt 180 aaacaaaagt tgcaggactg gttcttgtag gttttctata tgctgcttat ttttaaggca 240 tggacgatct gagaaaagca ctgttgtcgc cgtaagcaga gcatattcag tatccgatac 300 atgaagttca ctcattcttc cagttagagg agccgaaggg tagccttcct tgagaaagtt 360 ttccggagaa tgaacacttt catcagcgct aggattatgc acctctagtg tcccagctga 420 gggctttgct ggtctcatag tacttccaga ggttgagtct ttcccaccat aaagctgggc 480 tacatgaagg aacatcactt cagtttttga cgccttctgt aatgcagcct gatcctcagt 540 ggttaaattt tcaaatcctg ggagtccctt ggtaaacttc attagccctt gtatgtgcag 600 gactgatacc tctgagagtc tcagaaaact tagttcgggg ttggaaccct cctgcagaag 660 tttgcttgtt tctcccaagg gaatcatgga tttttggtga gcagtcacta tggtattcag 720 aagccgatgt tcctcttgag ttagagtcat gttgtcctga acccctttcc cagatctggt 780 gctggatgac acgtgtttgg tgtctgctcc ttcatcttcc acttggatgg cagggtacag 840 Seite 40 L-0020-Ol-WO-O1.ST25 ggcaggcccg tgcttgaagt tcttgcgaag tctctttgac ttacactgga tctctgtgag 900 caaacattct gccaacatcc ccaccgcctt acacttcttc agtctgcact cttggcattt 960 tctgcgcatg tacatgtcca tttcacagtg accaccgttc ttgcaagaat acactgcatt 1020 cttggtaatg ctacgccgga aaaagccttt gcacccctca caagtaagtg cattgtagtg 1080 atatcctgat gccttatcac cgcagaccat gcagagcatg tcatcttgac ccttccagcg 1140 taaagaacaa gttgctctgc acctttgtgc cccagggaat tcatcttcat catcgtgaac 1200 .
aaccacataa gtggactgac cagattctcg agcgtcgagt ccacaccaac tgcctccact 1260 gta.Gt9tg9a.. t cGa~.gceat.- gtgtgttc~aa ~~tgacattgt aa~gavg~fJg ac'~Lgt~at~gc~ . 1:j20.: . . ..
tggaggaaac tgagcggtag aatattgaca atagggtggt tcttggaaat ctgtatcaca 1380 cagctgataa tggaattgtt ctggcaaaat atctggtttt attaacat 1428 <210>24 <211>475 <212>PRT
<213>Mus musculus <400> 24 Met Leu Ile Lys Pro Asp Ile Leu Pro Glu Gln Phe His Tyr Gln Leu Cys Asp Thr Asp Phe Gln Glu Pro Pro Tyr Cys Gln Tyr Ser Thr Ala Gln Phe Pro Pro Ala Leu Gln Ser Pro Ser Leu Gln Ser His Phe Asn Thr His Gly Leu Asp Pro Gln Tyr Ser Gly Gly Ser Trp Cys Gly Leu Asp Ala Arg Glu Ser Gly Gln Ser Thr Tyr Val Val Val His Asp Asp Glu Asp Glu Phe Pro Gly Ala Gln Arg Cys Arg Ala Thr Cys Ser Leu Seite 41 L-0020-Ol-WO-Ol.ST25 Arg Trp Lys Gly Gln Asp Asp Met Leu Cys Met Ual Cys Gly Asp Lys Ala Ser Gly Tyr His Tyr Asn Ala Leu Thr Cys Glu Gly Cys Lys Gly Phe Phe Arg Arg Ser Ile Thr Lys Asn Ala Ual Tyr Ser Cys Lys Asn Gly Gly His Cys Glu Met Asp Met Tyr Met Arg Arg Lys Cys Gln Glu Cys Arg Leu Lys Lys Cys Lys Ala Ual Gly Met Leu Ala Glu Cys Leu Lei. ihr~ Glu ~tle::Gl;s. Cys.-Lys,:Se~-r.:i ys~A.r~:c~ I_e~.t ArYgr.Lys.~A~n phe_..~4.y~ ..- .. . .... : : ~ _ ~ ..~
His Gly Pro Ala Leu Tyr Pro Ala Ile Gln Val Glu Asp Glu Gly Ala 195 200 ~ 205 Asp Thr Lys His Ual Ser Ser Ser Thr Arg Ser Gly Lys Gly Ual Gln Asp Asn Met Thr Leu Thr Gln Glu Glu His Arg Leu Leu Asn Thr Ile Ual Thr Ala His Gln Lys Ser Met Ile Pro Leu Gly Glu Thr Ser Lys Leu Leu Gln Glu Gly Ser Asn Pro Glu Leu Ser Phe Leu Arg Leu Ser Glu Ual Ser Ual Leu His Ile Gln Gly Leu Met Lys Phe Thr Lys Gly 275 280 ~ 285 Leu Pro Gly Phe Glu Asn Leu Thr Thr Glu Asp Gln Ala Ala Leu Gln Lys Ala Ser Lys Thr Glu Ual Met Phe Leu His Ual Ala Gln Leu Tyr Gly Gly Lys Asp Ser Thr Ser Gly Ser Thr Met Arg Pro Ala Lys Pro Ser Ala Gly Thr Leu Glu Ual His Asn Pro Ser Ala Asp Glu Ser Ual Seite 42 L-0020-Ol-WO-01.ST25 His Ser Pro Glu Asn Phe Leu Lys Glu Gly Tyr Pro Ser Ala Pro Leu Thr Gly Arg Met Ser Glu Leu His Ual Ser Asp Thr Glu Tyr Ala Leu Leu Thr Ala Thr Thr Ual Leu Phe Ser Asp Arg Pro Cys Leu Lys Asn Lys Gln His Ile Glu Asn Leu Gln Glu Pro Ual Leu Gln Leu Leu Phe Lys Phe Ser Lys Met Tyr His Pro Glu Asp Pro Gln Nis Phe Ala His Leu Il~e~ Gly Arg Leu Thr Glu Leu Arg Thr Leu Ser~His Ser His Ser Glu Ile Leu Arg Met Trp Lys Thr Lys Asp Pro Arg Leu Val Met Leu 450 455 . 460 Phe Ser Glu Lys Trp Asp Leu His Ser Phe Ser Seite 43
Claims (21)
1. A nucleic acid molecule coding for a nuclear receptor which is selected from the group comprising:
a) the nucleotide sequences set forth in SEQ ID NOs: 1, 4, 6, 8, 10, 12, 17, and/or 22;
b) or complement thereof as set forth in SEQ ID NOs; 2, 5, 7, 9, 11, 13, 18, and/or 23;
c) a nucleic acid which hybridizes to a nucleic acid having a nucleotide sequence which is the complement of the nucleotide sequence of SEQ ID NOs: 1, 4, 6, 8, 10, 12, 17, 20 and/or 22 under conditions of high stringency, and d) a nucleic acid which hybridizes to a nucleic acid having a nucleotide sequence which is the complement of the nucleotide sequence of SEQ ID NOs: 2, 5, 7, 9, 11, 13, 18, 21 and/or 23 under conditions of high stringency.
a) the nucleotide sequences set forth in SEQ ID NOs: 1, 4, 6, 8, 10, 12, 17, and/or 22;
b) or complement thereof as set forth in SEQ ID NOs; 2, 5, 7, 9, 11, 13, 18, and/or 23;
c) a nucleic acid which hybridizes to a nucleic acid having a nucleotide sequence which is the complement of the nucleotide sequence of SEQ ID NOs: 1, 4, 6, 8, 10, 12, 17, 20 and/or 22 under conditions of high stringency, and d) a nucleic acid which hybridizes to a nucleic acid having a nucleotide sequence which is the complement of the nucleotide sequence of SEQ ID NOs: 2, 5, 7, 9, 11, 13, 18, 21 and/or 23 under conditions of high stringency.
2. The nucleic acid molecule of claim 1 which is genomic DNA.
3. The nucleic acid molecule of claim 1 which is cDNA.
4. The nucleic acid molecule of claim 1 which is RNA.
5. A nucleic acid molecule comprising the. nucleic acid molecule of any of claims 1 to 4 and a label attached thereto.
6. A vector comprising the nucleic acid molecule of claim 1.
7. The vector of claim 6, which is an expression vector.
8. A host cell transfected with the vector of claim 6 or 7.
9. A host cell transfected with the expression vector of claim 7.
10. A method of producing a polypeptide comprising the step of culturing the host cell of claim 9 in an appropriate culture medium to, thereby, produce the polypeptide.
11.An isolated polypeptide encoded by any portion of the nucleic acid of claim 1.
12.A polypeptide selected from the group comprising:
the amino acid sequences set forth in SEQ ID NO: 3, 24 and/or 19.
the amino acid sequences set forth in SEQ ID NO: 3, 24 and/or 19.
13.A method for screening for agents which are capable of inhibiting the cellular function of the nuclear receptor L66 comprising the steps of:
a) contacting one or more candidate agents with a polypeptide according to claim 11 or 12 b) removing unbound agent(s) c) detecting whether the agent(s) interact with the polypeptide of the nuclear receptor.
a) contacting one or more candidate agents with a polypeptide according to claim 11 or 12 b) removing unbound agent(s) c) detecting whether the agent(s) interact with the polypeptide of the nuclear receptor.
14.A method for inhibiting the cellular function of the nuclear receptor L66, comprising the steps of:
a) contacting a cell with a binding agent of a polypeptide according to claim 11 or 12, whereby the cellular function of L66 is inhibited.
a) contacting a cell with a binding agent of a polypeptide according to claim 11 or 12, whereby the cellular function of L66 is inhibited.
15. Method according to claim 14, characterized in that the binding agent is an antibody.
16. Method according to claim 14, characterized in that the binding agent is RNA.
17. Method according to claim 14, characterized in that the binding agent is an anti-sense oligonucleotide.
18. Method according to claim 14, characterized in that the binding agent is a ribozyme.
19. Method according to claim 14, characterized in that the cell is in a body.
20. Use of a nucleic acid or protein sequence according to SEQ ID NO.: 1, 4, 6, 8, 10, 12, 17, 20, 22, 2, 5, 7, 9, 11, 13, 18, 21, 3, 19, 24 and/or 23 for the construction of multiple nuclear receptor specific sequence alignments.
21. Use of the sequences according to claim 20 for the construction of protein sequence alignments.
Applications Claiming Priority (5)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| EP00120370 | 2000-09-16 | ||
| EP00120370.2 | 2000-09-16 | ||
| EP01111658 | 2001-05-14 | ||
| EP01111658.9 | 2001-05-14 | ||
| PCT/EP2001/010323 WO2002022817A2 (en) | 2000-09-16 | 2001-09-07 | Nuclear receptor l66 and methods of use |
Publications (1)
| Publication Number | Publication Date |
|---|---|
| CA2420932A1 true CA2420932A1 (en) | 2002-03-21 |
Family
ID=26071406
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| CA002420932A Abandoned CA2420932A1 (en) | 2000-09-16 | 2001-09-07 | Novel mammalian nuclear receptor l66 and methods of use |
Country Status (4)
| Country | Link |
|---|---|
| EP (1) | EP1317542A2 (en) |
| AU (1) | AU2002213893A1 (en) |
| CA (1) | CA2420932A1 (en) |
| WO (1) | WO2002022817A2 (en) |
Families Citing this family (2)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US20030109467A1 (en) * | 2001-11-15 | 2003-06-12 | Isis Pharmaceuticals Inc. | Antisense modulation of human FXR expression |
| AU2003219028A1 (en) * | 2002-04-13 | 2003-10-27 | Lion Bioscience Ag | NON-PRIMATE FXRb AS A LANOSTEROL SENSING NUCLEAR HORMONE RECEPTOR AND RELATED USES |
Family Cites Families (4)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US6005086A (en) * | 1995-01-13 | 1999-12-21 | The Salk Institute For Biological Studies | Farnesoid activated receptor polypeptides, and nucleic acid encoding the same |
| US5932699A (en) * | 1995-01-13 | 1999-08-03 | The General Hospital Corporation | Retinoid X receptor-interacting polypeptides |
| US5728548A (en) * | 1995-06-29 | 1998-03-17 | Genetics Institute, Inc. | Retinoid receptor-1 (RR1) and DNA encoding RR1 |
| AU780658B2 (en) * | 1999-03-26 | 2005-04-07 | City Of Hope | Method of affecting cholesterol catabolism using nuclear bile acid receptor |
-
2001
- 2001-09-07 CA CA002420932A patent/CA2420932A1/en not_active Abandoned
- 2001-09-07 AU AU2002213893A patent/AU2002213893A1/en not_active Abandoned
- 2001-09-07 WO PCT/EP2001/010323 patent/WO2002022817A2/en not_active Ceased
- 2001-09-07 EP EP01982261A patent/EP1317542A2/en not_active Withdrawn
Also Published As
| Publication number | Publication date |
|---|---|
| WO2002022817A3 (en) | 2003-03-27 |
| EP1317542A2 (en) | 2003-06-11 |
| WO2002022817A2 (en) | 2002-03-21 |
| AU2002213893A1 (en) | 2002-03-26 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| US5639616A (en) | Isolated nucleic acid encoding a ubiquitous nuclear receptor | |
| US5861274A (en) | Nucleic acids encoding peroxisome proliferator-activated receptor | |
| HU221411B1 (en) | Novel estrogen receptor | |
| JP2002501083A (en) | Ligand receptors and their uses | |
| AU764310B2 (en) | Human G-protein coupled receptor | |
| US20060154335A1 (en) | Novel members of the steroid/thyroid superfamily and uses therefor | |
| AU4728599A (en) | 14273 receptor, a g-protein coupled receptor | |
| JP2001525178A (en) | Method for searching for agonist and antagonist for human 11cb splice variant | |
| JP2002517195A (en) | G protein-coupled receptor named 2871 receptor | |
| WO2003018746A2 (en) | Estrogen receptor alpha variants and methods of detection thereof | |
| CA2420932A1 (en) | Novel mammalian nuclear receptor l66 and methods of use | |
| US6465627B2 (en) | DAX-1 protein, methods for production and use thereof | |
| AU779993B2 (en) | Novel human G-protein coupled receptor | |
| WO1997004810A9 (en) | Dax-1 protein, methods for production and use thereof | |
| US20030119726A1 (en) | P-glycoproteins and uses thereof | |
| WO2002077229A2 (en) | Cofactors of the liver x receptor alpha and methods of use | |
| EP1371662A1 (en) | Novel variant of the cellular receptor LXR alpha | |
| JPH11127872A (en) | New intranuclear receptor protein, gene and its use | |
| EP0812354A2 (en) | HUMAN RETINOID X RECEPTOR - GAMMA (hRXR-GAMMA) | |
| EP1278839A2 (en) | Isolated human g-protein coupled receptors, nucleic acid molecules encoding human gpcr proteins, and uses thereof | |
| WO2002070697A1 (en) | Mammalian nuclear receptor l67 and methods of use | |
| EP1371661A1 (en) | Variants of the Constitutive Activated of Retinoid acid response receptor (CAR) in its ligand binding domain (LBD) | |
| WO2002018420A2 (en) | Cofactors of the pregnane x receptor and methods of use | |
| WO2002053591A1 (en) | Mammalian nuclear receptor cofactor cf12 and methods of use | |
| WO2003087140A1 (en) | NON-PRIMATE FXRb AS A LANOSTEROL SENSING NUCLEAR HORMONE RECEPTOR AND RELATED USES |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| FZDE | Dead |