MX2007006330A - Production of proteins - Google Patents
Production of proteinsInfo
- Publication number
- MX2007006330A MX2007006330A MXMX/A/2007/006330A MX2007006330A MX2007006330A MX 2007006330 A MX2007006330 A MX 2007006330A MX 2007006330 A MX2007006330 A MX 2007006330A MX 2007006330 A MX2007006330 A MX 2007006330A
- Authority
- MX
- Mexico
- Prior art keywords
- sequence
- protein
- cells
- nucleic acid
- fusion protein
- Prior art date
Links
- 108090000623 proteins and genes Proteins 0.000 title claims abstract description 102
- 102000004169 proteins and genes Human genes 0.000 title claims abstract description 85
- 238000004519 manufacturing process Methods 0.000 title claims description 9
- 210000004027 cell Anatomy 0.000 claims abstract description 99
- 108020001507 fusion proteins Proteins 0.000 claims abstract description 93
- 102000037865 fusion proteins Human genes 0.000 claims abstract description 90
- 241000196324 Embryophyta Species 0.000 claims abstract description 40
- 108090000765 processed proteins & peptides Proteins 0.000 claims abstract description 35
- 102000007056 Recombinant Fusion Proteins Human genes 0.000 claims abstract description 28
- 108010008281 Recombinant Fusion Proteins Proteins 0.000 claims abstract description 28
- 238000000034 method Methods 0.000 claims abstract description 24
- 239000013598 vector Substances 0.000 claims abstract description 23
- 102000004196 processed proteins & peptides Human genes 0.000 claims abstract description 19
- 108091028043 Nucleic acid sequence Proteins 0.000 claims description 63
- 150000007523 nucleic acids Chemical group 0.000 claims description 61
- 229920002494 Zein Polymers 0.000 claims description 33
- 108010055615 Zein Proteins 0.000 claims description 32
- 239000005019 zein Substances 0.000 claims description 29
- 229940093612 zein Drugs 0.000 claims description 29
- 108060006613 prolamin Proteins 0.000 claims description 23
- 102000039446 nucleic acids Human genes 0.000 claims description 16
- 108020004707 nucleic acids Proteins 0.000 claims description 16
- 229920001184 polypeptide Polymers 0.000 claims description 15
- 108010076504 Protein Sorting Signals Proteins 0.000 claims description 14
- 230000001939 inductive effect Effects 0.000 claims description 13
- 239000013604 expression vector Substances 0.000 claims description 12
- 210000005253 yeast cell Anatomy 0.000 claims description 12
- 238000000429 assembly Methods 0.000 claims description 9
- 230000000712 assembly Effects 0.000 claims description 9
- 241001465754 Metazoa Species 0.000 claims description 8
- 230000001105 regulatory effect Effects 0.000 claims description 8
- 241000195493 Cryptophyta Species 0.000 claims description 7
- 235000007164 Oryza sativa Nutrition 0.000 claims description 7
- 101000723835 Zea mays 16 kDa gamma-zein Proteins 0.000 claims description 7
- 101001040871 Zea mays Glutelin-2 Proteins 0.000 claims description 7
- 235000009566 rice Nutrition 0.000 claims description 7
- 239000002773 nucleotide Substances 0.000 claims description 4
- 125000003729 nucleotide group Chemical group 0.000 claims description 4
- 230000001131 transforming effect Effects 0.000 claims description 2
- RWRDLPDLKQPQOW-UHFFFAOYSA-N Pyrrolidine Chemical compound C1CCNC1 RWRDLPDLKQPQOW-UHFFFAOYSA-N 0.000 claims 4
- 240000007594 Oryza sativa Species 0.000 claims 2
- 230000002538 fungal effect Effects 0.000 claims 1
- 210000003527 eukaryotic cell Anatomy 0.000 abstract description 9
- 230000009466 transformation Effects 0.000 abstract description 8
- 230000015572 biosynthetic process Effects 0.000 abstract description 7
- 230000006698 induction Effects 0.000 abstract description 5
- 235000018102 proteins Nutrition 0.000 description 75
- 102000053602 DNA Human genes 0.000 description 21
- 102000002265 Human Growth Hormone Human genes 0.000 description 20
- 108010000521 Human Growth Hormone Proteins 0.000 description 20
- 108020004414 DNA Proteins 0.000 description 18
- 239000000854 Human Growth Hormone Substances 0.000 description 18
- 240000004808 Saccharomyces cerevisiae Species 0.000 description 18
- 235000014680 Saccharomyces cerevisiae Nutrition 0.000 description 17
- 210000002472 endoplasmic reticulum Anatomy 0.000 description 17
- 238000009825 accumulation Methods 0.000 description 15
- 230000031787 nutrient reservoir activity Effects 0.000 description 15
- 239000013612 plasmid Substances 0.000 description 15
- 150000001413 amino acids Chemical group 0.000 description 14
- 235000001014 amino acid Nutrition 0.000 description 11
- 241000235058 Komagataella pastoris Species 0.000 description 10
- 239000000047 product Substances 0.000 description 10
- BBBFJLBPOGFECG-VJVYQDLKSA-N calcitonin Chemical compound N([C@H](C(=O)N[C@@H](CC(C)C)C(=O)NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC=1NC=NC=1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)N1[C@@H](CCC1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CO)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N1[C@@H](CCC1)C(N)=O)C(C)C)C(=O)[C@@H]1CSSC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N1 BBBFJLBPOGFECG-VJVYQDLKSA-N 0.000 description 9
- 108020004511 Recombinant DNA Proteins 0.000 description 8
- 240000008042 Zea mays Species 0.000 description 8
- 235000005824 Zea mays ssp. parviglumis Nutrition 0.000 description 8
- 235000002017 Zea mays subsp mays Nutrition 0.000 description 8
- 125000000539 amino acid group Chemical group 0.000 description 8
- 235000005822 corn Nutrition 0.000 description 8
- 239000002609 medium Substances 0.000 description 8
- 210000004102 animal cell Anatomy 0.000 description 7
- 239000012634 fragment Substances 0.000 description 7
- 210000004962 mammalian cell Anatomy 0.000 description 7
- 210000003463 organelle Anatomy 0.000 description 7
- 241000233866 Fungi Species 0.000 description 6
- 241000124008 Mammalia Species 0.000 description 6
- FAPWRFPIFSIZLT-UHFFFAOYSA-M Sodium chloride Chemical compound [Na+].[Cl-] FAPWRFPIFSIZLT-UHFFFAOYSA-M 0.000 description 6
- 230000008901 benefit Effects 0.000 description 6
- 241000701447 unidentified baculovirus Species 0.000 description 6
- 102000055006 Calcitonin Human genes 0.000 description 5
- 108060001064 Calcitonin Proteins 0.000 description 5
- 241000238631 Hexapoda Species 0.000 description 5
- 241000209094 Oryza Species 0.000 description 5
- 229960004015 calcitonin Drugs 0.000 description 5
- 239000002299 complementary DNA Substances 0.000 description 5
- 230000004927 fusion Effects 0.000 description 5
- 239000000126 substance Substances 0.000 description 5
- 238000001890 transfection Methods 0.000 description 5
- 108091026890 Coding region Proteins 0.000 description 4
- 241000725303 Human immunodeficiency virus Species 0.000 description 4
- ONIBWKKTOPOVIA-BYPYZUCNSA-N L-Proline Chemical compound OC(=O)[C@@H]1CCCN1 ONIBWKKTOPOVIA-BYPYZUCNSA-N 0.000 description 4
- 108091034117 Oligonucleotide Proteins 0.000 description 4
- 101710182846 Polyhedrin Proteins 0.000 description 4
- ONIBWKKTOPOVIA-UHFFFAOYSA-N Proline Natural products OC(=O)C1CCCN1 ONIBWKKTOPOVIA-UHFFFAOYSA-N 0.000 description 4
- DBMJMQXJHONAFJ-UHFFFAOYSA-M Sodium laurylsulphate Chemical compound [Na+].CCCCCCCCCCCCOS([O-])(=O)=O DBMJMQXJHONAFJ-UHFFFAOYSA-M 0.000 description 4
- 108700005078 Synthetic Genes Proteins 0.000 description 4
- 241000700605 Viruses Species 0.000 description 4
- JLCPHMBAVCMARE-UHFFFAOYSA-N [3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-hydroxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methyl [5-(6-aminopurin-9-yl)-2-(hydroxymethyl)oxolan-3-yl] hydrogen phosphate Polymers Cc1cn(C2CC(OP(O)(=O)OCC3OC(CC3OP(O)(=O)OCC3OC(CC3O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c3nc(N)[nH]c4=O)C(COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3CO)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cc(C)c(=O)[nH]c3=O)n3cc(C)c(=O)[nH]c3=O)n3ccc(N)nc3=O)n3cc(C)c(=O)[nH]c3=O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)O2)c(=O)[nH]c1=O JLCPHMBAVCMARE-UHFFFAOYSA-N 0.000 description 4
- 230000002255 enzymatic effect Effects 0.000 description 4
- 238000003119 immunoblot Methods 0.000 description 4
- 239000012528 membrane Substances 0.000 description 4
- XXRYFVCIMARHRS-UHFFFAOYSA-N propan-2-yl n-dimethoxyphosphorylcarbamate Chemical compound COP(=O)(OC)NC(=O)OC(C)C XXRYFVCIMARHRS-UHFFFAOYSA-N 0.000 description 4
- 230000001225 therapeutic effect Effects 0.000 description 4
- 238000001262 western blot Methods 0.000 description 4
- DGVVWUTYPXICAM-UHFFFAOYSA-N β‐Mercaptoethanol Chemical compound OCCS DGVVWUTYPXICAM-UHFFFAOYSA-N 0.000 description 4
- 108010025188 Alcohol oxidase Proteins 0.000 description 3
- 241000201370 Autographa californica nucleopolyhedrovirus Species 0.000 description 3
- 108020004705 Codon Proteins 0.000 description 3
- 102000009024 Epidermal Growth Factor Human genes 0.000 description 3
- OKKJLVBELUTLKV-UHFFFAOYSA-N Methanol Chemical compound OC OKKJLVBELUTLKV-UHFFFAOYSA-N 0.000 description 3
- 108091005804 Peptidases Proteins 0.000 description 3
- 241000235648 Pichia Species 0.000 description 3
- 239000004365 Protease Substances 0.000 description 3
- 102100037486 Reverse transcriptase/ribonuclease H Human genes 0.000 description 3
- 239000000427 antigen Substances 0.000 description 3
- 108091007433 antigens Proteins 0.000 description 3
- 102000036639 antigens Human genes 0.000 description 3
- 210000004899 c-terminal region Anatomy 0.000 description 3
- 235000013339 cereals Nutrition 0.000 description 3
- 210000004978 chinese hamster ovary cell Anatomy 0.000 description 3
- 210000004748 cultured cell Anatomy 0.000 description 3
- 238000005516 engineering process Methods 0.000 description 3
- 239000000284 extract Substances 0.000 description 3
- 239000003550 marker Substances 0.000 description 3
- 235000016709 nutrition Nutrition 0.000 description 3
- 238000000746 purification Methods 0.000 description 3
- 210000002966 serum Anatomy 0.000 description 3
- 239000011780 sodium chloride Substances 0.000 description 3
- 108091032973 (ribonucleotides)n+m Proteins 0.000 description 2
- IGAZHQIYONOHQN-UHFFFAOYSA-N Alexa Fluor 555 Chemical compound C=12C=CC(=N)C(S(O)(=O)=O)=C2OC2=C(S(O)(=O)=O)C(N)=CC=C2C=1C1=CC=C(C(O)=O)C=C1C(O)=O IGAZHQIYONOHQN-UHFFFAOYSA-N 0.000 description 2
- 102000008186 Collagen Human genes 0.000 description 2
- 108010035532 Collagen Proteins 0.000 description 2
- 241000711573 Coronaviridae Species 0.000 description 2
- 102000004127 Cytokines Human genes 0.000 description 2
- 108090000695 Cytokines Proteins 0.000 description 2
- KCXVZYZYPLLWCC-UHFFFAOYSA-N EDTA Chemical compound OC(=O)CN(CC(O)=O)CCN(CC(O)=O)CC(O)=O KCXVZYZYPLLWCC-UHFFFAOYSA-N 0.000 description 2
- 108010013369 Enteropeptidase Proteins 0.000 description 2
- 102100029727 Enteropeptidase Human genes 0.000 description 2
- 241000206602 Eukaryota Species 0.000 description 2
- 108010074860 Factor Xa Proteins 0.000 description 2
- 208000005577 Gastroenteritis Diseases 0.000 description 2
- 102000004366 Glucosidases Human genes 0.000 description 2
- 108010056771 Glucosidases Proteins 0.000 description 2
- 102000018997 Growth Hormone Human genes 0.000 description 2
- 108010051696 Growth Hormone Proteins 0.000 description 2
- 101000798114 Homo sapiens Lactotransferrin Proteins 0.000 description 2
- 102000004157 Hydrolases Human genes 0.000 description 2
- 108090000604 Hydrolases Proteins 0.000 description 2
- 108090000854 Oxidoreductases Proteins 0.000 description 2
- 102000004316 Oxidoreductases Human genes 0.000 description 2
- 101710118538 Protease Proteins 0.000 description 2
- 241000235070 Saccharomyces Species 0.000 description 2
- 108010016634 Seed Storage Proteins Proteins 0.000 description 2
- 241000256251 Spodoptera frugiperda Species 0.000 description 2
- 241000255985 Trichoplusia Species 0.000 description 2
- 241000269370 Xenopus <genus> Species 0.000 description 2
- 238000004458 analytical method Methods 0.000 description 2
- 239000003242 anti bacterial agent Substances 0.000 description 2
- 229940088710 antibiotic agent Drugs 0.000 description 2
- 230000000443 biocontrol Effects 0.000 description 2
- 239000007853 buffer solution Substances 0.000 description 2
- 238000005119 centrifugation Methods 0.000 description 2
- 238000003776 cleavage reaction Methods 0.000 description 2
- 229920001436 collagen Polymers 0.000 description 2
- 230000000295 complement effect Effects 0.000 description 2
- 150000001875 compounds Chemical class 0.000 description 2
- 238000004624 confocal microscopy Methods 0.000 description 2
- 238000010276 construction Methods 0.000 description 2
- 239000013256 coordination polymer Substances 0.000 description 2
- 238000005520 cutting process Methods 0.000 description 2
- 230000008021 deposition Effects 0.000 description 2
- 229940042399 direct acting antivirals protease inhibitors Drugs 0.000 description 2
- 238000001962 electrophoresis Methods 0.000 description 2
- 230000005284 excitation Effects 0.000 description 2
- 229930182830 galactose Natural products 0.000 description 2
- 239000000499 gel Substances 0.000 description 2
- 230000002068 genetic effect Effects 0.000 description 2
- 239000000122 growth hormone Substances 0.000 description 2
- 208000002672 hepatitis B Diseases 0.000 description 2
- 239000005556 hormone Substances 0.000 description 2
- 229940088597 hormone Drugs 0.000 description 2
- 102000050459 human LTF Human genes 0.000 description 2
- 238000009396 hybridization Methods 0.000 description 2
- 238000011534 incubation Methods 0.000 description 2
- 239000003262 industrial enzyme Substances 0.000 description 2
- 239000003112 inhibitor Substances 0.000 description 2
- 238000003780 insertion Methods 0.000 description 2
- 230000037431 insertion Effects 0.000 description 2
- 230000003993 interaction Effects 0.000 description 2
- 230000014759 maintenance of location Effects 0.000 description 2
- 230000007246 mechanism Effects 0.000 description 2
- 239000000203 mixture Substances 0.000 description 2
- 210000004898 n-terminal fragment Anatomy 0.000 description 2
- 210000000287 oocyte Anatomy 0.000 description 2
- 239000000137 peptide hydrolase inhibitor Substances 0.000 description 2
- 108010079892 phosphoglycerol kinase Proteins 0.000 description 2
- 229920002401 polyacrylamide Polymers 0.000 description 2
- 239000012474 protein marker Substances 0.000 description 2
- 230000010076 replication Effects 0.000 description 2
- 230000007017 scission Effects 0.000 description 2
- 238000002415 sodium dodecyl sulfate polyacrylamide gel electrophoresis Methods 0.000 description 2
- 238000003860 storage Methods 0.000 description 2
- 230000004960 subcellular localization Effects 0.000 description 2
- 238000012360 testing method Methods 0.000 description 2
- 230000014616 translation Effects 0.000 description 2
- 229960005486 vaccine Drugs 0.000 description 2
- 238000005406 washing Methods 0.000 description 2
- -1 zein gamma Chemical class 0.000 description 2
- CYNAPIVXKRLDER-LBPRGKRZSA-N (2s)-2-benzamido-3-(4-hydroxy-3-nitrophenyl)propanoic acid Chemical compound C([C@@H](C(=O)O)NC(=O)C=1C=CC=CC=1)C1=CC=C(O)C([N+]([O-])=O)=C1 CYNAPIVXKRLDER-LBPRGKRZSA-N 0.000 description 1
- QKNYBSVHEMOAJP-UHFFFAOYSA-N 2-amino-2-(hydroxymethyl)propane-1,3-diol;hydron;chloride Chemical compound Cl.OCC(N)(CO)CO QKNYBSVHEMOAJP-UHFFFAOYSA-N 0.000 description 1
- FWMNVWWHGCHHJJ-SKKKGAJSSA-N 4-amino-1-[(2r)-6-amino-2-[[(2r)-2-[[(2r)-2-[[(2r)-2-amino-3-phenylpropanoyl]amino]-3-phenylpropanoyl]amino]-4-methylpentanoyl]amino]hexanoyl]piperidine-4-carboxylic acid Chemical compound C([C@H](C(=O)N[C@H](CC(C)C)C(=O)N[C@H](CCCCN)C(=O)N1CCC(N)(CC1)C(O)=O)NC(=O)[C@H](N)CC=1C=CC=CC=1)C1=CC=CC=C1 FWMNVWWHGCHHJJ-SKKKGAJSSA-N 0.000 description 1
- HZWWPUTXBJEENE-UHFFFAOYSA-N 5-amino-2-[[1-[5-amino-2-[[1-[2-amino-3-(4-hydroxyphenyl)propanoyl]pyrrolidine-2-carbonyl]amino]-5-oxopentanoyl]pyrrolidine-2-carbonyl]amino]-5-oxopentanoic acid Chemical compound C1CCC(C(=O)NC(CCC(N)=O)C(=O)N2C(CCC2)C(=O)NC(CCC(N)=O)C(O)=O)N1C(=O)C(N)CC1=CC=C(O)C=C1 HZWWPUTXBJEENE-UHFFFAOYSA-N 0.000 description 1
- 229920001817 Agar Polymers 0.000 description 1
- 239000012103 Alexa Fluor 488 Substances 0.000 description 1
- 241000219194 Arabidopsis Species 0.000 description 1
- 241001203868 Autographa californica Species 0.000 description 1
- 101150071434 BAR1 gene Proteins 0.000 description 1
- 241001474374 Blennius Species 0.000 description 1
- 108010078791 Carrier Proteins Proteins 0.000 description 1
- 241000282693 Cercopithecidae Species 0.000 description 1
- 108020004635 Complementary DNA Proteins 0.000 description 1
- 241000699800 Cricetinae Species 0.000 description 1
- 241000699802 Cricetulus griseus Species 0.000 description 1
- 102000004163 DNA-directed RNA polymerases Human genes 0.000 description 1
- 108090000626 DNA-directed RNA polymerases Proteins 0.000 description 1
- 102100037642 Elongation factor G, mitochondrial Human genes 0.000 description 1
- 102400001368 Epidermal growth factor Human genes 0.000 description 1
- 101800003838 Epidermal growth factor Proteins 0.000 description 1
- 108010058643 Fungal Proteins Proteins 0.000 description 1
- 108010001498 Galectin 1 Proteins 0.000 description 1
- 102100021736 Galectin-1 Human genes 0.000 description 1
- 102100024637 Galectin-10 Human genes 0.000 description 1
- 101001011019 Gallus gallus Gallinacin-10 Proteins 0.000 description 1
- 101001011021 Gallus gallus Gallinacin-12 Proteins 0.000 description 1
- 108010061711 Gliadin Proteins 0.000 description 1
- 108010043121 Green Fluorescent Proteins Proteins 0.000 description 1
- 101150009006 HIS3 gene Proteins 0.000 description 1
- 101150069554 HIS4 gene Proteins 0.000 description 1
- 101000880344 Homo sapiens Elongation factor G, mitochondrial Proteins 0.000 description 1
- 206010062767 Hypophysitis Diseases 0.000 description 1
- 101150045458 KEX2 gene Proteins 0.000 description 1
- QIVBCDIJIAJPQS-VIFPVBQESA-N L-tryptophane Chemical compound C1=CC=C2C(C[C@H](N)C(O)=O)=CNC2=C1 QIVBCDIJIAJPQS-VIFPVBQESA-N 0.000 description 1
- 229910009891 LiAc Inorganic materials 0.000 description 1
- KDXKERNSBIXSRK-UHFFFAOYSA-N Lysine Natural products NCCCCC(N)C(O)=O KDXKERNSBIXSRK-UHFFFAOYSA-N 0.000 description 1
- 239000004472 Lysine Substances 0.000 description 1
- 108010052285 Membrane Proteins Proteins 0.000 description 1
- 102000018697 Membrane Proteins Human genes 0.000 description 1
- 239000000020 Nitrocellulose Substances 0.000 description 1
- 102000007999 Nuclear Proteins Human genes 0.000 description 1
- 108010089610 Nuclear Proteins Proteins 0.000 description 1
- 241000283973 Oryctolagus cuniculus Species 0.000 description 1
- 229930040373 Paraformaldehyde Natural products 0.000 description 1
- 108010030544 Peptidyl-Lys metalloendopeptidase Proteins 0.000 description 1
- 241000425347 Phyla <beetle> Species 0.000 description 1
- 241001637516 Polygonia c-album Species 0.000 description 1
- 101100394989 Rhodopseudomonas palustris (strain ATCC BAA-98 / CGA009) hisI gene Proteins 0.000 description 1
- 101100226845 Strongylocentrotus purpuratus EGF2 gene Proteins 0.000 description 1
- 101100226846 Strongylocentrotus purpuratus EGF3 gene Proteins 0.000 description 1
- 239000007983 Tris buffer Substances 0.000 description 1
- 241000209140 Triticum Species 0.000 description 1
- 235000021307 Triticum Nutrition 0.000 description 1
- 229920004890 Triton X-100 Polymers 0.000 description 1
- 239000013504 Triton X-100 Substances 0.000 description 1
- QIVBCDIJIAJPQS-UHFFFAOYSA-N Tryptophan Natural products C1=CC=C2C(CC(N)C(O)=O)=CNC2=C1 QIVBCDIJIAJPQS-UHFFFAOYSA-N 0.000 description 1
- IXKSXJFAGXLQOQ-XISFHERQSA-N WHWLQLKPGQPMY Chemical compound C([C@@H](C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)NC(=O)[C@@H](N)CC=1C2=CC=CC=C2NC=1)C1=CNC=N1 IXKSXJFAGXLQOQ-XISFHERQSA-N 0.000 description 1
- 239000008272 agar Substances 0.000 description 1
- 238000013459 approach Methods 0.000 description 1
- XKRFYHLGVUSROY-UHFFFAOYSA-N argon Substances [Ar] XKRFYHLGVUSROY-UHFFFAOYSA-N 0.000 description 1
- 229910052786 argon Inorganic materials 0.000 description 1
- 239000011324 bead Substances 0.000 description 1
- 229960003773 calcitonin (salmon synthetic) Drugs 0.000 description 1
- 238000004113 cell culture Methods 0.000 description 1
- 239000006143 cell culture medium Substances 0.000 description 1
- 230000001413 cellular effect Effects 0.000 description 1
- 238000001311 chemical methods and process Methods 0.000 description 1
- 239000007795 chemical reaction product Substances 0.000 description 1
- 239000003153 chemical reaction reagent Substances 0.000 description 1
- 239000003638 chemical reducing agent Substances 0.000 description 1
- 230000008045 co-localization Effects 0.000 description 1
- 229960005188 collagen Drugs 0.000 description 1
- ATDGTVJJHBUTRL-UHFFFAOYSA-N cyanogen bromide Chemical compound BrC#N ATDGTVJJHBUTRL-UHFFFAOYSA-N 0.000 description 1
- 235000018417 cysteine Nutrition 0.000 description 1
- XUJNEKJLAYXESH-UHFFFAOYSA-N cysteine Natural products SCC(N)C(O)=O XUJNEKJLAYXESH-UHFFFAOYSA-N 0.000 description 1
- 210000000172 cytosol Anatomy 0.000 description 1
- 230000001086 cytosolic effect Effects 0.000 description 1
- 239000000975 dye Substances 0.000 description 1
- 230000007613 environmental effect Effects 0.000 description 1
- 229940116977 epidermal growth factor Drugs 0.000 description 1
- 239000003797 essential amino acid Substances 0.000 description 1
- 235000020776 essential amino acid Nutrition 0.000 description 1
- 238000002474 experimental method Methods 0.000 description 1
- 238000000605 extraction Methods 0.000 description 1
- 108091006047 fluorescent proteins Proteins 0.000 description 1
- 102000034287 fluorescent proteins Human genes 0.000 description 1
- 235000013305 food Nutrition 0.000 description 1
- 239000011521 glass Substances 0.000 description 1
- 230000002209 hydrophobic effect Effects 0.000 description 1
- 230000006303 immediate early viral mRNA transcription Effects 0.000 description 1
- 238000003365 immunocytochemistry Methods 0.000 description 1
- 238000000338 in vitro Methods 0.000 description 1
- 208000015181 infectious disease Diseases 0.000 description 1
- 238000002347 injection Methods 0.000 description 1
- 239000007924 injection Substances 0.000 description 1
- 230000002452 interceptive effect Effects 0.000 description 1
- 238000002955 isolation Methods 0.000 description 1
- 230000004807 localization Effects 0.000 description 1
- 238000012423 maintenance Methods 0.000 description 1
- 108020004999 messenger RNA Proteins 0.000 description 1
- 125000001360 methionine group Chemical group N[C@@H](CCSC)C(=O)* 0.000 description 1
- 238000001000 micrograph Methods 0.000 description 1
- 210000003470 mitochondria Anatomy 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 239000003068 molecular probe Substances 0.000 description 1
- 238000002703 mutagenesis Methods 0.000 description 1
- 231100000350 mutagenesis Toxicity 0.000 description 1
- 229920001220 nitrocellulos Polymers 0.000 description 1
- 235000015097 nutrients Nutrition 0.000 description 1
- 230000003287 optical effect Effects 0.000 description 1
- 210000000056 organ Anatomy 0.000 description 1
- 210000001672 ovary Anatomy 0.000 description 1
- 229920002866 paraformaldehyde Polymers 0.000 description 1
- 239000008188 pellet Substances 0.000 description 1
- 239000008363 phosphate buffer Substances 0.000 description 1
- 210000003635 pituitary gland Anatomy 0.000 description 1
- 239000013600 plasmid vector Substances 0.000 description 1
- 210000002706 plastid Anatomy 0.000 description 1
- 238000002264 polyacrylamide gel electrophoresis Methods 0.000 description 1
- 210000002729 polyribosome Anatomy 0.000 description 1
- 230000008569 process Effects 0.000 description 1
- 125000001500 prolyl group Chemical group [H]N1C([H])(C(=O)[*])C([H])([H])C([H])([H])C1([H])[H] 0.000 description 1
- 210000004777 protein coat Anatomy 0.000 description 1
- 108020001580 protein domains Proteins 0.000 description 1
- 238000000164 protein isolation Methods 0.000 description 1
- 238000011084 recovery Methods 0.000 description 1
- 238000011160 research Methods 0.000 description 1
- 108010068072 salmon calcitonin Proteins 0.000 description 1
- 230000028327 secretion Effects 0.000 description 1
- 239000000243 solution Substances 0.000 description 1
- 125000006850 spacer group Chemical group 0.000 description 1
- 238000006467 substitution reaction Methods 0.000 description 1
- 239000013589 supplement Substances 0.000 description 1
- 230000002194 synthesizing effect Effects 0.000 description 1
- 230000008685 targeting Effects 0.000 description 1
- 238000010257 thawing Methods 0.000 description 1
- 230000009261 transgenic effect Effects 0.000 description 1
- 238000013519 translation Methods 0.000 description 1
- 230000005945 translocation Effects 0.000 description 1
- 230000032258 transport Effects 0.000 description 1
- 241000701366 unidentified nuclear polyhedrosis viruses Species 0.000 description 1
- VBEQCZHXXJYVRD-GACYYNSASA-N uroanthelone Chemical compound C([C@@H](C(=O)N[C@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CS)C(=O)N[C@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O)C(C)C)[C@@H](C)O)NC(=O)[C@H](CO)NC(=O)[C@H](CC(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CO)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](NC(=O)[C@H](CC=1NC=NC=1)NC(=O)[C@H](CCSC)NC(=O)[C@H](CS)NC(=O)[C@@H](NC(=O)CNC(=O)CNC(=O)[C@H](CC(N)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CS)NC(=O)[C@H](CC=1C=CC(O)=CC=1)NC(=O)CNC(=O)[C@H](CC(O)=O)NC(=O)[C@H](CC=1C=CC(O)=CC=1)NC(=O)[C@H](CO)NC(=O)[C@H](CO)NC(=O)[C@H]1N(CCC1)C(=O)[C@H](CS)NC(=O)CNC(=O)[C@H]1N(CCC1)C(=O)[C@H](CC=1C=CC(O)=CC=1)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC(N)=O)C(C)C)[C@@H](C)CC)C1=CC=C(O)C=C1 VBEQCZHXXJYVRD-GACYYNSASA-N 0.000 description 1
- 210000003934 vacuole Anatomy 0.000 description 1
- 230000003612 virological effect Effects 0.000 description 1
- 239000007222 ypd medium Substances 0.000 description 1
Abstract
A method for forming a fusion protein that is expressed as a recombinant protein body-like assembly in host eukaryotic cells and organisms other than higher plants as host systems is disclosed. More particularly, peptides and proteins are fused to protein sequences that mediate the induction of recombinant protein body-like assembly (RPBLA) formation, are stably expressed and accumulated in these host cells after transformation with an appropriate vector. Methods for preparing the fusion protein are also disclosed.
Description
"PROTEIN PRODUCTION"
Technical Field The present invention relates to the production of peptides and recombinant proteins in eukaryotic cells and organisms other than higher plants as host systems. More specifically, the peptides and proteins are fused into protein sequences that mediate the induction of the formation of recombinant protein bodies (ESCPR) assemblies, and are expressed and accumulated with stability in these host systems after being transformed with a vector suitable. Prior Art The production of recombinant proteins for therapeutic, nutritional or industrial uses has enjoyed great success during the past decade. It has been shown that different eukaryotic cells and organisms are capable of producing active therapeutic compounds based on proteins. Unfortunately, the high costs that are often derived from low production levels of recombinant proteins and / or protein isolation and purification procedures can invalidate their industrial application. Active research is carried out to improve both production levels and purification procedures from different approaches. A new technology has been developed based on the fusion of the domain of a seed storage protein of a plant with the protein of interest (WO 2004/003207) to increase the stability and accumulation of recombinant proteins in higher plants. These storage proteins are specific to plant seeds, in which they accumulate with stability in protein bodies (Galili et al., 1993, Trends Cell Biol 3: 437-442). The storage proteins are inserted into the lumen of the endoplasmic reticulum (ER) by means of a signal peptide, and assembled in the endoplasmic reticulum by developing specific organelles called protein bodies derived from ER (CP-RE), or in the vacuoles of protein storage
(VAP) (Okita and Rogers 1996 Annu. Rev. Plant Physiol Mol. Biol. 47: 327-50;
Herman and Larkins 1999 Plant Cell 1 1: 601 -613; Sanderfoot and Raikel 1999 Plant Cell 1 1: 629-642). It has also been described that recombinant storage proteins gather in protein-like organelles (CP) in different host systems of plants, such as Xenopus oocytes and yeast. The expression of cereal prolamins (the most abundant cereal storage proteins) has been described in Xenopus oocytes after injection of the corresponding mRNAs. This system has been used as a model to study the targeting properties of these storage proteins (Simón et al., 1990, Plant Cell 2: 941-950, Altschuler et al., 1993, Plant Cell 5: 443- 450; Torrent et al., 1994, Plant 192: 512-518) and to test the possibility of modifying the a-zein of 19 kDa, a prolamin of corn, by introducing the essential amino acids lysine and tryptophan in their sequence, without altering its stability (Wallace et al., 1988, Science 240: 662-664). Zeins, the complex group of corn prolamines, have also been produced in yeast for various purposes. Coraggio et al., 1988, Eur J Cell Biol 47: 165-172, expressed native a-zeins and modified yeast to study the target determinants of this protein. Kim et al., 2002, Plant Cell 14: 655-672, studied the possible interactions of a, ß,? and d-zeins that lead to the formation of protein bodies. To answer this question, they transformed the yeast cells with cDNAs that encode these proteins. In addition, the authors constructed zein-GFP fusion proteins to determine the subcellular localization of zein proteins in the yeast cells. The yeast cells, then, were used as a model expression system to study the properties of zein. It should be noted that Kim et al., 2002, Plant Cell 14: 655-672, concluded that yeast is not a good model for studying zein interactions because zeins, by themselves, did not accumulate well in the leaven transformed. Yeast cells were also used as a model to study the mechanisms that control the transport and deposition of protein bodies from wheat storage proteins called gliadins
(Rosenberg et al., 1993, Plant Physiol 102: 61-69).
In the present document we demonstrate that the fusion of a sequence of the protein that mediates the induction of assemblies similar to recombinant protein bodies (ESCPR), such as, for example, prolamins or prolamin domains with a peptide or protein of interest (target) it also mediates the accumulation of said ESCPR in cells of organisms such as fungi (including yeast), algae and animals. Interestingly, these fusion proteins accumulate with stability in animal cells, within structures in organelles similar to protein bodies.
COMPENDIUM OF THE INVENTION
The present invention provides a system and method for producing a fusion protein containing a protein body-inducing sequence (CPIS) and a peptide or protein of interest (in this document they are often called "polypeptides" collectively) in cells eukaryotes that do not belong to higher plants, such as animal cells, fungi and algae, as well as in cultured cells of animals, fungi and algae in which the fusion proteins containing the peptide or protein of interest accumulate with stability in the form of assemblies similar to recombinant protein bodies (ESCPR). The CPIS are capable of mediating the induction of ESCPR formation and the entry of proteins and / or accumulation in these organelles, such as, for example, the natural and modified sequences of the storage protein sequences with a peptide or protein. interest (targets). The present invention provides, inter alia, a method for producing a product of interest in the form of a fusion protein, in eukaryotic cells that do not belong to higher plants as a host system that has been transformed with a nucleic acid sequence comprising a portion of nucleic acid encoding the CPIS and a portion of nucleic acid encoding a polypeptide product of interest. In a particular embodiment, the nucleic acid sequence used for the transformation comprises (i) a nucleic acid sequence encoding a CPIS and (ii) a nucleic acid sequence comprising the nucleotide sequence encoding a product of interest. In one embodiment, the 3 'end of the nucleic acid sequence (i) is linked to the 5' end of said nucleic acid sequence (ii). In another embodiment, the 5 'end of the nucleic acid sequence (i) is linked to the 3' end of the nucleic acid sequence (ii). Thus, the CPIS sequence may be at the N-terminal or C-terminal end of the fusion protein. In another specific embodiment, the nucleic acid sequence used for transformation comprises, in addition to the nucleic acid sequences above or mentioned (i) and (ii), a nucleic acid sequence comprising the nucleotide sequence encoding the sequence amino acid spacer The amino acid spacer sequence can be a cleavable, or non-cleavable, amino acid sequence by enzymatic or chemical means. In a particular embodiment, the nucleic acid sequence (iii) is placed between the nucleic acid sequences (i)
and (ii), for example, the 3 'end of the nucleic acid sequence (iii) is linked to the 5' end of said nucleic acid sequence (ii). In another embodiment, the 5 'end of said nucleic acid sequence (iii) is linked to the 3' end of the nucleic acid sequence (i). Also, in a specific embodiment, the nucleic acid sequence used
for the transformation encodes a specifically cuttable sequence and is as defined in the patent application WO 2004003207, granted together with the present application. Furthermore, in another embodiment, the nucleic acid is in accordance with the patent application WO 2004003207, in which the nucleic acid sequence encoding the amino acid sequence that is specifically cleavable by enzymatic or chemical means is absent. In a further embodiment, the fusion proteins can be a direct fusion between the CPIS and the peptide or protein of interest. In another embodiment, the method of the invention further comprises the isolation and purification of the fusion protein. In another further embodiment, the protein of interest is fused to a natural or modified storage protein, such as for example the natural or modified prolamines or the prolamin domains. Examples of proteins of interest are any protein with therapeutic, nutritional, biocontrol or industrial use. Some illustrative proteins and peptides are, for example, hormones such as calcitonin, growth hormone and the like, antibodies such as monoclonal antibodies and fragments thereof, antigens such as those which are useful for vaccines against the human immunodeficiency virus (HIV).; surface or nuclear proteins of hepatitis B, gastroenteritis, coronavirus and the like, protease, inhibitors, antibiotics, collagen, human lactoferrin, cytokines, industrial enzymes, such as hydrolases, glucosidases, oxido-red uctases, etc.
BRIEF DESCRIPTION OF THE FIGURES
In the figures that form an integral part of this description: Figure 1 is a photograph of an SDS / PAGE analysis showing the accumulation of fusion proteins, including calcitonin (Ct), human growth hormone (hGH) and factor of epidermal growth (EGF) as individual interest proteins linked to the RX3 protein body-inducing sequence derived from gamma zein in cells of transfected mammals. The fusion proteins RX3-Ct, RX3-hGH and RX3-EGF accumulated in the cultured cells of transfected mammals 293T, CHO and Cosi are shown. Equivalent amounts of transfected mammalian cells were extracted at 44 hours post-transfection, and the corresponding soluble total proteins were analyzed with electrophoresis and Western blot technique using the anti-zein gamma antibody. Also included are schematic representations of the constructs encoding the fusion proteins RX3-Ct, RX3-hGH and RX3-EGF. "c" = cells; "m" = medium. Molecular weight markers (in kDa) are indicated on the left. Figure 2 is a six panel photograph (A-F) showing the location of the fusion proteins in the ESCPR within transfected cells. Confocal microscopy was used to show Cosí cells expressing RX3-CT (Fig. 2A), CHO cells expressing RX3-Ct (Fig. 2B) and RX3-EGF (Fig. 2C), and 293T cells coexpressing RX3-Ct and the protein marker DsRed2-RE (Fig. 2D, Fig. 2E and Fig.
2F). The fusion proteins derived from RX3 were immunolocalized with anti-zein gamma serum in structures similar to protein bodies (Figs 2A-2D) and in the endoplasmic reticulum (see arrow in Fig. 2A). Fig. 2E shows the colored ER with the fluorescent red protein marker DsRed2-RE. Fig. 2F shows an overlay of Figures 2D and 2E and represents the colocalization of RX3-Q with DsRed2 in the RE and in the CPLS. The figures embedded in Figures 2A and 2C show very enlarged images of the CPLS. Bars: 1 micrometer. Figure 3 on two panels (A and B) illustrates the accumulation of fusion proteins in transformed yeast cells. Figure 3A shows the accumulation of fusion proteins RX-EGF (Slot C1 17) and RX-hGH (Slot c1 18) in transformed Saccharomyces. Equivalent amounts of total cell protein extracts and immunoblot media were analyzed using specific antibodies. Cells transformed with the plasmid pYX243 without insert were used as control (C). The lower part of each panel contains a schematic representation of the constructs encoding the fusion proteins RX3-hGH and RX3-EGF. Figure 3B illustrates the accumulation of hGH and RX3-hGH containing fusion proteins in Pichia pastoris using two different signal peptides. Equivalent amounts of total cell protein extracts and transformed media were analyzed with immunoblot using specific antibodies. C1 and C2 indicate cells transformed, respectively, with plasmids pPIC9 and pPIC3.5K used as control. Schematic representations of the c135 and c121 constructs encoding the RX3-hGH fusion protein and the c136 construct encoding hGH are shown in the lower part of the panels. Molecular weight markers (in kDa) are indicated to the left of the figures, "and" = yeast cells; "m" = medium; "SPg" = zein gamma signal peptide; "RX3" = N-terminal sequence of proline-rich gamma zein, "EGF" = epidermal growth factor; "hGH" = human growth hormone; "Afprepro" = peptide factor alpha prepro. The present invention offers several benefits and advantages. One benefit is that their use allows the relatively simple and rapid expression of a desired recombinant protein in a chosen eukaryotic cell that does not belong to higher plants. An advantage of the invention is that it provides a source of recombinant protein easily obtainable and purifiable due to expression in RBPLA. Other additional benefits and advantages will be apparent to those skilled in the art from the following description.DETAILED DESCRIPTION OF THE INVENTION
The contemplated recombinant proteins are fusion proteins that form assemblages similar to recombinant protein bodies (ESCPR) in the host cells in which they are expressed. The formation of ESCPR is induced with storage protein domains that form high density deposits within cells. These dense deposits can accumulate cytosol, in an organelle of the endomembrane system, mitochondria, plastids or can be secreted. Assemblies similar to recombinant protein bodies have a predetermined density that can differ between different fusion proteins, but which is known for a specific fusion protein being prepared. This predetermined density of the ESCPR is usually higher than that of almost all the endogenous proteins of host cells present in the homogenate, and is usually from about 1.1 to 1.35 g / ml. The high density of the new ESCPR is due to the general ability of recombinant fusion proteins to form multimers and accumulate. The contemplated RBPLAs are expressed in eukaryotes that do not belong to higher plants and are usually characterized by their densities as described above. When expressed in animal cells, ESCPRs usually have a spherical shape, with diameters of approximately 1 micrometer (μ) and have a surrounding membrane. These fusion proteins comprise two polypeptide sequences linked directly or indirectly by a peptide bond, wherein a sequence is a protein body-inducing sequence (CPIS) linked to a polypeptide product (e.g., peptide or protein) of interest (target ). CPIS are sequences of protein or peptide amino acids that mediate the induction of ESCPR formation and the entry and / or accumulation of proteins in organelles. A CPIS and the host cell are preferably of different biological phyla. Thus, the CPIS is normally of a higher plant, a spermatophyte, while the host cell is a eukaryotic different from a spermatophyte and can be an animal cell, such as for example mammalian or insect cells, a fungus or yeast, or a cell of seaweed, all of which are of different edges than the spermatophytes. Exemplary CPIS examples, but not restrictives, are storage proteins or modified storage proteins, such as prolamines or modified prolamines, prolamin domains or modified prolamin domains. Prolamines are analyzed in Shewry et al., 2002 J. Exp. Bot. 53 (370): 947-958. Preferred CPISs are those of prolamin compounds such as zein gamma, alpha zein or rice prolamin mentioned below. Gamma zein, a corn storage protein whose DNA and amino acid residue sequences are shown below, is one of four corn prolamines and accounts for 10 to 15 percent of the total protein found in the corn. endosperm of corn. Like other cereal prolamins, alpha- and zein gammas are biosynthesized in polysomes bound with membranes on the cytoplasmic side of the rough ER, they gather within the light and then they are sequestered in the protein bodies derived from the ER (Herman et al., 1999 Plant Cell 1: 601-613; Ludevid et al., 1984 Plant Mol. Biol. 3: 277-234; Torrent et al., 1986 Plant Mol. Biol. 7: 93-403). The gamma zein is composed of four characteristic domains: i) a signal peptide of 19 amino acids, ii) a repeating domain containing eight units of hexapeptide PPPVHL (SEQ ID NO: 1) (53 aa), iii) the ProX domain where the proline residues alternate with other amino acids (29 aa) and iv) the C-terminal hydrophobic domain rich in cysteine (111 aa). The ability of the gamma zein to constitute ESCPR derived from the ER is not restricted to the seeds. In fact, when the gamma zein gene was constitutively expressed in the Arabidopsis transgenic plants, the storage protein accumulated within the CPLS derived from the ER in the mesophilic cells of the leaves (Geli et al., 1994 Plant Cell 6: 191 1 -1922). When looking for the signal responsible for the deposition of gamma zein in the protein bodies derived from the ER (prolamins do not have the KDEL signal), it has been shown that the proline-rich N-terminal domain including the tandem repeat domain was necessary for the retention of the ER and that the C-terminal domain participated in the formation of the protein bodies. However, the mechanisms by which these domains promote the composition of protein bodies are still unknown. In the same way that protein bodies are only properly named in seeds, similar structures produced in other plant organs and in non-superior plants are generally called recombinant protein bodies (ESCPR) assemblies. Other useful prolamin-type sequences are shown by way of illustration in the following table along with their GenBank identifiers.
Other useful sequences are obtained by a BLAST search in the complete non-redundant database of GenBank including CDS translations + PDB + SwissProt + PIR + PRF (excluding environmental samples) as described in Altschul et al., 1997 Nucleic Acids Res. 25 : 3389-3402 using an unknown such as those shown below: sequence of the RX3 protein (SEQ ID NO: 2), alpha-zein protein sequence (SEQ ID NO: 3), and prolamin protein sequence of the rice (SEQ ID NO: 4). A prolamin modified by way of illustration comprises (a) a sequence of signal peptides, (b) a sequence of one or more copies of the repeated hexapeptide domain PPPVHL (SEQ ID NO: 1) of the gamma zein protein, the entire domain contains eight hexapeptide units; and (c) a sequence of all or part of the ProX domain of the gamma zein. Some illustrative specific modified prolamines are the polypeptides identified below as R3, RX3 and P4 whose DNA sequences and amino acid residues are also shown below. Particularly preferred prolamines are gamma zein and its component parts as disclosed in published application WO2004003207, rP13 rice protein and 22 kDa alpha zein of corn and its N-terminal fragment. The DNA sequences and amino acid residues of the gamma, rice and zein alpha proteins are shown below: - Zein gamma (27 KD) SEQ ID NO: 5 (DNA sequence) and SEQ ID NO: 6 (protein sequence).
- RX3 SEQ ID NO: 7 (DNA sequence) and SEQ ID NO: 8 (protein sequence) - R3 SEQ ID NO: 9 (DNA sequence) and SEQ ID NO: 10 (protein sequence)
- P4 SEQ ID NO: 1 1 (DNA sequence) and SEQ ID NO: 12 (protein sequence)
- X10 SEQ ID NO: 13 (DNA sequence) and SEQ ID NO: 14 (protein sequence)
- rP13 (13 kD rice prolamin homologous to the clone - (GenBank AB016504) Sha et al., 1996 Biosci Biotechnol Biochem 60 (2): 335-337; Wen et al., 1993 Plant Physiol. 101 (3 ): 1 115-1 1 16; Kawagoe et al., 2005 Plant Cell 17 (4): 1 141-1 153; Mullins et al., 2004 J. Agrie. Food Chem. 52 (8): 2242-2246; Mitsukawa et al., 1999 Biosci Biotechnol Biochem 63 (11): 1851-1858) SEQ ID NO: 15 (protein sequence) and SEQ ID NO: 16 (DNA sequence). - 22aZt [N-terminal fragment of the 22kD corn zein zein - (GenBank V01475) Kim et al., 2002 Plant Cell 14 (3): 655-672; Woo et al., 2001 Plant Cell 13 (10): 2297-2317; Matsushima et al., 1997 Biochim. Biophys. Acta 1339 (1): 14-22; Thompson et al., 1992 Plant Mol. Biol. 18 (4): 827-833]. SEQ ID NO: 17 (protein sequence) and SEQ ID NO: 18 (DNA sequence)
Examples of proteins of interest include any protein having therapeutic, nutritional, biocontrol or industrial uses, such as for example monoclonal antibodies (mAb such as IgC, IgM, IgA, etc.) and fragments thereof, antigens for vaccines ( human immunodeficiency virus, HIV, pre-surface antigens, surface and core of hepatitis B, coronaviruses of gastroenteritis, etc.), hormones (calcitonin, growth hormone, etc.), protease inhibitors, antibiotics, collagen , human lactoferrin, cytokines, industrial enzymes (hydrolases, glucosidases, oxido-reductases, and the like).
Exemplary sequences of DNA and amino acid residues for proteins of interest are, for example, Salmon Calcitonin, BAC57417 [SEQ ID NO: 19 (protein sequence) and SEQ ID NO: 20 (DNA sequence)]; hEGF, construction based on AAF85790 without the signal peptide, [SEQ ID NO: 21 (protein sequence) and SEQ ID NO: 22 (DNA sequence)]; and hGH, construction based on P01241 without the signal peptide [SEQ ID NO: 23 (protein sequence), SEQ ID NO: 24 (DNA sequence)]. In another embodiment, the recombinant fusion protein comprises in addition to the CPIS and product sequences of interest, a spacer sequence of amino acids. The spacing sequence of amino acids can be a cuttable sequence, or non-cuttable, by enzymatic or chemical means. In a specific embodiment, the amino acid spacer sequence is placed between the CPIS and the product of interest. An illustrative amino acid sequence is cuttable with a protease such as an enterokinase, endoprotease Arg-C, endoprotease Glu-C, endoprotease Lys-C, Factor Xa and the like. Alternatively, an amino acid sequence that is specifically cleavable with a chemical reagent, such as, for example, cyanogen bromide that cuts into the methionine residues, is encoded. In a further embodiment, the nucleic acid sequence used for transformation is as disclosed in co-pending patent application WO 2004003207. Furthermore, in another embodiment, the nucleic acid sequence is as disclosed in patent application WO 2004003207, but the nucleic acid sequence encoding the cleavable amino acid sequence is absent. In a preferred embodiment, the fusion proteins are prepared according to a method comprising the transformation of a host system of eukaryotic cells different from higher plants, such as an animal, culture of animal cells, fungi / yeast, insects or algae with a sequence of nucleic acid (DNA or RNA) comprising (i) a first nucleic acid encoding a CPIS that is operatively linked in the same reading frame to (ii) a second nucleic acid sequence comprising the nucleotide sequence encoding a product of polypeptide interest, i.e., the nucleic acid sequence encoding the CPIS is chemically linked (peptide linkage) to the sequence encoding the polypeptide of interest so that both polypeptides are expressed from their own reading patterns. The host cell is maintained for a period of time and under culture conditions suitable for expression of the fusion protein and composition of the fusion protein expressed in recombinant protein bodies-like assemblies (ESCPR). After expression, the resulting fusion protein accumulates in the transformed host system in the form of assembly similar to high protein recombinant protein bodies similar to protein bodies. The fusion protein can then be recovered from the host cells or the host cells containing the fusion protein can be used as desired, for example for an animal feed containing an added nutrient or supplement. The fusion protein can be isolated as part of the ESCPR or free of ESCPR. The culture conditions suitable for expression of the fusion protein are typically different for each type of host cell. However, these conditions are known to those skilled in the art and are easily determined. Similarly, the duration of maintenance may differ according to the host cells and according to the amount of fusion protein to be prepared. Again, these conditions are well known and can be easily determined in specific situations. In addition, specific growing conditions can be obtained from the documents cited in this document. In one embodiment, the 3 'end of the first nucleic acid sequence (i) is linked to the 5' end of the second nucleic acid sequence (i). In another embodiment, the 5 'end of the first nucleic acid sequence (i) is linked to the 3' end of the second nucleic acid sequence (ii). In another embodiment, the CPIS comprises a storage protein or a modified storage protein, a fragment or a modified fragment thereof or a newly designed proline-rich sequence. In another specific embodiment, a fusion protein is prepared according to a method comprising transforming the host cell system, for example an animal, animal cell culture, fungi / yeast or algae, with a nucleic acid sequence comprising, in addition to the aforementioned nucleic acid sequences (i) and (ii), a nucleic acid sequence (ii) in the pattern encoding the amino acid spacer sequence. The spacing amino acid sequence may be a cuttable or non-cutting sequence, by enzymatic or chemical means, as explained above. In a specific embodiment, the nucleic acid sequence (iii) is placed between said nucleic acid sequences (i) and (ii), for example, the 3 'end of the third nucleic acid sequence (iii) is linked to the extreme 5 'of the second nucleic acid sequence (ii). In another embodiment, the 5 'end of the third nucleic acid sequence (iii) is linked to the 3"end of the second nucleic acid sequence (ii).
A sequence (segment) of nucleic acid encoding a fusion protein molecule previously described or a complement of said coding sequence is also contemplated herein. Such a nucleic acid segment is present in isolated and purified form in some preferred embodiments. In living organisms, the sequence of amino acid residues of a protein or polypeptide is directly related by genetic code to the deoxyribonucleic acid (DNA) sequence of the gene encoding the protein. Thus, by means of the well-known degeneracy of the genetic code, DNA sequences and their corresponding RNA sequences (nucleic acids) can be prepared as desired which encode the same amino acid residue sequences of fusion proteins, but which are quite different of a genetic sequence described above to prevent the two sequences from hybridizing under highly restrictive conditions, although they do hybridize under moderate restrictive conditions. Highly restrictive conditions can be defined as those comprising hybridization at a temperature of about 50 ° -55 ° C in 6XSSC and a final wash at a temperature of 68 ° C in 1-3XSSC. Moderate stringent conditions comprise hybridization at a temperature of about 50 ° C to about 65 ° C in 0.2 or 0.3 M NaCl, followed by washing at about 50 ° C to about 55 ° C in 0.2X SSC, 0.1% SDS (sodium dodecyl sulfate). A nucleic sequence (DNA sequence or RNA sequence) that (1) codes for itself, or its complement encodes, a fusion protein containing a protein-producing sequence (CPIS) and a polypeptide of interest, as well it is contemplated in the present. As is well known, a nucleic acid sequence such as the contemplated nucleic acid sequence is expressed when it is operably linked to a suitable promoter in a suitable expression system as described elsewhere herein. Different hosts often have preferences for a specific codon to be used to encode a specific amino acid residue. Such codon preferences are well known and a DNA sequence encoding a desired fusion protein sequence can be altered, using for example in vitro mutagenesis, so that codons preferred by the host are used for a specific host in the that the fusion protein must be expressed. Also contemplated in this invention is a recombinant nucleic acid molecule such as a DNA molecule, comprising a vector containing one or more regulatory sequences (control elements) as a suitable promoter to drive the expression of the gene in an organism of eukaryotic host cells compatible operably linked to an exogenous segment of nucleic acid (e.g., a segment or DNA sequence) that defines a gene encoding a contemplated fusion protein, as described above. More specifically, a recombinant DNA molecule comprising a vector comprising a promoter to drive expression of the fusion protein in cells of the host organism operably linked to a DNA segment defining a gene encoding an inducing sequence is also contemplated. of protein bodies (CPIS) linked to a polypeptide of interest. Said recombinant DNA molecule, after suitable transfection and expression in a eukaryotic host cell, produces a fusion protein contemplated in the form of ESCPR. As is well known in the art, as long as the required nucleic acid sequence (eg, DNA) is present including the start and end signals, additional base pairs may be present at either end of the DNA segment and that segment may be used to express the protein. This, of course, presupposes the absence in the segment of an operably linked DNA sequence that represses the expression, expressing an additional product that consumes the fusion protein that is desired to express, expresses a product that consumes a desired reaction product produced by said desired fusion protein, or otherwise interferes with the expression of the gene of the DNA segment. Thus, as long as the DNA segment is free of said interfering DNA sequences, a DNA segment of the invention can have from about 500 to about 15,000 base pairs in length. The maximum size of a recombinant DNA molecule, especially an expression vector, is determined primarily by the convenience and size of the vector that a host cell can accept, given that all the minimum DNA sequences required for replication and expression , when desired, are present. The minimum sizes of vectors are well known. Said long sequences of non-5 DNA are preferred, but can be used. A DNA segment encoding a fusion protein described above can be synthesized with chemical techniques, such as for example the phosphotriester method of Matteucci et al. (1981) J. Am. Chem. Soc, 103: 3185. Of course, by chemical synthesizing the coding sequence, any desired modifications can be made simply by substitution of the appropriate bases by those encoding the native amino acid residue sequence. However, DNA segments are preferred, among them the sequences specifically described herein. The DNA segments containing a gene encoding the fusion protein are preferably obtained from recombinant DNA molecules (plasmid vectors) containing said gene. A vector that directs the expression of a fusion protein gene in a host cell is referred to herein as an "expression vector". An expression vector contains expression control elements, including
the promoter. The gene encoding the fusion protein is operably linked to the expression vector to allow the promoter sequence to direct the RNA polymerase linkage and the expression of the gene encoding the fusion protein. To express the gene encoding the polypeptides, promoters that are inducible, viral, synthetic and constitutive are described as described by Poszkowski et al. 25 (1989) EMBO J., 3: 2719 and Odell et al. (1985) Nature, 313: 810, as well as those that are regulated in time, regulated in space and regulated spatiotemporally as described in Chua et al. (1989) Science, 244: 174-181. The expression vectors compatible with eukaryotic cells, such as those that are compatible with yeast cells or those compatible with cells of
mammals, algae or insects and the like, are contemplated in the present. Such expression vectors can also be used to form the recombinant DNA molecules of the present invention. Vectors for use in yeasts such as S. cerevisiae or Pichia pastoris may be episomal or integrating, as is well known. Expression vectors of eukaryotic cells are well known in the art and there are several commercial suppliers. Typically, said vectors contain one or more convenient restriction sites for the insertion of the desired DNA segment and promoter sequences. Optionally, said vectors contain a specific selectable marker for use in eukaryotic cells. Examples of promoters for use in S. cerevisiae are the phosphoglycerol kinase (PGK) promoter for S. cerevisiae and the GAL 10 and GAL 1 divergent promoters, while the alcohol oxidase (AOX1) gene is a useful promoter for the Pichia pastoris An illustrative expression of a fusion protein in S. cerevisiae and Pichia pastoris is shown below. The production of a fusion protein by recombinant DNA expression in mammalian cells is illustrated hereinafter using a recombinant DNA vector expressing the fusion protein gene in Chinese hamster ovary (CHO) host cells, cells Hosts of the Cosí and human 293T. This is accomplished using procedures well known in the art and are described in more detail in Sambrook et al., Molecular Cloninq: A Laboratorv Manual, 2nd ed., Cold Spring Harbor Laboratories (1989). An insect cell system can also be used to express a contemplated fusion protein. For example, in one such system the nuclear polyhedrosis virus of Autographa californica (AcNPV) or baculovirus is used as an expression vector of foreign genes in Spodoptera frugiperda cells or Trichoplusia larvae. The sequences encoding a fusion protein can be cloned into a non-essential region of the virus, such as the polyhedrin gene, and placed under the control of the polyhedrin promoter. Successful insertion of a fusion protein sequence renders the polyhedrin gene inactive and produces a recombinant virus lacking a protein coat. The recombinant viruses can then be used to infect, for example, S. frugiperda cells or Trichoplusia larvae in which the fusion protein can be expressed. E. Engelhard et al. (1994) Proc. Nati Acad. Sci., USA, 91: 3224-3227; and V. Luckow, Insect Cell Expression Technology, pp. 183-218, in Protein Enqineerinq: Principies and Practice, J.L. Cleland et al. eds., Wiley-Liss, Inc, 1996). Heterologous genes placed under the control of the polyhedrin promoter of Autographa californica nuclear polyhedrosis virus (AcNPV) are often expressed at high levels during the late stages of infection. Recombinant baculoviruses containing the fusion protein gene are constructed using the baculovirus transporter vector system (Luckow et al (1993) J. Virol., 67: 4566-4579], sold commercially as the expression system of baculovirus Bac-to-Bac (Life Technologies) Stocks of recombinant viruses are prepared and the expression of the recombinant protein is monitored with normal protocols (O'Reilly et al., Baculovirus Expression Vectors: A Laboratory Manual, WH Freeman and Company , New York, 1992; and King et al., The Baculovirus Expression System: A Laboratory Guide, Chapman & Hall, London, 1992). The choice of which expression vector and which promoter is operably linked to a gene encoding fusion protein depends directly on the desired functional properties, for example the location and time of expression of proteins, and the host cell to be transformed. These are well-known limitations inherent in the technique of building recombinant DNA molecules. However, a useful vector for practicing the present invention can direct the replication, and preferably also the expression (for an expression vector) of the gene of the fusion protein included in the DNA segment to which it is operatively linked. The expressed ESCPR and its fusion proteins can be obtained from the host cells expressing them by common means used in biochemical or biological recovery. As the ESCPR are dense with respect to other proteins present in the host cells, the ESCPR are especially sptible to being collected by centrifugation of a cellular homogenate. The fusion proteins can be obtained from the ESCPR collected by dissolving the surrounding membrane in a buffer solution containing a reducing agent such as 2-mercaptoethanol.
Without further elaboration, it is believed that a person skilled in the art can, using the preceding description and the examples detailed below, use the present invention to its fullest extent. The following preferred specific embodiments should, therefore, be interpreted merely as illustrative and not in any way limiting the remainder of the description.
Example 1: Accumulation of fusion proteins in cells of transfected mammals Synthetic genes corresponding to mature calcitonin (Ct) and EGF sequences as well as the cDNA encoding the hGH sequence were fused to the N-terminal gamma zein sequence which encodes RX3 (WO2004003207) and were introduced into the vector pcDNA3.1 (Invitrogen) to obtain the constructs p3.1 RXCt, p3.1 RXEGF and p3.1 RXhGH. These constructs encoding the fusion proteins RX3-Ct, RX3-EGF and RX3-hGH were introduced into mammalian 293T, Cosi and CHO cells by the lipofectamine-based transfection procedure (Invitrogen). 293T and Cosi cells transfected with plasmid pECFP-N1 (Clontech) containing the genetic sequence of an improved and modified fluorescent cyan GFP were used as controls. The accumulation of fusion proteins in the transiently transfected cells was analyzed with Western blot, using antibodies against the gamma zein. After 44 hours of transfection, the total soluble proteins were extracted with buffer solution A (100 mM Tris-HCl pH 8.0, 150 mM NaCl, 5 mM EDTA, 0.5% SDS, Triton X-100 0.5 %, 2-mercaptoethanol 2% and protease inhibitors). The aliquots of cell incubation media were precipitated and stored at -20 ° C. Proteins extracted from equivalent amounts of transfected cells were separated by electrophoresis by polyacrylamide gel and transferred to nitrocellulose sheets. As can be seen from the results shown in Figure 1, the three fusion proteins analyzed RX3-Ct, RX3-EGF and RX3-hGH, accumulated very efficiently regardless of the type of cultured cells selected for expression: compare the pattern of accumulated RX3-hGH in both 293T and CHO cells and the RX3-EGF pattern in CHO and Cosí cells. The fusion proteins were observed in the protein extracts corresponding to the transfected cells (slots c) and no immunoreactive band was detected in the cell culture media (slots m). This observation suggests that the RX3 domain is capable of composing and retaining the fusion proteins in the endomembrane compartment.
These results illustrate how fusion proteins derived from RX3 are composed and accumulated in the endomembrane system in the three types of mammalian cells analyzed (human 293T cells, Cosí monkey cells and CHO hamster cells), suggesting that an efficient accumulation of a desired protein can be achieved in any mammalian cell or organism chosen by means of fusion with the RX3 domain.
Example 2: Subcellular localization of fusion proteins in transfected mammalian cells To determine whether the N-terminal gamma zein RX3 sequence was able to induce recombinant protein-like assemblies in mammalian cells, the location of the RX3 fusion proteins -Ct and RX3-EGF was analyzed with immunocytochemistry using confocal microscopy. The transfected cells were fixed for 10 minutes in 3% paraformaldehyde and after washing with phosphate buffer, they were incubated with gamma zein antiserum (1/700 solution) for 1 hour. The non-immune serum was used as a control. The primary antibodies were detected with conjugated anti-rabbit antibodies in Alexa Fluor 488 or Alexa Fluor 555 dyes (molecular probes). Micrographs of the transfected cells were obtained using a laser confocal microscope (Leica TCS SP, Heidelberg, Germany) equipped with spectrophotometers for selection of the wavelength of the emission band. The green fluorescent images were collected at a 488 nm excitation with the Argon ion laser using a defined emission window at 495-535 nm. Red fluorescent images were collected after an excitation of 543 nm with a HeNe laser and emission window of 550-600. The optical sections were 0, 5 μ ?? of thickness. Digital images and projections were recorded with confocal microscope software. Fig. 2 shows the confocal projections of cells transfected with p3.1Ct-p3.1 RXEGF. As shown in the figure, the corresponding fusion proteins, RX3-Ct and RX3EGF, were detected in the endoplasmic reticulum (ER, arrow in Figure 2A) indicating that the signal peptide of gamma zein is functional in mammalian cells in which mediates the translocation of the fusion protein to ER. The samples incubated with the non-immune serum as control did not show any significant fluorescence (not shown). It is important to note that it is surprising that the fusion proteins appear preferentially accumulated in large spots that appear surrounded by a membrane (see figure embedded in Figure 2A). These structures, which are absent in non-transfected cells, are comparable in size to the plant bodies of plants, with diameters of approximately 1 micrometer (figures embedded in A and C). This result is not only surprising because animal cells can reproduce the organelle of storage CP described in plants, but because of the large amount of ESCPR observed in all transfected cells, indicating the capacity for efficient accumulation of these cells. In addition, the different types of transfected cells showed the same localization and accumulation patterns as the fusion proteins (see Figures 2A, 2B and 2D), and this pattern appears independently of the target fused with the CPIS (Figures 2B and 2C). The cells were co-transfected with plasmid pDsRed2-RE (Clontech) containing the sequence for a fluorescent protein used as a marker of RE to analyze the subcellular origin of the induced CPLS. It is interesting, as can be seen in Figures 2D, 2E and 2F, that the RX3-Ct and the RE marker are colocalized in the ER and in assemblages similar to protein bodies, indicating the origin in ER of the ESCPR induced in cells of mammals as it occurs in plant cells.
Example 3: Accumulation of fusion proteins in transformed yeast cells Sequences encoding EGF and hGH were fused in the NX-terminal gamma zein RX3 coding sequence (WO2004003207) and introduced into vector pYX243 (R &D Systems) to obtain constructs d 17 and c1 18. These constructs encoding the fusion proteins RX-EGF and RX-hGH were introduced into Saccharomyces cerevisiae. Expression analyzes were performed by growth of transformants in a galactose-containing medium and equivalent amounts of cells and media were analyzed with SDS-PAGE and immunoblot using specific antibodies against the expressed recombinant proteins. As can be seen in Figure 3A both fusion proteins RX3-EGF (slots c1 17) and RX3-hGH (slots c1 18) accumulated in the yeast cells, and no traces of protein were detected in the media. The accumulation of hGH fusions and those derived from hGH was also studied in the yeast Pichia pastoris that was transformed with the c135 and c121 constructs (which encode the fusion protein RX3-hGH) and c136 (which encodes the hGH protein, see schematic representation in Figure 3B). The transformants that accumulate the highest levels of recombinant proteins were selected. Two different signal peptides were used to express the fusion protein, the zein gamma signal peptide (Fig. 3B, SPg) and the peptide factor alpha prepro (Fig. 3B, Afprepro). In addition, a secretion control was also analyzed using the peptide alpha factor prepro of the fused yeast directly in hGH (Fig. 3B, c136). The total proteins of the cells and media were analyzed with Western blot using antibodies specific against hGH cultured in rabbit. As expected, hGH was secreted into the medium (Fig. 3B, slot c136 / m) when the Afprepro peptide was used. In contrast, the RX3-hGH fusion protein accumulated within the yeast cells regardless of the signal peptide used. As can be seen in fig. 3B, the fusion protein was inside cells in both cases, when the yeast Afprepro peptide was used (slot c135 / y) and when the zein gamma signal peptide (slot c121 / y) was used, no traces were detected of the protein in the media (not shown). Thus, the proline-rich N-terminal domain of the gamma zein was sufficient to mediate the retention of protein in the endomembrane compartment of the yeast cells, and more specifically a dense fraction corresponding to CP-like structures that could be separated by centrifugation . The results obtained in Saccharomyces cerevisiae and Pichia pastoris are examples of other eukaryotic organisms apart from plants and the animal kingdom in which fusion proteins containing a seed storage protein are efficiently compiled and accumulated in CP-like structures.
Experimental procedures
Plasmid constructs for transfection of mammals. The synthetic gene corresponding to the mature calcitonin sequence (Ct) was obtained as described (patent application WO2004003207). The synthetic gene encoding the 53 amino acids of active hEGF was obtained by the PCR method of overlapping the primer, using 4 oligonucleotides of about 60 bases, with 20 overlapping bases. The hEGF synthetic cDNA included a 5 'linker sequence corresponding to the specific cleavage site of Factor Xa. The oligonucleotides [EGF1 (SEQ ID NO: 25); EGF2 (SEQ ID NO: 26); EGF3 (SEQ ID NO: 27); and EGF4 (SEQ ID NO: 28) were purified with denaturing polyacrylamide gel. The cDNA sequence encoding the 191 amino acids of human growth hormone (hGH) was obtained from cDNA of the human pituitary gland (Clontech, BDBiosciences) by PCR using the oligonucleotides GH5 (SEQ ID N °: 29) and GH3 (SEQ ID N °: 30) which include the sequence corresponding to the enterokinase cutting site. Synthetic genes that correspond to mature calcitonin (Ct), WO2004003207) and hEGF sequences as well as the cDNA encoding hGH were fused to the coding sequence of N-terminal gamma zein RX3 (patent WO2004003207) and introduced into pUC18. The Sall-BamHI restriction fragments of the plasmids derived from pUC18, pUC18RXCt, pUC18RX01 and pUC18RX06, containing the corresponding fusion protein sequences of RX3-Ct, RX3-EGF and RX3-hGH were introduced into the vector pcDNA3.1- (Invitrogen) restricted with Xho l-Bam Hl. In the resulting constructs called p3.1 RX3CT, p3.1 RX3EGF and p3.1 RX3hGH, the fusion protein sequences were under the CMV promoter and the pA BGH terminator.
Plasmid constructs for transformation of yeast Vectors and host strains: Saccharomyces cerevisiae strain (genotype Mata his3 Ieu2 met15 ura3 bar1 :: URA3) was transformed by the use of constructs derived from vector pYX243 (promoter GAL, LEU2, AmpR, from R &D Systems). The strain of Pichia pastoris GS1 15. { his4) and the vectors pPIC9 and pPIC3.5K (promoter AOX1, HIS4, AmpR) were from Invitrogen life tech.
Plasmid constructs: Sall restriction fragments (blunt-ended) -BamHI from the plasmids described above derived from pUC18, pUC18RX01 and pUC18RX06, containing the corresponding fusion protein sequences RX3-EGF and RX3-hGH were introduced into vector pYX243 (R &D Systems) restricted with EcoRI (blunt ends) -Bam Hl. In the resulting constructs that are called, respectively, d 17 and d 18, the fusion protein sequences were under the control of the GAL inducible promoter. Sall restriction fragments (blunt ends) -BamHI (blunt ends) of the plasmids derived from pUC18, pUC18RX01 and pUC18RX06, were introduced into the vector pPIC3.5K (Invitrogen) restricted with Notl (blunt ends) -EcoRI (blunt ends) to obtain plasmids c120 and c121 to transform the Pichia Pastoris. Plasmid pPIC9 (Invitrogen) was used to analyze the expression of fusion proteins using a yeast signal peptide, the alpha prepro peptide of Saccharomyces. The sequences flanked with Xhol-Notl encoding the hGH and RX3-hGH proteins were obtained by PCR using pUC18RX06 as template and the oligonucleotides af06 (SEQ ID NO: 31), afRX (SEQ ID NO: 32) and 06Not (SEQ ID N °: 33). These sequences contained the sequence encoding the KEX2 site necessary for efficient cleavage of the alpha prepro peptide (Invitrogen, Pichia expression kit). PCR products were cloned in pPIC9 restricted with Xhol-Notl, resulting in plasmids c135 and c136 containing, respectively, the sequences of RX3-hGH and hGH proteins fused to the peptide factor alpha prepro.
Transformation of the yeast Saccharomyces cerevisiae strain (Ieu2) was transformed with the plasmids constructs c1 17 and c118 with the LiAc process (Ito et al 1983, J. Bacteriol 153: 163-168) and transformants were selected in Leu plates. . Expression analyzes were done by the growth of transformants in a galactose-containing medium. The strain of Pichia pastoris GS1 15. { his4) was transformed with the Pichia EasyComp Kit (from Invitrogen life tech.) with plasmids c120 and c121 Sacl linearized and coated in an RDB His- medium. The Mut phenotypes were determined by streaking the colonies on MD and MM agar plates. Expression tests were performed by growth of transformants in YPD medium for two days. Thereafter, the cells were pelleted and suspended in an MM medium for another 48 hours and methanol was added to a final concentration of 0.5% for 24 hours. The transformants that accumulate the highest levels of recombinant proteins were selected. The recipes of the media were made as described by Invitrogen (Pichia expression Kit).
Extraction of yeast proteins and Western blot S. cerevisiae and P. pastoris expressing recombinant fusion proteins were pelleted. The aliquots of the respective incubation media were precipitated and stored at -20 ° C for further analysis. The cell pellets were also frozen and after thawing, the cells were disrupted with normal procedures using glass beads and H medium (50 mM HCI-Tris pH 8.0, 150 mM NaCl, 5 mM EDTA, 200 mM DTT and inhibitors of protease). Equivalent amounts of cells and media were analyzed with SDS-PAGE and immunoblot using specific antibodies against the expressed recombinant proteins. Each of the patents and articles cited in this document are incorporated by reference. The use of the article "one" is intended to include one or more. The above description and the examples are intended to be illustrative and are not to be taken as limiting. Still other variations are possible within the spirit and scope of this invention and those skilled in the art will easily deduce them by themselves.
Claims (20)
1. A eukaryotic host cell of non-superior plants containing recombinant fusion protein within assemblies similar to recombinant protein bodies (ESCPR), said fusion protein containing two interlinked sequences in which a sequence is a heterologous protein body-inducing sequence with with respect to the host cell and the other is the sequence of a product of interest.
2. The host cell according to claim 1 wherein the density of the ESCPR is from about 1.1 to about 1.35 g / ml.
3. The host cell according to claim 1 wherein said fusion protein further includes a linker sequence between the protein body-inducing sequence and the sequence of the product of interest.
4. The host cell according to claim 1 wherein the inducing sequence of the proteinaceous body comprises a prolamine or a modified prolamin.
5. The host cell of claim 4 wherein the protein body inducing sequence is a prolamin sequence.
6. The host cell according to claim 5 wherein the prolamin sequence is zein gamma, alpha zein or the prolamin of rice.
7. A method for producing a fusion protein comprising the following steps: (a) transforming eukaryotic host cells that do not belong to higher plants with a nucleic acid sequence comprising (i) a first nucleic acid encoding a protein body-inducing sequence (CPIS) which is operably linked in the same reading frame to (ii) a second nucleic acid sequence comprising the nucleotide sequence encoding a product of interest polypeptide; and (b) maintaining the transformed host cells for a period of time and under culture conditions suitable for expression of the fusion protein and within assemblies similar to recombinant protein bodies (ESCPR).
8. The method according to claim 7, wherein the 3 'end of the first nucleic acid sequence (i) is linked to the 5' end of the second nucleic acid sequence (i).
9. The method according to claim 7 wherein said nucleic acid encodes the linker sequence between the CPIS and the polypeptide product of interest.
10. The method according to claim 7 wherein said host cells are fungal cells, in particular yeast cells.
11. The method according to claim 7 wherein said host cells are algae cells.
12. The method according to claim 7 wherein said host cells are cells of animals, in particular cells of mammalian animals.
13. The method according to claim 7 which includes the additional step of recovering the expressed fusion protein.
14. The method according to claim 7, wherein the protein body-inducing sequence comprises a prolamine or a modified prolamin.
15. The method according to claim 14 wherein the prolamin sequence is a modified prolamin comprising (a) a signal peptide sequence, (b) a sequence of one or more copies of the repeated hexapeptide domain PPPVHL (SEQ ID NO: 1) ) of the gamma zein protein, and (c) a sequence of all or part of the ProX domain of the gamma zein.
16. The method according to claim 14, wherein the prolamin sequence is zein gamma, alpha zein or the prolamin of rice.
17. The method according to claim 7, wherein the nucleic acid sequence used to transform the host cells is present in an expression vector that includes one or more regulatory sequences.
18. The method according to claim 17 wherein said one or more regulatory sequences include a promoter.
19. A recombinant nucleic acid molecule comprising a vector containing one or more regulatory sequences operably linked to an exogenous nucleic acid segment that defines a gene encoding a fusion protein comprising (i) a protein body-inducing sequence that is linked a (ii) a product of polypeptide interest.
20. The nucleic acid according to claim 19 wherein said one or more regulatory sequences of said vector includes a promoter suitable for directing the expression of the gene in a compatible eukaryotic host cell.
Applications Claiming Priority (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| GB0426160.8 | 2004-11-29 |
Publications (1)
| Publication Number | Publication Date |
|---|---|
| MX2007006330A true MX2007006330A (en) | 2008-10-03 |
Family
ID=
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| AU2005308921B2 (en) | Production of proteins | |
| RU2194758C2 (en) | Method of preparing heterogeneous polypeptide in yeast saccharomyces cerevisiae | |
| US11312968B2 (en) | Yeast strains and methods for producing collagen | |
| US6410264B1 (en) | Pichia pastoris gene sequences and methods for their use | |
| EP3438125A1 (en) | Yeast strains and methods for controlling hydroxylation of recombinant collagen | |
| CA2278523C (en) | Rna binding protein and binding site useful for expression of recombinant molecules | |
| JPH10501413A (en) | Synthetic leader peptide sequences | |
| EP1687428A1 (en) | Processes for making acylated insulin | |
| US20120142090A1 (en) | Protein Expression System | |
| Koganesawa et al. | Expression and purification of a small cytokine growth-blocking peptide from armyworm Pseudaletia separata by an optimized fermentation method using the methylotrophic yeast Pichia pastoris | |
| US6861237B2 (en) | Production of heterologous polypeptides in yeast | |
| ES2258797T3 (en) | PROTEINS WITH N-TERMINAL EXTENSION EXPRESSED IN LEAVING. | |
| EP3027752B1 (en) | Signal sequence for protein expression in pichia pastoris | |
| MX2007006330A (en) | Production of proteins | |
| US20080227154A1 (en) | Protein Expression | |
| Kozuma et al. | Human serum albumin production in silkworm Bombyx mori fourth instar larvae | |
| WO2001042485A1 (en) | Sequence and method for increasing protein expression in cellular expression systems | |
| CA2175068C (en) | The recombinant production of proteins in yeast | |
| EP2369000A1 (en) | Production of peptides and proteins by accumulation in mitochondria | |
| EP1339856A1 (en) | Production of heterologous polypeptides in yeast |