US20140186890A1 - Selection of host cells expressing protein at high levels - Google Patents
Selection of host cells expressing protein at high levels Download PDFInfo
- Publication number
- US20140186890A1 US20140186890A1 US14/165,374 US201414165374A US2014186890A1 US 20140186890 A1 US20140186890 A1 US 20140186890A1 US 201414165374 A US201414165374 A US 201414165374A US 2014186890 A1 US2014186890 A1 US 2014186890A1
- Authority
- US
- United States
- Prior art keywords
- sequence
- selectable marker
- start codon
- interest
- colonies
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Abandoned
Links
- 108090000623 proteins and genes Proteins 0.000 title claims description 317
- 102000004169 proteins and genes Human genes 0.000 title claims description 140
- 108090000765 processed proteins & peptides Proteins 0.000 claims abstract description 247
- 102000004196 processed proteins & peptides Human genes 0.000 claims abstract description 233
- 229920001184 polypeptide Polymers 0.000 claims abstract description 229
- 239000003550 marker Substances 0.000 claims abstract description 211
- 238000013518 transcription Methods 0.000 claims abstract description 112
- 230000035897 transcription Effects 0.000 claims abstract description 110
- 238000013519 translation Methods 0.000 claims abstract description 106
- 108020004414 DNA Proteins 0.000 claims abstract description 84
- 238000000034 method Methods 0.000 claims abstract description 74
- 108091026890 Coding region Proteins 0.000 claims abstract description 50
- 230000014621 translational initiation Effects 0.000 claims abstract description 44
- 238000004519 manufacturing process Methods 0.000 claims abstract description 30
- 102000053602 DNA Human genes 0.000 claims abstract description 29
- 238000012258 culturing Methods 0.000 claims abstract description 20
- 230000014509 gene expression Effects 0.000 claims description 302
- 210000004027 cell Anatomy 0.000 claims description 277
- 108091081024 Start codon Proteins 0.000 claims description 192
- 230000003115 biocidal effect Effects 0.000 claims description 119
- 101710197208 Regulatory protein cro Proteins 0.000 claims description 73
- 238000011144 upstream manufacturing Methods 0.000 claims description 62
- RXWNCPJZOCPEPQ-NVWDDTSBSA-N puromycin Chemical compound C1=CC(OC)=CC=C1C[C@H](N)C(=O)N[C@H]1[C@@H](O)[C@H](N2C3=NC=NC(=C3N=C2)N(C)C)O[C@@H]1CO RXWNCPJZOCPEPQ-NVWDDTSBSA-N 0.000 claims description 50
- 229930182817 methionine Natural products 0.000 claims description 35
- FFEARJCKVFRZRR-BYPYZUCNSA-N L-methionine Chemical compound CSCC[C@H](N)C(O)=O FFEARJCKVFRZRR-BYPYZUCNSA-N 0.000 claims description 34
- 229930189065 blasticidin Natural products 0.000 claims description 32
- ROHFNLRQFUQHCH-YFKPBYRVSA-N L-leucine Chemical compound CC(C)C[C@H](N)C(O)=O ROHFNLRQFUQHCH-YFKPBYRVSA-N 0.000 claims description 28
- 239000003795 chemical substances by application Substances 0.000 claims description 26
- 229950010131 puromycin Drugs 0.000 claims description 25
- 108091033380 Coding strand Proteins 0.000 claims description 24
- 108010077544 Chromatin Proteins 0.000 claims description 23
- 210000003483 chromatin Anatomy 0.000 claims description 23
- 229930193140 Neomycin Natural products 0.000 claims description 21
- 229960004927 neomycin Drugs 0.000 claims description 21
- FBOZXECLQNJBKD-ZDUSSCGKSA-N L-methotrexate Chemical compound C=1N=C2N=C(N)N=C(N)C2=NC=1CN(C)C1=CC=C(C(=O)N[C@@H](CCC(O)=O)C(O)=O)C=C1 FBOZXECLQNJBKD-ZDUSSCGKSA-N 0.000 claims description 18
- 229960000485 methotrexate Drugs 0.000 claims description 18
- ROHFNLRQFUQHCH-UHFFFAOYSA-N Leucine Natural products CC(C)CC(N)C(O)=O ROHFNLRQFUQHCH-UHFFFAOYSA-N 0.000 claims description 17
- 229940024606 amino acid Drugs 0.000 claims description 17
- 150000001413 amino acids Chemical class 0.000 claims description 17
- 210000003527 eukaryotic cell Anatomy 0.000 claims description 17
- AYFVYJQAPQTCCC-GBXIJSLDSA-N L-threonine Chemical compound C[C@@H](O)[C@H](N)C(O)=O AYFVYJQAPQTCCC-GBXIJSLDSA-N 0.000 claims description 15
- KZSNJWFQEVHDMF-UHFFFAOYSA-N Valine Natural products CC(C)C(N)C(O)=O KZSNJWFQEVHDMF-UHFFFAOYSA-N 0.000 claims description 12
- AYFVYJQAPQTCCC-UHFFFAOYSA-N Threonine Natural products CC(O)C(N)C(O)=O AYFVYJQAPQTCCC-UHFFFAOYSA-N 0.000 claims description 11
- 239000004473 Threonine Substances 0.000 claims description 11
- 239000004474 valine Substances 0.000 claims description 9
- KZSNJWFQEVHDMF-BYPYZUCNSA-N L-valine Chemical compound CC(C)[C@H](N)C(O)=O KZSNJWFQEVHDMF-BYPYZUCNSA-N 0.000 claims description 8
- 230000005030 transcription termination Effects 0.000 claims description 8
- 239000011159 matrix material Substances 0.000 claims description 7
- 239000003242 anti bacterial agent Substances 0.000 claims description 6
- YQYJSBFKSSDGFO-UHFFFAOYSA-N Epihygromycin Natural products OC1C(O)C(C(=O)C)OC1OC(C(=C1)O)=CC=C1C=C(C)C(=O)NC1C(O)C(O)C2OCOC2C1O YQYJSBFKSSDGFO-UHFFFAOYSA-N 0.000 claims description 5
- AGPKZVBTJJNPAG-WHFBIAKZSA-N L-isoleucine Chemical compound CC[C@H](C)[C@H](N)C(O)=O AGPKZVBTJJNPAG-WHFBIAKZSA-N 0.000 claims description 5
- 239000012212 insulator Substances 0.000 claims description 5
- 229960000310 isoleucine Drugs 0.000 claims description 5
- AGPKZVBTJJNPAG-UHFFFAOYSA-N isoleucine Natural products CCC(C)C(N)C(O)=O AGPKZVBTJJNPAG-UHFFFAOYSA-N 0.000 claims description 5
- 229930027917 kanamycin Natural products 0.000 claims description 5
- 229960000318 kanamycin Drugs 0.000 claims description 5
- SBUJHOSQTJFQJX-NOAMYHISSA-N kanamycin Chemical compound O[C@@H]1[C@@H](O)[C@H](O)[C@@H](CN)O[C@@H]1O[C@H]1[C@H](O)[C@@H](O[C@@H]2[C@@H]([C@@H](N)[C@H](O)[C@@H](CO)O2)O)[C@H](N)C[C@@H]1N SBUJHOSQTJFQJX-NOAMYHISSA-N 0.000 claims description 5
- 229930182823 kanamycin A Natural products 0.000 claims description 5
- 230000009422 growth inhibiting effect Effects 0.000 claims description 4
- 231100000518 lethal Toxicity 0.000 claims description 4
- 230000001665 lethal effect Effects 0.000 claims description 4
- 239000002243 precursor Substances 0.000 claims description 4
- 125000000539 amino acid group Chemical group 0.000 claims description 3
- OYVAGSVQBOHSSS-UAPAGMARSA-O bleomycin A2 Chemical compound N([C@H](C(=O)N[C@H](C)[C@@H](O)[C@H](C)C(=O)N[C@@H]([C@H](O)C)C(=O)NCCC=1SC=C(N=1)C=1SC=C(N=1)C(=O)NCCC[S+](C)C)[C@@H](O[C@H]1[C@H]([C@@H](O)[C@H](O)[C@H](CO)O1)O[C@@H]1[C@H]([C@@H](OC(N)=O)[C@H](O)[C@@H](CO)O1)O)C=1N=CNC=1)C(=O)C1=NC([C@H](CC(N)=O)NC[C@H](N)C(N)=O)=NC(N)=C1C OYVAGSVQBOHSSS-UAPAGMARSA-O 0.000 claims description 3
- SXTAYKAGBXMACB-UHFFFAOYSA-N methionine S-imide-S-oxide Natural products CS(=N)(=O)CCC(N)C(O)=O SXTAYKAGBXMACB-UHFFFAOYSA-N 0.000 claims description 3
- 108091033319 polynucleotide Proteins 0.000 claims 4
- 102000040430 polynucleotide Human genes 0.000 claims 4
- 239000002157 polynucleotide Substances 0.000 claims 4
- 239000012228 culture supernatant Substances 0.000 claims 2
- 238000003306 harvesting Methods 0.000 claims 1
- 125000002987 valine group Chemical group [H]N([H])C([H])(C(*)=O)C([H])(C([H])([H])[H])C([H])([H])[H] 0.000 claims 1
- 230000035772 mutation Effects 0.000 abstract description 38
- 150000007523 nucleic acids Chemical group 0.000 abstract description 31
- 230000007423 decrease Effects 0.000 abstract description 20
- 108091028043 Nucleic acid sequence Proteins 0.000 abstract description 13
- CWCMIVBLVUHDHK-ZSNHEYEWSA-N phleomycin D1 Chemical compound N([C@H](C(=O)N[C@H](C)[C@@H](O)[C@H](C)C(=O)N[C@@H]([C@H](O)C)C(=O)NCCC=1SC[C@@H](N=1)C=1SC=C(N=1)C(=O)NCCCCNC(N)=N)[C@@H](O[C@H]1[C@H]([C@@H](O)[C@H](O)[C@H](CO)O1)O[C@@H]1[C@H]([C@@H](OC(N)=O)[C@H](O)[C@@H](CO)O1)O)C=1N=CNC=1)C(=O)C1=NC([C@H](CC(N)=O)NC[C@H](N)C(N)=O)=NC(N)=C1C CWCMIVBLVUHDHK-ZSNHEYEWSA-N 0.000 description 136
- 108010084455 Zeocin Proteins 0.000 description 131
- 235000018102 proteins Nutrition 0.000 description 125
- 230000014616 translation Effects 0.000 description 102
- 239000013612 plasmid Substances 0.000 description 83
- 238000001890 transfection Methods 0.000 description 78
- 241000701022 Cytomegalovirus Species 0.000 description 67
- 108700026244 Open Reading Frames Proteins 0.000 description 56
- 239000012634 fragment Substances 0.000 description 47
- 102000018651 Epithelial Cell Adhesion Molecule Human genes 0.000 description 46
- 108010066687 Epithelial Cell Adhesion Molecule Proteins 0.000 description 46
- 230000000694 effects Effects 0.000 description 46
- 210000004978 chinese hamster ovary cell Anatomy 0.000 description 35
- 235000006109 methionine Nutrition 0.000 description 35
- 108020004705 Codon Proteins 0.000 description 33
- 241000829100 Macaca mulatta polyomavirus 1 Species 0.000 description 32
- 239000000047 product Substances 0.000 description 30
- 238000012360 testing method Methods 0.000 description 29
- 101150074155 DHFR gene Proteins 0.000 description 28
- 108700019146 Transgenes Proteins 0.000 description 27
- 239000003623 enhancer Substances 0.000 description 27
- 102000039446 nucleic acids Human genes 0.000 description 23
- 108020004707 nucleic acids Proteins 0.000 description 23
- 239000013598 vector Substances 0.000 description 23
- 239000002609 medium Substances 0.000 description 22
- 239000001963 growth medium Substances 0.000 description 21
- 108020004999 messenger RNA Proteins 0.000 description 21
- 125000006850 spacer group Chemical group 0.000 description 18
- 108091032973 (ribonucleotides)n+m Proteins 0.000 description 17
- 108020004684 Internal Ribosome Entry Sites Proteins 0.000 description 17
- 238000002474 experimental method Methods 0.000 description 17
- 238000007792 addition Methods 0.000 description 16
- 235000001014 amino acid Nutrition 0.000 description 16
- 239000002773 nucleotide Substances 0.000 description 14
- 125000003729 nucleotide group Chemical group 0.000 description 14
- 210000003705 ribosome Anatomy 0.000 description 14
- 230000003247 decreasing effect Effects 0.000 description 12
- 239000012737 fresh medium Substances 0.000 description 12
- 239000000463 material Substances 0.000 description 12
- 230000001965 increasing effect Effects 0.000 description 10
- 108700008625 Reporter Genes Proteins 0.000 description 9
- 238000003556 assay Methods 0.000 description 9
- 238000011534 incubation Methods 0.000 description 9
- 230000000644 propagated effect Effects 0.000 description 9
- 230000003321 amplification Effects 0.000 description 8
- 238000004458 analytical method Methods 0.000 description 8
- 230000009286 beneficial effect Effects 0.000 description 8
- 230000015572 biosynthetic process Effects 0.000 description 8
- 238000010367 cloning Methods 0.000 description 8
- 230000006870 function Effects 0.000 description 8
- 239000005090 green fluorescent protein Substances 0.000 description 8
- 230000000670 limiting effect Effects 0.000 description 8
- 238000003199 nucleic acid amplification method Methods 0.000 description 8
- UCSJYZPVAKXKNQ-HZYVHMACSA-N streptomycin Chemical compound CN[C@H]1[C@H](O)[C@@H](O)[C@H](CO)O[C@H]1O[C@@H]1[C@](C=O)(O)[C@H](C)O[C@H]1O[C@@H]1[C@@H](NC(N)=N)[C@H](O)[C@@H](NC(N)=N)[C@H](O)[C@H]1O UCSJYZPVAKXKNQ-HZYVHMACSA-N 0.000 description 8
- 108060003951 Immunoglobulin Proteins 0.000 description 7
- 230000008901 benefit Effects 0.000 description 7
- 239000002299 complementary DNA Substances 0.000 description 7
- 230000001419 dependent effect Effects 0.000 description 7
- 102000018358 immunoglobulin Human genes 0.000 description 7
- 238000010348 incorporation Methods 0.000 description 7
- 230000002103 transcriptional effect Effects 0.000 description 7
- 241000588724 Escherichia coli Species 0.000 description 6
- 238000013459 approach Methods 0.000 description 6
- 238000012054 celltiter-glo Methods 0.000 description 6
- 239000003153 chemical reaction reagent Substances 0.000 description 6
- 238000010276 construction Methods 0.000 description 6
- 230000002068 genetic effect Effects 0.000 description 6
- 230000012010 growth Effects 0.000 description 6
- 230000001976 improved effect Effects 0.000 description 6
- 210000004962 mammalian cell Anatomy 0.000 description 6
- 238000010187 selection method Methods 0.000 description 6
- 108010043121 Green Fluorescent Proteins Proteins 0.000 description 5
- 102000004144 Green Fluorescent Proteins Human genes 0.000 description 5
- 108020001507 fusion proteins Proteins 0.000 description 5
- 102000037865 fusion proteins Human genes 0.000 description 5
- ZDXPYRJPNDTMRX-UHFFFAOYSA-N glutamine Natural products OC(=O)C(N)CCC(N)=O ZDXPYRJPNDTMRX-UHFFFAOYSA-N 0.000 description 5
- 230000007246 mechanism Effects 0.000 description 5
- 125000001360 methionine group Chemical group N[C@@H](CCSC)C(=O)* 0.000 description 5
- 230000002829 reductive effect Effects 0.000 description 5
- 230000001105 regulatory effect Effects 0.000 description 5
- 102000007469 Actins Human genes 0.000 description 4
- 108010085238 Actins Proteins 0.000 description 4
- 108091003079 Bovine Serum Albumin Proteins 0.000 description 4
- 101710196632 LexA repressor Proteins 0.000 description 4
- 108060001084 Luciferase Proteins 0.000 description 4
- 101710118186 Neomycin resistance protein Proteins 0.000 description 4
- 229930182555 Penicillin Natural products 0.000 description 4
- JGSARLDLIJGVTE-MBNYWOFBSA-N Penicillin G Chemical compound N([C@H]1[C@H]2SC([C@@H](N2C1=O)C(O)=O)(C)C)C(=O)CC1=CC=CC=C1 JGSARLDLIJGVTE-MBNYWOFBSA-N 0.000 description 4
- 108020005067 RNA Splice Sites Proteins 0.000 description 4
- 125000003275 alpha amino acid group Chemical group 0.000 description 4
- 230000027455 binding Effects 0.000 description 4
- 239000013604 expression vector Substances 0.000 description 4
- 230000030279 gene silencing Effects 0.000 description 4
- 210000005260 human cell Anatomy 0.000 description 4
- 238000005259 measurement Methods 0.000 description 4
- 239000000203 mixture Substances 0.000 description 4
- 229940049954 penicillin Drugs 0.000 description 4
- 230000008488 polyadenylation Effects 0.000 description 4
- 229960005322 streptomycin Drugs 0.000 description 4
- 230000009466 transformation Effects 0.000 description 4
- 108020005544 Antisense RNA Proteins 0.000 description 3
- 241000894006 Bacteria Species 0.000 description 3
- 241000287828 Gallus gallus Species 0.000 description 3
- 101001035782 Gallus gallus Hemoglobin subunit beta Proteins 0.000 description 3
- 239000005089 Luciferase Substances 0.000 description 3
- 102000016943 Muramidase Human genes 0.000 description 3
- 108010014251 Muramidase Proteins 0.000 description 3
- 108010062010 N-Acetylmuramoyl-L-alanine Amidase Proteins 0.000 description 3
- 206010035226 Plasma cell myeloma Diseases 0.000 description 3
- 102000007056 Recombinant Fusion Proteins Human genes 0.000 description 3
- 108010008281 Recombinant Fusion Proteins Proteins 0.000 description 3
- 238000004113 cell culture Methods 0.000 description 3
- 239000003184 complementary RNA Substances 0.000 description 3
- 239000003814 drug Substances 0.000 description 3
- 229940079593 drug Drugs 0.000 description 3
- 239000012894 fetal calf serum Substances 0.000 description 3
- 238000000338 in vitro Methods 0.000 description 3
- 230000000977 initiatory effect Effects 0.000 description 3
- 238000003780 insertion Methods 0.000 description 3
- 230000037431 insertion Effects 0.000 description 3
- 229960000274 lysozyme Drugs 0.000 description 3
- 235000010335 lysozyme Nutrition 0.000 description 3
- 239000004325 lysozyme Substances 0.000 description 3
- 150000002742 methionines Chemical class 0.000 description 3
- 201000000050 myeloid neoplasm Diseases 0.000 description 3
- 230000003612 virological effect Effects 0.000 description 3
- SGKRLCUYIXIAHR-AKNGSSGZSA-N (4s,4ar,5s,5ar,6r,12ar)-4-(dimethylamino)-1,5,10,11,12a-pentahydroxy-6-methyl-3,12-dioxo-4a,5,5a,6-tetrahydro-4h-tetracene-2-carboxamide Chemical compound C1=CC=C2[C@H](C)[C@@H]([C@H](O)[C@@H]3[C@](C(O)=C(C(N)=O)C(=O)[C@H]3N(C)C)(O)C3=O)C3=C(O)C2=C1O SGKRLCUYIXIAHR-AKNGSSGZSA-N 0.000 description 2
- KDCGOANMDULRCW-UHFFFAOYSA-N 7H-purine Chemical compound N1=CNC2=NC=NC2=C1 KDCGOANMDULRCW-UHFFFAOYSA-N 0.000 description 2
- 108091035707 Consensus sequence Proteins 0.000 description 2
- 241000255581 Drosophila <fruit fly, genus> Species 0.000 description 2
- 238000002965 ELISA Methods 0.000 description 2
- 102000004190 Enzymes Human genes 0.000 description 2
- 108090000790 Enzymes Proteins 0.000 description 2
- 101000669028 Homo sapiens Zinc phosphodiesterase ELAC protein 2 Proteins 0.000 description 2
- 239000012097 Lipofectamine 2000 Substances 0.000 description 2
- TWRXJAOTZQYOKJ-UHFFFAOYSA-L Magnesium chloride Chemical compound [Mg+2].[Cl-].[Cl-] TWRXJAOTZQYOKJ-UHFFFAOYSA-L 0.000 description 2
- 241000699666 Mus <mouse, genus> Species 0.000 description 2
- 101100278853 Mus musculus Dhfr gene Proteins 0.000 description 2
- 108091034117 Oligonucleotide Proteins 0.000 description 2
- 102000010292 Peptide Elongation Factor 1 Human genes 0.000 description 2
- 108010077524 Peptide Elongation Factor 1 Proteins 0.000 description 2
- 102100037935 Polyubiquitin-C Human genes 0.000 description 2
- 108010056354 Ubiquitin C Proteins 0.000 description 2
- 241000700605 Viruses Species 0.000 description 2
- 102100039877 Zinc phosphodiesterase ELAC protein 2 Human genes 0.000 description 2
- 230000009471 action Effects 0.000 description 2
- 238000000137 annealing Methods 0.000 description 2
- 230000003466 anti-cipated effect Effects 0.000 description 2
- 230000033228 biological regulation Effects 0.000 description 2
- 230000000295 complement effect Effects 0.000 description 2
- 238000012217 deletion Methods 0.000 description 2
- 230000037430 deletion Effects 0.000 description 2
- 238000013461 design Methods 0.000 description 2
- 230000003292 diminished effect Effects 0.000 description 2
- 229960003722 doxycycline Drugs 0.000 description 2
- 229940088598 enzyme Drugs 0.000 description 2
- 238000012239 gene modification Methods 0.000 description 2
- 230000001771 impaired effect Effects 0.000 description 2
- 230000006872 improvement Effects 0.000 description 2
- 230000010354 integration Effects 0.000 description 2
- 238000002955 isolation Methods 0.000 description 2
- 101150066555 lacZ gene Proteins 0.000 description 2
- 238000001823 molecular biology technique Methods 0.000 description 2
- 210000000299 nuclear matrix Anatomy 0.000 description 2
- COLNVLDHVKWLRT-UHFFFAOYSA-N phenylalanine Natural products OC(=O)C(N)CC1=CC=CC=C1 COLNVLDHVKWLRT-UHFFFAOYSA-N 0.000 description 2
- COLNVLDHVKWLRT-QMMMGPOBSA-N phenylalanine group Chemical group N[C@@H](CC1=CC=CC=C1)C(=O)O COLNVLDHVKWLRT-QMMMGPOBSA-N 0.000 description 2
- 230000008092 positive effect Effects 0.000 description 2
- LXNHXLLTXMVWPM-UHFFFAOYSA-N pyridoxine Chemical compound CC1=NC=C(CO)C(CO)=C1O LXNHXLLTXMVWPM-UHFFFAOYSA-N 0.000 description 2
- 230000010076 replication Effects 0.000 description 2
- 230000003362 replicative effect Effects 0.000 description 2
- 108091008146 restriction endonucleases Proteins 0.000 description 2
- 230000003248 secreting effect Effects 0.000 description 2
- 210000002966 serum Anatomy 0.000 description 2
- 239000000725 suspension Substances 0.000 description 2
- 125000000341 threoninyl group Chemical group [H]OC([H])(C([H])([H])[H])C([H])(N([H])[H])C(*)=O 0.000 description 2
- 238000012546 transfer Methods 0.000 description 2
- 230000030787 translation reinitiation Effects 0.000 description 2
- IEDVJHCEMCRBQM-UHFFFAOYSA-N trimethoprim Chemical compound COC1=C(OC)C(OC)=CC(CC=2C(=NC(N)=NC=2)N)=C1 IEDVJHCEMCRBQM-UHFFFAOYSA-N 0.000 description 2
- 229960001082 trimethoprim Drugs 0.000 description 2
- 241000701161 unidentified adenovirus Species 0.000 description 2
- 102000002260 Alkaline Phosphatase Human genes 0.000 description 1
- 108020004774 Alkaline Phosphatase Proteins 0.000 description 1
- 101710154825 Aminoglycoside 3'-phosphotransferase Proteins 0.000 description 1
- 101100519158 Arabidopsis thaliana PCR2 gene Proteins 0.000 description 1
- 108010006654 Bleomycin Proteins 0.000 description 1
- 102000004506 Blood Proteins Human genes 0.000 description 1
- 108010017384 Blood Proteins Proteins 0.000 description 1
- 241000699802 Cricetulus griseus Species 0.000 description 1
- 102000052510 DNA-Binding Proteins Human genes 0.000 description 1
- 230000004568 DNA-binding Effects 0.000 description 1
- 101710096438 DNA-binding protein Proteins 0.000 description 1
- 108700020787 Drosophila Pc Proteins 0.000 description 1
- 239000006144 Dulbecco’s modified Eagle's medium Substances 0.000 description 1
- 102000003951 Erythropoietin Human genes 0.000 description 1
- 108090000394 Erythropoietin Proteins 0.000 description 1
- 102000001690 Factor VIII Human genes 0.000 description 1
- 108010054218 Factor VIII Proteins 0.000 description 1
- 101100383920 Fragaria ananassa MCSI gene Proteins 0.000 description 1
- 108700007698 Genetic Terminator Regions Proteins 0.000 description 1
- 108091005904 Hemoglobin subunit beta Proteins 0.000 description 1
- 102100021519 Hemoglobin subunit beta Human genes 0.000 description 1
- 108010034791 Heterochromatin Proteins 0.000 description 1
- 241000282412 Homo Species 0.000 description 1
- 108090000144 Human Proteins Proteins 0.000 description 1
- 102000003839 Human Proteins Human genes 0.000 description 1
- 241001135569 Human adenovirus 5 Species 0.000 description 1
- 241000701024 Human betaherpesvirus 5 Species 0.000 description 1
- 206010020751 Hypersensitivity Diseases 0.000 description 1
- 102000008394 Immunoglobulin Fragments Human genes 0.000 description 1
- 108010021625 Immunoglobulin Fragments Proteins 0.000 description 1
- 102000006496 Immunoglobulin Heavy Chains Human genes 0.000 description 1
- 108010019476 Immunoglobulin Heavy Chains Proteins 0.000 description 1
- 102000013463 Immunoglobulin Light Chains Human genes 0.000 description 1
- 108010065825 Immunoglobulin Light Chains Proteins 0.000 description 1
- 102000012745 Immunoglobulin Subunits Human genes 0.000 description 1
- 108010079585 Immunoglobulin Subunits Proteins 0.000 description 1
- 108091092195 Intron Proteins 0.000 description 1
- 108010025815 Kanamycin Kinase Proteins 0.000 description 1
- 108091036060 Linker DNA Proteins 0.000 description 1
- 241001045988 Neogene Species 0.000 description 1
- 102000012425 Polycomb-Group Proteins Human genes 0.000 description 1
- ONIBWKKTOPOVIA-UHFFFAOYSA-N Proline Natural products OC(=O)C1CCCN1 ONIBWKKTOPOVIA-UHFFFAOYSA-N 0.000 description 1
- 108091030071 RNAI Proteins 0.000 description 1
- 108020004511 Recombinant DNA Proteins 0.000 description 1
- 201000000582 Retinoblastoma Diseases 0.000 description 1
- 241000283984 Rodentia Species 0.000 description 1
- 239000004098 Tetracycline Substances 0.000 description 1
- 108090000848 Ubiquitin Proteins 0.000 description 1
- 102000044159 Ubiquitin Human genes 0.000 description 1
- JLCPHMBAVCMARE-UHFFFAOYSA-N [3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-hydroxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methyl [5-(6-aminopurin-9-yl)-2-(hydroxymethyl)oxolan-3-yl] hydrogen phosphate Polymers Cc1cn(C2CC(OP(O)(=O)OCC3OC(CC3OP(O)(=O)OCC3OC(CC3O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c3nc(N)[nH]c4=O)C(COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3CO)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cc(C)c(=O)[nH]c3=O)n3cc(C)c(=O)[nH]c3=O)n3ccc(N)nc3=O)n3cc(C)c(=O)[nH]c3=O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)O2)c(=O)[nH]c1=O JLCPHMBAVCMARE-UHFFFAOYSA-N 0.000 description 1
- 210000004100 adrenal gland Anatomy 0.000 description 1
- 239000011543 agarose gel Substances 0.000 description 1
- 210000001776 amniocyte Anatomy 0.000 description 1
- 230000001093 anti-cancer Effects 0.000 description 1
- 230000000259 anti-tumor effect Effects 0.000 description 1
- 229940088710 antibiotic agent Drugs 0.000 description 1
- 239000000427 antigen Substances 0.000 description 1
- 108091007433 antigens Proteins 0.000 description 1
- 102000036639 antigens Human genes 0.000 description 1
- 230000002238 attenuated effect Effects 0.000 description 1
- 230000001580 bacterial effect Effects 0.000 description 1
- 229960000074 biopharmaceutical Drugs 0.000 description 1
- 229960001561 bleomycin Drugs 0.000 description 1
- 230000000903 blocking effect Effects 0.000 description 1
- 230000023555 blood coagulation Effects 0.000 description 1
- 239000006143 cell culture medium Substances 0.000 description 1
- 230000008859 change Effects 0.000 description 1
- 239000013611 chromosomal DNA Substances 0.000 description 1
- 230000002759 chromosomal effect Effects 0.000 description 1
- 238000004440 column chromatography Methods 0.000 description 1
- 238000007796 conventional method Methods 0.000 description 1
- 230000002950 deficient Effects 0.000 description 1
- 230000003111 delayed effect Effects 0.000 description 1
- 238000001514 detection method Methods 0.000 description 1
- 238000011161 development Methods 0.000 description 1
- 238000010586 diagram Methods 0.000 description 1
- 230000029087 digestion Effects 0.000 description 1
- 238000010790 dilution Methods 0.000 description 1
- 239000012895 dilution Substances 0.000 description 1
- 108010048367 enhanced green fluorescent protein Proteins 0.000 description 1
- 230000001973 epigenetic effect Effects 0.000 description 1
- 229940105423 erythropoietin Drugs 0.000 description 1
- 229960000301 factor viii Drugs 0.000 description 1
- 239000012091 fetal bovine serum Substances 0.000 description 1
- 238000001914 filtration Methods 0.000 description 1
- 238000002825 functional assay Methods 0.000 description 1
- 230000004927 fusion Effects 0.000 description 1
- 230000009368 gene silencing by RNA Effects 0.000 description 1
- 238000012226 gene silencing method Methods 0.000 description 1
- 238000001415 gene therapy Methods 0.000 description 1
- 238000010353 genetic engineering Methods 0.000 description 1
- 102000005396 glutamine synthetase Human genes 0.000 description 1
- 108020002326 glutamine synthetase Proteins 0.000 description 1
- 230000013595 glycosylation Effects 0.000 description 1
- 238000006206 glycosylation reaction Methods 0.000 description 1
- 210000004458 heterochromatin Anatomy 0.000 description 1
- 239000005556 hormone Substances 0.000 description 1
- 229940088597 hormone Drugs 0.000 description 1
- 238000011065 in-situ storage Methods 0.000 description 1
- 230000006698 induction Effects 0.000 description 1
- 230000001939 inductive effect Effects 0.000 description 1
- 208000015181 infectious disease Diseases 0.000 description 1
- 239000003112 inhibitor Substances 0.000 description 1
- 230000005764 inhibitory process Effects 0.000 description 1
- 238000002347 injection Methods 0.000 description 1
- 239000007924 injection Substances 0.000 description 1
- 230000003993 interaction Effects 0.000 description 1
- 239000013067 intermediate product Substances 0.000 description 1
- 210000003734 kidney Anatomy 0.000 description 1
- 210000004072 lung Anatomy 0.000 description 1
- 229910001629 magnesium chloride Inorganic materials 0.000 description 1
- 230000001404 mediated effect Effects 0.000 description 1
- 238000010369 molecular cloning Methods 0.000 description 1
- 101150091879 neo gene Proteins 0.000 description 1
- 235000015097 nutrients Nutrition 0.000 description 1
- 238000005457 optimization Methods 0.000 description 1
- 201000008968 osteosarcoma Diseases 0.000 description 1
- 210000001672 ovary Anatomy 0.000 description 1
- 230000010412 perfusion Effects 0.000 description 1
- 230000029279 positive regulation of transcription, DNA-dependent Effects 0.000 description 1
- 230000004481 post-translational protein modification Effects 0.000 description 1
- OXCMYAYHXIHQOA-UHFFFAOYSA-N potassium;[2-butyl-5-chloro-3-[[4-[2-(1,2,4-triaza-3-azanidacyclopenta-1,4-dien-5-yl)phenyl]phenyl]methyl]imidazol-4-yl]methanol Chemical compound [K+].CCCCC1=NC(Cl)=C(CO)N1CC1=CC=C(C=2C(=CC=CC=2)C2=N[N-]N=N2)C=C1 OXCMYAYHXIHQOA-UHFFFAOYSA-N 0.000 description 1
- 230000003389 potentiating effect Effects 0.000 description 1
- 230000008569 process Effects 0.000 description 1
- 210000001236 prokaryotic cell Anatomy 0.000 description 1
- 230000002035 prolonged effect Effects 0.000 description 1
- 125000001500 prolyl group Chemical group [H]N1C([H])(C(=O)[*])C([H])([H])C([H])([H])C1([H])[H] 0.000 description 1
- 235000008160 pyridoxine Nutrition 0.000 description 1
- 239000011677 pyridoxine Substances 0.000 description 1
- 230000006798 recombination Effects 0.000 description 1
- 238000005215 recombination Methods 0.000 description 1
- 238000009877 rendering Methods 0.000 description 1
- 230000001718 repressive effect Effects 0.000 description 1
- 210000001525 retina Anatomy 0.000 description 1
- 238000012552 review Methods 0.000 description 1
- 238000007790 scraping Methods 0.000 description 1
- 238000012163 sequencing technique Methods 0.000 description 1
- 239000012679 serum free medium Substances 0.000 description 1
- 230000035939 shock Effects 0.000 description 1
- 241000894007 species Species 0.000 description 1
- 230000010473 stable expression Effects 0.000 description 1
- 238000010561 standard procedure Methods 0.000 description 1
- 238000006467 substitution reaction Methods 0.000 description 1
- 238000004114 suspension culture Methods 0.000 description 1
- 229960002180 tetracycline Drugs 0.000 description 1
- 229930101283 tetracycline Natural products 0.000 description 1
- 235000019364 tetracycline Nutrition 0.000 description 1
- 150000003522 tetracyclines Chemical class 0.000 description 1
- 230000001225 therapeutic effect Effects 0.000 description 1
- 210000001519 tissue Anatomy 0.000 description 1
- 231100000331 toxic Toxicity 0.000 description 1
- 230000002588 toxic effect Effects 0.000 description 1
- 238000002255 vaccination Methods 0.000 description 1
- 229940011671 vitamin b6 Drugs 0.000 description 1
Images
Classifications
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K16/00—Immunoglobulins [IGs], e.g. monoclonal or polyclonal antibodies
- C07K16/18—Immunoglobulins [IGs], e.g. monoclonal or polyclonal antibodies against material from animals or humans
- C07K16/28—Immunoglobulins [IGs], e.g. monoclonal or polyclonal antibodies against material from animals or humans against receptors, cell surface antigens or cell surface determinants
- C07K16/30—Immunoglobulins [IGs], e.g. monoclonal or polyclonal antibodies against material from animals or humans against receptors, cell surface antigens or cell surface determinants from tumour cells
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K16/00—Immunoglobulins [IGs], e.g. monoclonal or polyclonal antibodies
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K14/00—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
Definitions
- the disclosure relates to the field of molecular biology and biotechnology. More specifically the disclosure relates to means and methods for improving the selection of host cells that express proteins at high levels.
- Proteins can be produced in various host cells for a wide range of applications in biology and biotechnology, for instance as biopharmaceuticals.
- Eukaryotic and particularly mammalian host cells are preferred for this purpose for expression of many proteins, for instance when such proteins have certain posttranslational modifications such as glycosylation.
- Methods for such production are well established, and generally entail the expression in a host cell of a nucleic acid (also referred to as “transgene”) encoding the protein of interest.
- a nucleic acid also referred to as “transgene”
- the transgene together with a selectable marker gene is introduced into a precursor cell, cells are selected for the expression of the selectable marker gene, and one or more clones that express the protein of interest at high levels are identified, and used for the expression of the protein of interest.
- transgenes One problem associated with the expression of transgenes is that it is unpredictable, stemming from the high likelihood that the transgene will become inactive due to gene silencing (McBurney et al., 2002), and therefore many host cell clones have to be tested for high expression of the transgene.
- One method describes the use of selectable marker proteins with mutations in their coding sequence that diminishes, but does not destroy, the function of the marker (e.g., WO 01/32901).
- the rationale is that higher levels of the mutant marker expression are required when selection conditions are employed and therefore selection for high expression of the marker is achieved, therewith concomitantly selecting host cells that also express the gene of interest at high levels.
- Another method makes use of a selection marker gene under control of a promoter sequence that has been mutated such that the promoter has an activity level substantially below that of its corresponding wild type (U.S. Pat. No. 5,627,033).
- Another method describes the use of an impaired dominant selectable marker sequence, such as neomycin phosphotransferase with an impaired consensus Kozak sequence, to decrease the number of colonies to be screened and to increase the expression levels of a gene of interest that is co-linked to the dominant selectable marker (U.S. Pat. Nos. 5,648,267 and 5,733,779).
- the gene of interest is placed within an (artificial) intron in the dominant selectable marker.
- the gene of interest and the dominant selectable marker are in different transcriptional cassettes and each contains its own eukaryotic promoter in this method (U.S. Pat. Nos. 5,648,267 and 5,733,779).
- Another method uses the principle of a selectable marker gene containing an intron that does not naturally occur within the selectable gene, wherein the intron is capable of being spliced in a host cell to provide mRNA encoding a selectable protein and wherein the intron in the selectable gene reduces the level of selectable protein produced from the selectable gene in the host cell (European Patent 0724639 B1).
- DNA constructs comprising a selectable gene positioned within an intron defined by a 5′ splice donor site comprising an efficient splice donor sequence such that the efficiency of splicing an mRNA having the splice donor site is between about 80-99%, and a 3′ splice acceptor site, and a product gene encoding a product of interest downstream of 3′ splice acceptor site, the selectable gene and the product gene being controlled by the same transcriptional regulatory region (U.S. Pat. No. 5,561,053).
- polycistronic expression vector constructs use is made of polycistronic expression vector constructs.
- An early report of use of this principle describes a polycistronic expression vector, containing sequences coding for both the desired protein and a selectable protein, which coding sequences are governed by the same promoter and separated by a translational stop and start signal codons (U.S. Pat. No. 4,965,196).
- the selectable marker is the amplifiable DHFR gene.
- the sequence coding for the selectable marker is downstream from that coding for the desired polypeptide, such that procedures designed to select for the cells transformed by the selectable marker will also select for particularly enhanced production of the desired protein.
- bicistronic vectors have been described for the rapid and efficient creation of stable mammalian cell lines that express recombinant protein.
- These vectors contain an internal ribosome entry site (IRES) between the upstream coding sequence for the protein of interest and the downstream coding sequence of the selection marker (Rees et al., 1996).
- IRS internal ribosome entry site
- Such vectors are commercially available, for instance, the pIRES1 vectors from Clontech (CLONTECHniques, October 1996).
- selection of sufficient expression of the downstream marker protein then automatically selects for high transcription levels of the multicistronic mRNA, and hence a strongly increased probability of high expression of the protein of interest is envisaged using such vectors.
- the IRES used is an IRES which gives a relatively low level of translation of the selection marker gene, to further improve the chances of selecting for host cells with a high expression level of the protein of interest by selecting for expression of the selection marker protein (see, e.g., international publication WO 03/106684).
- the disclosure uses a novel and unique concept for selecting host cells expressing high levels of polypeptides of interest, the concept referred to herein as “reciprocal interdependent translation.” It is unique with respect to the prior art approaches in that an extra level of regulation is used to fine-tune the amount of translation products from a multicistronic transcript, whereby the level of translation of a selectable marker polypeptide is lowered, thereby increasing the level of translation of the polypeptide of interest coded on the same multicistronic transcription unit, i.e., the expression level of the polypeptide of interest is directly and more or less reciprocally dependent from the expression level of the selectable marker polypeptide (see FIG. 13 for a schematic view).
- approaches using an IRES between the coding sequences for the polypeptide of interest and the selectable marker polypeptide involve independent translation of both polypeptides, so that in those approaches there is no direct effect of the translation efficiency of the selectable marker polypeptide on that of the sequence coding for the polypeptide of interest.
- a DNA molecule comprising a multicistronic transcription unit coding for i) a selectable marker polypeptide functional in a eukaryotic host cell, and for ii) a polypeptide of interest, the polypeptide of interest having a translation initiation sequence (and start codon) separate from that of the selectable marker polypeptide, characterized in that the coding sequence for the polypeptide of interest is downstream from the coding sequence for the selectable marker in the multicistronic transcription unit, and in that the nucleic acid sequence coding for the selectable marker polypeptide comprises a mutation that decreases the translation efficiency of the selectable marker in a eukaryotic host cell.
- the translation initiation efficiency of the selectable marker is decreased.
- the decreased translation initiation efficiency is brought about by having a non-optimal translation start sequence of the selectable marker polypeptide.
- the translation start sequence in the coding strand for the selectable marker polypeptide comprises an ATG sequence defining a start codon, the ATG sequence being in a non-optimal context for translation initiation. This results in a decreased use of this ATG as start codon, when compared to an ATG start codon in an optimal context.
- the translation start sequence in the coding strand for the selectable marker polypeptide comprises a start codon different from an ATG start codon, such as one of GTG, TTG, CTG, ATT, or ACG sequence, the first two thereof being the most preferred.
- Such non-ATG start codons preferably are flanked by sequences providing for relatively good recognition of the non-ATG sequences as start codons, such that at least some ribosomes start translation from these start codons, i.e., the translation start sequence preferably comprises the sequence ACC[non-ATG start codon]G or GCC[non-ATG start codon]G.
- the sequence encoding the selectable marker polypeptide has no ATG sequence in the coding strand following the sequence coding for the start codon of the selectable marker polypeptide up to the sequence encoding the stop codon thereof, and not in any sequences in the same strand between the stop codon and the start codon of the polypeptide of interest (which start codon preferably is defined by an ATG in the same coding strand).
- any ATG sequence present in frame and coding for methionine in the sequence coding for the wild-type selectable marker polypeptide has been mutated to code for valine, threonine, isoleucine or leucine.
- the selectable marker protein provides resistance against lethal and/or growth-inhibitory effects of a selection agent, such as an antibiotic.
- the coding sequence of the polypeptide of interest may comprise an optimal translation start sequence.
- expression cassettes comprising a DNA molecule hereof, which expression cassettes further comprise a promoter upstream of the multicistronic expression unit and being functional in a eukaryotic host cell for initiation transcription of the multicistronic expression unit, and the expression cassettes further comprising a transcription termination sequence downstream of the multicistronic expression unit.
- such expression cassettes further comprise at least one chromatin control element chosen from the group consisting of a matrix or scaffold attachment region (MAR/SAR), an insulator sequence, a ubiquitous chromatin opener element (UCOE), and an anti-repressor sequence.
- MAR/SAR matrix or scaffold attachment region
- UCOE ubiquitous chromatin opener element
- Anti-repressor sequences are most preferred in this aspect, and in preferred embodiments, the anti-repressor sequences are chosen from the group consisting of: a) any one SEQ ID NO:1 through SEQ ID NO:66; b) fragments of any one of SEQ ID NO:1 through SEQ ID NO:66, wherein the fragments have anti-repressor activity; c) sequences that are at least 70% identical in nucleotide sequence to a) or b) wherein the sequences have anti-repressor activity; and d) the complement to any one of a) to c).
- the anti-repressor sequences are chosen from the group consisting of: STAR67 (SEQ ID NO:66), STAR7 (SEQ ID NO:7), STAR9 (SEQ ID NO:9), STAR17 (SEQ ID NO:17), STAR27 (SEQ ID NO:27), STAR29 (SEQ ID NO:29), STAR43 (SEQ ID NO:43), STAR44 (SEQ ID NO:44), STAR45 (SEQ ID NO:45), STAR47 (SEQ ID NO:47), STAR61 (SEQ ID NO:61), and functional fragments or derivatives of these STAR sequences.
- the expression cassette comprises STAR67, or a functional fragment or derivative thereof, positioned upstream of the promoter driving expression of the multicistronic gene.
- the multicistronic gene is flanked on both sides by at least one anti-repressor sequence.
- expression cassettes are provided hereof, comprising in 5′ to 3′ order: anti-repressor sequence A-anti-repressor sequence B-[promoter-multicistronic transcription unit hereof (encoding the functional selectable marker protein and downstream thereof the polypeptide of interest)-transcription termination sequence]-anti-repressor sequence C, wherein A, B and C may be the same or different.
- the polypeptide of interest is a part of a multimeric protein, for example a heavy or light chain of an immunoglobulin.
- DNA molecules comprising a sequence encoding a functional selectable marker polypeptide, characterized in that such DNA molecules comprise a mutation that decreases the translation efficiency of the functional selectable marker polypeptide in a eukaryotic host cell.
- a DNA molecule i) has a non-optimal translation start sequence followed by an otherwise functional selectable marker coding sequence, and ii) in the coding strand of the sequence defining the selectable marker polypeptide downstream of the non-optimal start codon is devoid of ATG sequences.
- host cells comprising DNA molecules hereof.
- Methods for generating host cells expressing a polypeptide of interest comprising the steps of: introducing into a plurality of precursor host cells an expression cassette hereof, culturing the cells under conditions selecting for expression of the selectable marker polypeptide, and selecting at least one host cell producing the polypeptide of interest.
- the disclosure aims at providing improved means and methods for selection of host cells expressing high levels of proteins of interest.
- methods for producing a polypeptide of interest comprising culturing a host cell, the host cell comprising an expression cassette hereof, and expressing the polypeptide of interest from the expression cassette.
- the polypeptide of interest is further isolated from the host cells and/or from the host cell culture medium.
- RNA molecules having the sequence of a transcription product of a DNA molecule hereof.
- polypeptides lacking methionine in their amino acid sequence, which polypeptides are obtainable by expression from certain DNA molecules according to the dislcosure.
- FIG. 1 Schematic diagram of transcription units without and with STAR elements.
- the expression unit has the SV40 transcriptional terminator at its 3′ end (t).
- the name of the construct is CMV-d2EGFP-ires-Zeo (CMV Control).
- the name of the construct is STAR67-CMV-d2EGFP-ires-Zeo (CMV-STAR67).
- FIG. 2 STAR67 improves CMV driven d2EGFP expression in CHO cells. The mean of the d2EGFP signal for 10 independent stable colonies is plotted for the indicated constructs in CHO cells.
- FIG. 3 STAR67 improves EF1 ⁇ driven d2EGFP expression in CHO cells. Same as FIG. 2 , but now with EF1 ⁇ promoter. A) EF1 ⁇ Control; B) STAR67-EF1 ⁇ .
- FIG. 4 STAR67 improves UB6 driven d2EGFP expression in CHO cells. Same as FIGS. 2 and 3 , but now with UB6 promoter. A) UB6 Control; B) STAR67-UB6.
- FIG. 5 STAR67 improves CMV driven d2EGFP expression in PER.C6® cells. Same as FIG. 2 , but now in PER.C6® cells.
- FIG. 6 STAR67 improves EF1 ⁇ driven d2EGFP expression in PER.C6® cells. Same as FIG. 3 , but now in PER.C6® cells. A) EF1 ⁇ Control; B) STAR67-EF1 ⁇ .
- FIG. 7 STAR67 improves UB6 driven d2EGFP expression in PER.C6® cells. Same as FIG. 4 , but now in PER.C6® cells. A) UB6 Control; B) STAR67-UB6.
- FIG. 8 STAR67 improves SV40 driven d2EGFP expression in CHO-K1 cells in combination with another STAR element. Similar as FIG. 2 , but now using the SV40 promoter in CHO cells, and the indicated constructs. The mean of the d2EGFP signal is plotted for 18-20 independent stable colonies 60 days after transfection. A) SV40 Control, SV40-STAR67, SV40-STAR7/7; B) SV40 Control and SV40-STAR67 7/7. See Example 4 for details.
- FIG. 9 STAR67 improves UB6 driven expression levels of the EpCAM antibody in CHO-K1 cells. See Example 5 for details. A) constructs without STAR elements; B) constructs with STAR67 upstream of promoter and flanking STAR7 elements. The anti-EpCAM antibody concentration is presented as pg/cell/day. X(18): average production level of the 18 colonies.
- FIG. 10 STAR67 is not an enhancer blocker, whereas STAR 6 and 7 are. See Example 6 for details.
- FIG. 11 STAR67 enhances UB6 and CMV-driven antibody expression levels in stably transfected CHO cells. See Example 7 for details.
- X(19) average production level of the 19 colonies.
- FIG. 12 Schematic representation of the use of a selection marker gene (Zeocin® antibiotic resistance gene) hereof.
- C same as B, but comprising a mutated translation initiation site, wherein the context of the ATG start codon has been mutated to decrease the translation initiation.
- GTG mutated start codon
- FIG. 13 Schematic representation of a multicistronic transcription unit hereof, with more or less reciprocal interdependent translation efficiency.
- a dEGFP gene here exemplifying a gene of interest
- the Zeocin® antibiotic resistance gene comprises the internal Met ⁇ Leu mutation (see FIG. 12B ). See Example 9 for details.
- FIG. 14 Results of selection systems hereof, with and without STAR elements.
- A. Zeocin® antibiotic resistance gene with ATG start codon in bad context referred to as “ATGmut” in the picture, but including a spacer sequence behind the ATG in the bad context, so in the text generally referred to as “ATGmut/space”).
- FIG. 15 Results of selection system hereof in an up-scaled experiment (A), and comparison with selection system according to prior art using an IRES (B). d2EGFP signal for independent colonies is shown on the vertical axis. See Example 10 for details.
- FIG. 16 Results of selection system with multicistronic transcription unit hereof, using blasticidin as a selectable marker.
- FIG. 17 Stability of expression of several clones with a multicistronic transcription unit hereof (including a Zeocin® antibiotic resistance gene with TTG start codon). Selection pressure (100 ⁇ g/ml Zeocin® antibiotic) was present during the complete experiment. d2EGFP signal for independent colonies is shown on the vertical axis. See Example 12 for details.
- FIG. 18 As FIG. 17 , but Zeocin® antibiotic concentration was lowered to 20 ⁇ g/ml after establishment of clones.
- FIG. 19 As FIG. 17 , but Zeocin® antibiotic was absent from culture medium after establishment of clones.
- FIG. 20 Expression of an antibody (anti-EpCAM) using the selection system with the multicistronic transcription unit hereof.
- the heavy chain (HC) and light chain (LC) are the polypeptide of interest in this example. Each of these is present in a separate transcription unit, which are both on a single nucleic acid molecule in this example.
- the HC is preceded by the Zeocin® antibiotic resistance gene coding for a selectable marker polypeptide, while the LC is preceded by the blasticidin resistance gene coding for a selectable marker polypeptide.
- Both resistance genes have been mutated to comprise an ATG start codon in a non-optimal context (“mutATG” in Figure, but including a spacer sequence, and hence in the text generally referred to as “ATGmut/space”).
- mutATG in Figure, but including a spacer sequence, and hence in the text generally referred to as “ATGmut/space”.
- Each of the multicistronic transcription units is under control of a CMV promoter.
- Constructs with STAR sequences as indicated were compared to constructs without STAR sequences. The antibody levels obtained when these constructs were introduced into host cells are given on the vertical axis in pg/cell/day for various independent clones. See Example 13 for details.
- FIG. 21 As FIG. 20 , but both the selection marker genes have been provided with a GTG start codon. See Example 13 for details.
- FIG. 22 As FIG. 20 , but both the selection marker genes have been provided with a TTG start codon. See Example 13 for details.
- FIG. 23 Stability of expression in sub-clones in the absence of selection pressure (after establishing colonies under selection pressure, some colonies where sub-cloned in medium containing no Zeocin® antibiotic). See Example 12 for details.
- FIG. 24 Copy-number dependency of expression levels of an embodiment of the invention. See Example 12 for details.
- FIG. 25 As FIG. 12 , but for the blasticidin resistance gene. None of the four internal ATGs in this gene are in frame coding for a methionine, and therefore the redundancy of the genetic code was used to mutate these ATGs without mutating the internal amino acid sequence of the encoded protein.
- FIG. 26 Coding sequence of the wild-type Zeocin® antibiotic resistance gene (SEQ ID NO:108). Bold ATGs code for methionine. The first bold ATG is the start codon.
- FIG. 27 Coding sequence of the wild-type blasticidin resistance gene (SEQ ID NO:110).
- Bold ATGs code for methione. The first bold ATG is the start codon.
- Other ATGs in the sequence are underlined: these internal ATGs do not code for methionine, because they are not in frame.
- FIG. 28 Coding sequence of the wild-type puromycin resistance gene (SEQ ID NO:112).
- Bold ATGs code for methione. The first bold ATG is the start codon.
- FIG. 29 Coding sequence of the wild-type mouse DHFR gene (SEQ ID NO:114). Bold ATGs code for methione. The first bold ATG is the start codon. Other ATGs in the sequence are underlined: these internal ATGs do not code for methionine, because they are not in frame.
- FIG. 30 Coding sequence of the wild-type hygromycin resistance gene (SEQ ID NO:116). Bold ATGs code for methione. The first bold ATG is the start codon. Other ATGs in the sequence are underlined: these internal ATGs do not code for methionine, because they are not in frame.
- FIG. 31 Coding sequence of the wild-type neomycin resistance gene (SEQ ID NO:118). Bold ATGs code for methione. The first bold ATG is the start codon. Other ATGs in the sequence are underlined: these internal ATGs do not code for methionine, because they are not in frame.
- FIG. 32 Coding sequence of the wild-type human glutamine synthase (GS) gene (SEQ ID NO:120). Bold ATGs code for methione. The first bold ATG is the start codon. Other ATGs in the sequence are underlined: these internal ATGs do not code for methionine, because they are not in frame.
- FIG. 33 Schematic representation of some further modified Zeocin® antibiotic resistance selection marker genes with a GTG start codon hereof, allowing for further fine-tuning of the selection stringency. See Example 14 for details.
- FIG. 34 Results with expression systems containing the further modified Zeocin® antibiotic resistance selection marker genes. See Example 14 for details. Dots indicate individual data points: lines indicate the average expression levels; used constructs (see also FIG. 33 ) are indicated on the horizontal axis (the addition of 7/67/7 at the end of the construct name indicates the presence of STAR sequences 7 and 67 upstream of the promoter and START downstream of the transcription termination site), and schematically depicted above the graph; vertical axis indicates d2EGFP signal.
- FIG. 35 Schematic representation of some further modified Zeocin® antibiotic resistance selection marker genes with a TTG start codon hereof, allowing for further fine-tuning of the selection stringency. See Example 15 for details.
- FIG. 36 Results with expression systems containing the further modified Zeocin® antibiotic resistance selection marker genes. See Example 15 for details. Dots indicate individual data points; lines indicate the average expression levels; used constructs are indicated on the horizontal axis, and schematically depicted above the graph; vertical axis indicates d2EGFP signal.
- FIG. 37 As FIG. 12 , but for the puromycin resistance gene. All three internal ATGs code for methione (panel A), and are replaced by CTG sequences coding for leucine (panel B). See Example 16 for details.
- FIG. 38 Results with expression constructs containing the puromycin resistance gene with a TTG start codon and no internal ATG codons. See Example 16 for details. Dots indicate individual data points; lines indicate the average expression levels; used constructs are indicated on the horizontal axis, and schematically depicted above the graph; vertical axis indicates d2EGFP signal.
- FIG. 39 As FIG. 12 , but for the neomycin resistance gene. See Example 17 for details.
- FIG. 40 As FIG. 12 , but for the dhfr gene. See Example 18 for details.
- FIG. 41 Results with expression constructs (Zeocin® selectable marker) hereof in PER.C6® cells. See Example 20 for details. Dots indicate individual data points; lines indicate the average expression levels; used constructs are indicated on the horizontal axis, and schematically depicted above the graph; vertical axis indicates d2EGFP signal.
- FIG. 42 Results with expression constructs (blasticidin selectable marker) hereof in PER.C6® cells. See Example 20 for details. Dots indicate individual data points; lines indicate the average expression levels; used constructs are indicated on the horizontal axis, and schematically depicted above the graph; vertical axis indicates d2EGFP signal.
- FIG. 43 Results with expression constructs hereof, further comprising a transcription pause (TRAP) sequence. See Example 21 for details. Dots indicate individual data points; lines indicate the average expression levels; used constructs are indicated on the horizontal axis, and schematically depicted above the graph; vertical axis indicates d2EGFP signal.
- TREP transcription pause
- FIG. 44 Copy-number dependency of expression of an antibody using transcription units hereof. See Example 22 for details.
- FIG. 45 Antibody expression from colonies containing expression constructs hereof, wherein the copy number of the expression constructs is amplified by methotrexate. See Example 23 for details. White bars: selection with Zeocin® antibiotic and blasticidin; black bars: selection with Zeocin® antibiotic, blasticidin and methotrexate (MTX). Numbers of tested colonies are depicted on the horizontal axis.
- FIG. 46 Results with different promoters. See Example 24 for details. Dots indicate individual data points; lines indicate the average expression levels; used constructs are indicated on the horizontal axis, and schematically depicted above the graph; vertical axis indicates d2EGFP signal.
- FIG. 47 Results with different STAR elements. See Example 25 for details. Dots indicate individual data points; lines indicate the average expression levels; used constructs are indicated on the horizontal axis, and schematically depicted above the graph; vertical axis indicates d2EGFP signal.
- FIG. 48 Results with other chromatin control elements. See Example 26 for details. Dots indicate individual data points; lines indicate the average expression levels; used constructs are indicated on the horizontal axis, and schematically depicted above the graph (black triangles indicate different tested chromatin control elements); vertical axis indicates d2EGFP signal.
- a DNA molecule comprising a multicistronic transcription unit coding for i) a selectable marker polypeptide functional in a eukaryotic host cell, and for ii) a polypeptide of interest, the polypeptide of interest having a translation initiation sequence separate from that of the selectable marker polypeptide, characterized in that the coding sequence for the polypeptide of interest is downstream from the coding sequence for the selectable marker in the multicistronic transcription unit, and the nucleic acid sequence coding for the selectable marker polypeptide comprises a mutation that decreases the translation efficiency of the selectable marker in a eukaryotic host cell.
- Such a DNA molecule can be used hereof for obtaining eukaryotic host cells expressing high levels of the polypeptide of interest, by selecting for the expression of the selectable marker polypeptide. Subsequently or simultaneously, one or more host cell(s) expressing the polypeptide of interest can be identified, and further used for expression of high levels of the polypeptide of interest.
- a “monocistronic gene” is defined as a gene capable of providing a RNA molecule that encodes one polypeptide.
- a “multicistronic transcription unit,” also referred to as multicistronic gene, is defined as a gene capable of providing an RNA molecule that encodes at least two polypeptides.
- the term “bicistronic gene” is defined as a gene capable of providing a RNA molecule that encodes two polypeptides. A bicistronic gene is therefore encompassed within the definition of a multicistronic gene.
- a “polypeptide” as used herein comprises at least five amino acids linked by peptide bonds, and can for instance be a protein or a part, such as a subunit, thereof.
- a “gene” a “transcription unit” as used in the disclosure can comprise chromosomal DNA, cDNA, artificial DNA, combinations thereof, and the like. Transcription units comprising several cistrons are transcribed as a single mRNA.
- translation of the second coding region i.e., that for the protein of interest
- translation of the first coding region i.e., that for the selectable marker protein
- a multicistronic transcription unit may be a bicistronic transcription unit coding from 5′ to 3′ for a selectable marker polypeptide and a polypeptide of interest. Hence, the polypeptide of interest is encoded downstream from the coding sequence for the selectable marker polypeptide.
- coding sequences on the same transcription unit downstream of the (first) polypeptide of interest, e.g., coding for a second selectable marker polypeptide or for a second polypeptide of interest.
- Such an extra coding sequence downstream of the coding sequence for the first polypeptide of interest then preferably is preceded by a sequence encoding a translation reinitiation site or internal ribosome entry site (IRES), such that also the second polypeptide of interest or the second selection marker polypeptide is translated, in this case independently from the earlier two coding sequences.
- IRS internal ribosome entry site
- a multimeric protein e.g., a heavy and a light chain of an immunoglobulin, encoded as a first and second polypeptide of interest (in any order)
- code for subunits of a multimeric protein within a single multicistronic expression unit.
- separate transcription units for the expression of different polypeptides of interest also when these form part of a multimeric protein (see, e.g., Example 13: the heavy and light chain of an antibody each are encoded by a separate transcription unit, each of these expression units being a bicistronic expression unit hereof).
- the DNA molecules can be present in the form of double stranded DNA, having with respect to the selectable marker polypeptide and the polypeptide of interest a coding strand and a non-coding strand, the coding strand being the strand with the same sequence as the translated RNA, except for the presence of T instead of U.
- an AUG start codon is coded for in the coding strand by an ATG sequence, and the strand containing this ATG sequence corresponding to the AUG start codon in the RNA is referred to as the coding strand of the DNA.
- start codons or translation initiation sequences are in fact present in an RNA molecule, but that these can be considered equally embodied in a DNA molecule coding for such an RNA molecule; hence, wherever the disclosure refers to a start codon or translation initiation sequence, the corresponding DNA molecule having the same sequence as the RNA sequence but for the presence of a T instead of a U in the coding strand of the DNA molecule is meant to be included, and vice versa, except where explicitly specified otherwise.
- a start codon is for instance an AUG sequence in RNA, but the corresponding ATG sequence in the coding strand of the DNA is referred to as start codon as well in the disclosure.
- in frame coding sequences meaning triplets (three bases) in the RNA molecule that are translated into an amino acid, but also to be interpreted as the corresponding trinucleotide sequences in the coding strand of the DNA molecule.
- the selectable marker polypeptide and the polypeptide of interest encoded by the multicistronic gene each have their own translation initiation sequence, and therefore each have their own start codon (as well as stop codon), i.e., they are encoded by separate open reading frames.
- selection marker or “selectable marker” is typically used to refer to a gene and/or protein whose presence can be detected directly or indirectly in a cell, for example, a polypeptide that inactivates a selection agent and protects the host cell from the agent's lethal or growth-inhibitory effects (e.g., an antibiotic resistance gene and/or protein).
- the selection marker induces fluorescence or a color deposit (e.g., green fluorescent protein (GFP) and derivatives (e.g., d2EGFP), luciferase, lacZ, alkaline phosphatase, etc.), which can be used for selecting cells expressing the polypeptide inducing the color deposit, e.g., using a fluorescence activated cell sorter (FACS) for selecting cells that express GFP.
- FACS fluorescence activated cell sorter
- the selectable marker polypeptide hereof provides resistance against lethal and/or growth-inhibitory effects of a selection agent.
- the selectable marker polypeptide is encoded by the DNA of the invention.
- the selectable marker polypeptide is functional in a eukaryotic host cell, and hence capable of being selected for in eukaryotic host cells. Any selectable marker polypeptide fulfilling this criterion can in principle be used according to the disclosure. Such selectable marker polypeptides are well known in the art and routinely used when eukaryotic host cell clones are to be obtained, and several examples are provided herein.
- a selection marker used for the invention is Zeocin® antibiotic. In other embodiments, blasticidin is used. A person skilled in the art will know that other selection markers are available and can be used, e.g., neomycin, puromycin, bleomycin, hygromycin, etc.
- kanamycin is used.
- the DHFR gene is used as a selectable marker, which can be selected for by methotrexate, especially by increasing the concentration of methotrexate cells can be selected for increased copy numbers of the DHFR gene.
- the glutamine synthetase (GS) gene can be used, for which selection is possible in cells having insufficient GS (e.g., NS-0 cells) by culturing in media without glutamine, or alternatively in cells having sufficient GS (e.g., CHO cells) by adding an inhibitor of GS, methionine sulphoximine (MSX).
- Other selectable marker genes that could be used, and their selection agents, are for instance described in table 1 of U.S. Pat. No. 5,561,053, incorporated by reference herein; see also Kaufman, Methods in Enzymology, 185:537-566 (1990), for a review of these.
- each one preferably contains the coding sequence for a different selectable marker, to allow selection for both multicistronic transcription units.
- both multicistronic transcription units may be present on a single nucleic acid molecule or alternatively each one may be present on a separate nucleic acid molecule.
- selection is typically defined as the process of using a selection marker/selectable marker and a selection agent to identify host cells with specific genetic properties (e.g., that the host cell contains a transgene integrated into its genome). It is clear to a person skilled in the art that numerous combinations of selection markers are possible.
- One antibiotic that is particularly advantageous is Zeocin®, because the Zeocin® antibiotic-resistance protein (Zeocin®-R) acts by binding the drug and rendering it harmless. Therefore, it is easy to titrate the amount of drug that kills cells with low levels of Zeocin®-R expression, while allowing the high-expressors to survive. All other antibiotic-resistance proteins in common use are enzymes, and thus act catalytically (not 1:1 with the drug). Hence, the antibiotic Zeocin® is a preferred selection marker. However, the invention also works with other selection markers.
- a selectable marker polypeptide is the protein that is encoded by the nucleic acid, which polypeptide can be detected, for instance because it provides resistance to a selection agent such as an antibiotic.
- the DNA encodes a polypeptide that confers resistance to the selection agent, which polypeptide is the selectable marker polypeptide.
- DNA sequences coding for such selectable marker polypeptides are known, and several examples of wild-type sequences of DNA encoding selectable marker proteins are provided herein ( FIGS. 26-32 ). It will be clear that mutants or derivatives of selectable markers can also be suitably used, and are therefore included within the scope of the term “selectable marker polypeptide,” as long as the selectable marker protein is still functional.
- the gene and protein encoding the resistance to a selection agent is referred to as the “selectable agent (resistance) gene” or “selection agent (resistance) protein,” respectively, although the official names may be different, e.g., the gene coding for the protein conferring resistance to neomycin (as well as to G418 and kanamycin) is often referred to as neomycin (resistance) (or neo') gene, while the official name is aminoglycoside 3′-phosphotransferase gene.
- selectable marker polypeptide It is beneficial to have low levels of expression of the selectable marker polypeptide, so that stringent selection is possible. In the disclosure this is brought about by using a selectable marker coding sequence with a non-optimal translation efficiency. Upon selection, only cells that have nevertheless sufficient levels of selectable marker polypeptide will be selected, meaning that such cells must have sufficient transcription of the multicistronic transcription unit and sufficient translation of the selectable marker polypeptide, which provides a selection for cells where the multicistronic transcription unit has been integrated or otherwise present in the host cells at a place where expression levels from this transcription unit are high. The (more or less) reciprocally interdependent nature of the translation of the polypeptide of interest from that of the translation efficiency of the marker, ensures high translation levels of the polypeptide of interest. This high level of translation is superimposed on the high transcription levels selected for by the stringent selection conditions (and resulting from the use of a strong promoter in the host cells to drive transcription of the multicistronic transcription unit).
- the DNA molecules hereof have the coding sequence for the selectable marker polypeptide upstream of the coding sequence for the polypeptide of interest, to provide for a multicistronic transcript with reciprocally interdependent translation of the polypeptide of interest and the selectable marker polypeptide.
- the multicistronic transcription unit comprises in the 5′ to 3′ direction (both in the transcribed strand of the DNA and in the resulting transcribed RNA) the coding sequence for the selectable marker polypeptide and the sequence encoding the polypeptide of interest.
- the coding regions of the multicistronic gene are preferably translated from the cap-dependent ORF, and therefore the selectable marker polypeptide would in principle also be produced in abundance.
- the nucleic acid sequence coding for the selectable marker polypeptide comprises a mutation (within the open reading frame (ORF) or in the translation initiation sequences preceding the ORF, or in both) that decreases the translation efficiency of the selectable marker polypeptide in a eukaryotic host cell.
- the translation efficiency should be lower than that of the corresponding wild-type sequence in the same cell, i.e., the mutation should result in less polypeptide per cell per time unit, and hence less selectable marker polypeptide.
- the mutation that decreases the translation efficiency is a mutation that decreases the translation initiation efficiency.
- This can, for example, be established by providing the selectable marker polypeptide coding sequence with a non-optimal translation start sequence.
- the translation initiation efficiency of the selectable marker gene in eukaryotic cells can be suitably decreased hereof by mutating the start codon and/or the nucleotides in positions ⁇ 3 to ⁇ 1 and +4 (where the A of the ATG start codon is nt+1), for instance in the coding strand of the corresponding DNA sequence, to provide a non-optimal translation start sequence.
- a translation start sequence is often referred to in the field as “Kozak sequence,” and an optimal Kozak sequence is RCC ATG G, the start codon underlined, R being a purine, i.e., A or G (see Kozak M, 1986, 1987, 1989, 1990, 1997, 2002).
- an optimal translation start sequence comprises an optimal start codon (i.e., ATG) in an optimal context (i.e., the ATG directly preceded by RCC and directly followed by G).
- a non-optimal translation start sequence is defined herein as any sequence that gives at least some detectable translation in a eukaryotic cell (detectable because the selection marker polypeptide is detectable), and not having the consensus sequence RCC ATG G (start codon underlined).
- Translation by the ribosomes starts preferably at the first start codon it encounters when scanning from the 5′-cap of the mRNA (scanning mechanism), and is most efficient when an optimal Kozak sequence is present (see Kozak M, 1986, 1987, 1989, 1990, 1997, 2002).
- non-optimal translation initiation sequences are recognized and used by the ribosome to start translation.
- the disclosure makes use of this principle, and allows for decreasing and even fine-tuning of the amount of translation and hence expression of the selectable marker polypeptide, which can therefore be used to increase the stringency of the selection system.
- the ATG start codon of the selectable marker polypeptide (in the coding strand of the DNA, coding for the corresponding AUG start codon in the RNA transcription product) is left intact, but the positions at ⁇ 3 to ⁇ 1 and +4 are mutated such that they do not fulfill the optimal Kozak sequence any more, e.g., by providing the sequence TTT ATG T as the translation start site (ATG start codon underlined). It will be clear that other mutations around the start codon at positions ⁇ 3 to ⁇ 1 and/or +4 could be used with similar results using the teaching of the disclosure, as can be routinely and easily tested by the person skilled in the art.
- the idea of this first embodiment is that the ATG start codon is placed in a “non-optimal” context for translation initiation.
- the ATG start codon itself of the selectable marker polypeptide is mutated. This will in general lead to even lower levels of translation initiation than the first embodiment.
- the ATG start codon in the second embodiment is mutated into another codon, which has been reported to provide some translation initiation, for instance to GTG, TTG, CTG, ATT, or ACG (collectively referred to herein as “non-optimal start codons”).
- the ATG start codon is mutated into a GTG start codon. This provides still lower expression levels (lower translation) than with the ATG start codon intact but in a non-optimal context.
- the ATG start codon is mutated to a TTG start codon, which provides even lower expression levels of the selectable marker polypeptide than with the GTG start codon (Kozak M, 1986, 1987, 1989, 1990, 1997, 2002; see also examples 9-13 herein).
- the non-optimal start codons are preferably directly preceded by nucleotides RCC in positions ⁇ 3 to ⁇ 1 and directly followed by a G nucleotide (position +4).
- the sequence TTT GTG G start codon underlined
- some initiation is observed at least in vitro, so although strongly preferred it may not be absolutely required to provide an optimal context for the non-optimal start codons.
- ATG sequences within the coding sequence for a polypeptide, but excluding the ATG start codon, are referred to as “internal ATGs,” and if these are in frame with the ORF and therefore code for methionine, the resulting methionine in the polypeptide is referred to as an “internal methionine.”
- the coding region (following the start codon, not necessarily including the start codon) may code for the selectable marker polypeptide preferably devoid of any ATG sequence in the coding strand of the DNA, up to (but not including) the start codon of the polypeptide of interest (obviously, the start codon of the polypeptide of interest may be, and in fact preferably is, an ATG start codon).
- the ATG can be mutated such that the resulting polypeptide has no mutations in its internal amino acid sequence.
- the ATG is an in-frame codon coding for an internal methionine
- the codon can be mutated, and the resulting mutated polypeptide can be routinely checked for activity of the selectable marker polypeptide.
- the skilled person may test other amino acids as well in place of internal methionines, using routine molecular biology techniques for mutating the coding DNA, and routine testing for functionality of the selectable marker polypeptide.
- routine molecular biology techniques for mutating DNA it is at present also possible to synthesize at will (if required using subcloning steps) DNA sequences that have sufficient length for an ORF of a selectable marker polypeptide, and such synthetic DNA sequences can nowadays be ordered commercially from various companies.
- the person skilled in the art may design appropriate sequences hereof encoding a selectable marker polypeptide (with a mutation decreasing translation initiation, and preferably having no internal ATGs), have this sequence synthesized, and test the DNA molecule for functionality of the encoded selectable marker by introducing the DNA molecule in eukaryotic host cells and test for expression of functional selectable marker polypeptide.
- the commercial availability of such sequences also makes feasible to provide without undue burden for selection marker coding sequences lacking internal ATG sequences, where the wild-type coding sequence of the selection marker polypeptide comprises several such internal ATGs.
- Any sequence present in the coding strand between the stop codon of the selectable marker and the start codon of the polypeptide of interest can easily be designed such that it does not comprise ATG sequences, or other non-optimal potential translation start codons.
- the translation start sequence of the polypeptide of interest may comprise an optimal translation start sequence, i.e., having the consensus sequence RCC ATG G (start codon underlined). This will result in a very efficient translation of the polypeptide of interest.
- the reciprocally interdependent translation mechanism of the invention is provided: ribosomes scan from the cap of the mRNA containing the multicistronic transcription unit, and a small percentage of ribosomes will translate the first open reading frame, i.e., the selectable marker protein, because this is provided with a mutation decreasing the translation efficiency, while the great majority of the ribosomes will pass this open reading frame and start translation at the optimal translation initiation site of the polypeptide of interest, resulting in low levels of selectable marker protein and high levels of polypeptide of interest.
- the stringency of selection is increased and reciprocally the expression levels of the polypeptide of interest go up, so that fine-tuning is possible: for instance, using a TTG start codon for the selection marker polypeptide, only very few ribosomes will translate from this start codon, resulting in very low levels of selectable marker protein, and hence a high stringency of selection, and at the same time almost all ribosomes will translate the polypeptide of interest.
- the selectable marker polypeptide In addition to a decreased translation initiation efficiency, it could be beneficial to also provide for decreased translation elongation efficiency of the selectable marker polypeptide, e.g., by mutating the coding sequence thereof so that it comprises several non-preferred codons of the host cell, in order to further decrease the translation levels of the marker polypeptide and allow still more stringent selection conditions, if desired.
- the selectable marker polypeptide further comprises a mutation that reduces the activity of the selectable marker polypeptide compared to its wild-type counterpart. This may be used to increase the stringency of selection even further.
- proline at position 9 in the Zeocin® antibiotic resistance polypeptide may be mutated, e.g., to Thr or Phe, and for the neomycin resistance polypeptide, amino acid residue 182 or 261 or both may further be mutated (see, e.g., WO 01/32901).
- a so-called spacer sequence is placed downstream of the sequence encoding the start codon of the selectable marker polypeptide, which spacer sequence preferably is a sequence in frame with the start codon and encoding a few amino acids, and that does not contain a secondary structure (Kozak, 1990), and does not contain the sequence ATG.
- Such a spacer sequence can be used to further decrease the translation initiation frequency if a secondary structure is present in the RNA (Kozak, 1990) of the selectable marker polypeptide (e.g., for Zeocin® antibiotic, possibly for blasticidin), and hence increase the stringency of the selection system hereof.
- DNA molecule comprising the sequence encoding a selectable marker protein hereof, which DNA molecule has been provided with a mutation that decreases the translation efficiency of the functional selectable marker polypeptide in a eukaryotic host cell.
- the DNA molecule in the coding strand i) has been mutated compared to the wild-type sequence encoding the selectable marker polypeptide, such that the sequence ATG is absent from the coding region following the start codon of the functional selectable marker protein up till the stop codon of the functional selectable marker protein open reading frame; and ii) has a non-optimal translation start sequence of the functional selectable marker polypeptide.
- Features i) and ii) can again be provided according to the description above.
- DNA molecules encompass a useful intermediate product hereof. These molecules can be prepared first, introduced into eukaryotic host cells and tested for functionality (for some markers this is even possible in prokaryotic host cells), if desired in a (semi-) quantitative manner, of the selectable marker polypeptide. They may then be further used to prepare a DNA molecule hereof, comprising the multicistronic transcription unit.
- an expression cassette comprising the DNA molecule hereof, having the multicistronic transcription unit.
- Such an expression cassette is useful to express sequences of interest, for instance in host cells.
- An “expression cassette” as used herein is a nucleic acid sequence comprising at least a promoter functionally linked to a sequence of which expression is desired.
- an expression cassette further contains transcription termination and polyadenylation sequences. Other regulatory sequences such as enhancers may also be included.
- an expression cassette comprising in the following order: 5′-promoter-multicistronic transcription unit hereof, coding for a selectable marker polypeptide and downstream thereof a polypeptide of interest-transcription termination sequence-3′.
- the promoter must be capable of functioning in a eukaryotic host cell, i.e., it must be capable of driving transcription of the multicistronic transcription unit.
- the expression cassette may optionally further contain other elements known in the art, e.g., splice sites to comprise introns, and the like.
- an intron is present behind the promoter and before the sequence encoding the selectable marker polypeptide.
- the expression cassette comprises a multicistronic transcription unit.
- the promoter sequence is placed upstream of the sequences that should be expressed.
- Much used expression vectors are available in the art, e.g., the pcDNA and pEF vector series of Invitrogen, pMSCV and pTK-Hyg from BD Sciences, pCMV-Script from Stratagene, etc, which can be used to obtain suitable promoters and/or transcription terminator sequences, polyA sequences, and the like.
- Sequences driving expression may include promoters, enhancers and the like, and combinations thereof. These should be capable of functioning in the host cell, thereby driving expression of the nucleic acid sequences that are functionally linked to them.
- promoters can be used to obtain expression of a gene in host cells. Promoters can be constitutive or regulated, and can be obtained from various sources, including viruses, prokaryotic, or eukaryotic sources, or artificially designed.
- Expression of nucleic acids of interest may be from the natural promoter or derivative thereof or from an entirely heterologous promoter (Kaufman, 2000).
- Some well-known and much used promoters for expression in eukaryotic cells comprise promoters derived from viruses, such as adenovirus, e.g., the E1A promoter, promoters derived from cytomegalovirus (CMV), such as the CMV immediate early (1E) promoter (referred to herein as the CMV promoter) (obtainable for instance from pcDNA, Invitrogen), promoters derived from Simian Virus 40 (SV40) (Das et al. 1985), and the like.
- viruses such as adenovirus, e.g., the E1A promoter, promoters derived from cytomegalovirus (CMV), such as the CMV immediate early (1E) promoter (referred to herein as the CMV promoter) (obtainable for instance from pcDNA, Invitrogen), promoters derived from
- Suitable promoters can also be derived from eukaryotic cells, such as methallothionein (MT) promoters, elongation factor 1 ⁇ (EF-1 ⁇ ) promoter (Gill et al., 2001), ubiquitin C or UB6 promoter (Gill et al., 2001; Schorpp et al., 1996), actin promoter, an immunoglobulin promoter, heat shock promoters, and the like.
- MT methallothionein
- EF-1 ⁇ elongation factor 1 ⁇
- UB6 promoter Gill et al., 2001; Schorpp et al., 1996)
- actin promoter an immunoglobulin promoter
- heat shock promoters and the like.
- Some preferred promoters for obtaining expression in eukaryotic cells are the CMV-promoter, a mammalian EF1-alpha promoter, a mammalian ubiquitin promoter such as a ubiquitin C promoter, or a SV40 promoter (e.g., obtainable from pIRES, cat. no. 631605, BD Sciences).
- Testing for promoter function and strength of a promoter is a matter of routine for a person skilled in the art, and in general may for instance encompass cloning a test gene such as lacZ, luciferase, GFP, etc. behind the promoter sequence, and test for expression of the test gene.
- promoters may be altered by deletion, addition, mutation of sequences therein, and tested for functionality, to find new, attenuated, or improved promoter sequences.
- strong promoters that give high transcription levels in the eukaryotic cells of choice are preferred.
- a DNA molecule hereof is part of a vector, e.g., a plasmid.
- a vector e.g., a plasmid.
- Such vectors can easily be manipulated by methods well known to the person skilled in the art, and can for instance be designed for being capable of replication in prokaryotic and/or eukaryotic cells.
- many vectors can directly or in the form of isolated desired fragment therefrom be used for transformation of eukaryotic cells and will integrate in whole or in part into the genome of such cells, resulting in stable host cells comprising the desired nucleic acid in their genome.
- DNA molecules in the form of a recombinant plasmid or a recombinant viral genome are introduced into (eukaryotic host) cells and preferably integrated into their genomes by methods known in the art.
- the disclosure also uses these types of DNA molecules to deliver its improved transgene expression system.
- a preferred embodiment of the invention is the use of plasmid DNA for delivery of the expression system.
- a plasmid contains a number of components: conventional components, known in the art, are an origin of replication and a selectable marker for propagation of the plasmid in bacterial cells; a selectable marker that functions in eukaryotic cells to identify and isolate host cells that carry an integrated transgene expression system; the protein of interest, whose high-level transcription is brought about by a promoter that is functional in eukaryotic cells (e.g., the human cytomegalovirus major immediate early promoter/enhancer, pCMV (Boshart et al., 1985); and viral transcriptional terminators (e.g., the SV40 polyadenylation site (Kaufman & Sharp, 1982) for the transgene of interest and the selectable marker.
- a promoter that is functional in eukaryotic cells
- pCMV the human cytomegalovirus major immediate early promoter/enhancer
- viral transcriptional terminators e.g., the SV40 polyadenylation site (K
- the vector used can be any vector that is suitable for cloning DNA and that can be used for transcription of a nucleic acid of interest.
- the vector is an integrating vector.
- the vector may be an episomally replicating vector.
- an expression cassette hereof further comprises at least one chromatin control element.
- a “chromatin control element” used herein is a collective term for DNA sequences that may somehow have an effect on the chromatin structure and therewith on the expression level and/or stability of expression of transgenes in their vicinity (they function “in cis,” and hence are placed preferably within 5 kb, more preferably within 2 kb, still more preferably within 1 kb from the transgene) within eukaryotic cells.
- Such elements have sometimes been used to increase the number of clones having desired levels of transgene expression. The mechanisms by which these elements work may differ for and even within different classes of such elements, and are not completely known for all types of such elements.
- chromatin control elements are chosen from the group consisting of matrix or scaffold attachment regions (MARs/SARs) (e.g., Phi-Van et al., 1990; WO 02/074969, WO 2005/040377), insulators (West et al., 2002) such as the beta-globin insulator element (5′ HS4 of the chicken beta-globin locus), scs, scs′, and the like (e.g., Chung et al., 1993, 1997; Kellum and Schedl, 1991; WO 94/23046, WO 96/04390, WO 01/02553, WO 2004/027072), a ubiquitous chromatin opening element (UCOE) (WO 00/05393, WO 02/24930, WO 02/099089, WO 02/099070), and anti-repressor sequences (also referred to as “STAR” sequences) (Kwaks e
- MARs/SARs matrix or scaffold attachment regions
- Non-limiting examples of MAR/SAR sequences that could be used in the current invention are the chicken lysosyme 5′ MAR (Phi-Van et al., 1990) or fragments thereof, e.g., the B, K and F regions as described in WO 02/074969); DNA sequences comprising at least one bent DNA element and at least one binding site for a DNA binding protein, preferably containing at least 10% of dinucleotide TA, and/or at least 12% of dinucleotide AT on a stretch of 100 contiguous base pairs, such as a sequence selected from the group of comprising the sequences SEQ ID NOS: 1 to 27 in WO 2005/040377, fragments of any one of SEQ ID NOS: 1 to 27 in WO 2005/040377 being at least 100 nucleotides in length and having MAR activity, sequences that are at least 70% identical in nucleotide sequence to any one of SEQ ID NOS: 1 to 27 in WO 2005/040377 or fragment
- a non-limiting example of insulator sequences that could be used in the disclosure is a sequence that comprises SEQ ID NO:1 of WO 01/02553.
- Non-limiting examples of UCOEs that could be used in the disclosure are sequences depicted in FIGS. 2 and 7 of WO 02/24930, functional fragments thereof and sequences being at least 70% identical thereto while still retaining activity; sequences comprising SEQ ID NO:28 of U.S. 2005/181428, functional fragments thereof and sequences being at least 70% identical thereto while still retaining activity.
- the chromatin control element is an anti-repressor sequence, preferably selected from the group consisting of: a) any one SEQ ID NO:1 through SEQ ID NO:66; b) fragments of any one of SEQ ID NO:1 through SEQ ID NO:66, wherein the fragments have anti-repressor activity (“functional fragments”); c) sequences that are at least 70% identical in nucleotide sequence to a) or b) wherein the sequences have anti-repressor activity (“functional derivatives”); and d) the complement to any one of a) to c).
- an anti-repressor sequence preferably selected from the group consisting of: a) any one SEQ ID NO:1 through SEQ ID NO:66; b) fragments of any one of SEQ ID NO:1 through SEQ ID NO:66, wherein the fragments have anti-repressor activity (“functional fragments”); c) sequences that are at least 70% identical in nucleotide sequence to a) or b) wherein the
- the chromatin control element is chosen from the group consisting of STAR67 (SEQ ID NO:66), STAR7 (SEQ ID NO:7), STAR9 (SEQ ID NO:9), STAR17 (SEQ ID NO:17), STAR27 (SEQ ID NO:27), STAR29 (SEQ ID NO:29), STAR43 (SEQ ID NO:43), STAR44 (SEQ ID NO:44), STAR45 (SEQ ID NO:45), STAR47 (SEQ ID NO:47), STAR61 (SEQ ID NO:61), or a functional fragment or derivative of the STAR sequences.
- the STAR sequence is STAR 67 (SEQ ID NO:66) or a functional fragment or derivative thereof.
- STAR 67 or a functional fragment or derivative thereof is positioned upstream of a promoter driving expression of the multicistronic transcription unit.
- the expression cassettes hereof are flanked on both sides by at least one anti-repressor sequence.
- Sequences having anti-repressor activity as used herein are sequences that are capable of at least in part counteracting the repressive effect of HP1 or HPC2 proteins when these proteins are tethered to DNA. Sequences having anti-repressor activity (sometimes also referred to as anti-repressor sequences or anti-repressor elements herein) suitable for the disclosure, have been disclosed in WO 03/004704, incorporated herein by reference, and were coined “STAR” sequences therein (wherever a sequence is referred to as a STAR sequence herein, this sequence has anti-repressor activity hereof). As a non-limiting example, the sequences of 66 anti-repressor elements, named STAR1-65 (see WO 03/004704) and STAR67, are presented herein as SEQ ID NOS: 1-65 and 66, respectively.
- a functional fragment or derivative of a given anti-repressor element may be considered equivalent to the anti-repressor element, when it still has anti-repressor activity.
- the presence of such anti-repressor activity can easily be checked by the person skilled in the art, for instance by the assay described below.
- Functional fragments or derivatives can easily be obtained by a person skilled in the art of molecular biology, by starting with a given anti-repressor sequence, and making deletions, additions, substitutions, inversions and the like (see, e.g., WO 03/004704).
- a functional fragment or derivative also comprises orthologs from other species, which can be found using the known anti-repressor sequences by methods known by the person skilled in the art (see, e.g., WO 03/004704).
- the disclosure encompasses fragments of the anti-repressor sequences, wherein the fragments still have anti-repressor activity.
- sequences are at least 80% identical, more preferably at least 90% identical and still more preferably at least 95% identical to the reference native sequence or functional fragment thereof.
- percent identity refers to that portion of the reference native sequence that is found in the fragment.
- Sequences having anti-repressor activity hereof can be obtained by various methods, including but not limited to the cloning from the human genome or from the genome of another organism, or by for instance amplifying known anti-repressor sequences directly from such a genome by using the knowledge of the sequences, e.g., by PCR, or can in part or wholly be chemically synthesized.
- Sequences having anti-repressor activity are structurally defined herein by their sequence and in addition are functionally defined as sequences having anti-repressor activity, which can be determined with the assay described below.
- Any sequence having anti-repressor activity according to the disclosure should at least be capable of surviving the following functional assay (see WO 03/004704, Example 1, incorporated herein by reference).
- Human U-2 OS cells (ATCC HTB-96) are stably transfected with the pTet-Off plasmid (Clontech K1620-A) and with nucleic acid encoding a LexA-repressor fusion protein containing the LexA DNA binding domain and the coding region of either HP1 or HPC2 (Drosophila Polycomb group proteins that repress gene expression when tethered to DNA; the assay works with either fusion protein) under control of the Tet-Off transcriptional regulatory system (Gossen and Bujard, 1992). These cells are referred to below as the reporter cells for the anti-repressor activity assay.
- a reporter plasmid which provides hygromycin resistance, contains a polylinker sequence positioned between four LexA operator sites and the SV40 promoter that controls the Zeocin® antibiotic resistance gene. The sequence to be tested for anti-repressor activity can be cloned in the polylinker. Construction of a suitable reporter plasmid, such as pSelect, is described in Example 1 and FIG. 1 of WO 00/004704.
- the reporter plasmid is transfected into the reporter cells, and the cells are cultured under hygromycin selection (25 ⁇ g/ml; selection for presence of the reporter plasmid) and tetracycline repression (doxycycline, 10 ng/ml; prevents expression of the LexA-repressor fusion protein). After one week of growth under these conditions, the doxycycline concentration is reduced to 0.1 ng/ml to induce the LexA-repressor gene, and after two days zeocin is added to 250 ⁇ g/ml.
- the cells are cultured for five weeks, until the control cultures (transfected with empty reporter plasmid, i.e., lacking a cloned anti-repressor sequence in the polylinker) are killed by the Zeocin® antibiotic (in this control plasmid, the SV40 promoter is repressed by the LexA-repressor fusion protein that is tethered to the LexA operating sites, resulting in insufficient Zeocin® antibiotic expression in such cells to survive zeocin selection).
- a sequence has anti-repressor activity according to the disclosure if when the sequence is cloned in the polylinker of the reporter plasmid, the reporter cells survive the 5 weeks selection under Zeocin® antibiotic.
- the anti-repressor sequence or functional fragment or derivative thereof confers a higher proportion of reporter over-expressing clones when flanking a reporter gene (e.g., luciferase, GFP) which is integrated into the genome of U-2 OS or CHO cells, compared to when the reporter gene is not flanked by anti-repressor sequences, or flanked by weaker repression blocking sequences such as Drosophila scs.
- a reporter gene e.g., luciferase, GFP
- This can be verified using for instance the pSDH vector, or similar vectors, as described in Example 1 and FIG. 2 of WO 03/004704.
- Anti-repressor elements can have at least one of three consequences for production of protein: (1) they increase the predictability of identifying host cell lines that express a protein at industrially acceptable levels (they impair the ability of adjacent heterochromatin to silence the transgene, so that the position of integration has a less pronounced effect on expression); (2) they result in host cell lines with increased protein yields; and/or (3) they result in host cell lines that exhibit more stable protein production during prolonged cultivation.
- STAR sequence can be used in the expression cassettes according to the disclosure, but the following STAR sequences are particularly useful: STAR67 (SEQ ID NO:66), START (SEQ ID NO:7), STAR9 (SEQ ID NO:9), STAR17 (SEQ ID NO:17), STAR27 (SEQ ID NO:27), STAR29 (SEQ ID NO:29), STAR43 (SEQ ID NO:43), STAR44 (SEQ ID NO:44), STAR45 (SEQ ID NO:45), STAR47 (SEQ ID NO:47), STAR61 (SEQ ID NO:61), or functional fragments or derivatives of these STAR sequences.
- the anti-repressor sequence preferably STAR67, is placed upstream of the promoter, preferably such that less than 2 kb are present between the 3′ end of the anti-repressor sequence and the start of the promoter sequence. In preferred embodiments, less than 1 kb, more preferably less than 500 nucleotides (nt), still more preferably less than about 200, 100, 50, or 30 nt are present between the 3′ end of the anti-repressor sequence and the start of the promoter sequence. In certain embodiments, the anti-repressor sequence is cloned directly upstream of the promoter, resulting in only about 0-20 nt between the 3′ end of the anti-repressor sequence and the start of the promoter sequence.
- both expression cassettes are multicistronic expression cassettes hereof, each coding for a different selectable marker protein, so that selection for both expression cassettes is possible.
- This embodiment has proven to give good results. e.g., for the expression of the heavy and light chain of antibodies.
- both expression cassettes may be placed on one nucleic acid molecule or both may be present on a separate nucleic acid molecule, before they are introduced into host cells.
- An advantage of placing them on one nucleic acid molecule is that the two expression cassettes are present in a single predetermined ratio (e.g., 1:1) when introduced into host cells.
- the expression cassettes preferably at least one of the expression cassettes, but more preferably each of them, comprises a chromatin control element, more preferably an anti-repressor sequence.
- the different subunits or parts of a multimeric protein are present on a single expression cassette.
- STAR sequence placed upstream of a promoter in an expression cassette, it has proven highly beneficial to provide a STAR sequence on both sides of an expression cassette, such that expression cassette comprising the transgene is flanked by two STAR sequences, which in certain embodiments are essentially identical to each other.
- anti-repressor sequences flanking the expression cassette may beneficially placed in opposite direction with respect to each other, such that the 3′ end of each of these anti-repressor sequences is facing inwards to the expression cassette (and to each other).
- the 5′ side of an anti-repressor element faces the DNA/chromatin of which the influence on the transgene is to be diminished by the anti-repressor element.
- the 3′ end faces the promoter.
- transcription units or expression cassettes hereof are provided, further comprising: a) a transcription pause (TRAP) sequence upstream of the promoter that drives transcription of the multicistronic transcription unit, the TRAP being in a 5′ to 3′ direction; or b) a TRAP sequence downstream of the open reading frame of the polypeptide of interest and preferably downstream of the transcription termination sequence of the multicistronic transcription unit, the TRAP being in a 3′ to 5′ orientation; or c) both a) and b); wherein a TRAP sequence is functionally defined as a sequence which when placed into a transcription unit, results in a reduced level of transcription in the nucleic acid present on the 3′ side of the TRAP when compared to the level of transcription observed in the nucleic acid on the 5′ side of the TRAP.
- TRAP transcription pause
- Non-limiting examples of TRAP sequences are transcription termination and/or polyadenylation signals.
- One non-limiting example of a TRAP sequence is given in SEQ ID NO:142. Examples of other TRAP sequences, methods to find these, and uses thereof have been described in WO 2004/055215.
- DNA molecules comprising multicistronic transcription units and/or expression cassettes according to the disclosure can be used for improving expression of nucleic acid, preferably in host cells.
- the terms “cell”/“host cell” and “cell line”/“host cell line” are respectively typically defined as a cell and homogeneous populations thereof that can be maintained in cell culture by methods known in the art, and that have the ability to express heterologous or homologous proteins.
- Prokaryotic host cells can be used to propagate and/or perform genetic engineering with the DNA molecules of the invention, especially when present on plasmids capable of replicating in prokaryotic host cells such as bacteria.
- a host cell hereof preferably is a eukaryotic cell, more preferably a mammalian cell, such as a rodent cell or a human cell or fusion between different cells.
- the host cell is a U-2 OS osteosarcoma, CHO (Chinese hamster ovary), HEK 293, HuNS-1 myeloma, WER1-Rb-1 retinoblastoma, BHK, Vero, non-secreting mouse myeloma Sp2/0-Ag 14, non-secreting mouse myeloma NSO, NCI-H295R adrenal gland carcinomal or a PER.C6® cell.
- a host cell is a cell expressing at least E1A, and preferably also E1B, of an adenovirus.
- a cell can be derived from for instance human cells, for instance from a kidney (example: HEK 293 cells, see Graham et al., 1977), lung (e.g., A549, see, e.g., WO 98/39411) or retina (example: HER cells marketed under the trade mark PER.C6®, see U.S. Pat. No. 5,994,128), or from amniocytes (e.g., N52.E6, described in U.S. Pat. No. 6,558,948), and similarly from other cells.
- a kidney example: HEK 293 cells, see Graham et al., 1977
- lung e.g., A549, see, e.g., WO 98/39411
- retina example: HER cells marketed under the trade mark PER.C6®,
- PER.C6® cells for the purpose of the disclosure means cells from an upstream or downstream passage or a descendent of an upstream or downstream passage of cells as deposited under ECACC no. 96022940, i.e., having the characteristics of those cells. It has been previously shown that such cells are capable of expression of proteins at high levels (e.g., WO 00/63403, and Jones et al., 2003).
- the host cells are CHO cells, for instance CHO-K1, CHO-S, CHO-DG44, CHO-DUKXB11, and the like.
- the CHO cells have a dhfr ⁇ phenotype.
- Such eukaryotic host cells can express desired polypeptides, and are often used for that purpose. They can be obtained by introduction of a DNA molecule of the invention, preferably in the form of an expression cassette, into the cells.
- the expression cassette is integrated in the genome of the host cells, which can be in different positions in various host cells, and selection will provide for a clone where the transgene is integrated in a suitable position, leading to a host cell clone with desired properties in terms of expression levels, stability, growth characteristics, and the like.
- the multicistronic transcription unit may be targeted or randomly selected for integration into a chromosomal region that is transcriptionally active, e.g., behind a promoter present in the genome.
- Selection for cells containing the DNA of the invention can be performed by selecting for the selectable marker polypeptide, using routine methods known by the person skilled in the art.
- an expression cassette hereof can be generated in situ, i.e., within the genome of the host cells.
- the host cells may be from a stable clone that can be selected and propagated according to standard procedures known to the person skilled in the art.
- a culture of such a clone is capable of producing polypeptide of interest, if the cells comprise the multicistronic transcription unit of the invention.
- Cells hereof preferably are able to grow in suspension culture in serum-free medium.
- the DNA molecule comprising the multicistronic transcription unit hereof preferably in the form of an expression cassette, is integrated into the genome of the eukaryotic host cell hereof. This will provide for stable inheritance of the multicistronic transcription unit.
- Selection for the presence of the selectable marker polypeptide, and hence for expression, can be performed during the initial obtaining of the cells, and could be lowered or stopped altogether after stable clones have been obtained. It is however also possible to apply the selection agent during later stages continuously, or only occasionally, possibly at lower levels than during initial selection of the host cells.
- a polypeptide of interest hereof can be any protein, and may be a monomeric protein or a (part of a) multimeric protein.
- a multimeric protein comprises at least two polypeptide chains.
- Non-limiting examples of a protein of interest hereof are enzymes, hormones, immunoglobulin chains, therapeutic proteins like anti-cancer proteins, blood coagulation proteins such as Factor VIII, multi-functional proteins, such as erythropoietin, diagnostic proteins, or proteins or fragments thereof useful for vaccination purposes, all known to the person skilled in the art.
- an expression cassette encodes an immunoglobulin heavy or light chain or an antigen binding part, derivative and/or analogue thereof.
- a protein expression unit hereof is provided, wherein the protein of interest is an immunoglobulin heavy chain.
- a protein expression unit hereof is provided, wherein the protein of interest is an immunoglobulin light chain.
- an immunoglobulin may be encoded by the heavy and light chains on different expression cassettes, or on a single expression cassette.
- the heavy and light chain are each present on a separate expression cassette, each having its own promoter (which may be the same or different for the two expression cassettes), each comprising a multicistronic transcription unit hereof, the heavy and light chain being the polypeptide of interest, and preferably each coding for a different selectable marker protein, so that selection for both heavy and light chain expression cassette can be performed when the expression cassettes are introduced and/or present in a eukaryotic host cell.
- the polypeptide of interest may be from any source, and in certain embodiments is a mammalian protein, an artificial protein (e.g., a fusion protein or mutated protein), and preferably is a human protein.
- an artificial protein e.g., a fusion protein or mutated protein
- the configurations of the expression cassettes hereof may also be used when the ultimate goal is not the production of a polypeptide of interest, but the RNA itself, for instance for producing increased quantities of RNA from an expression cassette, which may be used for purposes of regulating other genes (e.g., RNAi, antisense RNA), gene therapy, in vitro protein production, etc.
- RNAi e.g., RNAi, antisense RNA
- gene therapy e.g., in vitro protein production, etc.
- a method for generating a host cell expressing a polypeptide of interest comprising the steps of: a) introducing into a plurality of precursor cells an expression cassette hereof, and b) culturing the generated cells under conditions selecting for expression of the selectable marker polypeptide, and c) selecting at least one host cell producing the polypeptide of interest.
- the weakest translation efficiency for the selectable marker polypeptide using the weakest translation start sequence
- far fewer colonies are obtained using the same concentration of selection agent than with known selection systems, and a relatively high percentage of the obtained clones produces the polypeptide of interest at high levels.
- the obtained levels of expression appear higher than those obtained when an even larger number of clones using the known selection systems are used.
- the selection system is swift because it does not require copy number amplification of the transgene.
- cells with low copy numbers of the multicistronic transcription units already provide high expression levels.
- High transgene copy numbers of the transgene may be prone to genetic instability and repeat-induced silencing (e.g., Kim et al., 1998; McBurney et al., 2002).
- an additional advantage of the embodiments of the invention with relatively low transgene copy numbers is that lower copy numbers are anticipated to be less prone to recombination and to repeat-induced silencing, and therefore less problems in this respect are anticipated when using host cells with a limited number of copies of the transgene compared to host cells obtained using an amplification system where hundreds or even thousands of copies of the selectable marker and protein of interest coding sequences may be present in the genome of the cell.
- the disclosure provides examples of high expression levels, using the multicistronic transcription unit selection system, while the copy number of the transgene is relatively low, i.e., less than 30 copies per cell, or even less than 20 copies per cell.
- the disclosure allows the generation of host cells hereof, comprising less than 30 copies of the multicistronic transcription unit in the genome of the host cells, preferably less than 25, more preferably less than 20 copies, while at the same time providing sufficient expression levels of the polypeptide of interest for commercial purposes, e.g., more than 15, preferably more than 20 pg/cell/day of an antibody.
- the selection system of the invention nevertheless can be combined with amplification methods to even further improve expression levels.
- This can for instance be accomplished by amplification of a co-integrated dhfr gene using methotrexate, for instance by placing dhfr on the same nucleic acid molecule as the multicistronic transcription unit of the invention, or by cotransfection when dhfr is on a separate DNA molecule.
- a method for producing a polypeptide of interest comprising culturing a host cell, the host cell comprising a DNA molecule comprising a multicistronic expression unit or an expression cassette hereof, and expressing the polypeptide of interest from the coding sequence for the polypeptide of interest.
- the host cell for this aspect is a eukaryotic host cell, preferably a mammalian cell, such as a CHO cell, further as described above.
- nucleic acid that is to be expressed in a cell can be done by one of several methods, which as such are known to the person skilled in the art, also dependent on the format of the nucleic acid to be introduced.
- the methods include but are not limited to transfection, infection, injection, transformation, and the like.
- Suitable host cells that express the polypeptide of interest can be obtained by selection as described above.
- selection agent is present in the culture medium at least part of the time during the culturing, either in sufficient concentrations to select for cells expressing the selectable marker polypeptide or in lower concentrations. In preferred embodiments, selection agent is no longer present in the culture medium during the production phase when the polypeptide is expressed.
- Culturing a cell is done to enable it to metabolize, and/or grow and/or divide and/or produce recombinant proteins of interest. This can be accomplished by methods well known to persons skilled in the art, and includes but is not limited to providing nutrients for the cell.
- the methods comprise growth adhering to surfaces, growth in suspension, or combinations thereof. Culturing can be done for instance in dishes, roller bottles or in bioreactors, using batch, fed-batch, continuous systems such as perfusion systems, and the like.
- the expressed protein is collected (isolated), either from the cells or from the culture medium or from both. It may then be further purified using known methods, e.g., filtration, column chromatography, etc, by methods generally known to the person skilled in the art.
- the selection method works in the absence of chromatin control elements, but improved results are obtained when the multicistronic expression units are provided with such elements.
- the selection method works particularly well when an expression cassette hereof, comprising at least one anti-repressor sequence is used. Depending on the selection agent and conditions, the selection can in certain cases be made so stringent, that only very few or even no host cells survive the selection, unless anti-repressor sequences are present.
- the combination of the novel selection method and anti-repressor sequences provides a very attractive method to obtain only limited numbers of colonies with a greatly improved chance of high expression of the polypeptide of interest therein, while at the same time the obtained clones comprising the expression cassettes with anti-repressor sequences provide for stable expression of the polypeptide of interest, i.e., they are less prone to silencing or other mechanisms of lowering expression than conventional expression cassettes.
- the novel selection system disclosed herein therefore also provides the possibility to test parts of anti-repressor elements for functionality, by analyzing the effects of such sequences when present in expression cassettes of the invention under selection conditions.
- This easy screen which provides an almost or even complete black and white difference in many cases, therefore can contribute to identifying functional parts or derivatives from anti-repressor sequences.
- this assay can be used to characterize them further.
- fragments of known anti-repressor sequences are tested, the assay will provide functional fragments of such known anti-repressor sequences.
- Examples 1-7 describe the testing of STAR67 and other STAR sequences in various configurations and under various circumstances. Examples 8-25 further describe the novel selection system of the invention.
- STAR anti-repressor
- FIG. 1 Three plasmids were created ( FIG. 1 ):
- construct A Plasmid pd2EGFP (Clontech 6010-1) was modified by insertion of a linker at the BsiWI site to yield pd2EGFP-link.
- the linker made by annealing oligonucleotides GTACGGATATCAGATCTTTAATTAAG (SEQ ID NO:67) and GTACCTTAATTAAAGATCTGATAT (SEQ ID NO:68), introduced sites for the PacI, BglII, and EcoRV restriction endonucleases. This created the multiple cloning site MCSII for insertion of STAR elements.
- pIRES-stuf was digested with BglII and FspI to liberate a DNA fragment composed of the stuffer fragment, the CMV promoter, the IRES element (flanked by multiple cloning sites MCS A and MCS B), and the SV40 polyadenylation signal. This fragment was ligated with the vector backbone of pd2EGFP-link produced by digestion with BamHI and StuI, to yield pIRES-link.
- the open reading frame of the Zeocin® antibiotic-resistance gene was inserted into BamHI/NotI sites downstream of the pIRES as follows: the Zeocin® antibiotic-resistance ORF was amplified by PCR with primers GATCGGATCCTTCGAAATGGCCAAGTTGACCAGTGC (SEQ ID NO:71) and AGGCGCGGCCGCAATTCTCAGTCCTGCTCCTC (SEQ ID NO:72) from plasmid pCMV/zeo (Invitrogen, cat. no. V50120), digested with BamHI and NotI, and ligated with BamHI/NotI-digested pIRES-link to yield pIRES-link-zeo.
- the Zeocin® antibiotic-resistance ORF was amplified by PCR with primers GATCGGATCCTTCGAAATGGCCAAGTTGACCAGTGC (SEQ ID NO:71) and AGGCGCGGCCGCAATTCTCAGTCCTGCTCCTC (
- the d2EGFP reporter ORF was introduced into pIRES-link-zeo by amplification of pd2EGFP (Clontech 6010-1) with primers GATCGAATTCTCGCGAATGGTGAGCAAGCAGATCCTGAAG (SEQ ID NO:73) and AGGCGAATTCACCGGTGTTTAAACTTACACCCACTCGTGCAGGCTGCCCAGG (SEQ ID NO:74), and insertion of the EcoRI-digested d2EGFP cassette into the EcoRI site in the pIRES-link-zeo plasmid.
- This created construct A CMV-d2EGFP-IRES-Zeo (CMV Control).
- STAR67 was cloned upstream of the CMV promoter, in the AscI site (about 15 nt remaining between STAR67 and the promoter). This created construct B, STAR67-CMV-d2EGFP-ires-Zeo (CMV-STAR67).
- STAR67 was also tested in the context of a cassette that contains also the STAR7, cloned directionally in the 5′ SalI and XbaI sites and 3′ BglII and PacI sites to flank the entire cassette with STAR7. This is construct C (CMV-STAR67 7/7).
- the UB6 and EF1 ⁇ promoters were exchanged for the CMV promoter in the plasmids described in FIG. 1 .
- the UB6 promoter was cloned as follows. A DNA 0.37 kb stuffer from the pd2EGFP plasmid was amplified by PCR, as described in Example 1, using primers identified by SEQ ID NOS: 69 and 70. The resulting DNA stuffer was cloned in the BglII site of pUB6/V5-His [Invitrogen V250-20], creating pUB6-stuf. From pUB6-stuf an AscI-SacI fragment was cloned into CMV-d2EGFP-IRES-Zeo, from which the CMV promoter was removed.
- the EF1 ⁇ promoter was amplified by PCR with pEF1 ⁇ /V5-His [Invitrogen V920-20] as template, using primers GATCGGCGCGCCATTTAAATCCGAAAAGTGCCACCTGACG (SEQ ID NO:79) and AGGCGGGACCCCCTCACGACACCTGAAATGGAAG (SEQ ID NO:80).
- the PCR fragment was cloned in the AscI and PpuMI sites of CMV-d2EGFP-IRES-Zeo, from which the CMV promoter was removed.
- the Chinese Hamster Ovary cell line CHO-K1 (ATCC CCL-61) was cultured in HAMS-F12 medium+10% Fetal Calf Serum containing 2 mM glutamine, 100 U/ml penicillin, and 100 micrograms/ml streptomycin at 37° C./5% CO 2 .
- Cells were transfected with the plasmids using Lipofectamine® 2000 (Invitrogen) as described by the manufacturer. Briefly, cells were seeded to culture vessels and grown overnight to 70-90% confluence.
- Lipofectamine® reagent was combined with plasmid DNA at a ratio of 6 microliters per microgram (e.g., for a 10 cm Petri dish, 20 micrograms DNA and 120 microliters Lipofectamine) and added to the cells. After overnight incubation the transfection mixture was replaced with fresh medium, and the transfected cells were incubated further. After overnight cultivation, cells were trypsinized and seeded into fresh culture vessels with fresh medium. After another overnight incubation Zeocin® antibiotic was added to a concentration of 50 ⁇ g/ml and the cells were cultured further. After another three days the medium was replaced by fresh medium containing Zeocin® antibiotic (100 ⁇ g/ml) and cultured further.
- Zeocin® antibiotic 100 ⁇ g/ml
- FIG. 2 shows that transfection of the construct that contains STAR67 cloned upstream of the CMV promoter resulted in a number of CHO colonies that express significantly higher levels of d2EGFP protein, as compared to the “empty” control without STAR67, CMV Control.
- the average of the d2EGFP signal in the 10 colonies transfected with the CMV Control plasmid was 34, when measured 25 days after transfection. Sixty days after transfection the average of the d2EGFP signal in these ten colonies was reduced to 13, indicating that expression is not stable over time.
- the average of the d2EGFP signal in the ten colonies transfected with the STAR67-CMV plasmid was 42 when measured after 25 days and 32 measured 60 days after transfection.
- a STAR67-encompassing CMV construct conveyed a factor 2.5 higher CMV promoter driven expression level of the reporter protein in stably transfected clones.
- colonies containing the STAR67 construct are more stable over time in the absence of selection pressure than colonies that do not contain a STAR67 construct.
- FIG. 3 shows that transfection of the construct that contains STAR67 cloned upstream of the EF1 ⁇ -promoter resulted in a number of CHO colonies that express significantly higher levels of d2EGFP protein, as compared to the “empty” control without STAR67, EF1 ⁇ Control.
- the average of the d2EGFP signal in the 10 colonies transfected with the EF1 ⁇ Control plasmid was 31, when measured 25 days after transfection. Sixty days after transfection, the average of the d2EGFP signal in these ten colonies was 26.
- the average of the d2EGFP signal in the ten colonies transfected with the EF1 ⁇ -STAR67 plasmid was 60 when measured after 25 days and 76 measured 60 days after transfection.
- a STAR67-encompassing EF1 ⁇ construct conveyed a factor 2.9 higher EF1 ⁇ promoter driven expression level of the reporter protein in stably transfected clones.
- FIG. 4 shows that transfection of the construct that contains STAR67 cloned upstream of the UB6 promoter resulted in a number of CHO colonies that express significantly higher levels of d2EGFP protein, as compared to the “empty” control without STAR67, UB6 Control.
- the average of the d2EGFP signal in the ten colonies transfected with the UB6 Control plasmid was 51, when measured 25 days after transfection. 60 days after transfection the average of the d2EGFP signal in these ten colonies was 29, indicating that the expression was not stable over time.
- the average of the d2EGFP signal in the ten colonies transfected with the UB6-STAR67 plasmid was 218 when measured after 25 days and 224 measured 60 days after transfection.
- a STAR67-encompassing UB6 construct conveyed a factor 4.3 higher UB6 promoter driven expression level of the reporter protein in stably transfected clones. After 60 days, this factor was 7.7, due to instability of expression in the control colonies and stability in the UB6-STAR67 colonies.
- 25 days after transfection, after the first measurement selection pressure was removed by culturing the colonies in medium without Zeocin® antibiotic. Hence, colonies containing the STAR67 construct are more stable over time in the absence of selection pressure than colonies that do not contain a STAR67 construct.
- STAR67 increases expression from three different, unrelated promoters.
- PER.C6® cells were cultured in DMEM medium+pyridoxine+9% Fetal Bovine Serum (Non-Heat Inactivated), 8.9 mM MgCl 2 100 U/ml penicillin, and 100 micrograms/ml streptomycin at 37° C./10% CO 2 .
- Cells were transfected with the plasmids using Lipofectamine 2000 (Invitrogen) as described by the manufacturer. Briefly, cells were seeded to 6-well plates and grown overnight to 70-90% confluence.
- Lipofectamine reagent was combined with plasmid DNA at a ratio of 15 microliters per 3 microgram (e.g., for a 10 cm Petri dish, 20 micrograms DNA and 120 microliters Lipofectamine) and added after 30 minutes incubation at 25° C. to the cells. After six hours incubation, the transfection mixture was replaced with fresh medium, and the transfected cells were incubated further. After overnight cultivation, cells were trypsinized and seeded (1:15, 1:30, 1:60, 1:120 dilutions) into fresh Petri dishes (90 mm) with fresh medium with zeocin added to a concentration of 100 ⁇ g/ml and the cells were cultured further.
- FIG. 5 shows that transfection of the construct that contains STAR67 cloned upstream of the CMV promoter resulted in a number of PER.C6® colonies that express significantly higher levels of d2EGFP protein, as compared to the “empty” control without STAR67, CMV Control.
- the average of the d2EGFP signal in the ten colonies transfected with the CMV Control plasmid was 37, when measured 30 days after transfection. 60 days after transfection the average of the d2EGFP signal in these ten colonies was reduced to 14, indicating that expression was not stable over time.
- the average of the d2EGFP signal in the 10 colonies transfected with the STAR67-CMV plasmid was 101 when measured after 30 days and 45 measured 60 days after transfection.
- a STAR67-encompassing CMV construct conveyed a factor 3.2 higher CMV promoter driven expression level of the reporter protein in stably transfected clones.
- FIG. 6 shows that transfection of the construct that contains STAR67 cloned upstream of the EF1 ⁇ -promoter resulted in a number of PER.C6® colonies that express significantly higher levels of d2EGFP protein, as compared to the “empty” control without STAR67, EF1 ⁇ Control.
- the average of the d2EGFP signal in the ten colonies transfected with the EF1 ⁇ Control plasmid was five, when measured 30 days after transfection. Sixty days after transfection the average of the d2EGFP signal in these ten colonies was six.
- the average of the d2EGFP signal in the ten colonies transfected with the EF1 ⁇ -STAR67 plasmid was 25 when measured after 30 days and 20 measured 60 days after transfection.
- a STAR67-encompassing EF1 ⁇ construct conveyed a factor 4 higher EF1 ⁇ promoter driven expression level of the reporter protein in stably transfected clones.
- FIG. 7 shows that transfection of the construct that contains STAR67 cloned upstream of the UB6 promoter resulted in a number of PER.C6® colonies that express significantly higher levels of d2EGFP protein, as compared to the “empty” control without STAR67, UB6 Control.
- the average of the d2EGFP signal in the ten colonies transfected with the UB6 Control plasmid was four, when measured 30 days after transfection. Sixty days after transfection the average of the d2EGFP signal in these ten colonies was two. In comparison, the average of the d2EGFP signal in the ten colonies transfected with the UB6-STAR67 plasmid was 27 when measured after 30 days and 18 measured 60 days after transfection.
- a STAR67-encompassing UB6 construct conveyed a factor seven- to nine-fold higher UB6 promoter driven expression level of the reporter protein in stably transfected clones.
- STAR67 functions in different, unrelated cell types.
- the SV40 promoter was amplified by PCR with pIRES as template using primers TTGGTTGGGGCGCCGCAGCACCATGGCCTGAAATAACCTCTGAAAGAGG (SEQ ID NO:81) and TTGGTTGGGAGCTCAAGCTTTTTGCAAAAGCCTAGGCCTCCAAA AAAGCCTCCTC (SEQ ID NO:82).
- the PCR fragment was cloned in the AscI and SacI sites of CMV-d2EGFP-IRES-Zeo, from which the CMV promoter was removed.
- CHO cells were transfected, colonies were isolated and propagated and analyzed as in Example 2.
- FIG. 8 shows that transfection of the construct that either contains STAR67 cloned upstream of the SV40 promoter (SV40-STAR67) or STAR 7 cloned to flank the entire construct (SV40-STAR 7/7) did not result in CHO colonies that express significantly higher levels of d2EGFP protein, as compared to the “empty” control without STARs (SV40 Control).
- the average of the d2EGFP signal in the 18 colonies transfected with the SV40 Control plasmid was 86, when measured 40 days after transfection.
- the average of the d2EGFP signal in the 18 colonies transfected with the SV40-STAR67 7/7 plasmid was 209 when measured after 40 days colonies, which is a factor 2.4 higher than the average of the 18 control colonies (86).
- the STAR67 element is used in combination with another STAR element, this results in a number of stably transfected CHO colonies that show significantly higher d2EGFP expression levels.
- flanking STAR7 elements were replaced by flanking STAR6 elements or by flanking STAR4 elements, in combination with STAR67 upstream of the SV40 promoter (SV40-STAR67 6/6 and SV40-STAR67 4/4, respectively, using the same nomenclature as above), and observed improved expression also with these combinations.
- Example 4 we showed that the combination of STAR67 and STAR7 enhanced the expression levels of d2EGFP protein in CHO cells.
- the combination of STAR67 and STAR7 could be used for the production of an antibody.
- the heavy chain cDNA (HC-EpCAM) was cloned in a construct encompassing the UB6 promoter.
- the HC-EpCAM was coupled to the Zeocin® antibiotic resistance gene by an IRES sequence.
- the light chain cDNA (LC-EpCAM) was also cloned in a construct encompassing the UB6 promoter.
- the LC-EpCAM was coupled to the puromycin resistance gene by an IRES sequence. Together these two plasmids represent the UB6 (HC+LC) Control ( FIG. 9 ).
- STAR67 was cloned in both HC+LC constructs, upstream of the UB6 promoters.
- STAR7 was cloned to flank the entire cassettes, both at the 5′ and 3′ end ( FIG. 9 ). These two plasmids represent STAR7-STAR67-UB6 (HC+LC) STAR7.
- the Chinese Hamster Ovary cell line CHO-K1 (ATCC CCL-61) was transfected and cultured as in Example 2, using Zeocin® antibiotic (100 ⁇ g/ml) and puromycin (2.5 ⁇ g/ml) for selection.
- Zeocin® antibiotic 100 ⁇ g/ml
- puromycin 2.5 ⁇ g/ml
- Zeocin® antibiotic was added to the culture medium.
- the culture medium was removed and replaced with culture medium containing puromycin. After approximately seven days, colonies were isolated and transferred to 24-well plates, in culture medium containing Zeocin® antibiotic only.
- FIG. 9 shows that that transfection of antibody constructs that contain STAR67 cloned upstream of the UB6 promoter and two STAR7 elements cloned to flank the entire cassettes resulted in a number of CHO colonies that express significantly higher levels of EpCAM antibody (measured by ELISA using an anti-human IgG antibody) as compared to the “empty” control without STAR67 and STAR7, UB6 (HC+LC) Control.
- the average of the EpCAM production in the 18 colonies transfected with the UB6 (HC+LC) Control plasmid was 2.7 pg/cell/day, when measured 25 days after transfection. Selection agents Zeocin® antibiotic and puromycin were removed after 25 days.
- the average of the EpCAM production in these 18 colonies was 2.7 pg/cell/day.
- the average of the EpCAM production signal in the 18 colonies transfected with the STAR7-STAR67-UB6 (HC+LC)-STAR7 plasmid was 6.7 when measured after 25 days and 7.7 pg/cell/day, when measured 45 days after transfection.
- a STAR67/STAR7-encompassing UB6 construct conveyed a factor 2.5 to 2.9 fold higher UB6 promoter driven EpCAM expression level in stably transfected CHO clones.
- STAR67 is not an Enhancer Blocker, Whereas STAR6 and STAR7 are
- Enhancer blocker activity is tested by placing a STAR element between a strong enhancer and a promoter. Here we tested whether also STAR67 is an enhancer blocker.
- the d2EGFP gene was PCR-amplified using primers TTGGTTGGTCATGAATGGTGAGCAAGGGCGAGGAGCTGTTC (SEQ ID NO:75) and ATTCTCTAGACTACACATTGATCCTAGCAGAAGCAC (SEQ ID NO:76) and cloned into plasmid pGL3-promoter (Promega) using the NcoI and XbaI restriction sites to replace the Luciferase gene to create plasmid pGL3-promoter-GFP.
- a linker (created by annealing oligos CGATATCTTGGAGATCTACTAGTGGCGCG CCTTGGGCTAGCT (SEQ ID NO:77) and GATCAGCTAGCCCAAGGCGCGCCACTAGTAGATCTCC AAGATATCGAGCT (SEQ ID NO:78), was cloned in the SacI and BglII sites to create multiple cloning sites. The original BglII site was destroyed upon ligation of the linker, creating a new unique BglII site within the linker DNA.
- the SV40-enhancer was cut from plasmid pGL3-basic (Promega) using BsaBI and BamHI and cloned into the pGL3-linker-promoter-GFP using the EcoRV and BglII sites creating plasmid pGL3-enhancer-promoter-GFP.
- the STAR40 element was placed upstream of the SV40 enhancer using KpnI and SacI sites to prevent action of the enhancer on upstream sequences. Finally, STAR elements 6, 7 and 67 were placed in between the SV40 enhancer and the SV40 minimal promoter using the SpeI and AscI restriction sites.
- the Chinese Hamster Ovary cell line CHO-K1 (ATCC CCL-61) is transfected as in Example 2. One day after the transfection, d2EGFP levels are measured on an Epics XL flow cytometer (Beckman Coulter)
- FIG. 10 shows that STAR67 is not an enhancer blocker, whereas STAR6 and STAR7 act enhancer blockers in the same assay.
- STAR6, STAR7 and STAR67 were cloned between the SV40 enhancer and a minimal SV40-promoter upstream of the d2EGFP gene.
- no STAR element was cloned between the enhancer and the promoter strong transcriptional activation occurred (arbitrarily set at 100%).
- STAR6 or STAR7 was placed between the enhancer and the promoter, transcription dropped to background levels, indicating that STAR6 and STAR7 are potent enhancer blockers.
- Example 5 we showed that the combination of STAR67 and STAR7 enhanced the expression levels of EpCAM antibody in CHO cells, in the context of two distinct plasmids, which contained the heavy and light chains.
- STAR67 could be used for the production of EpCAM antibody when both heavy and light chains are placed on one plasmid. We used simultaneous selection for each selectable marker.
- the heavy chain cDNA (HC-EpCAM) is under the control of the UB6 promoter and coupled to the Zeocin resistance gene by an IRES sequence.
- the light chain cDNA (LC-EpCAM) is under control of the CMV promoter and coupled to the puromycin resistance gene by an IRES sequence. Basically these are the constructs used in Example 5. These two expression cassettes were placed on one plasmid, in such a manner that transcription of the two expression units had opposite directions. In the control plasmid the UB6 and CMV promoters were separated by a stuffer of 500 bp (EpCAM Control) ( FIG. 10 ). In another plasmid STAR67 was placed between the UB6 and CMV promoter (EpCAM STAR67) ( FIG. 10 ).
- the Chinese Hamster Ovary cell line CHO-K1 (ATCC CCL-61) was transfected and cultured as in Example 2, using Zeocin® antibiotic (100 ⁇ g/ml) and puromycin (2.5 ⁇ g/ml) for selection. In Example 5, consecutive selection for both selection markers was used. In contrast, here both selection agents were present in the culture medium simultaneously.
- Zeocin® antibiotic and puromycin were added to the culture medium. The selection medium was present until colonies were isolated (approximately 14 days after transfection). After colonies were isolated and transferred to 24-well plates, the cells were cultured in the presence of Zeocin® antibiotic and puromycin.
- FIG. 11 shows that that transfection of the antibody construct that contains STAR67 cloned between the UB6 and CMV promoters resulted in a number of CHO colonies that express EpCAM antibody (measured by ELISA using an anti-human IgG antibody).
- EpCAM antibody measured by ELISA using an anti-human IgG antibody.
- the average of the EpCAM production in the 19 colonies transfected with the EpCAM STAR67 plasmid was 9.8 pg/cell/day, when measured 25 days after transfection.
- EpCAM control and EpCAM STAR67 plasmid The difference between the EpCAM control and EpCAM STAR67 plasmid is black-white in the sense that only transfection of EpCAM STAR67 results in the establishment of colonies, when the selection pressure is high. This opens an opportunity to use this plasmid configuration for identifying the region in STAR67 that is responsible for mediating this effect. Smaller, overlapping portions of STAR67 are placed between the UB6 and CMV promoters, driving the EpCAM molecule. When a portion of STAR67 is functional, colonies will survive when both Zeocin® antibiotic and puromycin are simultaneously used as selection agent. When a portion of STAR67 is not functional, no colonies will survive under identical selection conditions.
- the basic idea behind the development of a novel selection system is to place the gene encoding the resistance gene upstream of a gene of interest, and one promoter drives the expression of this bicistronic mRNA.
- the translation of the bicistronic mRNA is such that only in a small percentage of translation events the resistance gene will be translated into protein and that most of the time the downstream gene of interest will be translated into protein.
- the translation efficiency of the upstream resistance gene must be severely hampered in comparison to the translation efficiency of the downstream gene of interest. To achieve this, three steps can be taken hereof:
- the searching ribosome preferably should not meet another AUG, since any downstream AUG may serve as translation start codon, resulting in a lower translation efficiency of the second, downstream gene of interest.
- any AUG in the resistance gene mRNA will have to be replaced.
- this AUG is a functional codon that encodes a methionine, this amino acid will have to be replaced by a different amino acid, for instance by a leucine ( FIGS. 12A and B);
- the start codon of the resistance gene must have a bad context (be part of a non-optimal translation start sequence); i.e., the ribosomes must start translation at this start codon only in a limited number of events, and hence in most events continue to search for a better, more optimal start codon ( FIG. 12C-E ).
- Three different stringencies can be distinguished: a) the normal ATG start codon, but placed in a bad context (TTT ATG T) (called ATGmut) ( FIG. 12C ), b) preferably when placed in an optimal context, GTG can serve as start codon (ACC GTG G) ( FIG.
- TTG can serve as start codon (ACC TTG G) ( FIG. 12E ).
- the most stringent translation condition is the TTG codon, followed by the GTG codon ( FIG. 12 ).
- the Zeo mRNA with a TTG as start codon is expected to produce the least Zeocin® antibiotic resistance protein and will hence convey the lowest functional Zeocin® antibiotic resistance to cells ( FIGS. 12 , 13 ).
- the normal start codon ( ATG ) of the downstream gene of interest should have an optimal translation context (e.g., ACC ATG G) ( FIG. 13A-D ). This warrants that, after steps 1 and 2 have been taken, in most events the start codon of the gene of interest will function as start codon of the bicistronic mRNA.
- step 1 is performed, that is, in the Zeocin® antibiotic resistance gene one existing internal methionine is replaced by another amino acid ( FIG. 12B-E ). It is important that after such a change the Zeo protein still confers Zeocin® antibiotic resistance to the transfected cells. Since it is not known beforehand which amino acid will fulfill this criterion, three different amino acids have been tried: leucine, threonine and valine. The different constructs with distinct amino acids have than been tested for their ability to still confer Zeocin® antibiotic resistance to the transfected cells.
- the original Zeo open reading frame has the following sequence around the start codon: AA ACCATGG CC (start codon in bold; SEQ ID NO:83). This is a start codon with an optimal translational context ( FIG. 12A ).
- the amplified product was cut with NruI-EcoRI, and ligated into pcDNA3, resulting in pZEOATGmut.
- the original Zeo open reading frame contains an in frame ATG, encoding methionine at amino acid position 94 (out of 124).
- This internal ATG, encoding the methionine at position 94 was changed in such a way that the methionine was changed into leucine, threonine or valine respectively:
- part of the Zeo open reading frame was amplified using primer pair ZEOforwardMUT (SEQ ID NO:84) and ZEO-THRreverse (SEQ ID NO:87): AGGCCCCGCCCCCACGGCTGCTCGCCGATCTCGGT GGT GGCCGGC.
- the PCR product was cut with BamHI-BglI and ligated into pZEOATGmut. This resulted in pZEO(thr).
- pZEO(thr) To replace the internal codon for methionine in the Zeo open reading frame with the codon for valine (not shown, but as in FIG.
- the Chinese Hamster Ovary cell line CHO-K1 (ATCC CCL-61) was cultured in HAMS-F12 medium+10% Fetal Calf Serum containing 2 mM glutamine, 100 U/ml penicillin, and 100 micrograms/ml streptomycin at 37° C./5% CO 2 .
- Cells were transfected with the plasmids using Lipofectamine® 2000 (Invitrogen) as described by the manufacturer. Briefly, cells were seeded to culture vessels and grown overnight to 70-90% confluence.
- Lipofectamine® reagent was combined with plasmid DNA at a ratio of 6 microliters per microgram (e.g., for a 10 cm Petri dish, 20 micrograms DNA and 120 microliters Lipofectamine® reagent) and added to the cells. After overnight incubation, the transfection mixture was replaced with fresh medium, and the transfected cells were incubated further. After overnight cultivation, cells were trypsinized and seeded into fresh culture vessels with fresh medium containing Zeocin® antibiotic (100 ⁇ g/ml). When individual colonies became visible (approximately ten days after transfection), colonies were counted.
- Zeocin® antibiotic 100 ⁇ g/ml
- plasmids were transfected to CHO-K1 cells, 1) pZEO(WT), 2) pZEO(leu), 3) pZEO(thr), and 4) pZEO(val).
- the cells were selected on 100 ⁇ g/ml Zeocin® antibiotic. Transfection of pZEO(leu) resulted in an equal number of Zeocin® antibiotic resistant colonies in comparison with the control pZEO (WT), pZEO(thr) and pZEO(val) gave less colonies, but the differences were not in the order of a magnitude.
- the start codon of the d2EGFP gene was first optimized (step 3 in Example 8). After that, the different versions of the Zeocin® antibiotic resistance gene were created. The differences between these versions are that they have different stall codons, with distinct translational efficiency (step 2 in Example 9, FIG. 12C-E ). These different Zeocin® antibiotic resistance gene versions were cloned upstream of the modified d2EGFP gene ( FIG. 13 ).
- the d2EGFP reporter ORF was introduced into pcDNA3.
- the sequence around the start codon of this d2EGFP cDNA is GAA TTCATG GG (start codon in bold; SEQ ID NO:89), which is not optimal.
- d2EGFP was amplified from pd2EGFP (Clontech 6010-1) with primers d2EGFPforwardBamHI (SEQ ID NO:90): GATCGGATCCTATGAGGAATTCGCC ACCATGG TGAGCAAGGGCGAGGAG and d2EGFPreverseNotI (SEQ ID NO:91): AAGGAAAAAAAAGCGGCCGCCTACACATTGATCCTAGCAGAAG.
- This product contains now a start codon with an optimal translational context ( ACCATGG ).
- the optimization of the translational start sequence of the gene of interest (here: EGFP as a model gene) is not essential but preferred in order to skew the translation initiation frequency towards the gene of interest still further.
- ATG as a start codon in the Zeo resistance gene but in a bad context ( TTTATGT ) (not shown, but as in FIG. 13B ) and followed by spacer sequence, instead of the optimal ATG ( FIG. 13A ).
- the spacer sequence is placed downstream of the ATG sequence.
- a secondary structure is present, causing the ribosome to be temporarily delayed. Because of this, a poor start codon can in some cases be used by the ribosome, despite being a bad start codon or being in a non-optimal context for translation initiation.
- a spacer sequence is introduced that does not contain a secondary structure (Kozak, 1990).
- the term “space” is introduced, and used in the plasmid and primer names to indicate the presence of such a spacer sequence.
- the spacer removes the “ribosome delaying sequence” from the neighborhood of the initiation codon, therewith causing the ribosome to start translating less frequently, and hence increasing the stringency of the selection hereof.
- the spacer introduces some extra amino acids in the coding sequence.
- the name of the selectable marker polypeptide is referred to by abbreviation (e.g., Zeo, Blas, etc.); the start codon is mentioned (e.g., ATG, GTG, TTG); when this start codon is placed in a non-optimal context for translation initiation, the addition “mut” is used (this is usually only done for ATG start codons, as combining a non-optimal context with a non-ATG start codon usually does not result in sufficient translation initiation to allow for selection); when a spacer sequence is used behind the start codon, the addition “space” is used (this is done usually for “ATGmut” start codons for Zeo or Bias selectable markers).
- abbreviation e.g., Zeo, Blas, etc.
- the start codon is mentioned (e.g., ATG, GTG, TTG); when this start codon is placed in a non-optimal context for translation initiation, the addition “mut” is used (this is usually only done for
- the Zeo open reading frame was amplified with primer pair ZEOforwardBamHI- ATG mut/space (SEQ ID NO:93): GATCGGATCCTTGG TTTATGT CGATCCAAAG ACTGCCAAATCTAGATCCGAGATTTTCAGGAGCTAAGGAAGCTAAAGCCAAGTTGACCAGTGAAGTTC (wherein the sequence following the underlined sequence comprises the spacer sequence), and ZEOWTreverse (SEQ ID NO:85), the PCR product was cut with EcoRI-BamHI, and ligated into pd2EGF, cut with EcoRI-BamHI, creating pZEO- ATGmut/space -d2EGFP.
- the Chinese Hamster Ovary cell line CHO-K1 (ATCC CCL-61) was cultured in HAMS-F12 medium+10% Fetal Calf Serum containing 2 mM glutamine, 100 U/ml penicillin, and 100 micrograms/ml streptomycin at 37° C./5% CO 2 .
- Cells were transfected with the plasmids using Lipofectamine® 2000 (Invitrogen) as described by the manufacturer. Briefly, cells were seeded to culture vessels and grown overnight to 70-90% confluence.
- Lipofectamine® reagent was combined with plasmid DNA at a ratio of 15 microliters per 3 microgram (e.g., for a 10 cm Petri dish, 20 micrograms DNA and 120 microliters Lipofectamine® reagent) and added after 30 minutes incubation at 25° C. to the cells. After overnight incubation the transfection mixture was replaced with fresh medium, and the transfected cells were incubated further. After overnight cultivation, cells were trypsinized and seeded into fresh culture vessels with fresh medium. After another overnight incubation, Zeocin® antibiotic was added to a concentration of 50 ⁇ g/ml and the cells were cultured further.
- CHO-K1 cells were transfected with constructs that contain the ATGmut/space Zeo ( FIG. 13B ), GTG Zeo ( FIG. 13C ) and TTG Zeo ( FIG. 13D ) genes as selection gene, all being cloned upstream of the d2EGFP reporter gene.
- constructs that contain the ATGmut/space Zeo ( FIG. 13B ), GTG Zeo ( FIG. 13C ) and TTG Zeo ( FIG. 13D ) genes as selection gene, all being cloned upstream of the d2EGFP reporter gene.
- These three constructs were without STAR elements (Control) or with STAR elements 7 and 67 upstream of the CMV promoter and STAR 7 downstream from the d2EGFP gene ( FIG. 14 ).
- FIG. 14 shows that both the control (without STAR elements) constructs with ATGmut/space Zeo (A) and GTG Zeo (B) gave colonies that expressed d2EGFP protein.
- the average d2EGFP expression level of 24 ATGmut/space Zeo colonies was 46 and of GTG Zeo colonies was 75. This higher average expression level in GTG Zeo colonies may reflect the higher stringency of GTG, in comparison with ATGmut/space (Example 8).
- Addition of STAR elements 7 and 67 to the constructs resulted in colonies that had higher average d2EGFP expression levels.
- Transfection of the ATGmut/space Zeo STAR 7/67/7 construct resulted in colonies with an average d2EGFP expression level of 118, which is a factor 2.6 higher than the average in the control cells (46).
- Addition of STAR elements to the GTG Zeo construct resulted in an average d2EGFP expression level of 99, which is a factor 1.3 higher than the average in the control cells (75).
- Example 9 The results in Example 9 indicate that the TTG Zeo has extremely stringent translation efficiency, which might be to high to convey Zeocin® antibiotic resistance to the cells.
- the transfection was scaled up to test whether there would be some colonies that have such high expression levels that they survive. Scaling up the experiment could also address the question whether the high average of TTG Zeo STAR 7/67/7 would become higher when more colonies were analyzed.
- CHO-K1 cells were transfected with the constructs that have the TTG Zeo gene as selection marker, with and without STAR elements 7 and 67 ( FIG. 15 ). Transfections, selection, culturing etc were as in Example 9, except that six times more cells, DNA and Lipofectamine 2000 were used. Transfections and selection were done in Petri dishes.
- FIG. 15A shows that transfection with the TTG Zeo STAR 7/67/7 construct resulted in the generation of many colonies with an average d2EGFP signal of 560. This is as high as in Example 9, except that now 58 colonies were analyzed.
- the average d2EGFP expression level was 61, and when STAR elements 7 and 67 were added to such a construct, the average d2EGFP expression level was 125, a factor 2 above the control ( FIG. 15B ).
- the average of the TTG Zeo STAR 7/67/7 colonies was therefore a factor 9.2 higher than the STAR-less IRES-Zeo colonies and a factor 4.5 higher than the STAR7/67/7 IRES Zeo colonies.
- the novel selection method hereof can be applied with expression cassettes that do not contain chromatin control elements, although it is clearly preferred to use expression cassettes comprising at least one such element, preferably a STAR element.
- FIG. 25A There are four internal ATGs in the blasticidine resistance gene, none of which codes for a methionine ( FIG. 25A ). These ATGs have to be eliminated though ( FIG. 25B ), since they will serve as start codon when the ATG start codon (or the context thereof) has been modified, and this will result in peptides that do not resemble blasticidine resistance protein. More importantly, these ATGs will prevent efficient translation of the gene of interest, as represented by d2EGFP in this example for purposes of illustration. To eliminate the internal ATGs, the blasticidine resistance protein open reading frame was first amplified with four primer pairs, generating 4 blasticidine resistance protein fragments. The primer pairs were:
- BSDBamHIforward (SEQ ID NO: 96): GATCGGATCCACCATGGCCAAGCCTTTGTCTCAAG BSD150reverse (SEQ ID NO: 97): GTAAAATGATATACGTTGACACCAG B) BSD150forward (SEQ ID NO: 98): CTGGTGTCAACGTATATCATTTTAC BSD250reverse (SEQ ID NO: 99): GCCCTGTTCTCGTTTCCGATCGCG C) BSD250forward (SEQ ID NO: 100): CGCGATCGGAAACGAGAACAGGGC BSD350reverse (SEQ ID NO: 101): GCCGTCGGCTGTCCGTCACTGTCC D) BSD350forward (SEQ ID NO: 102): GGACAGTGACGGACAGCCGACGGC BSD399reverse (SEQ ID NO: 103): GATCGAATTCTTAGCCCTCCCACACGTAACCAGAGGGC
- Fragments A to D were isolated from an agarose gel and mixed together. Next, only primers BSDBamHIforward and BSD399reverse were used to create the full length blasticidine resistance protein cDNA, but with all internal ATGs replaced. The reconstituted blasticidine was then cut with EcoRI-BamHI, and cloned into pZEO- GTG -d2EGFP, cut with EcoRI-BamHI (which releases Zeo), resulting in pBSDmut-d2EGFP. The entire blasticidine resistance protein open reading frame was sequenced to verify that all ATGs were replaced.
- the mutated blasticidine resistance protein open reading frame in pBSD-d2EGFP was amplified using primers BSDforwardBamHIAvrII- GTG (SEQ ID NO:106): GATCGGATCCTAGG ACCGTGG CCAAGCCTTTGTCTCAAGAAG and BSD399reverseEcoRIAvrII (SEQ ID NO:105), the PCR product was cut with BamHI-EcoRI, and ligated into pZEO- GTG -d2EGFP, cut with EcoRI-BamHI. This results in pBSD- GTG -d2EGFP.
- CHO-K1 cells were transfected with constructs that contain the GTG Blas ( FIG. 16A ) and TTG Blas ( FIG. 16B ) genes as selection gene, all being cloned upstream of the d2EGFP reporter gene. Selection took place in the presence of 20 ⁇ g/ml Blasticidine.
- the two constructs were without STAR elements (Control) or with STAR elements 7 and 67 upstream of the CMV promoter and START downstream from the d2EGFP gene ( FIG. 16 ).
- FIG. 16 shows that both the control (without STAR elements) constructs with GTG Blas (A) and TTG Blas (B) gave colonies that expressed d2EGFP protein.
- the average d2EGFP signal of 24 GTG Blas colonies was 14.0 ( FIG. 16A ) and of TTG Blas colonies was 81 ( FIG. 16B ).
- This higher average expression level in TTG Bias colonies may reflect the higher stringency of TTG, in comparison with GTG (see also Example 9). However, only eight colonies survived under the more stringent TTG conditions.
- Colonies described in Example 10 were further cultured under various conditions to assess the stability of d2EGFP expression over an extended time period.
- the TTG Zeo STAR 7/67/7 containing colonies in FIG. 15A were cultured for an additional 70 days in the presence of 100 ⁇ g/ml Zeocin® antibiotic. As shown in FIG. 17 , the average d2EGFP signal rose from 560.2 after 35 days to 677.2 after 105 days. Except for some rare colonies, the colonies had a higher d2EGFP expression level.
- the anti-EpCAM antibody (see also Example 5) was taken as an example.
- a plasmid was created on which both the heavy chain (HC) and light chain (LC) were placed, each in a separate transcription unit ( FIG. 20-22 ). Expression of both chains was driven by the CMV promoter.
- the Zeocin® antibiotic resistance gene was placed, either with the ATGmut/space ( FIG. 20 ), GTG ( FIG. 21 ) or TTG ( FIG. 22 ) as start codon (see Example 9).
- the blasticidin resistance gene was placed, either with the ATGmut/space ( FIG. 20 ), GTG ( FIG. 21 ) or TTG ( FIG. 22 ) as start codon (see Example 11).
- STAR elements Two types of constructs were made, one construct without STAR elements (Control) and one construct with a combination of STAR 7 and 67 elements.
- the STAR elements were placed as follows: upstream of each CMV promoter (i.e., one for the transcription unit comprising HC and one for the transcription unit comprising LC) STAR 67 was placed and the resulting construct was flanked with a 5′ and 3′ STAR 7 element ( FIGS. 20-22 ). All constructs were transfected to CHO-K1 cells and selected on 100 ⁇ g/ml Zeocin® antibiotic and 20 ⁇ g/ml Blasticidin (at the same time).
- FIG. 20 shows that the STAR 7/67/7 combination had a beneficial effect on EpCAM production.
- the ATGmut/space Zeo and ATGmut/space Blas had no effect on the number of colonies that were formed with plasmids containing STAR elements or not.
- the average EpCAM expression levels of either 24 control versus STAR 7/67/7 colonies ranged from 0.61 pg/cell/day in the control to 3.44 pg/cell/day in the START/67/7 construct ( FIG. 20 ). This is a factor 5.6 increase.
- FIG. 21 also shows that the STAR 7/67/7 combination had a beneficial effect on EpCAM production, using the GTG start codon for the markers.
- the GTG Zeo and GTG Blas STAR 7/67/7 construct approximately 2 times more colonies were formed.
- the average EpCAM expression levels of either 24 control versus STAR 7/67/7 colonies ranged from 2.44 pg/cell/day in the control to 6.51 pg/cell/day in the STAR7/67/7 construct ( FIG. 21 ). This is a factor 2.7 increase. Also the average EpCAM production in the highest five colonies was compared.
- FIG. 22 shows that with the TTG Zeo and TTG Blas control construct no colonies were formed, similar as in Example 9. With the STAR 7/67/7 TTG construct colonies were formed. The average EpCAM expression levels of the STAR 7/67/7 TTG colonies was 10.4 pg/cell/day ( FIG. 22 ). This is again higher than with the ATGmut/space and GTG as start codon (see FIGS. 20 , 21 for comparison). The average EpCAM production in the highest five TTG STAR 7/67/7 colonies was 22.5 pg/cell/day.
- the results show that the selection system can also be applied to two simultaneously produced polypeptides, in this case two polypeptides of a multimeric protein, casu quo an antibody.
- the EpCAM production closely follows the results obtained with d2EGFP.
- the TTG as start codon is more stringent than the GTG start codon, which in turn is more stringent than the ATGmut/space ( FIGS. 12 and 13 ).
- Higher stringency results in a decreasing number of colonies, with no colonies in the case of the TTG control that has no STAR elements, and higher stringency of the selection marker is coupled to higher expression of the protein of interest.
- Example 8 Different versions of the Zeocin® antibiotic resistance gene with mutated start codons were described in Example 8. Besides the described GTG codons (Example 8, FIG. 33A ), additional modified start codons with distinct translational efficiency are possible. These different Zeocin® antibiotic resistance gene versions were created ( FIG. 33 ) and cloned upstream of the modified d2EGFP gene, as in Example 9.
- GTG as a start codon in the Zeo resistance gene ( FIG. 33A ), but followed by a spacer sequence ( FIG. 33B ).
- the mutspace-Zeo open reading frame was amplified with primer pair GTGspaceBamHIF (SEQ ID NO:122): GAATTCGGATCCACC GTG GCGATCCAAAGACTGCCAAATCTAG and (wherein the sequence following the underlined sequence comprises the spacer sequence), and ZEOWTreverse (SEQ ID NO:85), the PCR product was cut with EcoRI-BamHI, and ligated into pd2EGFP, cut with EcoRI-BamHI, creating pZEO- GTGspace -d2EGFP.
- FIG. 33D GTG as a start codon in the Zeo resistance gene, instead of ATG ( FIG. 33A ), but with an additional mutation in the Zeo open reading frame at Pro9, which was replaced with threonine (Thr) ( FIG. 33D ).
- the Thr9 mutation was introduced by amplifying the Zeo open reading with primer pair ZEOForwardGTG-Thr9 (SEQ ID NO:124): AATTGGATCCACC GTG GCCAAGTTGACCAGTGCCGTT ACC GTGCTC and ZEOWTreverse (SEQ ID NO:85), the PCR product was cut with EcoRI-BamHI, and ligated into pd2EGFP, cut with EcoRI-BamHI, creating pZEO- GTG -Thr9-d2EGFP.
- the Phe9 mutation was introduced by amplifying the Zeo open reading with primer pair ZEOForward GTG-Phe9 (SEQ ID NO:125): AATTGGATCCACC GTG GCCAAGTTGACCAGTGCCGTT TTC GTGCTC and ZEOWTreverse (SEQ ID NO:85), the PCR product was cut with EcoRI-BamHI, and ligated into pd2EGFP, cut with EcoRI-BamHI, creating pZEO- GTG -Phe9-d2EGFP.
- CHO-K1 cells were transfected with constructs that contain the GTG Zeo ( FIG. 33A ), GTGspace Zeo ( FIG. 33B ), TTT GTG Zeo (also called: GTGmut Zeo) ( FIG. 33C ), GTG Thr9 Zeo(leu) ( FIG. 33D ) and GTG Phe9 Zeo(leu) ( FIG. 33D ) genes as selection gene, all being cloned upstream of the d2EGFP reporter gene.
- These five constructs were without STAR elements (Control) or with STAR elements 7 and 67 upstream of the CMV promoter and STAR 7 downstream from the d2EGFP gene ( FIG. 33 ).
- the higher stringencies of the novel GTG mutations correlate with higher mean fluorescence signals ( FIG. 34 ).
- the TTT GTG Zeo 7/67/7 gave only two high expressing colonies and a few low expressing colonies. This may indicate that this mutation is at the brink of the stringency that these cells can bear with a fixed concentration of Zeocin® antibiotic added to the culture medium.
- Thr9 and Phe9 mutations do not influence the translation efficiency of the Zeo mutants. Instead they reduce the functionality of the Zeocin® antibiotic resistance protein, by preventing an optimal interaction between the two halves of the Zeocin® antibiotic resistance protein (Dumas et al., 1994). This implies that more of the protein has to be produced to achieve resistance against the Zeocin® antibiotic in the culture medium. As a consequence, the entire cassette has to be transcribed at a higher level, eventually resulting in a higher d2EGFP expression level.
- This example further demonstrates the possibility to provide for fine-tuning of the stringency of the selection system of the invention, to achieve optimal expression levels of a protein of interest.
- the person skilled in the art will be capable of combining these and other possibilities within the concepts disclosed herein (e.g., mutate the Zeocin® antibiotic at position 9 to other amino acids, or mutate it in other positions; use a GTG or other start codon in a non-optimal translation initiation context for Zeocin® antibiotic or other selection markers; or mutate other selection markers to reduce their functionality, for instance use a sequence coding for a neomycin resistance gene having a mutation at amino acid residue 182 or 261 or both, see, e.g., WO 01/32901), and the like, to provide for such fine-tuning, and by simply testing determine a suitable combination of features for the selection marker, leading to enhanced expression of the polypeptide of interest.
- TTG as a start codon in the Zeo resistance gene ( FIG. 35A ), but followed by a spacer sequence ( FIG. 35B ).
- the Zeo open reading frame (with the spacer sequence) was amplified with primer pair TTGspaceBamHIF (SEQ ID NO:126): GAATTCGGATCCACC TTG GCGATCCAAAGACTGCCAAATCTAG and ZEOWTreverse (SEQ ID NO:85), the PCR product was cut with EcoRI-BamHI, and ligated into pd2EGFP, cut with EcoRI-Bam HI, creating pZEO- TTGspace -d2EGFP.
- Thr threonine
- FIG. 35C TTG as a start codon in the Zeo resistance gene, instead of ATG ( FIG. 35A ), but with an additional mutation in the Zeo open reading frame at Pro9, with was replaced with threonine (Thr) ( FIG. 35C ).
- the Thr9 mutation was introduced by amplifying the Zeo open reading with primer pair ZEOForwardTTG-Thr9 (SEQ ID NO:127): AATTGGATCCACC TTG GCCAAGTTGACCAGTGCCGTT ACC GTGCTC and ZEOWTreverse (SEQ ID NO:85), the PCR product was cut with EcoRI-BamHI, and ligated into pd2EGFP, cut with EcoRI-BamHI, creating pZEO- TTG -Thr9-d2EGFP.
- the Phe9 mutation was introduced by amplifying the Zeo open reading with primer pair ZEOForwardTTG-Phe9 (SEQ ID NO:128): AATTGGATCCACC TTG GCCAAGTTGACCAGTGCCGTT TTC GTGCTC and ZEOWTreverse (SEQ ID NO:85), the PCR product was cut with EcoRI-BamHI, and ligated into pd2EGFP, cut with EcoRI-BamHI, creating pZEO- TTG -Phe9-d2EGFP.
- CHO-K1 cells were transfected with constructs that contain the TTG Zeo ( FIG. 35A ), TTGspace Zeo ( FIG. 35B ), TTG Thr9 Zeo ( FIG. 35C ) and TTG Phe9 Zeo ( FIG. 35D ) genes as selection gene, all being cloned upstream of the d2EGFP reporter gene.
- constructs that contain the TTG Zeo ( FIG. 35A ), TTGspace Zeo ( FIG. 35B ), TTG Thr9 Zeo ( FIG. 35C ) and TTG Phe9 Zeo ( FIG. 35D ) genes as selection gene, all being cloned upstream of the d2EGFP reporter gene.
- These four constructs were without STAR elements (Control) or with STAR elements 7 and 67 upstream of the CMV promoter and STAR 7 downstream from the d2EGFP gene ( FIG. 35 ).
- FIG. 36 shows that of the control constructs without STAR elements only the TTG Zeo construct without STAR elements gave colonies
- the higher stringencies of the novel TTG mutations correlate with higher mean fluorescence signals ( FIG. 36 ).
- the TTG Thr9 Zeo 7/67/7 and TTG Phe9 Zeo 7/67/7 constructs gave only two high expressing colonies each and a few low expressing colonies. This may indicate that these mutations are at the brink of the stringency that the cells can bear with a fixed concentration of Zeocin® antibiotic added to the culture medium.
- ATGs in the puromycin resistance gene there are three internal ATGs in the puromycin resistance gene, each of which codes for a methionine ( FIG. 28 , FIG. 37A ). These ATGs have to be eliminated (FIG. 37 B,C), since they will serve as start codon when the ATG start codon (or the context thereof) has been modified, and this will result in peptides that do not resemble puromycin resistance protein. More importantly, these ATGs will prevent efficient translation of the gene of interest, as represented by d2EGFP in this example for purposes of illustration.
- the methionines were changed into leucine, like in the Zeocin® antibiotic resistance protein (Example 8).
- the puromycin resistance protein open reading frame was first amplified with 4 primer pairs, generating 4 puromycin resistance protein fragments.
- the primer pairs were:
- PURO BamHI F (SEQ ID NO: 129): GATCGGATCCATGGTTACCGAGTACAAGCCCACGGT, PURO300 R LEU (SEQ ID NO: 130): CAGCCGGGAACCGCTCAACTCGGCCAGGCGCGGGC; and PURO300FLEU (SEQ ID NO: 131): CGAGTTGAGCGGTTCCCGGCTGGCCGCGCAGCAACAGCTGGAAGGCCTC, PURO600RLEU (SEQ ID NO: 132): AAGCTTGAATTCAGGCACCGGGCTTGCGGGTCAGGCACCAGGTC.
- CHO-K1 cells were transfected with the construct that contains the TTG Puro ( FIG. 38 ) gene as selection gene, cloned upstream of the d2EGFP reporter gene. Selection was under 10 ⁇ g/ml puromycin.
- the construct was without STAR elements (Control) or with STAR elements 7 and 67 upstream of the CMV promoter and STAR 7 downstream from the d2EGFP gene ( FIG. 38 ).
- FIG. 38 shows that the average d2EGFP fluorescence signal of 24 TTG Puro Control colonies was 37.9, of 24 TTG Puro colonies with STARs 7/67/7 75.5.
- the neomycin resistance protein open reading frame was entirely synthesized by a commercial provider (GeneArt, Germany), wherein all internal coding ATGs (for Met) where replaced by CTGs (coding for Leu), and non-coding ATGs were replaced such that a degenerated codon was used and hence no mutations in the protein sequence resulted; the synthesized sequence of the neomycin is given in SEQ ID NO:134.
- GeneArt GeneArt, Germany
- the synthesized neomycin gene was amplified with primer pairs NEO-F-HindIII (SEQ ID NO:136): GATCAAGCTTTTGGATCGGCCATTGAAACAAGACGGATTG and NEO EcoRI 800R (SEQ ID NO:137): AAGCTTGAATTCTCAGAAGAACTCGTCAAGAAGGCG.
- E. coli were used to test the functionality of the neomycin resistance protein from which all ATGs were removed.
- E. coli were transformed with the constructs that contain the GTG Neo ( FIG. 39B ) or TTG Neo ( FIG. 39C ) gene as selection gene. Selection took place by growing the bacteria on kanamycin. Only a functional neomycin resistance gene can give resistance against kanamycin. Transformation with either modified Neo gene resulted in the formation of E. coli colonies, from which the plasmid containing the gene could be isolated. This shows that the described, modified translation efficiencies of the Neomycin resistance mRNAs, as well as the removal of all ATGs from the Neo open reading frame result in the production of functional neomycin resistance protein.
- mutated neomycin resistance genes are incorporated in a multicistronic transcription unit of the invention, and used for selection with G418 or neomycin in eukaryotic host cells.
- DHFR-F-HindIII SEQ ID NO:140: GATCAAGCTTTTGTTCGACCATTGAACTGCATCGTC and DHFR-EcoRI-600-R (SEQ ID NO:141): AGCTTGAATTCTTAGTCTTTCTTCTCGTAGACTTC.
- E. coli bacteria were used to test the functionality of the dhfr protein from which all ATGs were removed.
- E. coli was transformed with the constructs that contain the GTG dhfr ( FIG. 40B ) or TTG dhfr ( FIG. 40C ) gene. Selection took place by growing the bacteria on trimethoprim (Sigma T7883-56). Only a functional dhfr gene can give resistance against trimethoprim. Transformation with either modified dhfr gene resulted in the formation of E. coli colonies, from which the plasmid containing the gene could be isolated. This shows that the described, modified translation efficiencies of the dhfr mRNAs, as well as the removal of all ATGs from the dhfr open reading frame result in the production of functional dhfr protein.
- the mutated dhfr genes are incorporated in a multicistronic transcription unit of the invention, and used for selection with methotrexate in eukaryotic host cells.
- the average d2EGFP fluorescence signal of 20 GTG Zeo colonies was 63.8, while the average d2EGFP signal of 20 GTGspace Zeo colonies was 185, demonstrating that also in PER.C6® cells the GTGspace Zeo has a higher translation stringency than the GTG Zeo mRNA.
- transfection with both the GTG Blasticidin and TTG Blasticidin gene resulted in colonies that expressed d2EGFP.
- the average d2EGFP fluorescence signal of 20 GTG Blasticidin colonies was 71.4, while the average d2EGFP fluorescence signal of 20 TTG Blasticidin colonies was 135, demonstrating that also in PER.C6® cells the TTG Blasticidin has a higher translation stringency than the GTG Blasticidin mRNA.
- This example demonstrates that the selection system of the invention can also be used in other cells than CHO cells.
- a TRAnscription Pause (TRAP) sequence is thought to, at least in part, prevent formation of antisense RNA or, to at least in part, prevent transcription to enter the protein expression unit (see WO 2004/055215).
- a TRAP sequence is functionally defined as a sequence which when placed into a transcription unit, results in a reduced level of transcription in the nucleic acid present on the 3′ side of the TRAP when compared to the level of transcription observed in the nucleic acid on the 5′ side of the TRAP, and non-limiting examples of TRAP sequences are transcription termination signals.
- the TRAP In order to function to prevent or decrease transcription to enter the transcription unit, the TRAP is to be placed upstream of a promoter driving expression of the transcription unit and the TRAP should be in a 5′ to 3′ direction. In order to prevent at least in part formation of antisense RNA, the TRAP should be located downstream of the open reading frame in a transcription unit and present in a 3′ to 5′ direction (that is, in an opposite orientation as the normal orientation of a transcriptional termination sequence that is usually present behind the open reading frame in a transcription unit). A combination of a TRAP upstream of the promoter in a 5′ to 3′ orientation and a TRAP downstream of the open reading frame in a 3′ to 5′ orientation is preferred. Adding a TRAP sequence to a STAR element improves the effects of STAR elements on transgene expression (see WO 2004/055215). Here we test the effects of the TRAP sequence in the context of the TTG Zeo resistance gene.
- the TTG Zeocin®-d2EGFP cassette that was flanked with STAR7 elements was modified by the addition of the SPA/pause TRAP sequence (see WO 2004/055215); SEQ ID NO:142), both upstream of the 5′ STAR7 (in 5′ to 3′ direction) and downstream of the 3′ STAR7 (in 3′ to 5′ direction) ( FIG. 43 ). Both STAR 7/7 and TRAP-STAR 7/7-TRAP containing vectors were transfected to CHO-K1. Stable colonies were isolated and the d2EGFP fluorescence intensities were measured. As shown in FIG.
- the average d2EGFP fluorescence signal of 23 TTG Zeo STAR 7/7 colonies was 455.1, while the average d2EGFP fluorescence signal of 23 TTG Zeo TRAP-STAR 7/7-TRAP colonies was 642.3.
- the average d2EGFP fluorescence signal in highest 5 TTG Zeo STAR 7/7 colonies was 705.1, while the average d2EGFP fluorescence signal of 5 TTG Zeo TRAP-STAR 7/7-TRAP colonies was 784.7.
- EpCAM antibody expression levels were analyzed in relation to the number of integrated EpCAM DNA copies.
- the construct that was tested was TTG-Zeo-Light Chain (LC)-TTG-Blas-Heavy Chain (HC), both expression units being under the control of the CMV promoter (see FIG. 44 ).
- This construct contained STAR 7 and 67 (see FIG. 44 ). Selection conditions were such that with 200 ⁇ g/ml Zeocin® antibiotic and 20 ⁇ g/ml Blasticidin in the culture medium no control colonies (no STARs) survived and only STAR 7/67/7 colonies survived.
- MTX methotrexate
- the construct that was tested was TTG-Zeo-Heavy Chain (HC)-TTG-Blas-Light Chain (LC), both expression units being under the control of the CMV promoter. Upstream of each CMV promoter STAR67 was positioned and START was used to flank the entire cassette (see also Example 13, FIG. 22 for such a construct).
- This construct was further modified by placing an SV40-dhfr cassette (a mouse dhfr gene under control of an SV40 promoter) between the HC and LC cassettes, upstream of the second STAR67 ( FIG. 45 ).
- CHO-K1 cells were transfected. Selection was done with 100 ⁇ g/ml Zeocin® antibiotic and 10 ⁇ g/ml Blasticidin in the culture medium.
- Colonies were isolated and propagated before measuring EpCAM expression levels. Six colonies that produced between 20 and 35 pg/cell/day were transferred to medium containing 100 nM MTX. This concentration was raised to 500 nM, 1000 nM and finally to 2000 nM with two weeks periods in between each step. After two weeks on 2000 nM MTX, EpCAM concentrations were measured. As shown in FIG. 45 , four colonies showed enhanced EpCAM production. Colony 13, from 22 to 30; colony 14, from 28 to 42; colony 17, from 20 to 67 and colony 19, from 37 to 67 pg/cell/day. Colonies 4 and 16 showed no enhanced EpCAM expression.
- FIG. 46 we indicate the promoters we tested in the context of the TTG Zeo selection marker.
- the tested plasmids consisted of the indicated control constructs with three different promoters and STAR constructs which were flanked with STAR 7 and STAR 67 at the 5′ end and STAR 7 at the 3′ end.
- the constructs were transfected to CHO-K1 cell and selection was performed with 200 ⁇ g/ml Zeocin® antibiotic in the culture medium. Up to 23 independent colonies were isolated and propagated before analysis of d2EGFP expression levels. As shown in FIG.
- DNA elements such as the HS4 hypersensitive site in the locus control region of the chicken ⁇ -globin locus (Chung et al., 1997), matrix attachment regions (MAR) (Stief et al., 1989) and a ubiquitous chromatin opening element (UCOE) (Williams et al., 2005) have been reported to have beneficial effects on gene expression when these DNA elements are incorporated in a vector.
- MAR matrix attachment regions
- UCOE ubiquitous chromatin opening element
- the 1.25 kb HS4 element was cloned into the cassette encompassing the CMV promoter, TTG Zeo and d2EGFP by a three way ligation step to obtain a construct with a tandem of 2 HS4 elements (Chung et al., 1997). This step was done both for the 5′ and 3′ of the cassette encompassing the CMV promoter, TTG Zeo and d2EGFP.
- the 2959 bp long chicken lysozyme MAR (Stief et al., 1989) was cloned 5′ and 3′ of the cassette encompassing the CMV promoter, TTG Zeo and d2EGFP.
- the 2614 bp long UCOE (Williams et al., 2005) was a NotI-KpnI fragment, excised from a human BAC clone (RP11-93D5), corresponding to nucleotide 29449 to 32063. This fragment was cloned 5′ of the CMV promoter.
- the STAR construct contained STAR7 and STAR67 5′ of the CMV promoter and STAR73′ of the cassette. These four constructs, as well as the control construct without flanking chromatin control DNA elements, were transfected to CHO-K1 cells. Selection was performed by 200 ⁇ g/ml Zeocin® antibiotic in the culture medium. Colonies were isolated, propagated and d2EGFP expression levels were measured.
- constructs with all DNA elements resulted in the formation of d2EGFP expressing colonies.
- incorporation of 2 ⁇ HS4 elements and the UCOE did not result in the formation of colonies that displayed higher d2EGFP expression levels, in comparison with the control colonies.
- incorporation of the lysozyme MAR resulted in the formation of colonies that expressed d2EGFP significantly higher.
- the mean expression level induced by MAR containing constructs was four-fold higher than in the control colonies. Best results were obtained, however, by incorporating STAR 7 and 67 in the construct. An almost ten-fold increase in the mean d2EGFP expression level was observed, as compared to the control colonies.
- chromatin control DNA elements such as MARs can be used in the context of the selection system of the invention. However, the best results were obtained when STAR elements were used as chromatin control elements.
Landscapes
- Chemical & Material Sciences (AREA)
- Health & Medical Sciences (AREA)
- Organic Chemistry (AREA)
- Life Sciences & Earth Sciences (AREA)
- Immunology (AREA)
- Biochemistry (AREA)
- General Health & Medical Sciences (AREA)
- Genetics & Genomics (AREA)
- Medicinal Chemistry (AREA)
- Molecular Biology (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- Biophysics (AREA)
- Cell Biology (AREA)
- Gastroenterology & Hepatology (AREA)
- Micro-Organisms Or Cultivation Processes Thereof (AREA)
- Preparation Of Compounds By Using Micro-Organisms (AREA)
Abstract
Provided is a DNA molecule comprising a multicistronic transcription unit coding for i) a selectable marker polypeptide functional in a eukaryotic host cell, and for ii) a polypeptide of interest, the polypeptide of interest having a translation initiation sequence separate from that of the selectable marker polypeptide, wherein the coding sequence for the polypeptide of interest is downstream from the coding sequence for the selectable marker in the multicistronic transcription unit, and the nucleic acid sequence coding for the selectable marker polypeptide comprises a mutation that decreases the translation efficiency of the selectable marker in a eukaryotic host cell. Also provided are methods for obtaining host cells expressing a polypeptide of interest, the host cells comprising the described DNA molecules. Further provided is the production of polypeptides of interest, comprising culturing host cells comprising the DNA molecules hereof.
Description
- This application is a continuation of co-pending U.S. patent application Ser. No. 12/807,133, filed Aug. 26, 2010, now U.S. Pat. No. ______, which application is a divisional of U.S. patent application Ser. No. 11/269,525, filed Nov. 7, 2005, pending, which application claims the benefit, under 35 U.S.C. §119(e), to U.S.
Provisional Patent Application 60/626,301, filed Nov. 8, 2004, and U.S.Provisional Patent Application 60/696,610, filed Jul. 5, 2005, the disclosure of each of which is hereby incorporated herein in its entirety by this reference. This application also claims the benefit of EP 04105593.0 filed Nov. 8, 2004, which is hereby incorporated herein in its entirety by this reference. - Pursuant to 37 C.F.R. §1.821(c) or (e), a file containing a PDF version of the Sequence Listing has been submitted concomitant with this application, the contents of which are hereby incorporated by reference. The transmittal documents of this application include a Request to Transfer CRF from the parent application.
- The disclosure relates to the field of molecular biology and biotechnology. More specifically the disclosure relates to means and methods for improving the selection of host cells that express proteins at high levels.
- Proteins can be produced in various host cells for a wide range of applications in biology and biotechnology, for instance as biopharmaceuticals. Eukaryotic and particularly mammalian host cells are preferred for this purpose for expression of many proteins, for instance when such proteins have certain posttranslational modifications such as glycosylation. Methods for such production are well established, and generally entail the expression in a host cell of a nucleic acid (also referred to as “transgene”) encoding the protein of interest. In general, the transgene together with a selectable marker gene is introduced into a precursor cell, cells are selected for the expression of the selectable marker gene, and one or more clones that express the protein of interest at high levels are identified, and used for the expression of the protein of interest.
- One problem associated with the expression of transgenes is that it is unpredictable, stemming from the high likelihood that the transgene will become inactive due to gene silencing (McBurney et al., 2002), and therefore many host cell clones have to be tested for high expression of the transgene.
- Methods to select recombinant host cells expressing relatively high levels of desired proteins are known.
- One method describes the use of selectable marker proteins with mutations in their coding sequence that diminishes, but does not destroy, the function of the marker (e.g., WO 01/32901). The rationale is that higher levels of the mutant marker expression are required when selection conditions are employed and therefore selection for high expression of the marker is achieved, therewith concomitantly selecting host cells that also express the gene of interest at high levels.
- Another method makes use of a selection marker gene under control of a promoter sequence that has been mutated such that the promoter has an activity level substantially below that of its corresponding wild type (U.S. Pat. No. 5,627,033).
- Another method describes the use of an impaired dominant selectable marker sequence, such as neomycin phosphotransferase with an impaired consensus Kozak sequence, to decrease the number of colonies to be screened and to increase the expression levels of a gene of interest that is co-linked to the dominant selectable marker (U.S. Pat. Nos. 5,648,267 and 5,733,779). In preferred embodiments therein, the gene of interest is placed within an (artificial) intron in the dominant selectable marker. The gene of interest and the dominant selectable marker are in different transcriptional cassettes and each contains its own eukaryotic promoter in this method (U.S. Pat. Nos. 5,648,267 and 5,733,779).
- Another method uses the principle of a selectable marker gene containing an intron that does not naturally occur within the selectable gene, wherein the intron is capable of being spliced in a host cell to provide mRNA encoding a selectable protein and wherein the intron in the selectable gene reduces the level of selectable protein produced from the selectable gene in the host cell (European Patent 0724639 B1).
- In yet another method, DNA constructs are used comprising a selectable gene positioned within an intron defined by a 5′ splice donor site comprising an efficient splice donor sequence such that the efficiency of splicing an mRNA having the splice donor site is between about 80-99%, and a 3′ splice acceptor site, and a product gene encoding a product of interest downstream of 3′ splice acceptor site, the selectable gene and the product gene being controlled by the same transcriptional regulatory region (U.S. Pat. No. 5,561,053).
- In certain methods, use is made of polycistronic expression vector constructs. An early report of use of this principle describes a polycistronic expression vector, containing sequences coding for both the desired protein and a selectable protein, which coding sequences are governed by the same promoter and separated by a translational stop and start signal codons (U.S. Pat. No. 4,965,196). In preferred embodiments in U.S. Pat. No. 4,965,196, the selectable marker is the amplifiable DHFR gene. In a particularly preferred embodiment of the system described in U.S. Pat. No. 4,965,196, the sequence coding for the selectable marker is downstream from that coding for the desired polypeptide, such that procedures designed to select for the cells transformed by the selectable marker will also select for particularly enhanced production of the desired protein.
- In further improvements based on the concept of multicistronic expression vectors, bicistronic vectors have been described for the rapid and efficient creation of stable mammalian cell lines that express recombinant protein. These vectors contain an internal ribosome entry site (IRES) between the upstream coding sequence for the protein of interest and the downstream coding sequence of the selection marker (Rees et al., 1996). Such vectors are commercially available, for instance, the pIRES1 vectors from Clontech (CLONTECHniques, October 1996). Using such vectors for introduction into host cells, selection of sufficient expression of the downstream marker protein then automatically selects for high transcription levels of the multicistronic mRNA, and hence a strongly increased probability of high expression of the protein of interest is envisaged using such vectors.
- Preferably in such methods, the IRES used is an IRES which gives a relatively low level of translation of the selection marker gene, to further improve the chances of selecting for host cells with a high expression level of the protein of interest by selecting for expression of the selection marker protein (see, e.g., international publication WO 03/106684).
- The disclosure uses a novel and unique concept for selecting host cells expressing high levels of polypeptides of interest, the concept referred to herein as “reciprocal interdependent translation.” It is unique with respect to the prior art approaches in that an extra level of regulation is used to fine-tune the amount of translation products from a multicistronic transcript, whereby the level of translation of a selectable marker polypeptide is lowered, thereby increasing the level of translation of the polypeptide of interest coded on the same multicistronic transcription unit, i.e., the expression level of the polypeptide of interest is directly and more or less reciprocally dependent from the expression level of the selectable marker polypeptide (see
FIG. 13 for a schematic view). In contrast, approaches using an IRES between the coding sequences for the polypeptide of interest and the selectable marker polypeptide (e.g., Rees et al., 1996), involve independent translation of both polypeptides, so that in those approaches there is no direct effect of the translation efficiency of the selectable marker polypeptide on that of the sequence coding for the polypeptide of interest. - In one aspect, provided is a DNA molecule comprising a multicistronic transcription unit coding for i) a selectable marker polypeptide functional in a eukaryotic host cell, and for ii) a polypeptide of interest, the polypeptide of interest having a translation initiation sequence (and start codon) separate from that of the selectable marker polypeptide, characterized in that the coding sequence for the polypeptide of interest is downstream from the coding sequence for the selectable marker in the multicistronic transcription unit, and in that the nucleic acid sequence coding for the selectable marker polypeptide comprises a mutation that decreases the translation efficiency of the selectable marker in a eukaryotic host cell. Preferably the translation initiation efficiency of the selectable marker is decreased. In certain embodiments, the decreased translation initiation efficiency is brought about by having a non-optimal translation start sequence of the selectable marker polypeptide.
- In certain embodiments thereof, the translation start sequence in the coding strand for the selectable marker polypeptide comprises an ATG sequence defining a start codon, the ATG sequence being in a non-optimal context for translation initiation. This results in a decreased use of this ATG as start codon, when compared to an ATG start codon in an optimal context.
- In a more preferred embodiment, the translation start sequence in the coding strand for the selectable marker polypeptide comprises a start codon different from an ATG start codon, such as one of GTG, TTG, CTG, ATT, or ACG sequence, the first two thereof being the most preferred. Such non-ATG start codons preferably are flanked by sequences providing for relatively good recognition of the non-ATG sequences as start codons, such that at least some ribosomes start translation from these start codons, i.e., the translation start sequence preferably comprises the sequence ACC[non-ATG start codon]G or GCC[non-ATG start codon]G.
- In certain embodiments, the sequence encoding the selectable marker polypeptide has no ATG sequence in the coding strand following the sequence coding for the start codon of the selectable marker polypeptide up to the sequence encoding the stop codon thereof, and not in any sequences in the same strand between the stop codon and the start codon of the polypeptide of interest (which start codon preferably is defined by an ATG in the same coding strand).
- In certain embodiments, any ATG sequence present in frame and coding for methionine in the sequence coding for the wild-type selectable marker polypeptide has been mutated to code for valine, threonine, isoleucine or leucine.
- In certain embodiments, the selectable marker protein provides resistance against lethal and/or growth-inhibitory effects of a selection agent, such as an antibiotic.
- The coding sequence of the polypeptide of interest may comprise an optimal translation start sequence.
- Further provided are expression cassettes comprising a DNA molecule hereof, which expression cassettes further comprise a promoter upstream of the multicistronic expression unit and being functional in a eukaryotic host cell for initiation transcription of the multicistronic expression unit, and the expression cassettes further comprising a transcription termination sequence downstream of the multicistronic expression unit.
- In certain embodiments, such expression cassettes further comprise at least one chromatin control element chosen from the group consisting of a matrix or scaffold attachment region (MAR/SAR), an insulator sequence, a ubiquitous chromatin opener element (UCOE), and an anti-repressor sequence. Anti-repressor sequences are most preferred in this aspect, and in preferred embodiments, the anti-repressor sequences are chosen from the group consisting of: a) any one SEQ ID NO:1 through SEQ ID NO:66; b) fragments of any one of SEQ ID NO:1 through SEQ ID NO:66, wherein the fragments have anti-repressor activity; c) sequences that are at least 70% identical in nucleotide sequence to a) or b) wherein the sequences have anti-repressor activity; and d) the complement to any one of a) to c). In certain embodiments, the anti-repressor sequences are chosen from the group consisting of: STAR67 (SEQ ID NO:66), STAR7 (SEQ ID NO:7), STAR9 (SEQ ID NO:9), STAR17 (SEQ ID NO:17), STAR27 (SEQ ID NO:27), STAR29 (SEQ ID NO:29), STAR43 (SEQ ID NO:43), STAR44 (SEQ ID NO:44), STAR45 (SEQ ID NO:45), STAR47 (SEQ ID NO:47), STAR61 (SEQ ID NO:61), and functional fragments or derivatives of these STAR sequences. In certain embodiments, the expression cassette comprises STAR67, or a functional fragment or derivative thereof, positioned upstream of the promoter driving expression of the multicistronic gene. In certain embodiments, the multicistronic gene is flanked on both sides by at least one anti-repressor sequence. In certain embodiments, expression cassettes are provided hereof, comprising in 5′ to 3′ order: anti-repressor sequence A-anti-repressor sequence B-[promoter-multicistronic transcription unit hereof (encoding the functional selectable marker protein and downstream thereof the polypeptide of interest)-transcription termination sequence]-anti-repressor sequence C, wherein A, B and C may be the same or different.
- In certain embodiments, the polypeptide of interest is a part of a multimeric protein, for example a heavy or light chain of an immunoglobulin.
- Also provided are DNA molecules comprising a sequence encoding a functional selectable marker polypeptide, characterized in that such DNA molecules comprise a mutation that decreases the translation efficiency of the functional selectable marker polypeptide in a eukaryotic host cell. Preferably, such a DNA molecule: i) has a non-optimal translation start sequence followed by an otherwise functional selectable marker coding sequence, and ii) in the coding strand of the sequence defining the selectable marker polypeptide downstream of the non-optimal start codon is devoid of ATG sequences.
- Also provided are host cells comprising DNA molecules hereof.
- Further provided are methods for generating host cells expressing a polypeptide of interest, the method comprising the steps of: introducing into a plurality of precursor host cells an expression cassette hereof, culturing the cells under conditions selecting for expression of the selectable marker polypeptide, and selecting at least one host cell producing the polypeptide of interest.
- The disclosure aims at providing improved means and methods for selection of host cells expressing high levels of proteins of interest.
- Provided are methods for producing a polypeptide of interest, the methods comprising culturing a host cell, the host cell comprising an expression cassette hereof, and expressing the polypeptide of interest from the expression cassette. In certain embodiments thereof, the polypeptide of interest is further isolated from the host cells and/or from the host cell culture medium.
- Provided are RNA molecules having the sequence of a transcription product of a DNA molecule hereof.
- Provided are functional selectable marker polypeptides lacking methionine in their amino acid sequence, which polypeptides are obtainable by expression from certain DNA molecules according to the dislcosure.
-
FIG. 1 . Schematic diagram of transcription units without and with STAR elements. A) bicistronic gene containing (from 5′ to 3′) a transgene (encoding for example the d2EGFP gene), an IRES, and a selectable marker (zeo, conferring Zeocin® antibiotic resistance) under control of the CMV promoter. The expression unit has the SV40 transcriptional terminator at its 3′ end (t). The name of the construct is CMV-d2EGFP-ires-Zeo (CMV Control). B) construct as in A, but now STAR 67 is cloned upstream of the CMV promoter. The name of the construct is STAR67-CMV-d2EGFP-ires-Zeo (CMV-STAR67). C) construct as in B, but upstream and downstream STAR7 elements are cloned to flank the entire construct. The name of the construct is STAR7-STAR67-CMV-d2EGFP-ires-Zeo-STAR7 (CMV-STAR67 7/7) -
FIG. 2 . STAR67 improves CMV driven d2EGFP expression in CHO cells. The mean of the d2EGFP signal for 10 independent stable colonies is plotted for the indicated constructs in CHO cells. A) CMV Control; B) STAR67-CMV. X(10): average d2EGFP expression levels of the ten colonies. See Example 2 for details. -
FIG. 3 . STAR67 improves EF1α driven d2EGFP expression in CHO cells. Same asFIG. 2 , but now with EF1α promoter. A) EF1α Control; B) STAR67-EF1α. -
FIG. 4 . STAR67 improves UB6 driven d2EGFP expression in CHO cells. Same asFIGS. 2 and 3 , but now with UB6 promoter. A) UB6 Control; B) STAR67-UB6. -
FIG. 5 . STAR67 improves CMV driven d2EGFP expression in PER.C6® cells. Same asFIG. 2 , but now in PER.C6® cells. A) CMV Control; B) STAR67-CMV. -
FIG. 6 . STAR67 improves EF1α driven d2EGFP expression in PER.C6® cells. Same asFIG. 3 , but now in PER.C6® cells. A) EF1α Control; B) STAR67-EF1α. -
FIG. 7 . STAR67 improves UB6 driven d2EGFP expression in PER.C6® cells. Same asFIG. 4 , but now in PER.C6® cells. A) UB6 Control; B) STAR67-UB6. -
FIG. 8 . STAR67 improves SV40 driven d2EGFP expression in CHO-K1 cells in combination with another STAR element. Similar asFIG. 2 , but now using the SV40 promoter in CHO cells, and the indicated constructs. The mean of the d2EGFP signal is plotted for 18-20 independentstable colonies 60 days after transfection. A) SV40 Control, SV40-STAR67, SV40-STAR7/7; B) SV40 Control and SV40-STAR67 7/7. See Example 4 for details. -
FIG. 9 . STAR67 improves UB6 driven expression levels of the EpCAM antibody in CHO-K1 cells. See Example 5 for details. A) constructs without STAR elements; B) constructs with STAR67 upstream of promoter and flanking STAR7 elements. The anti-EpCAM antibody concentration is presented as pg/cell/day. X(18): average production level of the 18 colonies. -
FIG. 10 . STAR67 is not an enhancer blocker, whereas 6 and 7 are. See Example 6 for details.STAR -
FIG. 11 . STAR67 enhances UB6 and CMV-driven antibody expression levels in stably transfected CHO cells. See Example 7 for details. A) single DNA molecule containing anti-EpCAM heavy chain (HC) and light chain (LC), each behind a promoter, and each linked to a different selectable marker gene (simultaneous selection was used for both markers): construct without STAR elements. No colonies were found: B) same construct with STAR67 between the two promoters. The anti-EpCAM antibody concentration is presented as pg/cell/day. X(19): average production level of the 19 colonies. -
FIG. 12 . Schematic representation of the use of a selection marker gene (Zeocin® antibiotic resistance gene) hereof. A. wild-type Zeocin® antibiotic resistance gene, having its normal translation initiation site (ATG start codon) and one internal ATG codon, which codes for methionine. B. mutant Zeocin® resistance gene, wherein the internal ATG has been mutated into a codon for leucine; this mutant is a functional Zeocin® resistance gene. C. same as B, but comprising a mutated translation initiation site, wherein the context of the ATG start codon has been mutated to decrease the translation initiation. D. same as B, but comprising a mutated start codon (GTG). E. same as B, but with a TTG start codon. The numbers under the figures C-E schematically indicate a relative amount of initiation frequency (under the start codon) and “scan-through” frequency (under the coding sequence) by the ribosomes, but only in a semi-quantitative manner, i.e., they indicate the efficiency of translation initiation compared to each other, but the qualitative numbers may differ completely: the numbers only serve to explain the invention. See Example 8 for details. -
FIG. 13 . Schematic representation of a multicistronic transcription unit hereof, with more or less reciprocal interdependent translation efficiency. Explanation as forFIG. 12 , but now a dEGFP gene (here exemplifying a gene of interest) has been placed downstream of the selectable marker polypeptide coding sequence. The Zeocin® antibiotic resistance gene comprises the internal Met→Leu mutation (seeFIG. 12B ). See Example 9 for details. -
FIG. 14 . Results of selection systems hereof, with and without STAR elements. A. Zeocin® antibiotic resistance gene with ATG start codon in bad context (referred to as “ATGmut” in the picture, but including a spacer sequence behind the ATG in the bad context, so in the text generally referred to as “ATGmut/space”). B. Zeocin® antibiotic resistance gene with GTG start codon. C. Zeocin® antibiotic resistance gene with TTG start codon. d2EGFP signal for independent colonies is shown on the vertical axis. See Example 9 for details. -
FIG. 15 . Results of selection system hereof in an up-scaled experiment (A), and comparison with selection system according to prior art using an IRES (B). d2EGFP signal for independent colonies is shown on the vertical axis. See Example 10 for details. -
FIG. 16 . Results of selection system with multicistronic transcription unit hereof, using blasticidin as a selectable marker. A. blasticidin resistance gene mutated to comprise a GTG start codon. B. blasticidin resistance gene mutated to comprise a TTG start codon. The blasticidin resistance gene has further been mutated to remove all internal ATG sequences. d2EGFP signal for independent colonies is shown on the vertical axis. See Example 11 for details. -
FIG. 17 . Stability of expression of several clones with a multicistronic transcription unit hereof (including a Zeocin® antibiotic resistance gene with TTG start codon). Selection pressure (100 μg/ml Zeocin® antibiotic) was present during the complete experiment. d2EGFP signal for independent colonies is shown on the vertical axis. See Example 12 for details. -
FIG. 18 . AsFIG. 17 , but Zeocin® antibiotic concentration was lowered to 20 μg/ml after establishment of clones. -
FIG. 19 . AsFIG. 17 , but Zeocin® antibiotic was absent from culture medium after establishment of clones. -
FIG. 20 . Expression of an antibody (anti-EpCAM) using the selection system with the multicistronic transcription unit hereof. The heavy chain (HC) and light chain (LC) are the polypeptide of interest in this example. Each of these is present in a separate transcription unit, which are both on a single nucleic acid molecule in this example. The HC is preceded by the Zeocin® antibiotic resistance gene coding for a selectable marker polypeptide, while the LC is preceded by the blasticidin resistance gene coding for a selectable marker polypeptide. Both resistance genes have been mutated to comprise an ATG start codon in a non-optimal context (“mutATG” in Figure, but including a spacer sequence, and hence in the text generally referred to as “ATGmut/space”). Each of the multicistronic transcription units is under control of a CMV promoter. Constructs with STAR sequences as indicated were compared to constructs without STAR sequences. The antibody levels obtained when these constructs were introduced into host cells are given on the vertical axis in pg/cell/day for various independent clones. See Example 13 for details. -
FIG. 21 . AsFIG. 20 , but both the selection marker genes have been provided with a GTG start codon. See Example 13 for details. -
FIG. 22 . AsFIG. 20 , but both the selection marker genes have been provided with a TTG start codon. See Example 13 for details. -
FIG. 23 . Stability of expression in sub-clones in the absence of selection pressure (after establishing colonies under selection pressure, some colonies where sub-cloned in medium containing no Zeocin® antibiotic). See Example 12 for details. -
FIG. 24 . Copy-number dependency of expression levels of an embodiment of the invention. See Example 12 for details. -
FIG. 25 . AsFIG. 12 , but for the blasticidin resistance gene. None of the four internal ATGs in this gene are in frame coding for a methionine, and therefore the redundancy of the genetic code was used to mutate these ATGs without mutating the internal amino acid sequence of the encoded protein. -
FIG. 26 . Coding sequence of the wild-type Zeocin® antibiotic resistance gene (SEQ ID NO:108). Bold ATGs code for methionine. The first bold ATG is the start codon. -
FIG. 27 . Coding sequence of the wild-type blasticidin resistance gene (SEQ ID NO:110). Bold ATGs code for methione. The first bold ATG is the start codon. Other ATGs in the sequence are underlined: these internal ATGs do not code for methionine, because they are not in frame. -
FIG. 28 . Coding sequence of the wild-type puromycin resistance gene (SEQ ID NO:112). Bold ATGs code for methione. The first bold ATG is the start codon. -
FIG. 29 . Coding sequence of the wild-type mouse DHFR gene (SEQ ID NO:114). Bold ATGs code for methione. The first bold ATG is the start codon. Other ATGs in the sequence are underlined: these internal ATGs do not code for methionine, because they are not in frame. -
FIG. 30 . Coding sequence of the wild-type hygromycin resistance gene (SEQ ID NO:116). Bold ATGs code for methione. The first bold ATG is the start codon. Other ATGs in the sequence are underlined: these internal ATGs do not code for methionine, because they are not in frame. -
FIG. 31 . Coding sequence of the wild-type neomycin resistance gene (SEQ ID NO:118). Bold ATGs code for methione. The first bold ATG is the start codon. Other ATGs in the sequence are underlined: these internal ATGs do not code for methionine, because they are not in frame. -
FIG. 32 . Coding sequence of the wild-type human glutamine synthase (GS) gene (SEQ ID NO:120). Bold ATGs code for methione. The first bold ATG is the start codon. Other ATGs in the sequence are underlined: these internal ATGs do not code for methionine, because they are not in frame. -
FIG. 33 . Schematic representation of some further modified Zeocin® antibiotic resistance selection marker genes with a GTG start codon hereof, allowing for further fine-tuning of the selection stringency. See Example 14 for details. -
FIG. 34 . Results with expression systems containing the further modified Zeocin® antibiotic resistance selection marker genes. See Example 14 for details. Dots indicate individual data points: lines indicate the average expression levels; used constructs (see alsoFIG. 33 ) are indicated on the horizontal axis (the addition of 7/67/7 at the end of the construct name indicates the presence of 7 and 67 upstream of the promoter and START downstream of the transcription termination site), and schematically depicted above the graph; vertical axis indicates d2EGFP signal.STAR sequences -
FIG. 35 . Schematic representation of some further modified Zeocin® antibiotic resistance selection marker genes with a TTG start codon hereof, allowing for further fine-tuning of the selection stringency. See Example 15 for details. -
FIG. 36 . Results with expression systems containing the further modified Zeocin® antibiotic resistance selection marker genes. See Example 15 for details. Dots indicate individual data points; lines indicate the average expression levels; used constructs are indicated on the horizontal axis, and schematically depicted above the graph; vertical axis indicates d2EGFP signal. -
FIG. 37 . AsFIG. 12 , but for the puromycin resistance gene. All three internal ATGs code for methione (panel A), and are replaced by CTG sequences coding for leucine (panel B). See Example 16 for details. -
FIG. 38 . Results with expression constructs containing the puromycin resistance gene with a TTG start codon and no internal ATG codons. See Example 16 for details. Dots indicate individual data points; lines indicate the average expression levels; used constructs are indicated on the horizontal axis, and schematically depicted above the graph; vertical axis indicates d2EGFP signal. -
FIG. 39 . AsFIG. 12 , but for the neomycin resistance gene. See Example 17 for details. A. wild-type neomycin resistance gene; ATG sequences are indicated, ATGs coding for methionine are indicated by Met above the ATG. B. neomycin resistance gene without ATG sequences, and with a GTG start codon. C. neomycin resistance gene without ATG sequences, and with a TTG start codon. -
FIG. 40 . AsFIG. 12 , but for the dhfr gene. See Example 18 for details. A. wild-type dhfr gene; ATG sequences are indicated, ATGs coding for methionine are indicated by Met above the ATG. B. dhfr gene without ATG sequences, and with a GTG start codon. C. dhfr gene without ATG sequences, and with a TTG start codon. -
FIG. 41 . Results with expression constructs (Zeocin® selectable marker) hereof in PER.C6® cells. See Example 20 for details. Dots indicate individual data points; lines indicate the average expression levels; used constructs are indicated on the horizontal axis, and schematically depicted above the graph; vertical axis indicates d2EGFP signal. -
FIG. 42 . Results with expression constructs (blasticidin selectable marker) hereof in PER.C6® cells. See Example 20 for details. Dots indicate individual data points; lines indicate the average expression levels; used constructs are indicated on the horizontal axis, and schematically depicted above the graph; vertical axis indicates d2EGFP signal. -
FIG. 43 . Results with expression constructs hereof, further comprising a transcription pause (TRAP) sequence. See Example 21 for details. Dots indicate individual data points; lines indicate the average expression levels; used constructs are indicated on the horizontal axis, and schematically depicted above the graph; vertical axis indicates d2EGFP signal. -
FIG. 44 . Copy-number dependency of expression of an antibody using transcription units hereof. See Example 22 for details. -
FIG. 45 . Antibody expression from colonies containing expression constructs hereof, wherein the copy number of the expression constructs is amplified by methotrexate. See Example 23 for details. White bars: selection with Zeocin® antibiotic and blasticidin; black bars: selection with Zeocin® antibiotic, blasticidin and methotrexate (MTX). Numbers of tested colonies are depicted on the horizontal axis. -
FIG. 46 . Results with different promoters. See Example 24 for details. Dots indicate individual data points; lines indicate the average expression levels; used constructs are indicated on the horizontal axis, and schematically depicted above the graph; vertical axis indicates d2EGFP signal. -
FIG. 47 . Results with different STAR elements. See Example 25 for details. Dots indicate individual data points; lines indicate the average expression levels; used constructs are indicated on the horizontal axis, and schematically depicted above the graph; vertical axis indicates d2EGFP signal. -
FIG. 48 . Results with other chromatin control elements. See Example 26 for details. Dots indicate individual data points; lines indicate the average expression levels; used constructs are indicated on the horizontal axis, and schematically depicted above the graph (black triangles indicate different tested chromatin control elements); vertical axis indicates d2EGFP signal. - In one aspect, provided is a DNA molecule comprising a multicistronic transcription unit coding for i) a selectable marker polypeptide functional in a eukaryotic host cell, and for ii) a polypeptide of interest, the polypeptide of interest having a translation initiation sequence separate from that of the selectable marker polypeptide, characterized in that the coding sequence for the polypeptide of interest is downstream from the coding sequence for the selectable marker in the multicistronic transcription unit, and the nucleic acid sequence coding for the selectable marker polypeptide comprises a mutation that decreases the translation efficiency of the selectable marker in a eukaryotic host cell. Such a DNA molecule can be used hereof for obtaining eukaryotic host cells expressing high levels of the polypeptide of interest, by selecting for the expression of the selectable marker polypeptide. Subsequently or simultaneously, one or more host cell(s) expressing the polypeptide of interest can be identified, and further used for expression of high levels of the polypeptide of interest.
- The term “monocistronic gene” is defined as a gene capable of providing a RNA molecule that encodes one polypeptide. A “multicistronic transcription unit,” also referred to as multicistronic gene, is defined as a gene capable of providing an RNA molecule that encodes at least two polypeptides. The term “bicistronic gene” is defined as a gene capable of providing a RNA molecule that encodes two polypeptides. A bicistronic gene is therefore encompassed within the definition of a multicistronic gene. A “polypeptide” as used herein comprises at least five amino acids linked by peptide bonds, and can for instance be a protein or a part, such as a subunit, thereof. Mostly, the terms polypeptide and protein are used interchangeably herein. A “gene” a “transcription unit” as used in the disclosure can comprise chromosomal DNA, cDNA, artificial DNA, combinations thereof, and the like. Transcription units comprising several cistrons are transcribed as a single mRNA. In contrast to approaches used in the prior art, in the multicistronic transcription units of the invention, translation of the second coding region (i.e., that for the protein of interest) present on that RNA is not dependent on the use of translation reinitiation sites or internal ribosome entry sites for translation initiation of the second coding region, but rather it is dependent on the efficiency of translation of the first coding region (i.e., that for the selectable marker protein) in a more or less reciprocal manner: the higher the translation level of the selectable marker, the lower the transcription level of the protein of interest and vice versa.
- A multicistronic transcription unit may be a bicistronic transcription unit coding from 5′ to 3′ for a selectable marker polypeptide and a polypeptide of interest. Hence, the polypeptide of interest is encoded downstream from the coding sequence for the selectable marker polypeptide.
- It is also possible to include more coding sequences on the same transcription unit, downstream of the (first) polypeptide of interest, e.g., coding for a second selectable marker polypeptide or for a second polypeptide of interest. Such an extra coding sequence downstream of the coding sequence for the first polypeptide of interest then preferably is preceded by a sequence encoding a translation reinitiation site or internal ribosome entry site (IRES), such that also the second polypeptide of interest or the second selection marker polypeptide is translated, in this case independently from the earlier two coding sequences. In such an embodiment, it is for instance possible to have the coding sequences for subunits of a multimeric protein, e.g., a heavy and a light chain of an immunoglobulin, encoded as a first and second polypeptide of interest (in any order), to code for subunits of a multimeric protein within a single multicistronic expression unit. It is however preferred to use separate transcription units for the expression of different polypeptides of interest, also when these form part of a multimeric protein (see, e.g., Example 13: the heavy and light chain of an antibody each are encoded by a separate transcription unit, each of these expression units being a bicistronic expression unit hereof).
- The DNA molecules can be present in the form of double stranded DNA, having with respect to the selectable marker polypeptide and the polypeptide of interest a coding strand and a non-coding strand, the coding strand being the strand with the same sequence as the translated RNA, except for the presence of T instead of U. Hence, an AUG start codon is coded for in the coding strand by an ATG sequence, and the strand containing this ATG sequence corresponding to the AUG start codon in the RNA is referred to as the coding strand of the DNA. It will be clear to the skilled person that start codons or translation initiation sequences are in fact present in an RNA molecule, but that these can be considered equally embodied in a DNA molecule coding for such an RNA molecule; hence, wherever the disclosure refers to a start codon or translation initiation sequence, the corresponding DNA molecule having the same sequence as the RNA sequence but for the presence of a T instead of a U in the coding strand of the DNA molecule is meant to be included, and vice versa, except where explicitly specified otherwise. In other words, a start codon is for instance an AUG sequence in RNA, but the corresponding ATG sequence in the coding strand of the DNA is referred to as start codon as well in the disclosure. The same is used for the reference of “in frame” coding sequences, meaning triplets (three bases) in the RNA molecule that are translated into an amino acid, but also to be interpreted as the corresponding trinucleotide sequences in the coding strand of the DNA molecule.
- The selectable marker polypeptide and the polypeptide of interest encoded by the multicistronic gene each have their own translation initiation sequence, and therefore each have their own start codon (as well as stop codon), i.e., they are encoded by separate open reading frames.
- The term “selection marker” or “selectable marker” is typically used to refer to a gene and/or protein whose presence can be detected directly or indirectly in a cell, for example, a polypeptide that inactivates a selection agent and protects the host cell from the agent's lethal or growth-inhibitory effects (e.g., an antibiotic resistance gene and/or protein). Another possibility is that the selection marker induces fluorescence or a color deposit (e.g., green fluorescent protein (GFP) and derivatives (e.g., d2EGFP), luciferase, lacZ, alkaline phosphatase, etc.), which can be used for selecting cells expressing the polypeptide inducing the color deposit, e.g., using a fluorescence activated cell sorter (FACS) for selecting cells that express GFP. Preferably, the selectable marker polypeptide hereof provides resistance against lethal and/or growth-inhibitory effects of a selection agent. The selectable marker polypeptide is encoded by the DNA of the invention. The selectable marker polypeptide is functional in a eukaryotic host cell, and hence capable of being selected for in eukaryotic host cells. Any selectable marker polypeptide fulfilling this criterion can in principle be used according to the disclosure. Such selectable marker polypeptides are well known in the art and routinely used when eukaryotic host cell clones are to be obtained, and several examples are provided herein. In certain embodiments, a selection marker used for the invention is Zeocin® antibiotic. In other embodiments, blasticidin is used. A person skilled in the art will know that other selection markers are available and can be used, e.g., neomycin, puromycin, bleomycin, hygromycin, etc. In other embodiments, kanamycin is used. In yet other embodiments, the DHFR gene is used as a selectable marker, which can be selected for by methotrexate, especially by increasing the concentration of methotrexate cells can be selected for increased copy numbers of the DHFR gene. Similarly, the glutamine synthetase (GS) gene can be used, for which selection is possible in cells having insufficient GS (e.g., NS-0 cells) by culturing in media without glutamine, or alternatively in cells having sufficient GS (e.g., CHO cells) by adding an inhibitor of GS, methionine sulphoximine (MSX). Other selectable marker genes that could be used, and their selection agents, are for instance described in table 1 of U.S. Pat. No. 5,561,053, incorporated by reference herein; see also Kaufman, Methods in Enzymology, 185:537-566 (1990), for a review of these.
- When two multicistronic transcription units are to be selected for hereof in a single host cell, each one preferably contains the coding sequence for a different selectable marker, to allow selection for both multicistronic transcription units. Of course, both multicistronic transcription units may be present on a single nucleic acid molecule or alternatively each one may be present on a separate nucleic acid molecule.
- The term “selection” is typically defined as the process of using a selection marker/selectable marker and a selection agent to identify host cells with specific genetic properties (e.g., that the host cell contains a transgene integrated into its genome). It is clear to a person skilled in the art that numerous combinations of selection markers are possible. One antibiotic that is particularly advantageous is Zeocin®, because the Zeocin® antibiotic-resistance protein (Zeocin®-R) acts by binding the drug and rendering it harmless. Therefore, it is easy to titrate the amount of drug that kills cells with low levels of Zeocin®-R expression, while allowing the high-expressors to survive. All other antibiotic-resistance proteins in common use are enzymes, and thus act catalytically (not 1:1 with the drug). Hence, the antibiotic Zeocin® is a preferred selection marker. However, the invention also works with other selection markers.
- A selectable marker polypeptide is the protein that is encoded by the nucleic acid, which polypeptide can be detected, for instance because it provides resistance to a selection agent such as an antibiotic. Hence, when an antibiotic is used as a selection agent, the DNA encodes a polypeptide that confers resistance to the selection agent, which polypeptide is the selectable marker polypeptide. DNA sequences coding for such selectable marker polypeptides are known, and several examples of wild-type sequences of DNA encoding selectable marker proteins are provided herein (
FIGS. 26-32 ). It will be clear that mutants or derivatives of selectable markers can also be suitably used, and are therefore included within the scope of the term “selectable marker polypeptide,” as long as the selectable marker protein is still functional. - For convenience and as generally accepted by the skilled person, in many publications as well as herein, often the gene and protein encoding the resistance to a selection agent is referred to as the “selectable agent (resistance) gene” or “selection agent (resistance) protein,” respectively, although the official names may be different, e.g., the gene coding for the protein conferring resistance to neomycin (as well as to G418 and kanamycin) is often referred to as neomycin (resistance) (or neo') gene, while the official name is aminoglycoside 3′-phosphotransferase gene.
- It is beneficial to have low levels of expression of the selectable marker polypeptide, so that stringent selection is possible. In the disclosure this is brought about by using a selectable marker coding sequence with a non-optimal translation efficiency. Upon selection, only cells that have nevertheless sufficient levels of selectable marker polypeptide will be selected, meaning that such cells must have sufficient transcription of the multicistronic transcription unit and sufficient translation of the selectable marker polypeptide, which provides a selection for cells where the multicistronic transcription unit has been integrated or otherwise present in the host cells at a place where expression levels from this transcription unit are high. The (more or less) reciprocally interdependent nature of the translation of the polypeptide of interest from that of the translation efficiency of the marker, ensures high translation levels of the polypeptide of interest. This high level of translation is superimposed on the high transcription levels selected for by the stringent selection conditions (and resulting from the use of a strong promoter in the host cells to drive transcription of the multicistronic transcription unit).
- The DNA molecules hereof have the coding sequence for the selectable marker polypeptide upstream of the coding sequence for the polypeptide of interest, to provide for a multicistronic transcript with reciprocally interdependent translation of the polypeptide of interest and the selectable marker polypeptide. Hence, the multicistronic transcription unit comprises in the 5′ to 3′ direction (both in the transcribed strand of the DNA and in the resulting transcribed RNA) the coding sequence for the selectable marker polypeptide and the sequence encoding the polypeptide of interest.
- The coding regions of the multicistronic gene are preferably translated from the cap-dependent ORF, and therefore the selectable marker polypeptide would in principle also be produced in abundance. To prevent such high translation of the selectable marker cistron, hereof the nucleic acid sequence coding for the selectable marker polypeptide comprises a mutation (within the open reading frame (ORF) or in the translation initiation sequences preceding the ORF, or in both) that decreases the translation efficiency of the selectable marker polypeptide in a eukaryotic host cell. The translation efficiency should be lower than that of the corresponding wild-type sequence in the same cell, i.e., the mutation should result in less polypeptide per cell per time unit, and hence less selectable marker polypeptide. This can be detected using routine methods known to the person skilled in the art. For instance in the case of antibiotic selection the mutation will result in less resistance than obtained with the sequence having no such mutation and hence normal translation efficiency, which difference can easily be detected by determining the number of surviving colonies after a normal selection period, which will be lower when a translation efficiency decreasing mutation is present. As is well known to the person skilled in the art there are a number of parameters that indicate the expression level marker polypeptide such as, the maximum concentration of selection agent to which cells are still resistant, number of surviving colonies at a given concentration, growth speed (doubling time) of the cells in the presence of selection agent, combinations of the above, and the like.
- In certain embodiments, the mutation that decreases the translation efficiency, is a mutation that decreases the translation initiation efficiency. This can, for example, be established by providing the selectable marker polypeptide coding sequence with a non-optimal translation start sequence. For example, the translation initiation efficiency of the selectable marker gene in eukaryotic cells can be suitably decreased hereof by mutating the start codon and/or the nucleotides in positions −3 to −1 and +4 (where the A of the ATG start codon is nt+1), for instance in the coding strand of the corresponding DNA sequence, to provide a non-optimal translation start sequence. A translation start sequence is often referred to in the field as “Kozak sequence,” and an optimal Kozak sequence is RCCATGG, the start codon underlined, R being a purine, i.e., A or G (see Kozak M, 1986, 1987, 1989, 1990, 1997, 2002). Hence, besides the start codon itself, the context thereof, in particular nucleotides −3 to −1 and +4, are relevant, and an optimal translation start sequence comprises an optimal start codon (i.e., ATG) in an optimal context (i.e., the ATG directly preceded by RCC and directly followed by G). A non-optimal translation start sequence is defined herein as any sequence that gives at least some detectable translation in a eukaryotic cell (detectable because the selection marker polypeptide is detectable), and not having the consensus sequence RCCATGG (start codon underlined). Translation by the ribosomes starts preferably at the first start codon it encounters when scanning from the 5′-cap of the mRNA (scanning mechanism), and is most efficient when an optimal Kozak sequence is present (see Kozak M, 1986, 1987, 1989, 1990, 1997, 2002). However, in a small percentage of events, non-optimal translation initiation sequences are recognized and used by the ribosome to start translation. The disclosure makes use of this principle, and allows for decreasing and even fine-tuning of the amount of translation and hence expression of the selectable marker polypeptide, which can therefore be used to increase the stringency of the selection system.
- In one embodiment, the ATG start codon of the selectable marker polypeptide (in the coding strand of the DNA, coding for the corresponding AUG start codon in the RNA transcription product) is left intact, but the positions at −3 to −1 and +4 are mutated such that they do not fulfill the optimal Kozak sequence any more, e.g., by providing the sequence TTTATGT as the translation start site (ATG start codon underlined). It will be clear that other mutations around the start codon at positions −3 to −1 and/or +4 could be used with similar results using the teaching of the disclosure, as can be routinely and easily tested by the person skilled in the art. The idea of this first embodiment is that the ATG start codon is placed in a “non-optimal” context for translation initiation.
- In another embodiment, the ATG start codon itself of the selectable marker polypeptide is mutated. This will in general lead to even lower levels of translation initiation than the first embodiment. The ATG start codon in the second embodiment is mutated into another codon, which has been reported to provide some translation initiation, for instance to GTG, TTG, CTG, ATT, or ACG (collectively referred to herein as “non-optimal start codons”). In preferred embodiments, the ATG start codon is mutated into a GTG start codon. This provides still lower expression levels (lower translation) than with the ATG start codon intact but in a non-optimal context. More preferably, the ATG start codon is mutated to a TTG start codon, which provides even lower expression levels of the selectable marker polypeptide than with the GTG start codon (Kozak M, 1986, 1987, 1989, 1990, 1997, 2002; see also examples 9-13 herein).
- For the second embodiment, i.e., where a non-ATG start codon is used, it is preferred to provide an optimal context for such a start codon, i.e., the non-optimal start codons are preferably directly preceded by nucleotides RCC in positions −3 to −1 and directly followed by a G nucleotide (position +4). However, it has been reported that using the sequence TTTGTGG (start codon underlined), some initiation is observed at least in vitro, so although strongly preferred it may not be absolutely required to provide an optimal context for the non-optimal start codons.
- ATG sequences within the coding sequence for a polypeptide, but excluding the ATG start codon, are referred to as “internal ATGs,” and if these are in frame with the ORF and therefore code for methionine, the resulting methionine in the polypeptide is referred to as an “internal methionine.” The coding region (following the start codon, not necessarily including the start codon) may code for the selectable marker polypeptide preferably devoid of any ATG sequence in the coding strand of the DNA, up to (but not including) the start codon of the polypeptide of interest (obviously, the start codon of the polypeptide of interest may be, and in fact preferably is, an ATG start codon). This can be established by mutating any such ATG sequence within the coding sequence of the selectable marker polypeptide, following the start codon thereof (as is clear from the teaching above, the start codon of the selectable marker polypeptide itself may be an ATG sequence, but not necessarily so). To this purpose, the degeneracy of the genetic code is used to avoid mutating amino acids in the selectable marker polypeptide wherever possible. Hence, wherever an ATG is present in the coding strand of the DNA sequence encoding the selectable marker polypeptide, which ATG is not in frame with the selectable marker polypeptide ORF, and therefore does not code for an internal methionine in the selectable marker polypeptide, the ATG can be mutated such that the resulting polypeptide has no mutations in its internal amino acid sequence. Where the ATG is an in-frame codon coding for an internal methionine, the codon can be mutated, and the resulting mutated polypeptide can be routinely checked for activity of the selectable marker polypeptide. In this way a mutation can be chosen which leads to a mutated selectable marker polypeptide that is still active as such (quantitative differences may exist, but those are less relevant, and in fact it could even be beneficial to have less active variants for the purpose of the disclosure; the minimum requirement is that the selectable marker polypeptide can still be selected for in eukaryotic cells). Amino acids valine, threonine, isoleucine and leucine are structurally similar to methionine, and therefore codons that code for one of these amino acids are good starting candidates to be tested in place of methione within the coding sequence after the start codon. Of course, using the teachings of the disclosure, the skilled person may test other amino acids as well in place of internal methionines, using routine molecular biology techniques for mutating the coding DNA, and routine testing for functionality of the selectable marker polypeptide. Besides routine molecular biology techniques for mutating DNA, it is at present also possible to synthesize at will (if required using subcloning steps) DNA sequences that have sufficient length for an ORF of a selectable marker polypeptide, and such synthetic DNA sequences can nowadays be ordered commercially from various companies. Hence, using the teachings of the disclosure, the person skilled in the art may design appropriate sequences hereof encoding a selectable marker polypeptide (with a mutation decreasing translation initiation, and preferably having no internal ATGs), have this sequence synthesized, and test the DNA molecule for functionality of the encoded selectable marker by introducing the DNA molecule in eukaryotic host cells and test for expression of functional selectable marker polypeptide. The commercial availability of such sequences also makes feasible to provide without undue burden for selection marker coding sequences lacking internal ATG sequences, where the wild-type coding sequence of the selection marker polypeptide comprises several such internal ATGs.
- By providing a coding sequence for a selectable marker polypeptide lacking any internal ATG sequence, the chances of inadvertent translation initiation by ribosomes that passed the (first, non-optimal) translation start sequence of the selectable marker polypeptide at a subsequent internal ATG trinucleotide is diminished, so that the ribosomes will continue to scan for the first optimal translation start sequence, i.e., that of the polypeptide of interest.
- It is not necessarily strictly required to remove all internal ATG sequences from the coding sequence of the selectable marker, as long as the vast majority of ribosomes will not use such internal ATG sequences as a translation initiation start site, for instance because such internal ATGs usually are not in an optimal context for translation initiation. However, also in view of the fact that even an ATG sequence in a non-optimal context gives rise to significant translation initiation events (in general much higher than from a non-ATG start codon in an optimal context), it will also be clear that if several internal ATG sequences are present in the coding region of the selectable marker, each of these might subtract a fraction of ribosomes that start translation at those ATGs, preventing them from reaching the downstream translation unit encoding the polypeptide of interest, and hence result in less expression of the polypeptide of interest. This is the reason why it is preferred to have a sequence encoding the selectable marker polypeptide, which sequence is free of internal ATG sequences. Possibly, if desired, the ratio of ribosomes translating the polypeptide of interest might even be further enhanced by mutating any non-optimal potential translation initiation sites within the sequence coding for the selectable marker polypeptide.
- Any sequence present in the coding strand between the stop codon of the selectable marker and the start codon of the polypeptide of interest can easily be designed such that it does not comprise ATG sequences, or other non-optimal potential translation start codons.
- The translation start sequence of the polypeptide of interest may comprise an optimal translation start sequence, i.e., having the consensus sequence RCCATGG (start codon underlined). This will result in a very efficient translation of the polypeptide of interest.
- As a consequence of these measures, the reciprocally interdependent translation mechanism of the invention is provided: ribosomes scan from the cap of the mRNA containing the multicistronic transcription unit, and a small percentage of ribosomes will translate the first open reading frame, i.e., the selectable marker protein, because this is provided with a mutation decreasing the translation efficiency, while the great majority of the ribosomes will pass this open reading frame and start translation at the optimal translation initiation site of the polypeptide of interest, resulting in low levels of selectable marker protein and high levels of polypeptide of interest. By providing the coding sequence of the marker with different mutations leading to several levels of decreased translation efficiency, the stringency of selection is increased and reciprocally the expression levels of the polypeptide of interest go up, so that fine-tuning is possible: for instance, using a TTG start codon for the selection marker polypeptide, only very few ribosomes will translate from this start codon, resulting in very low levels of selectable marker protein, and hence a high stringency of selection, and at the same time almost all ribosomes will translate the polypeptide of interest.
- It is demonstrated herein that this can be used in a very robust selection system, leading to a very large percentage of clones that express the polypeptide of interest at high levels, as desired. In addition, the expression levels obtained for the polypeptide of interest appear to be significantly higher than those obtained when an even larger number of colonies are screened using selection systems hitherto known.
- In addition to a decreased translation initiation efficiency, it could be beneficial to also provide for decreased translation elongation efficiency of the selectable marker polypeptide, e.g., by mutating the coding sequence thereof so that it comprises several non-preferred codons of the host cell, in order to further decrease the translation levels of the marker polypeptide and allow still more stringent selection conditions, if desired. In certain embodiments, besides the mutation(s) that decrease the translation efficiency hereof, the selectable marker polypeptide further comprises a mutation that reduces the activity of the selectable marker polypeptide compared to its wild-type counterpart. This may be used to increase the stringency of selection even further. As non-limiting examples, proline at
position 9 in the Zeocin® antibiotic resistance polypeptide may be mutated, e.g., to Thr or Phe, and for the neomycin resistance polypeptide,amino acid residue 182 or 261 or both may further be mutated (see, e.g., WO 01/32901). - In some embodiments, a so-called spacer sequence is placed downstream of the sequence encoding the start codon of the selectable marker polypeptide, which spacer sequence preferably is a sequence in frame with the start codon and encoding a few amino acids, and that does not contain a secondary structure (Kozak, 1990), and does not contain the sequence ATG. Such a spacer sequence can be used to further decrease the translation initiation frequency if a secondary structure is present in the RNA (Kozak, 1990) of the selectable marker polypeptide (e.g., for Zeocin® antibiotic, possibly for blasticidin), and hence increase the stringency of the selection system hereof.
- Also provided is a DNA molecule comprising the sequence encoding a selectable marker protein hereof, which DNA molecule has been provided with a mutation that decreases the translation efficiency of the functional selectable marker polypeptide in a eukaryotic host cell. In certain embodiments, the DNA molecule in the coding strand: i) has been mutated compared to the wild-type sequence encoding the selectable marker polypeptide, such that the sequence ATG is absent from the coding region following the start codon of the functional selectable marker protein up till the stop codon of the functional selectable marker protein open reading frame; and ii) has a non-optimal translation start sequence of the functional selectable marker polypeptide. Features i) and ii) can again be provided according to the description above. Such DNA molecules encompass a useful intermediate product hereof. These molecules can be prepared first, introduced into eukaryotic host cells and tested for functionality (for some markers this is even possible in prokaryotic host cells), if desired in a (semi-) quantitative manner, of the selectable marker polypeptide. They may then be further used to prepare a DNA molecule hereof, comprising the multicistronic transcription unit.
- It is a preferred aspect to provide an expression cassette comprising the DNA molecule hereof, having the multicistronic transcription unit. Such an expression cassette is useful to express sequences of interest, for instance in host cells. An “expression cassette” as used herein is a nucleic acid sequence comprising at least a promoter functionally linked to a sequence of which expression is desired. Preferably, an expression cassette further contains transcription termination and polyadenylation sequences. Other regulatory sequences such as enhancers may also be included. Hence, provided is an expression cassette comprising in the following order: 5′-promoter-multicistronic transcription unit hereof, coding for a selectable marker polypeptide and downstream thereof a polypeptide of interest-transcription termination sequence-3′. The promoter must be capable of functioning in a eukaryotic host cell, i.e., it must be capable of driving transcription of the multicistronic transcription unit. The expression cassette may optionally further contain other elements known in the art, e.g., splice sites to comprise introns, and the like. In some embodiments, an intron is present behind the promoter and before the sequence encoding the selectable marker polypeptide.
- To obtain expression of nucleic acid sequences encoding protein, it is known to those skilled in the art that sequences capable of driving such expression, can be functionally linked to the nucleic acid sequences encoding the protein, resulting in recombinant nucleic acid molecules encoding a protein in expressible format. In the disclosure, the expression cassette comprises a multicistronic transcription unit. In general, the promoter sequence is placed upstream of the sequences that should be expressed. Much used expression vectors are available in the art, e.g., the pcDNA and pEF vector series of Invitrogen, pMSCV and pTK-Hyg from BD Sciences, pCMV-Script from Stratagene, etc, which can be used to obtain suitable promoters and/or transcription terminator sequences, polyA sequences, and the like.
- Where the sequence encoding the polypeptide of interest is properly inserted with reference to sequences governing the transcription and translation of the encoded polypeptide, the resulting expression cassette is useful to produce the polypeptide of interest, referred to as expression. Sequences driving expression may include promoters, enhancers and the like, and combinations thereof. These should be capable of functioning in the host cell, thereby driving expression of the nucleic acid sequences that are functionally linked to them. The person skilled in the art is aware that various promoters can be used to obtain expression of a gene in host cells. Promoters can be constitutive or regulated, and can be obtained from various sources, including viruses, prokaryotic, or eukaryotic sources, or artificially designed. Expression of nucleic acids of interest may be from the natural promoter or derivative thereof or from an entirely heterologous promoter (Kaufman, 2000). Some well-known and much used promoters for expression in eukaryotic cells comprise promoters derived from viruses, such as adenovirus, e.g., the E1A promoter, promoters derived from cytomegalovirus (CMV), such as the CMV immediate early (1E) promoter (referred to herein as the CMV promoter) (obtainable for instance from pcDNA, Invitrogen), promoters derived from Simian Virus 40 (SV40) (Das et al. 1985), and the like. Suitable promoters can also be derived from eukaryotic cells, such as methallothionein (MT) promoters, elongation factor 1α (EF-1α) promoter (Gill et al., 2001), ubiquitin C or UB6 promoter (Gill et al., 2001; Schorpp et al., 1996), actin promoter, an immunoglobulin promoter, heat shock promoters, and the like. Some preferred promoters for obtaining expression in eukaryotic cells, which are suitable promoters in the disclosure, are the CMV-promoter, a mammalian EF1-alpha promoter, a mammalian ubiquitin promoter such as a ubiquitin C promoter, or a SV40 promoter (e.g., obtainable from pIRES, cat. no. 631605, BD Sciences). Testing for promoter function and strength of a promoter is a matter of routine for a person skilled in the art, and in general may for instance encompass cloning a test gene such as lacZ, luciferase, GFP, etc. behind the promoter sequence, and test for expression of the test gene. Of course, promoters may be altered by deletion, addition, mutation of sequences therein, and tested for functionality, to find new, attenuated, or improved promoter sequences. According to the disclosure, strong promoters that give high transcription levels in the eukaryotic cells of choice are preferred.
- In certain embodiments, a DNA molecule hereof is part of a vector, e.g., a plasmid. Such vectors can easily be manipulated by methods well known to the person skilled in the art, and can for instance be designed for being capable of replication in prokaryotic and/or eukaryotic cells. In addition, many vectors can directly or in the form of isolated desired fragment therefrom be used for transformation of eukaryotic cells and will integrate in whole or in part into the genome of such cells, resulting in stable host cells comprising the desired nucleic acid in their genome.
- Conventional expression systems are DNA molecules in the form of a recombinant plasmid or a recombinant viral genome. The plasmid or the viral genome is introduced into (eukaryotic host) cells and preferably integrated into their genomes by methods known in the art. In preferred embodiments, the disclosure also uses these types of DNA molecules to deliver its improved transgene expression system. A preferred embodiment of the invention is the use of plasmid DNA for delivery of the expression system. A plasmid contains a number of components: conventional components, known in the art, are an origin of replication and a selectable marker for propagation of the plasmid in bacterial cells; a selectable marker that functions in eukaryotic cells to identify and isolate host cells that carry an integrated transgene expression system; the protein of interest, whose high-level transcription is brought about by a promoter that is functional in eukaryotic cells (e.g., the human cytomegalovirus major immediate early promoter/enhancer, pCMV (Boshart et al., 1985); and viral transcriptional terminators (e.g., the SV40 polyadenylation site (Kaufman & Sharp, 1982) for the transgene of interest and the selectable marker.
- The vector used can be any vector that is suitable for cloning DNA and that can be used for transcription of a nucleic acid of interest. When host cells are used it is preferred that the vector is an integrating vector. Alternatively, the vector may be an episomally replicating vector.
- It is widely appreciated that chromatin structure and other epigenetic control mechanisms may influence the expression of transgenes in eukaryotic cells (e.g., Whitelaw et al., 2001). The multicistronic expression units hereof form part of a selection system with a rather rigorous selection regime. This generally requires high transcription levels in the host cells of choice. To increase the chance of finding clones of host cells that survive the rigorous selection regime, and possibly to increase the stability of expression in obtained clones, it will generally be preferable to increase the predictability of transcription. Therefore, in preferred embodiments, an expression cassette hereof further comprises at least one chromatin control element. A “chromatin control element” used herein is a collective term for DNA sequences that may somehow have an effect on the chromatin structure and therewith on the expression level and/or stability of expression of transgenes in their vicinity (they function “in cis,” and hence are placed preferably within 5 kb, more preferably within 2 kb, still more preferably within 1 kb from the transgene) within eukaryotic cells. Such elements have sometimes been used to increase the number of clones having desired levels of transgene expression. The mechanisms by which these elements work may differ for and even within different classes of such elements, and are not completely known for all types of such elements. However, such elements have been described, and for the purpose of the disclosure chromatin control elements are chosen from the group consisting of matrix or scaffold attachment regions (MARs/SARs) (e.g., Phi-Van et al., 1990; WO 02/074969, WO 2005/040377), insulators (West et al., 2002) such as the beta-globin insulator element (5′ HS4 of the chicken beta-globin locus), scs, scs′, and the like (e.g., Chung et al., 1993, 1997; Kellum and Schedl, 1991; WO 94/23046, WO 96/04390, WO 01/02553, WO 2004/027072), a ubiquitous chromatin opening element (UCOE) (WO 00/05393, WO 02/24930, WO 02/099089, WO 02/099070), and anti-repressor sequences (also referred to as “STAR” sequences) (Kwaks et al., 2003; WO 03/004704). Non-limiting examples of MAR/SAR sequences that could be used in the current invention are the chicken lysosyme 5′ MAR (Phi-Van et al., 1990) or fragments thereof, e.g., the B, K and F regions as described in WO 02/074969); DNA sequences comprising at least one bent DNA element and at least one binding site for a DNA binding protein, preferably containing at least 10% of dinucleotide TA, and/or at least 12% of dinucleotide AT on a stretch of 100 contiguous base pairs, such as a sequence selected from the group of comprising the sequences SEQ ID NOS: 1 to 27 in WO 2005/040377, fragments of any one of SEQ ID NOS: 1 to 27 in WO 2005/040377 being at least 100 nucleotides in length and having MAR activity, sequences that are at least 70% identical in nucleotide sequence to any one of SEQ ID NOS: 1 to 27 in WO 2005/040377 or fragments thereof and having MAR activity, wherein MAR activity is defined as being capable of binding to nuclear matrices/scaffolds in vitro and/or of altering the expression of coding sequences operably linked to a promoter; sequences chosen from any one of SEQ ID NOS: 1 to 5 in WO 02/074969, fragments of any one of any one of SEQ ID NOS: 1 to 5 in WO 02/074969 and having MAR activity, sequences that are at least 70% identical in nucleotide sequence to any one of SEQ ID NOS: 1 to 5 in WO 02/074969 or fragments thereof and having MAR activity; sequences chosen from SEQ ID NO:1 and SEQ ID NO:2 in WO 2004/027072, functional fragments thereof and sequences being at least 70% identical thereto. A non-limiting example of insulator sequences that could be used in the disclosure is a sequence that comprises SEQ ID NO:1 of WO 01/02553. Non-limiting examples of UCOEs that could be used in the disclosure are sequences depicted in FIGS. 2 and 7 of WO 02/24930, functional fragments thereof and sequences being at least 70% identical thereto while still retaining activity; sequences comprising SEQ ID NO:28 of U.S. 2005/181428, functional fragments thereof and sequences being at least 70% identical thereto while still retaining activity.
- Preferably, the chromatin control element is an anti-repressor sequence, preferably selected from the group consisting of: a) any one SEQ ID NO:1 through SEQ ID NO:66; b) fragments of any one of SEQ ID NO:1 through SEQ ID NO:66, wherein the fragments have anti-repressor activity (“functional fragments”); c) sequences that are at least 70% identical in nucleotide sequence to a) or b) wherein the sequences have anti-repressor activity (“functional derivatives”); and d) the complement to any one of a) to c). Preferably, the chromatin control element is chosen from the group consisting of STAR67 (SEQ ID NO:66), STAR7 (SEQ ID NO:7), STAR9 (SEQ ID NO:9), STAR17 (SEQ ID NO:17), STAR27 (SEQ ID NO:27), STAR29 (SEQ ID NO:29), STAR43 (SEQ ID NO:43), STAR44 (SEQ ID NO:44), STAR45 (SEQ ID NO:45), STAR47 (SEQ ID NO:47), STAR61 (SEQ ID NO:61), or a functional fragment or derivative of the STAR sequences. In a particularly preferred embodiment, the STAR sequence is STAR 67 (SEQ ID NO:66) or a functional fragment or derivative thereof. In certain embodiments,
STAR 67 or a functional fragment or derivative thereof is positioned upstream of a promoter driving expression of the multicistronic transcription unit. In other preferred embodiments, the expression cassettes hereof are flanked on both sides by at least one anti-repressor sequence. - Sequences having anti-repressor activity as used herein are sequences that are capable of at least in part counteracting the repressive effect of HP1 or HPC2 proteins when these proteins are tethered to DNA. Sequences having anti-repressor activity (sometimes also referred to as anti-repressor sequences or anti-repressor elements herein) suitable for the disclosure, have been disclosed in WO 03/004704, incorporated herein by reference, and were coined “STAR” sequences therein (wherever a sequence is referred to as a STAR sequence herein, this sequence has anti-repressor activity hereof). As a non-limiting example, the sequences of 66 anti-repressor elements, named STAR1-65 (see WO 03/004704) and STAR67, are presented herein as SEQ ID NOS: 1-65 and 66, respectively.
- A functional fragment or derivative of a given anti-repressor element may be considered equivalent to the anti-repressor element, when it still has anti-repressor activity. The presence of such anti-repressor activity can easily be checked by the person skilled in the art, for instance by the assay described below. Functional fragments or derivatives can easily be obtained by a person skilled in the art of molecular biology, by starting with a given anti-repressor sequence, and making deletions, additions, substitutions, inversions and the like (see, e.g., WO 03/004704). A functional fragment or derivative also comprises orthologs from other species, which can be found using the known anti-repressor sequences by methods known by the person skilled in the art (see, e.g., WO 03/004704). Hence, the disclosure encompasses fragments of the anti-repressor sequences, wherein the fragments still have anti-repressor activity. Also encompassed are sequences that are at least 70% identical in nucleotide sequence to the sequences having anti-repressor activity or to functional fragments thereof having anti-repressor activity, as long as these sequences that are at least 70% identical still have the anti-repressor activity hereof. Preferably, the sequences are at least 80% identical, more preferably at least 90% identical and still more preferably at least 95% identical to the reference native sequence or functional fragment thereof. For fragments of a given sequence, percent identity refers to that portion of the reference native sequence that is found in the fragment.
- Sequences having anti-repressor activity hereof can be obtained by various methods, including but not limited to the cloning from the human genome or from the genome of another organism, or by for instance amplifying known anti-repressor sequences directly from such a genome by using the knowledge of the sequences, e.g., by PCR, or can in part or wholly be chemically synthesized.
- Sequences having anti-repressor activity, and functional fragments or derivatives thereof, are structurally defined herein by their sequence and in addition are functionally defined as sequences having anti-repressor activity, which can be determined with the assay described below.
- Any sequence having anti-repressor activity according to the disclosure should at least be capable of surviving the following functional assay (see WO 03/004704, Example 1, incorporated herein by reference).
- Human U-2 OS cells (ATCC HTB-96) are stably transfected with the pTet-Off plasmid (Clontech K1620-A) and with nucleic acid encoding a LexA-repressor fusion protein containing the LexA DNA binding domain and the coding region of either HP1 or HPC2 (Drosophila Polycomb group proteins that repress gene expression when tethered to DNA; the assay works with either fusion protein) under control of the Tet-Off transcriptional regulatory system (Gossen and Bujard, 1992). These cells are referred to below as the reporter cells for the anti-repressor activity assay. A reporter plasmid, which provides hygromycin resistance, contains a polylinker sequence positioned between four LexA operator sites and the SV40 promoter that controls the Zeocin® antibiotic resistance gene. The sequence to be tested for anti-repressor activity can be cloned in the polylinker. Construction of a suitable reporter plasmid, such as pSelect, is described in Example 1 and FIG. 1 of WO 00/004704. The reporter plasmid is transfected into the reporter cells, and the cells are cultured under hygromycin selection (25 μg/ml; selection for presence of the reporter plasmid) and tetracycline repression (doxycycline, 10 ng/ml; prevents expression of the LexA-repressor fusion protein). After one week of growth under these conditions, the doxycycline concentration is reduced to 0.1 ng/ml to induce the LexA-repressor gene, and after two days zeocin is added to 250 μg/ml. The cells are cultured for five weeks, until the control cultures (transfected with empty reporter plasmid, i.e., lacking a cloned anti-repressor sequence in the polylinker) are killed by the Zeocin® antibiotic (in this control plasmid, the SV40 promoter is repressed by the LexA-repressor fusion protein that is tethered to the LexA operating sites, resulting in insufficient Zeocin® antibiotic expression in such cells to survive zeocin selection). A sequence has anti-repressor activity according to the disclosure if when the sequence is cloned in the polylinker of the reporter plasmid, the reporter cells survive the 5 weeks selection under Zeocin® antibiotic. Cells from such colonies can still be propagated onto new medium containing Zeocin® antibiotic after the five weeks Zeocin® antibiotic selection, whereas cells transfected with reporter plasmids lacking anti-repressor sequences cannot be propagated onto new medium containing Zeocin® antibiotic. Any sequence not capable of conferring such growth after 5 weeks on Zeocin® antibiotic in this assay, does not qualify as a sequence having anti-repressor activity, or functional fragment or functional derivative thereof according to the disclosure. As an example, other known chromatin control elements such as those tested by Van der Vlag et al. (2000), including Drosophila scs (Kellum and Schedl, 1991), 5′-HS4 of the chicken β-globin locus (Chung et al., 1993, 1997) or Matrix Attachment Regions (MARS) (Phi-Van et al., 1990), do not survive this assay.
- In addition, it is preferred that the anti-repressor sequence or functional fragment or derivative thereof confers a higher proportion of reporter over-expressing clones when flanking a reporter gene (e.g., luciferase, GFP) which is integrated into the genome of U-2 OS or CHO cells, compared to when the reporter gene is not flanked by anti-repressor sequences, or flanked by weaker repression blocking sequences such as Drosophila scs. This can be verified using for instance the pSDH vector, or similar vectors, as described in Example 1 and FIG. 2 of WO 03/004704.
- Anti-repressor elements can have at least one of three consequences for production of protein: (1) they increase the predictability of identifying host cell lines that express a protein at industrially acceptable levels (they impair the ability of adjacent heterochromatin to silence the transgene, so that the position of integration has a less pronounced effect on expression); (2) they result in host cell lines with increased protein yields; and/or (3) they result in host cell lines that exhibit more stable protein production during prolonged cultivation.
- Any STAR sequence can be used in the expression cassettes according to the disclosure, but the following STAR sequences are particularly useful: STAR67 (SEQ ID NO:66), START (SEQ ID NO:7), STAR9 (SEQ ID NO:9), STAR17 (SEQ ID NO:17), STAR27 (SEQ ID NO:27), STAR29 (SEQ ID NO:29), STAR43 (SEQ ID NO:43), STAR44 (SEQ ID NO:44), STAR45 (SEQ ID NO:45), STAR47 (SEQ ID NO:47), STAR61 (SEQ ID NO:61), or functional fragments or derivatives of these STAR sequences.
- In certain embodiments the anti-repressor sequence, preferably STAR67, is placed upstream of the promoter, preferably such that less than 2 kb are present between the 3′ end of the anti-repressor sequence and the start of the promoter sequence. In preferred embodiments, less than 1 kb, more preferably less than 500 nucleotides (nt), still more preferably less than about 200, 100, 50, or 30 nt are present between the 3′ end of the anti-repressor sequence and the start of the promoter sequence. In certain embodiments, the anti-repressor sequence is cloned directly upstream of the promoter, resulting in only about 0-20 nt between the 3′ end of the anti-repressor sequence and the start of the promoter sequence.
- For the production of multimeric proteins, two or more expression cassettes can be used. Preferably, both expression cassettes are multicistronic expression cassettes hereof, each coding for a different selectable marker protein, so that selection for both expression cassettes is possible. This embodiment has proven to give good results. e.g., for the expression of the heavy and light chain of antibodies. It will be clear that both expression cassettes may be placed on one nucleic acid molecule or both may be present on a separate nucleic acid molecule, before they are introduced into host cells. An advantage of placing them on one nucleic acid molecule is that the two expression cassettes are present in a single predetermined ratio (e.g., 1:1) when introduced into host cells. On the other hand, when present on two different nucleic acid molecules, this allows the possibility to vary the molar ratio of the two expression cassettes when introducing them into host cells, which may be an advantage if the preferred molar ratio is different from 1:1 or when it is unknown beforehand what is the preferred molar ratio, so that variation thereof and empirically finding the optimum can easily be performed by the skilled person. Hereof, preferably at least one of the expression cassettes, but more preferably each of them, comprises a chromatin control element, more preferably an anti-repressor sequence.
- In another embodiment, the different subunits or parts of a multimeric protein are present on a single expression cassette.
- Instead of or in addition to the presence of a STAR sequence placed upstream of a promoter in an expression cassette, it has proven highly beneficial to provide a STAR sequence on both sides of an expression cassette, such that expression cassette comprising the transgene is flanked by two STAR sequences, which in certain embodiments are essentially identical to each other.
- It is shown herein that the combination of a first anti-repressor element upstream of a promoter and flanking the expression cassette by two other anti-repressor sequences provides superior results.
- As at least some anti-repressor sequences can be directional (WO 00/004704), the anti-repressor sequences flanking the expression cassette (anti-repressor A and B) may beneficially placed in opposite direction with respect to each other, such that the 3′ end of each of these anti-repressor sequences is facing inwards to the expression cassette (and to each other). Hence, in preferred embodiments, the 5′ side of an anti-repressor element faces the DNA/chromatin of which the influence on the transgene is to be diminished by the anti-repressor element. For an anti-repressor sequence upstream of a promoter in an expression cassette, the 3′ end faces the promoter. The sequences of the anti-repressor elements in the sequence listing (SEQ ID NOS: 1-66) are given in 5′ to 3′ direction, unless otherwise indicated.
- In certain embodiments, transcription units or expression cassettes hereof are provided, further comprising: a) a transcription pause (TRAP) sequence upstream of the promoter that drives transcription of the multicistronic transcription unit, the TRAP being in a 5′ to 3′ direction; or b) a TRAP sequence downstream of the open reading frame of the polypeptide of interest and preferably downstream of the transcription termination sequence of the multicistronic transcription unit, the TRAP being in a 3′ to 5′ orientation; or c) both a) and b); wherein a TRAP sequence is functionally defined as a sequence which when placed into a transcription unit, results in a reduced level of transcription in the nucleic acid present on the 3′ side of the TRAP when compared to the level of transcription observed in the nucleic acid on the 5′ side of the TRAP. Non-limiting examples of TRAP sequences are transcription termination and/or polyadenylation signals. One non-limiting example of a TRAP sequence is given in SEQ ID NO:142. Examples of other TRAP sequences, methods to find these, and uses thereof have been described in WO 2004/055215.
- DNA molecules comprising multicistronic transcription units and/or expression cassettes according to the disclosure can be used for improving expression of nucleic acid, preferably in host cells. The terms “cell”/“host cell” and “cell line”/“host cell line” are respectively typically defined as a cell and homogeneous populations thereof that can be maintained in cell culture by methods known in the art, and that have the ability to express heterologous or homologous proteins.
- Prokaryotic host cells can be used to propagate and/or perform genetic engineering with the DNA molecules of the invention, especially when present on plasmids capable of replicating in prokaryotic host cells such as bacteria.
- A host cell hereof preferably is a eukaryotic cell, more preferably a mammalian cell, such as a rodent cell or a human cell or fusion between different cells. In certain non-limiting embodiments, the host cell is a U-2 OS osteosarcoma, CHO (Chinese hamster ovary), HEK 293, HuNS-1 myeloma, WER1-Rb-1 retinoblastoma, BHK, Vero, non-secreting mouse myeloma Sp2/0-
Ag 14, non-secreting mouse myeloma NSO, NCI-H295R adrenal gland carcinomal or a PER.C6® cell. - In certain embodiments, a host cell is a cell expressing at least E1A, and preferably also E1B, of an adenovirus. As non-limiting examples, such a cell can be derived from for instance human cells, for instance from a kidney (example: HEK 293 cells, see Graham et al., 1977), lung (e.g., A549, see, e.g., WO 98/39411) or retina (example: HER cells marketed under the trade mark PER.C6®, see U.S. Pat. No. 5,994,128), or from amniocytes (e.g., N52.E6, described in U.S. Pat. No. 6,558,948), and similarly from other cells. Methods for obtaining such cells are described for instance in U.S. Pat. Nos. 5,994,128 and 6,558,948. PER.C6® cells for the purpose of the disclosure means cells from an upstream or downstream passage or a descendent of an upstream or downstream passage of cells as deposited under ECACC no. 96022940, i.e., having the characteristics of those cells. It has been previously shown that such cells are capable of expression of proteins at high levels (e.g., WO 00/63403, and Jones et al., 2003). In other preferred embodiments, the host cells are CHO cells, for instance CHO-K1, CHO-S, CHO-DG44, CHO-DUKXB11, and the like. In certain embodiments, the CHO cells have a dhfr− phenotype.
- Such eukaryotic host cells can express desired polypeptides, and are often used for that purpose. They can be obtained by introduction of a DNA molecule of the invention, preferably in the form of an expression cassette, into the cells. Preferably, the expression cassette is integrated in the genome of the host cells, which can be in different positions in various host cells, and selection will provide for a clone where the transgene is integrated in a suitable position, leading to a host cell clone with desired properties in terms of expression levels, stability, growth characteristics, and the like. Alternatively the multicistronic transcription unit may be targeted or randomly selected for integration into a chromosomal region that is transcriptionally active, e.g., behind a promoter present in the genome. Selection for cells containing the DNA of the invention can be performed by selecting for the selectable marker polypeptide, using routine methods known by the person skilled in the art. When such a multicistronic transcription unit is integrated behind a promoter in the genome, an expression cassette hereof can be generated in situ, i.e., within the genome of the host cells.
- The host cells may be from a stable clone that can be selected and propagated according to standard procedures known to the person skilled in the art. A culture of such a clone is capable of producing polypeptide of interest, if the cells comprise the multicistronic transcription unit of the invention. Cells hereof preferably are able to grow in suspension culture in serum-free medium.
- In preferred embodiments, the DNA molecule comprising the multicistronic transcription unit hereof, preferably in the form of an expression cassette, is integrated into the genome of the eukaryotic host cell hereof. This will provide for stable inheritance of the multicistronic transcription unit.
- Selection for the presence of the selectable marker polypeptide, and hence for expression, can be performed during the initial obtaining of the cells, and could be lowered or stopped altogether after stable clones have been obtained. It is however also possible to apply the selection agent during later stages continuously, or only occasionally, possibly at lower levels than during initial selection of the host cells.
- A polypeptide of interest hereof can be any protein, and may be a monomeric protein or a (part of a) multimeric protein. A multimeric protein comprises at least two polypeptide chains. Non-limiting examples of a protein of interest hereof are enzymes, hormones, immunoglobulin chains, therapeutic proteins like anti-cancer proteins, blood coagulation proteins such as Factor VIII, multi-functional proteins, such as erythropoietin, diagnostic proteins, or proteins or fragments thereof useful for vaccination purposes, all known to the person skilled in the art.
- In certain embodiments, an expression cassette encodes an immunoglobulin heavy or light chain or an antigen binding part, derivative and/or analogue thereof. In a preferred embodiment a protein expression unit hereof is provided, wherein the protein of interest is an immunoglobulin heavy chain. In yet another embodiment, a protein expression unit hereof is provided, wherein the protein of interest is an immunoglobulin light chain. When these two protein expression units are present within the same (host) cell a multimeric protein and more specifically an immunoglobulin, is assembled. Hence, in certain embodiments, the protein of interest is an immunoglobulin, such as an antibody, which is a multimeric protein. Preferably, such an antibody is a human or humanized antibody. In certain embodiments thereof, it is an IgG, IgA, or IgM antibody. An immunoglobulin may be encoded by the heavy and light chains on different expression cassettes, or on a single expression cassette. Preferably, the heavy and light chain are each present on a separate expression cassette, each having its own promoter (which may be the same or different for the two expression cassettes), each comprising a multicistronic transcription unit hereof, the heavy and light chain being the polypeptide of interest, and preferably each coding for a different selectable marker protein, so that selection for both heavy and light chain expression cassette can be performed when the expression cassettes are introduced and/or present in a eukaryotic host cell.
- The polypeptide of interest may be from any source, and in certain embodiments is a mammalian protein, an artificial protein (e.g., a fusion protein or mutated protein), and preferably is a human protein.
- The configurations of the expression cassettes hereof may also be used when the ultimate goal is not the production of a polypeptide of interest, but the RNA itself, for instance for producing increased quantities of RNA from an expression cassette, which may be used for purposes of regulating other genes (e.g., RNAi, antisense RNA), gene therapy, in vitro protein production, etc.
- In one aspect, provided is a method for generating a host cell expressing a polypeptide of interest, the method comprising the steps of: a) introducing into a plurality of precursor cells an expression cassette hereof, and b) culturing the generated cells under conditions selecting for expression of the selectable marker polypeptide, and c) selecting at least one host cell producing the polypeptide of interest. This novel method provides a very good result in terms of the ratio of obtained clones versus clones with high expression of the desired polypeptide. Using the most stringent conditions, i.e., the weakest translation efficiency for the selectable marker polypeptide (using the weakest translation start sequence), far fewer colonies are obtained using the same concentration of selection agent than with known selection systems, and a relatively high percentage of the obtained clones produces the polypeptide of interest at high levels. In addition, the obtained levels of expression appear higher than those obtained when an even larger number of clones using the known selection systems are used.
- It is an additional advantage that the selection system is swift because it does not require copy number amplification of the transgene. Hence, cells with low copy numbers of the multicistronic transcription units already provide high expression levels. High transgene copy numbers of the transgene may be prone to genetic instability and repeat-induced silencing (e.g., Kim et al., 1998; McBurney et al., 2002). Therefore, an additional advantage of the embodiments of the invention with relatively low transgene copy numbers is that lower copy numbers are anticipated to be less prone to recombination and to repeat-induced silencing, and therefore less problems in this respect are anticipated when using host cells with a limited number of copies of the transgene compared to host cells obtained using an amplification system where hundreds or even thousands of copies of the selectable marker and protein of interest coding sequences may be present in the genome of the cell. The disclosure provides examples of high expression levels, using the multicistronic transcription unit selection system, while the copy number of the transgene is relatively low, i.e., less than 30 copies per cell, or even less than 20 copies per cell. Hence, the disclosure allows the generation of host cells hereof, comprising less than 30 copies of the multicistronic transcription unit in the genome of the host cells, preferably less than 25, more preferably less than 20 copies, while at the same time providing sufficient expression levels of the polypeptide of interest for commercial purposes, e.g., more than 15, preferably more than 20 pg/cell/day of an antibody.
- While clones having relatively low copy numbers of the multicistronic transcription units and high expression levels can be obtained, the selection system of the invention nevertheless can be combined with amplification methods to even further improve expression levels. This can for instance be accomplished by amplification of a co-integrated dhfr gene using methotrexate, for instance by placing dhfr on the same nucleic acid molecule as the multicistronic transcription unit of the invention, or by cotransfection when dhfr is on a separate DNA molecule.
- In one aspect, provided is a method for producing a polypeptide of interest, the method comprising culturing a host cell, the host cell comprising a DNA molecule comprising a multicistronic expression unit or an expression cassette hereof, and expressing the polypeptide of interest from the coding sequence for the polypeptide of interest.
- The host cell for this aspect is a eukaryotic host cell, preferably a mammalian cell, such as a CHO cell, further as described above.
- Introduction of nucleic acid that is to be expressed in a cell, can be done by one of several methods, which as such are known to the person skilled in the art, also dependent on the format of the nucleic acid to be introduced. The methods include but are not limited to transfection, infection, injection, transformation, and the like. Suitable host cells that express the polypeptide of interest can be obtained by selection as described above.
- In certain embodiments, selection agent is present in the culture medium at least part of the time during the culturing, either in sufficient concentrations to select for cells expressing the selectable marker polypeptide or in lower concentrations. In preferred embodiments, selection agent is no longer present in the culture medium during the production phase when the polypeptide is expressed.
- Culturing a cell is done to enable it to metabolize, and/or grow and/or divide and/or produce recombinant proteins of interest. This can be accomplished by methods well known to persons skilled in the art, and includes but is not limited to providing nutrients for the cell. The methods comprise growth adhering to surfaces, growth in suspension, or combinations thereof. Culturing can be done for instance in dishes, roller bottles or in bioreactors, using batch, fed-batch, continuous systems such as perfusion systems, and the like. In order to achieve large scale (continuous) production of recombinant proteins through cell culture it is preferred in the art to have cells capable of growing in suspension, and it is preferred to have cells capable of being cultured in the absence of animal- or human-derived serum or animal- or human-derived serum components.
- The conditions for growing or multiplying cells (see, e.g., Tissue Culture, Academic Press, Kruse and Paterson, editors (1973)) and the conditions for expression of the recombinant product are known to the person skilled in the art. In general, principles, protocols, and practical techniques for maximizing the productivity of mammalian cell cultures can be found in Mammalian Cell Biotechnology: a Practical Approach (M. Butler, ed., IRL Press, 1991).
- In a preferred embodiment, the expressed protein is collected (isolated), either from the cells or from the culture medium or from both. It may then be further purified using known methods, e.g., filtration, column chromatography, etc, by methods generally known to the person skilled in the art.
- The selection method works in the absence of chromatin control elements, but improved results are obtained when the multicistronic expression units are provided with such elements. The selection method works particularly well when an expression cassette hereof, comprising at least one anti-repressor sequence is used. Depending on the selection agent and conditions, the selection can in certain cases be made so stringent, that only very few or even no host cells survive the selection, unless anti-repressor sequences are present. Hence, the combination of the novel selection method and anti-repressor sequences provides a very attractive method to obtain only limited numbers of colonies with a greatly improved chance of high expression of the polypeptide of interest therein, while at the same time the obtained clones comprising the expression cassettes with anti-repressor sequences provide for stable expression of the polypeptide of interest, i.e., they are less prone to silencing or other mechanisms of lowering expression than conventional expression cassettes.
- In certain embodiments, almost no clones are obtained when no anti-repressor sequence is present in the expression cassette hereof, providing for very stringent selection. The novel selection system disclosed herein therefore also provides the possibility to test parts of anti-repressor elements for functionality, by analyzing the effects of such sequences when present in expression cassettes of the invention under selection conditions. This easy screen, which provides an almost or even complete black and white difference in many cases, therefore can contribute to identifying functional parts or derivatives from anti-repressor sequences. When known anti-repressor sequences are tested, this assay can be used to characterize them further. When fragments of known anti-repressor sequences are tested, the assay will provide functional fragments of such known anti-repressor sequences.
- The practice of the disclosure will employ, unless otherwise indicated, conventional techniques of immunology, molecular biology, microbiology, cell biology, and recombinant DNA, which are within the skill of the art. See, e.g., Sambrook, Fritsch and Maniatis, Molecular Cloning: A Laboratory Manual, 2nd edition, 1989; Current Protocols in Molecular Biology, F. M. Ausubel, et al., eds, 1987; the series Methods in Enzymology (Academic Press, Inc.); PCR2: A Practical Approach, M. J. MacPherson, B. D. Hams, G. R. Taylor, eds, 1995; Antibodies: A Laboratory Manual, Harlow and Lane, eds, 1988.
- The disclosure is further described in the following non-limiting Examples.
- Examples 1-7 describe the testing of STAR67 and other STAR sequences in various configurations and under various circumstances. Examples 8-25 further describe the novel selection system of the invention.
- A novel anti-repressor (STAR) sequence was isolated using a genetic screen as described in WO 03/004704, and this novel sequence was coined STAR67. The effects of STAR67 on expression of transgenes in mammalian cell lines were tested. Here we describe the construction of the various constructs.
- Three plasmids were created (
FIG. 1 ): -
- A) CMV-d2EGFP-ires-Zeo (CMV Control),
- B) STAR67-CMV-d2EGFP-ires-Zeo (CMV-STAR67),
- C) STAR7-STAR67-CMV-d2EGFP-ires-Zeo-STAR7 (CMV-
STAR67 7/7)
- The construction of construct A is described below. Plasmid pd2EGFP (Clontech 6010-1) was modified by insertion of a linker at the BsiWI site to yield pd2EGFP-link. The linker, made by annealing oligonucleotides GTACGGATATCAGATCTTTAATTAAG (SEQ ID NO:67) and GTACCTTAATTAAAGATCTGATAT (SEQ ID NO:68), introduced sites for the PacI, BglII, and EcoRV restriction endonucleases. This created the multiple cloning site MCSII for insertion of STAR elements. Then primers GATCAGATCTGGCGCGCCATTTAAATCGTCTCGCGCGTTTCGGTGATGACGG (SEQ ID NO:69) and (AGGCGGATCCGAATGTATTTAGAAAAATAAACAAATAGGGG (SEQ ID NO:70) were used to amplify a region of 0.37 kb from pd2EGFP, which was inserted into the BglII site of pIRES (Clontech 6028-1) to yield pIRES-stuf. This introduced sites for the AscI and SwaI restriction endonucleases at MCSI, and acts as a “stuffer fragment” to avoid potential interference between STAR elements and adjacent promoters. pIRES-stuf was digested with BglII and FspI to liberate a DNA fragment composed of the stuffer fragment, the CMV promoter, the IRES element (flanked by multiple cloning sites MCS A and MCS B), and the SV40 polyadenylation signal. This fragment was ligated with the vector backbone of pd2EGFP-link produced by digestion with BamHI and StuI, to yield pIRES-link.
- The open reading frame of the Zeocin® antibiotic-resistance gene was inserted into BamHI/NotI sites downstream of the pIRES as follows: the Zeocin® antibiotic-resistance ORF was amplified by PCR with primers GATCGGATCCTTCGAAATGGCCAAGTTGACCAGTGC (SEQ ID NO:71) and AGGCGCGGCCGCAATTCTCAGTCCTGCTCCTC (SEQ ID NO:72) from plasmid pCMV/zeo (Invitrogen, cat. no. V50120), digested with BamHI and NotI, and ligated with BamHI/NotI-digested pIRES-link to yield pIRES-link-zeo. The d2EGFP reporter ORF was introduced into pIRES-link-zeo by amplification of pd2EGFP (Clontech 6010-1) with primers GATCGAATTCTCGCGAATGGTGAGCAAGCAGATCCTGAAG (SEQ ID NO:73) and AGGCGAATTCACCGGTGTTTAAACTTACACCCACTCGTGCAGGCTGCCCAGG (SEQ ID NO:74), and insertion of the EcoRI-digested d2EGFP cassette into the EcoRI site in the pIRES-link-zeo plasmid. This created construct A, CMV-d2EGFP-IRES-Zeo (CMV Control).
- STAR67 was cloned upstream of the CMV promoter, in the AscI site (about 15 nt remaining between STAR67 and the promoter). This created construct B, STAR67-CMV-d2EGFP-ires-Zeo (CMV-STAR67).
- STAR67 was also tested in the context of a cassette that contains also the STAR7, cloned directionally in the 5′ SalI and XbaI sites and 3′ BglII and PacI sites to flank the entire cassette with STAR7. This is construct C (CMV-
STAR67 7/7). - We tested whether the presence of STAR67 adjacent to the CMV, EF1α and UB6 promoters influences the expression level of these promoters in CHO cells. The constructs A and B (
FIG. 1 ) described in Example 1 are used for this purpose, modified for the respective promoters -
- 1 CMV-d2EGFP-ires-Zeo (CMV Control)
- 2 STAR67-CMV-d2EGFP-ires-Zeo (CMV-STAR67)
- 3 EF1α-d2EGFP-ires-Zeo (EF1α Control)
- 4 STAR67-EF1α-d2EGFP-ires-Zeo (EF1α-STAR67)
- 5 UB6-d2EGFP-ires-Zeo (UB6 Control)
- 6 STAR67-UB6-d2EGFP-ires-Zeo (UB6-STAR67).
- The UB6 and EF1α promoters were exchanged for the CMV promoter in the plasmids described in
FIG. 1 . The UB6 promoter was cloned as follows. A DNA 0.37 kb stuffer from the pd2EGFP plasmid was amplified by PCR, as described in Example 1, using primers identified by SEQ ID NOS: 69 and 70. The resulting DNA stuffer was cloned in the BglII site of pUB6/V5-His [Invitrogen V250-20], creating pUB6-stuf. From pUB6-stuf an AscI-SacI fragment was cloned into CMV-d2EGFP-IRES-Zeo, from which the CMV promoter was removed. - The EF1α promoter was amplified by PCR with pEF1α/V5-His [Invitrogen V920-20] as template, using primers GATCGGCGCGCCATTTAAATCCGAAAAGTGCCACCTGACG (SEQ ID NO:79) and AGGCGGGACCCCCTCACGACACCTGAAATGGAAG (SEQ ID NO:80). The PCR fragment was cloned in the AscI and PpuMI sites of CMV-d2EGFP-IRES-Zeo, from which the CMV promoter was removed.
- The Chinese Hamster Ovary cell line CHO-K1 (ATCC CCL-61) was cultured in HAMS-F12 medium+10% Fetal Calf Serum containing 2 mM glutamine, 100 U/ml penicillin, and 100 micrograms/ml streptomycin at 37° C./5% CO2. Cells were transfected with the plasmids using Lipofectamine® 2000 (Invitrogen) as described by the manufacturer. Briefly, cells were seeded to culture vessels and grown overnight to 70-90% confluence. Lipofectamine® reagent was combined with plasmid DNA at a ratio of 6 microliters per microgram (e.g., for a 10 cm Petri dish, 20 micrograms DNA and 120 microliters Lipofectamine) and added to the cells. After overnight incubation the transfection mixture was replaced with fresh medium, and the transfected cells were incubated further. After overnight cultivation, cells were trypsinized and seeded into fresh culture vessels with fresh medium. After another overnight incubation Zeocin® antibiotic was added to a concentration of 50 μg/ml and the cells were cultured further. After another three days the medium was replaced by fresh medium containing Zeocin® antibiotic (100 μg/ml) and cultured further. When individual colonies became visible (approximately ten days after transfection) medium was removed and replaced with fresh medium without Zeocin® antibiotic. Individual clones were isolated and transferred to 24-well plates in medium without Zeocin® antibiotic. One day after isolation of the colonies Zeocin® antibiotic was added to the medium. Expression of the d2EGFP reporter gene was assessed approximately three weeks after transfection. d2EGFP expression levels in the colonies were measured after periods of two weeks. After the initial two weeks after transfection when the first d2EGFP measurements were performed, the colonies were cultured in medium without Zeocin® antibiotic or other antibiotics. This continued for the remainder of the experiment.
-
FIG. 2 shows that transfection of the construct that contains STAR67 cloned upstream of the CMV promoter resulted in a number of CHO colonies that express significantly higher levels of d2EGFP protein, as compared to the “empty” control without STAR67, CMV Control. The average of the d2EGFP signal in the 10 colonies transfected with the CMV Control plasmid was 34, when measured 25 days after transfection. Sixty days after transfection the average of the d2EGFP signal in these ten colonies was reduced to 13, indicating that expression is not stable over time. In comparison, the average of the d2EGFP signal in the ten colonies transfected with the STAR67-CMV plasmid was 42 when measured after 25 days and 32 measured 60 days after transfection. Hence 60 days after transfection, a STAR67-encompassing CMV construct conveyed a factor 2.5 higher CMV promoter driven expression level of the reporter protein in stably transfected clones. Importantly, 25 days after transfection, after the first measurement, selection pressure was removed by culturing the colonies in medium without zeocin. Hence, colonies containing the STAR67 construct are more stable over time in the absence of selection pressure than colonies that do not contain a STAR67 construct. -
FIG. 3 shows that transfection of the construct that contains STAR67 cloned upstream of the EF1α-promoter resulted in a number of CHO colonies that express significantly higher levels of d2EGFP protein, as compared to the “empty” control without STAR67, EF1α Control. The average of the d2EGFP signal in the 10 colonies transfected with the EF1α Control plasmid was 31, when measured 25 days after transfection. Sixty days after transfection, the average of the d2EGFP signal in these ten colonies was 26. In comparison, the average of the d2EGFP signal in the ten colonies transfected with the EF1α-STAR67 plasmid was 60 when measured after 25 days and 76 measured 60 days after transfection. Hence, both after 25 and 60 days after transfection, a STAR67-encompassing EF1α construct conveyed a factor 2.9 higher EF1α promoter driven expression level of the reporter protein in stably transfected clones. -
FIG. 4 shows that transfection of the construct that contains STAR67 cloned upstream of the UB6 promoter resulted in a number of CHO colonies that express significantly higher levels of d2EGFP protein, as compared to the “empty” control without STAR67, UB6 Control. The average of the d2EGFP signal in the ten colonies transfected with the UB6 Control plasmid was 51, when measured 25 days after transfection. 60 days after transfection the average of the d2EGFP signal in these ten colonies was 29, indicating that the expression was not stable over time. In comparison, the average of the d2EGFP signal in the ten colonies transfected with the UB6-STAR67 plasmid was 218 when measured after 25 days and 224 measured 60 days after transfection. Hence, 25 days after transfection, a STAR67-encompassing UB6 construct conveyed a factor 4.3 higher UB6 promoter driven expression level of the reporter protein in stably transfected clones. After 60 days, this factor was 7.7, due to instability of expression in the control colonies and stability in the UB6-STAR67 colonies. Importantly, 25 days after transfection, after the first measurement, selection pressure was removed by culturing the colonies in medium without Zeocin® antibiotic. Hence, colonies containing the STAR67 construct are more stable over time in the absence of selection pressure than colonies that do not contain a STAR67 construct. - In conclusion, STAR67 increases expression from three different, unrelated promoters.
- We tested whether the presence of STAR67 adjacent of the CMV, EF1α and UB6 promoters influences the expression level of these promoters in another cell type than CHO cells, namely human PER.C6® cells. The same constructs as in Example 1 were used.
- PER.C6® cells were cultured in DMEM medium+pyridoxine+9% Fetal Bovine Serum (Non-Heat Inactivated), 8.9 mM MgCl2 100 U/ml penicillin, and 100 micrograms/ml streptomycin at 37° C./10% CO2. Cells were transfected with the plasmids using Lipofectamine 2000 (Invitrogen) as described by the manufacturer. Briefly, cells were seeded to 6-well plates and grown overnight to 70-90% confluence. Lipofectamine reagent was combined with plasmid DNA at a ratio of 15 microliters per 3 microgram (e.g., for a 10 cm Petri dish, 20 micrograms DNA and 120 microliters Lipofectamine) and added after 30 minutes incubation at 25° C. to the cells. After six hours incubation, the transfection mixture was replaced with fresh medium, and the transfected cells were incubated further. After overnight cultivation, cells were trypsinized and seeded (1:15, 1:30, 1:60, 1:120 dilutions) into fresh Petri dishes (90 mm) with fresh medium with zeocin added to a concentration of 100 μg/ml and the cells were cultured further. When colonies became visible, individual clones were isolated by scraping and transferred to 24-well plates in medium with Zeocin® antibiotic. When grown to ˜70% confluence, cells were transferred to six-well plates. Stable colonies were expanded for two weeks in six-well plates before the d2EGFP signal was determined on a XL-MCL Beckman Coulter flow cytometer. The mean of the d2EGFP signal was taken as measure for the level of d2EGFP expression. Colonies were measured for a second time after two weeks. Thereafter colonies were further cultured in the absence of Zeocin® antibiotic.
-
FIG. 5 shows that transfection of the construct that contains STAR67 cloned upstream of the CMV promoter resulted in a number of PER.C6® colonies that express significantly higher levels of d2EGFP protein, as compared to the “empty” control without STAR67, CMV Control. The average of the d2EGFP signal in the ten colonies transfected with the CMV Control plasmid was 37, when measured 30 days after transfection. 60 days after transfection the average of the d2EGFP signal in these ten colonies was reduced to 14, indicating that expression was not stable over time. In comparison, the average of the d2EGFP signal in the 10 colonies transfected with the STAR67-CMV plasmid was 101 when measured after 30 days and 45 measured 60 days after transfection. Hence 60 days after transfection, a STAR67-encompassing CMV construct conveyed a factor 3.2 higher CMV promoter driven expression level of the reporter protein in stably transfected clones. -
FIG. 6 shows that transfection of the construct that contains STAR67 cloned upstream of the EF1α-promoter resulted in a number of PER.C6® colonies that express significantly higher levels of d2EGFP protein, as compared to the “empty” control without STAR67, EF1α Control. The average of the d2EGFP signal in the ten colonies transfected with the EF1α Control plasmid was five, when measured 30 days after transfection. Sixty days after transfection the average of the d2EGFP signal in these ten colonies was six. In comparison, the average of the d2EGFP signal in the ten colonies transfected with the EF1α-STAR67 plasmid was 25 when measured after 30 days and 20 measured 60 days after transfection. Hence, both after 30 and 60 days after transfection, a STAR67-encompassing EF1α construct conveyed afactor 4 higher EF1α promoter driven expression level of the reporter protein in stably transfected clones. -
FIG. 7 shows that transfection of the construct that contains STAR67 cloned upstream of the UB6 promoter resulted in a number of PER.C6® colonies that express significantly higher levels of d2EGFP protein, as compared to the “empty” control without STAR67, UB6 Control. The average of the d2EGFP signal in the ten colonies transfected with the UB6 Control plasmid was four, when measured 30 days after transfection. Sixty days after transfection the average of the d2EGFP signal in these ten colonies was two. In comparison, the average of the d2EGFP signal in the ten colonies transfected with the UB6-STAR67 plasmid was 27 when measured after 30 days and 18 measured 60 days after transfection. Hence, both after 30 and 60 days after transfection, a STAR67-encompassing UB6 construct conveyed a factor seven- to nine-fold higher UB6 promoter driven expression level of the reporter protein in stably transfected clones. - Hence placing STAR67 upstream of the promoter resulted in significantly higher protein expression levels in comparison with STAR67-less constructs, also in PER.C6® cells. Hence STAR67 functions in different, unrelated cell types.
- We tested whether the presence of STAR67 adjacent of the SV40 promoter influences the expression level of this promoter, either alone or in combination with another STAR element, in this example STAR7. The constructs that were used for this purpose (see
FIG. 8 ), are: -
- SV40-d2EGFP-ices-Zeo (SV40 Control)
- STAR67-SV40-d2EGFP-ires-Zeo (SV40-STAR67)
- STAR7-SV40-d2EGFP-ires-Zeo-STAR7 (SV40-STAR7/7)
- STAR7-STAR67-SV40-d2EGFP-ires-Zeo-STAR7 (SV40-
STAR67 7/7)
- The SV40 promoter was amplified by PCR with pIRES as template using primers TTGGTTGGGGCGCGCCGCAGCACCATGGCCTGAAATAACCTCTGAAAGAGG (SEQ ID NO:81) and TTGGTTGGGAGCTCAAGCTTTTTGCAAAAGCCTAGGCCTCCAAA AAAGCCTCCTC (SEQ ID NO:82). The PCR fragment was cloned in the AscI and SacI sites of CMV-d2EGFP-IRES-Zeo, from which the CMV promoter was removed.
- CHO cells were transfected, colonies were isolated and propagated and analyzed as in Example 2.
-
FIG. 8 shows that transfection of the construct that either contains STAR67 cloned upstream of the SV40 promoter (SV40-STAR67) orSTAR 7 cloned to flank the entire construct (SV40-STAR 7/7) did not result in CHO colonies that express significantly higher levels of d2EGFP protein, as compared to the “empty” control without STARs (SV40 Control). The average of the d2EGFP signal in the 18 colonies transfected with the SV40 Control plasmid was 86, when measured 40 days after transfection. In comparison, the average of the d2EGFP signal in the 18 colonies transfected with the SV40-STAR67 plasmid was 82 when measured after 40 days and the average of the d2EGFP signal in the 18 colonies transfected with the SV40-STAR 7/7 plasmid was 91 when measured after 40 days. Hence, no significant effect of these STAR elements on the SV40 promoter in CHO cells was observed. - It appears that the expression levels from the SV40 promoter in these cells are already quite high, and even much higher than those observed with the CMV promoter, which was considered to be a very strong promoter. This high background expression in the absence of a STAR element using the SV40 promoter in CHO cells, may explain why no significant effect of STAR67 alone, or STAR7 flanking the transgene, was observed.
- However, the average of the d2EGFP signal in the 18 colonies transfected with the SV40-
STAR67 7/7 plasmid was 209 when measured after 40 days colonies, which is a factor 2.4 higher than the average of the 18 control colonies (86). Hence, when the STAR67 element is used in combination with another STAR element, this results in a number of stably transfected CHO colonies that show significantly higher d2EGFP expression levels. - Therefore in this novel configuration (5′-STAR sequence A-STAR sequence C-promoter-nucleic acid encoding a protein of interest-STAR sequence B-3′, wherein in the present example STAR sequences A and B are
STAR 7 and STAR sequence C is STAR67) STAR elements appear to function even better than in hitherto disclosed configurations. - We have done experiments in which the flanking STAR7 elements were replaced by flanking STAR6 elements or by flanking STAR4 elements, in combination with STAR67 upstream of the SV40 promoter (SV40-
STAR67 6/6 and SV40-STAR67 4/4, respectively, using the same nomenclature as above), and observed improved expression also with these combinations. This proves that the flanking STAR7 elements can indeed be exchanged for other STAR sequences, and still the improvement of the novel configuration of the expression cassette with the STAR sequences is observed. - In Example 4, we showed that the combination of STAR67 and STAR7 enhanced the expression levels of d2EGFP protein in CHO cells. Here we tested whether the combination of STAR67 and STAR7 could be used for the production of an antibody. We chose an antibody against the EpCAM molecule (Huls et al., 1999) as test protein and used the UB6 promoter.
- The heavy chain cDNA (HC-EpCAM) was cloned in a construct encompassing the UB6 promoter. The HC-EpCAM was coupled to the Zeocin® antibiotic resistance gene by an IRES sequence. The light chain cDNA (LC-EpCAM) was also cloned in a construct encompassing the UB6 promoter. The LC-EpCAM was coupled to the puromycin resistance gene by an IRES sequence. Together these two plasmids represent the UB6 (HC+LC) Control (
FIG. 9 ). - To test the effects of STAR67 and STAR7, STAR67 was cloned in both HC+LC constructs, upstream of the UB6 promoters. STAR7 was cloned to flank the entire cassettes, both at the 5′ and 3′ end (
FIG. 9 ). These two plasmids represent STAR7-STAR67-UB6 (HC+LC) STAR7. - The Chinese Hamster Ovary cell line CHO-K1 (ATCC CCL-61) was transfected and cultured as in Example 2, using Zeocin® antibiotic (100 μg/ml) and puromycin (2.5 μg/ml) for selection. One day after transfection, Zeocin® antibiotic was added to the culture medium. When first colonies became visible (approximately seven days after addition of Zeocin® antibiotic) the culture medium was removed and replaced with culture medium containing puromycin. After approximately seven days, colonies were isolated and transferred to 24-well plates, in culture medium containing Zeocin® antibiotic only.
-
FIG. 9 shows that that transfection of antibody constructs that contain STAR67 cloned upstream of the UB6 promoter and two STAR7 elements cloned to flank the entire cassettes resulted in a number of CHO colonies that express significantly higher levels of EpCAM antibody (measured by ELISA using an anti-human IgG antibody) as compared to the “empty” control without STAR67 and STAR7, UB6 (HC+LC) Control. The average of the EpCAM production in the 18 colonies transfected with the UB6 (HC+LC) Control plasmid was 2.7 pg/cell/day, when measured 25 days after transfection. Selection agents Zeocin® antibiotic and puromycin were removed after 25 days. Forty-five days after transfection, the average of the EpCAM production in these 18 colonies was 2.7 pg/cell/day. In comparison, the average of the EpCAM production signal in the 18 colonies transfected with the STAR7-STAR67-UB6 (HC+LC)-STAR7 plasmid was 6.7 when measured after 25 days and 7.7 pg/cell/day, when measured 45 days after transfection. Hence, both after 30 and 45 days after transfection, a STAR67/STAR7-encompassing UB6 construct conveyed a factor 2.5 to 2.9 fold higher UB6 promoter driven EpCAM expression level in stably transfected CHO clones. - Hence placing STAR67 upstream of the promoter and two STAR7 elements to flank the cassettes resulted in significantly higher EpCAM antibody expression levels in CHO, in comparison with STAR67/STAR7-less constructs.
- All hitherto known STAR elements that were tested for that property, including STAR6 and STAR7, are enhancer blockers (WO 03/004704, Kwaks et al., 2003). Enhancer blocker activity is tested by placing a STAR element between a strong enhancer and a promoter. Here we tested whether also STAR67 is an enhancer blocker.
- The d2EGFP gene was PCR-amplified using primers TTGGTTGGTCATGAATGGTGAGCAAGGGCGAGGAGCTGTTC (SEQ ID NO:75) and ATTCTCTAGACTACACATTGATCCTAGCAGAAGCAC (SEQ ID NO:76) and cloned into plasmid pGL3-promoter (Promega) using the NcoI and XbaI restriction sites to replace the Luciferase gene to create plasmid pGL3-promoter-GFP. A linker (created by annealing oligos CGATATCTTGGAGATCTACTAGTGGCGCG CCTTGGGCTAGCT (SEQ ID NO:77) and GATCAGCTAGCCCAAGGCGCGCCACTAGTAGATCTCC AAGATATCGAGCT (SEQ ID NO:78), was cloned in the SacI and BglII sites to create multiple cloning sites. The original BglII site was destroyed upon ligation of the linker, creating a new unique BglII site within the linker DNA. The SV40-enhancer was cut from plasmid pGL3-basic (Promega) using BsaBI and BamHI and cloned into the pGL3-linker-promoter-GFP using the EcoRV and BglII sites creating plasmid pGL3-enhancer-promoter-GFP. The STAR40 element was placed upstream of the SV40 enhancer using KpnI and SacI sites to prevent action of the enhancer on upstream sequences. Finally,
6, 7 and 67 were placed in between the SV40 enhancer and the SV40 minimal promoter using the SpeI and AscI restriction sites.STAR elements - The Chinese Hamster Ovary cell line CHO-K1 (ATCC CCL-61) is transfected as in Example 2. One day after the transfection, d2EGFP levels are measured on an Epics XL flow cytometer (Beckman Coulter)
-
FIG. 10 shows that STAR67 is not an enhancer blocker, whereas STAR6 and STAR7 act enhancer blockers in the same assay. STAR6, STAR7 and STAR67 were cloned between the SV40 enhancer and a minimal SV40-promoter upstream of the d2EGFP gene. When no STAR element was cloned between the enhancer and the promoter strong transcriptional activation occurred (arbitrarily set at 100%). When STAR6 or STAR7 was placed between the enhancer and the promoter, transcription dropped to background levels, indicating that STAR6 and STAR7 are potent enhancer blockers. In contrast, when STAR67 was cloned between the enhancer and the promoter relative transcription levels were still 80% of the control, indicating that STAR67 is not a good enhancer blocker, this in contrast with STAR6 andSTAR 7, as well as other STAR elements, as previously described (WO 03/004704, Kwaks et al., 2003). - In Example 5, we showed that the combination of STAR67 and STAR7 enhanced the expression levels of EpCAM antibody in CHO cells, in the context of two distinct plasmids, which contained the heavy and light chains. In this example we tested whether STAR67 could be used for the production of EpCAM antibody when both heavy and light chains are placed on one plasmid. We used simultaneous selection for each selectable marker.
- The heavy chain cDNA (HC-EpCAM) is under the control of the UB6 promoter and coupled to the Zeocin resistance gene by an IRES sequence. The light chain cDNA (LC-EpCAM) is under control of the CMV promoter and coupled to the puromycin resistance gene by an IRES sequence. Basically these are the constructs used in Example 5. These two expression cassettes were placed on one plasmid, in such a manner that transcription of the two expression units had opposite directions. In the control plasmid the UB6 and CMV promoters were separated by a stuffer of 500 bp (EpCAM Control) (
FIG. 10 ). In another plasmid STAR67 was placed between the UB6 and CMV promoter (EpCAM STAR67) (FIG. 10 ). - The Chinese Hamster Ovary cell line CHO-K1 (ATCC CCL-61) was transfected and cultured as in Example 2, using Zeocin® antibiotic (100 μg/ml) and puromycin (2.5 μg/ml) for selection. In Example 5, consecutive selection for both selection markers was used. In contrast, here both selection agents were present in the culture medium simultaneously. One day after transfection, Zeocin® antibiotic and puromycin were added to the culture medium. The selection medium was present until colonies were isolated (approximately 14 days after transfection). After colonies were isolated and transferred to 24-well plates, the cells were cultured in the presence of Zeocin® antibiotic and puromycin.
-
FIG. 11 shows that that transfection of the antibody construct that contains STAR67 cloned between the UB6 and CMV promoters resulted in a number of CHO colonies that express EpCAM antibody (measured by ELISA using an anti-human IgG antibody). The average of the EpCAM production in the 19 colonies transfected with the EpCAM STAR67 plasmid was 9.8 pg/cell/day, when measured 25 days after transfection. - In contrast, surprisingly no colonies survived of the transfection with the EpCAM Control plasmid. When selection was performed with either Zeocin® antibiotic or puromycin alone, EpCAM Control colonies survived. However, when the selection pressure was increased by placing selection pressure on both the heavy and light chain, these conditions allowed only colonies to survive that have a STAR67 present in the transfected plasmid.
- The results also show that incorporation of STAR67 has a beneficial effect on two promoters, the UB6 and CMV promoters, that are placed upstream and downstream of one STAR67 element. This indicates that STAR67 may operate in a bi-directional fashion.
- The difference between the EpCAM control and EpCAM STAR67 plasmid is black-white in the sense that only transfection of EpCAM STAR67 results in the establishment of colonies, when the selection pressure is high. This opens an opportunity to use this plasmid configuration for identifying the region in STAR67 that is responsible for mediating this effect. Smaller, overlapping portions of STAR67 are placed between the UB6 and CMV promoters, driving the EpCAM molecule. When a portion of STAR67 is functional, colonies will survive when both Zeocin® antibiotic and puromycin are simultaneously used as selection agent. When a portion of STAR67 is not functional, no colonies will survive under identical selection conditions.
- Hence placing STAR67 upstream of the promoter resulted in significantly higher EpCAM antibody expression levels in CHO, in comparison with STAR-less constructs.
- The basic idea behind the development of a novel selection system is to place the gene encoding the resistance gene upstream of a gene of interest, and one promoter drives the expression of this bicistronic mRNA. The translation of the bicistronic mRNA is such that only in a small percentage of translation events the resistance gene will be translated into protein and that most of the time the downstream gene of interest will be translated into protein. Hence the translation efficiency of the upstream resistance gene must be severely hampered in comparison to the translation efficiency of the downstream gene of interest. To achieve this, three steps can be taken hereof:
- 1) within the resistance gene on the mRNA, the searching ribosome preferably should not meet another AUG, since any downstream AUG may serve as translation start codon, resulting in a lower translation efficiency of the second, downstream gene of interest. Hence, preferably any AUG in the resistance gene mRNA will have to be replaced. In case this AUG is a functional codon that encodes a methionine, this amino acid will have to be replaced by a different amino acid, for instance by a leucine (
FIGS. 12A and B); - 2) the start codon of the resistance gene must have a bad context (be part of a non-optimal translation start sequence); i.e., the ribosomes must start translation at this start codon only in a limited number of events, and hence in most events continue to search for a better, more optimal start codon (
FIG. 12C-E ). Three different stringencies can be distinguished: a) the normal ATG start codon, but placed in a bad context (TTTATGT) (called ATGmut) (FIG. 12C ), b) preferably when placed in an optimal context, GTG can serve as start codon (ACCGTGG) (FIG. 12D ) and c) preferably when placed in an optimal context, TTG can serve as start codon (ACCTTGG) (FIG. 12E ). The most stringent translation condition is the TTG codon, followed by the GTG codon (FIG. 12 ). The Zeo mRNA with a TTG as start codon is expected to produce the least Zeocin® antibiotic resistance protein and will hence convey the lowest functional Zeocin® antibiotic resistance to cells (FIGS. 12 , 13). - 3) preferably, the normal start codon (ATG) of the downstream gene of interest should have an optimal translation context (e.g., ACCATGG) (
FIG. 13A-D ). This warrants that, after 1 and 2 have been taken, in most events the start codon of the gene of interest will function as start codon of the bicistronic mRNA.steps - In Example 8,
step 1 is performed, that is, in the Zeocin® antibiotic resistance gene one existing internal methionine is replaced by another amino acid (FIG. 12B-E ). It is important that after such a change the Zeo protein still confers Zeocin® antibiotic resistance to the transfected cells. Since it is not known beforehand which amino acid will fulfill this criterion, three different amino acids have been tried: leucine, threonine and valine. The different constructs with distinct amino acids have than been tested for their ability to still confer Zeocin® antibiotic resistance to the transfected cells. - The original Zeo open reading frame has the following sequence around the start codon: AAACCATGGCC (start codon in bold; SEQ ID NO:83). This is a start codon with an optimal translational context (
FIG. 12A ). First the optimal context of the start codon of the Zeo open reading frame was changed through amplification from plasmid pCMV-zeo [Invitrogen V50120], with primer pair ZEOforwardMUT (SEQ ID NO:84): GATCTCGCGATACAGGATTTATGTTGGCCAAGTTGACCAGTGCCGTTCCG and ZEO-WTreverse (WT=Wild type; SEQ ID NO:85): AGGCGAATTCAGTCCTGCTCCTCGGC, using pCMV-ZEO (Invitrogen: V50120) as a template. The amplified product was cut with NruI-EcoRI, and ligated into pcDNA3, resulting in pZEOATGmut. - The original Zeo open reading frame contains an in frame ATG, encoding methionine at amino acid position 94 (out of 124). This internal ATG, encoding the methionine at
position 94 was changed in such a way that the methionine was changed into leucine, threonine or valine respectively: - To replace the internal codon for methionine in the Zeo open reading frame with the codon for leucine (
FIG. 12B ), part of the Zeo open reading frame was amplified using primer pair ZEOforwardMUT (SEQ ID NO:84) and ZEO-LEUreverse (SEQ ID NO:86): AGGCCCCGCCCCCACGGCTGCTCGCCGATCTCGGTCAA GGCCGGC. The PCR product was cut with BamHI-BglI and ligated into pZEOATGmut. This resulted in pZEO(leu). To replace the internal codon for methionine in the Zeo open reading frame with the codon for threonine (not shown, but as inFIG. 12B ), part of the Zeo open reading frame was amplified using primer pair ZEOforwardMUT (SEQ ID NO:84) and ZEO-THRreverse (SEQ ID NO:87): AGGCCCCGCCCCCACGGCTGCTCGCCGATCTCGGTGGTGGCCGGC. The PCR product was cut with BamHI-BglI and ligated into pZEOATGmut. This resulted in pZEO(thr). To replace the internal codon for methionine in the Zeo open reading frame with the codon for valine (not shown, but as inFIG. 12B ) (GTG), part of the Zeo open reading frame was amplified using primer pair ZEOforwardMUT (SEQ ID NO:84) and ZEO-VALreverse (SEQ ID NO:88): AGGCCCCGCCCCCACGGCTGCTCGCCGATCTCGGTCCACGCCGG. The PCR product was cut with BamHI-BglI and ligated into pZEOATGmut. This resulted in pZEO(val). - The Chinese Hamster Ovary cell line CHO-K1 (ATCC CCL-61) was cultured in HAMS-F12 medium+10% Fetal Calf Serum containing 2 mM glutamine, 100 U/ml penicillin, and 100 micrograms/ml streptomycin at 37° C./5% CO2. Cells were transfected with the plasmids using Lipofectamine® 2000 (Invitrogen) as described by the manufacturer. Briefly, cells were seeded to culture vessels and grown overnight to 70-90% confluence. Lipofectamine® reagent was combined with plasmid DNA at a ratio of 6 microliters per microgram (e.g., for a 10 cm Petri dish, 20 micrograms DNA and 120 microliters Lipofectamine® reagent) and added to the cells. After overnight incubation, the transfection mixture was replaced with fresh medium, and the transfected cells were incubated further. After overnight cultivation, cells were trypsinized and seeded into fresh culture vessels with fresh medium containing Zeocin® antibiotic (100 μg/ml). When individual colonies became visible (approximately ten days after transfection), colonies were counted.
- Four plasmids were transfected to CHO-K1 cells, 1) pZEO(WT), 2) pZEO(leu), 3) pZEO(thr), and 4) pZEO(val). The cells were selected on 100 μg/ml Zeocin® antibiotic. Transfection of pZEO(leu) resulted in an equal number of Zeocin® antibiotic resistant colonies in comparison with the control pZEO (WT), pZEO(thr) and pZEO(val) gave less colonies, but the differences were not in the order of a magnitude. Hence it was concluded that changes of the internal methionine into leucine, threonine or valine all resulted in a Zeocin® antibiotic resistance protein that is still able to confer Zeocin® antibiotic resistance to the transfected cells. Rather arbitrarily, pZEO(leu) was chosen as starting point for creating different start codons on the Zeo open reading frame. Hence, in the examples below, the start, as well as internal methionines, are always replaced by leucine, for Zeocin® antibiotic, but also for other selectable marker genes, as will be clear from further examples.
- To create a bicistronic mRNA encompassing a mutated Zeocin® resistance mRNA with less translational efficiency, and the d2EGFP gene as downstream gene of interest, the start codon of the d2EGFP gene was first optimized (
step 3 in Example 8). After that, the different versions of the Zeocin® antibiotic resistance gene were created. The differences between these versions are that they have different stall codons, with distinct translational efficiency (step 2 in Example 9,FIG. 12C-E ). These different Zeocin® antibiotic resistance gene versions were cloned upstream of the modified d2EGFP gene (FIG. 13 ). - The d2EGFP reporter ORF was introduced into pcDNA3. The sequence around the start codon of this d2EGFP cDNA is GAATTCATGGG (start codon in bold; SEQ ID NO:89), which is not optimal. As a first step, d2EGFP was amplified from pd2EGFP (Clontech 6010-1) with primers d2EGFPforwardBamHI (SEQ ID NO:90): GATCGGATCCTATGAGGAATTCGCCACCATGGTGAGCAAGGGCGAGGAG and d2EGFPreverseNotI (SEQ ID NO:91): AAGGAAAAAAGCGGCCGCCTACACATTGATCCTAGCAGAAG. This product contains now a start codon with an optimal translational context (ACCATGG). This created pd2EGFP and subsequently, the Zeo open reading frame was ligated into pd2EGFP, resulting in pZEO-d2EGFP. It is pointed out here that the optimization of the translational start sequence of the gene of interest (here: EGFP as a model gene) is not essential but preferred in order to skew the translation initiation frequency towards the gene of interest still further.
- Now three classes of constructs were made: ATG as a start codon in the Zeo resistance gene, but in a bad context (TTTATGT) (not shown, but as in
FIG. 13B ) and followed by spacer sequence, instead of the optimal ATG (FIG. 13A ). The spacer sequence is placed downstream of the ATG sequence. In the Zeocin® antibiotic (and possibly in the blasticidin) RNA, a secondary structure is present, causing the ribosome to be temporarily delayed. Because of this, a poor start codon can in some cases be used by the ribosome, despite being a bad start codon or being in a non-optimal context for translation initiation. This causes the chance of translation to increase, and in case of the current invention therefore renders the stringency for selection lower. To decrease this effect, and hence to further decrease the translation initiation efficiency, a spacer sequence is introduced that does not contain a secondary structure (Kozak, 1990). Hence, the term “space” is introduced, and used in the plasmid and primer names to indicate the presence of such a spacer sequence. The spacer removes the “ribosome delaying sequence” from the neighborhood of the initiation codon, therewith causing the ribosome to start translating less frequently, and hence increasing the stringency of the selection hereof. The spacer introduces some extra amino acids in the coding sequence. This has been done in some cases for both Zeocin® antibiotic and for blasticidin, as will be apparent from the examples. The nomenclature of the plasmids and primers in general in the following is along these lines: the name of the selectable marker polypeptide is referred to by abbreviation (e.g., Zeo, Blas, etc.); the start codon is mentioned (e.g., ATG, GTG, TTG); when this start codon is placed in a non-optimal context for translation initiation, the addition “mut” is used (this is usually only done for ATG start codons, as combining a non-optimal context with a non-ATG start codon usually does not result in sufficient translation initiation to allow for selection); when a spacer sequence is used behind the start codon, the addition “space” is used (this is done usually for “ATGmut” start codons for Zeo or Bias selectable markers). The Zeo open reading frame was amplified with primer pair ZEOforwardBamHI-ATGmut/space (SEQ ID NO:93): GATCGGATCCTTGGTTTATGTCGATCCAAAG ACTGCCAAATCTAGATCCGAGATTTTCAGGAGCTAAGGAAGCTAAAGCCAAGTTGACCAGTGAAGTTC (wherein the sequence following the underlined sequence comprises the spacer sequence), and ZEOWTreverse (SEQ ID NO:85), the PCR product was cut with EcoRI-BamHI, and ligated into pd2EGF, cut with EcoRI-BamHI, creating pZEO-ATGmut/space-d2EGFP. - 2) GTG as a start codon in the Zeo resistance gene, instead of ATG (
FIG. 13C ). The Zeo open reading frame was amplified with primer pair ZEOforwardBamHI-GTG (SEQ ID NO:94): GATCGGATCCACCGTGGCCAAGTTGACCAGTGCCGTTC and ZEOWTreverse (SEQ ID NO:85), the PCR product was cut with EcoRI-BamHI, and ligated into pd2EGFP, cut with EcoRI-BamHI, creating pZEO-GTG-d2EGFP. - 3) TTG as a start codon in the Zeo resistance gene, instead of ATG (
FIG. 13D ). The Zeo open reading frame was amplified with primer pair ZEOforwardBamHI-TTG: GATCGGATCCACCTTGG CCAAGTTGACCAGTGCCGTTC (SEQ ID NO:95) and ZEOWTreverse (SEQ ID NO:85), the PCR product was cut with EcoRI-BamHI, and ligated into pd2EGFP, cut with EcoRI-BamHI, creating pZEO-TTG-d2EGFP. - The Chinese Hamster Ovary cell line CHO-K1 (ATCC CCL-61) was cultured in HAMS-F12 medium+10% Fetal Calf Serum containing 2 mM glutamine, 100 U/ml penicillin, and 100 micrograms/ml streptomycin at 37° C./5% CO2. Cells were transfected with the plasmids using Lipofectamine® 2000 (Invitrogen) as described by the manufacturer. Briefly, cells were seeded to culture vessels and grown overnight to 70-90% confluence. Lipofectamine® reagent was combined with plasmid DNA at a ratio of 15 microliters per 3 microgram (e.g., for a 10 cm Petri dish, 20 micrograms DNA and 120 microliters Lipofectamine® reagent) and added after 30 minutes incubation at 25° C. to the cells. After overnight incubation the transfection mixture was replaced with fresh medium, and the transfected cells were incubated further. After overnight cultivation, cells were trypsinized and seeded into fresh culture vessels with fresh medium. After another overnight incubation, Zeocin® antibiotic was added to a concentration of 50 μg/ml and the cells were cultured further. After another three days the medium was replaced by fresh medium containing Zeocin® antibiotic (100 μg/ml) and cultured further. When individual colonies became visible (approximately ten days after transfection) medium was removed and replaced with fresh medium without Zeocin® antibiotic. Individual clones were isolated and transferred to 24-well plates in medium without Zeocin® antibiotic. One day after isolation of the colonies, Zeocin® antibiotic was added to the medium. Expression of the d2EGFP reporter gene was assessed approximately three weeks after transfection. d2EGFP expression levels in the colonies were measured after periods of two weeks.
- CHO-K1 cells were transfected with constructs that contain the ATGmut/space Zeo (
FIG. 13B ), GTG Zeo (FIG. 13C ) and TTG Zeo (FIG. 13D ) genes as selection gene, all being cloned upstream of the d2EGFP reporter gene. These three constructs were without STAR elements (Control) or with 7 and 67 upstream of the CMV promoter andSTAR elements STAR 7 downstream from the d2EGFP gene (FIG. 14 ).FIG. 14 shows that both the control (without STAR elements) constructs with ATGmut/space Zeo (A) and GTG Zeo (B) gave colonies that expressed d2EGFP protein. The average d2EGFP expression level of 24 ATGmut/space Zeo colonies was 46 and of GTG Zeo colonies was 75. This higher average expression level in GTG Zeo colonies may reflect the higher stringency of GTG, in comparison with ATGmut/space (Example 8). Addition of 7 and 67 to the constructs resulted in colonies that had higher average d2EGFP expression levels. Transfection of the ATGmut/STAR elements space Zeo STAR 7/67/7 construct resulted in colonies with an average d2EGFP expression level of 118, which is a factor 2.6 higher than the average in the control cells (46). Addition of STAR elements to the GTG Zeo construct resulted in an average d2EGFP expression level of 99, which is a factor 1.3 higher than the average in the control cells (75). - Importantly, no colonies were established when the TTG Zeo construct was transfected. However, the construct with TTG Zeo, flanked with
7 and 67 resulted in the establishment of six colonies, with an average d2EGFP expression level of 576 (STARs FIG. 14C ). Thus the highest translation stringency, brought about by the TTG start codon (FIG. 12 ) yields to the highest d2EGFP expression levels, as predicted inFIG. 13 . The results also indicate that the stringency of the TTG Zeo alone (without STAR elements) is at least in some experiments too high for colonies to survive. However, in later independent experiments (see below), some colonies were found with this construct without STAR elements, indicating that the stringency of the selection system with the TTG start codon in the Zeocin® antibiotic selection marker not necessarily precludes the finding of colonies when no STAR elements are present, and that the number of colonies obtained may vary between experiments. - It is concluded that the use of STAR elements in combination with the stringent selection system hereof allows to readily identify high producers of the gene of interest.
- The results in Example 9 indicate that the TTG Zeo has extremely stringent translation efficiency, which might be to high to convey Zeocin® antibiotic resistance to the cells. The transfection was scaled up to test whether there would be some colonies that have such high expression levels that they survive. Scaling up the experiment could also address the question whether the high average of
TTG Zeo STAR 7/67/7 would become higher when more colonies were analyzed. - CHO-K1 cells were transfected with the constructs that have the TTG Zeo gene as selection marker, with and without
STAR elements 7 and 67 (FIG. 15 ). Transfections, selection, culturing etc were as in Example 9, except that six times more cells, DNA and Lipofectamine 2000 were used. Transfections and selection were done in Petri dishes. -
FIG. 15A shows that transfection with theTTG Zeo STAR 7/67/7 construct resulted in the generation of many colonies with an average d2EGFP signal of 560. This is as high as in Example 9, except that now 58 colonies were analyzed. When compared to a construct with the Zeocin® antibiotic resistance gene placed behind an IRES sequence (FIG. 15B ), the average d2EGFP expression level was 61, and when 7 and 67 were added to such a construct, the average d2EGFP expression level was 125, aSTAR elements factor 2 above the control (FIG. 15B ). The average of theTTG Zeo STAR 7/67/7 colonies was therefore a factor 9.2 higher than the STAR-less IRES-Zeo colonies and a factor 4.5 higher than the STAR7/67/7 IRES Zeo colonies. - An observation is that the form of the curve of all expressing colonies differs between the TTG Zeo STAR7/67/7 and IRES-
Zeo STAR 7/67/7. In the first case (TTG Zeo) the curve levels off, whereas in the second case (IRES-Zeo) the curve has a more “exponential” shape. The plateau in the TTG Zeo curve could indicate that the cells have reached a maximum d2EGFP expression level, above which the d2EGFP expression levels become toxic and the cells die. However, it later appeared that the high values were close to the maximum value that could be detected with the settings of the detector of the FACS analyzer. In later experiments, the settings of the FACS analyzer were changed to allow for detection of higher values, and indeed in some instances higher values than obtained here were measured in later independent experiments (see below). - Due to up-scaling of the transfections three colonies with the STAR-less TTG Zeo construct could be picked. The d2EGFP expression levels of these colonies were 475, 158 and 43. The last colony died soon after the first measurement. This result indicates that the TTG Zeo construct can convey Zeocin® antibiotic resistance, resulting in colonies that also can give high expression levels in some instances. Hence, the novel selection method hereof can be applied with expression cassettes that do not contain chromatin control elements, although it is clearly preferred to use expression cassettes comprising at least one such element, preferably a STAR element.
- The results indicate that STAR elements allow a more stringent selection system hereof, such as exemplified in this example, resulting in the picking of colonies that have a very high average protein expression level.
- There are four internal ATGs in the blasticidine resistance gene, none of which codes for a methionine (
FIG. 25A ). These ATGs have to be eliminated though (FIG. 25B ), since they will serve as start codon when the ATG start codon (or the context thereof) has been modified, and this will result in peptides that do not resemble blasticidine resistance protein. More importantly, these ATGs will prevent efficient translation of the gene of interest, as represented by d2EGFP in this example for purposes of illustration. To eliminate the internal ATGs, the blasticidine resistance protein open reading frame was first amplified with four primer pairs, generating 4 blasticidine resistance protein fragments. The primer pairs were: -
A) BSDBamHIforward (SEQ ID NO: 96): GATCGGATCCACCATGGCCAAGCCTTTGTCTCAAG BSD150reverse (SEQ ID NO: 97): GTAAAATGATATACGTTGACACCAG B) BSD150forward (SEQ ID NO: 98): CTGGTGTCAACGTATATCATTTTAC BSD250reverse (SEQ ID NO: 99): GCCCTGTTCTCGTTTCCGATCGCG C) BSD250forward (SEQ ID NO: 100): CGCGATCGGAAACGAGAACAGGGC BSD350reverse (SEQ ID NO: 101): GCCGTCGGCTGTCCGTCACTGTCC D) BSD350forward (SEQ ID NO: 102): GGACAGTGACGGACAGCCGACGGC BSD399reverse (SEQ ID NO: 103): GATCGAATTCTTAGCCCTCCCACACGTAACCAGAGGGC - Fragments A to D were isolated from an agarose gel and mixed together. Next, only primers BSDBamHIforward and BSD399reverse were used to create the full length blasticidine resistance protein cDNA, but with all internal ATGs replaced. The reconstituted blasticidine was then cut with EcoRI-BamHI, and cloned into pZEO-GTG-d2EGFP, cut with EcoRI-BamHI (which releases Zeo), resulting in pBSDmut-d2EGFP. The entire blasticidine resistance protein open reading frame was sequenced to verify that all ATGs were replaced.
- With this mutated gene encoding blasticidine resistance protein (Blas), three classes of constructs are made (
FIG. 25C-E ): - 1) ATG as a start codon, but in a bad context and followed by spacer sequence. The mutated blasticidine resistance protein open reading frame in pBSD-d2EGFP was amplified using primers BSDforwardBamHIAvrII-ATGmut/space (SEQ ID NO:104): GATCGGATCCTAGGTTGGTTTATGTCGATCC AAAGACTGCCAAATCTAGATCCGAGATTTTCAGGAGCTAAGGAAGCTAAAGCCAAGCCTTTGTCTCAA GAAG, and BSD399reverseEcoRIAvrII (SEQ ID NO:105): GATCGAATTCCCTAGGTTAGCCCTCCCACACG TAACCAGAGGGC, the PCR product is cut with BamHI-EcoRI, and ligated into pZEO-GTG-d2EGFP, cut with EcoRI-BamHI. This results in pBSD-ATGmut/space-d2EGFP.
- 2) GTG as a start codon instead of ATG. The mutated blasticidine resistance protein open reading frame in pBSD-d2EGFP was amplified using primers BSDforwardBamHIAvrII-GTG (SEQ ID NO:106): GATCGGATCCTAGGACCGTGGCCAAGCCTTTGTCTCAAGAAG and BSD399reverseEcoRIAvrII (SEQ ID NO:105), the PCR product was cut with BamHI-EcoRI, and ligated into pZEO-GTG-d2EGFP, cut with EcoRI-BamHI. This results in pBSD-GTG-d2EGFP.
- 3) TTG as a start codon instead of ATG. The mutated blasticidine open reading frame in pBSD-d2EGFP was amplified using primers BSDforwardBamHIAvrII-TTG (SEQ ID NO:107): GATCGGATCCTAGGACCTTGGCCAAGCCTTTGTCTCAAGAAG and BSD399reverseEcoRIAvrII (SEQ ID NO:105), the PCR product was cut with BamHI-EcoRI, and ligated into pZEO-GTG-d2EGFP, cut with EcoRI-BamHI. This results in pBSD-TTG-d2EGFP.
- CHO-K1 cells were transfected with constructs that contain the GTG Blas (
FIG. 16A ) and TTG Blas (FIG. 16B ) genes as selection gene, all being cloned upstream of the d2EGFP reporter gene. Selection took place in the presence of 20 μg/ml Blasticidine. The two constructs were without STAR elements (Control) or with 7 and 67 upstream of the CMV promoter and START downstream from the d2EGFP gene (STAR elements FIG. 16 ).FIG. 16 shows that both the control (without STAR elements) constructs with GTG Blas (A) and TTG Blas (B) gave colonies that expressed d2EGFP protein. The average d2EGFP signal of 24 GTG Blas colonies was 14.0 (FIG. 16A ) and of TTG Blas colonies was 81 (FIG. 16B ). This higher average expression level in TTG Bias colonies may reflect the higher stringency of TTG, in comparison with GTG (see also Example 9). However, only eight colonies survived under the more stringent TTG conditions. - Addition of
7 and 67 to the constructs resulted in colonies that had higher average d2EGFP expression levels. Transfection of theSTAR elements GTG Blas STAR 7/67/7 construct resulted in colonies with an average d2EGFP expression level of 97.2 (FIG. 16A ), which is a factor 6.9 higher than the average in the control cells (14.0). Addition of STAR elements to the TTG Blas construct resulted in an average d2EGFP signal of 234.2 (FIG. 16B ), which is a factor 2.9 higher than the average in the control cells (81). However, note again that only eight colonies survived the harsh selection conditions of TTG Bias, whereas 48 colonies survived withTTG Blas STAR 7/67/7. When only the five highest values are compared, the average of the five highest TTG Blas was 109.1 and the average of the five highestTTG Blas STAR 7/67/7 was 561.2, which is a factor 5.1 higher. - The results indicate that STAR elements allow a more stringent selection system, resulting in the picking of colonies that have a very high average protein expression level. They also show that this selection is not restricted to the Zeocin® antibiotic resistance protein alone, but that also other selection marker polypeptides, in this case the blasticidin resistance protein, can be used.
- Colonies described in Example 10 were further cultured under various conditions to assess the stability of d2EGFP expression over an extended time period.
- The
TTG Zeo STAR 7/67/7 containing colonies inFIG. 15A were cultured for an additional 70 days in the presence of 100 μg/ml Zeocin® antibiotic. As shown inFIG. 17 , the average d2EGFP signal rose from 560.2 after 35 days to 677.2 after 105 days. Except for some rare colonies, the colonies had a higher d2EGFP expression level. - When the level of Zeocin® antibiotic was lowered to 20 μg/ml Zeocin® antibiotic, there was still an increase in the average d2EGFP expression level, from 560.2 after 35 days to 604.5 after 105 days (
FIG. 18 ). - When no selection pressure was present at all due to removal of the Zeocin® antibiotic from the culture medium, approximately 50% of the colonies became mosaic, that is, within one colony non-d2EGFP expressing cells became apparent. This resulted in lowering of d2EGFP expression levels to less than 50% of the original levels. If the signal became less than 67% (decrease of at least one-third) from the original signal, the colony was considered to be unstable in respect to d2EGFP expression. Of the 57 original colonies, 27 colonies remained stable according to this criterion; the average d2EGFP signal of these colonies after 35 days (while still under selection pressure) was 425.6, whereas the average d2EGFP signal without selection pressure after 65 days was 290.0. When measured after 105 days, the average signal in the 27 colonies was 300.9. Hence, after an initial decrease, the expression levels in the 27 colonies remained stable according to this criterion (
FIG. 19 ). - Six of the colonies were subjected to one round of sub-cloning. Cells were sown in 96-well plates as such that each well contained approximately 0.3 cells. No Zeocin® antibiotic was present in the medium so that from the start the sub clones grew without selection pressure. Of each original colony, six sub clones were randomly isolated and grown in six-well plates till analysis. In
FIG. 23 , we compared the original values of the original clones, as already shown inFIG. 15A , with one of the sub clones. In one of the six clones (clone 25), no sub clone was present with d2EGFP signal in the range of the original clone. However, in five out of six cases at least one the sub clones had equal d2EGFP expression levels as the parent clone. These expression levels were determined after 50 days without selection pressure. We conclude that one round of sub cloning is sufficient to obtain a high number of colonies that remain stable for high expression in the absence of selection pressure. This has been confirmed in a similar experiment (not shown). - We compared the number of copies that integrated in the
TTG Zeo STAR 7/67/7 colonies. DNA was isolated when colonies were 105 days old under Zeocin® antibiotic selection pressure (seeFIG. 17 ). As shown inFIG. 24 , two populations could be distinguished. InFIG. 24 , the cut off was made at 20 copies and the R2 value is calculated and shown. Also the R2 value from data with higher than 20 copies is shown. In the range from 100 to 800 d2EGFP signal, there was a high degree of copy number dependency, as signified by a relatively high R2 of 0.5685 (FIG. 24 ). However, in the population of colonies that fluctuate around a d2EGFP signal of 800 a high variation in copy number was observed (FIG. 24 ), as signified with a low R2 of 0.0328. Together, the data show that in the novel selection system, in colonies that containTTG Zeo STAR 7/67/7 constructs there is copy number dependent d2EGFP expression up to ˜20 copies. Also, although copy number dependency is lost when >20 copies are present, still a substantial proportion of the colonies with high (>800) d2EGFP signal have no more than 30 copies (FIG. 24 ). This combination between high d2EGFP expression and a relatively low copy number (between 10 and 30) may be important for identifying colonies that remain relatively stable without selection pressure. It is an advantage to have clones with relatively low copy numbers (less than about 30, more preferably less than about 20) that give high expression levels, because such clones are believed to be less amenable to genetic instability. The present selection system allows one to generate such clones, including from CHO cells. - To test the selection system on the production of an antibody, the anti-EpCAM antibody (see also Example 5) was taken as an example.
- A plasmid was created on which both the heavy chain (HC) and light chain (LC) were placed, each in a separate transcription unit (
FIG. 20-22 ). Expression of both chains was driven by the CMV promoter. Upstream of the EpCAM heavy chain the Zeocin® antibiotic resistance gene was placed, either with the ATGmut/space (FIG. 20 ), GTG (FIG. 21 ) or TTG (FIG. 22 ) as start codon (see Example 9). Upstream of the EpCAM light chain, the blasticidin resistance gene was placed, either with the ATGmut/space (FIG. 20 ), GTG (FIG. 21 ) or TTG (FIG. 22 ) as start codon (see Example 11). Two types of constructs were made, one construct without STAR elements (Control) and one construct with a combination of 7 and 67 elements. The STAR elements were placed as follows: upstream of each CMV promoter (i.e., one for the transcription unit comprising HC and one for the transcription unit comprising LC)STAR STAR 67 was placed and the resulting construct was flanked with a 5′ and 3′STAR 7 element (FIGS. 20-22 ). All constructs were transfected to CHO-K1 cells and selected on 100 μg/ml Zeocin® antibiotic and 20 μg/ml Blasticidin (at the same time). After selection, independent colonies were isolated and propagated under continuous selection pressure (using 100 μg/ml Zeocin® antibiotic and 20 μg/ml blasticidin).FIG. 20 shows that theSTAR 7/67/7 combination had a beneficial effect on EpCAM production. The ATGmut/space Zeo and ATGmut/space Blas had no effect on the number of colonies that were formed with plasmids containing STAR elements or not. However, the average EpCAM expression levels of either 24 control versusSTAR 7/67/7 colonies ranged from 0.61 pg/cell/day in the control to 3.44 pg/cell/day in the START/67/7 construct (FIG. 20 ). This is a factor 5.6 increase. Since there were many colonies in the ATGmut/space control with 0 pg/cell/day, also the average EpCAM production in the highest five colonies was compared. In the control ATGmut/space this was 3.0 pg/cell/day, versus 7.8 pg/cell/day with the ATGmut/space STAR 7/67/7 construct, an increase of a factor 2.6. -
FIG. 21 also shows that theSTAR 7/67/7 combination had a beneficial effect on EpCAM production, using the GTG start codon for the markers. With the GTG Zeo andGTG Blas STAR 7/67/7 construct approximately 2 times more colonies were formed. Also, the average EpCAM expression levels of either 24 control versusSTAR 7/67/7 colonies ranged from 2.44 pg/cell/day in the control to 6.51 pg/cell/day in the STAR7/67/7 construct (FIG. 21 ). This is a factor 2.7 increase. Also the average EpCAM production in the highest five colonies was compared. In the control GTG this was 5.7 pg/cell/day, versus 13.0 pg/cell/day with theGTG STAR 7/67/7 construct, an increase of a factor 2.3. Also note that the average EpCAM production mediated by the GTG start codon for the selection markers was significantly higher than with the ATGmut/space start codon. -
FIG. 22 shows that with the TTG Zeo and TTG Blas control construct no colonies were formed, similar as in Example 9. With theSTAR 7/67/7 TTG construct colonies were formed. The average EpCAM expression levels of theSTAR 7/67/7 TTG colonies was 10.4 pg/cell/day (FIG. 22 ). This is again higher than with the ATGmut/space and GTG as start codon (seeFIGS. 20 , 21 for comparison). The average EpCAM production in the highest fiveTTG STAR 7/67/7 colonies was 22.5 pg/cell/day. - The results show that the selection system can also be applied to two simultaneously produced polypeptides, in this case two polypeptides of a multimeric protein, casu quo an antibody. The EpCAM production closely follows the results obtained with d2EGFP. The TTG as start codon is more stringent than the GTG start codon, which in turn is more stringent than the ATGmut/space (
FIGS. 12 and 13 ). Higher stringency results in a decreasing number of colonies, with no colonies in the case of the TTG control that has no STAR elements, and higher stringency of the selection marker is coupled to higher expression of the protein of interest. - Different versions of the Zeocin® antibiotic resistance gene with mutated start codons were described in Example 8. Besides the described GTG codons (Example 8,
FIG. 33A ), additional modified start codons with distinct translational efficiency are possible. These different Zeocin® antibiotic resistance gene versions were created (FIG. 33 ) and cloned upstream of the modified d2EGFP gene, as in Example 9. - Four additional GTG constructs were made:
- 1) GTG as a start codon in the Zeo resistance gene (
FIG. 33A ), but followed by a spacer sequence (FIG. 33B ). The mutspace-Zeo open reading frame was amplified with primer pair GTGspaceBamHIF (SEQ ID NO:122): GAATTCGGATCCACCGTGGCGATCCAAAGACTGCCAAATCTAG and (wherein the sequence following the underlined sequence comprises the spacer sequence), and ZEOWTreverse (SEQ ID NO:85), the PCR product was cut with EcoRI-BamHI, and ligated into pd2EGFP, cut with EcoRI-BamHI, creating pZEO-GTGspace-d2EGFP. - 2) GTG as a start codon in the Zeo resistance gene, but in a bad context (TTTGTG) (
FIG. 33C ). The Zeo open reading frame was amplified with primer pair ZEOTTTGTGBamHIF (SEQ ID NO:123): GAATTCGGATCCTTTGTGGCCAAGTTGACCAGTGCCGTTCCG and ZEOWTreverse (SEQ ID NO:85), the PCR product was cut with EcoRI-BamHI, and ligated into pd2EGFP, cut with EcoRI-BamHI, creating pZEO(leu)-TTTGTG-d2EGFP. - 3) GTG as a start codon in the Zeo resistance gene, instead of ATG (
FIG. 33A ), but with an additional mutation in the Zeo open reading frame at Pro9, which was replaced with threonine (Thr) (FIG. 33D ). The Thr9 mutation was introduced by amplifying the Zeo open reading with primer pair ZEOForwardGTG-Thr9 (SEQ ID NO:124): AATTGGATCCACCGTGGCCAAGTTGACCAGTGCCGTTACCGTGCTC and ZEOWTreverse (SEQ ID NO:85), the PCR product was cut with EcoRI-BamHI, and ligated into pd2EGFP, cut with EcoRI-BamHI, creating pZEO-GTG-Thr9-d2EGFP. - 4) GTG as a start codon in the Zeo resistance gene, instead of ATG (
FIG. 33A ), but with an additional mutation in the Zeo open reading frame at Pro9, with was replaced with Phenylalanine (Phe) (FIG. 33E ). The Phe9 mutation was introduced by amplifying the Zeo open reading with primer pair ZEOForward GTG-Phe9 (SEQ ID NO:125): AATTGGATCCACCGTGGCCAAGTTGACCAGTGCCGTTTTCGTGCTC and ZEOWTreverse (SEQ ID NO:85), the PCR product was cut with EcoRI-BamHI, and ligated into pd2EGFP, cut with EcoRI-BamHI, creating pZEO-GTG-Phe9-d2EGFP. - Transfection, culturing and analysis of CHO-K1 cells was performed as in Example 8.
- CHO-K1 cells were transfected with constructs that contain the GTG Zeo (
FIG. 33A ), GTGspace Zeo (FIG. 33B ), TTT GTG Zeo (also called: GTGmut Zeo) (FIG. 33C ), GTG Thr9 Zeo(leu) (FIG. 33D ) and GTG Phe9 Zeo(leu) (FIG. 33D ) genes as selection gene, all being cloned upstream of the d2EGFP reporter gene. These five constructs were without STAR elements (Control) or with 7 and 67 upstream of the CMV promoter andSTAR elements STAR 7 downstream from the d2EGFP gene (FIG. 33 ).FIG. 34 shows that of the control constructs without STAR elements only the GTG Zeo construct without STAR elements gave colonies that expressed d2EGFP protein. In contrast, all constructs containing STAR elements gave colonies that expressed d2EGFP protein. The mean d2EGFP fluorescence signal of 11 GTG Zeo Control colonies was 20.3, of 13 GTG Zeo colonies withSTARs 7/67/7 104.9, of 24GTG space Zeo 7/67/7 colonies 201.5, of 6TTT GTG Zeo 7/67/7 colonies 310.5, of 22GTG Thr9 Zeo 7/67/7 colonies 423, and of 16 GTG Phe9 Zeo colonies 550.2 (FIG. 34 ). - The higher stringencies of the novel GTG mutations correlate with higher mean fluorescence signals (
FIG. 34 ). TheTTT GTG Zeo 7/67/7, however, gave only two high expressing colonies and a few low expressing colonies. This may indicate that this mutation is at the brink of the stringency that these cells can bear with a fixed concentration of Zeocin® antibiotic added to the culture medium. - The Thr9 and Phe9 mutations do not influence the translation efficiency of the Zeo mutants. Instead they reduce the functionality of the Zeocin® antibiotic resistance protein, by preventing an optimal interaction between the two halves of the Zeocin® antibiotic resistance protein (Dumas et al., 1994). This implies that more of the protein has to be produced to achieve resistance against the Zeocin® antibiotic in the culture medium. As a consequence, the entire cassette has to be transcribed at a higher level, eventually resulting in a higher d2EGFP expression level.
- It is concluded that the use of the described translation efficiencies of the Zeocin® antibiotic resistance mRNA result in higher expression levels of the d2EGFP protein, this in combination with STAR elements.
- This example further demonstrates the possibility to provide for fine-tuning of the stringency of the selection system of the invention, to achieve optimal expression levels of a protein of interest. Clearly, the person skilled in the art will be capable of combining these and other possibilities within the concepts disclosed herein (e.g., mutate the Zeocin® antibiotic at
position 9 to other amino acids, or mutate it in other positions; use a GTG or other start codon in a non-optimal translation initiation context for Zeocin® antibiotic or other selection markers; or mutate other selection markers to reduce their functionality, for instance use a sequence coding for a neomycin resistance gene having a mutation atamino acid residue 182 or 261 or both, see, e.g., WO 01/32901), and the like, to provide for such fine-tuning, and by simply testing determine a suitable combination of features for the selection marker, leading to enhanced expression of the polypeptide of interest. - Different versions of the Zeocin® antibiotic resistance gene with mutated start codons were described in Example 8. Besides the described TTG codons (
FIG. 35A ) additional modified start codons with distinct translational efficiency are possible. These different Zeocin® antibiotic resistance gene versions were created and cloned upstream of the modified d2EGFP gene (FIG. 35 ). - Three additional TTG constructs were made:
- 1) TTG as a start codon in the Zeo resistance gene (
FIG. 35A ), but followed by a spacer sequence (FIG. 35B ). The Zeo open reading frame (with the spacer sequence) was amplified with primer pair TTGspaceBamHIF (SEQ ID NO:126): GAATTCGGATCCACCTTGGCGATCCAAAGACTGCCAAATCTAG and ZEOWTreverse (SEQ ID NO:85), the PCR product was cut with EcoRI-BamHI, and ligated into pd2EGFP, cut with EcoRI-Bam HI, creating pZEO-TTGspace-d2EGFP. - 2) TTG as a start codon in the Zeo resistance gene, instead of ATG (
FIG. 35A ), but with an additional mutation in the Zeo open reading frame at Pro9, with was replaced with threonine (Thr) (FIG. 35C ). The Thr9 mutation was introduced by amplifying the Zeo open reading with primer pair ZEOForwardTTG-Thr9 (SEQ ID NO:127): AATTGGATCCACCTTGGCCAAGTTGACCAGTGCCGTTACCGTGCTC and ZEOWTreverse (SEQ ID NO:85), the PCR product was cut with EcoRI-BamHI, and ligated into pd2EGFP, cut with EcoRI-BamHI, creating pZEO-TTG-Thr9-d2EGFP. - 3) TTG as a start codon in the Zeo resistance gene, instead of ATG (
FIG. 35A ), but with an additional mutation in the Zeo open reading frame at Pro9, with was replaced with Phenylalanine (Phe) (FIG. 35D ). The Phe9 mutation was introduced by amplifying the Zeo open reading with primer pair ZEOForwardTTG-Phe9 (SEQ ID NO:128): AATTGGATCCACCTTGGCCAAGTTGACCAGTGCCGTTTTCGTGCTC and ZEOWTreverse (SEQ ID NO:85), the PCR product was cut with EcoRI-BamHI, and ligated into pd2EGFP, cut with EcoRI-BamHI, creating pZEO-TTG-Phe9-d2EGFP. - CHO-K1 cells were transfected with constructs that contain the TTG Zeo (
FIG. 35A ), TTGspace Zeo (FIG. 35B ), TTG Thr9 Zeo (FIG. 35C ) and TTG Phe9 Zeo (FIG. 35D ) genes as selection gene, all being cloned upstream of the d2EGFP reporter gene. These four constructs were without STAR elements (Control) or with 7 and 67 upstream of the CMV promoter andSTAR elements STAR 7 downstream from the d2EGFP gene (FIG. 35 ).FIG. 36 shows that of the control constructs without STAR elements only the TTG Zeo construct without STAR elements gave colonies that expressed d2EGFP protein. In contrast, all constructs containing STAR elements gave colonies that expressed d2EGFP protein. The mean d2EGFP fluorescence signal of three TTG Zeo Control colonies was 26.8, of 24 TTG Zeo colonies withSTARs 7/67/7 426.8, of 24TTGspace Zeo 7/67/7 colonies 595.7, of twoTTG Thr9 Zeo 7/67/7 colonies 712.1, and of three TTG Phe9 Zeo colonies 677.1 (FIG. 36 ). - The higher stringencies of the novel TTG mutations correlate with higher mean fluorescence signals (
FIG. 36 ). TheTTG Thr9 Zeo 7/67/7 andTTG Phe9 Zeo 7/67/7 constructs, however, gave only two high expressing colonies each and a few low expressing colonies. This may indicate that these mutations are at the brink of the stringency that the cells can bear with a fixed concentration of Zeocin® antibiotic added to the culture medium. - It is concluded that the use of the described translation efficiencies of the Zeocin® antibiotic resistance mRNA result in higher expression levels of the d2EGFP protein, this in combination with STAR elements.
- There are three internal ATGs in the puromycin resistance gene, each of which codes for a methionine (
FIG. 28 ,FIG. 37A ). These ATGs have to be eliminated (FIG. 37B,C), since they will serve as start codon when the ATG start codon (or the context thereof) has been modified, and this will result in peptides that do not resemble puromycin resistance protein. More importantly, these ATGs will prevent efficient translation of the gene of interest, as represented by d2EGFP in this example for purposes of illustration. The methionines were changed into leucine, like in the Zeocin® antibiotic resistance protein (Example 8). However, instead of using the TTG codon for leucine (for instance in Zeocin® antibiotic in Example 8), now the CTG codon for leucine was chosen (in humans, for leucine the CTG codon is used more often than the TTG codon). To eliminate the internal ATGs, the puromycin resistance protein open reading frame was first amplified with 4 primer pairs, generating 4 puromycin resistance protein fragments. The primer pairs were: -
PURO BamHI F (SEQ ID NO: 129): GATCGGATCCATGGTTACCGAGTACAAGCCCACGGT, PURO300 R LEU (SEQ ID NO: 130): CAGCCGGGAACCGCTCAACTCGGCCAGGCGCGGGC; and PURO300FLEU (SEQ ID NO: 131): CGAGTTGAGCGGTTCCCGGCTGGCCGCGCAGCAACAGCTGGAAGGCCTC, PURO600RLEU (SEQ ID NO: 132): AAGCTTGAATTCAGGCACCGGGCTTGCGGGTCAGGCACCAGGTC. - This generates two PCR products, corresponding to the 5′ and 3′ part of the puromycin resistance gene. The two products were added together and amplified with PURO BamHI F (SEQ ID NO:129)-PURO600RLEU (SEQ ID NO:132). The resulting PCR product was cut with BamHI-EcoRI and ligated, creating pCMV-ATGPURO (leu). Sequencing of this clone verified that all three internal ATGs had been converted. The entire puromycin open reading frame was then amplified with PUROBamHI TTG1F (SEQ ID NO:133): GAATTCGGATCCACCTTGGTTACCGAGTACAAGCCCACGGTG and PURO600RLEU (SEQ ID NO:132). This primer introduces an extra codon (GTT) directly after the TTG start codon, because the at nucleotide +4 is introduced for an optimal context, and hence two more nucleotides are introduced to preserve the reading frame.
- CHO-K1 cells were transfected with the construct that contains the TTG Puro (
FIG. 38 ) gene as selection gene, cloned upstream of the d2EGFP reporter gene. Selection was under 10 μg/ml puromycin. The construct was without STAR elements (Control) or with 7 and 67 upstream of the CMV promoter andSTAR elements STAR 7 downstream from the d2EGFP gene (FIG. 38 ).FIG. 38 shows that the average d2EGFP fluorescence signal of 24 TTG Puro Control colonies was 37.9, of 24 TTG Puro colonies withSTARs 7/67/7 75.5. Moreover, when the average of the five highest values is taken, the d2EGFP fluorescence signal of TTG Puro Control colonies was 69.5, and of TTG Puro colonies withSTARs 7/67/7 186.1, an almost three-fold increase in d2EGFP fluorescence signal. This shows that the described, modified translation efficiency of the Puromycin resistance mRNA result in higher expression levels of the d2EGFP protein, this in combination with STAR elements. - This experiment demonstrates that the puromycin resistance gene can be mutated to remove the ATG sequences therefrom, while remaining functional. Moreover it is concluded that the selection method of the invention also works with yet another selection marker, puromycin.
- There are sixteen internal ATGs in the neomycin resistance gene, five of which code for a methionine in the neomycin open reading frame (
FIG. 31 ,FIG. 39A ). All these sixteen ATGs have to be eliminated (FIG. 39B,C), since they will serve as start codon when the ATG start codon (or the context thereof) has been modified, and this will result in peptides that do not resemble neomycin resistance protein, and this will decrease the translation from the downstream open reading frame coding for the polypeptide of interest in the transcription units of the invention. To eliminate the internal ATGs, the neomycin resistance protein open reading frame was entirely synthesized by a commercial provider (GeneArt, Germany), wherein all internal coding ATGs (for Met) where replaced by CTGs (coding for Leu), and non-coding ATGs were replaced such that a degenerated codon was used and hence no mutations in the protein sequence resulted; the synthesized sequence of the neomycin is given in SEQ ID NO:134. In order to replace the ATG start codon with GTG (FIG. 39B ) or TTGFIG. 39C ), the synthesized neomycin gene was amplified with primer pairs NEO-F-HindIII (SEQ ID NO:136): GATCAAGCTTTTGGATCGGCCATTGAAACAAGACGGATTG and NEO EcoRI 800R (SEQ ID NO:137): AAGCTTGAATTCTCAGAAGAACTCGTCAAGAAGGCG. - E. coli were used to test the functionality of the neomycin resistance protein from which all ATGs were removed. E. coli were transformed with the constructs that contain the GTG Neo (
FIG. 39B ) or TTG Neo (FIG. 39C ) gene as selection gene. Selection took place by growing the bacteria on kanamycin. Only a functional neomycin resistance gene can give resistance against kanamycin. Transformation with either modified Neo gene resulted in the formation of E. coli colonies, from which the plasmid containing the gene could be isolated. This shows that the described, modified translation efficiencies of the Neomycin resistance mRNAs, as well as the removal of all ATGs from the Neo open reading frame result in the production of functional neomycin resistance protein. - The mutated neomycin resistance genes are incorporated in a multicistronic transcription unit of the invention, and used for selection with G418 or neomycin in eukaryotic host cells.
- There are eight internal ATGs in the dhfr gene, six of which code for a methionine in the dhfr open reading frame (
FIG. 29 ,FIG. 40A ). All these ATGs have to be eliminated (FIGS. 40B , C), since they will serve as start codon when the ATG start codon (or the context thereof) has been modified, and this will result in peptides that do not resemble dhfr protein, and will decrease the translation from the downstream open reading frame coding for the polypeptide of interest in the transcription units of the invention. To eliminate the internal ATGs, the dhfr protein open reading frame was entirely synthesized (SEQ ID NO:138), as described above for neomycin. In order to replace the ATG start codon with GTG (FIG. 40B ) or TTG (FIG. 40C ), the synthesized DHFR gene was amplified with primers DHFR-F-HindIII (SEQ ID NO:140): GATCAAGCTTTTGTTCGACCATTGAACTGCATCGTC and DHFR-EcoRI-600-R (SEQ ID NO:141): AGCTTGAATTCTTAGTCTTTCTTCTCGTAGACTTC. - E. coli bacteria were used to test the functionality of the dhfr protein from which all ATGs were removed. E. coli was transformed with the constructs that contain the GTG dhfr (
FIG. 40B ) or TTG dhfr (FIG. 40C ) gene. Selection took place by growing the bacteria on trimethoprim (Sigma T7883-56). Only a functional dhfr gene can give resistance against trimethoprim. Transformation with either modified dhfr gene resulted in the formation of E. coli colonies, from which the plasmid containing the gene could be isolated. This shows that the described, modified translation efficiencies of the dhfr mRNAs, as well as the removal of all ATGs from the dhfr open reading frame result in the production of functional dhfr protein. - The mutated dhfr genes are incorporated in a multicistronic transcription unit of the invention, and used for selection with methotrexate in eukaryotic host cells.
- Various Zeocin® and blasticidin genes with mutated start codons, all cloned upstream of the d2EGFP gene were tested in the PER.C6® cell line.
- The GTG Zeocin® antibiotic and GTGspace Zeocin® antibiotic resistance gene modifications (see also Example 14;
FIG. 41 ) and the GTG blasticidin and TTG blasticidin resistance gene modifications (see also Example 11;FIG. 42 ), all cloned upstream of the d2EGFP gene were transfected to PER.C6® cells. As shown inFIG. 41 , transfection with both the GTG Zeocin® antibiotic and GTGspace Zeocin® antibiotic gene resulted in colonies that expressed d2EGFP. The average d2EGFP fluorescence signal of 20 GTG Zeo colonies was 63.8, while the average d2EGFP signal of 20 GTGspace Zeo colonies was 185, demonstrating that also in PER.C6® cells the GTGspace Zeo has a higher translation stringency than the GTG Zeo mRNA. - As shown in
FIG. 42 , transfection with both the GTG Blasticidin and TTG Blasticidin gene resulted in colonies that expressed d2EGFP. The average d2EGFP fluorescence signal of 20 GTG Blasticidin colonies was 71.4, while the average d2EGFP fluorescence signal of 20 TTG Blasticidin colonies was 135, demonstrating that also in PER.C6® cells the TTG Blasticidin has a higher translation stringency than the GTG Blasticidin mRNA. - This example demonstrates that the selection system of the invention can also be used in other cells than CHO cells.
- A TRAnscription Pause (TRAP) sequence is thought to, at least in part, prevent formation of antisense RNA or, to at least in part, prevent transcription to enter the protein expression unit (see WO 2004/055215). A TRAP sequence is functionally defined as a sequence which when placed into a transcription unit, results in a reduced level of transcription in the nucleic acid present on the 3′ side of the TRAP when compared to the level of transcription observed in the nucleic acid on the 5′ side of the TRAP, and non-limiting examples of TRAP sequences are transcription termination signals. In order to function to prevent or decrease transcription to enter the transcription unit, the TRAP is to be placed upstream of a promoter driving expression of the transcription unit and the TRAP should be in a 5′ to 3′ direction. In order to prevent at least in part formation of antisense RNA, the TRAP should be located downstream of the open reading frame in a transcription unit and present in a 3′ to 5′ direction (that is, in an opposite orientation as the normal orientation of a transcriptional termination sequence that is usually present behind the open reading frame in a transcription unit). A combination of a TRAP upstream of the promoter in a 5′ to 3′ orientation and a TRAP downstream of the open reading frame in a 3′ to 5′ orientation is preferred. Adding a TRAP sequence to a STAR element improves the effects of STAR elements on transgene expression (see WO 2004/055215). Here we test the effects of the TRAP sequence in the context of the TTG Zeo resistance gene.
- The TTG Zeocin®-d2EGFP cassette that was flanked with STAR7 elements (
FIG. 43 ) was modified by the addition of the SPA/pause TRAP sequence (see WO 2004/055215); SEQ ID NO:142), both upstream of the 5′ STAR7 (in 5′ to 3′ direction) and downstream of the 3′ STAR7 (in 3′ to 5′ direction) (FIG. 43 ). BothSTAR 7/7 and TRAP-STAR 7/7-TRAP containing vectors were transfected to CHO-K1. Stable colonies were isolated and the d2EGFP fluorescence intensities were measured. As shown inFIG. 43 the average d2EGFP fluorescence signal of 23TTG Zeo STAR 7/7 colonies was 455.1, while the average d2EGFP fluorescence signal of 23 TTG Zeo TRAP-STAR 7/7-TRAP colonies was 642.3. The average d2EGFP fluorescence signal in highest 5TTG Zeo STAR 7/7 colonies was 705.1, while the average d2EGFP fluorescence signal of 5 TTG Zeo TRAP-STAR 7/7-TRAP colonies was 784.7. - This result indicates that the addition of TRAPs does not enhance the d2EGFP fluorescence signal in the highest colonies, but that there is a significant raise in the number of high expressing colonies. Whereas only 5
TTG Zeo STAR 7/7 colonies had d2EGFP signal above 600, 17 TTG Zeo TRAP-STAR 7/7-TRAP colonies had a d2EGFP fluorescence signal above 600. - In the
experiment 3 μg DNA of each plasmid was transfected. However, whereas the transfection efficiency was similar, the total number of colonies with theTTG Zeo STAR 7/7 plasmid was 62, while the total number of colonies with the TTG Zeo TRAP-STAR 7/7-TRAP plasmid was 116, almost a doubling. - We conclude that addition of TRAP elements to the STAR containing plasmids with modified Zeocin® antibiotic resistance gene translation codons results in a significantly higher overall number of colonies and that more colonies are present with the highest expression levels.
- We analyzed the EpCAM antibody expression levels in relation to the number of integrated EpCAM DNA copies.
- The construct that was tested was TTG-Zeo-Light Chain (LC)-TTG-Blas-Heavy Chain (HC), both expression units being under the control of the CMV promoter (see
FIG. 44 ). This construct containedSTAR 7 and 67 (seeFIG. 44 ). Selection conditions were such that with 200 μg/ml Zeocin® antibiotic and 20 μg/ml Blasticidin in the culture medium no control colonies (no STARs) survived and only STAR 7/67/7 colonies survived. - DNA was isolated when colonies were 60 days under Zeocin® antibiotic and Blasticidin selection pressure (see
FIG. 44 ). The R2 value is calculated and shown. In the entire range from 5 to 40 pg/cell/day EpCAM there was a high degree of copy number dependency, as signified by a relatively high R2 of 0.5978 (FIG. 44 ). The data show that in the novel selection system, in colonies that contain TTG Zeo-TTGBlas EpCAM STAR 7/67/7 constructs there is copy number dependent EpCAM expression. - We analyzed EpCAM antibody expression levels after incubation of clones with methotrexate (MTX). The purpose of this experiment was to determine whether amplification of a STAR-containing construct would result in higher EpCAM expression. MTX acts through inhibition of the dhfr gene product. While some CHO strains that are dhfr-deficient have been described, CHO-K1 is dhfr+. Therefore relatively high concentrations of MTX in the culture medium have to be present to select for amplification by increased MTX concentrations in CHO-K1 cells.
- The construct that was tested was TTG-Zeo-Heavy Chain (HC)-TTG-Blas-Light Chain (LC), both expression units being under the control of the CMV promoter. Upstream of each CMV promoter STAR67 was positioned and START was used to flank the entire cassette (see also Example 13,
FIG. 22 for such a construct). This construct was further modified by placing an SV40-dhfr cassette (a mouse dhfr gene under control of an SV40 promoter) between the HC and LC cassettes, upstream of the second STAR67 (FIG. 45 ). CHO-K1 cells were transfected. Selection was done with 100 μg/ml Zeocin® antibiotic and 10 μg/ml Blasticidin in the culture medium. No control colonies (without STAR elements) survived and only colonies with constructs containing the STAR elements survived. Colonies were isolated and propagated before measuring EpCAM expression levels. Six colonies that produced between 20 and 35 pg/cell/day were transferred to medium containing 100 nM MTX. This concentration was raised to 500 nM, 1000 nM and finally to 2000 nM with two weeks periods in between each step. After two weeks on 2000 nM MTX, EpCAM concentrations were measured. As shown inFIG. 45 , four colonies showed enhanced EpCAM production.Colony 13, from 22 to 30;colony 14, from 28 to 42;colony 17, from 20 to 67 andcolony 19, from 37 to 67 pg/cell/day. 4 and 16 showed no enhanced EpCAM expression. We conclude that addition of methotrexate to the culture medium of CHO-K1 colonies created with the selection system of the invention can result in enhanced protein expression. Hence, STAR elements and the selection method of the invention can be combined with and are compatible with MTX-induced enhancement of protein expression levels.Colonies - We analyzed d2EGFP expression levels in the context of the TTG Zeo selection marker and different promoters. We compared the action of STAR elements in the context of the CMV enhancer/promoter, the SV40 enhancer/promoter and the CMV enhancer/β-actin promoter.
- In
FIG. 46 , we indicate the promoters we tested in the context of the TTG Zeo selection marker. The tested plasmids consisted of the indicated control constructs with three different promoters and STAR constructs which were flanked withSTAR 7 andSTAR 67 at the 5′ end andSTAR 7 at the 3′ end. The constructs were transfected to CHO-K1 cell and selection was performed with 200 μg/ml Zeocin® antibiotic in the culture medium. Up to 23 independent colonies were isolated and propagated before analysis of d2EGFP expression levels. As shown inFIG. 46 , incorporation of STAR elements in constructs with the CMV enhancer/promoter, the SV40 enhancer/promoter or the CMV enhancer/β-actin promoter all resulted in the formation of colonies with higher d2EGFP expression levels than with the corresponding control constructs. This shows that the selection system of the invention, in combination with STAR elements, operates well in the context of different promoters. Further analysis showed that the mean of CMV-driven d2EGFP values was significantly higher than the mean of SV40-driven d2EGFP values (p<0.05). In contrast, the mean of CMV-driven d2EGFP values did not significantly differ from CMV/β actin-driven d2EGFP values (p=0.2). - We analyzed d2EGFP expression levels in the context of the CMV promoter-TTG Zeo selection marker and 53 different STAR elements, to obtain more insight in which STAR elements give the best results in this context.
- We cloned 53 STAR elements up-and downstream of the CMV promoter-TTG Zeo-d2EGFP cassette. The following STAR elements were tested in such constructs: STAR2-12, 14, 15, 17-20, 26-34, 36, 37, 39, 40, 42-49, 51, 52, 54, 55, 57-62, 64, 65, 67. The constructs were transfected to CHO-K1 cells and selection was performed with 200 μg/ml Zeocin® antibiotic in the culture medium. Up to 24 independent colonies were isolated and propagated before analysis of d2EGFP expression levels. Incorporation of STAR elements in the constructs resulted in different degrees of enhanced d2EGFP expression, as compared to the control. Incorporation of
14, 18 and 55 in this experiment did not result in an increase of average d2EGFP expression over the control (no STAR element). Although some constructs (withSTAR elements 2, 3, 10, 42, 48 and 49) in this experiment gave rise to only a few colonies, all tested STAR elements except 14, 18 and 55 resulted in average d2EGFP expression levels higher than for the control. It should be noted that some STAR elements may act in a more cell type specific manner and that it is well possible thatSTAR elements 14, 18 and 55 work better in other cell types, with other promoters, other selection markers, or in different context or configuration than in the particular set of conditions tested here. Addition of ten STAR elements, namely STARSTAR 7, 9, 17, 27, 29, 43, 44, 45, 47 and 61, induced average d2EGFP expression levels higher than five times the average d2EGFP expression level of the control. We retransformed the control and seven constructs with STAR elements and repeated the experiment. The results are shown inelements FIG. 47 . Incorporation of STAR elements in the constructs resulted in different degrees of enhanced d2EGFP expression, as compared to the control (FIG. 47 ). The average d2EGFP expression level in colonies transfected with the control construct was 29. The averages from d2EGFP expression levels in colonies with the seven different STAR constructs ranged between 151 (STAR 67) and 297 (STAR 29). This is a factor of five- to ten-fold higher than the average in the control colonies. - We conclude that a) the vast majority of STAR elements have a positive effect on gene expression levels, b) there is variation in the degree of positive effects induced by the different STAR elements, and c) ten out of 53 tested STAR elements induce more than five-fold average d2EGFP expression levels, as compared to the control, and that STAR elements can induce a ten-fold higher average d2EGFP expression level, as compared to the control.
- DNA elements such as the HS4 hypersensitive site in the locus control region of the chicken β-globin locus (Chung et al., 1997), matrix attachment regions (MAR) (Stief et al., 1989) and a ubiquitous chromatin opening element (UCOE) (Williams et al., 2005) have been reported to have beneficial effects on gene expression when these DNA elements are incorporated in a vector. We combined these DNA elements with the selection system of the invention.
- The 1.25 kb HS4 element was cloned into the cassette encompassing the CMV promoter, TTG Zeo and d2EGFP by a three way ligation step to obtain a construct with a tandem of 2 HS4 elements (Chung et al., 1997). This step was done both for the 5′ and 3′ of the cassette encompassing the CMV promoter, TTG Zeo and d2EGFP. The 2959 bp long chicken lysozyme MAR (Stief et al., 1989) was cloned 5′ and 3′ of the cassette encompassing the CMV promoter, TTG Zeo and d2EGFP. The 2614 bp long UCOE (Williams et al., 2005) was a NotI-KpnI fragment, excised from a human BAC clone (RP11-93D5), corresponding to nucleotide 29449 to 32063. This fragment was cloned 5′ of the CMV promoter. The STAR construct contained STAR7 and
STAR67 5′ of the CMV promoter and STAR73′ of the cassette. These four constructs, as well as the control construct without flanking chromatin control DNA elements, were transfected to CHO-K1 cells. Selection was performed by 200 μg/ml Zeocin® antibiotic in the culture medium. Colonies were isolated, propagated and d2EGFP expression levels were measured. As shown inFIG. 48 constructs with all DNA elements resulted in the formation of d2EGFP expressing colonies. However, incorporation of 2×HS4 elements and the UCOE did not result in the formation of colonies that displayed higher d2EGFP expression levels, in comparison with the control colonies. In contrast, incorporation of the lysozyme MAR resulted in the formation of colonies that expressed d2EGFP significantly higher. The mean expression level induced by MAR containing constructs was four-fold higher than in the control colonies. Best results were obtained, however, by incorporating 7 and 67 in the construct. An almost ten-fold increase in the mean d2EGFP expression level was observed, as compared to the control colonies. We conclude that other chromatin control DNA elements such as MARs can be used in the context of the selection system of the invention. However, the best results were obtained when STAR elements were used as chromatin control elements.STAR -
- Boshart M., F. Weber, G. Jahn, K. Dorsch-Hasler, B. Fleckenstein, and W. Schaffner (1985). A very strong enhancer is located upstream of an immediate early gene of human cytomegalovirus. Cell 41:521-530.
- Chung J. H., M. Whiteley and G. Felsenfeld (1993). A 5′ element of the chicken beta-globin domain serves as an insulator in human erythroid cells and protects against position effect in Drosophila. Cell 74:505-514.
- Chung J. H., A.C. Bell, and G. Felsenfeld (1997). Characterization of the chicken beta-globin insulator. Proc. Natl. Acad. Sci. U.S.A. 94:575-580.
- Das G.C., S. K. Niyogi, and N. P. Salzman (1985). SV40 promoters and their regulation. Prog. Nucleic Acid Res. Mol. Biol. 32:217-236.
- Dumas P., M. Bergdoll, C. Cagnon and J. M. Masson (1994). Crystal structure and site-directed mutagenesis of a bleomycin resistance protein and their significance for drug sequestering. EMBO. J. 13:2483-2492.
- Gill D. R., S. E. Smyth, C. A. Goddard, I. A. Pringle, C. F. Higgins, W. H. Colledge and S.C. Hyde (2001). Increased persistence of lung gene expression using plasmids containing the ubiquitin C or elongation factor 1α promoter. Gene Therapy 8:1539-1546.
- Gossen M. and H. Bujard (1992). Tight control of gene expression in mammalian cells by tetracycline-responsive promoters. Proc. Natl. Acad. Sci. U.S.A. 89:5547-5551.
- Graham F. O., J. Smiley, W. Russell and R. Nairn (1977). Characteristics of a human cell line transformed by DNA from
human adenovirus type 5. J. Gen. Virol. 36:59-72. - Huls G. A., I. A. F. M. Heijnen, M. E. Cuomo, J.C. Koningsberger, L. Wiegman, E. Boel, A.-R. van der Vuurst-de Vries, S. A. J. Loyson, W. Helfrich, G. P. van Berge Henegouwen, M. van Meijer, J. de Kruif, and T. Logtenberg (1999). A recombinant, fully human monoclonal antibody with antitumor activity constructed from phage-displayed antibody fragments. Nat. Biotechnol. 17:276-281.
- Jones D., N. Kroos, R. Anema, B. Van Montfort, A. Vooys, S. Van Der Kraats, E. Van Der Helm, S. Smits, J. Schouten, K. Brouwer, F. Lagerwerf, P. Van Berkel, D.-J. Opstelten, T. Logtenberg, and A. Bout (2003). High-level expression of recombinant IgG in the human cell line PER.C6. Biotechnol. Prog. 19:163-168.
- Kaufman R. J. (2000). Overview of vector design for mammalian gene expression. Mol. Biotechnol. 16:151-160.
- Kaufman R. J. and P. A. Sharp (1982). Construction of a modular dihydrofolate reductase cDNA gene: analysis of signals utilized for efficient expression. Mol. Cell. Biol. 2:1304-1319.
- Kellum R. and P. Schedl (1991). A position-effect assay for boundaries of higher order chromosomal domains. Cell 64:941-950.
- Kim S. J., N. S. Kim, C. J. Ryu, H. J. Hong, and G. M. Lee (1998). Characterization of chimeric antibody producing CHO cells in the course of dihydrofolate reductase-mediated gene amplification and their stability in the absence of selective pressure. Biotechnol. Bioeng. 58:73-84.
- Kozak M. (1986). Point mutations define a sequence flanking the AUG initiator codon that modulates translation by eukaryotic ribosomes. Cell 44:283-292.
- Kozak M. (1987). An analysis of 5′-noncoding sequences from 699 vertebrate messenger RNAs. Nucleic Acids Res. 15:8125-8148.
- Kozak M. (1989). Context effects and inefficient initiation at non-AUG codons in eucaryotic cell-free translation systems. Mol. Cell. Biol. 9:5073-5080.
- Kozak M. (1990). Downstream secondary structure facilitates recognition of initiator codons by eukaryotic ribosomes. Proc. Natl. Acad. Sci. U.S.A. 87:8301-8305.
- Kozak M. (1997). Recognition of AUG and alternative initiator codons is augmented by G in position +4 but is not generally affected by the nucleotides in positions +5 and +6. EMBO. J. 16:2482-2492.
- Kozak M. (2002). Pushing the limits of the scanning mechanism for initiation of translation. Gene 299:1-34.
- Kwaks T. H., P. Barnett, W. Hemrika, T. Siersma, R. G. Sewalt, D. P. Satijn, J. F. Brons, R. van Blokland, P. Kwakman, A. L. Kruckeberg, A. Kelder, A. P. Otte (2003). Identification of anti-repressor elements that confer high and stable protein production in mammalian cells. Nat. Biotechnol. 21, 553-558. Erratum in: Nat. Biotechnol. 21, 822 (2003).
- Phi-Van L., J. P. Von Kreis, W. Ostertag and W. H. Stratling (1990). The
chicken lysozyme 5′ matrix attachment region increases transcription from a heterologous promoter in heterologous cells and dampens position effects on the expression of transfected genes. Mol. Cell. Biol. 10:2302-2307. - McBurney M. W., T. Mai, X. Yang, and K. Jardine (2002). Evidence for repeat-induced gene silencing in cultured Mammalian cells: inactivation of tandem repeats of transfected genes. Exp. Cell. Res. 274:1-8.
- Rees S., J. Coote, J. Stables, S. Goodson, S. Harris, and M. G. Lee (1996). Bicistronic vector for the creation of stable mammalian cell lines that predisposes all antibiotic-resistant cells to express recombinant protein. Biotechniques 20:102-104, 106, 108-110.
- Schorpp M., R. Jager, K. Schellander, J. Schenkel, E. F. Wagner, H. Weiher, and P. Angel (1996). The human ubiquitin C promoter directs high ubiquitous expression of transgenes in mice. Nucleic Acids Res. 24:1787-8.
- Stief A., D. M. Winter, W. H. Stratling, and A. E. Sippel (1989). A nuclear DNA attachment element mediates elevated and position-independent gene activity. Nature 341:343-345.
- Van der Vlag J., J. L. den Blaauwen, R. G. Sewalt, R. van Driel, and A. P. Otte (2000). Transcriptional repression mediated by polycomb group proteins and other chromatin-associated repressors is selectively blocked by insulators. J. Biol. Chem. 275:697-704.
- West A. G., M. Gaszner, and G. Felsenfeld (2002). Insulators: many functions, many mechanisms. Genes Dev. 16:271-288.
- Whitelaw E., H. Sutherland, M. Kearns, H. Morgan, L. Weaving, and D. Garrick (2001). Epigenetic effects on transgene expression. Methods Mol. Biol. 158:351-68.
- Williams S., T. Mustoe, T. Mulcahy, M. Griffiths, D. Simpson, M. Antoniou, A. Ivine, A. Mountain, and R. Crombie (2005). CpG-island fragments from the HNRPA2B1/CBX3 genomic locus reduce silencing and enhance transgene expression from the hCMV promoter/enhancer in mammalian cells. BMC Biotechnol. 5:17.
Claims (19)
1.-60. (canceled)
61. A method for producing a peptide of interest, the method comprising:
culturing a eukaryotic host cell comprising a DNA molecule comprising, in 5′-3′ direction, a promoter-a multicistronic transcription unit-a transcription termination sequence, wherein the multicistronic transcription unit encodes:
i) a selectable marker polypeptide functional in a eukaryotic host cell; and
ii) a peptide of interest;
wherein the peptide of interest has a translation initiation sequence separate from the selectable marker polypeptide's translation initiation sequence separate; and
wherein the coding sequence for the peptide of interest is downstream from the coding sequence for the selectable marker in the multicistronic transcription unit; and
wherein the sequence encoding the selectable marker polypeptide has no ATG sequence in a coding strand following the start codon of the selectable marker polypeptide up to the start codon of the peptide of interest; and
wherein the polynucleotide encoding the selectable marker polypeptide in the coding strand comprises a translation start sequence selected from the group consisting of a) an ATG start codon in a non-optimal context for translation initiation, comprising the sequence (C/T)(A/T/G)(A/T/G)ATG(A/T/C) wherein the start codon is italicized, b) a GTG start codon, c) a TTG start codon, d) a CTG start codon, e) an ATT start codon, and f) an ACG start codon; and
expressing the peptide of interest.
62. The method according to claim 61 , further comprising isolating the peptide of interest.
63. The method according to claim 61 , wherein the translation start sequence in the polynucleotide encoding the selectable marker polypeptide comprises a GTG start codon.
64. The method according to claim 61 , wherein the translation start sequence in the polynucleotide encoding the selectable marker polypeptide comprises a TTG start codon.
65. The method according to claim 61 , wherein any ATG, when present in frame and encoding methionine in the sequence encoding wild-type marker polypeptide, is mutated to encode an amino acid selected from the group consisting of valine, threonine, isoleucine, and leucine.
66. The method according to claim 61 , wherein the selectable marker protein provides resistance against lethal or growth-inhibitory effects of a selection agent.
67. The method according to claim 66 , wherein the selection agent is an antibiotic.
68. The method according to claim 66 , wherein the selection agent is selected from the group consisting of a bleocin, puromycin, blasticidin, hygromycin, neomycin, methotrexate, methionine sulphoximine and kanamycin.
69. The method according to claim 66 , wherein the selection agent is a bleocin.
70. The method according to claim 61 , wherein the DNA molecule further comprises an element selected from the group consisting of a matrix or scaffold attachment region (MAR/SAR), an insulator sequence, a universal chromatin opening element (UCOE), and an anti-repressor (STAR) sequence.
71. The method according to claim 70 , further comprising isolating the peptide of interest.
72. The method according to claim 70 , wherein the element is an anti-repressor sequence.
73. The method according to claim 70 , wherein the element is a matrix attachment region.
74. A method for producing an antibody, the method comprising:
culturing a eukaryotic cell comprising a first multicistronic transcription unit and a second multicistronic transcription unit each under control of an independent promoter, wherein the first multicistronic transcription unit encodes a first functional selectable marker polypeptide and heavy chain of the antibody, and wherein the second multicistronic transcription unit encodes a second functional selectable marker polypeptide and light chain of the antibody, and
harvesting antibody from the eukaryotic cell in a culture or from the culture supernatant or from both the eukaryotic cell and the culture supernatant;
wherein the coding sequences for the heavy and light chains have translation initiation sequences separate from the translation initiation sequences for the first and second selectable marker polypeptides respectively,
wherein the coding sequences for the heavy and light chains of the antibody are downstream from the coding sequences of the first and second functional selectable markers in the first and second multicistronic transcription units respectively;
wherein the sequences encoding the first and second selectable marker polypeptides have no ATG sequence in the coding strands following the start codons of the selectable marker polypeptides up to the start codons of the heavy and light chain, respectively; and
wherein the sequences encoding the first and second selectable marker polypeptide in the coding strand comprise a translation start sequence each independently selected from the group consisting of a) an ATG start codon in a non-optimal context for translation initiation, comprising the sequence (C/T)(A/T/G)(A/T/G)ATG(A/T/C) wherein the start codon is italicized, b) a GTG start codon, c) a TTG start codon, d) a CTG start codon, e) an ATT start codon, and f) an ACG start codon.
75. The method according to claim 74 , wherein any ATG, when present in frame and encoding methionine in the sequence encoding wild-type marker polypeptide, is mutated to encode an amino acid residue selected from the group consisting of valine, threonine, isoleucine, and leucine.
76. A method for generating a host cell expressing a peptide of interest, the method comprising:
introducing into a plurality of eukaryotic precursor cells a DNA molecule comprising a multicistronic transcription unit encoding:
i) a selectable marker polypeptide functional in a eukaryotic host cell; and
ii) a peptide of interest;
wherein the peptide of interest has a translation initiation sequence separate from that of the selectable marker polypeptide;
wherein the coding sequence for the peptide of interest is downstream from the coding sequence for the selectable marker in the multicistronic transcription unit;
wherein the sequence encoding the selectable marker polypeptide has no ATG sequence in a coding strand following the start codon of the selectable marker polypeptide up to the start codon of the peptide of interest; and
wherein the polynucleotide encoding the selectable marker polypeptide in the coding strand comprises a translation start sequence selected from the group consisting of a) an ATG start codon in a non-optimal context for translation initiation, comprising the sequence (C/T)(A/T/G)(A/T/G)ATG(A/T/C) wherein the start codon is italicized, b) a GTG start codon, c) a TTG start codon, d) a CTG start codon, e) an ATT start codon; and f) an ACG start codon;
culturing the generated cells under conditions selecting for expression of the selectable marker polypeptide; and
selecting at least one host cell producing the peptide of interest.
77. The method according to claim 76 , wherein any ATG, when present in frame and encoding methionine in the sequence encoding wild-type marker polypeptide, is mutated to encode an amino acid selected from the group consisting of valine, threonine, isoleucine, and leucine.
78. The method according to claim 76 , wherein the DNA molecule further comprises a promoter upstream of the multicistronic transcription unit and a transcription termination sequence downstream of the multicistronic transcription unit.
Priority Applications (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| US14/165,374 US20140186890A1 (en) | 2004-11-08 | 2014-01-27 | Selection of host cells expressing protein at high levels |
Applications Claiming Priority (7)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| US62630104P | 2004-11-08 | 2004-11-08 | |
| EP04105593 | 2004-11-08 | ||
| EP04105593.1 | 2004-11-08 | ||
| US69661005P | 2005-07-05 | 2005-07-05 | |
| US11/269,525 US20060172382A1 (en) | 2004-11-08 | 2005-11-07 | Selection of host cells expressing protein at high levels |
| US12/807,133 US8771984B2 (en) | 2004-11-08 | 2010-08-26 | Selection of host cells expressing protein at high levels |
| US14/165,374 US20140186890A1 (en) | 2004-11-08 | 2014-01-27 | Selection of host cells expressing protein at high levels |
Related Parent Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| US12/807,133 Continuation US8771984B2 (en) | 2004-11-08 | 2010-08-26 | Selection of host cells expressing protein at high levels |
Publications (1)
| Publication Number | Publication Date |
|---|---|
| US20140186890A1 true US20140186890A1 (en) | 2014-07-03 |
Family
ID=34929819
Family Applications (3)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| US11/269,525 Abandoned US20060172382A1 (en) | 2004-11-08 | 2005-11-07 | Selection of host cells expressing protein at high levels |
| US12/807,133 Expired - Fee Related US8771984B2 (en) | 2004-11-08 | 2010-08-26 | Selection of host cells expressing protein at high levels |
| US14/165,374 Abandoned US20140186890A1 (en) | 2004-11-08 | 2014-01-27 | Selection of host cells expressing protein at high levels |
Family Applications Before (2)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| US11/269,525 Abandoned US20060172382A1 (en) | 2004-11-08 | 2005-11-07 | Selection of host cells expressing protein at high levels |
| US12/807,133 Expired - Fee Related US8771984B2 (en) | 2004-11-08 | 2010-08-26 | Selection of host cells expressing protein at high levels |
Country Status (8)
| Country | Link |
|---|---|
| US (3) | US20060172382A1 (en) |
| CN (1) | CN101090968B (en) |
| AT (1) | ATE550432T1 (en) |
| DK (1) | DK1809750T3 (en) |
| ES (1) | ES2384391T3 (en) |
| NZ (1) | NZ553700A (en) |
| SI (1) | SI1809750T1 (en) |
| ZA (1) | ZA200702046B (en) |
Cited By (1)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| WO2020223160A1 (en) * | 2019-04-30 | 2020-11-05 | Chan Zuckerberg Biohub, Inc. | A novel chromatin-opening element for stable long term gene expression |
Families Citing this family (24)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| EP1573058B1 (en) * | 2002-12-18 | 2011-09-14 | ChromaGenics B.V. | A method for improving protein production |
| WO2004056986A2 (en) * | 2002-12-20 | 2004-07-08 | Chromagenics B.V. | Means and methods for producing a protein through chromatin openers that are capable of rendering chromatin more accessible to transcription factors |
| US8999667B2 (en) * | 2004-11-08 | 2015-04-07 | Chromagenics B.V. | Selection of host cells expressing protein at high levels |
| US20100136616A1 (en) * | 2004-11-08 | 2010-06-03 | Chromagenics B.V. | Selection of Host Cells Expressing Protein at High Levels |
| US20060195935A1 (en) | 2004-11-08 | 2006-08-31 | Chromagenics B.V. | Selection of host cells expressing protein at high levels |
| PL1809750T3 (en) * | 2004-11-08 | 2012-08-31 | Chromagenics Bv | Selection of host cells expressing protein at high levels |
| SI1809750T1 (en) | 2004-11-08 | 2012-08-31 | Chromagenics Bv | Selection of host cells expressing protein at high levels |
| US20070298458A1 (en) * | 2004-11-18 | 2007-12-27 | National University Of Corporation Hiroshima University | Method And Kit For Expressing Protein Under Regulation Of The Expression From Repeated Sequence Formed By Gene Amplification, And Transformant |
| ATE511544T1 (en) * | 2006-02-21 | 2011-06-15 | Chromagenics Bv | SELECTION OF HOST CELLS WITH HIGH LEVEL PROTEIN EXPRESSION |
| ES2388003T3 (en) * | 2006-09-20 | 2012-10-05 | Genzyme Corporation | A system based on FACS and the indicator protein for high-performance development of therapeutic proteins |
| US8653249B2 (en) * | 2008-02-27 | 2014-02-18 | Fuso Pharmaceutical Industries, Ltd. | Expression vector for mass production of foreign gene-derived protein using animal cell and use thereof |
| CN102459607B (en) | 2009-06-15 | 2014-08-06 | 萨拉基尼克有限公司 | Novel stringent selectable markers |
| WO2011017606A1 (en) * | 2009-08-06 | 2011-02-10 | Cmc Icos Biologics, Inc. | Methods for improving recombinant protein expression |
| DK2611915T3 (en) | 2010-09-01 | 2015-06-22 | Cellagenics B V | Nucleic acid fragments from a ribosomal protein promoter to enhance gene expression |
| EP2739733B1 (en) * | 2011-08-03 | 2017-11-22 | Texas Biomedical Research Insitute | Nucleic acid compositions, methods and kits for rapid pairing of affinity agents |
| TWI698240B (en) | 2012-05-15 | 2020-07-11 | 澳大利亞商艾佛蘭屈澳洲私營有限公司 | Treatment of amd using aav sflt-1 |
| CN106414474B (en) | 2014-03-17 | 2021-01-15 | 阿德夫拉姆生物技术股份有限公司 | Compositions and methods for enhanced gene expression in cone cells |
| CN104531699B (en) * | 2014-10-09 | 2017-03-15 | 河南农业大学 | A kind of UCOE controlling element fragments of the pig for strengthening exogenous gene expression |
| BR112017018846A2 (en) | 2015-03-02 | 2018-07-31 | Adverum Biotechnologies, Inc. | compositions and methods for intravitreal delivery of polynucleotides to retinal cones. |
| AU2016335699A1 (en) | 2015-10-09 | 2018-05-24 | Genzyme Corporation | Early post-transfection isolation of cells (EPIC) for biologics production |
| GB2545763A (en) | 2015-12-23 | 2017-06-28 | Adverum Biotechnologies Inc | Mutant viral capsid libraries and related systems and methods |
| SG10202103545PA (en) | 2016-10-07 | 2021-05-28 | Genzyme Corp | Early post-transfection isolation of cells (epic) for biologics production |
| CN110484563B (en) * | 2019-07-25 | 2023-04-07 | 新乡医学院 | Mammal cell combined expression vector, expression system, preparation method and application |
| CN116615655A (en) | 2020-11-19 | 2023-08-18 | 建新公司 | Flow cytometry attenuation reporter expression (FLARE) multiple reporter systems and methods of use thereof |
Citations (2)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| WO1991001374A1 (en) * | 1989-07-24 | 1991-02-07 | Seragen, Inc. | Prevention of internal initiation |
| US20030199468A1 (en) * | 2001-07-04 | 2003-10-23 | Otte Arie Peter | DNA sequences comprising gene transcription regulatory qualities and methods for detecting and using such DNA sequences |
Family Cites Families (59)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US4965196A (en) * | 1983-01-19 | 1990-10-23 | Genentech, Inc. | Polycistronic expression vector construction |
| FR2569723B1 (en) * | 1984-08-31 | 1986-09-19 | Centre Nat Rech Scient | APPLICATION OF ANTIBIOTICS FROM THE PHLEOMYCIN FAMILY AS A SELECTION AGENT IN THE FIELD OF GENETIC ENGINEERING |
| JP3438889B2 (en) | 1991-03-29 | 2003-08-18 | ジェネンテク,インコーポレイテッド | Method for selecting a recombinant host cell that expresses a desired protein at a high level |
| US6319707B1 (en) * | 1991-08-12 | 2001-11-20 | Fred Hutchinson Cancer Research Center Board Regents Of The University Of Washington | Cap-independent multicistronic retroviral vectors |
| JP2686025B2 (en) * | 1992-08-25 | 1997-12-08 | 理化学研究所 | Blasticidin S deaminase gene |
| DE4228458A1 (en) * | 1992-08-27 | 1994-06-01 | Beiersdorf Ag | Multicistronic expression units and their use |
| US5648267A (en) * | 1992-11-13 | 1997-07-15 | Idec Pharmaceuticals Corporation | Impaired dominant selectable marker sequence and intronic insertion strategies for enhancement of expression of gene product and expression vector systems comprising same |
| DE669986T1 (en) * | 1992-11-13 | 1996-10-10 | Idec Pharma Corp | COMPLETELY FUNCTIONAL CONSENSUS-KOZAK SEQUENCES FOR MAMMAL EXPRESSION. |
| WO1994023046A1 (en) | 1993-04-07 | 1994-10-13 | The Government Of The United States Of America As Represented By The Secretary, Department Of Health And Human Services | Dna sequence which acts as a chromatin insulator element to protect expressed genes from cis-acting regulatory sequences in mammalian cells |
| US5610053A (en) | 1993-04-07 | 1997-03-11 | The United States Of America As Represented By The Department Of Health And Human Services | DNA sequence which acts as a chromatin insulator element to protect expressed genes from cis-acting regulatory sequences in mammalian cells |
| US5733782A (en) * | 1993-10-25 | 1998-03-31 | Creative Biomolecules, Inc. | Methods and compositions for high protein production from non-native DNA |
| US5972605A (en) | 1994-07-07 | 1999-10-26 | Geron Corporation | Assays for regulators of mammalian telomerase expression |
| US5561053A (en) * | 1994-08-05 | 1996-10-01 | Genentech, Inc. | Method for selecting high-expressing host cells |
| WO1996012008A1 (en) | 1994-10-13 | 1996-04-25 | Merck & Co., Inc. | Synthesis of methylase-resistant genes |
| US5627033A (en) | 1995-03-28 | 1997-05-06 | Research & Diagnostics Systems, Inc. | Mammalian expression vectors |
| IL122614A0 (en) * | 1995-06-15 | 1998-08-16 | Introgene Bv | Packaging systems for human recombinant adenovirus to be used in gene therapy |
| US5773695A (en) | 1996-01-26 | 1998-06-30 | North Carolina State University | Plant nuclear scaffold attachment region and method for increasing gene expression in transgenic cells |
| US6107477A (en) * | 1996-09-26 | 2000-08-22 | Aurora Biosciences Corporation | Non-optimal Kozaks sequences |
| US5773779A (en) * | 1997-02-21 | 1998-06-30 | The Lincoln Electric Company | Method and system for welding railroad rails |
| JP2000509614A (en) | 1997-03-04 | 2000-08-02 | バクスター インターナショナル インコーポレイテッド | Adenovirus E1-complementation cell line |
| US5888809A (en) * | 1997-05-01 | 1999-03-30 | Icos Corporation | Hamster EF-1α transcriptional regulatory DNA |
| US6689606B2 (en) * | 1998-07-21 | 2004-02-10 | M.L. Laboratories Plc | Polynucleotide |
| EP1098986B1 (en) | 1998-07-21 | 2011-12-28 | Millipore Corporation | A polynucleotide comprising a ubiquitous chromatin opening element (ucoe) |
| EP1104490B8 (en) | 1998-08-14 | 2008-02-13 | Chromagenics B.V. | Method of detecting a dna sequence, a dna sequence, a method of making a dna construct and the use thereof |
| US6800457B2 (en) * | 1998-09-22 | 2004-10-05 | Bristol-Myers Squibb Company | Expression vectors containing hot spot for increased recombinant protein expression in transfected cells |
| US6521419B1 (en) | 1998-09-22 | 2003-02-18 | Kanakaraju Koduri | Expression vectors containing hot spot for increased recombinant protein expression in transfected cells |
| WO2000023606A1 (en) | 1998-10-22 | 2000-04-27 | Medical College Of Georgia Institute, Inc. | Long terminal repeat, enhancer, and insulator sequences for use in recombinant vectors |
| WO2001002553A2 (en) | 1999-06-30 | 2001-01-11 | The Government Of The United States Of America As Represented By The Secretary, Department Of Health And Human Services | Dna binding protein and sequence as insulators having specific enhancer blocking activity for regulation of gene expression |
| CA2383073A1 (en) * | 1999-08-25 | 2001-03-01 | Immunex Corporation | Compositions and methods for improved cell culture |
| WO2001032901A1 (en) | 1999-11-01 | 2001-05-10 | Chiron Corporation | Expression vectors, transfection systems, and method of use thereof |
| US6558948B1 (en) * | 1999-11-23 | 2003-05-06 | Stefan Kochanek | Permanent amniocytic cell line, its production and use for the production of gene transfer vectors |
| AU2001243142A1 (en) | 2000-02-03 | 2001-08-14 | Hyseq, Inc. | Novel nucleic acids and polypeptides |
| US6586205B1 (en) * | 2000-02-11 | 2003-07-01 | Millennium Pharmaceuticals, Inc. | 43239 a novel GPCR-like molecule and uses thereof |
| EP1263950A2 (en) | 2000-02-11 | 2002-12-11 | Millennium Pharmaceuticals, Inc. | Seven-transmembrane proteins/g-protein coupled receptors |
| AU9006701A (en) | 2000-09-20 | 2002-04-02 | Cobra Therapeutics Ltd | Polynucleotide |
| CA2435972C (en) | 2001-01-26 | 2011-09-13 | University Of Lausanne | Matrix attachment regions and methods for use thereof |
| US20020155540A1 (en) * | 2001-02-14 | 2002-10-24 | Malla Padidam | Method to reduce transcriptional interference between tandem genes |
| CA2439185A1 (en) * | 2001-02-23 | 2002-09-06 | Novartis Ag | Vector constructs |
| FR2821855B1 (en) * | 2001-03-09 | 2004-04-02 | Cayla | SYNTHETIC GENES AND BACTERIAL PLASMIDS WITHOUT GIC |
| TWI324181B (en) * | 2001-04-16 | 2010-05-01 | Martek Biosciences Corp | Product and process for transformation of thraustochytriales microorganisms |
| AU2002310321A1 (en) | 2001-06-04 | 2002-12-16 | Corixa Corporation | Compositions and methods for high-level, large-scale production of recombinant proteins |
| EP1273666A1 (en) | 2001-07-04 | 2003-01-08 | Chromagenics B.V. | Method of selecting a DNA sequence with transcription modulating activity using a vector comprising an element with a gene transcription repressing activity |
| ITMI20012110A1 (en) * | 2001-10-12 | 2003-04-12 | Keryos Spa | MULTI-CISTRONIC VECTORS USABLE IN GENE TRANSFER PROTOCOLS |
| US20040038304A1 (en) | 2002-03-28 | 2004-02-26 | Gala Design, Inc. | Antibody libraries |
| EP1513937B1 (en) * | 2002-06-14 | 2011-03-02 | ChromaGenics B.V. | A method for the stimultaneaous production of multiple proteins; vectors and cells for use therein |
| WO2003106674A2 (en) * | 2002-06-14 | 2003-12-24 | Chromagenics B.V. | Means and methods for regulating gene expression |
| US7344886B2 (en) * | 2002-11-29 | 2008-03-18 | Boehringer Ingelheim Pharma Gmbh & Co., Kg | Neomycin-phosphotransferase-genes and methods for the selection of recombinant cells producing high levels of a desired gene product |
| EP1573058B1 (en) * | 2002-12-18 | 2011-09-14 | ChromaGenics B.V. | A method for improving protein production |
| WO2004056986A2 (en) | 2002-12-20 | 2004-07-08 | Chromagenics B.V. | Means and methods for producing a protein through chromatin openers that are capable of rendering chromatin more accessible to transcription factors |
| CN1846000B (en) * | 2003-09-05 | 2012-10-03 | 金克克国际有限公司 | HPV CD8+ T cell epitope |
| ES2296201T3 (en) * | 2004-07-08 | 2008-04-16 | Chromagenics B.V. | NOVELTY SEQUENCE TO IMPROVE NUCLEIC ACID EXPRESSION. |
| US8999667B2 (en) * | 2004-11-08 | 2015-04-07 | Chromagenics B.V. | Selection of host cells expressing protein at high levels |
| US20100136616A1 (en) * | 2004-11-08 | 2010-06-03 | Chromagenics B.V. | Selection of Host Cells Expressing Protein at High Levels |
| US8039230B2 (en) * | 2004-11-08 | 2011-10-18 | Chromagenics B.V. | Selection of host cells expressing protein at high levels |
| PL1809750T3 (en) * | 2004-11-08 | 2012-08-31 | Chromagenics Bv | Selection of host cells expressing protein at high levels |
| SI1809750T1 (en) * | 2004-11-08 | 2012-08-31 | Chromagenics Bv | Selection of host cells expressing protein at high levels |
| US20060195935A1 (en) * | 2004-11-08 | 2006-08-31 | Chromagenics B.V. | Selection of host cells expressing protein at high levels |
| US7968700B2 (en) * | 2006-03-20 | 2011-06-28 | Chromagenics B.V. | Expression augmenting DNA fragments, use thereof, and methods for finding thereof |
| JP5501054B2 (en) | 2010-03-25 | 2014-05-21 | 株式会社トクヤマ | Method for producing 3,3-diaminoacrylic acid (1-diphenylmethylazetidin-3-yl) ester acetate |
-
2005
- 2005-11-07 SI SI200531538T patent/SI1809750T1/en unknown
- 2005-11-07 NZ NZ553700A patent/NZ553700A/en not_active IP Right Cessation
- 2005-11-07 US US11/269,525 patent/US20060172382A1/en not_active Abandoned
- 2005-11-07 DK DK05810935.6T patent/DK1809750T3/en active
- 2005-11-07 ES ES05810935T patent/ES2384391T3/en not_active Expired - Lifetime
- 2005-11-07 CN CN200580037896XA patent/CN101090968B/en not_active Expired - Fee Related
- 2005-11-07 AT AT05810935T patent/ATE550432T1/en active
-
2007
- 2007-03-08 ZA ZA200702046A patent/ZA200702046B/en unknown
-
2010
- 2010-08-26 US US12/807,133 patent/US8771984B2/en not_active Expired - Fee Related
-
2014
- 2014-01-27 US US14/165,374 patent/US20140186890A1/en not_active Abandoned
Patent Citations (2)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| WO1991001374A1 (en) * | 1989-07-24 | 1991-02-07 | Seragen, Inc. | Prevention of internal initiation |
| US20030199468A1 (en) * | 2001-07-04 | 2003-10-23 | Otte Arie Peter | DNA sequences comprising gene transcription regulatory qualities and methods for detecting and using such DNA sequences |
Non-Patent Citations (4)
| Title |
|---|
| Carroll et al, J. Virol. 67(3):1433-1440, 1993 * |
| Kim et al, Biotechnol. Bioeng. 58:73-84, 1997 * |
| Kim et al, J. Biol. Chem. 278(43):41691-41701, 2003 * |
| Moser et al, Biotechnol. Prog. 16:724-735, 2000 * |
Cited By (2)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| WO2020223160A1 (en) * | 2019-04-30 | 2020-11-05 | Chan Zuckerberg Biohub, Inc. | A novel chromatin-opening element for stable long term gene expression |
| US20220193266A1 (en) * | 2019-04-30 | 2022-06-23 | Chan Zuckerberg Biohub, Inc. | A novel chromatin-opening element for stable long term gene expression |
Also Published As
| Publication number | Publication date |
|---|---|
| SI1809750T1 (en) | 2012-08-31 |
| US20110014655A1 (en) | 2011-01-20 |
| DK1809750T3 (en) | 2012-06-25 |
| US8771984B2 (en) | 2014-07-08 |
| ATE550432T1 (en) | 2012-04-15 |
| CN101090968B (en) | 2011-10-26 |
| US20060172382A1 (en) | 2006-08-03 |
| ZA200702046B (en) | 2008-09-25 |
| ES2384391T3 (en) | 2012-07-04 |
| NZ553700A (en) | 2010-09-30 |
| CN101090968A (en) | 2007-12-19 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| US8771984B2 (en) | Selection of host cells expressing protein at high levels | |
| EP1809750B1 (en) | Selection of host cells expressing protein at high levels | |
| US9228004B2 (en) | Selection of host cells expressing protein at high levels | |
| EP1987150B1 (en) | Selection of host cells expressing protein at high levels | |
| NO341862B1 (en) | New sequence for enhancing expression of nucleic acid (s) | |
| US7968700B2 (en) | Expression augmenting DNA fragments, use thereof, and methods for finding thereof | |
| EP1572994B1 (en) | Means and methods for producing a protein through chromatin openers that are capable of rendering chromatin more accessible to transcription factors | |
| US20100136616A1 (en) | Selection of Host Cells Expressing Protein at High Levels | |
| HK1101264B (en) | Selection of host cells expressing protein at high levels | |
| EP1763586B1 (en) | Novel sequence for improving expression of nucleic acid | |
| AU2007217431B2 (en) | Selection of host cells expressing protein at high levels | |
| HK1097004B (en) | Novel sequence for improving expression of nucleic acid |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| AS | Assignment |
Owner name: CHROMAGENICS B.V., NETHERLANDS Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNORS:OTTE, ARIE P.;VAN BLOKLAND, HENRICUS J.M.;KWAKS, THEODORUS H.J.;AND OTHERS;REEL/FRAME:032067/0822 Effective date: 20051101 |
|
| STCB | Information on status: application discontinuation |
Free format text: ABANDONED -- FAILURE TO RESPOND TO AN OFFICE ACTION |