US20080085535A1 - Stable Genomic Integration of Multiple Polynucleotide Copies - Google Patents
Stable Genomic Integration of Multiple Polynucleotide Copies Download PDFInfo
- Publication number
- US20080085535A1 US20080085535A1 US11/576,896 US57689605A US2008085535A1 US 20080085535 A1 US20080085535 A1 US 20080085535A1 US 57689605 A US57689605 A US 57689605A US 2008085535 A1 US2008085535 A1 US 2008085535A1
- Authority
- US
- United States
- Prior art keywords
- cell
- seq
- promoter
- orf
- operon
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Abandoned
Links
- 108091033319 polynucleotide Proteins 0.000 title claims description 41
- 102000040430 polynucleotide Human genes 0.000 title claims description 41
- 239000002157 polynucleotide Substances 0.000 title claims description 41
- 230000010354 integration Effects 0.000 title abstract description 33
- 210000004027 cell Anatomy 0.000 claims abstract description 183
- 229920001184 polypeptide Polymers 0.000 claims abstract description 104
- 108090000765 processed proteins & peptides Proteins 0.000 claims abstract description 104
- 102000004196 processed proteins & peptides Human genes 0.000 claims abstract description 104
- 210000000349 chromosome Anatomy 0.000 claims abstract description 77
- 108700026244 Open Reading Frames Proteins 0.000 claims abstract description 72
- 238000000034 method Methods 0.000 claims abstract description 70
- 108010052160 Site-specific recombinase Proteins 0.000 claims abstract description 46
- 230000006798 recombination Effects 0.000 claims abstract description 33
- 238000005215 recombination Methods 0.000 claims abstract description 32
- 230000022532 regulation of transcription, DNA-dependent Effects 0.000 claims abstract description 18
- 238000001727 in vivo Methods 0.000 claims abstract description 7
- 108010061833 Integrases Proteins 0.000 claims description 45
- 239000002773 nucleotide Substances 0.000 claims description 45
- 125000003729 nucleotide group Chemical group 0.000 claims description 45
- 102100034343 Integrase Human genes 0.000 claims description 42
- 241000193830 Bacillus <bacterium> Species 0.000 claims description 41
- 238000011144 upstream manufacturing Methods 0.000 claims description 24
- 102000004190 Enzymes Human genes 0.000 claims description 12
- 108090000790 Enzymes Proteins 0.000 claims description 12
- 108010055246 excisionase Proteins 0.000 claims description 8
- 210000001236 prokaryotic cell Anatomy 0.000 claims description 6
- 102000004316 Oxidoreductases Human genes 0.000 claims description 5
- 108090000854 Oxidoreductases Proteins 0.000 claims description 5
- 102000004195 Isomerases Human genes 0.000 claims description 4
- 108090000769 Isomerases Proteins 0.000 claims description 4
- 108090000364 Ligases Proteins 0.000 claims description 4
- 102000003960 Ligases Human genes 0.000 claims description 4
- 102000004317 Lyases Human genes 0.000 claims description 4
- 108090000856 Lyases Proteins 0.000 claims description 4
- 102000004357 Transferases Human genes 0.000 claims description 4
- 108090000992 Transferases Proteins 0.000 claims description 4
- 102000004157 Hydrolases Human genes 0.000 claims description 3
- 108090000604 Hydrolases Proteins 0.000 claims description 3
- 108090000623 proteins and genes Proteins 0.000 description 90
- 239000013612 plasmid Substances 0.000 description 53
- 240000004808 Saccharomyces cerevisiae Species 0.000 description 45
- 235000014680 Saccharomyces cerevisiae Nutrition 0.000 description 45
- 108091026890 Coding region Proteins 0.000 description 40
- 239000012634 fragment Substances 0.000 description 39
- 239000013598 vector Substances 0.000 description 38
- 108020004999 messenger RNA Proteins 0.000 description 33
- 230000014509 gene expression Effects 0.000 description 32
- 235000014469 Bacillus subtilis Nutrition 0.000 description 29
- 239000003550 marker Substances 0.000 description 28
- 108010065511 Amylases Proteins 0.000 description 24
- 150000007523 nucleic acids Chemical class 0.000 description 24
- 230000000087 stabilizing effect Effects 0.000 description 23
- SRBFZHDQGSBBOR-IOVATXLUSA-N D-xylopyranose Chemical group O[C@@H]1COC(O)[C@H](O)[C@H]1O SRBFZHDQGSBBOR-IOVATXLUSA-N 0.000 description 22
- 102000018120 Recombinases Human genes 0.000 description 22
- 108010091086 Recombinases Proteins 0.000 description 22
- 108020004414 DNA Proteins 0.000 description 20
- 244000063299 Bacillus subtilis Species 0.000 description 19
- 230000002538 fungal effect Effects 0.000 description 19
- 239000004382 Amylase Substances 0.000 description 18
- 102000013142 Amylases Human genes 0.000 description 18
- 108010076504 Protein Sorting Signals Proteins 0.000 description 18
- 235000019418 amylase Nutrition 0.000 description 18
- 238000004519 manufacturing process Methods 0.000 description 18
- 230000001580 bacterial effect Effects 0.000 description 17
- PYMYPHUHKUWMLA-UHFFFAOYSA-N arabinose Natural products OCC(O)C(O)C(O)C=O PYMYPHUHKUWMLA-UHFFFAOYSA-N 0.000 description 16
- SRBFZHDQGSBBOR-UHFFFAOYSA-N beta-D-Pyranose-Lyxose Natural products OC1COC(O)C(O)C1O SRBFZHDQGSBBOR-UHFFFAOYSA-N 0.000 description 16
- 102000039446 nucleic acids Human genes 0.000 description 15
- 108020004707 nucleic acids Proteins 0.000 description 15
- 230000010076 replication Effects 0.000 description 15
- 238000012545 processing Methods 0.000 description 14
- 238000013518 transcription Methods 0.000 description 14
- 230000035897 transcription Effects 0.000 description 14
- -1 amyL Chemical class 0.000 description 13
- 108091028043 Nucleic acid sequence Proteins 0.000 description 12
- 241000499912 Trichoderma reesei Species 0.000 description 12
- 241000588724 Escherichia coli Species 0.000 description 11
- 230000002759 chromosomal effect Effects 0.000 description 11
- 102000004169 proteins and genes Human genes 0.000 description 11
- 240000006439 Aspergillus oryzae Species 0.000 description 10
- 235000002247 Aspergillus oryzae Nutrition 0.000 description 10
- 241000193388 Bacillus thuringiensis Species 0.000 description 10
- 241000193385 Geobacillus stearothermophilus Species 0.000 description 10
- 229940088598 enzyme Drugs 0.000 description 10
- 241000894006 Bacteria Species 0.000 description 9
- MTCFGRXMJLQNBG-UHFFFAOYSA-N Serine Natural products OCC(N)C(O)=O MTCFGRXMJLQNBG-UHFFFAOYSA-N 0.000 description 9
- 108090000637 alpha-Amylases Proteins 0.000 description 9
- 238000010276 construction Methods 0.000 description 9
- 244000005700 microbiome Species 0.000 description 9
- 241000193422 Bacillus lentus Species 0.000 description 8
- 239000013604 expression vector Substances 0.000 description 8
- 238000002744 homologous recombination Methods 0.000 description 8
- 230000006801 homologous recombination Effects 0.000 description 8
- 241000351920 Aspergillus nidulans Species 0.000 description 7
- 241000228245 Aspergillus niger Species 0.000 description 7
- 241000233866 Fungi Species 0.000 description 7
- 230000001105 regulatory effect Effects 0.000 description 7
- 230000009466 transformation Effects 0.000 description 7
- 230000003115 biocidal effect Effects 0.000 description 6
- 238000010367 cloning Methods 0.000 description 6
- 101100225935 Aquifex aeolicus (strain VF5) genX gene Proteins 0.000 description 5
- 101000757144 Aspergillus niger Glucoamylase Proteins 0.000 description 5
- 101000775727 Bacillus amyloliquefaciens Alpha-amylase Proteins 0.000 description 5
- 108091005658 Basic proteases Proteins 0.000 description 5
- 241000193764 Brevibacillus brevis Species 0.000 description 5
- 102000012288 Phosphopyruvate Hydratase Human genes 0.000 description 5
- 108010022181 Phosphopyruvate Hydratase Proteins 0.000 description 5
- 241000235403 Rhizomucor miehei Species 0.000 description 5
- 102000004139 alpha-Amylases Human genes 0.000 description 5
- 229940024171 alpha-amylase Drugs 0.000 description 5
- 229940097012 bacillus thuringiensis Drugs 0.000 description 5
- 101150029692 epmA gene Proteins 0.000 description 5
- 230000006870 function Effects 0.000 description 5
- 230000008488 polyadenylation Effects 0.000 description 5
- 125000002924 primary amino group Chemical group [H]N([H])* 0.000 description 5
- UHPMCKVQTMMPCG-UHFFFAOYSA-N 5,8-dihydroxy-2-methoxy-6-methyl-7-(2-oxopropyl)naphthalene-1,4-dione Chemical compound CC1=C(CC(C)=O)C(O)=C2C(=O)C(OC)=CC(=O)C2=C1O UHPMCKVQTMMPCG-UHFFFAOYSA-N 0.000 description 4
- 241000228212 Aspergillus Species 0.000 description 4
- 241000193744 Bacillus amyloliquefaciens Species 0.000 description 4
- 241000193752 Bacillus circulans Species 0.000 description 4
- 241000193749 Bacillus coagulans Species 0.000 description 4
- 241000194107 Bacillus megaterium Species 0.000 description 4
- 108090000204 Dipeptidase 1 Proteins 0.000 description 4
- ULGZDMOVFRHVEP-RWJQBGPGSA-N Erythromycin Chemical compound O([C@@H]1[C@@H](C)C(=O)O[C@@H]([C@@]([C@H](O)[C@@H](C)C(=O)[C@H](C)C[C@@](C)(O)[C@H](O[C@H]2[C@@H]([C@H](C[C@@H](C)O2)N(C)C)O)[C@H]1C)(C)O)CC)[C@H]1C[C@@](C)(OC)[C@@H](O)[C@H](C)O1 ULGZDMOVFRHVEP-RWJQBGPGSA-N 0.000 description 4
- 241000223218 Fusarium Species 0.000 description 4
- 241000567163 Fusarium cerealis Species 0.000 description 4
- 241000194109 Paenibacillus lautus Species 0.000 description 4
- 241000589774 Pseudomonas sp. Species 0.000 description 4
- 241000187747 Streptomyces Species 0.000 description 4
- 241000187398 Streptomyces lividans Species 0.000 description 4
- 241001468239 Streptomyces murinus Species 0.000 description 4
- 108010048241 acetamidase Proteins 0.000 description 4
- 150000001413 amino acids Chemical class 0.000 description 4
- 238000013459 approach Methods 0.000 description 4
- 229940054340 bacillus coagulans Drugs 0.000 description 4
- 230000008859 change Effects 0.000 description 4
- 239000013611 chromosomal DNA Substances 0.000 description 4
- 230000021615 conjugation Effects 0.000 description 4
- 230000000694 effects Effects 0.000 description 4
- 108010061330 glucan 1,4-alpha-maltohydrolase Proteins 0.000 description 4
- 230000001965 increasing effect Effects 0.000 description 4
- 235000015097 nutrients Nutrition 0.000 description 4
- 230000008569 process Effects 0.000 description 4
- 238000003259 recombinant expression Methods 0.000 description 4
- 238000011160 research Methods 0.000 description 4
- 241000894007 species Species 0.000 description 4
- 239000003381 stabilizer Substances 0.000 description 4
- 230000002103 transcriptional effect Effects 0.000 description 4
- RZVAJINKPMORJF-UHFFFAOYSA-N Acetaminophen Chemical compound CC(=O)NC1=CC=C(O)C=C1 RZVAJINKPMORJF-UHFFFAOYSA-N 0.000 description 3
- 102100034044 All-trans-retinol dehydrogenase [NAD(+)] ADH1B Human genes 0.000 description 3
- 101710193111 All-trans-retinol dehydrogenase [NAD(+)] ADH4 Proteins 0.000 description 3
- 241000534414 Anotopterus nikparini Species 0.000 description 3
- 108010037870 Anthranilate Synthase Proteins 0.000 description 3
- 102000004580 Aspartic Acid Proteases Human genes 0.000 description 3
- 108010017640 Aspartic Acid Proteases Proteins 0.000 description 3
- 101900040182 Bacillus subtilis Levansucrase Proteins 0.000 description 3
- OKTJSMMVPCPJKN-UHFFFAOYSA-N Carbon Chemical compound [C] OKTJSMMVPCPJKN-UHFFFAOYSA-N 0.000 description 3
- 108010059892 Cellulase Proteins 0.000 description 3
- 241000146399 Ceriporiopsis Species 0.000 description 3
- 101100342470 Dictyostelium discoideum pkbA gene Proteins 0.000 description 3
- 241000196324 Embryophyta Species 0.000 description 3
- 101100385973 Escherichia coli (strain K12) cycA gene Proteins 0.000 description 3
- 241000206602 Eukaryota Species 0.000 description 3
- 241000192125 Firmicutes Species 0.000 description 3
- 241000221779 Fusarium sambucinum Species 0.000 description 3
- 108700007698 Genetic Terminator Regions Proteins 0.000 description 3
- 101100001650 Geobacillus stearothermophilus amyM gene Proteins 0.000 description 3
- 108010073178 Glucan 1,4-alpha-Glucosidase Proteins 0.000 description 3
- 241001480714 Humicola insolens Species 0.000 description 3
- 102000012330 Integrases Human genes 0.000 description 3
- 102100027612 Kallikrein-11 Human genes 0.000 description 3
- 108010087702 Penicillinase Proteins 0.000 description 3
- 241000235648 Pichia Species 0.000 description 3
- 101710150114 Protein rep Proteins 0.000 description 3
- 108020004511 Recombinant DNA Proteins 0.000 description 3
- 101710152114 Replication protein Proteins 0.000 description 3
- 101100309436 Streptococcus mutans serotype c (strain ATCC 700610 / UA159) ftf gene Proteins 0.000 description 3
- 241000187432 Streptomyces coelicolor Species 0.000 description 3
- 101100157012 Thermoanaerobacterium saccharolyticum (strain DSM 8691 / JW/SL-YS485) xynB gene Proteins 0.000 description 3
- 241000223258 Thermomyces lanuginosus Species 0.000 description 3
- 241001313536 Thermothelomyces thermophila Species 0.000 description 3
- 241000223259 Trichoderma Species 0.000 description 3
- 102000005924 Triose-Phosphate Isomerase Human genes 0.000 description 3
- 108700015934 Triose-phosphate isomerases Proteins 0.000 description 3
- 101710152431 Trypsin-like protease Proteins 0.000 description 3
- IXKSXJFAGXLQOQ-XISFHERQSA-N WHWLQLKPGQPMY Chemical compound C([C@@H](C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)NC(=O)[C@@H](N)CC=1C2=CC=CC=C2NC=1)C1=CNC=N1 IXKSXJFAGXLQOQ-XISFHERQSA-N 0.000 description 3
- 108010045649 agarase Proteins 0.000 description 3
- 229910052799 carbon Inorganic materials 0.000 description 3
- 101150055766 cat gene Proteins 0.000 description 3
- 238000006243 chemical reaction Methods 0.000 description 3
- 239000002299 complementary DNA Substances 0.000 description 3
- 101150005799 dagA gene Proteins 0.000 description 3
- 230000001419 dependent effect Effects 0.000 description 3
- 102000006602 glyceraldehyde-3-phosphate dehydrogenase Human genes 0.000 description 3
- 108020004445 glyceraldehyde-3-phosphate dehydrogenase Proteins 0.000 description 3
- 238000003780 insertion Methods 0.000 description 3
- 230000037431 insertion Effects 0.000 description 3
- 101150062334 int gene Proteins 0.000 description 3
- 230000035772 mutation Effects 0.000 description 3
- 230000007935 neutral effect Effects 0.000 description 3
- 101150019841 penP gene Proteins 0.000 description 3
- 210000001938 protoplast Anatomy 0.000 description 3
- 238000012552 review Methods 0.000 description 3
- 101150025220 sacB gene Proteins 0.000 description 3
- 238000013519 translation Methods 0.000 description 3
- 101150110790 xylB gene Proteins 0.000 description 3
- OSJPPGNTCRNQQC-UWTATZPHSA-N 3-phospho-D-glyceric acid Chemical compound OC(=O)[C@H](O)COP(O)(O)=O OSJPPGNTCRNQQC-UWTATZPHSA-N 0.000 description 2
- 241001019659 Acremonium <Plectosphaerellaceae> Species 0.000 description 2
- 102000007698 Alcohol dehydrogenase Human genes 0.000 description 2
- 108010021809 Alcohol dehydrogenase Proteins 0.000 description 2
- 241001513093 Aspergillus awamori Species 0.000 description 2
- 241000892910 Aspergillus foetidus Species 0.000 description 2
- 241001225321 Aspergillus fumigatus Species 0.000 description 2
- 241001480052 Aspergillus japonicus Species 0.000 description 2
- 101000690713 Aspergillus niger Alpha-glucosidase Proteins 0.000 description 2
- IJGRMHOSHXDMSA-UHFFFAOYSA-N Atomic nitrogen Chemical compound N#N IJGRMHOSHXDMSA-UHFFFAOYSA-N 0.000 description 2
- 241000223651 Aureobasidium Species 0.000 description 2
- 241001328122 Bacillus clausii Species 0.000 description 2
- 101900315840 Bacillus subtilis Alpha-amylase Proteins 0.000 description 2
- 101100114758 Bacillus thuringiensis subsp. tenebrionis cry3Aa gene Proteins 0.000 description 2
- 241000222120 Candida <Saccharomycetales> Species 0.000 description 2
- 241001466517 Ceriporiopsis aneirina Species 0.000 description 2
- 108091035707 Consensus sequence Proteins 0.000 description 2
- 241001337994 Cryptococcus <scale insect> Species 0.000 description 2
- 102000053602 DNA Human genes 0.000 description 2
- 241000194032 Enterococcus faecalis Species 0.000 description 2
- 102000010911 Enzyme Precursors Human genes 0.000 description 2
- 108010062466 Enzyme Precursors Proteins 0.000 description 2
- 241000146406 Fusarium heterosporum Species 0.000 description 2
- 102000048120 Galactokinases Human genes 0.000 description 2
- 108700023157 Galactokinases Proteins 0.000 description 2
- 101100369308 Geobacillus stearothermophilus nprS gene Proteins 0.000 description 2
- 101100080316 Geobacillus stearothermophilus nprT gene Proteins 0.000 description 2
- 241000223198 Humicola Species 0.000 description 2
- 241000235649 Kluyveromyces Species 0.000 description 2
- OUYCCCASQSFEME-QMMMGPOBSA-N L-tyrosine Chemical compound OC(=O)[C@@H](N)CC1=CC=C(O)C=C1 OUYCCCASQSFEME-QMMMGPOBSA-N 0.000 description 2
- 108010029541 Laccase Proteins 0.000 description 2
- 241000235087 Lachancea kluyveri Species 0.000 description 2
- 108090001060 Lipase Proteins 0.000 description 2
- 102000004882 Lipase Human genes 0.000 description 2
- 239000004367 Lipase Substances 0.000 description 2
- 241001344133 Magnaporthe Species 0.000 description 2
- 241000235395 Mucor Species 0.000 description 2
- 241000226677 Myceliophthora Species 0.000 description 2
- 241000233892 Neocallimastix Species 0.000 description 2
- 241000221960 Neurospora Species 0.000 description 2
- 241000221961 Neurospora crassa Species 0.000 description 2
- 241000233654 Oomycetes Species 0.000 description 2
- 241001236817 Paecilomyces <Clavicipitaceae> Species 0.000 description 2
- 241000228143 Penicillium Species 0.000 description 2
- 102000035195 Peptidases Human genes 0.000 description 2
- 108091005804 Peptidases Proteins 0.000 description 2
- 241000235379 Piromyces Species 0.000 description 2
- 241000235070 Saccharomyces Species 0.000 description 2
- 235000003534 Saccharomyces carlsbergensis Nutrition 0.000 description 2
- 235000001006 Saccharomyces cerevisiae var diastaticus Nutrition 0.000 description 2
- 244000206963 Saccharomyces cerevisiae var. diastaticus Species 0.000 description 2
- 241000204893 Saccharomyces douglasii Species 0.000 description 2
- 241001407717 Saccharomyces norbensis Species 0.000 description 2
- 241001123227 Saccharomyces pastorianus Species 0.000 description 2
- 241000222480 Schizophyllum Species 0.000 description 2
- 241000235346 Schizosaccharomyces Species 0.000 description 2
- 108091081024 Start codon Proteins 0.000 description 2
- 108010056079 Subtilisins Proteins 0.000 description 2
- 241000228341 Talaromyces Species 0.000 description 2
- 241001540751 Talaromyces ruber Species 0.000 description 2
- 241000228178 Thermoascus Species 0.000 description 2
- 241001494489 Thielavia Species 0.000 description 2
- 241001149964 Tolypocladium Species 0.000 description 2
- 102000008579 Transposases Human genes 0.000 description 2
- 108010020764 Transposases Proteins 0.000 description 2
- 241000223260 Trichoderma harzianum Species 0.000 description 2
- 241000378866 Trichoderma koningii Species 0.000 description 2
- 241000223262 Trichoderma longibrachiatum Species 0.000 description 2
- 241000223261 Trichoderma viride Species 0.000 description 2
- 241000235013 Yarrowia Species 0.000 description 2
- 230000009471 action Effects 0.000 description 2
- 239000003242 anti bacterial agent Substances 0.000 description 2
- 229940091771 aspergillus fumigatus Drugs 0.000 description 2
- 210000003578 bacterial chromosome Anatomy 0.000 description 2
- 108010051210 beta-Fructofuranosidase Proteins 0.000 description 2
- 108010005774 beta-Galactosidase Proteins 0.000 description 2
- 102000005936 beta-Galactosidase Human genes 0.000 description 2
- 230000015572 biosynthetic process Effects 0.000 description 2
- 230000015556 catabolic process Effects 0.000 description 2
- 210000002421 cell wall Anatomy 0.000 description 2
- 229940106157 cellulase Drugs 0.000 description 2
- 238000012512 characterization method Methods 0.000 description 2
- 229960005091 chloramphenicol Drugs 0.000 description 2
- WIIZWVCIJKGZOK-RKDXNWHRSA-N chloramphenicol Chemical compound ClC(Cl)C(=O)N[C@H](CO)[C@H](O)C1=CC=C([N+]([O-])=O)C=C1 WIIZWVCIJKGZOK-RKDXNWHRSA-N 0.000 description 2
- 108010091384 endoglucanase 2 Proteins 0.000 description 2
- 229960003276 erythromycin Drugs 0.000 description 2
- 238000000605 extraction Methods 0.000 description 2
- 230000004927 fusion Effects 0.000 description 2
- 230000012010 growth Effects 0.000 description 2
- 229910001385 heavy metal Inorganic materials 0.000 description 2
- 230000001976 improved effect Effects 0.000 description 2
- 230000003834 intracellular effect Effects 0.000 description 2
- 239000001573 invertase Substances 0.000 description 2
- 235000011073 invertase Nutrition 0.000 description 2
- 235000019421 lipase Nutrition 0.000 description 2
- 239000011159 matrix material Substances 0.000 description 2
- 101150105920 npr gene Proteins 0.000 description 2
- 230000037361 pathway Effects 0.000 description 2
- 101150054232 pyrG gene Proteins 0.000 description 2
- 239000000523 sample Substances 0.000 description 2
- 230000003248 secreting effect Effects 0.000 description 2
- 230000028327 secretion Effects 0.000 description 2
- 231100000331 toxic Toxicity 0.000 description 2
- 230000002588 toxic effect Effects 0.000 description 2
- 108700012359 toxins Proteins 0.000 description 2
- OUYCCCASQSFEME-UHFFFAOYSA-N tyrosine Natural products OC(=O)C(N)CC1=CC=C(O)C=C1 OUYCCCASQSFEME-UHFFFAOYSA-N 0.000 description 2
- 230000009105 vegetative growth Effects 0.000 description 2
- FQVLRGLGWNWPSS-BXBUPLCLSA-N (4r,7s,10s,13s,16r)-16-acetamido-13-(1h-imidazol-5-ylmethyl)-10-methyl-6,9,12,15-tetraoxo-7-propan-2-yl-1,2-dithia-5,8,11,14-tetrazacycloheptadecane-4-carboxamide Chemical compound N1C(=O)[C@@H](NC(C)=O)CSSC[C@@H](C(N)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](C)NC(=O)[C@@H]1CC1=CN=CN1 FQVLRGLGWNWPSS-BXBUPLCLSA-N 0.000 description 1
- 108091032973 (ribonucleotides)n+m Proteins 0.000 description 1
- 108020004465 16S ribosomal RNA Proteins 0.000 description 1
- 101710163881 5,6-dihydroxyindole-2-carboxylic acid oxidase Proteins 0.000 description 1
- OPIFSICVWOWJMJ-AEOCFKNESA-N 5-bromo-4-chloro-3-indolyl beta-D-galactoside Chemical compound O[C@@H]1[C@@H](O)[C@@H](O)[C@@H](CO)O[C@H]1OC1=CNC2=CC=C(Br)C(Cl)=C12 OPIFSICVWOWJMJ-AEOCFKNESA-N 0.000 description 1
- SEHFUALWMUWDKS-UHFFFAOYSA-N 5-fluoroorotic acid Chemical compound OC(=O)C=1NC(=O)NC(=O)C=1F SEHFUALWMUWDKS-UHFFFAOYSA-N 0.000 description 1
- 108010011619 6-Phytase Proteins 0.000 description 1
- 101150104118 ANS1 gene Proteins 0.000 description 1
- 101100510736 Actinidia chinensis var. chinensis LDOX gene Proteins 0.000 description 1
- 102100034035 Alcohol dehydrogenase 1A Human genes 0.000 description 1
- 102000004400 Aminopeptidases Human genes 0.000 description 1
- 108090000915 Aminopeptidases Proteins 0.000 description 1
- 229920000856 Amylose Polymers 0.000 description 1
- 101100163849 Arabidopsis thaliana ARS1 gene Proteins 0.000 description 1
- 241000228215 Aspergillus aculeatus Species 0.000 description 1
- 101000961203 Aspergillus awamori Glucoamylase Proteins 0.000 description 1
- 101000756530 Aspergillus niger Endo-1,4-beta-xylanase B Proteins 0.000 description 1
- 101900127796 Aspergillus oryzae Glucoamylase Proteins 0.000 description 1
- 101900318521 Aspergillus oryzae Triosephosphate isomerase Proteins 0.000 description 1
- 108090000145 Bacillolysin Proteins 0.000 description 1
- 108700003918 Bacillus Thuringiensis insecticidal crystal Proteins 0.000 description 1
- 241000193747 Bacillus firmus Species 0.000 description 1
- 241000194103 Bacillus pumilus Species 0.000 description 1
- 108010045681 Bacillus stearothermophilus neutral protease Proteins 0.000 description 1
- 241000221198 Basidiomycota Species 0.000 description 1
- 102100030981 Beta-alanine-activating enzyme Human genes 0.000 description 1
- 241000222490 Bjerkandera Species 0.000 description 1
- 241000222478 Bjerkandera adusta Species 0.000 description 1
- 101100327917 Caenorhabditis elegans chup-1 gene Proteins 0.000 description 1
- 108010006303 Carboxypeptidases Proteins 0.000 description 1
- 102000005367 Carboxypeptidases Human genes 0.000 description 1
- 102100035882 Catalase Human genes 0.000 description 1
- 108010053835 Catalase Proteins 0.000 description 1
- 108010031396 Catechol oxidase Proteins 0.000 description 1
- 102000030523 Catechol oxidase Human genes 0.000 description 1
- 108010084185 Cellulases Proteins 0.000 description 1
- 102000005575 Cellulases Human genes 0.000 description 1
- 108010008885 Cellulose 1,4-beta-Cellobiosidase Proteins 0.000 description 1
- 102100037633 Centrin-3 Human genes 0.000 description 1
- 241001646018 Ceriporiopsis gilvescens Species 0.000 description 1
- 241001277875 Ceriporiopsis rivulosa Species 0.000 description 1
- 241000524302 Ceriporiopsis subrufa Species 0.000 description 1
- 229920002101 Chitin Polymers 0.000 description 1
- 108010022172 Chitinases Proteins 0.000 description 1
- 102000012286 Chitinases Human genes 0.000 description 1
- 229920001661 Chitosan Polymers 0.000 description 1
- 241000233652 Chytridiomycota Species 0.000 description 1
- 108020004638 Circular DNA Proteins 0.000 description 1
- 102100038445 Claudin-2 Human genes 0.000 description 1
- 108020004705 Codon Proteins 0.000 description 1
- 241000222511 Coprinus Species 0.000 description 1
- 244000251987 Coprinus macrorhizus Species 0.000 description 1
- 235000001673 Coprinus macrorhizus Nutrition 0.000 description 1
- 241000222356 Coriolus Species 0.000 description 1
- 108010025880 Cyclomaltodextrin glucanotransferase Proteins 0.000 description 1
- 102000018832 Cytochromes Human genes 0.000 description 1
- 108010052832 Cytochromes Proteins 0.000 description 1
- 230000004544 DNA amplification Effects 0.000 description 1
- 102000004163 DNA-directed RNA polymerases Human genes 0.000 description 1
- 108090000626 DNA-directed RNA polymerases Proteins 0.000 description 1
- 108010053770 Deoxyribonucleases Proteins 0.000 description 1
- 102000016911 Deoxyribonucleases Human genes 0.000 description 1
- 101710121765 Endo-1,4-beta-xylanase Proteins 0.000 description 1
- 101710132690 Endo-1,4-beta-xylanase A Proteins 0.000 description 1
- 108090000371 Esterases Proteins 0.000 description 1
- 108091029865 Exogenous DNA Proteins 0.000 description 1
- 241000145614 Fusarium bactridioides Species 0.000 description 1
- 241000223194 Fusarium culmorum Species 0.000 description 1
- 241000223195 Fusarium graminearum Species 0.000 description 1
- 241000223221 Fusarium oxysporum Species 0.000 description 1
- 241001112697 Fusarium reticulatum Species 0.000 description 1
- 241001014439 Fusarium sarcochroum Species 0.000 description 1
- 241000223192 Fusarium sporotrichioides Species 0.000 description 1
- 241001465753 Fusarium torulosum Species 0.000 description 1
- 241000567178 Fusarium venenatum Species 0.000 description 1
- 101150108358 GLAA gene Proteins 0.000 description 1
- 108010093031 Galactosidases Proteins 0.000 description 1
- 102000002464 Galactosidases Human genes 0.000 description 1
- 241000146398 Gelatoporia subvermispora Species 0.000 description 1
- 108700039691 Genetic Promoter Regions Proteins 0.000 description 1
- 101000892220 Geobacillus thermodenitrificans (strain NG80-2) Long-chain-alcohol dehydrogenase 1 Proteins 0.000 description 1
- 229920001503 Glucan Polymers 0.000 description 1
- 102100022624 Glucoamylase Human genes 0.000 description 1
- 108010015776 Glucose oxidase Proteins 0.000 description 1
- 239000004366 Glucose oxidase Substances 0.000 description 1
- 102000004366 Glucosidases Human genes 0.000 description 1
- 108010056771 Glucosidases Proteins 0.000 description 1
- 101150009006 HIS3 gene Proteins 0.000 description 1
- 101100295959 Halobacterium salinarum (strain ATCC 700922 / JCM 11081 / NRC-1) arcB gene Proteins 0.000 description 1
- 101100246753 Halobacterium salinarum (strain ATCC 700922 / JCM 11081 / NRC-1) pyrF gene Proteins 0.000 description 1
- 241000238631 Hexapoda Species 0.000 description 1
- 101000780443 Homo sapiens Alcohol dehydrogenase 1A Proteins 0.000 description 1
- 101000773364 Homo sapiens Beta-alanine-activating enzyme Proteins 0.000 description 1
- 101000880522 Homo sapiens Centrin-3 Proteins 0.000 description 1
- 101000882901 Homo sapiens Claudin-2 Proteins 0.000 description 1
- 241001138401 Kluyveromyces lactis Species 0.000 description 1
- FBOZXECLQNJBKD-ZDUSSCGKSA-N L-methotrexate Chemical compound C=1N=C2N=C(N)N=C(N)C2=NC=1CN(C)C1=CC=C(C(=O)N[C@@H](CCC(O)=O)C(O)=O)C=C1 FBOZXECLQNJBKD-ZDUSSCGKSA-N 0.000 description 1
- 241000194034 Lactococcus lactis subsp. cremoris Species 0.000 description 1
- 241001493117 Lactococcus phage TP901-1 Species 0.000 description 1
- 101150068888 MET3 gene Proteins 0.000 description 1
- 229920000057 Mannan Polymers 0.000 description 1
- 108010054377 Mannosidases Proteins 0.000 description 1
- 102000001696 Mannosidases Human genes 0.000 description 1
- 108090000157 Metallothionein Proteins 0.000 description 1
- 101100022915 Neurospora crassa (strain ATCC 24698 / 74-OR23-1A / CBS 708.71 / DSM 1257 / FGSC 987) cys-11 gene Proteins 0.000 description 1
- 108090000913 Nitrate Reductases Proteins 0.000 description 1
- 102000007981 Ornithine carbamoyltransferase Human genes 0.000 description 1
- 101710113020 Ornithine transcarbamylase, mitochondrial Proteins 0.000 description 1
- 102100037214 Orotidine 5'-phosphate decarboxylase Human genes 0.000 description 1
- 108010055012 Orotidine-5'-phosphate decarboxylase Proteins 0.000 description 1
- 206010034133 Pathogen resistance Diseases 0.000 description 1
- 102000003992 Peroxidases Human genes 0.000 description 1
- 241000222385 Phanerochaete Species 0.000 description 1
- 241000222393 Phanerochaete chrysosporium Species 0.000 description 1
- 241000222395 Phlebia Species 0.000 description 1
- 241000222397 Phlebia radiata Species 0.000 description 1
- 108091000080 Phosphotransferase Proteins 0.000 description 1
- 241000425347 Phyla <beetle> Species 0.000 description 1
- 241000222350 Pleurotus Species 0.000 description 1
- 244000252132 Pleurotus eryngii Species 0.000 description 1
- 235000001681 Pleurotus eryngii Nutrition 0.000 description 1
- 241000276498 Pollachius virens Species 0.000 description 1
- 108010059820 Polygalacturonase Proteins 0.000 description 1
- 239000004365 Protease Substances 0.000 description 1
- 102000001253 Protein Kinase Human genes 0.000 description 1
- 230000004570 RNA-binding Effects 0.000 description 1
- 108020005091 Replication Origin Proteins 0.000 description 1
- 101000968489 Rhizomucor miehei Lipase Proteins 0.000 description 1
- 101100394989 Rhodopseudomonas palustris (strain ATCC BAA-98 / CGA009) hisI gene Proteins 0.000 description 1
- 108010083644 Ribonucleases Proteins 0.000 description 1
- 102000006382 Ribonucleases Human genes 0.000 description 1
- 101900354623 Saccharomyces cerevisiae Galactokinase Proteins 0.000 description 1
- 101900084120 Saccharomyces cerevisiae Triosephosphate isomerase Proteins 0.000 description 1
- 241000235343 Saccharomycetales Species 0.000 description 1
- 101100097319 Schizosaccharomyces pombe (strain 972 / ATCC 24843) ala1 gene Proteins 0.000 description 1
- 101100022918 Schizosaccharomyces pombe (strain 972 / ATCC 24843) sua1 gene Proteins 0.000 description 1
- 238000012300 Sequence Analysis Methods 0.000 description 1
- 235000014962 Streptococcus cremoris Nutrition 0.000 description 1
- 241000187391 Streptomyces hygroscopicus Species 0.000 description 1
- 241000701955 Streptomyces virus phiC31 Species 0.000 description 1
- 108090000787 Subtilisin Proteins 0.000 description 1
- QAOWNCQODCNURD-UHFFFAOYSA-L Sulfate Chemical compound [O-]S([O-])(=O)=O QAOWNCQODCNURD-UHFFFAOYSA-L 0.000 description 1
- 239000004098 Tetracycline Substances 0.000 description 1
- 241001495429 Thielavia terrestris Species 0.000 description 1
- 108010022394 Threonine synthase Proteins 0.000 description 1
- 241000222354 Trametes Species 0.000 description 1
- 241000222357 Trametes hirsuta Species 0.000 description 1
- 241000222355 Trametes versicolor Species 0.000 description 1
- 241000217816 Trametes villosa Species 0.000 description 1
- 108700009124 Transcription Initiation Site Proteins 0.000 description 1
- 108060008539 Transglutaminase Proteins 0.000 description 1
- 102000003425 Tyrosinase Human genes 0.000 description 1
- 108060008724 Tyrosinase Proteins 0.000 description 1
- 101150050575 URA3 gene Proteins 0.000 description 1
- 241000607626 Vibrio cholerae Species 0.000 description 1
- 241000700605 Viruses Species 0.000 description 1
- 241000235015 Yarrowia lipolytica Species 0.000 description 1
- 241000758405 Zoopagomycotina Species 0.000 description 1
- 108010084631 acetolactate decarboxylase Proteins 0.000 description 1
- 239000002253 acid Substances 0.000 description 1
- 230000003213 activating effect Effects 0.000 description 1
- 101150019439 aldB gene Proteins 0.000 description 1
- 101150078331 ama-1 gene Proteins 0.000 description 1
- 238000012870 ammonium sulfate precipitation Methods 0.000 description 1
- 229960000723 ampicillin Drugs 0.000 description 1
- AVKUERGKIZMTKX-NJBDSQKTSA-N ampicillin Chemical compound C1([C@@H](N)C(=O)N[C@H]2[C@H]3SC([C@@H](N3C2=O)C(O)=O)(C)C)=CC=CC=C1 AVKUERGKIZMTKX-NJBDSQKTSA-N 0.000 description 1
- 230000003625 amylolytic effect Effects 0.000 description 1
- 238000004458 analytical method Methods 0.000 description 1
- 101150009206 aprE gene Proteins 0.000 description 1
- 101150008194 argB gene Proteins 0.000 description 1
- 229940005348 bacillus firmus Drugs 0.000 description 1
- 101150103518 bar gene Proteins 0.000 description 1
- 108010047754 beta-Glucosidase Proteins 0.000 description 1
- 102000006995 beta-Glucosidase Human genes 0.000 description 1
- 102000006635 beta-lactamase Human genes 0.000 description 1
- 230000027455 binding Effects 0.000 description 1
- 239000003139 biocide Substances 0.000 description 1
- 230000033228 biological regulation Effects 0.000 description 1
- 210000004899 c-terminal region Anatomy 0.000 description 1
- 238000004364 calculation method Methods 0.000 description 1
- 108010089934 carbohydrase Proteins 0.000 description 1
- 108020001778 catalytic domains Proteins 0.000 description 1
- 230000003197 catalytic effect Effects 0.000 description 1
- 238000006555 catalytic reaction Methods 0.000 description 1
- 230000034303 cell budding Effects 0.000 description 1
- 239000013592 cell lysate Substances 0.000 description 1
- 230000001413 cellular effect Effects 0.000 description 1
- 229920002678 cellulose Polymers 0.000 description 1
- 239000001913 cellulose Substances 0.000 description 1
- 238000005119 centrifugation Methods 0.000 description 1
- 239000003795 chemical substances by application Substances 0.000 description 1
- 238000011098 chromatofocusing Methods 0.000 description 1
- 238000004587 chromatography analysis Methods 0.000 description 1
- 238000003776 cleavage reaction Methods 0.000 description 1
- 230000000295 complement effect Effects 0.000 description 1
- 239000002361 compost Substances 0.000 description 1
- 150000001875 compounds Chemical class 0.000 description 1
- 238000012258 culturing Methods 0.000 description 1
- 108010005400 cutinase Proteins 0.000 description 1
- 230000001461 cytolytic effect Effects 0.000 description 1
- 230000000593 degrading effect Effects 0.000 description 1
- 238000013461 design Methods 0.000 description 1
- 238000001514 detection method Methods 0.000 description 1
- 230000001627 detrimental effect Effects 0.000 description 1
- 230000029087 digestion Effects 0.000 description 1
- 230000008034 disappearance Effects 0.000 description 1
- 238000004520 electroporation Methods 0.000 description 1
- 108010091371 endoglucanase 1 Proteins 0.000 description 1
- 108010092413 endoglucanase V Proteins 0.000 description 1
- 229940032049 enterococcus faecalis Drugs 0.000 description 1
- 238000001952 enzyme assay Methods 0.000 description 1
- 101150076810 erm gene Proteins 0.000 description 1
- 238000001704 evaporation Methods 0.000 description 1
- 230000008020 evaporation Effects 0.000 description 1
- 230000007717 exclusion Effects 0.000 description 1
- 108010038658 exo-1,4-beta-D-xylosidase Proteins 0.000 description 1
- 108010093305 exopolygalacturonase Proteins 0.000 description 1
- 238000002474 experimental method Methods 0.000 description 1
- 238000000855 fermentation Methods 0.000 description 1
- 230000004151 fermentation Effects 0.000 description 1
- 238000001914 filtration Methods 0.000 description 1
- 238000010230 functional analysis Methods 0.000 description 1
- 239000010437 gem Substances 0.000 description 1
- 229940116332 glucose oxidase Drugs 0.000 description 1
- 235000019420 glucose oxidase Nutrition 0.000 description 1
- 150000004676 glycans Chemical class 0.000 description 1
- 108010002430 hemicellulase Proteins 0.000 description 1
- 229940059442 hemicellulase Drugs 0.000 description 1
- 230000002209 hydrophobic effect Effects 0.000 description 1
- 108010002685 hygromycin-B kinase Proteins 0.000 description 1
- 238000011534 incubation Methods 0.000 description 1
- 230000001939 inductive effect Effects 0.000 description 1
- 208000015181 infectious disease Diseases 0.000 description 1
- 230000002401 inhibitory effect Effects 0.000 description 1
- 230000000977 initiatory effect Effects 0.000 description 1
- 238000005342 ion exchange Methods 0.000 description 1
- 238000001155 isoelectric focusing Methods 0.000 description 1
- 238000002955 isolation Methods 0.000 description 1
- 229960000318 kanamycin Drugs 0.000 description 1
- 229930027917 kanamycin Natural products 0.000 description 1
- SBUJHOSQTJFQJX-NOAMYHISSA-N kanamycin Chemical compound O[C@@H]1[C@@H](O)[C@H](O)[C@@H](CN)O[C@@H]1O[C@H]1[C@H](O)[C@@H](O[C@@H]2[C@@H]([C@@H](N)[C@H](O)[C@@H](CO)O2)O)[C@H](N)C[C@@H]1N SBUJHOSQTJFQJX-NOAMYHISSA-N 0.000 description 1
- 229930182823 kanamycin A Natural products 0.000 description 1
- 238000011031 large-scale manufacturing process Methods 0.000 description 1
- 230000002366 lipolytic effect Effects 0.000 description 1
- 101150039489 lysZ gene Proteins 0.000 description 1
- 238000012423 maintenance Methods 0.000 description 1
- 239000000463 material Substances 0.000 description 1
- 230000007246 mechanism Effects 0.000 description 1
- 230000001404 mediated effect Effects 0.000 description 1
- 229960000485 methotrexate Drugs 0.000 description 1
- 230000002906 microbiologic effect Effects 0.000 description 1
- 239000000203 mixture Substances 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 238000010369 molecular cloning Methods 0.000 description 1
- 239000000178 monomer Substances 0.000 description 1
- 238000002887 multiple sequence alignment Methods 0.000 description 1
- 238000002703 mutagenesis Methods 0.000 description 1
- 231100000350 mutagenesis Toxicity 0.000 description 1
- 101150095344 niaD gene Proteins 0.000 description 1
- 229910052757 nitrogen Inorganic materials 0.000 description 1
- 101150017837 nprM gene Proteins 0.000 description 1
- 230000008520 organization Effects 0.000 description 1
- 108090000021 oryzin Proteins 0.000 description 1
- 108040007629 peroxidase activity proteins Proteins 0.000 description 1
- JTJMJGYZQZDUJJ-UHFFFAOYSA-N phencyclidine Chemical compound C1CCCCN1C1(C=2C=CC=CC=2)CCCCC1 JTJMJGYZQZDUJJ-UHFFFAOYSA-N 0.000 description 1
- 108010082527 phosphinothricin N-acetyltransferase Proteins 0.000 description 1
- 102000020233 phosphotransferase Human genes 0.000 description 1
- 230000035479 physiological effects, processes and functions Effects 0.000 description 1
- 229940085127 phytase Drugs 0.000 description 1
- 239000013600 plasmid vector Substances 0.000 description 1
- 229920001282 polysaccharide Polymers 0.000 description 1
- 239000005017 polysaccharide Substances 0.000 description 1
- 230000004481 post-translational protein modification Effects 0.000 description 1
- 230000001124 posttranscriptional effect Effects 0.000 description 1
- 238000001556 precipitation Methods 0.000 description 1
- 230000026897 pro-virus excision Effects 0.000 description 1
- 230000000750 progressive effect Effects 0.000 description 1
- 235000019419 proteases Nutrition 0.000 description 1
- 108060006633 protein kinase Proteins 0.000 description 1
- 238000001742 protein purification Methods 0.000 description 1
- 101150108007 prs gene Proteins 0.000 description 1
- 101150086435 prs1 gene Proteins 0.000 description 1
- 101150070305 prsA gene Proteins 0.000 description 1
- 230000008929 regeneration Effects 0.000 description 1
- 238000011069 regeneration method Methods 0.000 description 1
- 230000004044 response Effects 0.000 description 1
- 230000000717 retained effect Effects 0.000 description 1
- 150000003839 salts Chemical class 0.000 description 1
- 230000007017 scission Effects 0.000 description 1
- 238000012216 screening Methods 0.000 description 1
- 238000010187 selection method Methods 0.000 description 1
- 230000035945 sensitivity Effects 0.000 description 1
- 238000012163 sequencing technique Methods 0.000 description 1
- 238000002741 site-directed mutagenesis Methods 0.000 description 1
- 238000002415 sodium dodecyl sulfate polyacrylamide gel electrophoresis Methods 0.000 description 1
- 239000002689 soil Substances 0.000 description 1
- 238000010563 solid-state fermentation Methods 0.000 description 1
- 238000001694 spray drying Methods 0.000 description 1
- 238000012916 structural analysis Methods 0.000 description 1
- 239000000126 substance Substances 0.000 description 1
- 238000006467 substitution reaction Methods 0.000 description 1
- 239000000758 substrate Substances 0.000 description 1
- 101150118377 tet gene Proteins 0.000 description 1
- 229960002180 tetracycline Drugs 0.000 description 1
- 229930101283 tetracycline Natural products 0.000 description 1
- 235000019364 tetracycline Nutrition 0.000 description 1
- 150000003522 tetracyclines Chemical class 0.000 description 1
- 238000010361 transduction Methods 0.000 description 1
- 230000026683 transduction Effects 0.000 description 1
- 238000001890 transfection Methods 0.000 description 1
- 238000012546 transfer Methods 0.000 description 1
- 230000001131 transforming effect Effects 0.000 description 1
- 230000009261 transgenic effect Effects 0.000 description 1
- 102000003601 transglutaminase Human genes 0.000 description 1
- 230000014616 translation Effects 0.000 description 1
- 241001515965 unidentified phage Species 0.000 description 1
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Substances O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 1
- 210000005253 yeast cell Anatomy 0.000 description 1
Images
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
- C12N15/74—Vectors or expression systems specially adapted for prokaryotic hosts other than E. coli, e.g. Lactobacillus, Micromonospora
- C12N15/75—Vectors or expression systems specially adapted for prokaryotic hosts other than E. coli, e.g. Lactobacillus, Micromonospora for Bacillus
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/10—Processes for the isolation, preparation or purification of DNA or RNA
- C12N15/1034—Isolating an individual clone by screening libraries
- C12N15/1082—Preparation or screening gene libraries by chromosomal integration of polynucleotide sequences, HR-, site-specific-recombination, transposons, viral vectors
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/10—Processes for the isolation, preparation or purification of DNA or RNA
- C12N15/1034—Isolating an individual clone by screening libraries
- C12N15/1086—Preparation or screening of expression libraries, e.g. reporter assays
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/87—Introduction of foreign genetic material using processes not otherwise provided for, e.g. co-transformation
- C12N15/90—Stable introduction of foreign DNA into chromosome
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/87—Introduction of foreign genetic material using processes not otherwise provided for, e.g. co-transformation
- C12N15/90—Stable introduction of foreign DNA into chromosome
- C12N15/902—Stable introduction of foreign DNA into chromosome using homologous recombination
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N2800/00—Nucleic acids vectors
- C12N2800/30—Vector systems comprising sequences for excision in presence of a recombinase, e.g. loxP or FRT
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N2830/00—Vector systems having a special element relevant for transcription
- C12N2830/50—Vector systems having a special element relevant for transcription regulating RNA stability, not being an intron, e.g. poly A signal
Definitions
- a large number of naturally-occurring organisms have been found to produce useful polypeptide products, e.g., enzymes, the large scale production of which is desirable for research and commercial purposes. Once such product has been identified efforts are being made to develop production methods leading to a high production of the product.
- One widely used method which is based on recombinant DNA techniques, is to clone a gene encoding the product, inserting the gene into a suitable expression system permitting the expression of the product and culturing a suitable host cell comprising the expression system, either integrated in the chromosome or as an extrachromosomal entity, under conditions conducive for the expression of the product.
- EP 0 284 126 and EP 166 628 disclose methods for stably integrating one or more copies of a gene into the chromosome of a prokaryotic cell already harbouring at least one copy of the gene in question in its chromosome.
- a host cell comprising said gene is transformed with a DNA construct comprising another copy of the gene, whereby, after a suitable selection procedure, a cell is obtained which in its chromosome comprises two copies of the gene separated by an endogenous chromosomal sequence which is vital to the host cell and thereby ensures stable maintenance of the integrated gene.
- This procedure may be repeated so as to produce cells harbouring multiple copies of the gene in its chromosome.
- WO 2002/000907 describes methods to site-specifically integrate polynucleotides into inactivated chromosomal loci that are conditionally essential to the cell, where these loci are restored by the integration process.
- multiple copies of a gene can be introduced into a cell comprising multiple attachment sites recognized by the Mx9 integrase using the Mx9 phage transformation system, (WO 2004/018635 A2).
- the present invention provides a combined solution to these problems; it allows the simultaneous chromosomal site-specific integration of multiple copies of a gene (or operon) encoding a polypeptide(s) of interest, while also providing the means for initiating transcription of said gene after the proper integration of each copy via a heterologous promoter, which becomes operably linked with the gene only after the successful integration.
- the present invention relates to a method of constructing a cell comprising in its chromosome one or more copies of an open reading frame (ORF) or operon encoding at least one polypeptide of interest, each copy being under the transcriptional control of a heterologous promoter, said method comprising the steps of:
- One of the means for carrying out the invention is a host cell specifically designed for this purpose; the cell has been engineered to comprise one or more copies of a recognition sequence (RS) of a site specific recombinase, as exemplified below, wherein each copy of the RS is located downstream of a copy of a heterologous promoter.
- RS recognition sequence
- This arrangement ensures that when a polynucleotide construct of the invention recombines into the chromosome by the action of the site specific recombinase, the ORF or operon comprised in the construct will be operably linked with the heterologous promoter already present in the chromosome.
- the invention relates to a cell comprising in its chromosome one or more copies of a recognition sequence (RS) of a site specific recombinase, wherein each copy of the RS is located downstream of a copy of a heterologous promoter.
- RS recognition sequence
- the invention in a third aspect, relates to a cell produced by a method of the first aspect, or a cell comprising in its chromosome one or more copies of an open reading frame (ORF) or operon of interest, wherein each copy is under the transcriptional control of a heterologous promoter, and (i) wherein each copy of the ORF or operon is located in the chromosome upstream of a recognition sequence (RS) of a site specific recombinase, or (ii) wherein each copy of the ORF or operon is located in the chromosome downstream of a recognition sequence (RS) of a site specific recombinase.
- ORF open reading frame
- Another means for carrying out the invention is of course the polynucleotide construct mentioned in the method of the first aspect.
- a fourth aspect of the invention relates to a polynucleotide construct comprising a promoterless open reading frame (ORF) or operon encoding at least one polypeptide of interest, the construct also comprising a recognition sequence (RS) of a site specific recombinase located upstream or downstream of said ORF or operon.
- ORF promoterless open reading frame
- RS recognition sequence
- the invention relates to a method of producing a polypeptide of interest, said method comprising:
- FIG. 1 A schematic overview of a preferred embodiment of the invention:
- a circular polynucleotide construct comprising the recognition sequense of the TP901-1 phage integrase, attP, located upstream of an open reading frame, genX.
- the construct further comprises an optional marker, a temperature sensitive origin of replication, ori TS , as well as a region located downstream of the open reading frame in the construct, which is indicated with a small arrow denoted “repeat”.
- a chromosome of a host cell is also shown comprising a heterologous promoter and the TP901-1 phage integrase recognition sequence, attB, corresponding to the recognition sequence in the construct, which is located downstream of the promoter.
- a region is indicated in the chromosome with a small arrow denoted “repeat”.
- the “repeat” regions of the chromosome and the polynucleotide construct should be sufficiently homologous to effectuate in vivo homologous recombination between the two homologous regions when both regions are present in the cell.
- the attP and attB sites are recombined, whereby the construct is integrated into the chromosome, placing the open reading frame, genX, under the transcriptional control of the heterologous promoter, creating the resulting attL and attR sites in the process.
- a suitable integrase e.g. The TP901-1 phage integrase
- the two homologous “repeat” regions recombine, whereby the DNA in between the two regions is excised from the chromosome, leaving just the open reading frame, genX, in the chromosome along with the newly created attL site.
- alignments of sequences and calculation of homology scores may be done using a full Smith-Waterman alignment, useful for both protein and DNA alignments.
- the default scoring matrices BLOSUM50 and the identity matrix are used for protein and DNA alignments respectively.
- the penalty for the first residue in a gap is ⁇ 12 for proteins and ⁇ 16 for DNA, while the penalty for additional residues in a gap is ⁇ 2 for proteins and ⁇ 4 for DNA.
- Alignment may be made with the FASTA package version v20u6 (W. R. Pearson and D. J. Lipman (1988), “Improved Tools for Biological Sequence Analysis”, PNAS 85:2444-2448, and W. R. Pearson (1990) “Rapid and Sensitive Sequence Comparison with FASTP and FASTA”, Methods in Enzymology, 183:63-98).
- Promoter is defined herein as a nucleic acid sequence involved in the binding of RNA polymerase to initiate transcription of a gene.
- Tudem promoter is defined herein as two or more promoter sequences each of which is operably linked to a coding sequence and mediates the transcription of the coding sequence into mRNA.
- operably linked is defined herein as a configuration in which a control sequence, e.g., a promoter sequence, is appropriately placed at a position relative to a coding sequence such that the control sequence directs the production of a polypeptide encoded by the coding sequence.
- Coding sequence is defined herein as a nucleic acid sequence which is transcribed into mRNA and translated into a polypeptide when placed under the control of the appropriate control sequences.
- the boundaries of the coding sequence are generally determined by a ribosome binding site located just upstream of the open reading frame at the 5′ end of the mRNA and a transcription terminator sequence located just downstream of the open reading frame at the 3′ end of the mRNA.
- a coding sequence can include, but is not limited to, genomic DNA, cDNA, semisynthetic, synthetic, and recombinant nucleic acid sequences.
- Heterologous DNA in a host cell in the present context refers to exogenous DNA not originating from the cell.
- Nucleic acid construct is defined herein as a nucleic acid molecule, either single- or double-stranded, which is isolated from a naturally occurring gene or which has been modified to contain segments of nucleic acid which are combined and juxtaposed in a manner which would not otherwise exist in nature.
- the term nucleic acid construct is synonymous with the term expression cassette when the nucleic acid construct contains all the control sequences required for expression of a coding sequence.
- control sequences is defined herein to include all components, which are necessary or advantageous for the expression of a polynucleotide encoding a polypeptide of the present invention.
- Each control sequence may be native or foreign to the nucleotide sequence encoding the polypeptide.
- control sequences include, but are not limited to, a leader, polyadenylation sequence, propeptide sequence, promoter, signal peptide sequence, and transcription terminator.
- the control sequences include a promoter, and transcriptional and translational stop signals.
- the control sequences may be provided with linkers for the purpose of introducing specific restriction sites facilitating ligation of the control sequences with the coding region of the nucleotide sequence encoding a polypeptide.
- operably linked denotes herein a configuration in which a control sequence is placed at an appropriate position relative to the coding sequence of the polynucleotide sequence such that the control sequence directs the expression of the coding sequence of a polypeptide.
- coding sequence means a nucleotide sequence, which directly specifies the amino acid sequence of its protein product.
- the boundaries of the coding sequence are generally determined by an open reading frame, which usually begins with the ATG start codon or alternative start codons such as GTG and TTG.
- the coding sequence may a DNA, cDNA, or recombinant nucleotide sequence.
- expression includes any step involved in the production of the polypeptide including, but not limited to, transcription, post-transcriptional modification, translation, post-translational modification, and secretion.
- expression vector is defined herein as a linear or circular DNA molecule that comprises a polynucleotide encoding a polypeptide of the invention, and which is operably linked to additional nucleotides that provide for its expression.
- host cell includes any cell type which is susceptible to transformation, transfection, transduction, and the like with a nucleic acid construct comprising a polynucleotide of the present invention.
- a polypeptide of the present invention may be obtained from microorganisms of any genus.
- the term “obtained from” as used herein in connection with a given source shall mean that the polypeptide encoded by a nucleotide sequence is produced by the source or by a strain in which the nucleotide sequence from the source has been inserted.
- the polypeptide obtained from a given source is secreted extracellularly.
- a polypeptide of the present invention may be a bacterial polypeptide.
- the polypeptide may be a gram positive bacterial polypeptide such as a Bacillus polypeptide, e.g., a Bacillus alkalophilus, Bacillus amyloliquefaciens, Bacillus brevis, Bacillus circulans, Bacillus coagulans, Bacillus lautus, Bacillus lentus, Bacillus lichenformis, Bacillus megaterium, Bacillus stearothermophilus, Bacillus subtilis, or Bacillus thuringiensis polypeptide; or a Streptomyces polypeptide, e.g., a Streptomyces lividans or Streptomyces murinus polypeptide; or a gram negative bacterial polypeptide, e.g., an E. coli or a Pseudomonas sp. polypeptide.
- Bacillus polypeptide e.g.,
- a polypeptide of the present invention may also be a fungal polypeptide, and more preferably a yeast polypeptide such as a Candida, Kluyveromyces, Pichia, Saccharomyces, Schizosaccharomyces, or Yarrowia polypeptide; or more preferably a filamentous fungal polypeptide such as an Acremonium, Aspergillus, Aureobasidium, Cryptococcus, Filibasidium, Fusarium, Humicola, Magnaporthe, Mucor, Myceliophthora, Neocallimastix, Neurospora, Paecilomyces, Penicillium, Piromyces, Schizophyllum, Talaromyces, Thermoascus, Thielavia, Tolypocladium, or Trichoderma polypeptide.
- yeast polypeptide such as a Candida, Kluyveromyces, Pichia, Saccharomyces, Schizosaccharomyces, or Yarrowia poly
- the polypeptide is a Saccharomyces carlsbergensis, Saccharomyces cerevisiae, Saccharomyces diastaticus, Saccharomyces douglasii, Saccharomyces kluyveri, Saccharomyces norbensis, or Saccharomyces oviformis polypeptide.
- the polypeptide is an Aspergillus aculeatus, Aspergillus awamori, Aspergillus fumigatus, Aspergillus foetidus, Aspergillus japonicus, Aspergillus nidulans, Aspergillus niger, Aspergillus oryzae, Fusarium bactridioides, Fusarium cerealis, Fusarium crookwellense, Fusarium culmorum, Fusarium graminearum, Fusarium graminum, Fusarium heterosporum, Fusarium negundi, Fusarium oxysporum, Fusarium reticulatum, Fusarium roseum, Fusarium sambucinum, Fusarium sarcochroum, Fusarium sporotrichioides, Fusarium sulphureum, Fusarium torulosum, Fusarium trichothecioides
- the invention encompasses both the perfect and imperfect states, and other taxonomic equivalents, e.g., anamorphs, regardless of the species name by which they are known. Those skilled in the art will readily recognize the identity of appropriate equivalents.
- ATCC American Type Culture Collection
- DSM Deutsche Sammiung von Mikroorganismen und Zellkulturen GmbH
- CBS Centraalbureau Voor Schimmelcultures
- NRRL Northern Regional Research Center
- polypeptides may be identified and obtained from other sources including microorganisms isolated from nature (e.g., soil, composts, water, etc.) using the above-mentioned probes. Techniques for isolating microorganisms from natural habitats are well known in the art.
- the polynucleotide may then be obtained by similarly screening a genomic or cDNA library of another microorganism. Once a polynucleotide sequence encoding a polypeptide has been detected with the probe(s), the polynucleotide can be isolated or cloned by utilizing techniques which are well known to those of ordinary skill in the art (see, e.g., Sambrook et al., 1989, supra).
- Polypeptides of the present invention also include fused polypeptides or cleavable fusion polypeptides in which another polypeptide is fused at the N-terminus or the C-terminus of the polypeptide or fragment thereof.
- a fused polypeptide is produced by fusing a nucleotide sequence (or a portion thereof) encoding another polypeptide to a nucleotide sequence (or a portion thereof) of the present invention.
- Techniques for producing fusion polypeptides are known in the art, and include ligating the coding sequences encoding the polypeptides so that they are in frame and that expression of the fused polypeptide is under control of the same promoter(s) and terminator.
- the present invention also relates to nucleic acid constructs comprising an isolated polynucleotide of the present invention operably linked to one or more control sequences which direct the expression of the coding sequence in a suitable host cell under conditions compatible with the control sequences.
- An isolated polynucleotide encoding a polypeptide of the present invention may be manipulated in a variety of ways to provide for expression of the polypeptide. Manipulation of the polynucleotide's sequence prior to its insertion into a vector may be desirable or necessary depending on the expression vector. The techniques for modifying polynucleotide sequences utilizing recombinant DNA methods are well known in the art.
- the control sequence may be an appropriate promoter sequence, a nucleotide sequence which is recognized by a host cell for expression of a polynucleotide encoding a polypeptide of the present invention.
- the promoter sequence contains transcriptional control sequences which mediate the expression of the polypeptide.
- the promoter may be any nucleotide sequence which shows transcriptional activity in the host cell of choice including mutant, truncated, and hybrid promoters, and may be obtained from genes encoding extracellular or intracellular polypeptides either homologous or heterologous to the host cell.
- suitable promoters for directing the transcription of the nucleic acid constructs of the present invention are the promoters obtained from the E. coli lac operon, Streptomyces coelicolor agarase gene (dagA), Bacillus subtilis levansucrase gene (sacB), Bacillus lichenformis alpha-amylase gene (amyL), Bacillus stearothermophilus maltogenic amylase gene (amyM), Bacillus amyloliquefaciens alpha-amylase gene (amyQ), Bacillus lichenformis penicillinase gene (penP), Bacillus subtilis xylA and xylB genes, and prokaryotic beta-lactamase gene (Villa-Kamaroff et al., 1978, Proceedings of the National Academy of Sciences USA 75: 3727-3731), as well as the tac promoter (DeBoer et al., 1983, Proceedings of
- promoters for directing the transcription of the nucleic acid constructs of the present invention in a filamentous fungal host cell are promoters obtained from the genes for Aspergillus oryzae TAKA amylase, Rhizomucor miehei aspartic proteinase, Aspergillus niger neutral alpha-amylase, Aspergillus niger acid stable alpha-amylase, Aspergillus niger or Aspergillus awamori glucoamylase (glaA), Rhizomucor miehei lipase, Aspergillus oryzae alkaline protease, Aspergillus oryzae triose phosphate isomerase, Aspergillus nidulans acetamidase, Fursarium venenatum amyloglucosidase (WO 00/56900), Fursarium venenatum Daria (WO 00/56900), Fursarium venen
- useful promoters are obtained from the genes for Saccharomyces cerevisiae enolase (ENO-1), Saccharomyces cerevisiae galactokinase (GAL1), Saccharomyces cerevisiae alcohol dehydrogenase/glyceraldehyde-3-phosphate dehydrogenase (ADH1, ADH2/GAP), Saccharomyces cerevisiae triose phosphate isomerase (TPI), Saccharomyces cerevisiae metallothionine (CUP1), and Saccharomyces cerevisiae 3-phosphoglycerate kinase.
- ENO-1 Saccharomyces cerevisiae enolase
- GAL1 Saccharomyces cerevisiae galactokinase
- ADH1, ADH2/GAP Saccharomyces cerevisiae triose phosphate isomerase
- TPI Saccharomyces cerevisiae metallothionine
- the control sequence may also be a suitable transcription terminator sequence, a sequence recognized by a host cell to terminate transcription.
- the terminator sequence is operably linked to the 3′ terminus of the nucleotide sequence encoding the polypeptide. Any terminator which is functional in the host cell of choice may be used in the present invention.
- Preferred terminators for filamentous fungal host cells are obtained from the genes for Aspergillus oryzae TAKA amylase, Aspergillus niger glucoamylase, Aspergillus nidulans anthranilate synthase, Aspergillus niger alpha-glucosidase, and Fursarium oxysporum trypsin-like protease.
- Preferred terminators for yeast host cells are obtained from the genes for Saccharomyces cerevisiae enolase, Saccharomyces cerevisiae cytochrome C (CYC1), and Saccharomyces cerevisiae glyceraldehyde-3-phosphate dehydrogenase.
- Other useful terminators for yeast host cells are described by Romanos et al., 1992, supra.
- the control sequence may also be a suitable leader sequence, a nontranslated region of an mRNA which is important for translation by the host cell.
- the leader sequence is operably linked to the 5′ terminus of the nucleotide sequence encoding the polypeptide. Any leader sequence that is functional in the host cell of choice may be used in the present invention.
- Preferred leaders for filamentous fungal host cells are obtained from the genes for Aspergillus oryzae TAKA amylase and Aspergillus nidulans triose phosphate isomerase.
- Suitable leaders for yeast host cells are obtained from the genes for Saccharomyces cerevisiae enolase (ENO-1), Saccharomyces cerevisiae 3-phosphoglycerate kinase, Saccharomyces cerevisiae alpha-factor, and Saccharomyces cerevisiae alcohol dehydrogenase/glyceraldehyde-3-phosphate dehydrogenase (ADH2/GAP).
- ENO-1 Saccharomyces cerevisiae enolase
- Saccharomyces cerevisiae 3-phosphoglycerate kinase Saccharomyces cerevisiae alpha-factor
- Saccharomyces cerevisiae alcohol dehydrogenase/glyceraldehyde-3-phosphate dehydrogenase ADH2/GAP
- the control sequence may also be a polyadenylation sequence, a sequence operably linked to the 3′ terminus of the nucleotide sequence and which, when transcribed, is recognized by the host cell as a signal to add polyadenosine residues to transcribed mRNA. Any polyadenylation sequence which is functional in the host cell of choice may be used in the present invention.
- Preferred polyadenylation sequences for filamentous fungal host cells are obtained from the genes for Aspergillus oryzae TAKA amylase, Aspergillus niger glucoamylase, Aspergillus nidulans anthranilate synthase, Fursarium oxysporum trypsin-like protease, and Aspergillus niger alpha-glucosidase.
- the control sequence may also be a signal peptide coding region that codes for an amino acid sequence linked to the amino terminus of a polypeptide and directs the encoded polypeptide into the cell's secretory pathway.
- the 5′ end of the coding sequence of the nucleotide sequence may inherently contain a signal peptide coding region naturally linked in translation reading frame with the segment of the coding region which encodes the secreted polypeptide.
- the 5′ end of the coding sequence may contain a signal peptide coding region which is foreign to the coding sequence.
- the foreign signal peptide coding region may be required where the coding sequence does not naturally contain a signal peptide coding region.
- the foreign signal peptide coding region may simply replace the natural signal peptide coding region in order to enhance secretion of the polypeptide.
- any signal peptide coding region which directs the expressed polypeptide into the secretory pathway of a host cell of choice may be used in the present invention.
- Effective signal peptide coding regions for bacterial host cells are the signal peptide coding regions obtained from the genes for Bacillus NCIB 11837 maltogenic amylase, Bacillus stearothermophilus alpha-amylase, Bacillus lichenformis subtilisin, Bacillus lichenformis beta-lactamase, Bacillus stearothermophilus neutral proteases (nprT, nprS, nprM), and Bacillus subtilis prsA. Further signal peptides are described by Simonen and Palva, 1993, Microbiological Reviews 57: 109-137.
- Effective signal peptide coding regions for filamentous fungal host cells are the signal peptide coding regions obtained from the genes for Aspergillus oryzae TAKA amylase, Aspergillus niger neutral amylase, Aspergillus niger glucoamylase, Rhizomucor miehei aspartic proteinase, Humicola insolens cellulase, and Humicola lanuginosa lipase.
- Useful signal peptides for yeast host cells are obtained from the genes for Saccharomyces cerevisiae alpha-factor and Saccharomyces cerevisiae invertase. Other useful signal peptide coding regions are described by Romanos et al., 1992, supra.
- the control sequence may also be a propeptide coding region that codes for an amino acid sequence positioned at the amino terminus of a polypeptide.
- the resultant polypeptide is known as a proenzyme or propolypeptide (or a zymogen in some cases).
- a propolypeptide is generally inactive and can be converted to a mature active polypeptide by catalytic or autocatalytic cleavage of the propeptide from the propolypeptide.
- the propeptide coding region may be obtained from the genes for Bacillus subtilis alkaline protease (aprE), Bacillus subtilis neutral protease (nprT), Saccharomyces cerevisiae alpha-factor, Rhizomucor miehei aspartic proteinase, and Myceliophthora thermophila laccase (WO 95/33836).
- the propeptide region is positioned next to the amino terminus of a polypeptide and the signal peptide region is positioned next to the amino terminus of the propeptide region.
- regulatory sequences which allow the regulation of the expression of the polypeptide relative to the growth of the host cell.
- regulatory systems are those which cause the expression of the gene to be turned on or off in response to a chemical or physical stimulus, including the presence of a regulatory compound.
- Regulatory systems in prokaryotic systems include the lac, tac, and trp operator systems.
- yeast the ADH2 system or GAL1 system may be used.
- filamentous fungi the TAKA alpha-amylase promoter, Aspergillus niger glucoamylase promoter, and Aspergillus oryzae glucoamylase promoter may be used as regulatory sequences.
- Other examples of regulatory sequences are those which allow for gene amplification.
- these include the dihydrofolate reductase gene which is amplified in the presence of methotrexate, and the metallothionein genes which are amplified with heavy metals.
- the nucleotide sequence encoding the polypeptide would be operably linked with the regulatory sequence.
- the present invention also relates to recombinant expression vectors comprising a polynucleotide of the present invention, a promoter, and transcriptional and translational stop signals.
- the various nucleic acids and control sequences described above may be joined together to produce a recombinant expression vector which may include one or more convenient restriction sites to allow for insertion or substitution of the nucleotide sequence encoding the polypeptide at such sites.
- a nucleotide sequence of the present invention may be expressed by inserting the nucleotide sequence or a nucleic acid construct comprising the sequence into an appropriate vector for expression.
- the coding sequence is located in the vector so that the coding sequence is operably linked with the appropriate control sequences for expression.
- the recombinant expression vector may be any vector (e.g., a plasmid or virus) which can be conveniently subjected to recombinant DNA procedures and can bring about expression of the nucleotide sequence.
- the choice of the vector will typically depend on the compatibility of the vector with the host cell into which the vector is to be introduced.
- the vectors may be linear or closed circular plasmids.
- the vector may be one which, when introduced into the host cell, is integrated into the genome and replicated together with the chromosome(s) into which it has been integrated. Furthermore, a single vector or plasmid or two or more vectors or plasmids which together contain the total DNA to be introduced into the genome of the host cell, or a transposon may be used.
- the vectors of the present invention preferably contain one or more selectable markers which permit easy selection of transformed cells.
- a selectable marker is a gene the product of which provides for biocide or viral resistance, resistance to heavy metals, prototrophy to auxotrophs, and the like.
- bacterial selectable markers are the dal genes from Bacillus subtilis or Bacillus lichenformis, or markers which confer antibiotic resistance such as ampicillin, kanamycin, chloramphenicol, or tetracycline resistance.
- Suitable markers for yeast host cells are ADE2, HIS3, LEU2, LYS2, MET3, TRP1, and URA3.
- Selectable markers for use in a filamentous fungal host cell include, but are not limited to, amdS (acetamidase), argB (ornithine carbamoyltransferase), bar (phosphinothricin acetyltransferase), hph (hygromycin phosphotransferase), niaD (nitrate reductase), pyrG (orotidine-5′-phosphate decarboxylase), sC (sulfate adenyltransferase), and trpc (anthranilate synthase), as well as equivalents thereof.
- Preferred for use in an Aspergillus cell are the amdS and pyrG genes of Aspergillus nidulans or Aspergillus oryzae and the bar gene of Streptomyces hygroscopicus.
- the vectors of the present invention preferably contain an element(s) that permits integration of the vector into the host cell's genome.
- the vector may rely on the polynucleotide's sequence encoding the polypeptide or any other element of the vector for integration into the genome by homologous or nonhomologous recombination.
- the vector may contain additional nucleotide sequences for directing integration or excision by homologous recombination into or from the genome of the host cell at a precise location(s) in the chromosome(s).
- the integrational elements should preferably contain a sufficient number of nucleic acids, such as 100 to 10,000 base pairs, preferably 400 to 10,000 base pairs, and most preferably 800 to 10,000 base pairs, which have a high degree of identity with the corresponding target sequence to enhance the probability of homologous recombination.
- the integrational elements may be any sequence that is homologous with the target sequence in the genome of the host cell.
- the integrational elements may be non-encoding or encoding nucleotide sequences.
- the vector may be integrated into the genome of the host cell by non-homologous recombination.
- bacterial origins of replication are the origins of replication of plasmids pBR322, pUC19, pACYC177, and pACYC184 permitting replication in E. coli , and pUB110, pE194, pTA 1060, and pAM ⁇ 1 permitting replication in Bacillus.
- origins of replication for use in a yeast host cell are the 2 micron origin of replication, ARS1ARS4, the combination of ARS1 and CEN3, and the combination of ARS4 and CEN6.
- AMA1 and ANS1 examples of origins of replication useful in a filamentous fungal cell are AMA1 and ANS1 (Gems et al ., 1991, Gene 98:61-67; Cullen et al., 1987, Nucleic Acids Research 15: 9163-9175; WO 00/24883). Isolation of the AMAI gene and construction of plasmids or vectors comprising the gene can be accomplished according to the methods disclosed in WO 00/24883.
- More than one copy of a polynucleotide of the present invention may be inserted into the host cell to increase production of the gene product.
- An increase in the copy number of the polynucleotide can be obtained by integrating at least one additional copy of the sequence into the host cell genome or by including an amplifiable selectable marker gene with the polynucleotide where cells containing amplified copies of the selectable marker gene, and thereby additional copies of the polynucleotide, can be selected for by cultivating the cells in the presence of the appropriate selectable agent.
- the present invention also relates to recombinant host cells, comprising a polynucleotide of the present invention, which are advantageously used in the recombinant production of the polypeptides.
- a vector comprising a polynucleotide of the present invention is introduced into a host cell so that the vector is maintained as a chromosomal integrant or as a self-replicating extra-chromosomal vector as described earlier.
- the term “host cell” encompasses any progeny of a parent cell that is not identical to the parent cell due to mutations that occur during replication. The choice of a host cell will to a large extent depend upon the gene encoding the polypeptide and its source.
- the host cell may be a unicellular microorganism, e.g., a prokaryote, or a non-unicellular microorganism, e.g., a eukaryote.
- Useful unicellular microorganisms are bacterial cells such as gram positive bacteria including, but not limited to, a Bacillus cell, e.g., Bacillus alkalophilus, Bacillus amyloliquefaciens, Bacillus brevis, Bacillus circulans, Bacillus clausii, Bacillus coagulans, Bacillus lautus, Bacillus lentus, Bacillus lichenformis, Bacillus megaterium, Bacillus stearothermophilus, Bacillus subtilis, and Bacillus thuringiensis; or a Streptomyces cell, e.g., Streptomyces lividans and Streptomyces murinus, or gram negative bacteria such as E.
- a Bacillus cell e.g., Bacillus alkalophilus, Bacillus amyloliquefaciens, Bacillus brevis, Bacillus circulans, Bacillus clausii, Bacillus coagulans,
- the bacterial host cell is a Bacillus lentus, Bacillus lichenformis, Bacillus stearothermophilus, or Bacillus subtilis cell.
- the Bacillus cell is an alkalophilic Bacillus.
- the introduction of a vector into a bacterial host cell may, for instance, be effected by protoplast transformation (see, e.g., Chang and Cohen, 1979, Molecular General Genetics 168: 111-115), using competent cells (see, e.g., Young and Spizizin, 1961, Journal of Bacteriology 81: 823-829, or Dubnau and Davidoff-Abelson, 1971, Journal of Molecular Biology 56: 209-221), electroporation (see, e.g., Shigekawa and Dower, 1988, Biotechniques 6: 742-751), or conjugation (see, e.g., Koehler and Thorne, 1987, Journal of Bacteriology 169: 5771-5278).
- protoplast transformation see, e.g., Chang and Cohen, 1979, Molecular General Genetics 168: 111-115
- competent cells see, e.g., Young and Spizizin, 1961, Journal of Bacteriology 81: 823-829, or Dub
- the host cell may also be a eukaryote, such as a mammalian, insect, plant, or fungal cell.
- the host cell is a fungal cell.
- “Fungi” as used herein includes the phyla Ascornycota, Basidiomycota, Chytridiomycota, and Zygomycota (as defined by Hawksworth et al., In, Ainsworth and Bisby's Dictionary of The Fungi, 8th edition, 1995, CAB International, University Press, Cambridge, UK) as well as the Oomycota (as cited in Hawksworth et al., 1995, supra, page 171) and all mitosporic fungi (Hawksworth et al., 1995, supra).
- the fungal host cell is a yeast cell.
- yeast as used herein includes ascosporogenous yeast (Endomycetales), basidiosporogenous yeast, and yeast belonging to the Fungi Imperfecti (Blastomycetes). Since the classification of yeast may change in the future, for the purposes of this invention, yeast shall be defined as described in Biology and Activities of Yeast (Skinner, F. A., Passmore, S. M., and Davenport, R. R., eds, Soc. App. Bacteriol. Symposium Series No. 9,1980).
- the yeast host cell is a Candida, Hansenula, Kluyveromyces, Pichia, Saccharomyces, Schizosaccharomyces, or Yarrowia cell.
- the yeast host cell is a Saccharomyces carlsbergensis, Saccharomyces cerevisiae, Saccharomyces diastaticus, Saccharomyces douglasii, Saccharomyces kluyveri, Saccharomyces norbensis or Saccharomyces oviformis cell.
- the yeast host cell is a Kluyveromyces lactis cell.
- the yeast host cell is a Yarrowia lipolytica cell.
- the fungal host cell is a filamentous fungal cell.
- “Filamentous fungi” include all filamentous forms of the subdivision Eumycota and Oomycota (as defined by Hawksworth et al., 1995, supra).
- the filamentous fungi are generally characterized by a mycelial wall composed of chitin, cellulose, glucan, chitosan, mannan, and other complex polysaccharides. Vegetative growth is by hyphal elongation and carbon catabolism is obligately aerobic. In contrast, vegetative growth by yeasts such as Saccharomyces cerevisiae is by budding of a unicellular thallus and carbon catabolism may be fermentative.
- the filamentous fungal host cell is an Acremonium, Aspergillus, Aureobasidium, Bjerkandera, Ceriporiopsis, Coprinus, Coriolus, Cryptococcus, Filibasidium, Fusarium, Humicola, Magnaporthe, Mucor, Myceliophthora, Neocallimastix, Neurospora, Paecilomyces, Penicillium, Phanerochaete, Phlebia, Piromyces, Pleurotus, Schizophyllum, Talaromyces, Thermoascus, Thielavia, Tolypocladium, Trametes, or Trichoderma cell.
- the filamentous fungal host cell is an Aspergillus awamori, Aspergillus fumigatus, Aspergillus foetidus, Aspergillus japonicus, Aspergillus nidulans, Aspergillus niger or Aspergillus oryzae cell.
- the filamentous fungal host cell is a Fursarium bactridioides, Fusarium cerealis, Fusarium crookwellense, Fursarium culmorum, Fursarium graminearum, Fursarium graminum, Fursarium heterosporum, Fursarium negundi, Fursarium oxysporum, Fursarium reticulatum, Fursarium roseum, Fursarium sambucinum, Fursarium sarcochroum, Fursarium sporotrichioides, Fursarium sulphureum, Fursarium torulosum, Fursarium trichothecioides, or Fursarium venenatum cell.
- the filamentous fungal host cell is a Bjerkandera adusta, Ceriporiopsis aneirina, Ceriporiopsis aneirina, Ceriporiopsis caregiea, Ceriporiopsis gilvescens, Ceriporiopsis pannocinta, Ceriporiopsis rivulosa, Ceriporiopsis subrufa, or Ceriporiopsis subvermispora, Coprinus cinereus, Coriolus hirsutus, Humicola insolens, Humicola lanuginosa, Mucor miehei, Myceliophthora thermophila, Neurospora crassa, Penicillium purpurogenum, Phanerochaete chrysosporium, Phlebia radiata, Pleurotus eryngii, Thielavia terrestris, Trametes villosa, Trametes versicolor, Trichoderma harzianum, Trichoderma koning
- Fungal cells may be transformed by a process involving protoplast formation, transformation of the protoplasts, and regeneration of the cell wall in a manner known per se.
- Suitable procedures for transformation of Aspergillus and Trichoderma host cells are described in EP 238 023 and Yelton et al., 1984, Proceedings of the National Academy of Sciences USA 81: 1470-1474.
- Suitable methods for transforming Fursarium species are described by Malardier et al., 1989, Gene 78: 147-156, and WO 96/00787.
- Yeast may be transformed using the procedures described by Becker and Guarente, In Abelson, J. N. and Simon, M.
- the present invention also relates to methods for producing a polypeptide of the present invention, comprising (a) cultivating a cell, which in its wild-type form is capable of producing the polypeptide, under conditions conducive for production of the polypeptide; and (b) recovering the polypeptide.
- the present invention also relates to methods for producing a polypeptide of the present invention, comprising (a) cultivating a host cell under conditions conducive for production of the polypeptide; and (b) recovering the polypeptide.
- the cells are cultivated in a nutrient medium suitable for production of the polypeptide using methods well known in the art.
- the cell may be cultivated by shake flask cultivation, and small-scale or large-scale fermentation (including continuous, batch, fed-batch, or solid state fermentations) in laboratory or industrial fermentors performed in a suitable medium and under conditions allowing the polypeptide to be expressed and/or isolated.
- the cultivation takes place in a suitable nutrient medium comprising carbon and nitrogen sources and inorganic salts, using procedures known in the art. Suitable media are available from commercial suppliers or may be prepared according to published compositions (e.g., in catalogues of the American Type Culture Collection). If the polypeptide is secreted into the nutrient medium, the polypeptide can be recovered directly from the medium. If the polypeptide is not secreted, it can be recovered from cell lysates.
- the polypeptides may be detected using methods known in the art that are specific for the polypeptides. These detection methods may include use of specific antibodies, formation of an enzyme product, or disappearance of an enzyme substrate. For example, an enzyme assay may be used to determine the activity of the polypeptide as described herein.
- the resulting polypeptide may be recovered using methods known in the art.
- the polypeptide may be recovered from the nutrient medium by conventional procedures including, but not limited to, centrifugation, filtration, extraction, spray-drying, evaporation, or precipitation.
- polypeptides of the present invention may be purified by a variety of procedures known in the art including, but not limited to, chromatography (e.g., ion exchange, affinity, hydrophobic, chromatofocusing, and size exclusion), electrophoretic procedures (e.g., preparative isoelectric focusing), differential solubility (e.g., ammonium sulfate precipitation), SDS-PAGE, or extraction (see, e.g., Protein Purification, J.-C. Janson and Lars Ryden, editors, VCH Publishers, New York, 1989).
- chromatography e.g., ion exchange, affinity, hydrophobic, chromatofocusing, and size exclusion
- electrophoretic procedures e.g., preparative isoelectric focusing
- differential solubility e.g., ammonium sulfate precipitation
- SDS-PAGE or extraction
- the first aspect of the invention relates to a method of constructing a cell comprising in its chromosome one or more copies of an open reading frame (ORF) or operon encoding at least one polypeptide of interest, each copy being under the transcriptional control of a heterologous promoter, said method comprising the steps of:
- each copy of RS1 is located downstream of a copy of the heterologous promoter. How far downstream of the promoter RS1 may be located in the cell is a matter of trial and error; the only limiting factor is that the promoter must be operably linked with the ORF or operon after the construct has been integrated into the chromosome.
- RS1 is located up to 10.000 bp downstream of the promoter, even more preferably up to 5.000 bp downstream of the promoter, and most preferably no more than 500 bp downstream of the promoter.
- RS2 is located and oriented with respect to the ORF or operon so that an in vivo recombination of RS2 with a copy of RS1 in the chromosome of the cell will integrate the construct into the chromosome and bring the ORF or operon under the transcriptional control of the heterologous promoter”.
- This is to ensure that the ORF or operon and RS2 have the correct orientation with respect to each other and with respect to the polarity of RS1 in the chromosome, so that the recombinase mediated recombination between RS1 and RS2 will place the ORF or operon under the transcriptional control of the promoter.
- RS2 is preferably located up to 10.000 bp upstream of the ORF or operon, even more preferably up to 5.000 bp upstream of the ORF or operon, and most preferably no more than 500 bp upstream of the ORF or operon.
- the choice of a host cell will to a large extent depend upon the gene encoding the polypeptide and its source.
- the host cell may be a unicellular microorganism, e.g., a prokaryote, or a non-unicellular microorganism, e.g., a eukaryote.
- Useful unicellular cells are bacterial cells such as Gram positive bacteria including, but not limited to, a Bacillus cell, e.g., Bacillus alkalophilus, Bacillus amyloliquefaciens, Bacillus brevis, Bacillus circulans, Bacillus coagulans, Bacillus lautus, Bacillus lentus, Bacillus lichenformis, Bacillus megaterium, Bacillus stearothermophilus, Bacillus subtilis, and Bacillus thuringiensis; or a Streptomyces cell, e.g., Streptomyces lividans or Streptomyces murinus, or Gram negative bacteria such as E. coli and Pseudomonas sp.
- the bacterial host cell is a Bacillus lentus, Bacillus lichenformis, Bacillus stearothermophilus or Bacillus subtilis cell.
- the cell is a prokaryotic cell, preferably a Bacillus cell, and more preferably a Bacillus subtilis or a Bacillus lichenformis cell.
- the ORF or operon in any aspects of the invention preferably encodes at least one enzyme; preferably an oxidoreductase, a transferase, a hydrolase, a lyase, an isomerase, or a ligase; more preferably an amylolytic enzyme, a lipolytic enzyme, a proteolytic enzyme, a cellulytic enzyme, an oxidoreductase or a plant cell-wall degrading enzyme, and most preferably an enzyme with an activity selected from the group consisting of aminopeptidase, amylase, amyloglucosidase, carbohydrase, carboxypeptidase, catalase, cellulase, chitinase, cutinase, cyclodextrin glycosyltransferase, deoxyribonuclease, esterase, galactosidase, beta-galactosidase, glucoamylase, glucose oxidase
- WO 1993/010249 discloses various promoter variants, and WO 1999/043835 discloses tandem and triple promoter constructions with improved properties.
- Each promoter sequence of the tandem promoter may be any nucleic acid sequence which shows transcriptional activity in the Bacillus cell of choice including a mutant, truncated, and hybrid promoter, and may be obtained from genes encoding extracellular or intracellular polypeptides either homologous or heterologous to the Bacillus cell.
- Each promoter sequence may be native or foreign to the nucleic acid sequence encoding the polypeptide and native or foreign to the Bacillus cell.
- the promoter sequences may be the same promoter sequence or different promoter sequences.
- the promoter sequences may be obtained from a bacterial source.
- the promoter sequences may be obtained from a gram positive bacterium such as a Bacillus strain, e.g., Bacillus alkalophilus, Bacillus amyloliquefaciens, Bacillus brevis, Bacillus circulans, Bacillus clausii, Bacillus coagulans, Bacillus firmus, Bacillus lautus, Bacillus lentus, Bacillus lichenformis, Bacillus megaterium, Bacillus pumilus, Bacillus stearothermophilus, Bacillus subtilis, or Bacillus thuringiensis; or a Streptomyces strain, e.g., Streptomyces lividans or Streptomyces murinus; or from a gram negative bacterium, e.g., E. coli or Pseudomonas sp.
- a Bacillus strain e.g., Bacill
- a suitable promoter for directing the transcription of a nucleic acid sequence in the methods of the present invention is the promoter obtained from the E. coli lac operon.
- Another example is the promoter of the Streptomyces coelicolor agarase gene (dagA).
- dagA Streptomyces coelicolor agarase gene
- Another example is the promoter of the Bacillus lentus alkaline protease gene (aprH).
- Another example is the promoter of the Bacillus lichenformis alkaline protease gene (subtilisin Carlsberg gene).
- Another example is the promoter of the Bacillus subtilis levansucrase gene (sacB).
- Another example is the promoter of the Bacillus subtilis alpha-amylase gene (amyE).
- Another example is the promoter of the Bacillus lichenformis alpha-amylase gene (amyL).
- Another example is the promoter of the Bacillus stearothermophilus maltogenic amylase gene (amyM).
- Another example is the promoter of the Bacillus amyloliquefaciens alpha-amylase gene (amyQ).
- Another example is a “consensus” promoter having the sequence TTGACA for the “-35” region and TATMT for the “-10” region.
- Another example is the promoter of the Bacillus lichenformis penicillinase gene (penP).
- Another example are the promoters of the Bacillus subtilis xylA and xylB genes.
- Another example is the promoter of the Bacillus thuringiensis subsp. tenebrionis CryIIIA gene (cryIIIA, SEQ ID NO. 1) or portions thereof.
- Another example is the promoter of the prokaryotic beta-lactamase gene (Villa-Kamaroff et al., 1978, Proceedings of the National Academy of Sciences USA 75:3727-3731).
- Another example is the promoter of the spol bacterial phage promoter.
- Another example is the tac promoter (DeBoer et al., 1983, Proceedings of the National Academy of Sciences USA 80:21-25).
- the two or more promoter sequences of the tandem promoter may simultaneously promote the transcription of the nucleic acid sequence.
- one or more of the promoter sequences of the tandem promoter may promote the transcription of the nucleic acid sequence at different stages of growth of the Bacillus cell.
- the tandem promoter contains at least the amyQ promoter of the Bacillus amyloliquefaciens alpha-amylase gene. In another preferred embodiment, the tandem promoter contains at least a “consensus” promoter having the sequence TTGACA for the “-35” region and TATAAT for the “-10” region. In another preferred embodiment, the tandem promoter contains at least the amyL promoter of the Bacillus lichenformis alpha-amylase gene. In another preferred embodiment, the tandem promoter contains at least the cryIIIA promoter or portions thereof (Agaisse and Lereclus, 1994, supra).
- the tandem promoter contains at least the amyL promoter and the cryIIIA promoter. In another more preferred embodiment, the tandem promoter contains at least the amyQ promoter and the cryIIIA promoter. In another more preferred embodiment, the tandem promoter contains at least a “consensus” promoter having the sequence TTGACA for the “-35” region and TATAAT for the “-10” region and the cryIIIA promoter. In another more preferred embodiment, the tandem promoter contains at least two copies of the amyL promoter. In another more preferred embodiment, the tandem promoter contains at least two copies of the amyQ promoter.
- tandem promoter contains at least two copies of a “consensus” promoter having the sequence TTGACA for the “-35” region and TATAAT for the “-10” region. In another more preferred embodiment, the tandem promoter contains at least two copies of the cryIIIA promoter.
- the construction of a “consensus” promoter may be accomplished by site-directed mutagenesis to create a promoter which conforms more perfectly to the established consensus sequences for the “-10” and “-35” regions of the vegetative “sigma A-type” promoters for Bacillus subtilis (Voskuil et al., 1995, Molecular Microbiology 17: 271-279).
- the consensus sequence for the “-35” region is TTGACA and for the “-10” region is TATAAT.
- the consensus promoter may be obtained from any promoter which can function in a Bacillus host cell.
- the “consensus” promoter is obtained from a promoter obtained from the E. coli lac operon, Streptomyces coelicolor agarase gene (dagA), Bacillus lentus alkaline protease gene (aprH), Bacillus lichenformis alkaline protease gene (subtilisin Carlsberg gene), Bacillus subtilis levansucrase gene (sacB), Bacillus subtilis alpha-amylase gene (amyE), Bacillus lichenformis alpha-amylase gene (amyL), Bacillus stearothermophilus maltogenic amylase gene (amyM), Bacillus amyloliquefaciens alpha-amylase gene (amyQ), Bacillus lichenformis penicillinase gene (penP), Bacillus subtilis xylA and xylB genes, Bacillus thuringiensis subsp. tenebrionis Cry
- the “consensus” promoter is obtained from Bacillus amyloliquefaciens alpha-amylase gene (amyQ).
- the consensus promoter is the “consensus” amyQ promoter contained in nucleotides 1 to 185 of SEQ ID NO. 3 or SEQ ID NO. 4.
- the consensus promoter is the short “consensus” amyQ promoter contained in nucleotides 86 to 185 of SEQ ID NO. 3 or SEQ ID NO. 4.
- the “consensus” amyQ promoter of SEQ ID NO. 3 contains the following mutations of the nucleic acid sequence containing the wild-type amyQ promoter (SEQ ID NO.
- the “consensus” amyQ promoter (SEQ ID NO. 2) further contains a T to A change at position 116 approximately 20 base pairs upstream of the -35 region as shown in FIG. 21 (SEQ ID NO. 4). This change apparently had no detrimental effect on promoter function since it is well removed from the critical -10 and -35 regions.
- the heterologous promoter comprises two or more promoters; preferably the two or more promoters comprise one or more promoter derived from one or more Bacillus genes; more preferably the two or more promoters comprise one or more of the following: the amyQ promoter, the amyL promoter, the cryIIIA promoter, and a consensus promoter comprising the nucleotide sequence TTGACA for the -35 region and the nucleotide sequence TATMT for the -10 region.
- Site specific recombinases including phage integrases, are well-known in the art, where they are usually grouped into tyrosine recombinases or serine recombinases.
- a sub-group of the serine recombinases are the large serine recombinases, which contains all the known serine recombinase-type phage integrases.
- the large serine recombinases contain the resolvase/invertase-like N-terminal catalytic domains of all serine recombinases, but their C-terminal regions are much larger and very diverse. (Smith and Thorpe, 2002. Diversity in the serine recombinases. Mol Microbiol 44:299-307). A review of phage integrases is given by Groth and Calos (J. Mol. Biol. 2004, 335: 667-678).
- the site specific recombinase comprises a phage integrase, preferably a tyrosine recombinase or a serine recombinase, more preferably a large serine recombinase, and most preferably the TP901-1 integrase.
- the TP901-1 integrase is well-characterized, e.g. in Breüner et al. 2001. Resolvase-like recombination performed by the TP901-1 integrase. Microbiology 147: 2051-2063.
- the recognition sequences of TP901-1 integrase (attP, attB, attL and attR) are well-known.
- a preferred embodiment relates to the method of the first aspect, wherein RS1 comprises a nucleotide sequence at least 70%, preferably at least 75%, 80%, 85%, 90%, 95%, or most preferably at least 98% identical to attB 161 (SEQ ID NO: 21) or attB min (SEQ ID NO: 22), RS2 comprises a nucleotide sequence at least 70%, preferably at least 75%, 80%, 85%, 90%, 95%, or most preferably at least 98% identical to attPmin (SEQ ID NO: 23), and the site specific recombinase comprises the phage TP901-1 integrase.
- RS1 comprises a nucleotide sequence at least 70%, preferably at least 75%, 80%, 85%, 90%, 95%, or most preferably at least 98% identical to attPmin (SEQ ID NO: 23)
- RS2 comprises a nucleotide sequence at least 70%, preferably at least 75%, 80%, 85%, 90%, 95%, or most preferably at least 98% identical to attB 161 (SEQ ID NO: 21) or attB min (SEQ ID NO: 22)
- the site specific recombinase comprises the phage TP901-1 integrase.
- the attP and attB sequences may also be substituted with the corresponding attL and attR sequences in the method of the invention, which in turn may also be switched around, provided that the integrase is supplemented with the excisionase, Xis.
- RS1 comprises a nucleotide sequence at least 70%, preferably at least 75%, 80%, 85%, 90%, 95%, or most preferably at least 98% identical to attLmin (SEQ ID NO: 24)
- RS2 comprises a nucleotide sequence at least 70%, preferably at least 75%, 80%, 85%, 90%, 95%, or most preferably at least 98% identical to attRmin (SEQ ID NO: 25)
- the site specific recombinase comprises the phage TP901-1 integrase and excisionase Xis.
- RS1 comprises a nucleotide sequence at least 70%, preferably at least 75%, 80%, 85%, 90%, 95%, or most preferably at least 98% identical to attRmin (SEQ ID NO: 25)
- RS2 comprises a nucleotide sequence at least 70%, preferably at least 75%, 80%, 85%, 90%, 95%, or most preferably at least 98% identical to attLmin (SEQ ID NO: 24)
- the site specific recombinase comprises the phage TP901-1 integrase and excisionase Xis.
- WO 94/25612 discloses an mRNA stabilizer region downstream of the promoter and upstream of the coding sequence of the cryIIIA gene which increases expression of the gene.
- Hue et al. (1995, Journal of Bacteriology 177: 3465-3471) disclose a 5′ mRNA stabilizer sequence which stabilized several heterologous RNA sequences when present at the 5′ end and increased expression of downstream coding sequences several-fold in Bacillus subtilis.
- mRNA processing/stabilizing sequence is defined herein as a sequence located downstream of one or more promoter sequences and upstream of a coding sequence to which each of the one or more promoter sequences are operably linked such that all mRNAs synthesized from each promoter sequence may be processed to generate mRNA transcripts with a stabilizer sequence at the 5′ end of the transcripts.
- the presence of such a stabilizer sequence at the 5′ end of the mRNA transcripts increases their half-life (Agaisse and Lereclus, 1994, supra, Hue et al., 1995, supra).
- the mRNA processing/stabilizing sequence is complementary to the 3′ extremity of a bacterial 16S ribosomal RNA.
- the mRNA processing/stabilizing sequence generates essentially single-size transcripts with a stabilizing sequence at the 5′ end of the transcripts.
- the mRNA processing/stabilizing sequence is the Bacillus thuringiensis cryIIIA mRNA processing/stabilizing sequence disclosed in WO 94/25612 and Agaisse and Lereclus, 1994, supra, or portions thereof which retain the mRNA processing/stabilizing function.
- the mRNA processing/stabilizing sequence is the Bacillus subtilis SP82 mRNA processing/stabilizing sequence disclosed in Hue et al., 1995, supra, or portions thereof which retain the mRNA processing/stabilizing function.
- cryIIIA promoter and its mRNA processing/stabilizing sequence When the cryIIIA promoter and its mRNA processing/stabilizing sequence are employed in the methods of the present invention, a DNA fragment containing the sequence disclosed in WO 94/25612 and Agaisse and Lereclus, 1994, supra, or portions thereof which retain the promoter and mRNA processing/stabilizing functions, may be used. Furthermore, DNA fragments containing only the cryIIIA promoter or only the cryIIIA mRNA processing/stabilizing sequence may be prepared using methods well known in the art to construct various tandem promoter and mRNA processing/stabilizing sequence combinations. In this embodiment, the cryIIIA promoter and its mRNA processing/stabilizing sequence are preferably placed downstream of the other promoter sequence(s) constituting the tandem promoter and upstream of the coding sequence of the gene of interest.
- At least one mRNA stabilizing region is located between the heterologous promoter and RS1 in the chromosome of the cell in step (a); preferably the at least one mRNA stabilizing region comprises a mRNA stabilizing region derived from cryIIIA; more preferably the at least one mRNA stabilizing region comprises a nucleotide sequence at least 70%, preferably at least 75%, 80%, 85%, 90%, 95%, or most preferably at least 98% identical to the sequence shown in positions 35-580 of SEQ ID NO: 26.
- polynucleotide construct further comprises at least one mRNA stabilizing region located upstream of the ORF or operon between the ORF or operon and RS2.
- Another preferred embodiment relates to the cell of the second aspect, wherein at least one mRNA stabilizing region is located between the heterologous promoter and RS in the chromosome of the cell.
- Yet another preferred embodiment relates to the cell of the third aspect, wherein at least one mRNA stabilizing region is located between the heterologous promoter and the one or more copies of the ORF or operon in the chromosome of the cell.
- At least one mRNA stabilizing region is located upstream of the ORF or operon.
- regions of homology can be designed at the proper positions in the construct and next to the recognition sequence in chromosome of the host cell prior to integration. This is illustrated by the regions designated “repeat” in FIG. 1 .
- the regions may either be inserted heterologous polynucleotide regions, or one region may be designed on the basis of a corresponding region, which may be naturally found in the other sequence.
- the polynucleotide construct further comprises a region located upstream or downstream of the ORF or operon in the construct, said region being sufficiently homologous with a corresponding region located upstream or downstream, correspondingly, of RS1 in the chromosome of the cell to effectuate in vivo homologous recombination between the two homologous regions when both regions are present in the cell.
- two regions that are recognition sites of a site specific recombinase can be inserted at the same positions as the before mentioned repeats.
- site specific recombinase recombination between the two sites will then lead to excision of the region between the two sites, leaving only the gene of interest on the chromosome.
- Non-limiting examples are the well-known resolvase systems, with two res sites and a specific resolvase, which performs the recombination between the two sites.
- the concept of using site specific recombination systems for excision of sequences from the bacterial chromosome was described, e.g.
- WO 95/02058 describes a new transposon (tn5401) from B. thuringiensis containing transposase, resolvase, and res site.
- the transposon is used in a plasmid which contains B. thuringiensis DNA (e.g. origin and toxin gene) and, flanked by res sites, non-B. thuringiensis DNA (e.g. E. coli origin, selectable marker genes).
- B. thuringiensis DNA e.g. origin and toxin gene
- non-B. thuringiensis DNA e.g. E. coli origin, selectable marker genes
- the plasmid is introduced into B. thuringiensis.
- a plasmid expressing the resolvase is introduced (e.g. a thermosensitive plasmid containing the entire tranposon—but only used as resolvase donor) whereby the non- B. thuringiensis DNA
- a counterselectable marker such as the ysbC gene (Danish patent application PA 2004 00227; filed 13 Feb. 2004; Novozymes A/S), can be present on the vector-part of the polynucleotide construct.
- the marker will no longer be present in the cell, which then becomes resistant to the selection.
- a gene that gives a screenable phenotype can be used, such as an antibiotic selection marker, GFP, or an amylase. Loss of all integrated constructs by excision will then lead to loss of resistance to the antibiotic, loss of green fluorescence, or loss of the amylase phenotype.
- the polynucleotide construct further comprises at least one selectable marker, at least one counterselectable marker, or at least one screenable marker; preferably the at least one selectable marker, counterselectable marker, or the screenable marker is flanked on both sides by a recognition sequence(s) of a second site specific recombinase, preferably a resolvase.
- the second aspect of the invention relates to a cell comprising in its chromosome one or more copies of a recognition sequence (RS) of a site specific recombinase, wherein each copy of the RS is located downstream of a copy of a heterologous promoter.
- RS recognition sequence
- the third aspect of the invention relates to a cell comprising in its chromosome one or more copies of an open reading frame (ORF) or operon of interest, wherein each copy is under the transcriptional control of a heterologous promoter, and (i) wherein each copy of the ORF or operon is located in the chromosome upstream of a recognition sequence (RS) of a site specific recombinase, or (ii) wherein each copy of the ORF or operon is located in the chromosome downstream of a recognition sequence (RS) of a site specific recombinase.
- ORF open reading frame
- the fourth aspect of the invention relates to a polynucleotide construct comprising a promoterless open reading frame (ORF) or operon encoding at least one polypeptide of interest, the construct also comprising a recognition sequence (RS) of a site specific recombinase located upstream or downstream of said ORF or operon.
- ORF promoterless open reading frame
- RS recognition sequence
- the polynucleotide of the fourth aspect, or the method of the final aspect comprises a nucleotide sequence at least 70%, preferably at least 75%, 80%, 85%, 90%, 95%, or most preferably at least 98% identical to attB 161 (SEQ ID NO: 21), attB min (SEQ ID NO: 22), or attPmin (SEQ ID NO: 23), and the site specific recombinase comprises the phage TP901-1 integrase; or RS comprises a nucleotide sequence at least 70%, preferably at least 75%, 80%, 85%, 90%, 95%, or most preferably at least 98% identical to attLmin (SEQ ID NO: 24) or attRmin (SEQ ID NO: 25), and the site specific recombinase comprises the phage TP901-1 integrase and excisionase Xis.
- Bacillus subtilis DN1885 is described in Diderichsen, B., Wedsted, U., Hedegaard, L., Jensen, B. R., Sj ⁇ holm, C. (1990). Cloning of aldB, which encodes acetolactate decarboxylase, an exoenzyme from Bacillus brevis. Journal of Bacteriology 172, 4315-4321.
- B. subtilis PL1801 is the B. subtilis DN1885 strain with disrupted apr and npr genes.
- AEB43 B. subtilis PL1801 with a 161 bp attB fragment integrated in the xyl locus.
- AEB165 AEB43 with the 43 bp minimal attB integrated in the amyE locus.
- pattB19DraIII (SEQ ID NO:1) CCCCCACTAAGTGCCTGACTTTCAACTAC pattB179NotI: (SEQ ID NO:2) CCCCGCGGCCGCAAAAAAAGCAAAAAGC PEP140: (SEQ ID NO:3) AATATTGGCCGGGGAAGCGGAAGAATGAAG PEP218: (SEQ ID NO:4) CTATACTAGTCATCCTTGCAGGGTATGTTTC pamyE-EI: (SEQ ID NO:5) GGGGGAATTCAACGGCCTCAACCTACTACTG M13-forward: (SEQ ID NO:6) GTTTTCCCAGTCACGAC M13-revers: (SEQ ID NO:7) CAGCTATGACCATGATTACGC pCI-5: (SEQ ID NO:8) CTTCTACCCATTATTACAGCAGGA pCI-9: (SEQ ID NO:9) AGTAGTTCGCCAGTTAATAGTTTG p1224seq-2: (SEQ ID NO:10) GCCATACAGCTACT
- pLB44 E. coli plasmid containing a 2 kb region of the phage TP901-1 genome, including the int gene and attP site (Christiansen et al. (1996). J. Bacteriol. 178(17): 5164-5173).
- pBC16 is commercially available from DSMZ (DSM 4424); (Kreft, J. et al. (1978) Recombinant plasmids capable of replication in B. subtilis and E. coli. Mol. Gen. Genet 162: 59-67).
- pSJ2739 (described in U.S. Pat. No. 6,100,063) is derived from pE194, which is naturally temperature-sensitive for replication.
- the part of pSJ2739 which is relevant for this invention consists of the pE194 replicon, as well as a fragment derived from plasmid pUB110, enabling conjugation into B. lichenformis.
- pAEB142 The int gene of TP901-1, encoding the phage integrase, is inserted after the xylose-inducible P xyl promoter in the pCR®-BluntII-TOPO® (Invitrogen) vector.
- the P xyl and int fragments were first amplified by two separate PCR-reactions.
- the P xyl fragment was obtained with chromosomal DNA from B. subtilis PL1801 as template and the primers were pPxyl-up & pPxyl-down, giving a fragment of 1.5 kb.
- the plasmid pLB44 was used as template and the primers were pint-up & pint-down, again giving a fragment of 1.5 kb.
- the two fragments were digested with BspHI joined by ligation, and used as template in a third PCR-reaction with primers ppxyl-up & pint-down, resulting in a fragment of 2.9 kb.
- This fragment was then inserted in the pCR®-BluntII-TOPO® vector in the Zero Blunt® TOPO® PCR cloning kit (Invitrogen).
- pAEB146 The P xyl -int fragment of pAEB142 was inserted in pSJ2739. This fragment was obtained by a two-step process, where the P xyl fragment and the upstream part of int is obtained from pAEB142 on a 1.7 EcoRI-HindIII-fragment and ligated to the 4.3 kb EcoRI-HindIII-fragment of pSJ2739, and subsequently the downstream part of int was obtained on a HindIII fragment from pAEB142 and inserted in the HindIII site of the first plasmid.
- pAEB146 contains erm gene, providing resistance to erythromycin (Em), and a temperature-sensitive replicon, as well as the factors required for conjugation, making it possible to use both in B. subtilis and B. lichenformis.
- pAEB148 A PCR-fragment with the minimal attP (attP min ) and the cryIIIA region inserted in the pCR®-BluntII-TOPO® vector.
- the PCR fragment was obtained using primers pattPcry3A and pcry3AClaI, and the template was a plasmid containing the cryIIIA region. This gave a fragment of approximately 650 bp, which was cloned in the vector by using the Zero Blunt® TOPO® PCR cloning kit (Invitrogen).
- pAEB153 A 636 bp attPmin-cryIIIA-fragment (SEQ ID NO: 26) was obtained from pAEB148 by digestion with Xhol and Clal, and was inserted in the 2.1 kb Sall-Clal fragment of pMOL1632. This plasmid contains the same replication origin as the integrase donor plasmid pAEB146 but does not encode the replication protein. Thus, replication of pAEB153 is dependent on donation of the replication protein from another vector, such as pAEB146, i.e. pAEB153 is a so-called “slave” of pAEB146.
- pAEB267 A 360 bp fragment containing the minimal attP site of TP901-1 and a region of the B. lichenformis chromosome was obtained by PCR using primers pattP-ExtTerm and pTermBI, and chromosomal DNA from B. lichenformis as template. The fragment was digested with BamHI and KpnI and inserted in BamHI-KpnI digested pAEB146.
- pAEB288 pAEB267 with an amylase encoding gene, amyL, which is inserted into an Ncol-Nhel digested pAEB267.
- an 161 bp attB site (attB 161 , SEQ ID NO: 21) was integrated in the xyl locus in B. subtilis strain PL1801, resulting in the strain AEB43. Integration was obtained by double cross-over of a DNA fragment which contains attB 161 adjacent to the cat gene, surrounded by an upstream and a downstream region of the xyl locus.
- the upstream and downstream xyl fragments were obtained with PCR on chromosomal DNA from B. subtilis using primers that are suitable for amplifying regions of sufficient size for an efficient integration by homologous recombination (0.5 kb or more).
- these fragments were joined with the attB 161 fragment (obtained from Lactococcus lactis subsp. cremoris 3-107) and with the cat gene, yielding chloramphenicol (Cm) resistance.
- This xylup-attB-cat-xyldown fragment was introduced into PL1801 by transformation and the transformants were plated on Cm containing plates. Cells in which recombination between the DNA-fragment and the chromosome had occurred in both xyl regions would have retained the cat gene and would thus be Cm R . Transformants with this phenotype were isolated and by PCR and sequencing they were found to have the attB 161 site integrated in the xyl locus.
- the minimal attB site of 43 bp (attB min , Breüner et al. (2001) Microbiology 147 2051-2063; SEQ ID NO: 22) was integrated in the amyE locus in AEB43, resulting in strain AEB165, which had two functional versions or copies of the TP901-1 attb site integrated in the chromosome, attB 161 and attB min .
- AttB min was obtained by transformation and subsequent double cross-over into the chromosome of AEB43 of the amyup-tet-attB-amydown PCR fragment, which was obtained much as described for integration of attB 161 in the xyl locus, except that upstream and downstream regions of the amyE locus were flanking the tet-attB-fragment, and these regions were obtained by PCR from pBC16 with the primers pattB-tet & ptet-down.
- TcR transformants When this fragment was transformed into AEB43 TcR transformants could only arise if double crossover took place between the PCR-fragment and the bacterial chromosome at both ends of the PCR-fragment, leaving the tet gene and attB min in the chromosome. A number of Tc R transformants were isolated and found by PCR to contain the attB min site integrated at the intended position in the amyE locus.
- the TP901-1 integrase is needed to perform the recombination between the attB and attP sites.
- the expression of the integrase can be placed under the control of a constitutive or an inducible promoter.
- expression of the integrase is under the control of the P xyl -promoter, which is induced from a low to a high level of activity upon the addition of xylose.
- pAEB146 has a temperature sensitive replicon functional in Bacillus and the oriT region from plasmid pUB 110 which enables conjugation, and thus can be used in both B. subtilis and B. lichenformis.
- the integrase can be expressed from a plasmid which has a different kind of replicon, or it could be integrated into the chromosome.
- pAEB153 contains the minimal attP site of TP901-1 (SEQ ID NO: 23) determined in Br ⁇ ndsted and Hammer (1999) App. Environ. Microbiol. 65 752-758, but a larger attP region can also be used, or a smaller, if it is still active in recombination.
- Replication of pAEB153 is dependent on donation of replication protein from another plasmid with the pE194 replicon such as pAEB146.
- the attP site can be cloned on a different plasmid vector, e.g. one with an origin which is not dependent on other plasmids for replication, and/or a thermosensitive origin.
- the attP site can also be included on the plasmid from which integrase is expressed.
- the plasmid containing attP can be used as a vector for cloning genes in such a way, that integration of the plasmid in attB in the chromosome will lead to expression of the gene from a promoter present in the chromosome next to the attB site.
- To make the distance between the promoter and the cryIIIA region as short as possible attP min and cryIIIA are overlapping.
- a single base in the attP region was changed. The mutation in attP did not interfere with the ability of the region to participate in recombination with attB, as is shown in example 5.
- AEB165 (2 ⁇ attB) was transformed with pAEB146 (Int-donor) resulting in strain AEB182.
- AEB182 was in turn transformed with pAEB153 (attP).
- Transformants were grown and streaked at 33° C. (permissive temperature) and with selection for both plasmids to allow recombination between the attP and attB sites to take place. Then, a number of colonies were streaked on plates with selection only for pAEB153 and the incubation temperature was increased to 50° C., which disables replication of pAEB146 and thereby also of pAEB153.
- the only cells that can grow under these conditions are the ones where pAEB153 has integrated into the chromosome.
- the isolates were also checked for the presence of the Int-donor plasmid pAEB146 by streaking on selective plates (Em).
- integrase was performed without the addition of xylose to the medium.
- the production of integrase could be increased by adding xylose and thereby activating the P xyl promoter, leading to a higher expression of the integrase.
- the xylose concentration could, e.g., be between 0.05 og 5%; at high concentrations of xylose the integrase is overexpressed and becomes toxic to the cell.
- a counterselectable marker such as the ysbC gene (Danish patent application PA 2004 00227, filed 13 Feb. 2004) can be positioned downstream of a promoter next to all the attB sites in the chromosomes, but separated from the promoter by the attB site. Expression of the counterselectable marker from the promoter will lead to the cell being sensitive to the selective pressure (with ysbc. fluoro-orotate).
- the counterselectable marker is separated from the promoter, and the marker will no longer be expressed from this locus. However only when integration has occurred in all attB sites will no marker be produced, and the the cell will become resistant to the selection.
- a gene that gives a screenable phenotype can be used, such as an antibiotic selection marker, green fluorescence protein (GFP), beta-galactosidase, an amylase, or others. Integration of the attP plasmid in all of the attB sites will then lead to loss of resistance to the antibiotic, of green fluorescence, of colour on X-gal plates, of the amylase phenotype, or of what other phenotype was expressed from the marker.
- GFP green fluorescence protein
- the attB and attp Sites are Interchangeable
- the sites can be interchanged, so that one or more attP sites are inserted in the host genome, and the attB site is present in a vector to be integrated into the attP sites on the chromosome.
- AttP and attB can be exchanged with copies of attL (SEQ ID NO: 24) in the chromosome of the host and attR (SEQ ID NO: 25) on the plasmid; or vice versa.
- Recombination between the attL and attR sites will results in the creation of attP and attB sites after recombination.
- effective recombination of the TP901-1's attL and attR sites requires the presence of the excisionase, Xis, in addition to the integrase (Breüner et al. (1990) Novel Organization of Genes Involved in Prophage Excision Identified in the Temperate Lactococcal Bacteriophage TP901-1. J Bacteriol 181(23): 7291-7297.
- AttB sites were inserted at several positions in the chromosome of B. lichenformis, each site was inserted along with and downstream of a heterologous tandem promoter (as disclosed in WO 1999/043835).
- a vector comprising an attP site and an amylase encoding gene was then integrated into the chromosomal attB site by the integrase.
- the orientation of the amylase gene in the vector with respect to the attP site ensured that the gene became operably linked with the tandem promoter, when the vector was integrated into the chromosome through the recombination of the attB and attP sites.
- amylase encoding gene was inserted in the attP-int containing plasmid pAEB267 in such a way that integration of the plasmid via site-specific recombination between attB and attP catalysed by the TP901-1 integrase would result in the amylase gene being inserted into the chromosome so that it would be expressed from the heterologous tandem promoter, separated from the gene by the attL site ( FIG. 1 ) after the recombination.
- a strain where such an integration event had taken place (verified by PCR as described in example 4) was streaked on amylose containing plates, and clearing zones were formed, demonstrating that the amylase was expressed from the tandem promoter next to the attL site. No clearing zones were observed when pAEB267 without amylase gene was integrated in a similar manner as a control.
- AttB min sites were inserted at three positions in the chromosome of B. lichenformis (the amyL, xyl and gnt loci). Each attB min site was inserted along with and downstream of a heterologous tandem promoter (disclosed in WO 1999/043835).
- the cryIIIA region was located upstream of the amylase gene, and the orientation of the amylase gene in the vector with respect to the attP site ensured that the gene was located and oriented in all three loci as shown for “genX” in the middle part of FIG. 2 .
- Subsequent crossing out of the vectorparts of the integrated plasmids in all three loci by means of homologous recombination between the two cryIIIA regions in each locus resulted in a strain, in which each of the three loci contained the region shown in the bottom of FIG. 2 : promoter, cryIIIA, amylase, and attR.
Landscapes
- Health & Medical Sciences (AREA)
- Genetics & Genomics (AREA)
- Life Sciences & Earth Sciences (AREA)
- Engineering & Computer Science (AREA)
- Chemical & Material Sciences (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Organic Chemistry (AREA)
- Biomedical Technology (AREA)
- Zoology (AREA)
- Biotechnology (AREA)
- General Engineering & Computer Science (AREA)
- Wood Science & Technology (AREA)
- Molecular Biology (AREA)
- Plant Pathology (AREA)
- General Health & Medical Sciences (AREA)
- Microbiology (AREA)
- Physics & Mathematics (AREA)
- Biophysics (AREA)
- Biochemistry (AREA)
- Crystallography & Structural Chemistry (AREA)
- Bioinformatics & Computational Biology (AREA)
- Mycology (AREA)
- Virology (AREA)
- Micro-Organisms Or Cultivation Processes Thereof (AREA)
- Saccharide Compounds (AREA)
- Preparation Of Compounds By Using Micro-Organisms (AREA)
- Measuring Or Testing Involving Enzymes Or Micro-Organisms (AREA)
- Electronic Switches (AREA)
Abstract
Methods of constructing a cell comprising in its chromosome one or more copies of an open reading frame (ORF) or operon encoding at least one polypeptide of interest, each copy being under the transcriptional control of a heterologous promoter using a site specific recombinase and in vivo integration by recombination; means for Promoter carrying out the methods, resulting cells, and methods for producing a polypeptide of interest using the resulting cells.
Description
- A large number of naturally-occurring organisms have been found to produce useful polypeptide products, e.g., enzymes, the large scale production of which is desirable for research and commercial purposes. Once such product has been identified efforts are being made to develop production methods leading to a high production of the product. One widely used method, which is based on recombinant DNA techniques, is to clone a gene encoding the product, inserting the gene into a suitable expression system permitting the expression of the product and culturing a suitable host cell comprising the expression system, either integrated in the chromosome or as an extrachromosomal entity, under conditions conducive for the expression of the product.
- Irrespective of which production method is used, it is normally desirable to be able to increase the production level of a given polypeptide or protein. Thus, efforts are being made to increase the production, e.g. by inserting the gene encoding the product under the control of a strong expression signal, or by increasing the number of copies of the gene in the production organism in question. This latter approach may be accomplished by inserting the gene into a multicopy plasmid which generally, however, tends to be unstable in the host cell in question, or by integrating multiple copies of the gene into the chromosome of the production organism, an approach which generally is considered more attractive because the stability of the construct tend to be higher allowing the gene to be stably maintained in the production organism.
- EP 0 284 126 and EP 166 628 disclose methods for stably integrating one or more copies of a gene into the chromosome of a prokaryotic cell already harbouring at least one copy of the gene in question in its chromosome. According to EP 0 284 126, a host cell comprising said gene is transformed with a DNA construct comprising another copy of the gene, whereby, after a suitable selection procedure, a cell is obtained which in its chromosome comprises two copies of the gene separated by an endogenous chromosomal sequence which is vital to the host cell and thereby ensures stable maintenance of the integrated gene. This procedure may be repeated so as to produce cells harbouring multiple copies of the gene in its chromosome. WO 2002/000907 describes methods to site-specifically integrate polynucleotides into inactivated chromosomal loci that are conditionally essential to the cell, where these loci are restored by the integration process.
- It has been shown that infection of host cells having a natural attachment site, attB as well as an ectopically introduced attB site, with a derivative of the Streptomyces phage -ΦC31, resulted in the integration of the phage into both attB sites (Smith et al., 2004. Switching the polarity of a bacteriophage integration system. Mol Microbiol 51(6): 1719-1728).
- In fact, multiple copies of a gene can be introduced into a cell comprising multiple attachment sites recognized by the Mx9 integrase using the Mx9 phage transformation system, (WO 2004/018635 A2).
- It may often be difficult to achieve proper chromosomal integration in a host cell of a gene encoding a polypeptide of interest, when the gene is introduced into the cell on a DNA construct, even a low-copy number construct, while being actively transcribed from a promoter on the construct. This is particularly true for polypeptides that are inhibitory or perhaps even toxic to the host cell above a certain concentration. One way of avoiding this problem is to silence the gene while it is on the construct, so that transcription is only initiated when the gene has been properly integrated into the chromosome.
- Usually it of interest to integrate several copies of a gene encoding a polypeptide of interest into the host cell chromosome, sometimes up to 10 or more copies. A method for simultaneously integrating the desired number of copies would be time-saving compared with stepwise methods like those mentioned above.
- The present invention provides a combined solution to these problems; it allows the simultaneous chromosomal site-specific integration of multiple copies of a gene (or operon) encoding a polypeptide(s) of interest, while also providing the means for initiating transcription of said gene after the proper integration of each copy via a heterologous promoter, which becomes operably linked with the gene only after the successful integration.
- Accordingly, in a first aspect, the present invention relates to a method of constructing a cell comprising in its chromosome one or more copies of an open reading frame (ORF) or operon encoding at least one polypeptide of interest, each copy being under the transcriptional control of a heterologous promoter, said method comprising the steps of:
- (a) providing a cell comprising in its chromosome one or more copies of a first recognition sequence (RS1) of a site specific recombinase, wherein each copy of RS1 is located downstream of a copy of said heterologous promoter;
- (b) introducing into said cell a polynucleotide construct comprising the ORF or operon and a second recognition sequence (RS2) of the site specific recombinase, where RS2 is located and oriented with respect to the ORF or operon so that an in vivo recombination of RS2 with a copy of RS1 in the chromosome of the cell will integrate the construct into the chromosome and place the ORF or operon downstream of and in the same orientation as the heterologous promoter; and
- (c) recombining RS2 with the one or more copies of RS1 in the presence of the site specific recombinase, whereby one or more copies of the ORF or operon of interest are integrated into the chromosome and placed (i) either directly under the transcriptional control of the heterologous promoter, or (ii) downstream of and in the same orientation as the promoter but separated from it by a region, which can be excised after one or more optional recombination events, whereby the ORF or operon of interest is placed under the transcriptional control of the heterologous promoter.
- One of the means for carrying out the invention is a host cell specifically designed for this purpose; the cell has been engineered to comprise one or more copies of a recognition sequence (RS) of a site specific recombinase, as exemplified below, wherein each copy of the RS is located downstream of a copy of a heterologous promoter. This arrangement ensures that when a polynucleotide construct of the invention recombines into the chromosome by the action of the site specific recombinase, the ORF or operon comprised in the construct will be operably linked with the heterologous promoter already present in the chromosome.
- Accordingly, in a second aspect, the invention relates to a cell comprising in its chromosome one or more copies of a recognition sequence (RS) of a site specific recombinase, wherein each copy of the RS is located downstream of a copy of a heterologous promoter.
- In a third aspect, the invention relates to a cell produced by a method of the first aspect, or a cell comprising in its chromosome one or more copies of an open reading frame (ORF) or operon of interest, wherein each copy is under the transcriptional control of a heterologous promoter, and (i) wherein each copy of the ORF or operon is located in the chromosome upstream of a recognition sequence (RS) of a site specific recombinase, or (ii) wherein each copy of the ORF or operon is located in the chromosome downstream of a recognition sequence (RS) of a site specific recombinase.
- Another means for carrying out the invention is of course the polynucleotide construct mentioned in the method of the first aspect.
- Consequently, a fourth aspect of the invention relates to a polynucleotide construct comprising a promoterless open reading frame (ORF) or operon encoding at least one polypeptide of interest, the construct also comprising a recognition sequence (RS) of a site specific recombinase located upstream or downstream of said ORF or operon.
- In a final aspect, the invention relates to a method of producing a polypeptide of interest, said method comprising:
- (a) cultivating a cell of the third aspect or a cell comprising in its chromosome one or more copies of an open reading frame (ORF) or operon of interest, wherein each copy is under the transcriptional control of a heterologous promoter, and (i) wherein each copy of the ORF or operon is located in the chromosome upstream of a recognition sequence (RS) of a site specific recombinase, or (ii) wherein each copy of the ORF or operon is located in the chromosome downstream of a recognition sequence (RS) of a site specific recombinase; and
- (b) isolating the polypeptide of interest.
-
FIG. 1 . A schematic overview of a preferred embodiment of the invention: - A circular polynucleotide construct comprising the recognition sequense of the TP901-1 phage integrase, attP, located upstream of an open reading frame, genX. The construct further comprises an optional marker, a temperature sensitive origin of replication, oriTS, as well as a region located downstream of the open reading frame in the construct, which is indicated with a small arrow denoted “repeat”.
- A chromosome of a host cell is also shown comprising a heterologous promoter and the TP901-1 phage integrase recognition sequence, attB, corresponding to the recognition sequence in the construct, which is located downstream of the promoter. In addition, a region is indicated in the chromosome with a small arrow denoted “repeat”.
- The “repeat” regions of the chromosome and the polynucleotide construct should be sufficiently homologous to effectuate in vivo homologous recombination between the two homologous regions when both regions are present in the cell.
- In the presence of a suitable integrase (+integrase), e.g. The TP901-1 phage integrase, the attP and attB sites are recombined, whereby the construct is integrated into the chromosome, placing the open reading frame, genX, under the transcriptional control of the heterologous promoter, creating the resulting attL and attR sites in the process.
- In an optional next step, the two homologous “repeat” regions recombine, whereby the DNA in between the two regions is excised from the chromosome, leaving just the open reading frame, genX, in the chromosome along with the newly created attL site.
- For purposes of the present invention, alignments of sequences and calculation of homology scores may be done using a full Smith-Waterman alignment, useful for both protein and DNA alignments. The default scoring matrices BLOSUM50 and the identity matrix are used for protein and DNA alignments respectively. The penalty for the first residue in a gap is −12 for proteins and −16 for DNA, while the penalty for additional residues in a gap is −2 for proteins and −4 for DNA. Alignment may be made with the FASTA package version v20u6 (W. R. Pearson and D. J. Lipman (1988), “Improved Tools for Biological Sequence Analysis”, PNAS 85:2444-2448, and W. R. Pearson (1990) “Rapid and Sensitive Sequence Comparison with FASTP and FASTA”, Methods in Enzymology, 183:63-98).
- Multiple alignments of protein sequences may be made using “ClustalW” (Thompson, J. D., Higgins, D. G. and Gibson, T. J. (1994) CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, positions-specific gap penalties and weight matrix choice. Nucleic Acids Research, 22:4673-4680). Multiple alignment of DNA sequences may be done using the protein alignment as a template, replacing the amino acids with the corresponding codon from the DNA sequence.
- “Promoter” is defined herein as a nucleic acid sequence involved in the binding of RNA polymerase to initiate transcription of a gene. “Tandem promoter” is defined herein as two or more promoter sequences each of which is operably linked to a coding sequence and mediates the transcription of the coding sequence into mRNA. “Operably linked” is defined herein as a configuration in which a control sequence, e.g., a promoter sequence, is appropriately placed at a position relative to a coding sequence such that the control sequence directs the production of a polypeptide encoded by the coding sequence. “Coding sequence” is defined herein as a nucleic acid sequence which is transcribed into mRNA and translated into a polypeptide when placed under the control of the appropriate control sequences. The boundaries of the coding sequence are generally determined by a ribosome binding site located just upstream of the open reading frame at the 5′ end of the mRNA and a transcription terminator sequence located just downstream of the open reading frame at the 3′ end of the mRNA. A coding sequence can include, but is not limited to, genomic DNA, cDNA, semisynthetic, synthetic, and recombinant nucleic acid sequences.
- “Heterologous” DNA in a host cell, in the present context refers to exogenous DNA not originating from the cell.
- “Nucleic acid construct” is defined herein as a nucleic acid molecule, either single- or double-stranded, which is isolated from a naturally occurring gene or which has been modified to contain segments of nucleic acid which are combined and juxtaposed in a manner which would not otherwise exist in nature. The term nucleic acid construct is synonymous with the term expression cassette when the nucleic acid construct contains all the control sequences required for expression of a coding sequence.
- The term “control sequences” is defined herein to include all components, which are necessary or advantageous for the expression of a polynucleotide encoding a polypeptide of the present invention. Each control sequence may be native or foreign to the nucleotide sequence encoding the polypeptide. Such control sequences include, but are not limited to, a leader, polyadenylation sequence, propeptide sequence, promoter, signal peptide sequence, and transcription terminator. At a minimum, the control sequences include a promoter, and transcriptional and translational stop signals. The control sequences may be provided with linkers for the purpose of introducing specific restriction sites facilitating ligation of the control sequences with the coding region of the nucleotide sequence encoding a polypeptide.
- The term “operably linked” denotes herein a configuration in which a control sequence is placed at an appropriate position relative to the coding sequence of the polynucleotide sequence such that the control sequence directs the expression of the coding sequence of a polypeptide.
- When used herein the term “coding sequence” means a nucleotide sequence, which directly specifies the amino acid sequence of its protein product. The boundaries of the coding sequence are generally determined by an open reading frame, which usually begins with the ATG start codon or alternative start codons such as GTG and TTG. The coding sequence may a DNA, cDNA, or recombinant nucleotide sequence.
- The term “expression” includes any step involved in the production of the polypeptide including, but not limited to, transcription, post-transcriptional modification, translation, post-translational modification, and secretion.
- The term “expression vector” is defined herein as a linear or circular DNA molecule that comprises a polynucleotide encoding a polypeptide of the invention, and which is operably linked to additional nucleotides that provide for its expression.
- The term “host cell”, as used herein, includes any cell type which is susceptible to transformation, transfection, transduction, and the like with a nucleic acid construct comprising a polynucleotide of the present invention.
- A polypeptide of the present invention may be obtained from microorganisms of any genus. For purposes of the present invention, the term “obtained from” as used herein in connection with a given source shall mean that the polypeptide encoded by a nucleotide sequence is produced by the source or by a strain in which the nucleotide sequence from the source has been inserted. In a preferred aspect, the polypeptide obtained from a given source is secreted extracellularly.
- A polypeptide of the present invention may be a bacterial polypeptide. For example, the polypeptide may be a gram positive bacterial polypeptide such as a Bacillus polypeptide, e.g., a Bacillus alkalophilus, Bacillus amyloliquefaciens, Bacillus brevis, Bacillus circulans, Bacillus coagulans, Bacillus lautus, Bacillus lentus, Bacillus lichenformis, Bacillus megaterium, Bacillus stearothermophilus, Bacillus subtilis, or Bacillus thuringiensis polypeptide; or a Streptomyces polypeptide, e.g., a Streptomyces lividans or Streptomyces murinus polypeptide; or a gram negative bacterial polypeptide, e.g., an E. coli or a Pseudomonas sp. polypeptide.
- A polypeptide of the present invention may also be a fungal polypeptide, and more preferably a yeast polypeptide such as a Candida, Kluyveromyces, Pichia, Saccharomyces, Schizosaccharomyces, or Yarrowia polypeptide; or more preferably a filamentous fungal polypeptide such as an Acremonium, Aspergillus, Aureobasidium, Cryptococcus, Filibasidium, Fusarium, Humicola, Magnaporthe, Mucor, Myceliophthora, Neocallimastix, Neurospora, Paecilomyces, Penicillium, Piromyces, Schizophyllum, Talaromyces, Thermoascus, Thielavia, Tolypocladium, or Trichoderma polypeptide.
- In a preferred aspect, the polypeptide is a Saccharomyces carlsbergensis, Saccharomyces cerevisiae, Saccharomyces diastaticus, Saccharomyces douglasii, Saccharomyces kluyveri, Saccharomyces norbensis, or Saccharomyces oviformis polypeptide.
- In another preferred aspect, the polypeptide is an Aspergillus aculeatus, Aspergillus awamori, Aspergillus fumigatus, Aspergillus foetidus, Aspergillus japonicus, Aspergillus nidulans, Aspergillus niger, Aspergillus oryzae, Fusarium bactridioides, Fusarium cerealis, Fusarium crookwellense, Fusarium culmorum, Fusarium graminearum, Fusarium graminum, Fusarium heterosporum, Fusarium negundi, Fusarium oxysporum, Fusarium reticulatum, Fusarium roseum, Fusarium sambucinum, Fusarium sarcochroum, Fusarium sporotrichioides, Fusarium sulphureum, Fusarium torulosum, Fusarium trichothecioides, Fusarium venenatum, Humicola insolens, Humicola lanuginosa, Mucor miehei, Myceliophthora thermophila, Neurospora crassa, Penicillium purpurogenum, Trichoderma harzianum, Trichoderma koningii, Trichoderma longibrachiatum, Trichoderma reesei, or Trichoderma viride polypeptide.
- It will be understood that for the aforementioned species, the invention encompasses both the perfect and imperfect states, and other taxonomic equivalents, e.g., anamorphs, regardless of the species name by which they are known. Those skilled in the art will readily recognize the identity of appropriate equivalents.
- Strains of these species are readily accessible to the public in a number of culture collections, such as the American Type Culture Collection (ATCC), Deutsche Sammiung von Mikroorganismen und Zellkulturen GmbH (DSM), Centraalbureau Voor Schimmelcultures (CBS), and Agricultural Research Service Patent Culture Collection, Northern Regional Research Center (NRRL).
- Furthermore, such polypeptides may be identified and obtained from other sources including microorganisms isolated from nature (e.g., soil, composts, water, etc.) using the above-mentioned probes. Techniques for isolating microorganisms from natural habitats are well known in the art. The polynucleotide may then be obtained by similarly screening a genomic or cDNA library of another microorganism. Once a polynucleotide sequence encoding a polypeptide has been detected with the probe(s), the polynucleotide can be isolated or cloned by utilizing techniques which are well known to those of ordinary skill in the art (see, e.g., Sambrook et al., 1989, supra).
- Polypeptides of the present invention also include fused polypeptides or cleavable fusion polypeptides in which another polypeptide is fused at the N-terminus or the C-terminus of the polypeptide or fragment thereof. A fused polypeptide is produced by fusing a nucleotide sequence (or a portion thereof) encoding another polypeptide to a nucleotide sequence (or a portion thereof) of the present invention. Techniques for producing fusion polypeptides are known in the art, and include ligating the coding sequences encoding the polypeptides so that they are in frame and that expression of the fused polypeptide is under control of the same promoter(s) and terminator.
- The present invention also relates to nucleic acid constructs comprising an isolated polynucleotide of the present invention operably linked to one or more control sequences which direct the expression of the coding sequence in a suitable host cell under conditions compatible with the control sequences.
- An isolated polynucleotide encoding a polypeptide of the present invention may be manipulated in a variety of ways to provide for expression of the polypeptide. Manipulation of the polynucleotide's sequence prior to its insertion into a vector may be desirable or necessary depending on the expression vector. The techniques for modifying polynucleotide sequences utilizing recombinant DNA methods are well known in the art.
- The control sequence may be an appropriate promoter sequence, a nucleotide sequence which is recognized by a host cell for expression of a polynucleotide encoding a polypeptide of the present invention. The promoter sequence contains transcriptional control sequences which mediate the expression of the polypeptide. The promoter may be any nucleotide sequence which shows transcriptional activity in the host cell of choice including mutant, truncated, and hybrid promoters, and may be obtained from genes encoding extracellular or intracellular polypeptides either homologous or heterologous to the host cell.
- Examples of suitable promoters for directing the transcription of the nucleic acid constructs of the present invention, especially in a bacterial host cell, are the promoters obtained from the E. coli lac operon, Streptomyces coelicolor agarase gene (dagA), Bacillus subtilis levansucrase gene (sacB), Bacillus lichenformis alpha-amylase gene (amyL), Bacillus stearothermophilus maltogenic amylase gene (amyM), Bacillus amyloliquefaciens alpha-amylase gene (amyQ), Bacillus lichenformis penicillinase gene (penP), Bacillus subtilis xylA and xylB genes, and prokaryotic beta-lactamase gene (Villa-Kamaroff et al., 1978, Proceedings of the National Academy of Sciences USA 75: 3727-3731), as well as the tac promoter (DeBoer et al., 1983, Proceedings of the National Academy of Sciences USA 80: 21-25). Further promoters are described in “Useful proteins from recombinant bacteria” in Scientific American, 1980, 242: 74-94; and in Sambrook et al., 1989, supra.
- Examples of suitable promoters for directing the transcription of the nucleic acid constructs of the present invention in a filamentous fungal host cell are promoters obtained from the genes for Aspergillus oryzae TAKA amylase, Rhizomucor miehei aspartic proteinase, Aspergillus niger neutral alpha-amylase, Aspergillus niger acid stable alpha-amylase, Aspergillus niger or Aspergillus awamori glucoamylase (glaA), Rhizomucor miehei lipase, Aspergillus oryzae alkaline protease, Aspergillus oryzae triose phosphate isomerase, Aspergillus nidulans acetamidase, Fursarium venenatum amyloglucosidase (WO 00/56900), Fursarium venenatum Daria (WO 00/56900), Fursarium venenatum Quinn (WO 00/56900), Fursarium oxysporum trypsin-like protease (WO 96/00787), Trichoderma reesei beta-glucosidase, Trichoderma reesei cellobiohydrolase I, Trichoderma reesei endoglucanase I, Trichoderma reesei endoglucanase II, Trichoderma reesei endoglucanase II, Trichoderma reesei endoglucanase IV, Trichoderma reesei endoglucanase V, Trichoderma reesei xylanase I, Trichoderma reesei xylanase II Trichoderma reesei beta-xylosidase, as well as the NA2-tpi promoter (a hybrid of the promoters from the genes for Aspergillus niger neutral alpha-amylase and Aspergillus oryzae triose phosphate isomerase); and mutant, truncated, and hybrid promoters thereof.
- In a yeast host, useful promoters are obtained from the genes for Saccharomyces cerevisiae enolase (ENO-1), Saccharomyces cerevisiae galactokinase (GAL1), Saccharomyces cerevisiae alcohol dehydrogenase/glyceraldehyde-3-phosphate dehydrogenase (ADH1, ADH2/GAP), Saccharomyces cerevisiae triose phosphate isomerase (TPI), Saccharomyces cerevisiae metallothionine (CUP1), and Saccharomyces cerevisiae 3-phosphoglycerate kinase. Other useful promoters for yeast host cells are described by Romanos et al., 1992, Yeast 8: 423-488.
- The control sequence may also be a suitable transcription terminator sequence, a sequence recognized by a host cell to terminate transcription. The terminator sequence is operably linked to the 3′ terminus of the nucleotide sequence encoding the polypeptide. Any terminator which is functional in the host cell of choice may be used in the present invention.
- Preferred terminators for filamentous fungal host cells are obtained from the genes for Aspergillus oryzae TAKA amylase, Aspergillus niger glucoamylase, Aspergillus nidulans anthranilate synthase, Aspergillus niger alpha-glucosidase, and Fursarium oxysporum trypsin-like protease.
- Preferred terminators for yeast host cells are obtained from the genes for Saccharomyces cerevisiae enolase, Saccharomyces cerevisiae cytochrome C (CYC1), and Saccharomyces cerevisiae glyceraldehyde-3-phosphate dehydrogenase. Other useful terminators for yeast host cells are described by Romanos et al., 1992, supra.
- The control sequence may also be a suitable leader sequence, a nontranslated region of an mRNA which is important for translation by the host cell. The leader sequence is operably linked to the 5′ terminus of the nucleotide sequence encoding the polypeptide. Any leader sequence that is functional in the host cell of choice may be used in the present invention.
- Preferred leaders for filamentous fungal host cells are obtained from the genes for Aspergillus oryzae TAKA amylase and Aspergillus nidulans triose phosphate isomerase.
- Suitable leaders for yeast host cells are obtained from the genes for Saccharomyces cerevisiae enolase (ENO-1), Saccharomyces cerevisiae 3-phosphoglycerate kinase, Saccharomyces cerevisiae alpha-factor, and Saccharomyces cerevisiae alcohol dehydrogenase/glyceraldehyde-3-phosphate dehydrogenase (ADH2/GAP).
- The control sequence may also be a polyadenylation sequence, a sequence operably linked to the 3′ terminus of the nucleotide sequence and which, when transcribed, is recognized by the host cell as a signal to add polyadenosine residues to transcribed mRNA. Any polyadenylation sequence which is functional in the host cell of choice may be used in the present invention.
- Preferred polyadenylation sequences for filamentous fungal host cells are obtained from the genes for Aspergillus oryzae TAKA amylase, Aspergillus niger glucoamylase, Aspergillus nidulans anthranilate synthase, Fursarium oxysporum trypsin-like protease, and Aspergillus niger alpha-glucosidase.
- Useful polyadenylation sequences for yeast host cells are described by Guo and Sherman, 1995, Molecular Cellular Biology 15: 5983-5990.
- The control sequence may also be a signal peptide coding region that codes for an amino acid sequence linked to the amino terminus of a polypeptide and directs the encoded polypeptide into the cell's secretory pathway. The 5′ end of the coding sequence of the nucleotide sequence may inherently contain a signal peptide coding region naturally linked in translation reading frame with the segment of the coding region which encodes the secreted polypeptide. Alternatively, the 5′ end of the coding sequence may contain a signal peptide coding region which is foreign to the coding sequence. The foreign signal peptide coding region may be required where the coding sequence does not naturally contain a signal peptide coding region. Alternatively, the foreign signal peptide coding region may simply replace the natural signal peptide coding region in order to enhance secretion of the polypeptide. However, any signal peptide coding region which directs the expressed polypeptide into the secretory pathway of a host cell of choice may be used in the present invention.
- Effective signal peptide coding regions for bacterial host cells are the signal peptide coding regions obtained from the genes for Bacillus NCIB 11837 maltogenic amylase, Bacillus stearothermophilus alpha-amylase, Bacillus lichenformis subtilisin, Bacillus lichenformis beta-lactamase, Bacillus stearothermophilus neutral proteases (nprT, nprS, nprM), and Bacillus subtilis prsA. Further signal peptides are described by Simonen and Palva, 1993, Microbiological Reviews 57: 109-137.
- Effective signal peptide coding regions for filamentous fungal host cells are the signal peptide coding regions obtained from the genes for Aspergillus oryzae TAKA amylase, Aspergillus niger neutral amylase, Aspergillus niger glucoamylase, Rhizomucor miehei aspartic proteinase, Humicola insolens cellulase, and Humicola lanuginosa lipase.
- Useful signal peptides for yeast host cells are obtained from the genes for Saccharomyces cerevisiae alpha-factor and Saccharomyces cerevisiae invertase. Other useful signal peptide coding regions are described by Romanos et al., 1992, supra.
- The control sequence may also be a propeptide coding region that codes for an amino acid sequence positioned at the amino terminus of a polypeptide. The resultant polypeptide is known as a proenzyme or propolypeptide (or a zymogen in some cases). A propolypeptide is generally inactive and can be converted to a mature active polypeptide by catalytic or autocatalytic cleavage of the propeptide from the propolypeptide. The propeptide coding region may be obtained from the genes for Bacillus subtilis alkaline protease (aprE), Bacillus subtilis neutral protease (nprT), Saccharomyces cerevisiae alpha-factor, Rhizomucor miehei aspartic proteinase, and Myceliophthora thermophila laccase (WO 95/33836).
- Where both signal peptide and propeptide regions are present at the amino terminus of a polypeptide, the propeptide region is positioned next to the amino terminus of a polypeptide and the signal peptide region is positioned next to the amino terminus of the propeptide region.
- It may also be desirable to add regulatory sequences which allow the regulation of the expression of the polypeptide relative to the growth of the host cell. Examples of regulatory systems are those which cause the expression of the gene to be turned on or off in response to a chemical or physical stimulus, including the presence of a regulatory compound. Regulatory systems in prokaryotic systems include the lac, tac, and trp operator systems. In yeast, the ADH2 system or GAL1 system may be used. In filamentous fungi, the TAKA alpha-amylase promoter, Aspergillus niger glucoamylase promoter, and Aspergillus oryzae glucoamylase promoter may be used as regulatory sequences. Other examples of regulatory sequences are those which allow for gene amplification. In eukaryotic systems, these include the dihydrofolate reductase gene which is amplified in the presence of methotrexate, and the metallothionein genes which are amplified with heavy metals. In these cases, the nucleotide sequence encoding the polypeptide would be operably linked with the regulatory sequence.
- The present invention also relates to recombinant expression vectors comprising a polynucleotide of the present invention, a promoter, and transcriptional and translational stop signals. The various nucleic acids and control sequences described above may be joined together to produce a recombinant expression vector which may include one or more convenient restriction sites to allow for insertion or substitution of the nucleotide sequence encoding the polypeptide at such sites. Alternatively, a nucleotide sequence of the present invention may be expressed by inserting the nucleotide sequence or a nucleic acid construct comprising the sequence into an appropriate vector for expression. In creating the expression vector, the coding sequence is located in the vector so that the coding sequence is operably linked with the appropriate control sequences for expression.
- The recombinant expression vector may be any vector (e.g., a plasmid or virus) which can be conveniently subjected to recombinant DNA procedures and can bring about expression of the nucleotide sequence. The choice of the vector will typically depend on the compatibility of the vector with the host cell into which the vector is to be introduced. The vectors may be linear or closed circular plasmids.
- The vector may be one which, when introduced into the host cell, is integrated into the genome and replicated together with the chromosome(s) into which it has been integrated. Furthermore, a single vector or plasmid or two or more vectors or plasmids which together contain the total DNA to be introduced into the genome of the host cell, or a transposon may be used.
- The vectors of the present invention preferably contain one or more selectable markers which permit easy selection of transformed cells. A selectable marker is a gene the product of which provides for biocide or viral resistance, resistance to heavy metals, prototrophy to auxotrophs, and the like.
- Examples of bacterial selectable markers are the dal genes from Bacillus subtilis or Bacillus lichenformis, or markers which confer antibiotic resistance such as ampicillin, kanamycin, chloramphenicol, or tetracycline resistance. Suitable markers for yeast host cells are ADE2, HIS3, LEU2, LYS2, MET3, TRP1, and URA3. Selectable markers for use in a filamentous fungal host cell include, but are not limited to, amdS (acetamidase), argB (ornithine carbamoyltransferase), bar (phosphinothricin acetyltransferase), hph (hygromycin phosphotransferase), niaD (nitrate reductase), pyrG (orotidine-5′-phosphate decarboxylase), sC (sulfate adenyltransferase), and trpc (anthranilate synthase), as well as equivalents thereof. Preferred for use in an Aspergillus cell are the amdS and pyrG genes of Aspergillus nidulans or Aspergillus oryzae and the bar gene of Streptomyces hygroscopicus.
- The vectors of the present invention preferably contain an element(s) that permits integration of the vector into the host cell's genome.
- For integration into the host cell genome, or subsequent excision of parts of the vector, the vector may rely on the polynucleotide's sequence encoding the polypeptide or any other element of the vector for integration into the genome by homologous or nonhomologous recombination. Alternatively, the vector may contain additional nucleotide sequences for directing integration or excision by homologous recombination into or from the genome of the host cell at a precise location(s) in the chromosome(s). To increase the likelihood of integration at or excision from a precise location, the integrational elements should preferably contain a sufficient number of nucleic acids, such as 100 to 10,000 base pairs, preferably 400 to 10,000 base pairs, and most preferably 800 to 10,000 base pairs, which have a high degree of identity with the corresponding target sequence to enhance the probability of homologous recombination. The integrational elements may be any sequence that is homologous with the target sequence in the genome of the host cell. Furthermore, the integrational elements may be non-encoding or encoding nucleotide sequences. On the other hand, the vector may be integrated into the genome of the host cell by non-homologous recombination.
- Examples of bacterial origins of replication are the origins of replication of plasmids pBR322, pUC19, pACYC177, and pACYC184 permitting replication in E. coli, and pUB110, pE194, pTA 1060, and pAMβ1 permitting replication in Bacillus.
- Examples of origins of replication for use in a yeast host cell are the 2 micron origin of replication, ARS1ARS4, the combination of ARS1 and CEN3, and the combination of ARS4 and CEN6.
- Examples of origins of replication useful in a filamentous fungal cell are AMA1 and ANS1 (Gems et al ., 1991, Gene 98:61-67; Cullen et al., 1987, Nucleic Acids Research 15: 9163-9175; WO 00/24883). Isolation of the AMAI gene and construction of plasmids or vectors comprising the gene can be accomplished according to the methods disclosed in WO 00/24883.
- More than one copy of a polynucleotide of the present invention may be inserted into the host cell to increase production of the gene product. An increase in the copy number of the polynucleotide can be obtained by integrating at least one additional copy of the sequence into the host cell genome or by including an amplifiable selectable marker gene with the polynucleotide where cells containing amplified copies of the selectable marker gene, and thereby additional copies of the polynucleotide, can be selected for by cultivating the cells in the presence of the appropriate selectable agent.
- The procedures used to ligate the elements described above to construct the recombinant expression vectors of the present invention are well known to one skilled in the art (see, e.g., Sambrook et al., 1989, supra).
- The present invention also relates to recombinant host cells, comprising a polynucleotide of the present invention, which are advantageously used in the recombinant production of the polypeptides. A vector comprising a polynucleotide of the present invention is introduced into a host cell so that the vector is maintained as a chromosomal integrant or as a self-replicating extra-chromosomal vector as described earlier. The term “host cell” encompasses any progeny of a parent cell that is not identical to the parent cell due to mutations that occur during replication. The choice of a host cell will to a large extent depend upon the gene encoding the polypeptide and its source.
- The host cell may be a unicellular microorganism, e.g., a prokaryote, or a non-unicellular microorganism, e.g., a eukaryote.
- Useful unicellular microorganisms are bacterial cells such as gram positive bacteria including, but not limited to, a Bacillus cell, e.g., Bacillus alkalophilus, Bacillus amyloliquefaciens, Bacillus brevis, Bacillus circulans, Bacillus clausii, Bacillus coagulans, Bacillus lautus, Bacillus lentus, Bacillus lichenformis, Bacillus megaterium, Bacillus stearothermophilus, Bacillus subtilis, and Bacillus thuringiensis; or a Streptomyces cell, e.g., Streptomyces lividans and Streptomyces murinus, or gram negative bacteria such as E. coli and Pseudomonas sp. In a preferred aspect, the bacterial host cell is a Bacillus lentus, Bacillus lichenformis, Bacillus stearothermophilus, or Bacillus subtilis cell. In another preferred aspect, the Bacillus cell is an alkalophilic Bacillus.
- The introduction of a vector into a bacterial host cell may, for instance, be effected by protoplast transformation (see, e.g., Chang and Cohen, 1979, Molecular General Genetics 168: 111-115), using competent cells (see, e.g., Young and Spizizin, 1961, Journal of Bacteriology 81: 823-829, or Dubnau and Davidoff-Abelson, 1971, Journal of Molecular Biology 56: 209-221), electroporation (see, e.g., Shigekawa and Dower, 1988, Biotechniques 6: 742-751), or conjugation (see, e.g., Koehler and Thorne, 1987, Journal of Bacteriology 169: 5771-5278).
- The host cell may also be a eukaryote, such as a mammalian, insect, plant, or fungal cell.
- In a preferred aspect, the host cell is a fungal cell. “Fungi” as used herein includes the phyla Ascornycota, Basidiomycota, Chytridiomycota, and Zygomycota (as defined by Hawksworth et al., In, Ainsworth and Bisby's Dictionary of The Fungi, 8th edition, 1995, CAB International, University Press, Cambridge, UK) as well as the Oomycota (as cited in Hawksworth et al., 1995, supra, page 171) and all mitosporic fungi (Hawksworth et al., 1995, supra).
- In a more preferred aspect, the fungal host cell is a yeast cell. “Yeast” as used herein includes ascosporogenous yeast (Endomycetales), basidiosporogenous yeast, and yeast belonging to the Fungi Imperfecti (Blastomycetes). Since the classification of yeast may change in the future, for the purposes of this invention, yeast shall be defined as described in Biology and Activities of Yeast (Skinner, F. A., Passmore, S. M., and Davenport, R. R., eds, Soc. App. Bacteriol. Symposium Series No. 9,1980).
- In an even more preferred aspect, the yeast host cell is a Candida, Hansenula, Kluyveromyces, Pichia, Saccharomyces, Schizosaccharomyces, or Yarrowia cell.
- In a most preferred aspect, the yeast host cell is a Saccharomyces carlsbergensis, Saccharomyces cerevisiae, Saccharomyces diastaticus, Saccharomyces douglasii, Saccharomyces kluyveri, Saccharomyces norbensis or Saccharomyces oviformis cell. In another most preferred aspect, the yeast host cell is a Kluyveromyces lactis cell. In another most preferred aspect, the yeast host cell is a Yarrowia lipolytica cell.
- In another more preferred aspect, the fungal host cell is a filamentous fungal cell. “Filamentous fungi” include all filamentous forms of the subdivision Eumycota and Oomycota (as defined by Hawksworth et al., 1995, supra). The filamentous fungi are generally characterized by a mycelial wall composed of chitin, cellulose, glucan, chitosan, mannan, and other complex polysaccharides. Vegetative growth is by hyphal elongation and carbon catabolism is obligately aerobic. In contrast, vegetative growth by yeasts such as Saccharomyces cerevisiae is by budding of a unicellular thallus and carbon catabolism may be fermentative.
- In an even more preferred aspect, the filamentous fungal host cell is an Acremonium, Aspergillus, Aureobasidium, Bjerkandera, Ceriporiopsis, Coprinus, Coriolus, Cryptococcus, Filibasidium, Fusarium, Humicola, Magnaporthe, Mucor, Myceliophthora, Neocallimastix, Neurospora, Paecilomyces, Penicillium, Phanerochaete, Phlebia, Piromyces, Pleurotus, Schizophyllum, Talaromyces, Thermoascus, Thielavia, Tolypocladium, Trametes, or Trichoderma cell.
- In a most preferred aspect, the filamentous fungal host cell is an Aspergillus awamori, Aspergillus fumigatus, Aspergillus foetidus, Aspergillus japonicus, Aspergillus nidulans, Aspergillus niger or Aspergillus oryzae cell. In another most preferred aspect, the filamentous fungal host cell is a Fursarium bactridioides, Fusarium cerealis, Fusarium crookwellense, Fursarium culmorum, Fursarium graminearum, Fursarium graminum, Fursarium heterosporum, Fursarium negundi, Fursarium oxysporum, Fursarium reticulatum, Fursarium roseum, Fursarium sambucinum, Fursarium sarcochroum, Fursarium sporotrichioides, Fursarium sulphureum, Fursarium torulosum, Fursarium trichothecioides, or Fursarium venenatum cell. In another most preferred aspect, the filamentous fungal host cell is a Bjerkandera adusta, Ceriporiopsis aneirina, Ceriporiopsis aneirina, Ceriporiopsis caregiea, Ceriporiopsis gilvescens, Ceriporiopsis pannocinta, Ceriporiopsis rivulosa, Ceriporiopsis subrufa, or Ceriporiopsis subvermispora, Coprinus cinereus, Coriolus hirsutus, Humicola insolens, Humicola lanuginosa, Mucor miehei, Myceliophthora thermophila, Neurospora crassa, Penicillium purpurogenum, Phanerochaete chrysosporium, Phlebia radiata, Pleurotus eryngii, Thielavia terrestris, Trametes villosa, Trametes versicolor, Trichoderma harzianum, Trichoderma koningii, Trichoderma longibrachiatum, Trichoderma reesei, or Trichoderma viride strain cell.
- Fungal cells may be transformed by a process involving protoplast formation, transformation of the protoplasts, and regeneration of the cell wall in a manner known per se. Suitable procedures for transformation of Aspergillus and Trichoderma host cells are described in EP 238 023 and Yelton et al., 1984, Proceedings of the National Academy of Sciences USA 81: 1470-1474. Suitable methods for transforming Fursarium species are described by Malardier et al., 1989, Gene 78: 147-156, and WO 96/00787. Yeast may be transformed using the procedures described by Becker and Guarente, In Abelson, J. N. and Simon, M. I., editors, Guide to Yeast Genetics and Molecular Biology, Methods in Enzymology, Volume 194, pp 182-187, Academic Press, Inc., New York; Ito et al., 1983, Journal of Bacteriology 153: 163; and Hinnen et al., 1978, Proceedings of the National Academy of Sciences USA 75: 1920.
- The present invention also relates to methods for producing a polypeptide of the present invention, comprising (a) cultivating a cell, which in its wild-type form is capable of producing the polypeptide, under conditions conducive for production of the polypeptide; and (b) recovering the polypeptide.
- The present invention also relates to methods for producing a polypeptide of the present invention, comprising (a) cultivating a host cell under conditions conducive for production of the polypeptide; and (b) recovering the polypeptide.
- In the production methods of the present invention, the cells are cultivated in a nutrient medium suitable for production of the polypeptide using methods well known in the art. For example, the cell may be cultivated by shake flask cultivation, and small-scale or large-scale fermentation (including continuous, batch, fed-batch, or solid state fermentations) in laboratory or industrial fermentors performed in a suitable medium and under conditions allowing the polypeptide to be expressed and/or isolated. The cultivation takes place in a suitable nutrient medium comprising carbon and nitrogen sources and inorganic salts, using procedures known in the art. Suitable media are available from commercial suppliers or may be prepared according to published compositions (e.g., in catalogues of the American Type Culture Collection). If the polypeptide is secreted into the nutrient medium, the polypeptide can be recovered directly from the medium. If the polypeptide is not secreted, it can be recovered from cell lysates.
- The polypeptides may be detected using methods known in the art that are specific for the polypeptides. These detection methods may include use of specific antibodies, formation of an enzyme product, or disappearance of an enzyme substrate. For example, an enzyme assay may be used to determine the activity of the polypeptide as described herein.
- The resulting polypeptide may be recovered using methods known in the art. For example, the polypeptide may be recovered from the nutrient medium by conventional procedures including, but not limited to, centrifugation, filtration, extraction, spray-drying, evaporation, or precipitation.
- The polypeptides of the present invention may be purified by a variety of procedures known in the art including, but not limited to, chromatography (e.g., ion exchange, affinity, hydrophobic, chromatofocusing, and size exclusion), electrophoretic procedures (e.g., preparative isoelectric focusing), differential solubility (e.g., ammonium sulfate precipitation), SDS-PAGE, or extraction (see, e.g., Protein Purification, J.-C. Janson and Lars Ryden, editors, VCH Publishers, New York, 1989).
- The first aspect of the invention relates to a method of constructing a cell comprising in its chromosome one or more copies of an open reading frame (ORF) or operon encoding at least one polypeptide of interest, each copy being under the transcriptional control of a heterologous promoter, said method comprising the steps of:
- (a) providing a cell comprising in its chromosome one or more copies of a first recognition sequence (RS1) of a site specific recombinase, wherein each copy of RS1 is located downstream of a copy of said heterologous promoter;
- (b) introducing into said cell a polynucleotide construct comprising the ORF or operon and a second recognition sequence (RS2) of the site specific recombinase, where RS2 is located and oriented with respect to the ORF or operon so that an in vivo recombination of RS2 with a copy of RS1 in the chromosome of the cell will integrate the construct into the chromosome and place the ORF or operon downstream of and in the same orientation as the heterologous promoter; and
- (c) recombining RS2 with the one or more copies of RS1 in the presence of the site specific recombinase, whereby one or more copies of the ORF or operon of interest are integrated into the chromosome and placed (i) either directly under the transcriptional control of the heterologous promoter, or (ii) downstream of and in the same orientation as the promoter but separated from it by a region, which can be excised after one or more optional recombination events, whereby the ORF or operon of interest is placed under the transcriptional control of the heterologous promoter.
- In the cell of step (a) each copy of RS1 is located downstream of a copy of the heterologous promoter. How far downstream of the promoter RS1 may be located in the cell is a matter of trial and error; the only limiting factor is that the promoter must be operably linked with the ORF or operon after the construct has been integrated into the chromosome. Preferably RS1 is located up to 10.000 bp downstream of the promoter, even more preferably up to 5.000 bp downstream of the promoter, and most preferably no more than 500 bp downstream of the promoter.
- Correspondingly, in the polynucleotide construct “RS2 is located and oriented with respect to the ORF or operon so that an in vivo recombination of RS2 with a copy of RS1 in the chromosome of the cell will integrate the construct into the chromosome and bring the ORF or operon under the transcriptional control of the heterologous promoter”. This is to ensure that the ORF or operon and RS2 have the correct orientation with respect to each other and with respect to the polarity of RS1 in the chromosome, so that the recombinase mediated recombination between RS1 and RS2 will place the ORF or operon under the transcriptional control of the promoter. RS2 is preferably located up to 10.000 bp upstream of the ORF or operon, even more preferably up to 5.000 bp upstream of the ORF or operon, and most preferably no more than 500 bp upstream of the ORF or operon.
- The choice of a host cell will to a large extent depend upon the gene encoding the polypeptide and its source. The host cell may be a unicellular microorganism, e.g., a prokaryote, or a non-unicellular microorganism, e.g., a eukaryote. Useful unicellular cells are bacterial cells such as Gram positive bacteria including, but not limited to, a Bacillus cell, e.g., Bacillus alkalophilus, Bacillus amyloliquefaciens, Bacillus brevis, Bacillus circulans, Bacillus coagulans, Bacillus lautus, Bacillus lentus, Bacillus lichenformis, Bacillus megaterium, Bacillus stearothermophilus, Bacillus subtilis, and Bacillus thuringiensis; or a Streptomyces cell, e.g., Streptomyces lividans or Streptomyces murinus, or Gram negative bacteria such as E. coli and Pseudomonas sp. In a preferred embodiment, the bacterial host cell is a Bacillus lentus, Bacillus lichenformis, Bacillus stearothermophilus or Bacillus subtilis cell.
- In a preferred embodiment of any of the aspects of the invention, the cell is a prokaryotic cell, preferably a Bacillus cell, and more preferably a Bacillus subtilis or a Bacillus lichenformis cell.
- The ORF or operon in any aspects of the invention preferably encodes at least one enzyme; preferably an oxidoreductase, a transferase, a hydrolase, a lyase, an isomerase, or a ligase; more preferably an amylolytic enzyme, a lipolytic enzyme, a proteolytic enzyme, a cellulytic enzyme, an oxidoreductase or a plant cell-wall degrading enzyme, and most preferably an enzyme with an activity selected from the group consisting of aminopeptidase, amylase, amyloglucosidase, carbohydrase, carboxypeptidase, catalase, cellulase, chitinase, cutinase, cyclodextrin glycosyltransferase, deoxyribonuclease, esterase, galactosidase, beta-galactosidase, glucoamylase, glucose oxidase, glucosidase, haloperoxidase, hemicellulase, invertase, isomerase, laccase, ligase, lipase, lyase, mannosidase, oxidase, pectinase, peroxidase, phytase, phenoloxidase, polyphenoloxidase, protease, ribonuclease, transferase, transglutaminase, or xylanase.
- WO 1993/010249 discloses various promoter variants, and WO 1999/043835 discloses tandem and triple promoter constructions with improved properties. Each promoter sequence of the tandem promoter may be any nucleic acid sequence which shows transcriptional activity in the Bacillus cell of choice including a mutant, truncated, and hybrid promoter, and may be obtained from genes encoding extracellular or intracellular polypeptides either homologous or heterologous to the Bacillus cell. Each promoter sequence may be native or foreign to the nucleic acid sequence encoding the polypeptide and native or foreign to the Bacillus cell. The promoter sequences may be the same promoter sequence or different promoter sequences.
- In a preferred embodiment, the promoter sequences may be obtained from a bacterial source. In a more preferred embodiment, the promoter sequences may be obtained from a gram positive bacterium such as a Bacillus strain, e.g., Bacillus alkalophilus, Bacillus amyloliquefaciens, Bacillus brevis, Bacillus circulans, Bacillus clausii, Bacillus coagulans, Bacillus firmus, Bacillus lautus, Bacillus lentus, Bacillus lichenformis, Bacillus megaterium, Bacillus pumilus, Bacillus stearothermophilus, Bacillus subtilis, or Bacillus thuringiensis; or a Streptomyces strain, e.g., Streptomyces lividans or Streptomyces murinus; or from a gram negative bacterium, e.g., E. coli or Pseudomonas sp.
- An example of a suitable promoter for directing the transcription of a nucleic acid sequence in the methods of the present invention is the promoter obtained from the E. coli lac operon. Another example is the promoter of the Streptomyces coelicolor agarase gene (dagA). Another example is the promoter of the Bacillus lentus alkaline protease gene (aprH). Another example is the promoter of the Bacillus lichenformis alkaline protease gene (subtilisin Carlsberg gene). Another example is the promoter of the Bacillus subtilis levansucrase gene (sacB). Another example is the promoter of the Bacillus subtilis alpha-amylase gene (amyE). Another example is the promoter of the Bacillus lichenformis alpha-amylase gene (amyL). Another example is the promoter of the Bacillus stearothermophilus maltogenic amylase gene (amyM). Another example is the promoter of the Bacillus amyloliquefaciens alpha-amylase gene (amyQ). Another example is a “consensus” promoter having the sequence TTGACA for the “-35” region and TATMT for the “-10” region. Another example is the promoter of the Bacillus lichenformis penicillinase gene (penP). Another example are the promoters of the Bacillus subtilis xylA and xylB genes. Another example is the promoter of the Bacillus thuringiensis subsp. tenebrionis CryIIIA gene (cryIIIA, SEQ ID NO. 1) or portions thereof. Another example is the promoter of the prokaryotic beta-lactamase gene (Villa-Kamaroff et al., 1978, Proceedings of the National Academy of Sciences USA 75:3727-3731). Another example is the promoter of the spol bacterial phage promoter. Another example is the tac promoter (DeBoer et al., 1983, Proceedings of the National Academy of Sciences USA 80:21-25). Further promoters are described in “Useful proteins from recombinant bacteria” in Scientific American, 1980, 242:74-94; and in Sambrook, Fritsch, and Maniatus, 1989, Molecular Cloning, A Laboratory Manual, 2d edition, Cold Spring Harbor, New York.
- The two or more promoter sequences of the tandem promoter may simultaneously promote the transcription of the nucleic acid sequence. Alternatively, one or more of the promoter sequences of the tandem promoter may promote the transcription of the nucleic acid sequence at different stages of growth of the Bacillus cell.
- In a preferred embodiment, the tandem promoter contains at least the amyQ promoter of the Bacillus amyloliquefaciens alpha-amylase gene. In another preferred embodiment, the tandem promoter contains at least a “consensus” promoter having the sequence TTGACA for the “-35” region and TATAAT for the “-10” region. In another preferred embodiment, the tandem promoter contains at least the amyL promoter of the Bacillus lichenformis alpha-amylase gene. In another preferred embodiment, the tandem promoter contains at least the cryIIIA promoter or portions thereof (Agaisse and Lereclus, 1994, supra).
- In a more preferred embodiment, the tandem promoter contains at least the amyL promoter and the cryIIIA promoter. In another more preferred embodiment, the tandem promoter contains at least the amyQ promoter and the cryIIIA promoter. In another more preferred embodiment, the tandem promoter contains at least a “consensus” promoter having the sequence TTGACA for the “-35” region and TATAAT for the “-10” region and the cryIIIA promoter. In another more preferred embodiment, the tandem promoter contains at least two copies of the amyL promoter. In another more preferred embodiment, the tandem promoter contains at least two copies of the amyQ promoter. In another more preferred embodiment, the tandem promoter contains at least two copies of a “consensus” promoter having the sequence TTGACA for the “-35” region and TATAAT for the “-10” region. In another more preferred embodiment, the tandem promoter contains at least two copies of the cryIIIA promoter.
- The construction of a “consensus” promoter may be accomplished by site-directed mutagenesis to create a promoter which conforms more perfectly to the established consensus sequences for the “-10” and “-35” regions of the vegetative “sigma A-type” promoters for Bacillus subtilis (Voskuil et al., 1995, Molecular Microbiology 17: 271-279). The consensus sequence for the “-35” region is TTGACA and for the “-10” region is TATAAT. The consensus promoter may be obtained from any promoter which can function in a Bacillus host cell.
- In a preferred embodiment, the “consensus” promoter is obtained from a promoter obtained from the E. coli lac operon, Streptomyces coelicolor agarase gene (dagA), Bacillus lentus alkaline protease gene (aprH), Bacillus lichenformis alkaline protease gene (subtilisin Carlsberg gene), Bacillus subtilis levansucrase gene (sacB), Bacillus subtilis alpha-amylase gene (amyE), Bacillus lichenformis alpha-amylase gene (amyL), Bacillus stearothermophilus maltogenic amylase gene (amyM), Bacillus amyloliquefaciens alpha-amylase gene (amyQ), Bacillus lichenformis penicillinase gene (penP), Bacillus subtilis xylA and xylB genes, Bacillus thuringiensis subsp. tenebrionis CryIIIA gene (cryIIIA, SEQ ID NO. 1) or portions thereof, or prokaryotic beta-lactamase gene spol bacterial phage promoter.
- In a more preferred embodiment, the “consensus” promoter is obtained from Bacillus amyloliquefaciens alpha-amylase gene (amyQ). In a most preferred embodiment, the consensus promoter is the “consensus” amyQ promoter contained in nucleotides 1 to 185 of SEQ ID NO. 3 or SEQ ID NO. 4. In another most preferred embodiment, the consensus promoter is the short “consensus” amyQ promoter contained in nucleotides 86 to 185 of SEQ ID NO. 3 or SEQ ID NO. 4. The “consensus” amyQ promoter of SEQ ID NO. 3 contains the following mutations of the nucleic acid sequence containing the wild-type amyQ promoter (SEQ ID NO. 2): T to A and T to C in the -35 region (with respect to the transcription start site) at positions 135 and 136, respectively, and an A to T change in the -10 region at position 156 of SEQ ID NO. 2. The “consensus” amyQ promoter (SEQ ID NO. 2) further contains a T to A change at position 116 approximately 20 base pairs upstream of the -35 region as shown in
FIG. 21 (SEQ ID NO. 4). This change apparently had no detrimental effect on promoter function since it is well removed from the critical -10 and -35 regions. - Accordingly, in a preferred embodiment of any aspects of the invention, the heterologous promoter comprises two or more promoters; preferably the two or more promoters comprise one or more promoter derived from one or more Bacillus genes; more preferably the two or more promoters comprise one or more of the following: the amyQ promoter, the amyL promoter, the cryIIIA promoter, and a consensus promoter comprising the nucleotide sequence TTGACA for the -35 region and the nucleotide sequence TATMT for the -10 region.
- Site specific recombinases, including phage integrases, are well-known in the art, where they are usually grouped into tyrosine recombinases or serine recombinases. A sub-group of the serine recombinases are the large serine recombinases, which contains all the known serine recombinase-type phage integrases. The large serine recombinases contain the resolvase/invertase-like N-terminal catalytic domains of all serine recombinases, but their C-terminal regions are much larger and very diverse. (Smith and Thorpe, 2002. Diversity in the serine recombinases. Mol Microbiol 44:299-307). A review of phage integrases is given by Groth and Calos (J. Mol. Biol. 2004, 335: 667-678).
- Accordingly, a preferred embodiment of all aspects of the invention relates to where the site specific recombinase comprises a phage integrase, preferably a tyrosine recombinase or a serine recombinase, more preferably a large serine recombinase, and most preferably the TP901-1 integrase.
- The TP901-1 integrase is well-characterized, e.g. in Breüner et al. 2001. Resolvase-like recombination performed by the TP901-1 integrase. Microbiology 147: 2051-2063. In addition, the recognition sequences of TP901-1 integrase (attP, attB, attL and attR) are well-known.
- A preferred embodiment relates to the method of the first aspect, wherein RS1 comprises a nucleotide sequence at least 70%, preferably at least 75%, 80%, 85%, 90%, 95%, or most preferably at least 98% identical to attB161 (SEQ ID NO: 21) or attBmin (SEQ ID NO: 22), RS2 comprises a nucleotide sequence at least 70%, preferably at least 75%, 80%, 85%, 90%, 95%, or most preferably at least 98% identical to attPmin (SEQ ID NO: 23), and the site specific recombinase comprises the phage TP901-1 integrase.
- Since the attP and attB recognition sequences may be switched around, another preferred embodiment relates to the method of the first aspect, wherein RS1 comprises a nucleotide sequence at least 70%, preferably at least 75%, 80%, 85%, 90%, 95%, or most preferably at least 98% identical to attPmin (SEQ ID NO: 23), RS2 comprises a nucleotide sequence at least 70%, preferably at least 75%, 80%, 85%, 90%, 95%, or most preferably at least 98% identical to attB161 (SEQ ID NO: 21) or attBmin (SEQ ID NO: 22), and the site specific recombinase comprises the phage TP901-1 integrase.
- The attP and attB sequences may also be substituted with the corresponding attL and attR sequences in the method of the invention, which in turn may also be switched around, provided that the integrase is supplemented with the excisionase, Xis.
- Accordingly, a preferred embodiment of the invention relates to the first aspect, wherein RS1 comprises a nucleotide sequence at least 70%, preferably at least 75%, 80%, 85%, 90%, 95%, or most preferably at least 98% identical to attLmin (SEQ ID NO: 24), RS2 comprises a nucleotide sequence at least 70%, preferably at least 75%, 80%, 85%, 90%, 95%, or most preferably at least 98% identical to attRmin (SEQ ID NO: 25), and the site specific recombinase comprises the phage TP901-1 integrase and excisionase Xis.
- Another preferred embodiment of the invention relates to the first aspect, wherein RS1 comprises a nucleotide sequence at least 70%, preferably at least 75%, 80%, 85%, 90%, 95%, or most preferably at least 98% identical to attRmin (SEQ ID NO: 25), RS2 comprises a nucleotide sequence at least 70%, preferably at least 75%, 80%, 85%, 90%, 95%, or most preferably at least 98% identical to attLmin (SEQ ID NO: 24), and the site specific recombinase comprises the phage TP901-1 integrase and excisionase Xis.
- Agaisse and Lereclus (1994, Molecular Microbiology 13: 97-107) disclose a structural and functional analysis of the promoter region involved in the full expression of the cryIIIA toxin gene of Bacillus thuringiensis. WO 94/25612 discloses an mRNA stabilizer region downstream of the promoter and upstream of the coding sequence of the cryIIIA gene which increases expression of the gene.
- Hue et al. (1995, Journal of Bacteriology 177: 3465-3471) disclose a 5′ mRNA stabilizer sequence which stabilized several heterologous RNA sequences when present at the 5′ end and increased expression of downstream coding sequences several-fold in Bacillus subtilis.
- “An mRNA processing/stabilizing sequence” is defined herein as a sequence located downstream of one or more promoter sequences and upstream of a coding sequence to which each of the one or more promoter sequences are operably linked such that all mRNAs synthesized from each promoter sequence may be processed to generate mRNA transcripts with a stabilizer sequence at the 5′ end of the transcripts. The presence of such a stabilizer sequence at the 5′ end of the mRNA transcripts increases their half-life (Agaisse and Lereclus, 1994, supra, Hue et al., 1995, supra). The mRNA processing/stabilizing sequence is complementary to the 3′ extremity of a bacterial 16S ribosomal RNA. In a preferred embodiment, the mRNA processing/stabilizing sequence generates essentially single-size transcripts with a stabilizing sequence at the 5′ end of the transcripts.
- In a more preferred embodiment, the mRNA processing/stabilizing sequence is the Bacillus thuringiensis cryIIIA mRNA processing/stabilizing sequence disclosed in WO 94/25612 and Agaisse and Lereclus, 1994, supra, or portions thereof which retain the mRNA processing/stabilizing function. In another more preferred embodiment, the mRNA processing/stabilizing sequence is the Bacillus subtilis SP82 mRNA processing/stabilizing sequence disclosed in Hue et al., 1995, supra, or portions thereof which retain the mRNA processing/stabilizing function.
- When the cryIIIA promoter and its mRNA processing/stabilizing sequence are employed in the methods of the present invention, a DNA fragment containing the sequence disclosed in WO 94/25612 and Agaisse and Lereclus, 1994, supra, or portions thereof which retain the promoter and mRNA processing/stabilizing functions, may be used. Furthermore, DNA fragments containing only the cryIIIA promoter or only the cryIIIA mRNA processing/stabilizing sequence may be prepared using methods well known in the art to construct various tandem promoter and mRNA processing/stabilizing sequence combinations. In this embodiment, the cryIIIA promoter and its mRNA processing/stabilizing sequence are preferably placed downstream of the other promoter sequence(s) constituting the tandem promoter and upstream of the coding sequence of the gene of interest.
- In a preferred embodiment of the method of the first aspect of the invention, wherein at least one mRNA stabilizing region is located between the heterologous promoter and RS1 in the chromosome of the cell in step (a); preferably the at least one mRNA stabilizing region comprises a mRNA stabilizing region derived from cryIIIA; more preferably the at least one mRNA stabilizing region comprises a nucleotide sequence at least 70%, preferably at least 75%, 80%, 85%, 90%, 95%, or most preferably at least 98% identical to the sequence shown in positions 35-580 of SEQ ID NO: 26.
- Another preferred embodiment relates to the method of the first aspect, wherein the polynucleotide construct further comprises at least one mRNA stabilizing region located upstream of the ORF or operon between the ORF or operon and RS2.
- Another preferred embodiment relates to the cell of the second aspect, wherein at least one mRNA stabilizing region is located between the heterologous promoter and RS in the chromosome of the cell.
- Yet another preferred embodiment relates to the cell of the third aspect, wherein at least one mRNA stabilizing region is located between the heterologous promoter and the one or more copies of the ORF or operon in the chromosome of the cell.
- Preferably, in an embodiment of the polynucleotide construct of the fourth aspect, at least one mRNA stabilizing region is located upstream of the ORF or operon.
- To be able to cross out the vector part of the integrated polynucleotide construct, including an optional marker, regions of homology can be designed at the proper positions in the construct and next to the recognition sequence in chromosome of the host cell prior to integration. This is illustrated by the regions designated “repeat” in
FIG. 1 . The regions may either be inserted heterologous polynucleotide regions, or one region may be designed on the basis of a corresponding region, which may be naturally found in the other sequence. Homologous recombination between these two regions subsequent to the integration of the plasmid via site specific recombination between the recognition sequences RS1 and RS2 will then lead to excision of the polynucleotide between the two regions, leaving only the ORF or operon of interest on the chromosome, next to the recognition sequence site resulting from the first integration event. - Accordingly, in a preferred embodiment of the method of the first aspect, the polynucleotide construct further comprises a region located upstream or downstream of the ORF or operon in the construct, said region being sufficiently homologous with a corresponding region located upstream or downstream, correspondingly, of RS1 in the chromosome of the cell to effectuate in vivo homologous recombination between the two homologous regions when both regions are present in the cell.
- Alternatively, two regions that are recognition sites of a site specific recombinase, different from the one used for integration, can be inserted at the same positions as the before mentioned repeats. In the presence of the proper site specific recombinase recombination between the two sites will then lead to excision of the region between the two sites, leaving only the gene of interest on the chromosome. Non-limiting examples are the well-known resolvase systems, with two res sites and a specific resolvase, which performs the recombination between the two sites. The concept of using site specific recombination systems for excision of sequences from the bacterial chromosome was described, e.g. for the recombination system of the broad-host range plasmid RP4 (Eberl, L., Kristensen, C. S., Givskov, M., Grohmann, E., Gerlitz, M., Schwab, H. (1994), Analysis of the multimer resolution system encoded by the parCBA operon of broad-host-range plasmid RP4, Mol. Microbiol., 12, 131-141)). Stark, W. M., Boocock, M. R., Sherratt, D. J. (1992), Catalysis by site-specific recombinases, Trends in Genetics, 8, 432-439) is a review article on the mechanism of resolvase action. Camilli et al. ((1994), Use of genetic recombination as a reporter of gene expression, Proc. Natl. Acad. Sci. USA, 91, 2634-2638) describe the use of res sites and resolvase from the δγ-transposon in Vibrio cholera as a permanent, heritable marker of gene expression from a chromosomal gene. Chang, L.-K. et al. ((1994, Construction of Tn917as1, a transposon useful for mutagenesis and cloning of Bacillus subtilis genes, Gene, 150, 129-134) describe the plasmids (pE194) containing erm-res-tnpA (transposase)-tnpR (resolvase) samt IR-res-ori colE1-ABR1-ABR2-IR (pD917; Tn917ac1).
- The broad host range, gram-positive plasmid pAMβ1 (Clewell, D. B., Yagi, Y., Dunny, G. M., Schultz, S. K. (1974) Characterization of three plasmid deoxyribonucleic acid molecules in a strain of Streptococcus faecalis: identification of a plasmid determining erythromycin resistance. J. Bacteriol. 117, 283-289) has been described to contain a resolution system, that resolves plasmid multimers into monomers via a site specific recombination event, requiring a specific plasmid encoded enzyme (resolvase) and a site, res, on the plasmid (Swinfield, T.-J., Janniere, L., Ehrlich, S. D., Minton, N. P. (1991). Characterization of a region of the Enterococcus faecalis plasmid pAMβ1 which enhances the segregational stability of pAMβ1 -derived cloning vectors in Bacillus subtilis. Plasmid 26, 209-221; Janniere, L., Gruss, A., Ehrlich, S. D. (1993) Plasmids, pp. 625-644 in Sonenshein, A. L., Hoch, J. A., Losick, R. (eds.) Bacillus subtilis and other gram-positive bacteria: Biochemistry, Physiology and molecular genetics. American society for microbiology, Washington D.C.). It has been suggested to use a site-specific recombination system to remove a single selectable marker gene from the genome of a bacterial cell. For instance, Dale, et al. ((1991) Gene transfer with subsequent removal of the selection gene from the host genome, Proc. Natl. Acad. Sci. USA, 88, 10558-10562) describe the use of the cre/lox system for removal of markers from transgenic plants and mentions that the use of this system would obviate the need for different selectable markers in subsequent rounds of gene tranfer into the same host. Kristensen, C. S. et al. (1995), J. Bacteriol., 177, 52-58, describe the use of the multimer resolution system of the plasmid RP4 for the precise excision of chromosomal segments (such as marker genes introduced with heterologous DNA) from gram-negative bacteria. It is stated that the system is envisaged to be of interest in the generation of chromosomal insertions of heterologous DNA segments eventually devoid of any selection marker. WO 95/02058 describes a new transposon (tn5401) from B. thuringiensis containing transposase, resolvase, and res site. The transposon is used in a plasmid which contains B. thuringiensis DNA (e.g. origin and toxin gene) and, flanked by res sites, non-B. thuringiensis DNA (e.g. E. coli origin, selectable marker genes). The plasmid is introduced into B. thuringiensis. Subsequently, a plasmid expressing the resolvase is introduced (e.g. a thermosensitive plasmid containing the entire tranposon—but only used as resolvase donor) whereby the non-B. thuringiensis DNA is excised from the first plasmid.
- To ease identification of the clones in which crossing out has taken place, a counterselectable marker, such as the ysbC gene (Danish patent application PA 2004 00227; filed 13 Feb. 2004; Novozymes A/S), can be present on the vector-part of the polynucleotide construct. When all the constructs integrated in the chromosomes have crossed out and are lost from the cell, the marker will no longer be present in the cell, which then becomes resistant to the selection.
- Alternatively to a counterselectable marker, a gene that gives a screenable phenotype can be used, such as an antibiotic selection marker, GFP, or an amylase. Loss of all integrated constructs by excision will then lead to loss of resistance to the antibiotic, loss of green fluorescence, or loss of the amylase phenotype.
- Accordingly, in a preferred embodiment of the first aspect, the polynucleotide construct further comprises at least one selectable marker, at least one counterselectable marker, or at least one screenable marker; preferably the at least one selectable marker, counterselectable marker, or the screenable marker is flanked on both sides by a recognition sequence(s) of a second site specific recombinase, preferably a resolvase.
- The second aspect of the invention relates to a cell comprising in its chromosome one or more copies of a recognition sequence (RS) of a site specific recombinase, wherein each copy of the RS is located downstream of a copy of a heterologous promoter.
- The third aspect of the invention relates to a cell comprising in its chromosome one or more copies of an open reading frame (ORF) or operon of interest, wherein each copy is under the transcriptional control of a heterologous promoter, and (i) wherein each copy of the ORF or operon is located in the chromosome upstream of a recognition sequence (RS) of a site specific recombinase, or (ii) wherein each copy of the ORF or operon is located in the chromosome downstream of a recognition sequence (RS) of a site specific recombinase.
- The fourth aspect of the invention relates to a polynucleotide construct comprising a promoterless open reading frame (ORF) or operon encoding at least one polypeptide of interest, the construct also comprising a recognition sequence (RS) of a site specific recombinase located upstream or downstream of said ORF or operon.
- In a preferred embodiment of the cell of second or third aspects, the polynucleotide of the fourth aspect, or the method of the final aspect, RS comprises a nucleotide sequence at least 70%, preferably at least 75%, 80%, 85%, 90%, 95%, or most preferably at least 98% identical to attB 161 (SEQ ID NO: 21), attBmin (SEQ ID NO: 22), or attPmin (SEQ ID NO: 23), and the site specific recombinase comprises the phage TP901-1 integrase; or RS comprises a nucleotide sequence at least 70%, preferably at least 75%, 80%, 85%, 90%, 95%, or most preferably at least 98% identical to attLmin (SEQ ID NO: 24) or attRmin (SEQ ID NO: 25), and the site specific recombinase comprises the phage TP901-1 integrase and excisionase Xis.
- Bacillus subtilis DN1885 is described in Diderichsen, B., Wedsted, U., Hedegaard, L., Jensen, B. R., Sjøholm, C. (1990). Cloning of aldB, which encodes acetolactate decarboxylase, an exoenzyme from Bacillus brevis. Journal of Bacteriology 172, 4315-4321.
- B. subtilis PL1801 is the B. subtilis DN1885 strain with disrupted apr and npr genes.
- AEB43: B. subtilis PL1801 with a 161 bp attB fragment integrated in the xyl locus.
- AEB165: AEB43 with the 43 bp minimal attB integrated in the amyE locus.
-
-
pattB19DraIII: (SEQ ID NO:1) CCCCCACTAAGTGCCTGACTTTCAACTAC pattB179NotI: (SEQ ID NO:2) CCCCGCGGCCGCAAAAAAAGCAAAAAGC PEP140: (SEQ ID NO:3) AATATTGGCCGGGGAAGCGGAAGAATGAAG PEP218: (SEQ ID NO:4) CTATACTAGTCATCCTTGCAGGGTATGTTTC pamyE-EI: (SEQ ID NO:5) GGGGGAATTCAACGGCCTCAACCTACTACTG M13-forward: (SEQ ID NO:6) GTTTTCCCAGTCACGAC M13-revers: (SEQ ID NO:7) CAGCTATGACCATGATTACGC pCI-5: (SEQ ID NO:8) CTTCTACCCATTATTACAGCAGGA pCI-9: (SEQ ID NO:9) AGTAGTTCGCCAGTTAATAGTTTG p1224seq-2: (SEQ ID NO:10) GCCATACAGCTACTCACTCG pPxyl-up: (SEQ ID NO:11) CACTATGAATTCAGAAATACTCCTA pPxyl-down: (SEQ ID NO:12) GATTGAGTCATGAGATTTCCCCCTTA pattPcry3A: (SEQ ID NO:13) CTCGAGTCCAACTCGCTTAATTGCGAGTTTTTATTTCGTTTATTTCAATT AAGGTA ATTAAAGATAATATCTTTGAATTG pcry3AClaI: (SEQ ID NO:14) ATCGATTGTTGTTTCATGATTCTCCTC pint-up: (SEQ ID NO:15) GGGGTCATGACTAAGAAAGTAGCAATC pint-down: (SEQ ID NO:16) GGGGAAGCTTAAGCGAGTTGGAATTTA pxylint-2: (SEQ ID NO:17) CAGGTCTTCTTCCGCCACTTG pM1632-1: (SEQ ID NO:18) AGCGAAAATGCCTCACA pattP-ExtTerm: (SEQ ID NO:19) GGGGGGTACCTCCAACTCGCTTAATTGCGAGTTTTTATTTCGTTTATTTC AATTAAGGTAATTAAACCATGGCGGCCGCTAGCGTCGACTAGTCAAAGAT AGAAGAGCAGAGAG pTermBI: (SEQ ID NO:20) CCCCGGATCCCCCGCGATACCGTCATTTTC - pLB44: E. coli plasmid containing a 2 kb region of the phage TP901-1 genome, including the int gene and attP site (Christiansen et al. (1996). J. Bacteriol. 178(17): 5164-5173).
- pBC16 is commercially available from DSMZ (DSM 4424); (Kreft, J. et al. (1978) Recombinant plasmids capable of replication in B. subtilis and E. coli. Mol. Gen. Genet 162: 59-67).
- pSJ2739 (described in U.S. Pat. No. 6,100,063) is derived from pE194, which is naturally temperature-sensitive for replication. The part of pSJ2739 which is relevant for this invention consists of the pE194 replicon, as well as a fragment derived from plasmid pUB110, enabling conjugation into B. lichenformis.
- pAEB142: The int gene of TP901-1, encoding the phage integrase, is inserted after the xylose-inducible Pxyl promoter in the pCR®-BluntII-TOPO® (Invitrogen) vector. The Pxyl and int fragments were first amplified by two separate PCR-reactions. The Pxyl fragment was obtained with chromosomal DNA from B. subtilis PL1801 as template and the primers were pPxyl-up & pPxyl-down, giving a fragment of 1.5 kb. To amplify the int gene, the plasmid pLB44 was used as template and the primers were pint-up & pint-down, again giving a fragment of 1.5 kb. The two fragments were digested with BspHI joined by ligation, and used as template in a third PCR-reaction with primers ppxyl-up & pint-down, resulting in a fragment of 2.9 kb. This fragment was then inserted in the pCR®-BluntII-TOPO® vector in the Zero Blunt® TOPO® PCR cloning kit (Invitrogen).
- pAEB146: The Pxyl-int fragment of pAEB142 was inserted in pSJ2739. This fragment was obtained by a two-step process, where the Pxyl fragment and the upstream part of int is obtained from pAEB142 on a 1.7 EcoRI-HindIII-fragment and ligated to the 4.3 kb EcoRI-HindIII-fragment of pSJ2739, and subsequently the downstream part of int was obtained on a HindIII fragment from pAEB142 and inserted in the HindIII site of the first plasmid. pAEB146 contains erm gene, providing resistance to erythromycin (Em), and a temperature-sensitive replicon, as well as the factors required for conjugation, making it possible to use both in B. subtilis and B. lichenformis.
- pAEB148: A PCR-fragment with the minimal attP (attPmin) and the cryIIIA region inserted in the pCR®-BluntII-TOPO® vector. The PCR fragment was obtained using primers pattPcry3A and pcry3AClaI, and the template was a plasmid containing the cryIIIA region. This gave a fragment of approximately 650 bp, which was cloned in the vector by using the Zero Blunt® TOPO® PCR cloning kit (Invitrogen).
- pAEB153: A 636 bp attPmin-cryIIIA-fragment (SEQ ID NO: 26) was obtained from pAEB148 by digestion with Xhol and Clal, and was inserted in the 2.1 kb Sall-Clal fragment of pMOL1632. This plasmid contains the same replication origin as the integrase donor plasmid pAEB146 but does not encode the replication protein. Thus, replication of pAEB153 is dependent on donation of the replication protein from another vector, such as pAEB146, i.e. pAEB153 is a so-called “slave” of pAEB146.
- pAEB267: A 360 bp fragment containing the minimal attP site of TP901-1 and a region of the B. lichenformis chromosome was obtained by PCR using primers pattP-ExtTerm and pTermBI, and chromosomal DNA from B. lichenformis as template. The fragment was digested with BamHI and KpnI and inserted in BamHI-KpnI digested pAEB146.
- pAEB288: pAEB267 with an amylase encoding gene, amyL, which is inserted into an Ncol-Nhel digested pAEB267.
- As the first step, an 161 bp attB site (attB161, SEQ ID NO: 21) was integrated in the xyl locus in B. subtilis strain PL1801, resulting in the strain AEB43. Integration was obtained by double cross-over of a DNA fragment which contains attB161 adjacent to the cat gene, surrounded by an upstream and a downstream region of the xyl locus. The upstream and downstream xyl fragments were obtained with PCR on chromosomal DNA from B. subtilis using primers that are suitable for amplifying regions of sufficient size for an efficient integration by homologous recombination (0.5 kb or more). By PCR, these fragments were joined with the attB161 fragment (obtained from Lactococcus lactis subsp. cremoris 3-107) and with the cat gene, yielding chloramphenicol (Cm) resistance.
- This xylup-attB-cat-xyldown fragment was introduced into PL1801 by transformation and the transformants were plated on Cm containing plates. Cells in which recombination between the DNA-fragment and the chromosome had occurred in both xyl regions would have retained the cat gene and would thus be CmR. Transformants with this phenotype were isolated and by PCR and sequencing they were found to have the attB161 site integrated in the xyl locus.
- Secondly, the minimal attB site of 43 bp (attBmin, Breüner et al. (2001) Microbiology 147 2051-2063; SEQ ID NO: 22) was integrated in the amyE locus in AEB43, resulting in strain AEB165, which had two functional versions or copies of the TP901-1 attb site integrated in the chromosome, attB161 and attBmin. Integration of attBmin was obtained by transformation and subsequent double cross-over into the chromosome of AEB43 of the amyup-tet-attB-amydown PCR fragment, which was obtained much as described for integration of attB161 in the xyl locus, except that upstream and downstream regions of the amyE locus were flanking the tet-attB-fragment, and these regions were obtained by PCR from pBC16 with the primers pattB-tet & ptet-down.
- When this fragment was transformed into AEB43 TcR transformants could only arise if double crossover took place between the PCR-fragment and the bacterial chromosome at both ends of the PCR-fragment, leaving the tet gene and attBmin in the chromosome. A number of TcR transformants were isolated and found by PCR to contain the attBmin site integrated at the intended position in the amyE locus.
- The TP901-1 integrase is needed to perform the recombination between the attB and attP sites. The expression of the integrase can be placed under the control of a constitutive or an inducible promoter. In plasmid pAEB146 expression of the integrase is under the control of the Pxyl-promoter, which is induced from a low to a high level of activity upon the addition of xylose. pAEB146 has a temperature sensitive replicon functional in Bacillus and the oriT region from plasmid pUB 110 which enables conjugation, and thus can be used in both B. subtilis and B. lichenformis. Alternatively, the integrase can be expressed from a plasmid which has a different kind of replicon, or it could be integrated into the chromosome.
- pAEB153 contains the minimal attP site of TP901-1 (SEQ ID NO: 23) determined in Brøndsted and Hammer (1999) App. Environ. Microbiol. 65 752-758, but a larger attP region can also be used, or a smaller, if it is still active in recombination. Replication of pAEB153 is dependent on donation of replication protein from another plasmid with the pE194 replicon such as pAEB146. Alternatively the attP site can be cloned on a different plasmid vector, e.g. one with an origin which is not dependent on other plasmids for replication, and/or a thermosensitive origin. The attP site can also be included on the plasmid from which integrase is expressed.
- The plasmid containing attP can be used as a vector for cloning genes in such a way, that integration of the plasmid in attB in the chromosome will lead to expression of the gene from a promoter present in the chromosome next to the attB site. We therefore included the mRNA-stabilizing cryIIIA region in pAEB153 to obtain maximal expression of said gene. To make the distance between the promoter and the cryIIIA region as short as possible attPmin and cryIIIA are overlapping. To obtain optimal overlap a single base in the attP region was changed. The mutation in attP did not interfere with the ability of the region to participate in recombination with attB, as is shown in example 5.
- AEB165 (2× attB) was transformed with pAEB146 (Int-donor) resulting in strain AEB182. AEB182 was in turn transformed with pAEB153 (attP). Transformants were grown and streaked at 33° C. (permissive temperature) and with selection for both plasmids to allow recombination between the attP and attB sites to take place. Then, a number of colonies were streaked on plates with selection only for pAEB153 and the incubation temperature was increased to 50° C., which disables replication of pAEB146 and thereby also of pAEB153. The only cells that can grow under these conditions are the ones where pAEB153 has integrated into the chromosome. The isolates were also checked for the presence of the Int-donor plasmid pAEB146 by streaking on selective plates (Em).
- Eight colonies were streaked in this way, and six of the isolates were found both to be able to grow at 50° C. and to have lost the Int donor. Recombination between attP and attB was checked by PCR on chromosomal DNA from these six strains. Both the presence of the intact attB sites and of the attL site (SEQ ID NO: 24), which is the result of recombination between attB and attP was investigated. The primers used are shown in Table 1 and the results of the PCR-reactions are summarized in Table 2.
-
TABLE 1 primers used to check recombination Site xyl-locus amyE-locus attB pxylint-2 & pCI-5 pamy-1 & ptet-1 attL pM1632-1 & pxylint-2 pM1632-1 & pamy-1 -
TABLE 2 PCR-checks on the integration-strains AEB182/pAEB153 PCR-fragment A B C D E F attB161 in xyl ✓ — — — ✓ — attL in xyl — ✓ ✓ ✓ — ✓ attBmin in — ✓ — — ✓ ✓ amyE attL in amyE ✓ — ✓ ✓ — — ✓: a PCR fragment of the correct size was observed. —: a PCR fragment of the correct size was not observed. - Integration of pAEB153 had occurred in the following attB sites: A: in amyE; B: in xyl; C and D: in both sites; E. in none of the attB sites; and F: in xyl. Thus, double integration occurred in two out of the six strains tested.
- This experiment was performed without the addition of xylose to the medium. To increase integration efficiency further, the production of integrase could be increased by adding xylose and thereby activating the Pxyl promoter, leading to a higher expression of the integrase. The xylose concentration could, e.g., be between 0.05 og 5%; at high concentrations of xylose the integrase is overexpressed and becomes toxic to the cell.
- To ease identification of cells where the attP plasmid has integrated site-specifically in all available attB sites, a counterselectable marker such as the ysbC gene (Danish patent application PA 2004 00227, filed 13 Feb. 2004) can be positioned downstream of a promoter next to all the attB sites in the chromosomes, but separated from the promoter by the attB site. Expression of the counterselectable marker from the promoter will lead to the cell being sensitive to the selective pressure (with ysbc. fluoro-orotate). When integration of the attP plasmid happens in such an attB site, the counterselectable marker is separated from the promoter, and the marker will no longer be expressed from this locus. However only when integration has occurred in all attB sites will no marker be produced, and the the cell will become resistant to the selection.
- Alternatively to a counterselectable marker, a gene that gives a screenable phenotype can be used, such as an antibiotic selection marker, green fluorescence protein (GFP), beta-galactosidase, an amylase, or others. Integration of the attP plasmid in all of the attB sites will then lead to loss of resistance to the antibiotic, of green fluorescence, of colour on X-gal plates, of the amylase phenotype, or of what other phenotype was expressed from the marker.
- In a manner similar to the one described in the above examples, the sites can be interchanged, so that one or more attP sites are inserted in the host genome, and the attB site is present in a vector to be integrated into the attP sites on the chromosome.
- In another setup attP and attB can be exchanged with copies of attL (SEQ ID NO: 24) in the chromosome of the host and attR (SEQ ID NO: 25) on the plasmid; or vice versa. Recombination between the attL and attR sites will results in the creation of attP and attB sites after recombination. However, effective recombination of the TP901-1's attL and attR sites requires the presence of the excisionase, Xis, in addition to the integrase (Breüner et al. (1990) Novel Organization of Genes Involved in Prophage Excision Identified in the Temperate Lactococcal Bacteriophage TP901-1. J Bacteriol 181(23): 7291-7297.
- Using an approach similar to the one described in example 1, attB sites were inserted at several positions in the chromosome of B. lichenformis, each site was inserted along with and downstream of a heterologous tandem promoter (as disclosed in WO 1999/043835). A vector comprising an attP site and an amylase encoding gene was then integrated into the chromosomal attB site by the integrase. The orientation of the amylase gene in the vector with respect to the attP site ensured that the gene became operably linked with the tandem promoter, when the vector was integrated into the chromosome through the recombination of the attB and attP sites.
- The amylase encoding gene was inserted in the attP-int containing plasmid pAEB267 in such a way that integration of the plasmid via site-specific recombination between attB and attP catalysed by the TP901-1 integrase would result in the amylase gene being inserted into the chromosome so that it would be expressed from the heterologous tandem promoter, separated from the gene by the attL site (
FIG. 1 ) after the recombination. A strain where such an integration event had taken place (verified by PCR as described in example 4) was streaked on amylose containing plates, and clearing zones were formed, demonstrating that the amylase was expressed from the tandem promoter next to the attL site. No clearing zones were observed when pAEB267 without amylase gene was integrated in a similar manner as a control. - Using an approach similar to the one described in example 1, but adapted to B. lichenformis in design of the primers and chromosomal fragments used, attBmin sites were inserted at three positions in the chromosome of B. lichenformis (the amyL, xyl and gnt loci). Each attBmin site was inserted along with and downstream of a heterologous tandem promoter (disclosed in WO 1999/043835). A vector comprising the corresponding attP site and an amylase encoding gene, designed as the plasmid in
FIG. 2 with the amylase in the place of “genX”, was then integrated into all the chromosomal attB sites by the integrase. The cryIIIA region was located upstream of the amylase gene, and the orientation of the amylase gene in the vector with respect to the attP site ensured that the gene was located and oriented in all three loci as shown for “genX” in the middle part ofFIG. 2 . Subsequent crossing out of the vectorparts of the integrated plasmids in all three loci by means of homologous recombination between the two cryIIIA regions in each locus resulted in a strain, in which each of the three loci contained the region shown in the bottom ofFIG. 2 : promoter, cryIIIA, amylase, and attR.
Claims (22)
1-79. (canceled)
80. A method of constructing a cell comprising in its chromosome one or more copies of an open reading frame (ORF) or operon encoding at least one polypeptide of interest, each copy being under the transcriptional control of a heterologous promoter, said method comprising the steps of:
(a) providing a cell comprising in its chromosome one or more copies of a first recognition sequence (RS1) of a site specific recombinase, wherein each copy of RS1 is located downstream of a copy of said heterologous promoter;
(b) introducing into said cell a polynucleotide construct comprising the ORF or operon and a second recognition sequence (RS2) of the site specific recombinase, where RS2 is located and oriented with respect to the ORF or operon so that an in vivo recombination of RS2 with a copy of RS1 in the chromosome of the cell will integrate the construct into the chromosome and place the ORF or operon downstream of and in the same orientation as the heterologous promoter; and
(c) recombining RS2 with the one or more copies of RS1 in the presence of the site specific recombinase, whereby one or more copies of the ORF or operon are integrated into the chromosome and placed:
(i) either directly under the transcriptional control of the heterologous promoter, or
(ii) downstream of and in the same orientation as the heterologous promoter but separated from it by a region, which can be excised after one or more optional recombination events, whereby the ORF or operon of interest is placed under the transcriptional control of the heterologous promoter.
81. The method of claim 80 , wherein the cell is a prokaryotic cell.
82. The method of claim 81 , wherein the prokaryotic cell is a Bacillus cell.
83. The method of claim 80 , wherein the ORF or operon encodes at least one enzyme.
84. The method of claim 83 , wherein the ORF or operon encodes an oxidoreductase, a transferase, a hydrolase, a lyase, an isomerase, or a ligase.
85. The method of claim 80 , wherein the site specific recombinase comprises a phage integrase.
86. The method of claim 80 , wherein the site specific recombinase comprises the TP901-1 integrase.
87. The method of claim 80 , wherein RS1 comprises a nucleotide sequence at least 70% identical to attB161 (SEQ ID NO: 21) or attBmin (SEQ ID NO: 22), RS2 comprises a nucleotide sequence at least 70% identical to attPmin (SEQ ID NO: 23), and the site specific recombinase comprises the phage TP901-1 integrase.
88. The method of claim 80 , wherein RS1 comprises a nucleotide sequence at least 70% identical to attPmin (SEQ ID NO: 23), RS2 comprises a nucleotide sequence at least 70% identical to attB161 (SEQ ID NO: 21) or attBmin (SEQ ID NO: 22), and the site specific recombinase comprises the phage TP901-1 integrase.
89. The method of claim 80 , wherein RS1 comprises a nucleotide sequence at least 70% identical to attLmin (SEQ ID NO: 24), RS2 comprises a nucleotide sequence at least 70% identical to attRmin (SEQ ID NO: 25), and the site specific recombinase comprises the phage TP901-1 integrase and excisionase Xis.
90. The method of claim 80 , wherein RS1 comprises a nucleotide sequence at least 70% identical to attRmin (SEQ ID NO: 25), RS2 comprises a nucleotide sequence at least 70% identical to attLmin (SEQ ID NO: 24), and the site specific recombinase comprises the phage TP901-1 integrase and excisionase Xis.
91. A cell comprising in its chromosome one or more copies of an open reading frame (ORF) or operon of interest, wherein each copy is under the transcriptional control of a heterologous promoter, and (i) wherein each copy of the ORF or operon is located in the chromosome upstream of a recognition sequence (RS) of a site specific recombinase, or (ii) wherein each copy of the ORF or operon is located in the chromosome downstream of a recognition sequence (RS) of a site specific recombinase.
92. The cell of claim 91 , wherein the cell is a prokaryotic cell.
93. The cell of claim 92 , wherein the prokaryotic cell is a Bacillus cell.
94. The cell of claim 91 , wherein the site specific recombinase comprises a phage integrase.
95. The cell of claim 91 , wherein the site specific recombinase comprises the TP901-1 integrase.
96. The cell of claim 91 , wherein the RS comprises a nucleotide sequence at least 70% identical to attB161 (SEQ ID NO: 21), attBmin (SEQ ID NO: 22), or attPmin (SEQ ID NO: 23), and the site specific recombinase comprises the phage TP901-1 integrase.
97. The cell of claim 91 , wherein the RS comprises a nucleotide sequence at least 70% identical to attLmin (SEQ ID NO: 24) or attRmin (SEQ ID NO: 25), and the site specific recombinase comprises the phage TP901-1 integrase and excisionase Xis.
98. The cell of claim 91 , wherein the ORF or operon encodes at least one enzyme.
99. The cell of claim 98 , wherein the ORF or operon encodes an oxidoreductase, a transferase, a hydrolase, a lyase, an isomerase, or a ligase.
100. A method of producing a polypeptide of interest, said method comprising:
(a) cultivating a cell as defined in claim 91 ; and
(b) isolating the polypeptide of interest.
Priority Applications (2)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| US14/221,005 US20140273237A1 (en) | 2004-10-22 | 2014-03-20 | Stable Genomic Integration of Multiple Polynucleotide Copies |
| US15/481,879 US11174487B2 (en) | 2004-10-22 | 2017-04-07 | Stable genomic integration of multiple polynucleotide copies |
Applications Claiming Priority (5)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| DKPA200401621 | 2004-10-22 | ||
| DKPA200401621 | 2004-10-22 | ||
| DKPA200401785 | 2004-11-17 | ||
| DKPA200401785 | 2004-11-17 | ||
| PCT/DK2005/000673 WO2006042548A1 (en) | 2004-10-22 | 2005-10-19 | Stable genomic integration of multiple polynucleotide copies |
Related Parent Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| PCT/DK2005/000673 A-371-Of-International WO2006042548A1 (en) | 2004-10-22 | 2005-10-19 | Stable genomic integration of multiple polynucleotide copies |
Related Child Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| US14/221,005 Continuation US20140273237A1 (en) | 2004-10-22 | 2014-03-20 | Stable Genomic Integration of Multiple Polynucleotide Copies |
Publications (1)
| Publication Number | Publication Date |
|---|---|
| US20080085535A1 true US20080085535A1 (en) | 2008-04-10 |
Family
ID=35596262
Family Applications (3)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| US11/576,896 Abandoned US20080085535A1 (en) | 2004-10-22 | 2005-10-19 | Stable Genomic Integration of Multiple Polynucleotide Copies |
| US14/221,005 Abandoned US20140273237A1 (en) | 2004-10-22 | 2014-03-20 | Stable Genomic Integration of Multiple Polynucleotide Copies |
| US15/481,879 Expired - Fee Related US11174487B2 (en) | 2004-10-22 | 2017-04-07 | Stable genomic integration of multiple polynucleotide copies |
Family Applications After (2)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| US14/221,005 Abandoned US20140273237A1 (en) | 2004-10-22 | 2014-03-20 | Stable Genomic Integration of Multiple Polynucleotide Copies |
| US15/481,879 Expired - Fee Related US11174487B2 (en) | 2004-10-22 | 2017-04-07 | Stable genomic integration of multiple polynucleotide copies |
Country Status (7)
| Country | Link |
|---|---|
| US (3) | US20080085535A1 (en) |
| EP (1) | EP1805296B1 (en) |
| CN (1) | CN101061214B (en) |
| AT (1) | ATE443128T1 (en) |
| DE (1) | DE602005016707D1 (en) |
| DK (1) | DK1805296T3 (en) |
| WO (1) | WO2006042548A1 (en) |
Cited By (10)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US20080044853A1 (en) * | 2004-06-21 | 2008-02-21 | Novozymes A/S | Stably Maintained Multiple Copies of at Least Two Orf in the Same Orientation |
| US20150020235A1 (en) * | 2012-03-12 | 2015-01-15 | Dsm Ip Assets B.V. | Rasamsonia transformants |
| US9657309B2 (en) | 2012-03-12 | 2017-05-23 | Dsm Ip Assets B.V. | Recombination system |
| CN111417726A (en) * | 2017-10-23 | 2020-07-14 | 诺维信公司 | Improving protease expression by co-expression with propeptides |
| US10752930B2 (en) * | 2007-05-17 | 2020-08-25 | Boehringer Ingelheim Rcv Gmbh & Co Kg | Method for producing a recombinant protein on a manufacturing scale |
| US10793850B2 (en) | 2012-03-12 | 2020-10-06 | Dsm Ip Assets B.V. | Recombination system |
| WO2022269084A1 (en) | 2021-06-24 | 2022-12-29 | Basf Se | Improved bacillus host cell with altered rema/remb protein |
| WO2023285348A1 (en) | 2021-07-13 | 2023-01-19 | Novozymes A/S | Recombinant cutinase expression |
| WO2023064778A1 (en) * | 2021-10-11 | 2023-04-20 | The Board Of Trustees Of The Leland Stanford Junior University | Dna element responsive to extrachromosomal dna in cancer cells |
| WO2023104846A1 (en) | 2021-12-10 | 2023-06-15 | Novozymes A/S | Improved protein production in recombinant bacteria |
Families Citing this family (19)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN101061214B (en) * | 2004-10-22 | 2012-12-05 | 诺维信公司 | Stable genomic integration of multiple polynucleotide copies |
| EP3246409A1 (en) | 2007-05-29 | 2017-11-22 | Nature Technology Corporation | Antibiotic-resistance-free vectors |
| WO2010024905A1 (en) * | 2008-08-27 | 2010-03-04 | Massachusetts Institute Of Technology | Genetically stabilized tandem gene duplication |
| EP2527448A1 (en) * | 2011-05-23 | 2012-11-28 | Novozymes A/S | Simultaneous site-specific integrations of multiple gene-copies in filamentous fungi |
| CN105324488A (en) | 2013-06-21 | 2016-02-10 | 诺维信公司 | Production of polypeptides without secretion signal in bacillus |
| EP3019607B2 (en) | 2013-07-12 | 2022-09-07 | Novozymes A/S | Direct transfer of polynucleotides between genomes |
| WO2016180928A1 (en) | 2015-05-12 | 2016-11-17 | Novozymes A/S | Bacillus licheniformis host cell with deleted lantibiotic gene(s) |
| CN106906238B (en) * | 2015-12-22 | 2020-08-14 | 中国科学院分子植物科学卓越创新中心 | Multi-copy amplification method and application of streptomycete antibiotic biosynthesis gene cluster |
| US20190276855A1 (en) * | 2016-10-25 | 2019-09-12 | Novozymes A/S | Flp-mediated genomic integration in bacillus licheniformis |
| CN108251344A (en) * | 2016-12-28 | 2018-07-06 | 中国科学院上海生命科学研究院 | The structure of the serial efficient heterogenous expression host of streptomyces coelicolor and application |
| WO2018134386A1 (en) | 2017-01-23 | 2018-07-26 | Novozymes A/S | Host cells and methods for producing double-stranded rna |
| CN110651046A (en) | 2017-02-22 | 2020-01-03 | 艾欧生物科学公司 | Nucleic acid constructs comprising gene editing multiple sites and uses thereof |
| WO2019092042A1 (en) | 2017-11-10 | 2019-05-16 | Novozymes A/S | Temperature-sensitive cas9 protein |
| US20210017544A1 (en) | 2017-12-22 | 2021-01-21 | Novozymes A/S | Counter-Selection by Inhibition of Conditionally Essential Genes |
| CN113939588A (en) | 2019-05-15 | 2022-01-14 | 诺维信公司 | Temperature sensitive RNA guided endonucleases |
| WO2020260061A1 (en) | 2019-06-25 | 2020-12-30 | Novozymes A/S | Counter-selection by inhibition of conditionally essential genes |
| EP4133086A4 (en) * | 2020-04-07 | 2024-06-05 | IO Biosciences, Inc. | Nucleic acid constructs comprising gene editing multi-sites |
| CN120265779A (en) | 2022-12-05 | 2025-07-04 | 诺维信公司 | Modified RNA polymerase activity |
| WO2026017381A1 (en) | 2024-07-17 | 2026-01-22 | Novozymes A/S | Optimized bacillus host cells |
Citations (1)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US6017694A (en) * | 1997-12-19 | 2000-01-25 | American Cyanamid Company | Methods of screening for modulators of respiratory syncytial virus matrix protein interaction |
Family Cites Families (13)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| FR2563533B1 (en) | 1984-04-27 | 1986-08-22 | Centre Nat Rech Scient | METHOD FOR AMPLIFYING THE EXPRESSION OF A DETERMINED GENE IN SUBACTIVE BACILLUS AND STRAINS OBTAINED |
| HUT50877A (en) | 1987-02-27 | 1990-03-28 | Gist Brocades Nv | Process for producing stable gene amplification in chromosomal dna of procaryote microorganisms |
| US5470727A (en) * | 1993-12-21 | 1995-11-28 | Celtrix Pharmaceuticals, Inc. | Chromosomal expression of non-bacterial genes in bacterial cells |
| US5955310A (en) * | 1998-02-26 | 1999-09-21 | Novo Nordisk Biotech, Inc. | Methods for producing a polypeptide in a bacillus cell |
| EP1100885A4 (en) * | 1998-07-24 | 2001-12-12 | Baylor College Medicine | FAST SUBCLONING USING A SITE SPECIFIC RECOMBINATION |
| WO2001032899A1 (en) * | 1999-10-29 | 2001-05-10 | Takara Shuzo Co., Ltd. | Gene transfer method |
| WO2002000907A1 (en) | 2000-06-23 | 2002-01-03 | Novozymes A/S | Method for stable chromosomal multi-copy integration of genes |
| CN1455817A (en) * | 2000-07-21 | 2003-11-12 | (由农业部部长代表的)美利坚合众国 | Methods for replacement, translocation and stacking of DNA in eukaryotic genomes |
| AU2003265608A1 (en) | 2002-08-21 | 2004-03-11 | Kosan Biosciences, Inc. | Myxococcus xanthus bacteriophage mx9 transformation and integration system |
| EP1405908A1 (en) * | 2002-10-04 | 2004-04-07 | ProBioGen AG | Creation of high yield heterologous expression cell lines |
| US20040209370A1 (en) * | 2002-12-19 | 2004-10-21 | Wonchul Suh | Method for chromosomal engineering |
| CN101061214B (en) * | 2004-10-22 | 2012-12-05 | 诺维信公司 | Stable genomic integration of multiple polynucleotide copies |
| EP2527448A1 (en) * | 2011-05-23 | 2012-11-28 | Novozymes A/S | Simultaneous site-specific integrations of multiple gene-copies in filamentous fungi |
-
2005
- 2005-10-19 CN CN2005800362425A patent/CN101061214B/en not_active Expired - Lifetime
- 2005-10-19 AT AT05794647T patent/ATE443128T1/en not_active IP Right Cessation
- 2005-10-19 DE DE602005016707T patent/DE602005016707D1/en not_active Expired - Lifetime
- 2005-10-19 EP EP05794647A patent/EP1805296B1/en not_active Expired - Lifetime
- 2005-10-19 WO PCT/DK2005/000673 patent/WO2006042548A1/en not_active Ceased
- 2005-10-19 DK DK05794647.7T patent/DK1805296T3/en active
- 2005-10-19 US US11/576,896 patent/US20080085535A1/en not_active Abandoned
-
2014
- 2014-03-20 US US14/221,005 patent/US20140273237A1/en not_active Abandoned
-
2017
- 2017-04-07 US US15/481,879 patent/US11174487B2/en not_active Expired - Fee Related
Patent Citations (1)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US6017694A (en) * | 1997-12-19 | 2000-01-25 | American Cyanamid Company | Methods of screening for modulators of respiratory syncytial virus matrix protein interaction |
Non-Patent Citations (1)
| Title |
|---|
| Lee et al. (Sequential -Integration for the Regulated Insertion of Cloned Genes in Saccharomyces cerevisiae, Biotechnol. Prog. 1997, 13, 368-373) * |
Cited By (12)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US20080044853A1 (en) * | 2004-06-21 | 2008-02-21 | Novozymes A/S | Stably Maintained Multiple Copies of at Least Two Orf in the Same Orientation |
| US10640757B2 (en) | 2004-06-21 | 2020-05-05 | Novozymes A/S | Stably maintained multiple copies of at least two ORF in the same orientation |
| US10752930B2 (en) * | 2007-05-17 | 2020-08-25 | Boehringer Ingelheim Rcv Gmbh & Co Kg | Method for producing a recombinant protein on a manufacturing scale |
| US20150020235A1 (en) * | 2012-03-12 | 2015-01-15 | Dsm Ip Assets B.V. | Rasamsonia transformants |
| US9631197B2 (en) * | 2012-03-12 | 2017-04-25 | Dsm Ip Assets B.V. | Rasamsonia transformants |
| US9657309B2 (en) | 2012-03-12 | 2017-05-23 | Dsm Ip Assets B.V. | Recombination system |
| US10793850B2 (en) | 2012-03-12 | 2020-10-06 | Dsm Ip Assets B.V. | Recombination system |
| CN111417726A (en) * | 2017-10-23 | 2020-07-14 | 诺维信公司 | Improving protease expression by co-expression with propeptides |
| WO2022269084A1 (en) | 2021-06-24 | 2022-12-29 | Basf Se | Improved bacillus host cell with altered rema/remb protein |
| WO2023285348A1 (en) | 2021-07-13 | 2023-01-19 | Novozymes A/S | Recombinant cutinase expression |
| WO2023064778A1 (en) * | 2021-10-11 | 2023-04-20 | The Board Of Trustees Of The Leland Stanford Junior University | Dna element responsive to extrachromosomal dna in cancer cells |
| WO2023104846A1 (en) | 2021-12-10 | 2023-06-15 | Novozymes A/S | Improved protein production in recombinant bacteria |
Also Published As
| Publication number | Publication date |
|---|---|
| CN101061214A (en) | 2007-10-24 |
| WO2006042548A1 (en) | 2006-04-27 |
| US20140273237A1 (en) | 2014-09-18 |
| EP1805296A1 (en) | 2007-07-11 |
| DK1805296T3 (en) | 2010-01-18 |
| DE602005016707D1 (en) | 2009-10-29 |
| CN101061214B (en) | 2012-12-05 |
| ATE443128T1 (en) | 2009-10-15 |
| US20170218381A1 (en) | 2017-08-03 |
| EP1805296B1 (en) | 2009-09-16 |
| US11174487B2 (en) | 2021-11-16 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| US11174487B2 (en) | Stable genomic integration of multiple polynucleotide copies | |
| US20190185847A1 (en) | Improving a Microorganism by CRISPR-Inhibition | |
| US8227227B2 (en) | DNase expression in recombinant host cells | |
| US20160304905A1 (en) | Fungal Gene Library By Double Split-Marker Integration | |
| US20220010305A1 (en) | Genome Editing by Guided Endonuclease and Single-stranded Oligonucleotide | |
| US20170313997A1 (en) | Filamentous Fungal Double-Mutant Host Cells | |
| US6762040B2 (en) | Method for increasing gene copy number in a host cell and resulting host cell | |
| US20250230469A1 (en) | Counter-Selection by Inhibition of Conditionally Essential Genes | |
| EP2078078B1 (en) | Selection of well-expressed synthetic genes | |
| US20220298517A1 (en) | Counter-selection by inhibition of conditionally essential genes | |
| CN104837993A (en) | Method for generating site-specific mutations in filamentous fungi | |
| US20190078097A1 (en) | Polynucleotide Constructs For In Vitro and In Vivo Expression | |
| US20220267783A1 (en) | Filamentous fungal expression system | |
| WO2020173817A1 (en) | Calcite binding proteins | |
| US20240271175A1 (en) | Leader peptides and polynucleotides encoding the same |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| AS | Assignment |
Owner name: NOVOZYMES A/S, DENMARK Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNORS:BREUNER, ANNE;RASMUSSEN, MICHAEL DOLBERG;REEL/FRAME:019136/0358;SIGNING DATES FROM 20070308 TO 20070312 |
|
| STCB | Information on status: application discontinuation |
Free format text: ABANDONED -- FAILURE TO RESPOND TO AN OFFICE ACTION |