AU2001277918A1 - Plant Acyl-CoA synthetases - Google Patents
Plant Acyl-CoA synthetasesInfo
- Publication number
- AU2001277918A1 AU2001277918A1 AU2001277918A AU7791801A AU2001277918A1 AU 2001277918 A1 AU2001277918 A1 AU 2001277918A1 AU 2001277918 A AU2001277918 A AU 2001277918A AU 7791801 A AU7791801 A AU 7791801A AU 2001277918 A1 AU2001277918 A1 AU 2001277918A1
- Authority
- AU
- Australia
- Prior art keywords
- seq
- nucleic acid
- acid sequence
- ofthe
- acs
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Abandoned
Links
- 108010011449 Long-chain-fatty-acid-CoA ligase Proteins 0.000 title claims description 317
- 102000005870 Coenzyme A Ligases Human genes 0.000 title claims description 275
- 108090000623 proteins and genes Proteins 0.000 claims description 335
- 150000007523 nucleic acids Chemical group 0.000 claims description 208
- 108091028043 Nucleic acid sequence Proteins 0.000 claims description 126
- 238000000034 method Methods 0.000 claims description 92
- 102000004169 proteins and genes Human genes 0.000 claims description 81
- 239000013598 vector Substances 0.000 claims description 76
- 230000000694 effects Effects 0.000 claims description 61
- 239000000203 mixture Substances 0.000 claims description 58
- 239000000194 fatty acid Substances 0.000 claims description 42
- 235000014113 dietary fatty acids Nutrition 0.000 claims description 40
- 229930195729 fatty acid Natural products 0.000 claims description 40
- 150000004665 fatty acids Chemical class 0.000 claims description 40
- 230000009261 transgenic effect Effects 0.000 claims description 26
- 230000000295 complement effect Effects 0.000 claims description 22
- 230000027455 binding Effects 0.000 claims description 21
- 230000000692 anti-sense effect Effects 0.000 claims description 17
- 238000012216 screening Methods 0.000 claims description 17
- 238000004519 manufacturing process Methods 0.000 claims description 15
- 239000000758 substrate Substances 0.000 claims description 15
- 125000002924 primary amino group Chemical group [H]N([H])* 0.000 claims description 7
- 150000001875 compounds Chemical class 0.000 claims description 3
- 125000003275 alpha amino acid group Chemical group 0.000 claims 1
- 241000196324 Embryophyta Species 0.000 description 192
- 210000004027 cell Anatomy 0.000 description 137
- 239000013615 primer Substances 0.000 description 88
- 150000001413 amino acids Chemical group 0.000 description 79
- 102000039446 nucleic acids Human genes 0.000 description 77
- 108020004707 nucleic acids Proteins 0.000 description 77
- 102000053602 DNA Human genes 0.000 description 75
- 230000014509 gene expression Effects 0.000 description 75
- 108020004414 DNA Proteins 0.000 description 74
- 235000018102 proteins Nutrition 0.000 description 73
- 210000001519 tissue Anatomy 0.000 description 72
- 108091011073 AMP binding proteins Proteins 0.000 description 60
- 239000002299 complementary DNA Substances 0.000 description 57
- 102000021435 AMP binding proteins Human genes 0.000 description 55
- 238000010367 cloning Methods 0.000 description 50
- 239000000047 product Substances 0.000 description 48
- 108090000765 processed proteins & peptides Proteins 0.000 description 44
- 241000219194 Arabidopsis Species 0.000 description 39
- 239000000523 sample Substances 0.000 description 39
- 102000004190 Enzymes Human genes 0.000 description 38
- 108090000790 Enzymes Proteins 0.000 description 38
- 229920002477 rna polymer Polymers 0.000 description 38
- 102000004196 processed proteins & peptides Human genes 0.000 description 36
- 125000003729 nucleotide group Chemical group 0.000 description 35
- 229940024606 amino acid Drugs 0.000 description 34
- 235000001014 amino acid Nutrition 0.000 description 34
- 210000003763 chloroplast Anatomy 0.000 description 34
- 239000002773 nucleotide Substances 0.000 description 34
- 240000004808 Saccharomyces cerevisiae Species 0.000 description 31
- 235000014680 Saccharomyces cerevisiae Nutrition 0.000 description 31
- 229920001184 polypeptide Polymers 0.000 description 31
- 241000589158 Agrobacterium Species 0.000 description 27
- 108091026890 Coding region Proteins 0.000 description 27
- 108020004705 Codon Proteins 0.000 description 24
- 239000003550 marker Substances 0.000 description 24
- 108700019146 Transgenes Proteins 0.000 description 23
- 239000012634 fragment Substances 0.000 description 23
- 239000013612 plasmid Substances 0.000 description 23
- 238000003556 assay Methods 0.000 description 22
- 238000013518 transcription Methods 0.000 description 22
- 230000035897 transcription Effects 0.000 description 22
- 230000015572 biosynthetic process Effects 0.000 description 21
- 238000006243 chemical reaction Methods 0.000 description 21
- 108091081024 Start codon Proteins 0.000 description 20
- 241000588724 Escherichia coli Species 0.000 description 19
- 230000000875 corresponding effect Effects 0.000 description 19
- 239000013604 expression vector Substances 0.000 description 19
- 230000009466 transformation Effects 0.000 description 19
- 108091034117 Oligonucleotide Proteins 0.000 description 18
- 238000004458 analytical method Methods 0.000 description 18
- 241000894007 species Species 0.000 description 17
- 108020001507 fusion proteins Proteins 0.000 description 16
- 102000037865 fusion proteins Human genes 0.000 description 16
- 238000009396 hybridization Methods 0.000 description 16
- 238000003780 insertion Methods 0.000 description 16
- 230000037431 insertion Effects 0.000 description 16
- 230000035772 mutation Effects 0.000 description 15
- 238000003786 synthesis reaction Methods 0.000 description 15
- 230000003321 amplification Effects 0.000 description 14
- 238000000338 in vitro Methods 0.000 description 14
- 238000003199 nucleic acid amplification method Methods 0.000 description 14
- 102000040430 polynucleotide Human genes 0.000 description 14
- 108091033319 polynucleotide Proteins 0.000 description 14
- 239000002157 polynucleotide Substances 0.000 description 14
- DCXXMTOCNZCJGO-UHFFFAOYSA-N tristearoylglycerol Chemical compound CCCCCCCCCCCCCCCCCC(=O)OCC(OC(=O)CCCCCCCCCCCCCCCCC)COC(=O)CCCCCCCCCCCCCCCCC DCXXMTOCNZCJGO-UHFFFAOYSA-N 0.000 description 14
- 230000006870 function Effects 0.000 description 13
- 239000002245 particle Substances 0.000 description 13
- 239000000126 substance Substances 0.000 description 13
- 230000008685 targeting Effects 0.000 description 13
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Chemical compound O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 13
- 108020004635 Complementary DNA Proteins 0.000 description 12
- 108020004999 messenger RNA Proteins 0.000 description 12
- 108090000994 Catalytic RNA Proteins 0.000 description 11
- 102000053642 Catalytic RNA Human genes 0.000 description 11
- QIVBCDIJIAJPQS-VIFPVBQESA-N L-tryptophane Chemical compound C1=CC=C2C(C[C@H](N)C(O)=O)=CNC2=C1 QIVBCDIJIAJPQS-VIFPVBQESA-N 0.000 description 11
- 125000000539 amino acid group Chemical group 0.000 description 11
- 239000003623 enhancer Substances 0.000 description 11
- 239000000499 gel Substances 0.000 description 11
- 230000004048 modification Effects 0.000 description 11
- 238000012986 modification Methods 0.000 description 11
- 230000008488 polyadenylation Effects 0.000 description 11
- 108091092562 ribozyme Proteins 0.000 description 11
- 239000000243 solution Substances 0.000 description 11
- RGJOEKWQDUBAIZ-UHFFFAOYSA-N coenzime A Natural products OC1C(OP(O)(O)=O)C(COP(O)(=O)OP(O)(=O)OCC(C)(C)C(O)C(=O)NCCC(=O)NCCS)OC1N1C2=NC=NC(N)=C2N=C1 RGJOEKWQDUBAIZ-UHFFFAOYSA-N 0.000 description 10
- 238000002474 experimental method Methods 0.000 description 10
- 239000012528 membrane Substances 0.000 description 10
- 239000008188 pellet Substances 0.000 description 10
- 238000002864 sequence alignment Methods 0.000 description 10
- RGJOEKWQDUBAIZ-IBOSZNHHSA-N CoASH Chemical compound O[C@@H]1[C@H](OP(O)(O)=O)[C@@H](COP(O)(=O)OP(O)(=O)OCC(C)(C)[C@@H](O)C(=O)NCCC(=O)NCCS)O[C@H]1N1C2=NC=NC(N)=C2N=C1 RGJOEKWQDUBAIZ-IBOSZNHHSA-N 0.000 description 9
- 101000830691 Homo sapiens Protein tyrosine phosphatase type IVA 2 Proteins 0.000 description 9
- 108700026244 Open Reading Frames Proteins 0.000 description 9
- 102100024602 Protein tyrosine phosphatase type IVA 2 Human genes 0.000 description 9
- DBMJMQXJHONAFJ-UHFFFAOYSA-M Sodium laurylsulphate Chemical compound [Na+].CCCCCCCCCCCCOS([O-])(=O)=O DBMJMQXJHONAFJ-UHFFFAOYSA-M 0.000 description 9
- 239000005516 coenzyme A Substances 0.000 description 9
- 229940093530 coenzyme a Drugs 0.000 description 9
- KDTSHFARGAKYJN-UHFFFAOYSA-N dephosphocoenzyme A Natural products OC1C(O)C(COP(O)(=O)OP(O)(=O)OCC(C)(C)C(O)C(=O)NCCC(=O)NCCS)OC1N1C2=NC=NC(N)=C2N=C1 KDTSHFARGAKYJN-UHFFFAOYSA-N 0.000 description 9
- VLKZOEOYAKHREP-UHFFFAOYSA-N n-Hexane Chemical compound CCCCCC VLKZOEOYAKHREP-UHFFFAOYSA-N 0.000 description 9
- 230000036961 partial effect Effects 0.000 description 9
- 229940083575 sodium dodecyl sulfate Drugs 0.000 description 9
- 235000019333 sodium laurylsulphate Nutrition 0.000 description 9
- 238000001890 transfection Methods 0.000 description 9
- 108091034057 RNA (poly(A)) Proteins 0.000 description 8
- FAPWRFPIFSIZLT-UHFFFAOYSA-M Sodium chloride Chemical compound [Na+].[Cl-] FAPWRFPIFSIZLT-UHFFFAOYSA-M 0.000 description 8
- ISAKRJDGNUQOIC-UHFFFAOYSA-N Uracil Chemical compound O=C1C=CNC(=O)N1 ISAKRJDGNUQOIC-UHFFFAOYSA-N 0.000 description 8
- 101150047711 acs gene Proteins 0.000 description 8
- 238000001514 detection method Methods 0.000 description 8
- 230000001965 increasing effect Effects 0.000 description 8
- 238000002955 isolation Methods 0.000 description 8
- 229930027917 kanamycin Natural products 0.000 description 8
- 229960000318 kanamycin Drugs 0.000 description 8
- SBUJHOSQTJFQJX-NOAMYHISSA-N kanamycin Chemical compound O[C@@H]1[C@@H](O)[C@H](O)[C@@H](CN)O[C@@H]1O[C@H]1[C@H](O)[C@@H](O[C@@H]2[C@@H]([C@@H](N)[C@H](O)[C@@H](CO)O2)O)[C@H](N)C[C@@H]1N SBUJHOSQTJFQJX-NOAMYHISSA-N 0.000 description 8
- 229930182823 kanamycin A Natural products 0.000 description 8
- 238000002703 mutagenesis Methods 0.000 description 8
- 231100000350 mutagenesis Toxicity 0.000 description 8
- 230000008569 process Effects 0.000 description 8
- 238000000746 purification Methods 0.000 description 8
- 238000003757 reverse transcription PCR Methods 0.000 description 8
- 108020005544 Antisense RNA Proteins 0.000 description 7
- 241000894006 Bacteria Species 0.000 description 7
- 240000002791 Brassica napus Species 0.000 description 7
- 108700008625 Reporter Genes Proteins 0.000 description 7
- 230000001580 bacterial effect Effects 0.000 description 7
- 230000012010 growth Effects 0.000 description 7
- 238000002744 homologous recombination Methods 0.000 description 7
- 230000006801 homologous recombination Effects 0.000 description 7
- 230000008676 import Effects 0.000 description 7
- 150000002632 lipids Chemical class 0.000 description 7
- 230000001404 mediated effect Effects 0.000 description 7
- -1 nucleic acid phosphodiester Chemical class 0.000 description 7
- 230000001105 regulatory effect Effects 0.000 description 7
- 238000012163 sequencing technique Methods 0.000 description 7
- 230000001052 transient effect Effects 0.000 description 7
- 238000013519 translation Methods 0.000 description 7
- CSCPPACGZOOCGX-UHFFFAOYSA-N Acetone Chemical compound CC(C)=O CSCPPACGZOOCGX-UHFFFAOYSA-N 0.000 description 6
- OKTJSMMVPCPJKN-UHFFFAOYSA-N Carbon Chemical compound [C] OKTJSMMVPCPJKN-UHFFFAOYSA-N 0.000 description 6
- KCXVZYZYPLLWCC-UHFFFAOYSA-N EDTA Chemical compound OC(=O)CN(CC(O)=O)CCN(CC(O)=O)CC(O)=O KCXVZYZYPLLWCC-UHFFFAOYSA-N 0.000 description 6
- DHMQDGOQFOQNFH-UHFFFAOYSA-N Glycine Chemical compound NCC(O)=O DHMQDGOQFOQNFH-UHFFFAOYSA-N 0.000 description 6
- OKKJLVBELUTLKV-UHFFFAOYSA-N Methanol Chemical compound OC OKKJLVBELUTLKV-UHFFFAOYSA-N 0.000 description 6
- HEMHJVSKTPXQMS-UHFFFAOYSA-M Sodium hydroxide Chemical compound [OH-].[Na+] HEMHJVSKTPXQMS-UHFFFAOYSA-M 0.000 description 6
- 238000007792 addition Methods 0.000 description 6
- 239000000872 buffer Substances 0.000 description 6
- 229910052799 carbon Inorganic materials 0.000 description 6
- 230000001413 cellular effect Effects 0.000 description 6
- 239000003184 complementary RNA Substances 0.000 description 6
- 230000002068 genetic effect Effects 0.000 description 6
- 230000010354 integration Effects 0.000 description 6
- 239000006166 lysate Substances 0.000 description 6
- 229930182817 methionine Natural products 0.000 description 6
- 230000032361 posttranscriptional gene silencing Effects 0.000 description 6
- MTCFGRXMJLQNBG-REOHCLBHSA-N (2S)-2-Amino-3-hydroxypropansäure Chemical compound OC[C@H](N)C(O)=O MTCFGRXMJLQNBG-REOHCLBHSA-N 0.000 description 5
- 108010016192 4-coumarate-CoA ligase Proteins 0.000 description 5
- 241000219195 Arabidopsis thaliana Species 0.000 description 5
- 235000011293 Brassica napus Nutrition 0.000 description 5
- 101710132601 Capsid protein Proteins 0.000 description 5
- 101710094648 Coat protein Proteins 0.000 description 5
- 108700028146 Genetic Enhancer Elements Proteins 0.000 description 5
- 102100021181 Golgi phosphoprotein 3 Human genes 0.000 description 5
- 101000780205 Homo sapiens Long-chain-fatty-acid-CoA ligase 5 Proteins 0.000 description 5
- 101000780202 Homo sapiens Long-chain-fatty-acid-CoA ligase 6 Proteins 0.000 description 5
- 101710125418 Major capsid protein Proteins 0.000 description 5
- 101710181812 Methionine aminopeptidase Proteins 0.000 description 5
- 101710141454 Nucleoprotein Proteins 0.000 description 5
- 101710083689 Probable capsid protein Proteins 0.000 description 5
- 108020004511 Recombinant DNA Proteins 0.000 description 5
- 239000002253 acid Substances 0.000 description 5
- 230000003115 biocidal effect Effects 0.000 description 5
- 239000012472 biological sample Substances 0.000 description 5
- 238000005119 centrifugation Methods 0.000 description 5
- 238000012217 deletion Methods 0.000 description 5
- 230000037430 deletion Effects 0.000 description 5
- 230000002255 enzymatic effect Effects 0.000 description 5
- 230000004927 fusion Effects 0.000 description 5
- 238000004817 gas chromatography Methods 0.000 description 5
- 208000015181 infectious disease Diseases 0.000 description 5
- 230000005764 inhibitory process Effects 0.000 description 5
- 239000007788 liquid Substances 0.000 description 5
- 210000001938 protoplast Anatomy 0.000 description 5
- 239000006228 supernatant Substances 0.000 description 5
- 238000011144 upstream manufacturing Methods 0.000 description 5
- 239000001993 wax Substances 0.000 description 5
- 210000005253 yeast cell Anatomy 0.000 description 5
- YBJHBAHKTGYVGT-ZKWXMUAHSA-N (+)-Biotin Chemical compound N1C(=O)N[C@@H]2[C@H](CCCCC(=O)O)SC[C@@H]21 YBJHBAHKTGYVGT-ZKWXMUAHSA-N 0.000 description 4
- 108091032973 (ribonucleotides)n+m Proteins 0.000 description 4
- 102000040650 (ribonucleotides)n+m Human genes 0.000 description 4
- JKMHFZQWWAIEOD-UHFFFAOYSA-N 2-[4-(2-hydroxyethyl)piperazin-1-yl]ethanesulfonic acid Chemical compound OCC[NH+]1CCN(CCS([O-])(=O)=O)CC1 JKMHFZQWWAIEOD-UHFFFAOYSA-N 0.000 description 4
- 108010049926 Acetate-CoA ligase Proteins 0.000 description 4
- 102100035709 Acetyl-coenzyme A synthetase, cytoplasmic Human genes 0.000 description 4
- 241000589155 Agrobacterium tumefaciens Species 0.000 description 4
- IJGRMHOSHXDMSA-UHFFFAOYSA-N Atomic nitrogen Chemical compound N#N IJGRMHOSHXDMSA-UHFFFAOYSA-N 0.000 description 4
- 108091003079 Bovine Serum Albumin Proteins 0.000 description 4
- 235000011331 Brassica Nutrition 0.000 description 4
- 241000219198 Brassica Species 0.000 description 4
- 235000004977 Brassica sinapistrum Nutrition 0.000 description 4
- 108010014303 DNA-directed DNA polymerase Proteins 0.000 description 4
- 102000016928 DNA-directed DNA polymerase Human genes 0.000 description 4
- 239000007995 HEPES buffer Substances 0.000 description 4
- KFZMGEQAYNKOFK-UHFFFAOYSA-N Isopropanol Chemical compound CC(C)O KFZMGEQAYNKOFK-UHFFFAOYSA-N 0.000 description 4
- HNDVDQJCIGZPNO-YFKPBYRVSA-N L-histidine Chemical compound OC(=O)[C@@H](N)CC1=CN=CN1 HNDVDQJCIGZPNO-YFKPBYRVSA-N 0.000 description 4
- FFEARJCKVFRZRR-BYPYZUCNSA-N L-methionine Chemical compound CSCC[C@H](N)C(O)=O FFEARJCKVFRZRR-BYPYZUCNSA-N 0.000 description 4
- AYFVYJQAPQTCCC-GBXIJSLDSA-N L-threonine Chemical compound C[C@@H](O)[C@H](N)C(O)=O AYFVYJQAPQTCCC-GBXIJSLDSA-N 0.000 description 4
- 102100034337 Long-chain-fatty-acid-CoA ligase 6 Human genes 0.000 description 4
- TWRXJAOTZQYOKJ-UHFFFAOYSA-L Magnesium chloride Chemical compound [Mg+2].[Cl-].[Cl-] TWRXJAOTZQYOKJ-UHFFFAOYSA-L 0.000 description 4
- 238000000636 Northern blotting Methods 0.000 description 4
- 239000005642 Oleic acid Substances 0.000 description 4
- 108010076504 Protein Sorting Signals Proteins 0.000 description 4
- 108091081021 Sense strand Proteins 0.000 description 4
- MTCFGRXMJLQNBG-UHFFFAOYSA-N Serine Natural products OCC(N)C(O)=O MTCFGRXMJLQNBG-UHFFFAOYSA-N 0.000 description 4
- 208000037065 Subacute sclerosing leukoencephalitis Diseases 0.000 description 4
- 206010042297 Subacute sclerosing panencephalitis Diseases 0.000 description 4
- AYFVYJQAPQTCCC-UHFFFAOYSA-N Threonine Natural products CC(O)C(N)C(O)=O AYFVYJQAPQTCCC-UHFFFAOYSA-N 0.000 description 4
- 239000004473 Threonine Substances 0.000 description 4
- QIVBCDIJIAJPQS-UHFFFAOYSA-N Tryptophan Natural products C1=CC=C2C(CC(N)C(O)=O)=CNC2=C1 QIVBCDIJIAJPQS-UHFFFAOYSA-N 0.000 description 4
- 241000700605 Viruses Species 0.000 description 4
- 240000008042 Zea mays Species 0.000 description 4
- JLCPHMBAVCMARE-UHFFFAOYSA-N [3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-hydroxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methyl [5-(6-aminopurin-9-yl)-2-(hydroxymethyl)oxolan-3-yl] hydrogen phosphate Polymers Cc1cn(C2CC(OP(O)(=O)OCC3OC(CC3OP(O)(=O)OCC3OC(CC3O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c3nc(N)[nH]c4=O)C(COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3CO)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cc(C)c(=O)[nH]c3=O)n3cc(C)c(=O)[nH]c3=O)n3ccc(N)nc3=O)n3cc(C)c(=O)[nH]c3=O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)O2)c(=O)[nH]c1=O JLCPHMBAVCMARE-UHFFFAOYSA-N 0.000 description 4
- OJOBTAOGJIWAGB-UHFFFAOYSA-N acetosyringone Chemical compound COC1=CC(C(C)=O)=CC(OC)=C1O OJOBTAOGJIWAGB-UHFFFAOYSA-N 0.000 description 4
- 150000007513 acids Chemical class 0.000 description 4
- UDMBCSSLTHHNCD-KQYNXXCUSA-N adenosine 5'-monophosphate Chemical compound C1=NC=2C(N)=NC=NC=2N1[C@@H]1O[C@H](COP(O)(O)=O)[C@@H](O)[C@H]1O UDMBCSSLTHHNCD-KQYNXXCUSA-N 0.000 description 4
- 229950006790 adenosine phosphate Drugs 0.000 description 4
- 230000004071 biological effect Effects 0.000 description 4
- 229940098773 bovine serum albumin Drugs 0.000 description 4
- 238000004113 cell culture Methods 0.000 description 4
- 230000008859 change Effects 0.000 description 4
- 238000010276 construction Methods 0.000 description 4
- 125000002637 deoxyribonucleotide group Chemical group 0.000 description 4
- 230000001419 dependent effect Effects 0.000 description 4
- 238000001952 enzyme assay Methods 0.000 description 4
- 235000021588 free fatty acids Nutrition 0.000 description 4
- BRZYSWJRSDMWLG-CAXSIQPQSA-N geneticin Natural products O1C[C@@](O)(C)[C@H](NC)[C@@H](O)[C@H]1O[C@@H]1[C@@H](O)[C@H](O[C@@H]2[C@@H]([C@@H](O)[C@H](O)[C@@H](C(C)O)O2)N)[C@@H](N)C[C@H]1N BRZYSWJRSDMWLG-CAXSIQPQSA-N 0.000 description 4
- HNDVDQJCIGZPNO-UHFFFAOYSA-N histidine Natural products OC(=O)C(N)CC1=CN=CN1 HNDVDQJCIGZPNO-UHFFFAOYSA-N 0.000 description 4
- 230000001939 inductive effect Effects 0.000 description 4
- 230000007246 mechanism Effects 0.000 description 4
- 231100001222 nononcogenic Toxicity 0.000 description 4
- 238000007899 nucleic acid hybridization Methods 0.000 description 4
- 230000002018 overexpression Effects 0.000 description 4
- 230000037361 pathway Effects 0.000 description 4
- HMFHBZSHGGEWLO-UHFFFAOYSA-N pentofuranose Chemical group OCC1OC(O)C(O)C1O HMFHBZSHGGEWLO-UHFFFAOYSA-N 0.000 description 4
- 239000012071 phase Substances 0.000 description 4
- 238000003752 polymerase chain reaction Methods 0.000 description 4
- 210000001236 prokaryotic cell Anatomy 0.000 description 4
- 239000011541 reaction mixture Substances 0.000 description 4
- 230000010076 replication Effects 0.000 description 4
- 239000006152 selective media Substances 0.000 description 4
- 239000013605 shuttle vector Substances 0.000 description 4
- 239000011780 sodium chloride Substances 0.000 description 4
- 229940035893 uracil Drugs 0.000 description 4
- WRIDQFICGBMAFQ-UHFFFAOYSA-N (E)-8-Octadecenoic acid Natural products CCCCCCCCCC=CCCCCCCC(O)=O WRIDQFICGBMAFQ-UHFFFAOYSA-N 0.000 description 3
- LQJBNNIYVWPHFW-UHFFFAOYSA-N 20:1omega9c fatty acid Natural products CCCCCCCCCCC=CCCCCCCCC(O)=O LQJBNNIYVWPHFW-UHFFFAOYSA-N 0.000 description 3
- QSBYPNXLFMSGKH-UHFFFAOYSA-N 9-Heptadecensaeure Natural products CCCCCCCC=CCCCCCCCC(O)=O QSBYPNXLFMSGKH-UHFFFAOYSA-N 0.000 description 3
- QTBSBXVTEAMEQO-UHFFFAOYSA-N Acetic acid Chemical compound CC(O)=O QTBSBXVTEAMEQO-UHFFFAOYSA-N 0.000 description 3
- 229920001817 Agar Polymers 0.000 description 3
- 244000105624 Arachis hypogaea Species 0.000 description 3
- 108091035707 Consensus sequence Proteins 0.000 description 3
- FBPFZTCFMRRESA-FSIIMWSLSA-N D-Glucitol Natural products OC[C@H](O)[C@H](O)[C@@H](O)[C@H](O)CO FBPFZTCFMRRESA-FSIIMWSLSA-N 0.000 description 3
- 238000002965 ELISA Methods 0.000 description 3
- 239000004471 Glycine Substances 0.000 description 3
- 244000068988 Glycine max Species 0.000 description 3
- 235000010469 Glycine max Nutrition 0.000 description 3
- 244000299507 Gossypium hirsutum Species 0.000 description 3
- CKLJMWTZIZZHCS-REOHCLBHSA-N L-aspartic acid Chemical compound OC(=O)[C@@H](N)CC(O)=O CKLJMWTZIZZHCS-REOHCLBHSA-N 0.000 description 3
- WHUUTDBJXJRKMK-VKHMYHEASA-N L-glutamic acid Chemical compound OC(=O)[C@@H](N)CCC(O)=O WHUUTDBJXJRKMK-VKHMYHEASA-N 0.000 description 3
- AGPKZVBTJJNPAG-WHFBIAKZSA-N L-isoleucine Chemical compound CC[C@H](C)[C@H](N)C(O)=O AGPKZVBTJJNPAG-WHFBIAKZSA-N 0.000 description 3
- ROHFNLRQFUQHCH-YFKPBYRVSA-N L-leucine Chemical compound CC(C)C[C@H](N)C(O)=O ROHFNLRQFUQHCH-YFKPBYRVSA-N 0.000 description 3
- COLNVLDHVKWLRT-QMMMGPOBSA-N L-phenylalanine Chemical compound OC(=O)[C@@H](N)CC1=CC=CC=C1 COLNVLDHVKWLRT-QMMMGPOBSA-N 0.000 description 3
- OUYCCCASQSFEME-QMMMGPOBSA-N L-tyrosine Chemical compound OC(=O)[C@@H](N)CC1=CC=C(O)C=C1 OUYCCCASQSFEME-QMMMGPOBSA-N 0.000 description 3
- KZSNJWFQEVHDMF-BYPYZUCNSA-N L-valine Chemical compound CC(C)[C@H](N)C(O)=O KZSNJWFQEVHDMF-BYPYZUCNSA-N 0.000 description 3
- ROHFNLRQFUQHCH-UHFFFAOYSA-N Leucine Natural products CC(C)CC(N)C(O)=O ROHFNLRQFUQHCH-UHFFFAOYSA-N 0.000 description 3
- ZQPPMHVWECSIRJ-UHFFFAOYSA-N Oleic acid Natural products CCCCCCCCC=CCCCCCCCC(O)=O ZQPPMHVWECSIRJ-UHFFFAOYSA-N 0.000 description 3
- 238000010222 PCR analysis Methods 0.000 description 3
- 229910019142 PO4 Inorganic materials 0.000 description 3
- 241000219843 Pisum Species 0.000 description 3
- 235000010582 Pisum sativum Nutrition 0.000 description 3
- 108010029485 Protein Isoforms Proteins 0.000 description 3
- 102000001708 Protein Isoforms Human genes 0.000 description 3
- 241000700159 Rattus Species 0.000 description 3
- 238000002105 Southern blotting Methods 0.000 description 3
- KZSNJWFQEVHDMF-UHFFFAOYSA-N Valine Natural products CC(C)C(N)C(O)=O KZSNJWFQEVHDMF-UHFFFAOYSA-N 0.000 description 3
- 235000002017 Zea mays subsp mays Nutrition 0.000 description 3
- 230000004913 activation Effects 0.000 description 3
- 125000002252 acyl group Chemical group 0.000 description 3
- 239000008272 agar Substances 0.000 description 3
- 230000004075 alteration Effects 0.000 description 3
- 238000000137 annealing Methods 0.000 description 3
- 238000013459 approach Methods 0.000 description 3
- 229940009098 aspartate Drugs 0.000 description 3
- QVGXLLKOCUKJST-UHFFFAOYSA-N atomic oxygen Chemical compound [O] QVGXLLKOCUKJST-UHFFFAOYSA-N 0.000 description 3
- 230000033228 biological regulation Effects 0.000 description 3
- 238000004364 calculation method Methods 0.000 description 3
- 239000003795 chemical substances by application Substances 0.000 description 3
- 238000003776 cleavage reaction Methods 0.000 description 3
- 238000002742 combinatorial mutagenesis Methods 0.000 description 3
- 239000005547 deoxyribonucleotide Substances 0.000 description 3
- 238000013461 design Methods 0.000 description 3
- 239000000284 extract Substances 0.000 description 3
- 229930182830 galactose Natural products 0.000 description 3
- 239000007789 gas Substances 0.000 description 3
- 229930195712 glutamate Natural products 0.000 description 3
- 238000011534 incubation Methods 0.000 description 3
- 230000000977 initiatory effect Effects 0.000 description 3
- 229960000310 isoleucine Drugs 0.000 description 3
- AGPKZVBTJJNPAG-UHFFFAOYSA-N isoleucine Natural products CCC(C)C(N)C(O)=O AGPKZVBTJJNPAG-UHFFFAOYSA-N 0.000 description 3
- QXJSBBXBKPUZAA-UHFFFAOYSA-N isooleic acid Natural products CCCCCCCC=CCCCCCCCCC(O)=O QXJSBBXBKPUZAA-UHFFFAOYSA-N 0.000 description 3
- 239000012139 lysis buffer Substances 0.000 description 3
- 230000009871 nonspecific binding Effects 0.000 description 3
- 239000003921 oil Substances 0.000 description 3
- 235000019198 oils Nutrition 0.000 description 3
- ZQPPMHVWECSIRJ-KTKRTIGZSA-N oleic acid Chemical compound CCCCCCCC\C=C/CCCCCCCC(O)=O ZQPPMHVWECSIRJ-KTKRTIGZSA-N 0.000 description 3
- 239000001301 oxygen Substances 0.000 description 3
- 229910052760 oxygen Inorganic materials 0.000 description 3
- 150000002972 pentoses Chemical class 0.000 description 3
- 210000002824 peroxisome Anatomy 0.000 description 3
- 238000002823 phage display Methods 0.000 description 3
- COLNVLDHVKWLRT-UHFFFAOYSA-N phenylalanine Natural products OC(=O)C(N)CC1=CC=CC=C1 COLNVLDHVKWLRT-UHFFFAOYSA-N 0.000 description 3
- NBIIXXVUZAFLBC-UHFFFAOYSA-K phosphate Chemical compound [O-]P([O-])([O-])=O NBIIXXVUZAFLBC-UHFFFAOYSA-K 0.000 description 3
- 239000010452 phosphate Substances 0.000 description 3
- 150000004713 phosphodiesters Chemical group 0.000 description 3
- 239000013641 positive control Substances 0.000 description 3
- 238000002360 preparation method Methods 0.000 description 3
- 238000005204 segregation Methods 0.000 description 3
- 239000000600 sorbitol Substances 0.000 description 3
- 230000001629 suppression Effects 0.000 description 3
- 238000012546 transfer Methods 0.000 description 3
- 239000001226 triphosphate Substances 0.000 description 3
- 235000011178 triphosphate Nutrition 0.000 description 3
- OUYCCCASQSFEME-UHFFFAOYSA-N tyrosine Natural products OC(=O)C(N)CC1=CC=C(O)C=C1 OUYCCCASQSFEME-UHFFFAOYSA-N 0.000 description 3
- 239000004474 valine Substances 0.000 description 3
- LWTDZKXXJRRKDG-KXBFYZLASA-N (-)-phaseollin Chemical compound C1OC2=CC(O)=CC=C2[C@H]2[C@@H]1C1=CC=C3OC(C)(C)C=CC3=C1O2 LWTDZKXXJRRKDG-KXBFYZLASA-N 0.000 description 2
- GVEZIHKRYBHEFX-MNOVXSKESA-N 13C-Cerulenin Natural products CC=CCC=CCCC(=O)[C@H]1O[C@@H]1C(N)=O GVEZIHKRYBHEFX-MNOVXSKESA-N 0.000 description 2
- HCWLJSDMOMMDRF-SZWOQXJISA-N 3-[(3s,6r)-2-oxo-6-[(1s,2r,3r)-1,2,3,4-tetrahydroxybutyl]morpholin-3-yl]propanamide Chemical compound NC(=O)CC[C@@H]1NC[C@H]([C@@H](O)[C@H](O)[C@H](O)CO)OC1=O HCWLJSDMOMMDRF-SZWOQXJISA-N 0.000 description 2
- 108010085238 Actins Proteins 0.000 description 2
- 102100022089 Acyl-[acyl-carrier-protein] hydrolase Human genes 0.000 description 2
- 241000589156 Agrobacterium rhizogenes Species 0.000 description 2
- ILQOASSPNGAIBC-UHFFFAOYSA-N Agropine Natural products OC(O)C(O)CC(O)C1CN2C(CCC2=O)C(=O)O1 ILQOASSPNGAIBC-UHFFFAOYSA-N 0.000 description 2
- 235000010777 Arachis hypogaea Nutrition 0.000 description 2
- 239000004475 Arginine Substances 0.000 description 2
- CIWBSHSKHKDKBQ-JLAZNSOCSA-N Ascorbic acid Chemical compound OC[C@H](O)[C@H]1OC(=O)C(O)=C1O CIWBSHSKHKDKBQ-JLAZNSOCSA-N 0.000 description 2
- DCXYFEDJOCDNAF-UHFFFAOYSA-N Asparagine Natural products OC(=O)C(N)CC(N)=O DCXYFEDJOCDNAF-UHFFFAOYSA-N 0.000 description 2
- 241000972773 Aulopiformes Species 0.000 description 2
- 235000014698 Brassica juncea var multisecta Nutrition 0.000 description 2
- 235000006008 Brassica napus var napus Nutrition 0.000 description 2
- 235000006618 Brassica rapa subsp oleifera Nutrition 0.000 description 2
- 241001301148 Brassica rapa subsp. oleifera Species 0.000 description 2
- 244000020518 Carthamus tinctorius Species 0.000 description 2
- 235000003255 Carthamus tinctorius Nutrition 0.000 description 2
- 244000060011 Cocos nucifera Species 0.000 description 2
- 235000013162 Cocos nucifera Nutrition 0.000 description 2
- 108091033380 Coding strand Proteins 0.000 description 2
- 229920000742 Cotton Polymers 0.000 description 2
- IMXSCCDUAFEIOE-UHFFFAOYSA-N D-Octopin Natural products OC(=O)C(C)NC(C(O)=O)CCCN=C(N)N IMXSCCDUAFEIOE-UHFFFAOYSA-N 0.000 description 2
- LMKYZBGVKHTLTN-NKWVEPMBSA-N D-nopaline Chemical compound NC(=N)NCCC[C@@H](C(O)=O)N[C@@H](C(O)=O)CCC(O)=O LMKYZBGVKHTLTN-NKWVEPMBSA-N 0.000 description 2
- IMXSCCDUAFEIOE-RITPCOANSA-N D-octopine Chemical compound [O-]C(=O)[C@@H](C)[NH2+][C@H](C([O-])=O)CCCNC(N)=[NH2+] IMXSCCDUAFEIOE-RITPCOANSA-N 0.000 description 2
- 229920002307 Dextran Polymers 0.000 description 2
- 206010059866 Drug resistance Diseases 0.000 description 2
- 235000001950 Elaeis guineensis Nutrition 0.000 description 2
- YQYJSBFKSSDGFO-UHFFFAOYSA-N Epihygromycin Natural products OC1C(O)C(C(=O)C)OC1OC(C(=C1)O)=CC=C1C=C(C)C(=O)NC1C(O)C(O)C2OCOC2C1O YQYJSBFKSSDGFO-UHFFFAOYSA-N 0.000 description 2
- 241000701959 Escherichia virus Lambda Species 0.000 description 2
- 241000724791 Filamentous phage Species 0.000 description 2
- ZHNUHDYFZUAESO-UHFFFAOYSA-N Formamide Chemical compound NC=O ZHNUHDYFZUAESO-UHFFFAOYSA-N 0.000 description 2
- 101150094690 GAL1 gene Proteins 0.000 description 2
- 102100028501 Galanin peptides Human genes 0.000 description 2
- 238000012357 Gap analysis Methods 0.000 description 2
- WQZGKKKJIJFFOK-GASJEMHNSA-N Glucose Natural products OC[C@H]1OC(O)[C@H](O)[C@@H](O)[C@@H]1O WQZGKKKJIJFFOK-GASJEMHNSA-N 0.000 description 2
- 244000020551 Helianthus annuus Species 0.000 description 2
- 235000003222 Helianthus annuus Nutrition 0.000 description 2
- 241000238631 Hexapoda Species 0.000 description 2
- 241000282414 Homo sapiens Species 0.000 description 2
- 101100121078 Homo sapiens GAL gene Proteins 0.000 description 2
- 102100034343 Integrase Human genes 0.000 description 2
- 108091092195 Intron Proteins 0.000 description 2
- XUJNEKJLAYXESH-REOHCLBHSA-N L-Cysteine Chemical compound SC[C@H](N)C(O)=O XUJNEKJLAYXESH-REOHCLBHSA-N 0.000 description 2
- QNAYBMKLOCPYGJ-REOHCLBHSA-N L-alanine Chemical compound C[C@H](N)C(O)=O QNAYBMKLOCPYGJ-REOHCLBHSA-N 0.000 description 2
- ODKSFYDXXFIFQN-BYPYZUCNSA-P L-argininium(2+) Chemical compound NC(=[NH2+])NCCC[C@H]([NH3+])C(O)=O ODKSFYDXXFIFQN-BYPYZUCNSA-P 0.000 description 2
- DCXYFEDJOCDNAF-REOHCLBHSA-N L-asparagine Chemical compound OC(=O)[C@@H](N)CC(N)=O DCXYFEDJOCDNAF-REOHCLBHSA-N 0.000 description 2
- ZDXPYRJPNDTMRX-VKHMYHEASA-N L-glutamine Chemical compound OC(=O)[C@@H](N)CCC(N)=O ZDXPYRJPNDTMRX-VKHMYHEASA-N 0.000 description 2
- KDXKERNSBIXSRK-YFKPBYRVSA-N L-lysine Chemical compound NCCCC[C@H](N)C(O)=O KDXKERNSBIXSRK-YFKPBYRVSA-N 0.000 description 2
- 240000006240 Linum usitatissimum Species 0.000 description 2
- 235000004431 Linum usitatissimum Nutrition 0.000 description 2
- 108030000162 Long-chain-fatty-acid-[acyl-carrier-protein] ligases Proteins 0.000 description 2
- 235000007688 Lycopersicon esculentum Nutrition 0.000 description 2
- KDXKERNSBIXSRK-UHFFFAOYSA-N Lysine Natural products NCCCCC(N)C(O)=O KDXKERNSBIXSRK-UHFFFAOYSA-N 0.000 description 2
- 239000004472 Lysine Substances 0.000 description 2
- 235000021360 Myristic acid Nutrition 0.000 description 2
- 108091005461 Nucleic proteins Proteins 0.000 description 2
- 238000012408 PCR amplification Methods 0.000 description 2
- 108700001094 Plant Genes Proteins 0.000 description 2
- 108010092799 RNA-directed DNA polymerase Proteins 0.000 description 2
- 102000007056 Recombinant Fusion Proteins Human genes 0.000 description 2
- 108010008281 Recombinant Fusion Proteins Proteins 0.000 description 2
- 235000004443 Ricinus communis Nutrition 0.000 description 2
- 241000293869 Salmonella enterica subsp. enterica serovar Typhimurium Species 0.000 description 2
- 108020005543 Satellite RNA Proteins 0.000 description 2
- 238000012300 Sequence Analysis Methods 0.000 description 2
- 108020004682 Single-Stranded DNA Proteins 0.000 description 2
- CDBYLPFSWZWCQE-UHFFFAOYSA-L Sodium Carbonate Chemical compound [Na+].[Na+].[O-]C([O-])=O CDBYLPFSWZWCQE-UHFFFAOYSA-L 0.000 description 2
- 240000003768 Solanum lycopersicum Species 0.000 description 2
- 101001078419 Solanum lycopersicum Fatty acid hydroperoxide lyase, chloroplastic Proteins 0.000 description 2
- QAOWNCQODCNURD-UHFFFAOYSA-N Sulfuric acid Chemical compound OS(O)(=O)=O QAOWNCQODCNURD-UHFFFAOYSA-N 0.000 description 2
- 244000299461 Theobroma cacao Species 0.000 description 2
- 235000009470 Theobroma cacao Nutrition 0.000 description 2
- 241000723873 Tobacco mosaic virus Species 0.000 description 2
- DRTQHJPVMGBUCF-XVFCMESISA-N Uridine Chemical compound O[C@@H]1[C@H](O)[C@@H](CO)O[C@H]1N1C(=O)NC(=O)C=C1 DRTQHJPVMGBUCF-XVFCMESISA-N 0.000 description 2
- 102100023048 Very long-chain acyl-CoA synthetase Human genes 0.000 description 2
- 206010052428 Wound Diseases 0.000 description 2
- 208000027418 Wounds and injury Diseases 0.000 description 2
- 235000005824 Zea mays ssp. parviglumis Nutrition 0.000 description 2
- 229960004150 aciclovir Drugs 0.000 description 2
- MKUXAQIIEYXACX-UHFFFAOYSA-N aciclovir Chemical compound N1C(N)=NC(=O)C2=C1N(COCCO)C=N2 MKUXAQIIEYXACX-UHFFFAOYSA-N 0.000 description 2
- 230000002378 acidificating effect Effects 0.000 description 2
- 230000003213 activating effect Effects 0.000 description 2
- 238000000246 agarose gel electrophoresis Methods 0.000 description 2
- 235000004279 alanine Nutrition 0.000 description 2
- ODKSFYDXXFIFQN-UHFFFAOYSA-N arginine Natural products OC(=O)C(N)CCCNC(N)=N ODKSFYDXXFIFQN-UHFFFAOYSA-N 0.000 description 2
- 229960001230 asparagine Drugs 0.000 description 2
- 235000009582 asparagine Nutrition 0.000 description 2
- 230000009286 beneficial effect Effects 0.000 description 2
- 229960002685 biotin Drugs 0.000 description 2
- 235000020958 biotin Nutrition 0.000 description 2
- 239000011616 biotin Substances 0.000 description 2
- 210000004899 c-terminal region Anatomy 0.000 description 2
- GVEZIHKRYBHEFX-UHFFFAOYSA-N caerulein A Natural products CC=CCC=CCCC(=O)C1OC1C(N)=O GVEZIHKRYBHEFX-UHFFFAOYSA-N 0.000 description 2
- 229940041514 candida albicans extract Drugs 0.000 description 2
- 125000003178 carboxy group Chemical group [H]OC(*)=O 0.000 description 2
- 150000001732 carboxylic acid derivatives Chemical class 0.000 description 2
- 210000004671 cell-free system Anatomy 0.000 description 2
- GVEZIHKRYBHEFX-NQQPLRFYSA-N cerulenin Chemical compound C\C=C\C\C=C\CCC(=O)[C@H]1O[C@H]1C(N)=O GVEZIHKRYBHEFX-NQQPLRFYSA-N 0.000 description 2
- 229950005984 cerulenin Drugs 0.000 description 2
- 239000003153 chemical reaction reagent Substances 0.000 description 2
- 238000004587 chromatography analysis Methods 0.000 description 2
- 230000002759 chromosomal effect Effects 0.000 description 2
- 210000000349 chromosome Anatomy 0.000 description 2
- 238000004590 computer program Methods 0.000 description 2
- 239000000356 contaminant Substances 0.000 description 2
- 235000005822 corn Nutrition 0.000 description 2
- 230000001351 cycling effect Effects 0.000 description 2
- XUJNEKJLAYXESH-UHFFFAOYSA-N cysteine Natural products SCC(N)C(O)=O XUJNEKJLAYXESH-UHFFFAOYSA-N 0.000 description 2
- 235000018417 cysteine Nutrition 0.000 description 2
- 231100000433 cytotoxic Toxicity 0.000 description 2
- 230000001472 cytotoxic effect Effects 0.000 description 2
- 230000007123 defense Effects 0.000 description 2
- 230000002939 deleterious effect Effects 0.000 description 2
- 238000004925 denaturation Methods 0.000 description 2
- 230000036425 denaturation Effects 0.000 description 2
- 238000000151 deposition Methods 0.000 description 2
- 229960002086 dextran Drugs 0.000 description 2
- 239000008121 dextrose Substances 0.000 description 2
- LOKCTEFSRHRXRJ-UHFFFAOYSA-I dipotassium trisodium dihydrogen phosphate hydrogen phosphate dichloride Chemical compound P(=O)(O)(O)[O-].[K+].P(=O)(O)([O-])[O-].[Na+].[Na+].[Cl-].[K+].[Cl-].[Na+] LOKCTEFSRHRXRJ-UHFFFAOYSA-I 0.000 description 2
- 238000004520 electroporation Methods 0.000 description 2
- 230000001747 exhibiting effect Effects 0.000 description 2
- 230000004129 fatty acid metabolism Effects 0.000 description 2
- 235000019387 fatty acid methyl ester Nutrition 0.000 description 2
- 230000035558 fertility Effects 0.000 description 2
- IRSCQMHQWWYFCW-UHFFFAOYSA-N ganciclovir Chemical compound O=C1NC(N)=NC2=C1N=CN2COC(CO)CO IRSCQMHQWWYFCW-UHFFFAOYSA-N 0.000 description 2
- 229960002963 ganciclovir Drugs 0.000 description 2
- 239000011521 glass Substances 0.000 description 2
- ZDXPYRJPNDTMRX-UHFFFAOYSA-N glutamine Natural products OC(=O)C(N)CCC(N)=O ZDXPYRJPNDTMRX-UHFFFAOYSA-N 0.000 description 2
- RWSXRVCMGQZWBV-WDSKDSINSA-N glutathione Chemical compound OC(=O)[C@@H](N)CCC(=O)N[C@@H](CS)C(=O)NCC(O)=O RWSXRVCMGQZWBV-WDSKDSINSA-N 0.000 description 2
- 125000000487 histidyl group Chemical group [H]N([H])C(C(=O)O*)C([H])([H])C1=C([H])N([H])C([H])=N1 0.000 description 2
- 108010064894 hydroperoxide lyase Proteins 0.000 description 2
- 230000001976 improved effect Effects 0.000 description 2
- 238000000099 in vitro assay Methods 0.000 description 2
- 238000001727 in vivo Methods 0.000 description 2
- 238000010348 incorporation Methods 0.000 description 2
- 230000002458 infectious effect Effects 0.000 description 2
- ZNJFBWYDHIGLCU-HWKXXFMVSA-N jasmonic acid Chemical compound CC\C=C/C[C@@H]1[C@@H](CC(O)=O)CCC1=O ZNJFBWYDHIGLCU-HWKXXFMVSA-N 0.000 description 2
- 230000037356 lipid metabolism Effects 0.000 description 2
- 229910001629 magnesium chloride Inorganic materials 0.000 description 2
- 239000000463 material Substances 0.000 description 2
- 230000002503 metabolic effect Effects 0.000 description 2
- 108091005601 modified peptides Proteins 0.000 description 2
- 239000013642 negative control Substances 0.000 description 2
- 229910052757 nitrogen Inorganic materials 0.000 description 2
- 239000002777 nucleoside Substances 0.000 description 2
- 150000003833 nucleoside derivatives Chemical class 0.000 description 2
- 230000003287 optical effect Effects 0.000 description 2
- 210000000056 organ Anatomy 0.000 description 2
- 210000003463 organelle Anatomy 0.000 description 2
- 238000007254 oxidation reaction Methods 0.000 description 2
- 238000004091 panning Methods 0.000 description 2
- 239000012188 paraffin wax Substances 0.000 description 2
- 239000002953 phosphate buffered saline Substances 0.000 description 2
- 210000002706 plastid Anatomy 0.000 description 2
- BASFCYQUMIYNBI-UHFFFAOYSA-N platinum Chemical compound [Pt] BASFCYQUMIYNBI-UHFFFAOYSA-N 0.000 description 2
- 230000004481 post-translational protein modification Effects 0.000 description 2
- 238000012545 processing Methods 0.000 description 2
- 238000010188 recombinant method Methods 0.000 description 2
- 230000006798 recombination Effects 0.000 description 2
- 230000004044 response Effects 0.000 description 2
- 230000000717 retained effect Effects 0.000 description 2
- 230000002441 reversible effect Effects 0.000 description 2
- 238000012552 review Methods 0.000 description 2
- 235000019515 salmon Nutrition 0.000 description 2
- 150000003839 salts Chemical class 0.000 description 2
- 230000007017 scission Effects 0.000 description 2
- 230000035939 shock Effects 0.000 description 2
- 230000011664 signaling Effects 0.000 description 2
- 238000002741 site-directed mutagenesis Methods 0.000 description 2
- AJPJDKMHJJGVTQ-UHFFFAOYSA-M sodium dihydrogen phosphate Chemical compound [Na+].OP(O)([O-])=O AJPJDKMHJJGVTQ-UHFFFAOYSA-M 0.000 description 2
- 229910000162 sodium phosphate Inorganic materials 0.000 description 2
- 239000007787 solid Substances 0.000 description 2
- 239000007790 solid phase Substances 0.000 description 2
- 238000000527 sonication Methods 0.000 description 2
- 238000010561 standard procedure Methods 0.000 description 2
- 238000006467 substitution reaction Methods 0.000 description 2
- 238000012360 testing method Methods 0.000 description 2
- RWQNBRDOKXIBIV-UHFFFAOYSA-N thymine Chemical compound CC1=CNC(=O)NC1=O RWQNBRDOKXIBIV-UHFFFAOYSA-N 0.000 description 2
- 230000005030 transcription termination Effects 0.000 description 2
- 230000002103 transcriptional effect Effects 0.000 description 2
- 230000032258 transport Effects 0.000 description 2
- 235000015112 vegetable and seed oil Nutrition 0.000 description 2
- 239000008158 vegetable oil Substances 0.000 description 2
- 230000003612 virological effect Effects 0.000 description 2
- 238000005406 washing Methods 0.000 description 2
- 239000012138 yeast extract Substances 0.000 description 2
- UHDGCWIWMRVCDJ-UHFFFAOYSA-N 1-beta-D-Xylofuranosyl-NH-Cytosine Natural products O=C1N=C(N)C=CN1C1C(O)C(O)C(CO)O1 UHDGCWIWMRVCDJ-UHFFFAOYSA-N 0.000 description 1
- QKNYBSVHEMOAJP-UHFFFAOYSA-N 2-amino-2-(hydroxymethyl)propane-1,3-diol;hydron;chloride Chemical compound Cl.OCC(N)(CO)CO QKNYBSVHEMOAJP-UHFFFAOYSA-N 0.000 description 1
- TWJNQYPJQDRXPH-UHFFFAOYSA-N 2-cyanobenzohydrazide Chemical compound NNC(=O)C1=CC=CC=C1C#N TWJNQYPJQDRXPH-UHFFFAOYSA-N 0.000 description 1
- 125000000954 2-hydroxyethyl group Chemical group [H]C([*])([H])C([H])([H])O[H] 0.000 description 1
- FWMNVWWHGCHHJJ-SKKKGAJSSA-N 4-amino-1-[(2r)-6-amino-2-[[(2r)-2-[[(2r)-2-[[(2r)-2-amino-3-phenylpropanoyl]amino]-3-phenylpropanoyl]amino]-4-methylpentanoyl]amino]hexanoyl]piperidine-4-carboxylic acid Chemical compound C([C@H](C(=O)N[C@H](CC(C)C)C(=O)N[C@H](CCCCN)C(=O)N1CCC(N)(CC1)C(O)=O)NC(=O)[C@H](N)CC=1C=CC=CC=1)C1=CC=CC=C1 FWMNVWWHGCHHJJ-SKKKGAJSSA-N 0.000 description 1
- UBPVOHPZRZIJHM-WOUKDFQISA-N 5'-acetylphosphoadenosine Chemical compound O[C@@H]1[C@H](O)[C@@H](COP(O)(=O)OC(=O)C)O[C@H]1N1C2=NC=NC(N)=C2N=C1 UBPVOHPZRZIJHM-WOUKDFQISA-N 0.000 description 1
- OPIFSICVWOWJMJ-AEOCFKNESA-N 5-bromo-4-chloro-3-indolyl beta-D-galactoside Chemical compound O[C@@H]1[C@@H](O)[C@@H](O)[C@@H](CO)O[C@H]1OC1=CNC2=CC=C(Br)C(Cl)=C12 OPIFSICVWOWJMJ-AEOCFKNESA-N 0.000 description 1
- 101150060993 ACT8 gene Proteins 0.000 description 1
- 101150001232 ALS gene Proteins 0.000 description 1
- HRPVXLWXLXDGHG-UHFFFAOYSA-N Acrylamide Chemical compound NC(=O)C=C HRPVXLWXLXDGHG-UHFFFAOYSA-N 0.000 description 1
- 102000007469 Actins Human genes 0.000 description 1
- 229930024421 Adenine Natural products 0.000 description 1
- GFFGJBXGBJISGV-UHFFFAOYSA-N Adenine Chemical compound NC1=NC=NC2=C1N=CN2 GFFGJBXGBJISGV-UHFFFAOYSA-N 0.000 description 1
- 229920000936 Agarose Polymers 0.000 description 1
- 241000724328 Alfalfa mosaic virus Species 0.000 description 1
- 241000234282 Allium Species 0.000 description 1
- USFZMSVCRYTOJT-UHFFFAOYSA-N Ammonium acetate Chemical compound N.CC(O)=O USFZMSVCRYTOJT-UHFFFAOYSA-N 0.000 description 1
- 239000005695 Ammonium acetate Substances 0.000 description 1
- 235000003840 Amygdalus nana Nutrition 0.000 description 1
- 244000296825 Amygdalus nana Species 0.000 description 1
- 244000099147 Ananas comosus Species 0.000 description 1
- 101100000149 Arabidopsis thaliana AAE3 gene Proteins 0.000 description 1
- 235000003911 Arachis Nutrition 0.000 description 1
- 235000017060 Arachis glabrata Nutrition 0.000 description 1
- 235000018262 Arachis monticola Nutrition 0.000 description 1
- 108090001008 Avidin Proteins 0.000 description 1
- 241000726301 Avocado sunblotch viroid Species 0.000 description 1
- 239000007989 BIS-Tris Propane buffer Substances 0.000 description 1
- 244000063299 Bacillus subtilis Species 0.000 description 1
- 235000014469 Bacillus subtilis Nutrition 0.000 description 1
- 229920002799 BoPET Polymers 0.000 description 1
- 244000060924 Brassica campestris Species 0.000 description 1
- 241000220243 Brassica sp. Species 0.000 description 1
- 241000724256 Brome mosaic virus Species 0.000 description 1
- OYPRJOBELJOOCE-UHFFFAOYSA-N Calcium Chemical compound [Ca] OYPRJOBELJOOCE-UHFFFAOYSA-N 0.000 description 1
- 102000014914 Carrier Proteins Human genes 0.000 description 1
- 108010078791 Carrier Proteins Proteins 0.000 description 1
- 241000701489 Cauliflower mosaic virus Species 0.000 description 1
- 235000007516 Chrysanthemum Nutrition 0.000 description 1
- 240000005250 Chrysanthemum indicum Species 0.000 description 1
- 108020004638 Circular DNA Proteins 0.000 description 1
- 108091028075 Circular RNA Proteins 0.000 description 1
- 108091062157 Cis-regulatory element Proteins 0.000 description 1
- 108700010070 Codon Usage Proteins 0.000 description 1
- UDMBCSSLTHHNCD-UHFFFAOYSA-N Coenzym Q(11) Natural products C1=NC=2C(N)=NC=NC=2N1C1OC(COP(O)(O)=O)C(O)C1O UDMBCSSLTHHNCD-UHFFFAOYSA-N 0.000 description 1
- 108091027551 Cointegrate Proteins 0.000 description 1
- 241000724252 Cucumber mosaic virus Species 0.000 description 1
- 241000612153 Cyclamen Species 0.000 description 1
- UHDGCWIWMRVCDJ-PSQAKQOGSA-N Cytidine Natural products O=C1N=C(N)C=CN1[C@@H]1[C@@H](O)[C@@H](O)[C@H](CO)O1 UHDGCWIWMRVCDJ-PSQAKQOGSA-N 0.000 description 1
- 108010015742 Cytochrome P-450 Enzyme System Proteins 0.000 description 1
- 102000002004 Cytochrome P-450 Enzyme System Human genes 0.000 description 1
- 108010066133 D-octopine dehydrogenase Proteins 0.000 description 1
- 101150074155 DHFR gene Proteins 0.000 description 1
- 239000003155 DNA primer Substances 0.000 description 1
- 238000001712 DNA sequencing Methods 0.000 description 1
- 238000007900 DNA-DNA hybridization Methods 0.000 description 1
- 241000283715 Damaliscus lunatus Species 0.000 description 1
- 240000003421 Dianthus chinensis Species 0.000 description 1
- 108090000204 Dipeptidase 1 Proteins 0.000 description 1
- 240000003133 Elaeis guineensis Species 0.000 description 1
- 244000127993 Elaeis melanococca Species 0.000 description 1
- 102000002494 Endoribonucleases Human genes 0.000 description 1
- 108010093099 Endoribonucleases Proteins 0.000 description 1
- 101000925662 Enterobacteria phage PRD1 Endolysin Proteins 0.000 description 1
- 241000206602 Eukaryota Species 0.000 description 1
- 241000221079 Euphorbia <genus> Species 0.000 description 1
- 108010039731 Fatty Acid Synthases Proteins 0.000 description 1
- 229920001917 Ficoll Polymers 0.000 description 1
- 241000700662 Fowlpox virus Species 0.000 description 1
- 241000956293 Fulda Species 0.000 description 1
- 229920002148 Gellan gum Polymers 0.000 description 1
- 241000735332 Gerbera Species 0.000 description 1
- 102000053187 Glucuronidase Human genes 0.000 description 1
- 108010060309 Glucuronidase Proteins 0.000 description 1
- 108010024636 Glutathione Proteins 0.000 description 1
- 235000009432 Gossypium hirsutum Nutrition 0.000 description 1
- UYTPUPDQBNUYGX-UHFFFAOYSA-N Guanine Natural products O=C1NC(N)=NC2=C1N=CN2 UYTPUPDQBNUYGX-UHFFFAOYSA-N 0.000 description 1
- 229920000209 Hexadimethrine bromide Polymers 0.000 description 1
- 241000282412 Homo Species 0.000 description 1
- 206010020649 Hyperkeratosis Diseases 0.000 description 1
- 235000021506 Ipomoea Nutrition 0.000 description 1
- 241000207783 Ipomoea Species 0.000 description 1
- 108010025815 Kanamycin Kinase Proteins 0.000 description 1
- ONIBWKKTOPOVIA-BYPYZUCNSA-N L-Proline Chemical compound OC(=O)[C@@H]1CCCN1 ONIBWKKTOPOVIA-BYPYZUCNSA-N 0.000 description 1
- 108090001090 Lectins Proteins 0.000 description 1
- 102000004856 Lectins Human genes 0.000 description 1
- 102000003960 Ligases Human genes 0.000 description 1
- 108090000364 Ligases Proteins 0.000 description 1
- 241000234435 Lilium Species 0.000 description 1
- 101710153103 Long-chain-fatty-acid-CoA ligase FadD13 Proteins 0.000 description 1
- 241000724705 Lucerne transient streak virus Species 0.000 description 1
- 108060001084 Luciferase Proteins 0.000 description 1
- 239000005089 Luciferase Substances 0.000 description 1
- 102000009151 Luteinizing Hormone Human genes 0.000 description 1
- 108010073521 Luteinizing Hormone Proteins 0.000 description 1
- 241000220225 Malus Species 0.000 description 1
- 239000005041 Mylar™ Substances 0.000 description 1
- TUNFSRHWOTWDNC-UHFFFAOYSA-N Myristic acid Natural products CCCCCCCCCCCCCC(O)=O TUNFSRHWOTWDNC-UHFFFAOYSA-N 0.000 description 1
- 101710202365 Napin Proteins 0.000 description 1
- 241000234479 Narcissus Species 0.000 description 1
- 229930193140 Neomycin Natural products 0.000 description 1
- 206010028980 Neoplasm Diseases 0.000 description 1
- 244000061176 Nicotiana tabacum Species 0.000 description 1
- 235000002637 Nicotiana tabacum Nutrition 0.000 description 1
- 108700020796 Oncogene Proteins 0.000 description 1
- 102000043276 Oncogene Human genes 0.000 description 1
- 240000007594 Oryza sativa Species 0.000 description 1
- 235000007164 Oryza sativa Nutrition 0.000 description 1
- 101150103068 P2 gene Proteins 0.000 description 1
- 235000021314 Palmitic acid Nutrition 0.000 description 1
- 241000218996 Passiflora Species 0.000 description 1
- 102000009913 Peroxisomal Targeting Signal 2 Receptor Human genes 0.000 description 1
- 108010077056 Peroxisomal Targeting Signal 2 Receptor Proteins 0.000 description 1
- 101710163504 Phaseolin Proteins 0.000 description 1
- 241000219833 Phaseolus Species 0.000 description 1
- 240000004713 Pisum sativum Species 0.000 description 1
- 241000209504 Poaceae Species 0.000 description 1
- 108091036407 Polyadenylation Proteins 0.000 description 1
- 239000002202 Polyethylene glycol Substances 0.000 description 1
- 241000219000 Populus Species 0.000 description 1
- ONIBWKKTOPOVIA-UHFFFAOYSA-N Proline Natural products OC(=O)C1CCCN1 ONIBWKKTOPOVIA-UHFFFAOYSA-N 0.000 description 1
- 235000011432 Prunus Nutrition 0.000 description 1
- 241000589516 Pseudomonas Species 0.000 description 1
- 241000589774 Pseudomonas sp. Species 0.000 description 1
- CZPWVGJYEJSRLH-UHFFFAOYSA-N Pyrimidine Chemical compound C1=CN=CN=C1 CZPWVGJYEJSRLH-UHFFFAOYSA-N 0.000 description 1
- 108020004518 RNA Probes Proteins 0.000 description 1
- 108010065868 RNA polymerase SP6 Proteins 0.000 description 1
- 239000003391 RNA probe Substances 0.000 description 1
- 241000700157 Rattus norvegicus Species 0.000 description 1
- 208000035415 Reinfection Diseases 0.000 description 1
- 206010038997 Retroviral infections Diseases 0.000 description 1
- 240000000528 Ricinus communis Species 0.000 description 1
- 235000011449 Rosa Nutrition 0.000 description 1
- 241001092459 Rubus Species 0.000 description 1
- 241000235070 Saccharomyces Species 0.000 description 1
- 101100127688 Saccharomyces cerevisiae (strain ATCC 204508 / S288c) FAA1 gene Proteins 0.000 description 1
- 101100127692 Saccharomyces cerevisiae (strain ATCC 204508 / S288c) FAA4 gene Proteins 0.000 description 1
- 101100465559 Saccharomyces cerevisiae (strain ATCC 204508 / S288c) PRE7 gene Proteins 0.000 description 1
- 241001496113 Santalum Species 0.000 description 1
- 235000008631 Santalum Nutrition 0.000 description 1
- 241000724704 Solanum nodiflorum mottle virus Species 0.000 description 1
- 241000191940 Staphylococcus Species 0.000 description 1
- 229920002472 Starch Polymers 0.000 description 1
- 241000187747 Streptomyces Species 0.000 description 1
- 241000187435 Streptomyces griseolus Species 0.000 description 1
- 108091027544 Subgenomic mRNA Proteins 0.000 description 1
- 241000724703 Subterranean clover mottle virus Species 0.000 description 1
- 229930006000 Sucrose Natural products 0.000 description 1
- CZMRCDWAGMRECN-UGDNZRGBSA-N Sucrose Chemical compound O[C@H]1[C@H](O)[C@@H](CO)O[C@@]1(CO)O[C@@H]1[C@H](O)[C@@H](O)[C@H](O)[C@@H](CO)O1 CZMRCDWAGMRECN-UGDNZRGBSA-N 0.000 description 1
- NINIDFKCEFEMDL-UHFFFAOYSA-N Sulfur Chemical compound [S] NINIDFKCEFEMDL-UHFFFAOYSA-N 0.000 description 1
- 108700005078 Synthetic Genes Proteins 0.000 description 1
- 101710137500 T7 RNA polymerase Proteins 0.000 description 1
- 239000004098 Tetracycline Substances 0.000 description 1
- 241000723677 Tobacco ringspot virus Species 0.000 description 1
- 108700029229 Transcriptional Regulatory Elements Proteins 0.000 description 1
- 101150011790 UBI3 gene Proteins 0.000 description 1
- 206010046865 Vaccinia virus infection Diseases 0.000 description 1
- 241000724701 Velvet tobacco mottle virus Species 0.000 description 1
- 108020005202 Viral DNA Proteins 0.000 description 1
- 241000726445 Viroids Species 0.000 description 1
- 241000607479 Yersinia pestis Species 0.000 description 1
- 235000007244 Zea mays Nutrition 0.000 description 1
- 235000016383 Zea mays subsp huehuetenangensis Nutrition 0.000 description 1
- RGJOEKWQDUBAIZ-HDCXRZRFSA-N [[(2r,3s,4r,5r)-5-(6-aminopurin-9-yl)-4-hydroxy-3-phosphonooxyoxolan-2-yl]methoxy-hydroxyphosphoryl] [(3s)-3-hydroxy-2,2-dimethyl-4-oxo-4-[[3-oxo-3-(2-sulfanylethylamino)propyl]amino]butyl] hydrogen phosphate Chemical compound O[C@@H]1[C@H](OP(O)(O)=O)[C@@H](COP(O)(=O)OP(O)(=O)OCC(C)(C)[C@H](O)C(=O)NCCC(=O)NCCS)O[C@H]1N1C2=NC=NC(N)=C2N=C1 RGJOEKWQDUBAIZ-HDCXRZRFSA-N 0.000 description 1
- 230000035508 accumulation Effects 0.000 description 1
- 238000009825 accumulation Methods 0.000 description 1
- 230000010933 acylation Effects 0.000 description 1
- 229960000643 adenine Drugs 0.000 description 1
- 101150067366 adh gene Proteins 0.000 description 1
- 238000001042 affinity chromatography Methods 0.000 description 1
- 239000011543 agarose gel Substances 0.000 description 1
- 230000009418 agronomic effect Effects 0.000 description 1
- 238000012867 alanine scanning Methods 0.000 description 1
- 150000001298 alcohols Chemical class 0.000 description 1
- 150000001299 aldehydes Chemical class 0.000 description 1
- 150000001335 aliphatic alkanes Chemical class 0.000 description 1
- 125000001931 aliphatic group Chemical group 0.000 description 1
- 229930013930 alkaloid Natural products 0.000 description 1
- 150000001408 amides Chemical class 0.000 description 1
- 238000003277 amino acid sequence analysis Methods 0.000 description 1
- 125000003277 amino group Chemical group 0.000 description 1
- 229940043376 ammonium acetate Drugs 0.000 description 1
- 235000019257 ammonium acetate Nutrition 0.000 description 1
- 238000012870 ammonium sulfate precipitation Methods 0.000 description 1
- 229960000723 ampicillin Drugs 0.000 description 1
- AVKUERGKIZMTKX-NJBDSQKTSA-N ampicillin Chemical compound C1([C@@H](N)C(=O)N[C@H]2[C@H]3SC([C@@H](N3C2=O)C(O)=O)(C)C)=CC=CC=C1 AVKUERGKIZMTKX-NJBDSQKTSA-N 0.000 description 1
- 238000005571 anion exchange chromatography Methods 0.000 description 1
- 150000001450 anions Chemical class 0.000 description 1
- 239000003242 anti bacterial agent Substances 0.000 description 1
- 239000008346 aqueous phase Substances 0.000 description 1
- 238000003491 array Methods 0.000 description 1
- 125000003118 aryl group Chemical group 0.000 description 1
- 229940072107 ascorbate Drugs 0.000 description 1
- 235000010323 ascorbic acid Nutrition 0.000 description 1
- 239000011668 ascorbic acid Substances 0.000 description 1
- 108010028263 bacteriophage T3 RNA polymerase Proteins 0.000 description 1
- 230000037429 base substitution Effects 0.000 description 1
- 230000008901 benefit Effects 0.000 description 1
- DRTQHJPVMGBUCF-PSQAKQOGSA-N beta-L-uridine Natural products O[C@H]1[C@@H](O)[C@H](CO)O[C@@H]1N1C(=O)NC(=O)C=C1 DRTQHJPVMGBUCF-PSQAKQOGSA-N 0.000 description 1
- 102000006635 beta-lactamase Human genes 0.000 description 1
- 238000010256 biochemical assay Methods 0.000 description 1
- 230000006696 biosynthetic metabolic pathway Effects 0.000 description 1
- HHKZCCWKTZRCCL-UHFFFAOYSA-N bis-tris propane Chemical compound OCC(CO)(CO)NCCCNC(CO)(CO)CO HHKZCCWKTZRCCL-UHFFFAOYSA-N 0.000 description 1
- 239000007844 bleaching agent Substances 0.000 description 1
- 238000010804 cDNA synthesis Methods 0.000 description 1
- 239000011575 calcium Substances 0.000 description 1
- 229910052791 calcium Inorganic materials 0.000 description 1
- 239000001506 calcium phosphate Substances 0.000 description 1
- 229910000389 calcium phosphate Inorganic materials 0.000 description 1
- 235000011010 calcium phosphates Nutrition 0.000 description 1
- 238000004422 calculation algorithm Methods 0.000 description 1
- 244000309466 calf Species 0.000 description 1
- FPPNZSSZRUTDAP-UWFZAAFLSA-N carbenicillin Chemical compound N([C@H]1[C@H]2SC([C@@H](N2C1=O)C(O)=O)(C)C)C(=O)C(C(O)=O)C1=CC=CC=C1 FPPNZSSZRUTDAP-UWFZAAFLSA-N 0.000 description 1
- 229960003669 carbenicillin Drugs 0.000 description 1
- 230000021523 carboxylation Effects 0.000 description 1
- 238000006473 carboxylation reaction Methods 0.000 description 1
- 230000015556 catabolic process Effects 0.000 description 1
- 238000005277 cation exchange chromatography Methods 0.000 description 1
- 230000010261 cell growth Effects 0.000 description 1
- 239000013592 cell lysate Substances 0.000 description 1
- 210000000170 cell membrane Anatomy 0.000 description 1
- 235000013339 cereals Nutrition 0.000 description 1
- VJYIFXVZLXQVHO-UHFFFAOYSA-N chlorsulfuron Chemical compound COC1=NC(C)=NC(NC(=O)NS(=O)(=O)C=2C(=CC=CC=2)Cl)=N1 VJYIFXVZLXQVHO-UHFFFAOYSA-N 0.000 description 1
- 238000000975 co-precipitation Methods 0.000 description 1
- 239000011248 coating agent Substances 0.000 description 1
- 238000000576 coating method Methods 0.000 description 1
- 230000002596 correlated effect Effects 0.000 description 1
- 230000008878 coupling Effects 0.000 description 1
- 238000010168 coupling process Methods 0.000 description 1
- 238000005859 coupling reaction Methods 0.000 description 1
- 239000000287 crude extract Substances 0.000 description 1
- 239000012228 culture supernatant Substances 0.000 description 1
- 238000012258 culturing Methods 0.000 description 1
- 229930186364 cyclamen Natural products 0.000 description 1
- UHDGCWIWMRVCDJ-ZAKLUEHWSA-N cytidine Chemical compound O=C1N=C(N)C=CN1[C@H]1[C@H](O)[C@@H](O)[C@H](CO)O1 UHDGCWIWMRVCDJ-ZAKLUEHWSA-N 0.000 description 1
- SUYVUBYJARFZHO-RRKCRQDMSA-N dATP Chemical compound C1=NC=2C(N)=NC=NC=2N1[C@H]1C[C@H](O)[C@@H](COP(O)(=O)OP(O)(=O)OP(O)(O)=O)O1 SUYVUBYJARFZHO-RRKCRQDMSA-N 0.000 description 1
- SUYVUBYJARFZHO-UHFFFAOYSA-N dATP Natural products C1=NC=2C(N)=NC=NC=2N1C1CC(O)C(COP(O)(=O)OP(O)(=O)OP(O)(O)=O)O1 SUYVUBYJARFZHO-UHFFFAOYSA-N 0.000 description 1
- RGWHQCVHVJXOKC-SHYZEUOFSA-J dCTP(4-) Chemical compound O=C1N=C(N)C=CN1[C@@H]1O[C@H](COP([O-])(=O)OP([O-])(=O)OP([O-])([O-])=O)[C@@H](O)C1 RGWHQCVHVJXOKC-SHYZEUOFSA-J 0.000 description 1
- 238000006114 decarboxylation reaction Methods 0.000 description 1
- 230000007812 deficiency Effects 0.000 description 1
- 230000002950 deficient Effects 0.000 description 1
- 238000006731 degradation reaction Methods 0.000 description 1
- 230000008021 deposition Effects 0.000 description 1
- 238000011161 development Methods 0.000 description 1
- 230000018109 developmental process Effects 0.000 description 1
- 229960000633 dextran sulfate Drugs 0.000 description 1
- 238000010790 dilution Methods 0.000 description 1
- 239000012895 dilution Substances 0.000 description 1
- 201000010099 disease Diseases 0.000 description 1
- 208000037265 diseases, disorders, signs and symptoms Diseases 0.000 description 1
- 239000012153 distilled water Substances 0.000 description 1
- 238000009826 distribution Methods 0.000 description 1
- VHJLVAABSRFDPM-QWWZWVQMSA-N dithiothreitol Chemical compound SC[C@@H](O)[C@H](O)CS VHJLVAABSRFDPM-QWWZWVQMSA-N 0.000 description 1
- 239000003814 drug Substances 0.000 description 1
- 229940079593 drug Drugs 0.000 description 1
- 241001493065 dsRNA viruses Species 0.000 description 1
- 239000000975 dye Substances 0.000 description 1
- 229960001484 edetic acid Drugs 0.000 description 1
- 235000013399 edible fruits Nutrition 0.000 description 1
- 239000008157 edible vegetable oil Substances 0.000 description 1
- 238000001962 electrophoresis Methods 0.000 description 1
- 239000012149 elution buffer Substances 0.000 description 1
- 210000002257 embryonic structure Anatomy 0.000 description 1
- 210000002472 endoplasmic reticulum Anatomy 0.000 description 1
- 238000005516 engineering process Methods 0.000 description 1
- 230000009088 enzymatic function Effects 0.000 description 1
- 238000006911 enzymatic reaction Methods 0.000 description 1
- 230000032050 esterification Effects 0.000 description 1
- 238000005886 esterification reaction Methods 0.000 description 1
- 150000002148 esters Chemical class 0.000 description 1
- CCIVGXIOQKPBKL-UHFFFAOYSA-N ethanesulfonic acid Chemical compound CCS(O)(=O)=O CCIVGXIOQKPBKL-UHFFFAOYSA-N 0.000 description 1
- ZMMJGEGLRURXTF-UHFFFAOYSA-N ethidium bromide Chemical compound [Br-].C12=CC(N)=CC=C2C2=CC=C(N)C=C2[N+](CC)=C1C1=CC=CC=C1 ZMMJGEGLRURXTF-UHFFFAOYSA-N 0.000 description 1
- 229960005542 ethidium bromide Drugs 0.000 description 1
- 238000000605 extraction Methods 0.000 description 1
- 230000004136 fatty acid synthesis Effects 0.000 description 1
- 229930003935 flavonoid Natural products 0.000 description 1
- 150000002215 flavonoids Chemical class 0.000 description 1
- 235000017173 flavonoids Nutrition 0.000 description 1
- 235000004426 flaxseed Nutrition 0.000 description 1
- 239000007850 fluorescent dye Substances 0.000 description 1
- 235000013305 food Nutrition 0.000 description 1
- 238000013467 fragmentation Methods 0.000 description 1
- 238000006062 fragmentation reaction Methods 0.000 description 1
- 238000001502 gel electrophoresis Methods 0.000 description 1
- 238000001476 gene delivery Methods 0.000 description 1
- 230000030279 gene silencing Effects 0.000 description 1
- 238000012226 gene silencing method Methods 0.000 description 1
- 238000010353 genetic engineering Methods 0.000 description 1
- 230000035784 germination Effects 0.000 description 1
- 239000012869 germination medium Substances 0.000 description 1
- 229960003180 glutathione Drugs 0.000 description 1
- 150000002313 glycerolipids Chemical class 0.000 description 1
- 230000013595 glycosylation Effects 0.000 description 1
- 238000006206 glycosylation reaction Methods 0.000 description 1
- PCHJSUWPFVWCPO-UHFFFAOYSA-N gold Chemical compound [Au] PCHJSUWPFVWCPO-UHFFFAOYSA-N 0.000 description 1
- 229910052737 gold Inorganic materials 0.000 description 1
- 239000010931 gold Substances 0.000 description 1
- 230000005484 gravity Effects 0.000 description 1
- 238000000227 grinding Methods 0.000 description 1
- 239000003102 growth factor Substances 0.000 description 1
- IVSXFFJGASXYCL-UHFFFAOYSA-N guanine Chemical compound O=C1NC(N)=NC2=NC=N[C]21 IVSXFFJGASXYCL-UHFFFAOYSA-N 0.000 description 1
- 230000036541 health Effects 0.000 description 1
- 238000010438 heat treatment Methods 0.000 description 1
- 229910001385 heavy metal Inorganic materials 0.000 description 1
- 239000001307 helium Substances 0.000 description 1
- 229910052734 helium Inorganic materials 0.000 description 1
- SWQJXJOGLNCZEY-UHFFFAOYSA-N helium atom Chemical compound [He] SWQJXJOGLNCZEY-UHFFFAOYSA-N 0.000 description 1
- 230000002363 herbicidal effect Effects 0.000 description 1
- 239000004009 herbicide Substances 0.000 description 1
- 238000004128 high performance liquid chromatography Methods 0.000 description 1
- 230000001744 histochemical effect Effects 0.000 description 1
- 101150047832 hpt gene Proteins 0.000 description 1
- 230000002209 hydrophobic effect Effects 0.000 description 1
- 238000004191 hydrophobic interaction chromatography Methods 0.000 description 1
- 238000012872 hydroxylapatite chromatography Methods 0.000 description 1
- 108010002685 hygromycin-B kinase Proteins 0.000 description 1
- 238000005286 illumination Methods 0.000 description 1
- 238000011532 immunohistochemical staining Methods 0.000 description 1
- 230000000415 inactivating effect Effects 0.000 description 1
- 230000006698 induction Effects 0.000 description 1
- 230000002401 inhibitory effect Effects 0.000 description 1
- 239000003999 initiator Substances 0.000 description 1
- 238000007689 inspection Methods 0.000 description 1
- 230000003993 interaction Effects 0.000 description 1
- 230000002452 interceptive effect Effects 0.000 description 1
- BPHPUYQFMNQIOC-NXRLNHOXSA-N isopropyl beta-D-thiogalactopyranoside Chemical compound CC(C)S[C@@H]1O[C@H](CO)[C@H](O)[C@H](O)[C@H]1O BPHPUYQFMNQIOC-NXRLNHOXSA-N 0.000 description 1
- ZNJFBWYDHIGLCU-UHFFFAOYSA-N jasmonic acid Natural products CCC=CCC1C(CC(O)=O)CCC1=O ZNJFBWYDHIGLCU-UHFFFAOYSA-N 0.000 description 1
- 150000002576 ketones Chemical class 0.000 description 1
- 239000002523 lectin Substances 0.000 description 1
- 231100000225 lethality Toxicity 0.000 description 1
- 230000000670 limiting effect Effects 0.000 description 1
- 230000006372 lipid accumulation Effects 0.000 description 1
- 238000001638 lipofection Methods 0.000 description 1
- 239000002502 liposome Substances 0.000 description 1
- 238000005567 liquid scintillation counting Methods 0.000 description 1
- 238000011068 loading method Methods 0.000 description 1
- 230000004807 localization Effects 0.000 description 1
- 239000000314 lubricant Substances 0.000 description 1
- 230000002934 lysing effect Effects 0.000 description 1
- 238000002803 maceration Methods 0.000 description 1
- 238000012423 maintenance Methods 0.000 description 1
- 235000009973 maize Nutrition 0.000 description 1
- 210000004962 mammalian cell Anatomy 0.000 description 1
- 108010083942 mannopine synthase Proteins 0.000 description 1
- 238000013507 mapping Methods 0.000 description 1
- 239000011159 matrix material Substances 0.000 description 1
- 238000005259 measurement Methods 0.000 description 1
- 239000002609 medium Substances 0.000 description 1
- 150000004667 medium chain fatty acids Chemical class 0.000 description 1
- 238000002844 melting Methods 0.000 description 1
- 230000008018 melting Effects 0.000 description 1
- 230000037353 metabolic pathway Effects 0.000 description 1
- 230000004060 metabolic process Effects 0.000 description 1
- 229910052751 metal Inorganic materials 0.000 description 1
- 239000002184 metal Substances 0.000 description 1
- 239000002923 metal particle Substances 0.000 description 1
- 125000001360 methionine group Chemical group N[C@@H](CCSC)C(=O)* 0.000 description 1
- 150000004702 methyl esters Chemical class 0.000 description 1
- 230000000813 microbial effect Effects 0.000 description 1
- 238000000520 microinjection Methods 0.000 description 1
- 238000000386 microscopy Methods 0.000 description 1
- 210000003470 mitochondria Anatomy 0.000 description 1
- 238000010369 molecular cloning Methods 0.000 description 1
- 230000000869 mutational effect Effects 0.000 description 1
- 229960004927 neomycin Drugs 0.000 description 1
- 238000001668 nucleic acid synthesis Methods 0.000 description 1
- 210000004940 nucleus Anatomy 0.000 description 1
- 235000016709 nutrition Nutrition 0.000 description 1
- 229940124276 oligodeoxyribonucleotide Drugs 0.000 description 1
- 238000012261 overproduction Methods 0.000 description 1
- 238000004806 packaging method and process Methods 0.000 description 1
- 150000002943 palmitic acids Chemical class 0.000 description 1
- 235000020232 peanut Nutrition 0.000 description 1
- 238000010647 peptide synthesis reaction Methods 0.000 description 1
- 102000013415 peroxidase activity proteins Human genes 0.000 description 1
- 108040007629 peroxidase activity proteins Proteins 0.000 description 1
- 230000000858 peroxisomal effect Effects 0.000 description 1
- LWTDZKXXJRRKDG-UHFFFAOYSA-N phaseollin Natural products C1OC2=CC(O)=CC=C2C2C1C1=CC=C3OC(C)(C)C=CC3=C1O2 LWTDZKXXJRRKDG-UHFFFAOYSA-N 0.000 description 1
- 229940080469 phosphocellulose Drugs 0.000 description 1
- 229910052698 phosphorus Inorganic materials 0.000 description 1
- 238000013081 phylogenetic analysis Methods 0.000 description 1
- 230000003032 phytopathogenic effect Effects 0.000 description 1
- 230000008121 plant development Effects 0.000 description 1
- 239000013600 plasmid vector Substances 0.000 description 1
- 239000004014 plasticizer Substances 0.000 description 1
- 229910052697 platinum Inorganic materials 0.000 description 1
- 229920001223 polyethylene glycol Polymers 0.000 description 1
- 229920000642 polymer Polymers 0.000 description 1
- 239000002243 precursor Substances 0.000 description 1
- 238000002953 preparative HPLC Methods 0.000 description 1
- 239000002987 primer (paints) Substances 0.000 description 1
- 230000037452 priming Effects 0.000 description 1
- 230000001737 promoting effect Effects 0.000 description 1
- NLJUOWMZPAVXHU-UHFFFAOYSA-N prop-2-enamide;urea Chemical compound NC(N)=O.NC(=O)C=C NLJUOWMZPAVXHU-UHFFFAOYSA-N 0.000 description 1
- 230000004952 protein activity Effects 0.000 description 1
- 230000030788 protein refolding Effects 0.000 description 1
- 230000012743 protein tagging Effects 0.000 description 1
- 235000014774 prunus Nutrition 0.000 description 1
- 208000009305 pseudorabies Diseases 0.000 description 1
- 101150076896 pts1 gene Proteins 0.000 description 1
- 238000004080 punching Methods 0.000 description 1
- 239000012264 purified product Substances 0.000 description 1
- 230000002285 radioactive effect Effects 0.000 description 1
- 238000002708 random mutagenesis Methods 0.000 description 1
- 238000003259 recombinant expression Methods 0.000 description 1
- 238000005215 recombination Methods 0.000 description 1
- 238000004064 recycling Methods 0.000 description 1
- 230000009467 reduction Effects 0.000 description 1
- 230000008929 regeneration Effects 0.000 description 1
- 238000011069 regeneration method Methods 0.000 description 1
- 230000022532 regulation of transcription, DNA-dependent Effects 0.000 description 1
- 238000011160 research Methods 0.000 description 1
- 239000011347 resin Substances 0.000 description 1
- 229920005989 resin Polymers 0.000 description 1
- 210000001995 reticulocyte Anatomy 0.000 description 1
- 230000001177 retroviral effect Effects 0.000 description 1
- 238000010839 reverse transcription Methods 0.000 description 1
- 235000009566 rice Nutrition 0.000 description 1
- 239000012723 sample buffer Substances 0.000 description 1
- 229920006395 saturated elastomer Polymers 0.000 description 1
- 238000003345 scintillation counting Methods 0.000 description 1
- 230000028327 secretion Effects 0.000 description 1
- 210000002966 serum Anatomy 0.000 description 1
- 230000001568 sexual effect Effects 0.000 description 1
- GUGNSJAORJLKGP-UHFFFAOYSA-K sodium 8-methoxypyrene-1,3,6-trisulfonate Chemical compound [Na+].[Na+].[Na+].C1=C2C(OC)=CC(S([O-])(=O)=O)=C(C=C3)C2=C2C3=C(S([O-])(=O)=O)C=C(S([O-])(=O)=O)C2=C1 GUGNSJAORJLKGP-UHFFFAOYSA-K 0.000 description 1
- 229910000029 sodium carbonate Inorganic materials 0.000 description 1
- 230000009870 specific binding Effects 0.000 description 1
- 238000010186 staining Methods 0.000 description 1
- 239000008107 starch Substances 0.000 description 1
- 235000019698 starch Nutrition 0.000 description 1
- 239000008223 sterile water Substances 0.000 description 1
- 230000001954 sterilising effect Effects 0.000 description 1
- 238000004659 sterilization and disinfection Methods 0.000 description 1
- 230000004960 subcellular localization Effects 0.000 description 1
- 125000001424 substituent group Chemical group 0.000 description 1
- 239000005720 sucrose Substances 0.000 description 1
- 229910052717 sulfur Inorganic materials 0.000 description 1
- 239000011593 sulfur Substances 0.000 description 1
- 239000013589 supplement Substances 0.000 description 1
- 239000004094 surface-active agent Substances 0.000 description 1
- 150000003505 terpenes Chemical class 0.000 description 1
- 229960002180 tetracycline Drugs 0.000 description 1
- 229930101283 tetracycline Natural products 0.000 description 1
- 235000019364 tetracycline Nutrition 0.000 description 1
- 150000003522 tetracyclines Chemical class 0.000 description 1
- TUNFSRHWOTWDNC-HKGQFRNVSA-N tetradecanoic acid Chemical compound CCCCCCCCCCCCC[14C](O)=O TUNFSRHWOTWDNC-HKGQFRNVSA-N 0.000 description 1
- 238000009482 thermal adhesion granulation Methods 0.000 description 1
- 238000005382 thermal cycling Methods 0.000 description 1
- 229940113082 thymine Drugs 0.000 description 1
- 230000001131 transforming effect Effects 0.000 description 1
- QORWJWZARLRLPR-UHFFFAOYSA-H tricalcium bis(phosphate) Chemical compound [Ca+2].[Ca+2].[Ca+2].[O-]P([O-])([O-])=O.[O-]P([O-])([O-])=O QORWJWZARLRLPR-UHFFFAOYSA-H 0.000 description 1
- 125000002264 triphosphate group Chemical class [H]OP(=O)(O[H])OP(=O)(O[H])OP(=O)(O[H])O* 0.000 description 1
- UNXRWKVEANCORM-UHFFFAOYSA-N triphosphoric acid Chemical compound OP(O)(=O)OP(O)(=O)OP(O)(O)=O UNXRWKVEANCORM-UHFFFAOYSA-N 0.000 description 1
- WFKWXMTUELFFGS-UHFFFAOYSA-N tungsten Chemical compound [W] WFKWXMTUELFFGS-UHFFFAOYSA-N 0.000 description 1
- 229910052721 tungsten Inorganic materials 0.000 description 1
- 239000010937 tungsten Substances 0.000 description 1
- 241000701447 unidentified baculovirus Species 0.000 description 1
- 241001515965 unidentified phage Species 0.000 description 1
- DRTQHJPVMGBUCF-UHFFFAOYSA-N uracil arabinoside Natural products OC1C(O)C(CO)OC1N1C(=O)NC(=O)C=C1 DRTQHJPVMGBUCF-UHFFFAOYSA-N 0.000 description 1
- 229940045145 uridine Drugs 0.000 description 1
- 208000007089 vaccinia Diseases 0.000 description 1
- 230000017260 vegetative to reproductive phase transition of meristem Effects 0.000 description 1
- 239000003981 vehicle Substances 0.000 description 1
- 150000004669 very long chain fatty acids Chemical class 0.000 description 1
- 230000029812 viral genome replication Effects 0.000 description 1
- 230000000007 visual effect Effects 0.000 description 1
- 239000011782 vitamin Substances 0.000 description 1
- 235000013343 vitamin Nutrition 0.000 description 1
- 229940088594 vitamin Drugs 0.000 description 1
- 229930003231 vitamin Natural products 0.000 description 1
- 150000003722 vitamin derivatives Chemical class 0.000 description 1
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
- C12N9/93—Ligases (6)
-
- A—HUMAN NECESSITIES
- A23—FOODS OR FOODSTUFFS; TREATMENT THEREOF, NOT COVERED BY OTHER CLASSES
- A23D—EDIBLE OILS OR FATS, e.g. MARGARINES, SHORTENINGS OR COOKING OILS
- A23D9/00—Other edible oils or fats, e.g. shortenings or cooking oils
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
- C12N15/79—Vectors or expression systems specially adapted for eukaryotic hosts
- C12N15/82—Vectors or expression systems specially adapted for eukaryotic hosts for plant cells, e.g. plant artificial chromosomes (PACs)
- C12N15/8241—Phenotypically and genetically modified plants via recombinant DNA technology
- C12N15/8242—Phenotypically and genetically modified plants via recombinant DNA technology with non-agronomic quality (output) traits, e.g. for industrial processing; Value added, non-agronomic traits
- C12N15/8243—Phenotypically and genetically modified plants via recombinant DNA technology with non-agronomic quality (output) traits, e.g. for industrial processing; Value added, non-agronomic traits involving biosynthetic or metabolic pathways, i.e. metabolic engineering, e.g. nicotine, caffeine
- C12N15/8247—Phenotypically and genetically modified plants via recombinant DNA technology with non-agronomic quality (output) traits, e.g. for industrial processing; Value added, non-agronomic traits involving biosynthetic or metabolic pathways, i.e. metabolic engineering, e.g. nicotine, caffeine involving modified lipid metabolism, e.g. seed oil composition
Landscapes
- Health & Medical Sciences (AREA)
- Life Sciences & Earth Sciences (AREA)
- Genetics & Genomics (AREA)
- Chemical & Material Sciences (AREA)
- Engineering & Computer Science (AREA)
- Organic Chemistry (AREA)
- Biotechnology (AREA)
- Zoology (AREA)
- Wood Science & Technology (AREA)
- Bioinformatics & Cheminformatics (AREA)
- General Engineering & Computer Science (AREA)
- Molecular Biology (AREA)
- Biomedical Technology (AREA)
- Microbiology (AREA)
- Biochemistry (AREA)
- Oil, Petroleum & Natural Gas (AREA)
- General Health & Medical Sciences (AREA)
- Physics & Mathematics (AREA)
- Cell Biology (AREA)
- Plant Pathology (AREA)
- Nutrition Science (AREA)
- Biophysics (AREA)
- Medicinal Chemistry (AREA)
- Food Science & Technology (AREA)
- Polymers & Plastics (AREA)
- Micro-Organisms Or Cultivation Processes Thereof (AREA)
- Breeding Of Plants And Reproduction By Means Of Culturing (AREA)
- Agricultural Chemicals And Associated Chemicals (AREA)
- Enzymes And Modification Thereof (AREA)
Description
PLANT ACYL-COA SYNTHETASES
This application claims priority from provisional application U. S. Serial No. 60/220,474 filed on July 21 , 2000.
FIELD OF THE INVENTION
The present invention relates to genes and proteins encoding plant acyl-CoA synthetases and methods of their use.
BACKGROUND
Plant metabolism has evolved the ability to produce a diverse range of structures, including more than 20,000 different terpenoids, flavonoids, alkaloids, and fatty acids. Fatty acids have been extensively exploited for industrial uses in products such as lubricants, plasticizers, and surfactants, hi fact, approximately one-third of vegetable oils produced in the world are already used for non-food purposes (Ohlrogge, J (1994) Plant Physiol. 104:821-26).
In 1999, approximately 40 million hectares of transgenic crops were planted worldwide. Included in this figure is approximately 50 percent ofthe soybean acreage in the United States, over 70percent ofthe Canola acreage in Canada, about 20 percent ofthe United States corn crop, and about 33 percent ofthe United States cotton crop (Ohlrogge, J (1999) Curr. Opin. Plant Biol. 2:121-22).
Various laboratories around the world have attempted to modify triacylglycerol (TAG) content in oilseed crops by manipulating the genes involved in TAG biosynthesis. The TAG biosynfhetic pathway involves many enzymatic reactions. An increasing number ofthe genes that encode these enzymes have been cloned and studied in detail with respect to the quantitative and qualitative contributions they make to the TAG composition of a particular oilseed. There are still several genes in the TAG pathway, however, that have not been cloned and characterized in detail.
Most ofthe efforts to modify TAG content have focused on either increasing the nutritional characteristics and chemical stability of edible oils or on introducing new and unusual fatty acids into TAGs for use in various industrial applications. Progress has been achieved through over-expression and/or suppression of a modestly small number
of genes in the TAG synthesis pathway. However, to date, the alterations in fatty acid content have not been substantial enough to create truly meaningful new oilseed lines. Thus, there remains a need to identify and characterize additional genes in the TAG synthesis pathway, the manipulation of which can contribute to altered or increased fatty acid content in oilseeds.
SUMMARY OF THE INVENTION
The present invention relates to nucleic acid sequences encoding plant acyl-CoA synthetases (ACS) and to methods of their use, and to the proteins encoded by these nucleic acid sequences. The present invention is not limited to any particular nucleic acid or amino acid sequence.
Accordingly, in some embodiments of one aspect, the present invention an isolated nucleic acid sequence comprising SEQ ID NO:l or SEQ ID NO:2 of SEQ ID NO:3 or SEQ ID NO:4 or SEQ ID NO:5 or SEQ ID NO:6 or SEQ ID NO:7 or SEQ ID NO:8 or SEQ ID NO:9 or SEQ ID NO:10 or SEQ ID NO:l 1. The present invention is not limited to the nucleic acid sequences having SEQ ID NOs:l-l 1. Indeed, it is contemplated that the present invention encompasses homologs, variants, and portions or fragments ofthe nucleic acids having SEQ ID NOs:l-l 1. Accordingly, in some embodiments the present invention provides isolated nucleic acid sequences that hybridize to the nucleic acids encoded by SEQ ID NOs:l-l 1 under conditions of low to high stringency. In other embodiments, the present invention provides isolated nucleic acid sequences that compete with or inhibit the binding ofthe nucleic acid sequences encoded by SEQ ID NOs: 1-11 to their complements. In some preferred embodiments, the nucleic acid sequences encode a protein with acyl-CoA synthetase activity. In some particularly preferred embodiments, the nucleic acid sequences encode a protein that catalyzes the esterification of a fatty acid and coenzyme A. In other particularly preferred embodiments, the nucleic acid sequences encode a protein comprising an amino acid sequence from SEQ ID NOs: 12-22.
In some embodiments ofthe present invention, the nucleic acids described above are operably linked to a heterologous promoter. In further embodiments, the nucleic acid sequences described above are contained within a vector. In some embodiments, the nucleic acid sequence in the vector is in a sense orientation; in other embodiments, the nucleic acid sequence in the vector is in an antisense orientation. The present
invention also provides compositions comprising isolated nucleic acids or vectors as described above.
In still further embodiments, the present invention provides a host cell transfected with a nucleic acid sequence or vector or composition as described above. The present invention is not limited to any particular host cell. Indeed, a variety of host cells are contemplated, including, but not limited to, prokaryotic cells, eukaryotie cells, plant tissue cells, and cells inplanta. The present invention also provides a plant transfected with a nucleic acid sequence or a vector or a composition as described above, as well as a seed or oil from such a transfected plant. The present invention also provides an isolated nucleic acid sequence or a composition or a vector as described above for use in transfecting a plant or for use in altering a phenotype of a plant. In some embodiments, a method for making a transgenic plant comprises providing a nucleic acid sequence or a vector or a composition as described above, and plant tissue, and transfecting the plant tissue with the nucleic acid sequence or the vector or the composition under conditions such that a transgenic plant is generated. In other embodiments, a method for altering a phenotype of a plant comprises providing a nucleic acid sequence or a vector or a composition as described above, and plant tissue, and transfecting the plant tissue with the nucleic acid sequence or the vector or the composition under conditions such that a transgenic plant is generated and the phenotype is altered.
In other embodiments, the present invention provides methods for assaying acyl- CoA synthetase activity comprising providing a nucleic acid sequence or vector or composition as described above, expressing the nucleic acid sequence under conditions such that a protein is produced; and assaying the activity ofthe protein. The present invention also provides methods for producing variants of acyl-CoA synthetases comprising providing any ofthe nucleic acid sequences described above, mutagenizing the nucleic acid sequence, and screening a variant encoded by the mutagenized nucleic acid sequence for activity.
The present invention also provides methods for screening acyl-CoA synthetases comprising providing a candidate acyl-CoA synthetase and analyzing the candidate acyl-CoA synthetase for the presence of at least one of ACS motifs 1-9.
In additional embodiments, the present invention provides an isolated nucleic acid sequence encoding a plant acyl-CoA synthetase, wherein the plant acyl-CoA
synthetase competes for binding to a fatty acid substrate with a protein encoded by a nucleic acid sequence as described above, and composition comprising such an isolated nucleic acid sequence.
In other embodiments, the present invention provides a first isolated nucleic acid sequence, wherein the first nucleic acid sequence inhibits the binding of at least a portion of a second nucleic acid sequence to its complementary sequence and wherein the second nucleic acid sequence has a nucleic sequence as described above, and compositions comprising such a first nucleic acid sequence.
In other embodiments, the present invention provides a purified protein comprising an amino sequence encoded by a nucleic acid sequence as described above, and compositions comprising such a purified protein.
The present invention also relates to nucleic acid sequences encoding plant AMP binding proteins (AMP-BPs) and to methods of their use and to the proteins encoded by the nucleic acid sequences. The present invention is not limited to any particular nucleic acid or amino acid sequence.
In this aspect, the present invention provides compositions comprising an isolated nucleic acid sequence having SEQ ID NOs:23-32; these sequences encode an AMP binding protein. The present invention is not limited to nucleic acid sequences having SEQ ID NOs:23-32. Indeed, it is contemplated that the present invention encompasses homologs, variants, and portions or fragments ofthe nucleic acid sequences having SEQ ID NOs:23-32. Accordingly, in some embodiments, the present invention provides nucleic acid sequences that hybridize to the nucleic acids having SEQ ID NOs. "23-32 under conditions of low to high stringency. In other embodiments, the present invention provides nucleic acid sequences that compete with or inhibit the binding ofthe nucleic acid sequences having SEQ ID NOs:23-32 to their complements. In some preferred embodiments, the nucleic acids encode a protein with AMP binding activity. In other particularly preferred embodiments, the nucleic acid sequences encode a protein comprising an amino acid sequence from SEQ ID NOs: 33-42.
In some embodiments ofthe present invention, the nucleic acids described above are operably linked to a heterologous promoter. In further embodiments, the sequences described above are contained within a vector. In some embodiments, the nucleic acid sequence is in a sense orientation; in other embodiments, the nucleic acid sequence is in an antisense orientation.
In still further embodiments, the vectors are within a host cell. The present invention is not limited to any particular host cell. Indeed, a variety of host cells are contemplated, including, but not limited to, prokaryotic cells, eukaryotie cells, plant tissue cells, and cells inplanta. In yet further embodiments, the present invention provides compositions comprising nucleic acid sequences or vectors as described above in a second aspect ofthe present invention.
In other embodiments, the present invention provides a nucleic acid sequence or vector or composition as described above in a second aspect ofthe present invention for use in generating a transgenic plant or for use in altering a phenotype of a plant. A method for generating a transgenic plant comprises providing a nucleic acid sequence or a vector or a composition as described above in a second aspect ofthe present invention, and plant tissue, and transfecting the plant tissue with the nucleic acid sequence or the vector or the composition under conditions such that a transgenic plant is generated. A method for altering a phenotype of a plant comprising providing a nucleic acid sequence or vector or composition described above for the second aspect ofthe present invention, and plant tissue; and transfecting the plant tissue with the nucleic acid altered.
BRIEF DESCRIPTION OF THE DRAWINGS Figures 1 A - ID present an amino acid sequence alignment for Arabidopsis ACS and AMP-binding protein sequences.
Figure 2 a comparison ofthe degree of conservation ofthe deduced amino acid sequences of and around the insertional elements of each ACS. The residues corresponding to the predicted borders ofthe insertional element are numbered and denoted with arrows. These residues were determined by comparing the sequences of the candidate ACS genes to those ofthe other AMP-BP genes that were identified in the original data base screen and which lacked the insertional element. For clarity, Figure 2 displays only the first few amino acid residues that flank the upstream and downstream borders ofthe insertional region. Figure 3 provides the AtACSIA nucleic acid sequence (SEQ ID NO: 1).
Figure 4 provides the AtACSIB nucleic acid sequence (SEQ ID NO: 2). Figure 5 provides the AtACSIC nucleic acid sequence (SEQ ID NO: 3). Figure 6 provides the AtACS2 nucleic acid sequence (SEQ ID NO: 4).
Figure 7 provides the AtACS3A nucleic acid sequence (SEQ ID NO: 5).
Figure 8 provides the AtACS3B nucleic acid sequence (SEQ ID NO: 6).
Figure 9 provides the AtACS4A nucleic acid sequence (SEQ ID NO: 7).
Figure 10 provides the A1ACS4B nucleic acid sequence (SEQ ID NO: 8). Figure 11 provides the A1ACS5 nucleic acid sequence (SEQ ID NO: 9).
Figure 12 provides the AtACS6A nucleic acid sequence (SEQ ID NO: 10).
Figure 13 provides the AtACS6B nucleic acid sequence (SEQ ID NO: 11).
Figure 14 provides the AtACSIA amino acid sequence (SEQ ID NO: 12).
Figure 15 provides the AtACSIB amino acid sequence (SEQ ID NO: 13). Figure 16 provides the AtACSIC amino acid sequence (SEQ ID NO: 14).
Figure 17 provides the AtACS2 amino acid sequence (SEQ ID NO: 15).
Figure 18 provides the AtACS3A amino acid sequence (SEQ ID NO: 16).
Figure 19 provides the AtACS3B amino acid sequence (SEQ ID NO: 17).
Figure 20 provides the AtACS4A amino acid sequence (SEQ ID NO: 18). Figure 21 provides the AtACS4B amino acid sequence (SEQ ID NO: 19).
Figure 22 provides the AtACS5 amino acid sequence (SEQ ID NO: 20).
Figure 23 provides the AtACS6A amino acid sequence (SEQ ID NO: 21).
Figure 24 provides the AtACS6B amino acid sequence (SEQ ID NO: 22).
Figure 25 provides the predicted AMP-BP 1 nucleic acid sequence (SEQ ID NO:
23)
Figure 26 provides the predicted AMP-BP2 nucleic acid sequence (SEQ ID NO:
24). Figure 27 provides the predicted AMP-BP3 nucleic acid sequence (SEQ ID NO:
25), Figure 28 provides the predicted AMP-BP4 nucleic acid sequence (SEQ ID NO:
26), Figure 29 provides the predicted AMP-BP5 nucleic acid sequence (SEQ ID NO:
27). Figure 30 provides the predicted AMP-BP6 nucleic acid sequence (SEQ ID NO:
28), Figure 31 provides the predicted AMP-BP7 nucleic acid sequence (SEQ ID NO:
29)
Figure 32 provides the predicted AMP-BP8 nucleic acid sequence (SEQ ID NO:
30).
Figure 33 provides the predicted AMP-BP9 nucleic acid sequence (SEQ ID NO:
31).
Figure 34 provides the predicted AMP-BP 10 nucleic acid sequence (SEQ ID
NO: 32)
Figure 35 provides the predicted AMP-BP 1 amino acid sequence (SEQ ID NO:
33).
Figure 36 provides the predicted AMP-BP2 amino acid sequence (SEQ ID NO:
35).
Figure 37 provides the predicted AMP-BP3 amino acid sequence (SEQ ID NO:
35).
Figure 38 provides the predicted AMP-BP4 amino acid sequence (SEQ ID NO: 36). Figure 39 provides the predicted AMP-BP5 amino acid sequence (SEQ ID NO:
37)
Figure 40 provides the predicted AMP-BP6 amino acid sequence (SEQ ID NO: 38).
Figure 41 provides the predicted AMP-BP7 amino acid sequence (SEQ ID NO: 39).
Figure 42 provides the predicted AMP-BP8 amino acid sequence (SEQ ID NO: 40).
Figure 43 provides the predicted AMP-BP9 amino acid sequence (SEQ ID NO: 41).
Figure 44 provides the predicted AMP-BP 10 amino acid sequence (SEQ ID NO: 42).
Figure 45 is an amino acid sequence alignment for ACS motif 1 (SEQ ID NO:43).
Figure 46 is an amino acid sequence alignment for ACS motif 2 (SEQ ID NO:44).
Figure 47 is an amino acid sequence alignment for ACS motif 3 (SEQ ID NO:45).
Figure 48 is an amino acid sequence alignment for ACS motif 4(SEQ ID NO:46).
Figure 49 is an amino acid sequence alignment for ACS motif 5 (SEQ ID NO:47). Figure 50 is an amino acid sequence alignment for ACS motif 6 (SEQ ID
NO:48).
Figure 51 is an amino acid sequence alignment for ACS motif 7 (SEQ ID NO:49).
Figure 52 is an amino acid sequence alignment for ACS motif 8 (SEQ ID NO:50).
Figure 53 is an amino acid sequence alignment for ACS motif 9 (SEQ ID NO:51).
Figure 54 shows a phylogenetic tree constructed to visually compare the relationship between each ofthe candidate ACS genes. Figure 55 shows the results of acyl-CoA synthetase activity from in vitro assays.
Figure 56 shows the results of a fatty acid analysis ofthe siliques from wild-type and AtACS6B knockout mutant Arabidopsis 42 day old plants grown under 14:10 photoperiod. The total lipids were derivatized with an internal standard using 2.5 percent H2SO4 in methanol and the fatty acid methyl esters were analyzed by gas chromatography. Values are means +/- SE (n = 12).
DESCRIPTION OF THE INVENTION
The present invention relates to genes encoding plant acyl-CoA synthetases (ACSs) and methods of their use. The present invention encompasses both native and recombinant wild-type forms ofthe enzyme, as well as mutant and variant forms, some of which possess altered characteristics relative to the wild-type enzyme. The present invention also relates to methods of using ACSs, including altered expression in transgenic plants and expression in prokaryotes and cell culture systems. After the "Definitions," the following description ofthe invention is divided into: I. Acyl-CoA Synthetases; II. Uses of Acyl-CoA Synthetase Nucleic Acids and Polypeptides; III. Identification of Other Acyl-CoA Synthetase Homologs; and IN. AMP Binding Proteins.
DEFINITIONS
To facilitate understanding ofthe invention, a number of terms are defined below.
The term "plant" as used herein refers to a plurality of plant cells which are largely differentiated into a structure that is present at any stage of a plant's development. Such structures include, but are not limited to, a fruit, shoot, stem, leaf, flower petal, etc. The term "plant tissue" includes differentiated and undifferentiated tissues of plants including, but not limited to, roots, shoots, leaves, pollen, seeds, tumor tissue and various types of cells in culture (for example, single cells, protoplasts, embryos, callus, etc.). Plant tissue may be inplanta, in organ culture, tissue culture, or cell culture.
"Oil-producing species" as used herein refers to plant species which produce and store triacylglycerol in specific organs, primarily in seeds. Such species include soybean (Glycine max), rapeseed and canola (including Brassica napus and B. campestris), sunflower (Helianthus annus), cotton (Gossypium hirsutum), corn (Zea mays), cocoa (Theobroma cacao), safflower (Carthamus tinctorius), oil palm (Elaeis guineensis), coconut palm (Cocos nucifera), flax (Linum usitatissimum), castor (Ricinus communis) and peanut (Arachis hypogaea). The group also includes non-agronomic species which are useful in developing appropriate expression vectors such as tobacco, rapid cycling Brassica species, and Arabidopsis thaliana, and wild species which may be a source of unique fatty acids.
As used herein, the term "acyl-CoA synthetase (ACS)" refers to an enzymatic activity that catalyzes the formation of an acyl-CoA-fatty acid ester from a free fatty acid and coenzyme A (CoA). As used herein, the term "plastidial acyl-CoA synthetase" refers to an enzymatic activity that catalyzes the formation of an acyl-CoA-fatty acid ester from a free fatty acid and coenzyme A and that is localized to the chloroplast. As used herein, the term "plant acyl-CoA synthetase" refers to an acyl-CoA synthetase derived from a plant. The term plant acyl-CoA synthetases encompasses both acyl CoA synthetases that are identical to wild-type plant acyl-CoA synthetases and those that are derived from wild type plant acyl-CoA synthetases (for example, variants of plant acyl CoA synthetases or chimeric genes constructed with portions of plant acyl CoA synthetase coding regions).
As used herein, the term "AMP binding protein" ("AMP-BP") refers to a protein comprising an AMP-binding motif, which is found in all ACS genes. This motif is associated with the ability of a protein to bind ATP and to create an acyl- or acetyl- adenylate intermediate. However, not all AMP-BPs are ACSs; thus, in addition to ACS, the AMP-BP superfamily also contains several other classes of genes, at least some of which, such as 4-coumarate-CoA ligases and acetyl-CoA synthetases, are known to exist in plants.
The term "gene" as used herein, refers to a DNA sequence that comprises control and coding sequences necessary for the production of a polypeptide or protein precursor. The polypeptide can be encoded by a full length coding sequence or by any portion of the coding sequence, as long as the desired protein activity is retained.
"Nucleoside", as used herein, refers to a compound consisting of apurine [guanine (G) or adenine (A)] or pyrimidine [thymine (T), uridine (TJ), or cytidine (C)] base covalently linked to a pentose, whereas "nucleotide" refers to a nucleoside phosphorylated at one of its pentose hydroxyl groups.
A "nucleic acid", as used herein, is a covalently linked sequence of nucleotides in which the 3' position ofthe pentose of one nucleotide is joined by a phosphodiester group to the 5' position ofthe pentose ofthe next, and in which the nucleotide residues (bases) are linked in specific sequence; in other words, a linear order of nucleotides. A "polynucleotide", as used herein, is a nucleic acid containing a sequence that is greater than about 100 nucleotides in length. An "oligonucleotide", as used herein, is a short polynucleotide or a portion of a polynucleotide. An oligonucleotide typically contains a sequence of about two to about one hundred bases. The word "oligo" is sometimes used in place ofthe word "oligonucleotide". Nucleic acid molecules are said to have a "5'-terminus" (5' end) and a "3'- terminus" (3' end) because nucleic acid phosphodiester linkages occur to the 5' carbon and 3' carbon ofthe pentose ring ofthe substituent mononucleotides. The end of a nucleic acid at which a new linkage would be to a 5' carbon is its 5' terminal nucleotide. The end of a nucleic acid at which a new linkage would be to a 3' carbon is its 3' terminal nucleotide. A terminal nucleotide, as used herein, is the nucleotide at the end position ofthe 3'- or 5'-terminus.
DNA molecules are said to have "51 ends" and "3' ends" because mononucleotides are reacted to make oligonucleotides in a manner such that the 5'
phosphate of one mononucleotide pentose ring is attached to the 3' oxygen of its neighbor in one direction via a phosphodiester linkage. Therefore, an end of an oligonucleotides referred to as the "5' end" if its 5' phosphate is not linked to the 3' oxygen of a mononucleotide pentose ring and as the "3" end" if its 3' oxygen is not linked to a 5' phosphate of a subsequent mononucleotide pentose ring.
As used herein, a nucleic acid sequence, even if internal to a larger oligonucleotide or polynucleotide, also may be said to have 5' and 3' ends. In either a linear or circular DNA molecule, discrete elements are referred to as being "upstream" or 5' ofthe "downstream" or 3' elements. This terminology reflects the fact that transcription proceeds in a 5' to 3' fashion along the DNA strand. Typically, promoter and enhancer elements that direct transcription of a linked gene are generally located 5' or upstream ofthe coding region. However, enhancer elements can exert their effect even when located 3' ofthe promoter element and the coding region. Transcription termination and polyadenylation signals are located 3' or downstream ofthe coding region.
The term "wild-type" when made in reference to a gene refers to a gene which has the characteristics of a gene isolated from a naturally occurring source. The term "wild-type" when made in reference to a gene product refers to a gene product which has the characteristics of a gene product isolated from a naturally occurring source. A wild-type gene is that which is most frequently observed in a population and is thus arbitrarily designated the "normal" or "wild-type" form ofthe gene. In contrast, the term "modified" or "mutant" when made in reference to a gene or to a gene product refers, respectively, to a gene or to a gene product which displays modifications in sequence and or functional properties (in other words, altered characteristics) when compared to the wild-type gene or gene product. It is noted that naturally-occurring mutants can be isolated; these are identified by the fact that they have altered characteristics when compared to the wild-type gene or gene product.
The term "antisense" as used herein refers to a deoxyribonucleotide sequence whose sequence of deoxyribonucleotide residues is in reverse 5' to 3' orientation in relation to the sequence of deoxyribonucleotide residues in a sense strand of a DNA duplex. A "sense strand" of a DNA duplex refers to a strand in a DNA duplex which is transcribed by a cell in its natural state into a "sense mRNA." Thus an "antisense" sequence is a sequence having the same sequence as the non-coding strand in a DNA
duplex. The term "antisense RNA" refers to a RNA transcript that is complementary to all or part of a target primary transcript or mRNA and that blocks the expression of a target gene by interfering with the processing, transport and/or translation of its primary transcript or mRNA. The complementarity of an antisense RNA may be with any part ofthe specific gene transcript, in other words, at the 5' non-coding sequence, 3' non- coding sequence, introns, or the coding sequence. In addition, as used herein, antisense RNA may contain regions of ribozyme sequences that increase the efficacy of antisense RNA to block gene expression. "Ribozyme" refers to a catalytic RNA and includes sequence-specific endoribonucleases. "Antisense inhibition" refers to the production of antisense RNA transcripts capable of preventing the expression ofthe target protein. As used herein, the term "over-expression" refers to the production of a gene product in transgenic organisms that exceeds levels of production in normal or non- transformed organisms. As used herein, the term "cosuppression" refers to the expression of a foreign gene which has substantial homology to an endogenous gene resulting in the suppression of expression of both the foreign and the endogenous gene. As used herein, the term "altered levels" refers to the production of gene product(s) in transgenic organisms in amounts or proportions that differ from that of normal or non-transformed organisms.
The term "recombinant" when made in reference to a DNA molecule refers to a DNA molecule which is comprised of segments of DNA j oined together by means of molecular biological techniques. The term "recombinant" when made in reference to a protein or a polypeptide refers to a protein molecule which is expressed using a recombinant DNA molecule.
The term "nucleotide sequence of interest" refers to any nucleotide sequence, the manipulation of which may be deemed desirable for any reason (for example, confer improved qualities), by one of ordinary skill in the art. Such nucleotide sequences include, but are not limited to, coding sequences of structural genes (for example, reporter genes, selection marker genes, oncogenes, drug resistance genes, growth factors, etc.), and non-coding regulatory sequences which do not encode an mRNA or protein product, (for example, promoter sequence, polyadenylation sequence, termination sequence, enhancer sequence, etc.).
As used herein the term "coding region" when used in reference to structural gene refers to the nucleotide sequences which encode the amino acids found in the
nascent polypeptide as a result of translation of a mRNA molecule. Typically, the coding region is bounded on the 5' side by the nucleotide triplet "ATG" which encodes the initiator methionine and on the 3' side by a stop codon (e.g., TAA, TAG, TGA). In some cases the coding region is also known to initiate by a nucleotide triplet "TTG". As used herein, the terms "complementary" or "complementarity" when used in reference to polynucleotides refer to polynucleotides which are related by the base- pairing rules. For example, for the sequence 5'-AGT-3' is complementary to the sequence 5'-ACT-3'. Complementarity may be "partial," in which only some ofthe nucleic acids' bases are matched according to the base pairing rules. Or, there may be "complete" or "total" complementarity between the nucleic acids. The degree of complementarity between nucleic acid strands has significant effects on the efficiency and strength of hybridization between nucleic acid strands. This is of particular importance in amplification reactions, as well as detection methods which depend upon binding between nucleic acids. A "complement" of a nucleic acid sequence as used herein refers to a nucleotide sequence whose nucleic acids show total complementarity to the nucleic acids ofthe nucleic acid sequence.
The term "homology" when used in relation to nucleic acids refers to a degree of complementarity. There may be partial homology or complete homology (in other words, identity). "Sequence identity" refers to a measure of relatedness between two or more nucleic acids or proteins, and is given as a percentage with reference to the total comparison length. The identity calculation takes into account those nucleotide or amino acid residues that are identical and in the same relative positions in their respective larger sequences. Calculations of identity may be performed by algorithms contained within computer programs such as "GAP" (Genetics Computer Group,
Madison, Wis.) and "ALIGN" (DNAStar, Madison, Wis.). A partially complementary sequence is one that at least partially inhibits (or competes with) a completely complementary sequence from hybridizing to a target nucleic acid is referred to using the functional term "substantially homologous." The inhibition of hybridization ofthe completely complementary sequence to the target sequence may be examined using a hybridization assay (Southern or Northern blot, solution hybridization and the like) under conditions of low stringency. A substantially homologous sequence or probe will compete for and inhibit the binding (in other words, the hybridization) of a sequence
which is completely homologous to a target under conditions of low stringency. This is not to say that conditions of low stringency are such that non-specific binding is permitted; low stringency conditions require that the binding of two sequences to one another be a specific (in other words, selective) interaction. The absence of non-specific binding may be tested by the use of a second target which lacks even a partial degree of complementarity (for example, less than about 30 percent identity); in the absence of non-specific binding the probe will not hybridize to the second non-complementary target.
When used in reference to a double-stranded nucleic acid sequence such as a cDNA or genomic clone, the term "substantially homologous" refers to any probe which can hybridize to either or both strands ofthe double-stranded nucleic acid sequence under conditions of low stringency as described infra.
Low stringency conditions when used in reference to nucleic acid hybridization comprise conditions equivalent to binding or hybridization at 42_C in a solution consisting of 5X SSPE (43.8 g/1 NaCl, 6.9 g/1 NaH2PO4*H2O and 1.85 g/1 EDTA, pH adjusted to 7.4 with NaOH), 0.1 SDS, 5X Denhardt's reagent [50X Denhardt's contains per 500 ml: 5 g Ficoll (Type 400, Pharmacia), 5 g BSA (Fraction V; Sigma)] and 100 μg/ml denatured salmon sperm DNA followed by washing in a solution comprising 5X SSPE, 0.1 percent SDS at 42°C when a probe of about 500 nucleotides in length is employed.
High stringency conditions when used in reference to nucleic acid hybridization comprise conditions equivalent to binding or hybridization at 42°C in a solution consisting of 5X SSPE (43.8 g/1 NaCl, 6.9 g/1 NaH2PO4«H2O and 1.85 g/1 EDTA, pH adjusted to 7.4 with NaOH), 0.5 percent SDS, 5X Denhardt's reagent and 100 μg/ml denatured salmon sperm DNA followed by washing in a solution comprising 0. IX SSPE, 1.0 percent SDS at 42°C when a probe of about 500 nucleotides in length is employed.
When used in reference to nucleic acid hybridization the art knows well that numerous equivalent conditions may be employed to comprise either low or high stringency conditions; factors such as the length and nature (DNA, RNA, base composition) ofthe probe and nature ofthe target (DNA, RNA, base composition, present in solution or immobilized, etc.) and the concentration ofthe salts and other components (for example, the presence or absence of formamide, dextran sulfate,
polyethylene glycol) are considered and the hybridization solution may be varied to generate conditions of either low or high stringency hybridization different from, but equivalent to, the above listed conditions.
"Stringency" when used in reference to nucleic acid hybridization typically occurs in a range from about Tm-5_C (5_C below the Tm ofthe probe) to about 20_C to 25_C below Tm. As will be understood by those of skill in the art, a stringent hybridization can be used to identify or detect identical polynucleotide sequences or to identify or detect similar or related polynucleotide sequences. Under "stringent conditions" a nucleic acid sequence of interest will hybridize to its exact complement and closely related sequences.
Polypeptide molecules are said to have an "amino terminus" (N-terminus) and a "carboxy terminus" (C-terminus) because peptide linkages occur between the backbone amino group of a first amino acid residue and the backbone carboxyl group of a second amino acid residue. Typically, the terminus of a polypeptide at which a new linkage would be to the carboxy-terminus ofthe growing polypeptide chain, and polypeptide sequences are written from left to right beginning at the amino terminus.
As used herein in reference to an amino acid sequence or a protein, the term "portion" (as in "a portion of an amino acid sequence") refers to fragments of that protein. The fragments may range in size from four amino acid residues to the entire amino acid sequence minus one amino acid.
As used herein, the term "fusion protein" refers to a chimeric protein containing the protein of interest (for example, ACSs and fragments thereof) joined to an exogenous protein fragment (for example, the fusion partner which consists of a non- ACS protein). The fusion partner may enhance the solubility of ACS protein as expressed in a host cell, may provide an affinity tag to allow purification ofthe recombinant fusion protein from the host cell or culture supernatant, or both. If desired, the fusion protein may be removed from the protein of interest (for example, ACS or fragments thereof) by a variety of enzymatic or chemical means know to the art.
As used herein, the term "transit peptide" refers to the N-terminal extension of a protein that serves as a signal for uptake and transport of that protein into an organelle such as a plastid or mitochondrion.
The term "isolated" when used in relation to a nucleic acid, as in "an isolated nucleic acid sequence" refers to a nucleic acid sequence that is identified and separated
from at least one contaminant nucleic acid with which it is ordinarily associated in its natural source. Isolated nucleic acid is nucleic acid present in a form or setting that is different from that in which it is found in nature. In contrast, non-isolated nucleic acids are nucleic acids such as DNA and RNA which are found in the state they exist in nature. For example, a given DNA sequence (for example, a gene) is found on the host cell chromosome in proximity to neighboring genes; RNA sequences, such as a specific mRNA sequence encoding a specific protein, are found in the cell as a mixture with numerous other mRNAs which encode a multitude of proteins. However, an isolated nucleic acid sequence comprising SEQ ID NO:l includes, by way of example, such nucleic acid sequences in cells which ordinarily contain SEQ ID NO: 1 where the nucleic acid sequence is in a chromosomal or extrachromosomal location different from that of natural cells, or is otherwise flanked by a different nucleic acid sequence than that found in nature. The isolated nucleic acid sequence may be present in single-stranded or double-stranded form. When an isolated nucleic acid sequence is to be utilized to express a protein, the nucleic acid sequence will contain at a minimum at least a portion ofthe sense or coding strand (in other words, the nucleic acid sequence may be single- stranded). Alternatively, it may contain both the sense and anti-sense strands (in other words, the nucleic acid sequence may be double-stranded).
As used herein, the term "purified" refers to molecules, either nucleic or amino acid sequences, that are removed from their natural environment, isolated or separated. An "isolated nucleic acid sequence" is therefore a purified nucleic acid sequence. "Substantially purified" molecules are at least 60 percent free, preferably at least 75 percent free, and more preferably at least 90 percent free from other components with which they are naturally associated. As used herein, the terms "vector" and "vehicle" are used interchangeably in reference to nucleic acid molecules that transfer DNA segment(s) from one cell to another. Vectors may include plasmids, bacteriophages, viruses, cosmids, and the like. The term "expression vector" or "expression cassette" as used herein refers to a recombinant DNA molecule containing a desired coding sequence and appropriate nucleic acid sequences necessary for the expression ofthe operably linked coding sequence in a particular host organism. Nucleic acid sequences necessary for expression in prokaryotes usually include a promoter, an operator (optional), and a ribosome
binding site, often along with other sequences. Eukaryotie cells are known to utilize promoters, enhancers, and termination and polyadenylation signals.
The terms "targeting vector" or "targeting construct" refer to oligonucleotide sequences comprising a gene of interest flanked on either side by a recognition sequence which is capable of homologous recombination ofthe DNA sequence located between the flanking recognition sequences.
The terms "in operable combination", "in operable order" and "operably linked" as used herein refer to the linkage of nucleic acid sequences in such a manner that a nucleic acid molecule capable of directing the transcription of a given gene and/or the synthesis of a desired protein molecule is produced. The term also refers to the linkage of amino acid sequences in such a manner so that a functional protein is produced.
The term "selectable marker" as used herein, refer to a gene which encodes an enzyme having an activity that confers resistance to an antibiotic or drug upon the cell in which the selectable marker is expressed. Selectable markers may be "positive" or "negative." Examples of positive selectable markers include the neomycin phosphotrasferase (NPTII) gene which confers resistance to G418 and to kanamycin, and the bacterial hygromycin phosphotransferase gene (hyg), which confers resistance to the antibiotic hygromycin. Negative selectable markers encode an enzymatic activity whose expression is cytotoxic to the cell when grown in an appropriate selective medium. For example, the HSN-tΛ: gene is commonly used as a negative selectable marker. Expression ofthe HSN-t& gene in cells grown in the presence of gancyclovir or acyclovir is cytotoxic; thus, growth of cells in selective medium containing gancyclovir or acyclovir selects against cells capable of expressing a functional HSN TK enzyme. Transcriptional control signals in eukaryotes comprise "promoter" and "enhancer" elements. Promoters and enhancers consist of short arrays of DΝA sequences that interact specifically with cellular proteins involved in transcription (Maniatis, et al, Science 236:1237, 1987). Promoter and enhancer elements have been isolated from a variety of eukaryotie sources including genes in yeast, insect, mammalian and plant cells. Promoter and enhancer elements have also been isolated from viruses and analogous control elements, such as promoters, are also found in prokaryotes. The selection of a particular promoter and enhancer depends on the cell type used to express the protein of interest. Some eukaryotie promoters and enhancers have a broad host range while others are functional in a limited subset of cell types (for
review, see Noss, et al, Trends Biochem. Sci., 11:287, 1986; and Maniatis, et al, supra 1987).
The terms "promoter element," "promoter," or "promoter sequence" as used herein, refer to a DΝA sequence that is located at the 5' end (in other words precedes) the protein coding region of a DΝA polymer. The location of most promoters known in nature precedes the transcribed region. The promoter functions as a switch, activating the expression of a gene. If the gene is activated, it is said to be transcribed, or participating in transcription. Transcription involves the synthesis of mRΝA from the gene. The promoter, therefore, serves as a transcriptional regulatory element and also provides a site for initiation of transcription ofthe gene into mRΝA.
Promoters may be tissue specific or cell specific. The term "tissue specific" as it applies to a promoter refers to a promoter that is capable of directing selective expression of a nucleotide sequence of interest to a specific type of tissue (for example, seeds) in the relative absence of expression ofthe same nucleotide sequence of interest in a different type of tissue (for example, leaves). Tissue specificity of a promoter may be evaluated by, for example, operably linking a reporter gene to the promoter sequence to generate a reporter construct, introducing the reporter construct into the genome of a plant such that the reporter construct is integrated into every tissue ofthe resulting transgenic plant, and detecting the expression ofthe reporter gene (for example, detecting mRΝA, protein, or the activity of a protein encoded by the reporter gene) in different tissues ofthe transgenic plant. The detection of a greater level of expression of the reporter gene in one or more tissues relative to the level of expression ofthe reporter gene in other tissues shows that the promoter is specific for the tissues in which greater levels of expression are detected. The term "cell type specific" as applied to a promoter refers to a promoter which is capable of directing selective expression of a nucleotide sequence of interest in a specific type of cell in the relative absence of expression ofthe same nucleotide sequence of interest in a different type of cell within the same tissue. The term "cell type specific" when applied to a promoter also means a promoter capable of promoting selective expression of a nucleotide sequence of interest in a region within a single tissue. Cell type specificity of a promoter may be assessed using methods well known in the art, for example, immunohistochemical staining. Briefly, tissue sections are embedded in paraffin, and paraffin sections are reacted with a primary antibody which is specific for the polypeptide product encoded by the nucleotide sequence of
interest whose expression is controlled by the promoter. A labeled (for example, peroxidase conjugated) secondary antibody which is specific for the primary antibody is allowed to bind to the sectioned tissue and specific binding detected (for example, with avidin/biotin) by microscopy. Promoters may be constitutive or regulatable. The term "constitutive" when made in reference to a promoter means that the promoter is capable of directing transcription of an operably linked nucleic acid sequence in the absence of a stimulus (for example, heat shock, chemicals, light, etc.). Typically, constitutive promoters are capable of directing expression of a transgene in substantially any cell and any tissue. Exemplary constitutive plant promoters include, but are not limited to SD Cauliflower Mosaic Virus (CaMN SD; see for example, U.S. Pat. No. 5,352,605), mannopine synthase, octopine synthase (ocs), superpromoter (see for example, WO 95/14098), and ubi3 (see for example, Garbarino and Belknap (1994) Plant Mol. Biol. 24:119-127) promoters. Such promoters have been used successfully to direct the expression of heterologous nucleic acid sequences in transformed plant tissue.
In contrast, a "regulatable" promoter is one which is capable of directing a level of transcription of an operably linked nuclei acid sequence in the presence of a stimulus (for example, heat shock, chemicals, light, etc.) which is different from the level of transcription ofthe operably linked nucleic acid sequence in the absence ofthe stimulus. As used herein, the term "regulatory element" refers to a genetic element that controls some aspect ofthe expression of nucleic acid sequence(s). For example, a promoter is a regulatory element that facilitates the initiation of transcription of an operably linked coding region. Other regulatory elements are splicing signals, polyadenylation signals, termination signals, etc. The enhancer and/or promoter may be "endogenous" or "exogenous" or
"heterologous." An "endogenous" enhancer or promoter is one that is naturally linked with a given gene in the genome. An "exogenous" or "heterologous" enhancer or promoter is one that is placed in juxtaposition to a gene by means of genetic manipulation (in other words, molecular biological techniques) such that transcription of the gene is directed by the linked enhancer or promoter. For example, an endogenous promoter in operable combination with a first gene can be isolated, removed, and placed in operable combination with a second gene, thereby making it a "heterologous promoter" in operable combination with the second gene. A variety of such
combinations are contemplated (for example, the first and second genes can be from the same species, or from different species.
The presence of "splicing signals" on an expression vector often results in higher levels of expression ofthe recombinant transcript in eukaryotie host cells. Splicing signals mediate the removal of introns from the primary RNA transcript and consist of a splice donor and acceptor site (Sambrook, et al. (1989) Molecular Cloning: A Laboratory Manual, 2nd ed. (Cold Spring Harbor Laboratory Press, New York) pp. 16.7-16.8). A commonly used splice donor and acceptor site is the splice junction from the l6S RNA ofSV40. Efficient expression of recombinant DNA sequences in eukaryotie cells requires expression of signals directing the efficient termination and polyadenylation ofthe resulting transcript. Transcription termination signals are generally found downstream ofthe polyadenylation signal and are a few hundred nucleotides in length. The term "poly(A) site" or "poly(A) sequence" as used herein denotes a DNA sequence which directs both the termination and polyadenylation of the nascent RNA transcript. Efficient polyadenylation ofthe recombinant transcript is desirable, as transcripts lacking a poly(A) tail are unstable and are rapidly degraded. The poly(A) signal utilized in an expression vector may be "heterologous" or "endogenous." An endogenous poly(A) signal is one that is found naturally at the 3' end ofthe coding region of a given gene in the genome. A heterologous poly(A) signal is one which has been isolated from one gene and positioned 3' to another gene. A commonly used heterologous poly(A) signal is the SV40 poly(A) signal. The SV40 poly(A) signal is contained on a 237 bp BamH Bcll restriction fragment and directs both termination and polyadenylation (Sambrook, supra, at 16.6-16.7). The terms "infecting" and "infection" with a bacterium refer to co-incubation of a target biological sample, (for example, cell, tissue, etc.) with the bacterium under conditions such that nucleic acid sequences contained within the bacterium are introduced into one or more cells ofthe target biological sample.
The term "Agrobacterium" refers to a soil-borne, Gram-negative, rod-shaped phytopathogenic bacterium which causes crown gall. The term "Agrobacterium" includes, but is not limited to, the strains Agrobacterium tumefaciens, (which typically causes crown gall in infected plants), and Agrobacterium rhizogens (which causes hairy root disease in infected host plants). Infection of a plant cell with Agrobacterium
generally results in the production of opines (for example, nopaline, agropine, octopine etc.) by the infected cell. Thus, Agrobacterium strains which cause production of nopaline (for example, strain LBA4301, C58, A208, GV3101) are referred to as
"nopaline-type" Agrobacteria; Agrobacterium strains which cause production of octopine (for example, strain LBA4404, Ach5, B6) are referred to as "octopine-type"
Agrobacteria; and Agrobacterium strains which cause production of agropine (e.g., strain EHA105, EHA101, A281) are referred to as "agropine-type" Agrobacteria.
The terms "bombarding, "bombardment," and "biolistic bombardment" refer to the process of accelerating particles towards a target biological sample (for example, cell, tissue, etc.) to effect wounding ofthe cell membrane of a cell in the target biological sample and/or entry ofthe particles into the target biological sample.
Methods for biolistic bombardment are known in the art (for example, U.S. Patent No.
5,584,807), and are commercially available (for example, the helium gas-driven microprojectile accelerator (PDS-1000/He, BioRad). The term "microwounding" when made in reference to plant tissue refers to the introduction of microscopic wounds in that tissue. Microwounding may be achieved by, for example, particle bombardment as described herein.
The term "transfection" as used herein refers to the introduction of foreign DNA into eukaryotie cells. Transfection may be accomplished by a variety of means known to the art including calcium phosphate-DNA co-precipitation, DEAE-dextran-mediated transfection, polybrene-mediated transfection, electroporation, microinjection, liposome fusion, lipofection, protoplast fusion, retroviral infection, and biolistics.
The term "transgenic" when used in reference to a cell refers to a cell which contains a transgene, or whose genome has been altered by the introduction of a transgene. The term "transgenic" when used in reference to a tissue or to a plant refers to a tissue or plant, respectively, which comprises one or more cells that contain a transgene, or whose genome has been altered by the introduction of a transgene.
Transgenic cells, tissues and plants may be produced by several methods including the introduction of a "transgene" comprising nucleic acid (usually DNA) into a target cell or integration of the transgene into a chromosome of a target cell by way of human intervention, such as by the methods described herein.
The term "transgene" as used herein refers to any nucleic acid sequence which is introduced into the genome of a cell by experimental manipulations. A transgene may
be an "endogenous DNA sequence," or a "heterologous DNA sequence" (in other words, "foreign DNA"). The term "endogenous DNA sequence" refers to a nucleotide sequence which is naturally found in the cell into which it is introduced so long as it does not contain some modification (for example, a point mutation, the presence of a selectable marker gene, etc.) relative to the naturally-occurring sequence. The term "heterologous DNA sequence" refers to a nucleotide sequence which is ligated to, or is manipulated to become ligated to, a nucleic acid sequence to which it is not ligated in nature, or to which it is ligated at a different location in nature. Heterologous DNA is not endogenous to the cell into which it is introduced, but has been obtained from another cell. Heterologous DNA also includes an endogenous DNA sequence which contains some modification. Generally, although not necessarily, heterologous DNA encodes RNA and proteins that are not normally produced by the cell into which it is expressed. Examples of heterologous DNA include reporter genes, transcriptional and translational regulatory sequences, selectable marker proteins (for example, proteins which confer drug resistance), etc.
The term "foreign gene" refers to any nucleic acid (for example, gene sequence) which is introduced into the genome of a cell by experimental manipulations and may include gene sequences found in that cell so long as the introduced gene contains some modification (for example, a point mutation, the presence of a selectable marker gene, etc.) relative to the naturally-occurring gene.
The term "transformation" as used herein refers to the introduction of a transgene into a cell. Transformation of a cell may be stable or transient. The term "transient transformation" or "transiently transformed" refers to the introduction of one or more transgenes into a cell in the absence of integration ofthe transgene into the host cell's genome. Transient transformation may be detected by, for example, enzyme- linked immunosorbent assay (ELISA) which detects the presence of a polypeptide encoded by one or more ofthe transgenes. Alternatively, transient transformation may be detected by detecting the activity ofthe protein (for example, beta-glucuronidase) encoded by the transgene. The term "transient transformant" refers to a cell which has transiently incorporated one or more transgenes. In contrast, the term "stable transformation" or "stably transformed" refers to the introduction and integration of one or more transgenes into the genome of a cell. Stable transformation of a cell may be detected by Southern blot hybridization of genomic DNA ofthe cell with nucleic acid
sequences which are capable of binding to one or more ofthe transgenes. Alternatively, stable transformation of a cell may also be detected by the polymerase chain reaction of genomic DNA ofthe cell to amplify transgene sequences. The term "stable transformant" refers to a cell which has stably integrated one or more transgenes into the genomic DNA. Thus, a stable transformant is distinguished from a transient transformant in that, whereas genomic DNA from the stable transformant contains one or more transgenes, genomic DNA from the transient transformant does not contain a transgene.
The term "amplification" is defined as the production of additional copies of a nucleic acid sequence and is generally carried out using polymerase chain reaction technologies well known in the art (Dieffenbach and GS Dvekler, (1995) PCR Primer, a Laboratory Manual, Cold Spring Harbor Press, Plainview NY). As used herein, the term "polymerase chain reaction" ("PCR") refers to the methods disclosed in U.S. Patent Nos. 4,683,195, 4,683,202 and 4,965,188, which describe a method for increasing the concentration of a segment of a target sequence in a mixture of genomic DNA without cloning or purification. This process for amplifying the target sequence consists of introducing a large excess of two oligonucleotide primers to the DNA mixture containing the desired target sequence, followed by a precise sequence of thermal cycling in the presence of a DNA polymerase. The two primers are complementary to their respective strands ofthe double stranded target sequence. To effect amplification, the mixture is denatured and the primers then annealed to their complementary sequences within the target molecule. Following annealing, the primers are extended with a polymerase so as to form a new pair of complementary strands. The steps of denaturation, primer annealing and polymerase extension can be repeated many times (in other words, denaturation, annealing and extension constitute one "cycle"; there can be numerous "cycles") to obtain a high concentration of an amplified segment ofthe desired target sequence. The length ofthe amplified segment ofthe desired target sequence is determined by the relative positions ofthe primers with respect to each other, and therefore, this length is a controllable parameter. By virtue ofthe repeating aspect ofthe process, the method is referred to as the "polymerase chain reaction" (hereinafter "PCR"). Because the desired amplified segments ofthe target sequence become the predominant sequences (in terms of concentration) in the mixture, they are said to be "PCR amplified."
With PCR, it is possible to amplify a single copy of a specific target sequence in genomic DNA to a level detectable by several different methodologies (for example, hybridization with a labeled probe; incorporation of biotinylated primers followed by avidin-enzyme conjugate detection; and/or incorporation of 32P-labeled deoxyribonucleotide triphosphates, such as dCTP or dATP, into the amplified segment). In addition to genomic DNA, any oligonucleotide sequence can be amplified with the appropriate set of primer molecules. In particular, the amplified segments created by the PCR process itself are, themselves, efficient templates for subsequent PCR amplifications. Amplified target sequences may be used to obtain segments of DNA (for example, genes) for the construction of targeting vectors, transgenes, etc.
As used herein, the term "sample template" refers to a nucleic acid originating from a sample which is analyzed for the presence of "target". In contrast, "background template" is used in reference to nucleic acid other than sample template, which may or may not be present in a sample. Background template is most often inadvertent. It may be the result of carryover, or it may be due to the presence of nucleic acid contaminants sought to be purified away from the sample. For example, nucleic acids other than those to be detected may be present as background in a test sample.
As used herein, the term "primer" refers to an oligonucleotide, whether occurring naturally (for example, as in a purified restriction digest) or produced synthetically, which is capable of acting as a point of initiation of nucleic acid synthesis when placed under conditions in which synthesis of a primer extension product which is complementary to a nucleic acid strand is induced (in other words, in the presence of nucleotides, an inducing agent such as DNA polymerase, and under suitable conditions of temperature and pH). The primer is preferably single-stranded for maximum efficiency in amplification, but may alternatively be double-stranded. If double- stranded, the primer is first treated to separate its strands before being used to prepare extension products. Preferably, the primer is an oligodeoxyribonucleotide. The primer must be sufficiently long to prime the synthesis of extension products in the presence of the inducing agent. The exact lengths ofthe primers will depend on many factors, including temperature, source of primer and use ofthe method.
As used herein, the term "probe" refers to an oligonucleotide (in other words, a sequence of nucleotides), whether occurring naturally (for example, as in a purified restriction digest) or produced synthetically, recombinantly or by PCR amplification,
which is capable of hybridizing to another oligonucleotide of interest. A probe may be single-stranded or double-stranded. Probes are useful in the detection, identification and isolation of particular gene sequences. It is contemplated that the probe used in the present invention is labeled with any "reporter molecule," so that it is detectable in a detection system, including, but not limited to enzyme (in other words, ELISA, as well as enzyme-based histochemical assays), fluorescent, radioactive, and luminescent systems. It is not intended that the present invention be limited to any particular detection system or label. The terms "reporter molecule" and "label" are used herein interchangeably. In addition to probes, primers and deoxynucleoside triphosphates may contain labels; these labels may comprise, but are not limited to, 32P, 33P, SD, enzymes, or fluorescent molecules (e.g., fluorescent dyes).
I. Acyl-CoA Synthetases
Acyl-CoA synthetases (ACSs) catalyze the following reaction:
Fatty acid + CoASH + ATP ! acyl-CoA + AMP + PPi
wherein free fatty acids are activated through ATP-dependent thioesterification to coenzyme A. This reaction is critical to most fatty acid metabolism, since all but a few fatty acid-utilizing enzymes require activated forms of these molecules as substrates. The ACSs are particularly important to plant fatty acid metabolism. The present invention is not limited to any particular mechanism. Indeed, an understanding ofthe mechanism is not required to practice the present invention. However, it is contemplated that free fatty acids synthesized in the chloroplasts undergo activation by ACS at the plastid outer envelope membrane before being incorporated into TAG in the endoplasmic reticulum. Therefore, modifications of fatty acid distribution in TAG pools within a seed are likely affected by the various isoforms of ACS.
In addition to their roles in TAG biosynthesis, ACSs are thought to perform other important functions within the plant cell. It is contemplated that altered expression ofthe ACSs ofthe present invention may be utilized to alter these functions. For example, ACS is necessary for activating fatty acids released from oil bodies in newly germinated seedlings. These acyl-CoAs serve as substrates for the beta-oxidation cycle, which supplies the plant with cellular energy until it becomes photosynthetically
competent. ACS may also play a role in cuticle wax synthesis. The cuticle waxes are a mixture of hydrophobic lipid compounds found on the surfaces ofthe aerial tissues of most plants. These waxes retard water loss, protect the plants from pests, and provide signaling molecules needed for fertility. ACS is also a necessary component ofthe process of protein acylation. Several essential proteins and enzymes characterized in other eukaryotie organisms undergo coupling between myristic and/or palmitic acids and specific amino acid residues near their N-termini. These fatty acid modifications are necessary for proper targeting and function of these proteins. Most ofthe acylated target proteins are involved in signal fransduction or metabolic regulation. The fatty acids used for these modifications must be supplied as acyl-CoAs.
ACS also catalyzes the first step in the biosynthetic pathway of biotin, a vitamin cofactor necessary for many carboxy lation/decarboxylation reactions. ACS may also play an important role in the synthesis of jasmonic acid, an important fatty acid-derived signaling compound involved in reproduction, plant defense, and a number of other plant response reactions.
One ofthe major goals of modern plant biotechnology is to manipulate lipid metabolism in oilseed crops to produce new and improved edible and industrial vegetable oils. Lipids constitute the structural components of cellular membranes and act as sources of energy for the germinating seed. Both de novo synthesis and modification of existing lipids are dependent on the activity of ACSs, as described above. To date, ACSs have been recalcitrant to traditional methods of purification due to their association with membranes.
Despite their crucial role in lipid metabolism, ACSs have not been well- characterized in plants. To date, the only molecular information regarding plant ACSs is provided by Fulda et al, Plant Molec. Biol. 33:911-22 (1997), who describe five cDNA clones from Brassica napus, only two of which had ACS activity when expressed in E. coli. The present inventors have identified and cloned over 20 different genes, eleven of which are identified as ACSs; the remaining genes are AMP-BPs. These results indicate that, surprisingly, ACS exists as a much larger gene family in plants than could have been predicted from the results of Fulda et al.
The ACS genes were discovered by a step-wise procedure. The first step was computer-assisted homology comparisons between amino acid sequences of known
eukaryotie ACS sequences and EST sequences of Arabidopsis genome databases. Potential candidates, or ACS homologs, were then screened for the presence of a unique 40-50 position amino acid insertion near the middle of proteins encoded by ACS genes from Bassica napus; the results identified eleven genes as encoding ACSs. The sequences ofthe ACS genes were then compared by GAP analysis to establish that each gene was unique. The results of this analysis were also utilized to determine the relationships between the different genes; these relationships formed the basis on which to name the genes. The ACS homologs were also screened for activity by functional expression in Saccharomyces cerevisiae YB525 and for in vitro activity. Additional information about the identity and role ofthe ACS genes was obtained from analysis of their tissue-specific expression pattern, and chloroplast import assays. Furthermore, T- DNA Arabidopsis mutants lacking an ACS gene have been identified and are described.
Eleven ACS genes have been identified. This family therefore represents the largest ACS gene family yet described in a single species, surpassing even that of humans, which family is known to contain at least six genes that encode ACS or VLCS (very long chain acyl-CoA synthetase) ((Steinberg, S.J et al. (2000) Journal of Biological Chemistry 275(45): 35162-35169).
Accordingly, the present invention describes the isolation of several isoforms of ACS genes from Arabidopsis thaliana. It is contemplated that these genes and their homologs and variants will find use in the development of plants containing specialized fatty acid compositions. Each of these genes is discussed in further detail below.
A. ACS Nucleic Acids
Nucleic acids encoding plant ACSs were identified in the following manner. BLAST searches ofthe Arabidopsis genome database were conducted for EST sequences encoding polypeptides having homology to amino acid sequences of E. coli, rat, and yeast ACSs. ESTs having homology to the ACS genes were then ordered from the Arabidopsis Biological Resource Center (ABRC, Ohio State University) and used to screen a 2-3 kb size selected library (also from the ABRC). Full-length cDNAs were cloned into pPCR-Script Cam vectors (Stratagene) or pYES2 vectors (Stratagene) and sequenced.
Computer-assisted homology comparisons between known eukaryotie ACS sequences and the Arabidopsis sequences found either in library screens or in the public
databases revealed more than 40 genes containing significant homology to known ACSs from other eukaryotie organisms. Each of these genes contained the AMP-binding protein signature motif, which is found in all ACS genes; therefore, these genes were considered "ACS homologs." However, the identification of ACS genes from this simple sequence analysis was not possible. This is because other groups of proteins also contain the AMP-binding protein signature motif; thus, while all ACSs are AMP- binding proteins, the reverse is not true. In addition to ACS, the AMP-BP superfamily also contains several other classes of genes, some of which, such as 4-coumarate-CoA ligases and acetyl-CoA synthetases, are known to exist in plants. Therefore, what was needed was a more definitive ACS-specific sequence determinate with which to identify more likely ACS candidate genes.
Previous studies identified a unique 40-50 amino acid insertion near the middle of ACS enzymes in Brassica napus ((Fulda, M et al. (1997) Plant Mol Biol 33(5): 911- 22) and rat (Iijirna, H. et al. (1996) Eur J Biochem 242(2): 186-90). Although the precise function ofthe insertion was unknown, evidence indicated that it might be a necessary component of eukaryotie ACS gene function. Moreover, both the length and the location of this insertion is quite closely conserved between the rapeseed and rat clones, spanning approximately amino acid residues 330 to 380 within proteins of about 660 amino acids total. This sequence insertion was also found in many other eukaryotie ACSs known to activate long-chain (C14-C20) fatty acids (Fujino and Yamamoto,
1992), (Johnson, DR et al. (1994) J Cell Biol 127(3): 751-62), (Kang, MJ et al. (1997) Proc Natl Acad Sci U S A 94(7): 2880-4), but it was not found in the VLCS genes (very long chain fatty acyl-CoA synthetases, acyl chains >C22) (Uchiyama, A et al. (1996) J Biol Chem 271(48): 30360-5), (Berger, J et al. (1998) FEBS Lett 425(2): 305-9), (Min, KT and Benzer, S (1999) Science 284(5422): 1985-8), (Choi, JY and Martin, CE (1999) J Biol Chem 274(8): 4671-83), (Steinberg, SJ et al. (1999) Biochem and Biophys Res Comm 257(2): 615-621). It was also not found in any ofthe acetyl-CoA synthetases ((Ke et al, 2000)) or 4-coumarate-CoA ligases ((Lee, M et al. (1995) Science 280(5365): 915-918), (Ehlting, J et al. (1999) Plant J 19(1): 9-20) that had been cloned from Arabidopsis (data not shown). The maintenance of this sequence element in ACS genes from such evolutionarily distant species as Brassica napus and Rattus norvegicus, combined with its absence in genes that encode enzymes with specificity for short, or
very long, but not long chain, fatty acids, suggested that this sequence element might be very useful as a long chain ACS-specific sequence "probe".
Therefore, the presence of this sequence element was used as a probe to analyze the entire set of Arabidopsis genes that contained the AMP-BP signature motif. Eleven ofthe forty uncharacterized genes, or ACS homologs, contained insertions near the predicted sites within the deduced amino acid sequences. These eleven genes were therefore tentatively identified as ACS genes.
The amino acid sequences of these genes were then compared by GAP analysis; the results (as shown in Figure 1) established that each gene was unique. The results were also used as the basis for naming these genes. The genes are named AtACS for
Arabidopsis thaliana acyl-CoA synthetase. The genes are numbered starting with the number 1. If a gene possesses greater than 66 percent amino acid identity to any other gene(s), the number is maintained between the genes and each is lettered progressively
(1 A, IB, 1C etc.). A phylogenetic tree was constructed to visually compare the relationship between each ofthe candidate ACS genes. This tree is shown in Figure 54.
A summary ofthe information pertaining to each ofthe AtACS genes, including the corresponding EST sequences, is shown in Table 1.
The ACS genes were isolated generally as follows:
AtAMP-BP3 (SEQ ID NO: 25), AtACS3A (SEQ ID NO: 5), and AtACS 6A (SEQ ID NO: 10) were isolated from the library based on homology to ESTs FAFM13, 205M6T7, and G2B10T7, respectively. cDNAs corresponding to AtACS2 (SEQ ID NO: 4), AtACS6b (SEQ ID NO:
11), AtACS5 (SEQ ID NO: 9) were cloned from the library based on homology to ESTs
229E14T7, 203 Jl 1T7, and GbGel 15a, respectively. The 5' ends ofthe cDNAs were not
10 present in the isolated clones and were cloned by 5' RACE amplifications with total phage DNA isolated from the cDNA library.
cDNAs corresponding to AtACS3B (SEQ ID NO:6), AtACSIA (SEQ ID NO:l), and AtACS 1C (SEQ ID NO: 3) were cloned from the genomic library based on homology to ESTs 123N12T7, 240K22T7, and 119E14T7, respectively. Full length cDNAs were amplified using primers designed from the genomic sequences. Corresponding cDNA clones were apparently not present in the cDNA library.
AtACSIB (SEQ ID NO:2) was identified by a BLAST search from the Arabidopsis Genome Initiative database as a homologous sequence to AtACS 1 A and lC. Primers designed to the putative start and stop codons amplify an appropriately sized product from genomic DNA but do not amplify a cDNA clone when utilized for RT-PCR. The amplified clone was longer than the predicted cDNA.
AtACS4A (SEQ ID NO:7), which was originally named AMP-BP3 and later correctly identified as AtACS4A, was identified from the Arabidopsis databases using the sequence ofthe Brassica AMP-BP clone pMF28P (Genbank Accession # Z72151).
AtACS4B (SEQ ID NO:8) was found in the Arabidopsis database by homology to AtACS4A.
As described above, ACSs bear strong homology to other AMP-binding proteins. Therefore, it was necessary to screen candidate ACSs to determine if they did indeed encode ACS activity. The screens were conducted by screening for complementation ofthe mutant Saccharomyces cerevisiae strain YB525 (Johnson et al, (1994) J. Cell. Biol. 127:751-762), which is deficient in two ACS genes. In some cases, cDNAs originally suspected of encoding ACS activity were found not to be true ACSs (for example, AtAMP-BPl, SEQ ID NO:23, and AtAMP-BP3, SEQ ID NO: 25).
Accordingly, the present invention provides nucleic acids encoding plant ACSs (for example, such as the nucleic acid sequences SEQ ID NOs: 1-11, as shown in Figures 3-13, or which encode amino acid sequences SEQ ID NOS: 12-22, as shown in Figures 14-24). Other embodiments ofthe present invention provide nucleic acid sequences that are capable of hybridizing to SEQ ID NOs: 1-11 under conditions of high to low stringency. In some embodiments, the hybridizing nucleic acid sequence encodes a protein that retains at least one biological activity ofthe naturally occurring ACS it is derived from. In preferred embodiments, hybridization conditions are based on the melting temperature (Tm) ofthe nucleic acid binding complex and confer a defined "stringency" as explained above.
In other embodiments ofthe present invention, variants ofthe disclosed ACSs are provided. In preferred embodiments, variants result from mutation, (in other words, a change in the nucleic acid sequence) and generally produce altered mRNAs or polypeptides whose structure or function may or may not be altered. Any given gene may have none, one, or many variant forms. Common mutational changes that give rise to variants are generally ascribed to deletions, additions or substitutions of nucleic acids. Each of these types of changes may occur alone, or in combination with the others, and at the rate of one or more times in a given sequence.
It is contemplated that is possible to modify the structure of a peptide having an activity (for example, ACS activity) for such purposes as increasing synthetic activity or altering the affinity ofthe ACS for a particular fatty acid substrate. Such modified peptides are considered functional equivalents of peptides having an activity of an ACS as defined herein. A modified peptide can be produced in which the nucleotide sequence encoding the polypeptide has been altered, such as by substitution, deletion, or addition. In some preferred embodiments ofthe present invention, the alteration increases synthetic activity or alters the affinity ofthe ACS for a particular fatty acid substrate. In particularly preferred embodiments, these modifications do not significantly reduce the synthetic activity ofthe modified enzyme. In other words, construct "X" can be evaluated in order to determine whether it is a member ofthe genus of modified or variant ACSs ofthe present invention as defined functionally, rather than structurally. In preferred embodiments, the activity of variant ACSs is evaluated by the methods described in Examples 4 and 5. Accordingly, in some embodiments the present invention provides nucleic acids encoding plant acyl-CoA synthetases that complement yeast strain YB525. In other embodiments, the present invention provides nucleic acids encoding plant acyl-CoA synthetases that compete for the binding of fatty acid substrates with the proteins encoded by SEQ ID NOs: 1-11.
Moreover, as described above, variant forms of ACSs are also contemplated as being equivalent to those peptides and DNA molecules that are set forth in more detail herein. For example, it is contemplated that isolated replacement of a leucine with an isoleucine or valine, an aspartate with a glutamate, a threonine with a serine, or a similar replacement of an amino acid with a structurally related amino acid (in other words, conservative mutations) will not have a major effect on the biological activity ofthe resulting molecule. Accordingly, some embodiments ofthe present invention provide
variants of ACSs disclosed herein containing conservative replacements. Conservative replacements are those that take place within a family of amino acids that are related in their side chains. Genetically encoded amino acids can be divided into four families: (1) acidic (aspartate, glutamate); (2) basic (lysine, arginine, histidine); (3) nonpolar (alanine, valine, leucine, isoleucine, proline, phenylalanine, methionine, tryptophan); and (4) uncharged polar (glycine, asparagine, glutamine, cysteine, serine, threonine, tyrosine). Phenylalanine, tryptophan, and tyrosine are sometimes classified jointly as aromatic amino acids. In similar fashion, the amino acid repertoire can be grouped as (1) acidic (aspartate, glutamate); (2) basic (lysine, arginine, histidine), (3) aliphatic (glycine, alanine, valine, leucine, isoleucine, serine, threonine), with serine and threonine optionally be grouped separately as aliphatic-hydroxyl; (4) aromatic (phenylalanine, tyrosine, tryptophan); (5) amide (asparagine, glutamine); and (6) sulfur - containing (cysteine and methionine) (e.g., Stryer ed., Biochemistry, pg. 17-21, 2nd ed, WH Freeman and Co., 1981). Whether a change in the amino acid sequence of a peptide results in a functional homolog can be readily determined by assessing the ability ofthe variant peptide to function in a fashion similar to the wild-type protein. Peptides having more than one replacement can readily be tested in the same manner.
More rarely, a variant includes "nonconservative" changes (for example, replacement of a glycine with a tryptophan). Analogous minor variations can also include amino acid deletions or insertions, or both. Guidance in determining which amino acid residues can be substitoted, inserted, or deleted without abolishing biological activity can be found using computer programs (for example, LASERGENE software, DNASTAR Inc., Madison, Wis.).
As described in more detail below, variants may be produced by methods such as directed evolution or other techniques for producing combinatorial libraries of variants, described in more detail below. In still other embodiments ofthe present invention, the nucleotide sequences ofthe present invention may be engineered in order to alter an ACS coding sequence including, but not limited to, alterations that modify the cloning, processing, localization, secretion, and/or expression ofthe gene product. For example, mutations may be introduced using techniques that are well known in the art (e.g., site-directed mutagenesis to insert new restriction sites, alter glycosylation patterns, or change codon preference, etc.).
B. ACS Polypeptides
The family of ACS genes provided by the present invention represents a very diverse group of genes, as indicated by the results ofthe ACS amino acid sequence analysis summarized in Figure 1 and Table 1. While half of the gene family members are nearly identical in length (approximately 665 amino acids) (AtACSIA, IB, IC, 2, and 5), the other half all contain N-terminal extensions of between 30 and 60 amino acid residues (AtACS3A, 3B, 4A, 4B, 6A, and 6B). As a group, the family of genes share only 30 percent identical amino acids and is clearly delineated into several distinct subgroupings. The number of ESTs associated with each ofthe ACS genes also varied considerably, with some genes represented by numerous ESTs and others not represented at all. Collectively, these observations support the biochemical evidence laid out in previous reports that the ACS gene family is responsible for providing acyl- CoA substrates for a number of distinct metabolic pathways that are carried out under conditions that vary considerably with respect to tissue type, cell type, and organelle, with varied levels of demand upon particular isoforms compared to others. It is interesting to note that all ofthe ACS amino acid sequences appear to lack a typical plastidial targeting consensus sequence, yet subsequent analysis has demonstrated that at least some of these ACSs can be imported into the chloroplast, and at least one ACS may be associated with the chloroplast envelope membranes (see Example 7). The degree of conservation ofthe deduced amino acid sequences of and around the insertional elements of each ACS gene ofthe present invention were also compared. The results of this comparison are shown in Figure 2. The residues corresponding to the predicted borders ofthe insertional element are numbered and denoted with arrows. These residues were determined by comparing the sequences ofthe candidate ACS genes to those ofthe other AMP-BP genes that were identified in the original data base screen and which lacked the insertional element. For clarity, Figure 2 displays only the first few amino acid residues that flank the upstream and downstream borders ofthe insertional region. Taking into account the N-terminal extensions present in some ofthe ACS genes, the comparison ofthe insertional element sequences confirmed the conservation of location of this element within the open reading frames of all members of this set of genes. The homology between the entire set of full-length insertional elements is quite weak, displaying approximately 30 percent identical amino acids between all eleven genes, which closely matches the degree of conservation between the
eleven full-length proteins. Surprisingly, the regions immediately flanking the insertional element, are highly conserved across the whole family of eleven candidate ACS genes (see Figure 2). These data suggest that amino acid residues encoded by the insertional element are necessary for proper ACS function in the plant, with the residues in the middle ofthe element evolving with the rest ofthe gene to diversify and specialize the enzymatic function of each gene, while the residues near the borders of the element constitute a more invariable region ofthe enzyme that is essential to the core reaction.
Accordingly, the present invention also provides ACS polypeptides (for example, SEQ ID NOs: 12-22 as shown in Figures 14-24). Still further embodiments of the present invention provide fragments, fusion proteins or functional equivalents of ACSs. Functional equivalents of ACSs may be screened for as described in Examples 4 and 5. In still other embodiments ofthe present invention, nucleic acid sequences corresponding to a selected ACS may be used to generate recombinant DNA molecules that direct the expression of an ACS and variants in appropriate host cells. In some embodiments ofthe present invention, the polypeptide may be a naturally purified product, while in other embodiments it may be a product of chemical synthetic procedures, and in still other embodiments it may be produced by recombinant techniques using a prokaryotic or eukaryotie host cell (for example, by bacterial cells in culture). In other embodiments, the polypeptides ofthe invention may also include an initial methionine amino acid residue.
In one embodiment ofthe present invention, due to the inherent degeneracy of the genetic code, DNA sequences other than SEQ ID NOs: 1-11 encoding substantially the same or a functionally equivalent amino acid sequence, may be used to clone and express an ACS. In general, such nucleic acid sequences hybridize to SEQ ID NOs: 1- 11 under conditions of high to low stringency as described above. As will be understood by those of skill in the art, it may be advantageous to produce ACS-encoding nucleotide sequences possessing non-naturally occurring codons. Therefore, in some preferred embodiments, codons preferred by a particular prokaryotic or eukaryotie host are selected, for example, to increase the rate of ACS expression or to produce recombinant RNA transcripts having desirable properties, such as increased synthetic activity or altered affinity ofthe ACS for a particular fatty acid substrate.
II. Uses of ACS Polynucleotides and Polypeptides 1. Vectors for Expression of ACSs
In some embodiments ofthe present invention, the ACS nucleic acids are used to construct vectors for the expression of ACS polypeptides. Accordingly, the nucleic acids of the present invention may be employed for producing polypeptides by recombinant techniques. Thus, for example, the nucleic acid may be included in any one of a variety of expression vectors for expressing a polypeptide.
In some embodiments ofthe present invention, vectors are provided for the transfection of plant hosts to create transgenic plants. In general, these vectors comprise an ACS nucleic acid (for example, SEQ ID NOs: 1-11) operably linked to a promoter and other regulatory sequences (for example, enhancers, polyadenylation signals, etc.) required for expression in a plant. The ACS nucleic acid can be oriented to produce sense or antisense transcripts, depending on the desired use. In some embodiments, the promoter is a constitutive promoter (for example, superpromoter or SD promoter). In other embodiments, the promoter is a seed specific promoter (for example, phaseolin promoter [See for example, U.S. Pat. No. 5,589,616], napin promoter [See for example, U.S. Pat. No. 5,608,152], or acyl-CoA carrier protein promoter [See for example, 5,767,363]).
In some preferred embodiments, the vector is adapted for use in an Agrobacterium mediated transfection process (See for example, U.S. Pat.
Nos.5,981,839; 6,051,757; 5,981,840; 5,824,877; and 4,940,838). Construction of recombinant Ti and Ri plasmids in general follows methods typically used with the more common bacterial vectors, such as pBR322. Additional use can be made of accessory genetic elements sometimes found with the native plasmids and sometimes constructed from foreign sequences. These may include but are not limited to structural genes for antibiotic resistance as selection genes.
There are two systems of recombinant Ti and Ri plasmid vector systems now in use. The first system is called the "cointegrate" system. In this system, the shuttle vector containing the gene of interest is inserted by genetic recombination into a non- oncogenic Ti plasmid that contains both the cis-acting and trans-acting elements required for plant transformation as, for example, in the pMLJl shuttle vector and the non-oncogenic Ti plasmid pGV3850. The second system is called the "binary" system in which two plasmids are used; the gene of interest is inserted into a shuttle vector
containing the cis-acting elements required for plant transformation. The other necessary functions are provided in trans by the non-oncogenic Ti plasmid as exemplified by the pBIN19 shuttle vector and the non-oncogenic Ti plasmid PAL4404. Some of these vectors are commercially available. It may be desirable to target the nucleic acid sequence of interest to a particular locus on the plant genome. Site-directed integration ofthe nucleic acid sequence of interest into the plant cell genome may be achieved by, for example, homologous recombination using Agrobacterium-deήveά sequences. Generally, plant cells are incubated with a strain of Agrobacterium which contains a targeting vector in which sequences that are homologous to a DNA sequence inside the target locus are flanked by Agrobacterium transfer-DNA (T-DNA) sequences, as previously described (U.S. Pat. No. 5,501,967). One of skill in the art knows that homologous recombination may be achieved using targeting vectors which contain sequences that are homologous to any part ofthe targeted plant gene, whether belonging to the regulatory elements ofthe gene, or the coding regions ofthe gene. Homologous recombination may be achieved at any region of a plant gene so long as the nucleic acid sequence of regions flanking the site to be targeted is known.
The nucleic acids ofthe present invention may also be utilized to construct vectors derived from plant (+) RNA viruses (for example, brome mosaic virus, tobacco mosaic virus, alfalfa mosaic virus, cucumber mosaic virus, tomato mosaic virus, and combinations and hybrids thereof). Generally, the inserted ACS polynucleotide can be expressed from these vectors as a fusion protein (for example, coat protein fusion protein) or from its own subgenomic promoter or other promoter. Methods for the construction and use of such viruses are described in U.S. Pat. Nos. 5,846,795; 5,500,360; 5,173,410; 5,965,794; 5,977,438; and 5,866,785.
Alternatively, vectors can be constructed for expression in hosts other plants (for example, prokaryotic cells such as E. coli, yeast cells, C. elegans, and mammalian cell culture cells). In some embodiments ofthe present invention, vectors include, but are not limited to, chromosomal, nonchromosomal and synthetic DNA sequences (for example, derivatives of SV40, bacterial plasmids, phage DNA; baculo virus, yeast plasmids, vectors derived from combinations of plasmids and phage DNA, and viral DNA such as vaccinia, adenoviras, fowl pox virus, and pseudorabies). Large numbers of suitable vectors that are replicable and viable in the host are known to those of skill in
the art, and are commercially available. Any other plasmid or vector may be used as long as they are replicable and viable in the host.
In some preferred embodiments ofthe present invention, bacterial expression vectors comprise an origin of replication, a suitable promoter and optionally an enhancer, and also any necessary ribosome binding sites, polyadenylation sites, transcriptional termination sequences, and 5' flanking nontranscribed sequences. Promoters useful in the present invention include, but are not limited to, retroviral LTRs, S V40 promoter, CMV promoter, RSV promoter, E. coli lac or trp promoters, phage lambda P and PR promoters, T3, SP6 and T7 promoters. In other embodiments ofthe present invention, recombinant expression vectors include origins of replication and selectable markers, (e.g., tetracycline or ampicillin resistance in E. coli, or neomycin phosphotransferase gene for selection in eukaryotie cells).
2. Expression of ACSs in Transgenic Plants Vectors described above can be utilized to express the ACSs of the present invention in transgenic plants. A variety of methods are known for producing transgenic plants.
In some embodiments, Agrobacterium mediated transfection is utilized to create transgenic plants. Since most dicotyledonous plant are natural hosts for Agrobacterium, almost every dicotyledonous plant may be transformed by Agrobacterium in vitro. Although monocotyledonous plants, and in particular, cereals and grasses, are not natural hosts to Agrobacterium, work to transform them using Agrobacterium has also been carried out (Hooykas-Van Slogteren et al. (1984) Nature 311 :763-764). Plant genera that may be transformed by Agrobacterium include Arabidopsis, Chrysanthemum, Dianthus, Gerbera, Euphorbia, Pelaronium, Ipomoea, Passiflora,
Cyclamen, Malus, Prunus, Rosa, Rubus, Populus, Santalum, Allium, Lilium, Narcissus, Ananas, Arachis, Phaseolus and Pisum.
For transformation with Agrobacterium, disarmed Agrobacterium cells are transformed with recombinant Ti plasmids of Agrobacterium tumefaciens or Ri plasmids of Agrobacterium rhizogenes (such as those described in U.S. Patent No.
4,940,838). The nucleic acid sequence of interest is then stably integrated into the plant genome by infection with the transformed Agrobacterium strain. For example, heterologous nucleic acid sequences have been introduced into plant tissues using the
natural DNA transfer system of Agrobacterium tumefaciens and Agrobacterium rhizogenes bacteria (for review, see Klee et al. (1987) Ann. Rev. Plant Phys. 38:467- 486).
There are three common methods to transform plant cells with Agrobacterium. The first method is co-cultivation of Agrobacterium with cultured isolated protoplasts. This method requires an established culture system that allows culturing protoplasts and plant regeneration from cultured protoplasts. The second method is transformation of cells or tissues with Agrobacterium. This method requires (a) that the plant cells or tissues can be transformed by Agrobacterium and (b) that the transformed cells or tissues can be induced to regenerate into whole plants. The third method is transformation of seeds, apices or meristems with Agrobacterium. This method requires micropropagation.
One of skill in the art knows that the efficiency of transformation by Agrobacterium may be enhanced by using a number of methods known in the art. For example, the inclusion of a natural wound response molecule such as acetosyringone (AS) to the Agrobacterium culture has been shown to enhance transformation efficiency wi Agrobacterium tumefaciens (Shahla et al., (1987) Plant Molec. Biol. 8:291-298). Alternatively, transformation efficiency may be enhanced by wounding the target tissue to be transformed. Wounding of plant tissue may be achieved, for example, by punching, maceration, bombardment with microprojectiles, etc. (See e.g., Bidney et al, (1992) Plant Molec. Biol. 18:301-313).
In still further embodiments, the plant cells are transfected with vectors via particle bombardment (in other words, with a gene gun). Particle mediated gene transfer methods are known in the art, are commercially available, and include, but are not limited to, the gas driven gene delivery instrument descried in McCabe, U.S. Pat. No. 5,584,807. This method involves coating the nucleic acid sequence of interest onto heavy metal particles, and accelerating the coated particles under the pressure of compressed gas for delivery to the target tissue.
Other particle bombardment methods are also available for the introduction of heterologous nucleic acid sequences into plant cells. Generally, these methods involve depositing the nucleic aqid sequence of interest upon the surface of small, dense particles of a material such as gold, platinum, or tungsten. The coated particles are themselves then coated onto either a rigid surface, such as a metal plate, or onto a carrier
sheet made of a fragile material such as mylar. The coated sheet is then accelerated toward the target biological tissue. The use ofthe flat sheet generates a uniform spread of accelerated particles which maximizes the number of cells receiving particles under uniform conditions, resulting in the introduction ofthe nucleic acid sample into the target tissue.
Plants, plant cells and tissues transformed with a heterologous nucleic acid sequence of interest are readily detected using methods known in the art including, but not limited to, restriction mapping ofthe genomic DNA, PCR-analysis, DNA-DNA hybridization, DNA-RNA hybridization, DNA sequence analysis and the like. Additionally, selection of transformed plant cells may be accomplished using a selection marker gene. It is preferred, though not necessary, that a selection marker gene be used to select transformed plant cells. A selection marker gene may confer positive or negative selection.
A positive selection marker gene may be used in constructs for random integration and site-directed integration. Positive selection marker genes include antibiotic resistance genes, and herbicide resistance genes and the like. In one embodiment, the positive selection marker gene is the NPTII gene which confers resistance to geneticin (G418) or kanamycin. In another embodiment the positive selection marker gene is the HPT gene which confers resistance to hygromycin. The choice ofthe positive selection marker gene is not critical to the invention as long as it encodes a functional polypeptide product. Positive selection genes known in the art include, but are not limited to, the ALS gene (chlorsulphuron resistance), and the DHFR-gene (methothrexate resistance).
A negative selection marker gene may also be included in the constructs. The use of one or more negative selection marker genes in combination with a positive selection marker gene is preferred in constructs used for homologous recombination. Negative selection marker genes are generally placed outside the regions involved in the homologous recombination event. The negative selection marker gene serves to provide a disadvantage (preferably lethality) to cells that have integrated these genes into their genome in an expressible manner. Cells in which the targeting vectors for homologous recombination are randomly integrated in the genome will be harmed or killed due to the presence ofthe negative selection marker gene. Where a positive selection marker
gene is included in the construct, only those cells having the positive selection marker gene integrated in their genome will survive.
The choice ofthe negative selection marker gene is not critical to the invention as long as it encodes a functional polypeptide in the transformed plant cell. The negative selection gene may for instance be chosen from the aux-2 gene from the Ti- plasmid of Agrobacterium, the tfc-gene from SV40, cytochrome P450 from Streptomyces griseolus, the Adh-gene from Maize or Arabidopsis, etc. Any gene encoding an enzyme capable of converting a substance which is otherwise harmless to plant cells into a substance which is harmful to plant cells may be used. It is contemplated that the ACS polynucleotides ofthe present invention may be utilized to either increase or decrease the level of ACS mRNA and/or protein in transfected cells as compared to the levels in wild-type cells. Accordingly, in some embodiments, expression in plants by the methods described above leads to the over- expression of ACS in transgenic plants, plant tissues, or plant cells. The present invention is not limited to any particular mechanism. Indeed, an understanding of a mechanism is not required to practice the present invention. However, it is contemplated that over-expression ofthe ACS polynucleotides ofthe present invention will overcome limitations in the accumulation of fatty acids in oilseeds.
In other embodiments ofthe present invention, the ACS polynucleotides are utilized to decrease the level of ACS protein or mRNA in transgenic plants, plant tissues, or plant cells as compared to wild-type plants, plant tissues, or plant cells. One method of reducing ACS expression utilizes expression of antisense transcripts. Antisense RNA has been used to inhibit plant target genes in a tissue-specific manner (for example, van der Krol et al (1988) Biotechniques 6:958-976). Antisense inhibition has been shown using the entire cDNA sequence as well as a partial cDNA sequence
(for example, Sheehy et al. (1988) Proc. Natl. Acad. Sci. USA 85:8805-8809; Cannon et al. (1990) Plant Mol. Biol. 15:39-47). There is also evidence that 3' non-coding sequence fragment and 5' coding sequence fragments, containing as few as 41 base-pairs of a 1.87 kb cDNA, can play important roles in antisense inhibition (Ch'ng et al. (1989)Proc. Natl. Acad. Sci. USA 86:10006-10010).
Accordingly, in some embodiments, the ACS nucleic acids ofthe present invention (for example, SEQ ID NOs: 1-11, and fragments and variants thereof) are oriented in a vector and expressed so as to produce antisense transcripts. To accomplish
this, a nucleic acid segment from the desired gene is cloned and operably linked to a promoter such that the antisense strand of RNA will be transcribed. The expression cassette is then transformed into plants and the antisense strand of RNA is produced. The nucleic acid segment to be introduced generally will be substantially identical to at least a portion ofthe endogenous gene or genes to be repressed. The sequence, however, need not be perfectly identical to inhibit expression. The vectors ofthe present invention can be designed such that the inhibitory effect applies to other proteins within a family of genes exhibiting homology or substantial homology to the target gene. Furthermore, for antisense suppression, the introduced sequence also need not be full length relative to either the primary transcription product or fully processed mRNA. Generally, higher homology can be used to compensate for the use of a shorter sequence. Furthermore, the introduced sequence need not have the same intron or exon pattern, and homology of non-coding segments maybe equally effective. Normally, a sequence of between about 30 or 40 nucleotides and about full length nucleotides should be used, though a sequence of at least about 100 nucleotides is preferred, a sequence of at least about 200 nucleotides is more preferred, and a sequence of at least about 500 nucleotides is especially preferred.
Catalytic RNA molecules or ribozymes can also be used to inhibit expression of the target gene or genes. It is possible to design ribozymes that specifically pair with virtually any target RNA and cleave the phosphodiester backbone at a specific location, thereby functionally inactivating the target RNA. In carrying out this cleavage, the ribozyme is not itself altered, and is thus capable of recycling and cleaving other molecules, making it a true enzyme. The inclusion of ribozyme sequences within antisense RNAs confers RNA-cleaving activity upon them, thereby increasing the activity ofthe constructs.
A number of classes of ribozymes have been identified. One class of ribozymes is derived from a number of small circular RNAs which are capable of self-cleavage and replication in plants. The RNAs replicate either alone (viroid RNAs) or with a helper virus (satellite RNAs). Examples include RNAs from avocado sunblotch viroid and the satellite RNAs from tobacco ringspot virus, lucerne transient streak virus, velvet tobacco mottle virus, Solanum nodiflorum mottle virus and subterranean clover mottle virus.
The design and use of target RNA-specific ribozymes is described in Haseloff, et al., Nature 334:585-591 (1988).
Another method of reducing ACS expression utilizes the phenomenon of cosuppression or gene silencing (See for example, U.S. Pat. No. 6,063,947). The phenomenon of cosuppression has also been used to inhibit plant target genes in a tissue-specific manner. Cosuppression of an endogenous gene using a full-length cDNA sequence as well as a partial cDNA sequence (730 bp of a 1770 bp cDNA) are known (for example, Napoli et al. (1990) Plant Cell 2:279-289 ; van der Krol et al. (1990) Plant Cell 2:291-299; Smith et al, (1990) Mol. Gen. Genetics 224:477-481). Accordingly, in some embodiments the Arabidopsis ACS nucleic acids (for example, SEQ ID NOs: 1- 10, and fragments and variants thereof) are expressed in another species of plant to effect cosuppression of a homologous gene.
Generally, where inhibition of expression is desired, some transcription ofthe introduced sequence occurs. The effect may occur where the introduced sequence contains no coding sequence per se, but only intron or untranslated sequences homologous to sequences present in the primary transcript ofthe endogenous sequence. The introduced sequence generally will be substantially identical to the endogenous sequence intended to be repressed. This minimal identity will typically be greater than about 65 percent, but a higher identity might exert a more effective repression of expression ofthe endogenous sequences. Substantially greater identity of more than about 80 percent is preferred, though about 95 percent to absolute identity would be most preferred. As with antisense regulation, the effect should apply to any other proteins within a similar family of genes exhibiting homology or substantial homology. For cosuppression, the introduced sequence in the expression cassette, needing less than absolute identity, also need not be full length, relative to either the primary transcription product or fully processed mRNA. This may be preferred to avoid concurrent production of some plants which are over-expressers. A higher identity in a shorter than frill length sequence compensates for a longer, less identical sequence. Furthermore, the introduced sequence need not have the same intron or exon pattern, and identity of non-coding segments will be equally effective. Normally, a sequence of the size ranges noted above for antisense regulation is used.
3. Other Host Cells and Systems for Production of ACSs
The present invention also contemplates that the vectors described above can be utilized to express plant ACS genes and variants in prokaryotic and eukaryotie cells. In some embodiments ofthe present invention, the host cell can be a prokaryotic cell (for example, a bacterial cell). Specific examples of host cells include, but are not limited to, E. coli, Salmonella typhimurium, Bacillus subtilis, and various species within the genera Pseudomonas, Streptomyces, and Staphylococcus. The constructs in host cells can be used in a conventional manner to produce the gene product encoded by the recombinant sequence. In some embodiments, introduction ofthe construct into the host cell can be accomplished by any suitable method known in the art (e.g., calcium phosphate transfection, DΕAΕ-Dextran mediated transfection, or electroporation (for example, Davis et al (1986) Basic Methods in Molecular Biology). Alternatively, in some embodiments ofthe present invention, the polypeptides ofthe invention can be synthetically produced by conventional peptide synthesizers. In some embodiments ofthe present invention, following transformation of a suitable host strain and growth ofthe host strain to an appropriate cell density, the selected promoter is induced by appropriate means (e.g., temperature shift or chemical induction), and the host cells are cultured for an additional period. In other embodiments ofthe present invention, the host cells are harvested (for example, by centrifugation), disrupted by physical or chemical means, and the resulting crude extract retained for further purification. In still other embodiments ofthe present invention, microbial cells employed in expression of proteins can be disrupted by any convenient method, including freeze-thaw cycling, sonication, mechanical disruption, or use of cell lysing agents. It is not necessary that a host organism be used for the expression ofthe nucleic acid constructs ofthe invention. For example, expression ofthe protein encoded by a nucleic acid construct may be achieved through the use of a cell-free in vitro transcription/translation system. An example of such a cell-free system is the commercially available TnT™ Coupled Reticulocyte Lysate System (Promega; this cell- free system is described in U.S. Patent No. 5,324,637).
4. Purification of ACSs
The present invention also provides methods for recovering and purifying ACSs from native and recombinant cell cultures including, but not limited to, ammonium sulfate precipitation, anion or cation exchange chromatography, phosphocellulose chromatography, hydrophobic interaction chromatography, affinity chromatography, hydroxylapatite chromatography and lectin chromatography. In other embodiments of the present invention, protein refolding steps can be used as necessary, in completing configuration ofthe mature protein. In still other embodiments ofthe present invention, high performance liquid chromatography (HPLC) can be employed as one or more purification steps.
In other embodiments ofthe present invention, the nucleic acid construct containing DNA encoding the wild-type or a variant ACS further comprises the addition of exogenous sequences (in other words, sequences not encoded by the ACS coding region) to either the 5' or 3' end ofthe ACS coding region to allow for ease in purification of the resulting polymerase protein (the resulting protein containing such an affinity tag is termed a "fusion protein"). Several commercially available expression vectors are available for attaching affinity tags (for example, an exogenous sequence) to either the amino or carboxy-termini of a coding region. In general these affinity tags are short stretches of amino acids that do not alter the characteristics ofthe protein to be expressed (in other words, no change to enzymatic activities results).
For example, the pET expression system (Novagen) utilizes a vector containing the T7 promoter operably linked to a fusion protein with a short stretch of histidine residues at either end ofthe protein and a host cell that can be induced to express the T7 DNA polymerase (in other words, a DE3 host strain). The production of fusion proteins containing a histidine tract is not limited to the use of a particular expression vector and host strain. Several commercially available expression vectors and host strains can be used to express protein sequences as a fusion protein containing a histidine tract (for example, the pQE series [pQE-8, 12, 16, 17, 18, 30, 31, 32, 40, 41, 42, 50, 51, 52, 60 and 70] of expression vectors (Qiagen) used with host strains M15[pREP4] [Qiagen] and SG13009[pREP4] [Qiagen]) can be used to express fusion proteins containing six histidine residues at the amino-terminus ofthe fusion protein). Additional expression systems which utilize other affinity tags are known to the art.
Once a suitable nucleic acid construct has been made, the ACS may be produced from the construct. The examples below and standard molecular biological teachings known in the art enable one to manipulate the construct by a variety of suitable methods. Once the desired ACS has been expressed, the enzyme may be tested for activity as described Examples 4 and 5.
5. Deletion Mutants of ACSs
The present invention further provides fragments of ACSs. In some embodiments ofthe present invention, when expression of a portion of an ACS is desired, it may be necessary to add a start codon (ATG) to the oligonucleotide fragment containing the desired sequence to be expressed. It is well known in the art that a methionine at the N-terminal position can be enzymatically cleaved by the use ofthe enzyme methionine aminopeptidase (MAP). MAP has been cloned from E. coli (Ben- Bassat et al. (1987) J. Bacteriol. 169:751-757) and S. typhimurium, and its in vitro activity has been demonstrated on recombinant proteins (Miller et al. (1990)PNAS 84:2718-1722). Therefore, removal of an N-terminal methionine, if desired, can be achieved either in vivo by expressing such recombinant polypeptides in a host producing MAP (for example, E. coli or CM89 or S. cerevisiae), or in vitro by use of purified MAP. It is contemplated that deletion mutants of ACSs can be screened for activity as described above.
6. Use of ACS Nucleic Acids in Directed Evolution
It is contemplated that the ACS nucleic acids (for example, SEQ ID NOs: 1-11, and fragments and variants thereof) can be utilized as starting nucleic acids for directed evolution. These techniques can be utilized to develop ACS variants having desirable properties such as increased synthetic activity or altered affinity for a particular fatty acid substrate.
In some embodiments, artificial evolution is performed by random mutagenesis (for example, by utilizing error-prone PCR to introduce random mutations into a given coding sequence). This method requires that the frequency of mutation be finely tuned. As a general rule, beneficial mutations are rare, while deleterious mutations are common. This is because the combination of a deleterious mutation and a beneficial mutation often results in an inactive enzyme. The ideal number of base substitutions for
targeted gene is usually between 1.5 and 5 (Moore and Arnold (1996) Nat. Biotech., 14, 458-67; Leung et al. (1998)Technique, 1:11-15; Eckert and Kunkel (1991) PCR Methods Appl., 1 :17-24; Caldwell and Joyce (1992) PCR Methods Appl., 2:28-33; and Zhao and Arnold (1997) Nuc. Acids. Res., 25:1307-08). After mutagenesis, the resulting clones are selected for desirable activity (for example, screened for ACS activity as described above). Successive rounds of mutagenesis and selection are often necessary to develop enzymes with desirable properties. It should be noted that only the useful mutations are carried over to the next round of mutagenesis.
In other embodiments ofthe present invention, the polynucleotides ofthe present invention are used in gene shuffling or sexual PCR procedures (for example, Smith
(1994) Nature, 370:324-25; U.S. Pat. Nos. 5,837,458; 5,830,721; 5,811,238; 5,733,731). Gene shuffling involves random fragmentation of several mutant DNAs followed by their reassembly by PCR into full length molecules. Examples of various gene shuffling procedures include, but are not limited to, assembly following DNAse treatment, the staggered extension process (STEP), and random priming in vitro recombination. In the DNAse mediated method, DNA segments isolated from a pool of positive mutants are cleaved into random fragments with DNAsel and subjected to multiple rounds of PCR with no added primer. The lengths of random fragments approach that ofthe uncleaved segment as the PCR cycles proceed, resulting in mutations in present in different clones becoming mixed and accumulating in some ofthe resulting sequences. Multiple cycles of selection and shuffling have led to the functional enhancement of several enzymes (Stemmer (1994) Nature, 370:398-91; Stemmer, (1994) Proc. Natl. Acad. Sci. USA, 91, 10747-51; Crameri et al. (1996) Nat. Biotech., 14:315-19; Zhang et al. (1997) Proc. Natl. Acad. Sci. USA, 94:4504-09; and Crameri et al. (1997) Nat. Biotech., 15:436-38). Variants produced by directed evolution can be screened for ACS activity by the methods described in Examples 4 and 5.
In further embodiments ofthe present invention, other combinatorial mutagenesis approaches are applied. For example, the amino acid sequences for a population of ACS homologs or other related proteins can be aligned, preferably to promote the highest homology possible. Such a population of variants can include, for example, ACS homologs from one or more species, or ACS homologs from the same species but which differ due to mutation. Amino acids appearing at each position ofthe aligned sequences are selected to create a degenerate set of combinatorial sequences.
hi a preferred embodiment ofthe present invention, the combinatorial ACS library is produced by way of a degenerate library of genes encoding a library of polypeptides including at least a portion of potential ACS-protein sequences. For example, a mixture of synthetic oligonucleotides are enzymatically ligated into gene sequences such that the degenerate set of potential ACS sequences are expressible as individual polypeptides, or alternatively, as a set of larger fusion proteins (for example, for phage display) containing the set of ACS sequences therein.
There are many ways in which the library of potential ACS homologs can be generated from a degenerate oligonucleotide sequence. In some embodiments, chemical synthesis of a degenerate gene sequence is carried out in an automatic DNA synthesizer, and the synthetic genes are ligated into an appropriate gene for expression. The purpose of a degenerate set of genes is to provide, in one mixture, all ofthe sequences encoding the desired set of potential ACS sequences. The synthesis of degenerate oligonucleotides is well known in the art (for example, Narang, Tetrahedron 39:39, 1983; Itakura et al (1981) Recombinant DNA, Proc 3rd Cleveland Sympos. Macromol., Walton, ed., Elsevier, Amsterdam, pp 273-289; Itakura et al. (1984) Annu. Rev. Biochem. 53:323; Itakura et al. (1984) Science 198:1056; and Ike et al. (1983) Nucleic Acid Res. 11 :477). Such techniques have been employed in the directed evolution of other proteins (for example, Scott et al. (1980) Science 249:386-390; Roberts et al. (1992) PNAS 89:2429-2433; Devlin et al. (1990) Science 249: 404-406; Cwirla et al. (1990) PNAS 87: 6378-6382; as well as U.S. Pat. Nos. 5,223,409, 5,198,346, and 5,096,815).
A wide range of techniques are known in the art for screening gene products of combinatorial libraries generated by point mutations, and for screening cDNA libraries for gene products having a particular property of interest. Such techniques are generally adaptable for rapid screening of gene libraries generated by the combinatorial mutagenesis of ACS homologs. The most widely used techniques for screening large gene libraries typically comprise cloning the gene library into replicable expression vectors, transforming appropriate cells with the resulting library of vectors, and expressing the combinatorial genes under conditions such that detection of a desired activity facilitates relatively easy isolation ofthe vector encoding the gene whose product was detected. The illustrative assays described below are amenable to high
through-put analysis as necessary to screen large numbers of degenerate sequences created by combinatorial mutagenesis techniques.
In some embodiments ofthe present invention, the gene library is expressed as a fusion protein on the surface of a viral particle. For example, foreign peptide sequences can be expressed on the surface of infectious phage in the filamentous phage system, thereby conferring two significant benefits. First, since these phage can be applied to affinity matrices at very high concentrations, a large number of phage can be screened at one time. Second, since each infectious phage displays the combinatorial gene product on its surface, if a particular phage is recovered from an affinity matrix in low yield, the phage can be amplified by another round of viral replication. The group of almost identical E. coli filamentous phages M13, fd, and fl are most often used in phage display libraries, as either ofthe phage gill or gVIII coat proteins can be used to generate fusion proteins without disrupting the ultimate packaging ofthe viral particle (for example, WO 90/02909; WO 92/09690; Marks et al. (1992) J. Biol. Chem., 267:16007-16010; Griffths et al. (1993) EMBO J., 12:725-734; Clackson et al. (1991) Nature, 352:624- 628; and Barbas et al. (1992) PNAS 89:4457-4461).
In another embodiment ofthe present invention, the recombinant phage antibody system (e.g., RPAS, Pharmacia Catalog number 27-9400-01) is modified for use in expressing and screening ACS combinatorial libraries. The pCANTAB 5 phagemid of the RPAS kit contains the gene encoding the phage gill coat protein. In some embodiments ofthe present invention, the ACS combinatorial gene library is cloned into the phagemid adjacent to the gill signal sequence such that it will be expressed as a gill fusion protein. In other embodiments ofthe present invention, the phagemid is used to transform competent E. coli TGI cells after ligation. In still other embodiments ofthe present invention, transformed cells are subsequently infected with M13KO7 helper phage to rescue the phagemid and its candidate ACS gene insert. The resulting recombinant phage contain phagemid DNA encoding a specific candidate ACS-protein and display one or more copies ofthe corresponding fusion coat protein. In some embodiments ofthe present invention, the phage-displayed candidate proteins that are capable of, for example, binding a particular acyl-CoA, are selected or enriched by panning. The bound phage is then isolated, and if the recombinant phage express at least one copy ofthe wild type gill coat protein, they will retain their ability to infect E.
coli. Thus, successive rounds of reinfection of E. coli and panning greatly enriches for ACS homologs, which are then screened for further biological activities.
In light ofthe present disclosure, other forms of mutagenesis generally applicable will be apparent to those skilled in the art in addition to the aforementioned rational mutagenesis based on conserved versus non-conserved residues. For example, ACS homologs can be generated and screened using, for example, alanine scanning mutagenesis, linker scanning mutagenesis, or saturation mutagenesis.
7. Chemical Synthesis of ACS Polypeptides In an alternate embodiment ofthe invention, the coding sequence of an ACS is synthesized, whole or in part, using chemical methods well known in the art (e.g., Caruthers et al. (1980) Nuc. Acids Res. Symp. Ser., 7:215-233; Crea and Horn (1980) Nuc. Acids Res., 9:2331; Matteucci and Caruthers (1980) Tetrahedron Lett., 21:719; and Chow and Kempe (1981) Nuc. Acids Res., 9:2807-2817). In other embodiments of the present invention, the protein itself is produced using chemical methods to synthesize either a full-length ACS amino acid sequence or a portion thereof. For example, peptides can be synthesized by solid phase techniques, cleaved from the resin, and purified by preparative high performance liquid chromatography (for example, Creighton, Proteins Structures and Molecular Principles, W H Freeman and Co, New York N.Y., 1983). In other embodiments ofthe present invention, the composition of the synthetic peptides is confirmed by amino acid analysis or sequencing (for example, Creighton, supra).
Direct peptide synthesis can be performed using various solid-phase techniques (Roberge et al, Science 269:202-204, 1995) and automated synthesis may be achieved, for example, using ABI 431 A Peptide Synthesizer (Perkin Elmer) in accordance with the instructions provided by the manufacturer. Additionally, the amino acid sequence of an ACS, or any part thereof, may be altered during direct synthesis and/or combined using chemical methods with other sequences to produce a variant polypeptide.
III. Identification of Other Acyl-CoA Synthetase Homologs
As described above, plant ACSs are members of a larger family of AMP-binding proteins (AMP-BPs). Therefore, methods for discriminating between AMP-BPs and true ACSs are desirable. Figure 1 provides an amino acid comparison ofthe ACSs of
the present invention (SEQ ID NOs: 12-22) and ten putative Arabidopsis AMP-binding proteins (SEQ ID NOs: 33-42). The AMP-BP sequences were determined by BLAST searches ofthe TAIR database (The Arabidopsis Information Resource; http://www.arabidopsis.org/blast/) with ACS sequences. Most ofthe AMP-BP sequences were identified as BAC hits. The presumed cDNA sequences for these were deduced by homology comparisons to the ACSs and other AMP-BPs using GCG (Genetic Computer Group, Madison, WI). The sequences were then aligned using Pileup (Genetic Computer Group, Madison, WI) and shaded using the Boxshade. server. The AMP-BP genes have also been isolated and sequenced, as described below (see Example 2).
This comparison led to the identification of at least nine conserved motifs in ACS, which are described in more detail below. Of these nine motifs, some are conserved between ACSs and AMBPS, while others are conserved only in ACSs; other motifs are conserved only in AMP-BPs, but these are not included in the nine motifs. The motifs are numbered from 1 to 9, in going from the amino to the carboxy terminal ofthe proteins. Where more than one amino acid occurs at a particular position in a motif, the most common amino acid is listed first, followed by less common amino acids, separated by a slash, which indicates that these amino acids occupy the same position in the motif. If more than four different amino acids occupy the same position, the position is indicated by an "X", with the amino acids which occur at that position listed at the end ofthe sequence.
Accordingly, in some embodiments, the present invention provides plant ACSs comprising at least one of ACS motifs 1-9, or nucleic acid sequences encoding such plant ACSs.
ACS motif 1 (Figure 45; SEQ ID NO:43, V-P/T-L/I-Y-D/A/S-T/S-L-G) is present in ACSs and absent in AMP-BPs. ACS motif 2 (Figure 46; SEQ ID NO:44, 1-M/C-Y/F/K-T- S-G-T/S-T/S-G-XrP-K-G-V, where X! is D, L, T, N, or E) is conserved in both ACSs and AMP-BPs. The motif found in both ACSs and AMP-BPs is well known (PROSITE PS00455 = [LIVMFY] - X2 - [STG] - [STAG] - G - [ST] - [STEI] - [SG] - X - [PASLIVM] - [KR]), is very highly conserved, and acts as the unifying feature ofthe AMP-binding protein (AMP-BP) superfamily (Babbitt PC et al. (1992) Biochemistry 31(24): 5594-604; Fulda M et al. (1994) Mol Gen Genet 242(3): 241-9) to which ACS belongs. The sequence shown, SEQ ID NO:44, is for ACSs alone, as the motif in ACSs differs slightly from that in AMP-BPs, particularly in amino positions 1, 2, 9, and 10. ACS motif 3 (Figure 47; SEQ ID NO:45, L-P-L/A-A-W-H) is present in ACSs and absent in AMP-BPs. ACS motif 4
(Figure 48; SEQ ID NO:46; L/Q-K-P-T/P) is present in ACSs and absent in AMP-BPs. ACS motif 5 (Figure 47; SEQ ID NO:49, S/G/V-G-A/G/S-A L/A-P-L/T/M) is present in ACSs and absent in AMP-BPs. ACS motif 6 (Figure 50; SEQ ID NO:48, G-Y-G-L-T-E- T/S/A) is present in both ACSs and AMP-BPs. Note that only G occupies the first position in ACSs, while several different amino acids occupy this position in AMP-BPs. ACS motif 7 (Figure 51; SEQ ID NO:49, P/S/A-PJK-G/A-E/I-I-C/K/V-V/I-R/G-G) is present in ACSs and is absent in AMP-BPs. ACS motif 8 (Figure 52; SEQ ID NO:50, 1-I-D-R-K-K) is present in ACSs, except AtACS4A and AtACS4B, and absent in AMP-BPs. The 25 amino acid consensus sequence shown at the top of Figure 52 is a consensus sequence derived from several genes (for example, from E. coli, yeast, and human) which are known to bind fatty acids; this 25 amino acid sequence is implicated in fatty acid binding in E. coli genes, based upon experiments in which mutagenesis of 15 ofthe 25 amino acids resulted in absent or different specificity fatty acid binding (Black, PN (1997) J Biol Chem 272: 4896-4903). ACS motif 9 (Figure 53; SEQ ID NO:51, L-L/V -P/A-T/A/S-F/L/M/Y-K-I/K/M/L-K/R- R) is present in ACSs and absent in AMP-BPs.
It is contemplated that the sequences described herein can be utilized to clone and characterize ACS homologs from other species of plants. Accordingly, in some embodiments, the ACS nucleic acids or fragments thereof are utilized to screen cDNA or genomic libraries prepared from the RNA or DNA of another plant species. In other embodiments, primers that are completely or partially complementary to portions of SEQ ID NOs: 1-11 are utilized to amplify ACS homologs from nucleic acid isolated from other plant species. For example, degenerate primers may be utilized to amplify ACS homologs for genomic DNA samples or cDNA samples from other species. Alternatively, RT-PCR may be utilized to directly amplify homologs from RNA isolated from other species.
It is also contemplated that the sequences described herein (for example, both nucleic acid and polypeptide sequences, SEQ ID NOs: 1-22), may be utilized to search computer databases for homologous sequences from other species. For example, BLAST searches (Altshul et al. (1997) Nucleic Acids Res. 25:3389-3402; http://www.ncbi.nlm.nih.gov/blast) may be utilized to search for nucleic acids and proteins having homology (for example, greater than 60 percent, 70 percent, 80 percent, or 90 percent) to SEQ ID NOs: 1-22.
In some embodiments, nucleic acids suspected of being ACS homologs are screened by comparing motifs. In some embodiments, the protein sequence can be analyzed for the presence or absence of one or more of ACS motifs 1-9 (SEQ ID NOs: 43-51, respectively). The presence or absence of these motifs indicates that the candidate ACS is a true ACS. In still further embodiments, the nucleic acids can be utilized in genetic screens for ACS activity. For example, the nucleic acids can be analyzed for complementation ofthe mutant S. cerevisiae strain YB525. In other embodiments, the nucleic acids can be expressed and analyzed for complementation or biochemical activity as described in Example 4 and 5. Within the ACS group, AtACS4A and AtACS4B are somewhat divergent from the other ACS genes. This conclusion is based upon the observation that in motifs 3, 4, 5, and7, the amino acids for AtACS4A and AtACS4B are likely to be different from those ofthe other ACSs, yet these different amino acids are generally identical to each other in AtACS4A and AtACS4B. This conclusion is also supported by the observation that AtACS4A and AtACS4B do not contain motif 8. Moreover, this conclusion is also supported by the inability to observe ACS enzyme activity, either by complementation or by an in vitro assay, with these two clones (see Examples 4 and 5). Yet these two genes are more closely related to the ACSs than to any ofthe other genes in the superfamily. It is possible that these genes encode ACSs that activate specialized substrates, or are inactive under the conditions used in these experiments due to special requirements, such as folding or multimer formation requirements, or the need for post-translational modifications not met by the cellular machinery of Saccharomyces cerevisiae. Alternatively, these genes may encode a different type of enzyme related to ACS. For example, in yet another possibility, it is contemplated that these two enzymes are acyl-ACP synthetases. This function can be examined by over-expressing the ACS4A and ACS4B in yeast, and then assaying yeast extract for acyl ACP synthase activity, in a manner similar to that described in Examples 4 and 5, in which ACP is used as a substrate instead of CoA.
IV. AMP Binding Proteins A construction of the phylogenetic relationship between all 44 members of the
Arabidopsis AMP-BP superfamily (Shockey, J et al, manuscript in preparation) revealed several interesting phenomenon. Only three genes (At3gl6170, At3g48990, and Atlg30520) align independently, while the other 41 members ofthe superfamily
separate into three main groups: The ACS subfamily; a subfamily containing the three known 4-coumarate-CoA ligases plus ten other related genes; and a subfamily of fourteen previously unknown genes.
The discovery ofthe third subfamily was unexpected. This subfamily as a whole was more closely related than the other two groups, containing at least 42 percent amino acid identity, while bearing weak and roughly equal homology (approximately 20-25 percent amino acid identity) to the ACS, acetyl-CoA synthetase, and 4-coumarate-CoA ligase genes. Searches of all public databases revealed that higher plants (including rice and Brassica sp.) are the only organisms that contain genes highly homologous to those of this third subfamily. This subfamily thus represents a unique class of enzymes that may play a specialized role in a plant-specific aspect of carboxylic acid activation. It is also possible that this subfamily represents a functionally equivalent but structurally unrelated counterpart to the ACS subfamily.
In order to characterize this subfamily of genes, full-length cDNAs for ten ofthe fourteen members of this subfamily were cloned into pYES2 and transformed into Saccharomyces cerevisiae YB525, as described in the following examples (see particularly Examples 1 and 4 and 5). These constructs were used in the complementation and in vitro enzyme activity analyses, exactly as described for the ACS genes in the following examples. In the complementation assays, the genes ofthe AtAMP-BP subfamily were unable to activate exogenous myristic acid, and all ten genes were therefore unable to complement the YB525 phenotype. In the in vitro enzyme assays, cell-free lysates prepared from these transformed yeast lines containing one of these ten genes were also inactive against oleic acid in the in vitro enzyme assays. These data do not rule out the possibility that the genes of this group are ACSs.
In fact, the phylogenetic analysis ofthe AMP-BP superfamily as a whole supports the hypothesis that these genes catalyze the coenzyme A-dependent activation of some type of carboxylic acid, given the fact that each ofthe other classes in the phylogenetic tree contain representative genes that do exactly that. It is contemplated that AMP-BPs are very long chain ACSs. Medium chain- or very long chain-CoA synthetases have been characterized in other organisms ((Min and Benzer, 1999)). While medium-chain fatty acids are very rare in Arabidopsis ((Ohlrogge and Browse, 1995)), a critical role for very long chain acyl groups is obvious. Very long chain fatty acids (>C24) are the
substrates for the biosynthesis ofthe complex mixture of esters, alcohols, ketones, aldehydes, and alkanes that make the cuticular wax layer present on the surface of plants. Cuticular waxes also play essential roles in plant fertility and insect defense ((Preuss, D et al. (1993) Genes Dev 7(6): 974-85). This function can be examined by over-expressing the AMP-BPs in yeast, and then assaying yeast extract for very long chain ACS activity, in a manner similar to that described in Examples 4 and 5.
EXAMPLES
The following examples serve to illustrate certain preferred embodiments and aspects ofthe present invention and are not to be construed as limiting the scope thereof. In the disclosure which follows, the following abbreviations apply: M (molar); mM (millimolar); μM (micromolar); nM (nanomolar); mol (moles); mmol (millimoles); μmol (micromoles); nmol (nanomoles); gm (grams); mg (milligrams); μg (micrograms); pg (picograms); L (liters); ml (milliliters); μl (microliters); cm (centimeters); mm (millimeters); μm (micrometers); nm (nanometers); °C (degrees Centigrade); ATP (adenosine 5'-monophosphate); BSA (bovine serum albumin); cDNA (copy or complimentary DNA); CS (calf serum); DNA (deoxyribonucleic acid); ssDNA (single stranded DNA); dsDNA (double stranded DNA); dNTP (deoxyribonucleotide triphosphate); LH (luteinizing hormone); NIH (National Institutes of Health, Besthesda, MD); RNA (ribonucleic acid); PBS (phosphate buffered saline); g (gravity); OD (optical density); HEPES (N-[2-Hydroxyethyl]piperazine-N-[2-ethanesulfonic acid]); SDS (sodium dodecylsulfate); Tris-HCl (tris[Hydroxymethyl]aminomethane-hydrochloride); rpm (revolutions per minute); EDTA (ethylenediaminetetracetic acid); bla (β-lactamase or ampicillin-resistance gene); ORI (plasmid origin of replication); and Sigma (Sigma Chemical Company, St. Louis, MO); GC (gas chromatography); fames (fatty acid methyl esters).
Example 1
This Example describes the procedures utilized to identify and clone the ACS genes ofthe present invention.
Sequencing and Homology Analysis
All DNA sequencing was conducted in the Macromolecular Analysis Laboratory at Washington State University using automated sequencing equipment (Applied Biosystems, Foster City, CA). Sequences were assembled and modified using the GCG suite of programs (Wisconsin Package Version 10.0, Genetics Computer Group, Madison, WI). Database searches were conducted against the AtDB Illustra database (genome-www.standford.edu/Arabidopsis), its successor at The Arabidopsis Information Resource (TAIR) (www.arabidopsis.org), and the Munich Information Center for Protein Sequences Arabidopsis thaliana database (MATDB) (mips.gsf.de/proj/thal/db/search search_frame.html).
Identification and cloning of genes
Full-length ACS clones were isolated by first screening the EST databases ((Newman et al. (1994) Plant Physiol 106(4): 1241-55) to identify partial cDNA clones with homology to known ACSs. The inserts from these clones were used to screen for full length clones present in any of various cDNA libraries available from the
Arabidopsis Biological Resource Center ((Weigel, D et al. (1992) Cell 69(5): 843-59; and Kieber, JJ et al. (1993) Cell 72(3): 427-41). When full-length clones could not be identified using this approach, the missing portions ofthe genes were identified by isolation of genomic clones from an Arabidopsis thaliana genomic DNA library ((Voytas, DF et al. (990) Genetics 126(3): 713-21).
Once the initiator codon of each gene had been determined, a new gene-specific oligonucleotide primer pair was used to amplify RT-PCR products spanning the full- length open reading frame. Briefly, 2 ug of total RNA from mature seeds, tissue- cultore-grown roots, stems, young rosette leaves, flowers, and siliques were used as template for a scaled-up first-strand cDNA synthesis, using an equimolar mixture of capped oligo-dT primers (T 0C, T20A, and T20G) and Superscript II reverse transcriptase as described in the Hieroglyph differential display manual (Genomyx Corp.). Aliquots of these reactions were used as template in amplifications using Pfu Turbo polymerase
(Stratagene, La Jolla, CA), or with ExTaq polymerase (PanVera, Madison, WI), as described in the respective manufacturer's protocol. The Pfu Turbo-generated products were cloned into the pCR-ScriptCam vector supplied in the blunt cloning kit (Stratagene). The ExTaq-generated products were cloned into the pCR2.1 vector supplied in the TOPO-TA cloning kit (Invitrogen). These clones were sequenced to verify the fidelity of amplification.
Cloning of Arabidopsis ACS genes in E. coli and Saccharomyces cerevisiae
For expression in yeast, one of two methods was used to reamplify the open reading frames ofthe Arabidopsis cDNAs for re-cloning. Some genes were amplified from the original plasmids using new oligonucleotide primer pairs that introduced restriction sites compatible for insertion into the multiple cloning site ofthe Saccharomyces cerevisiae inducible expression vector pYES2 (Invitrogen). The PCR products were restricted with appropriate enzymes then gel-purified. Concentrated solutions ofthe insert DNAs were ligated to appropriately digested pYES2 DNA and transformed into competent E. coli. Plasmid DNA from the resulting bacterial colonies was resequenced to ensure accurate reamplification then transformed into S. cerevisiae YB525 cells (provided by Prof. Jl Gordon, Washington University, St. Louis, MO) ((Knoll, LJ et al. (1995) Genetics 126(3): 713-21) that had been made competent for chemical transformation using the S. c. EasyComp kit (Invitrogen). Alternatively, PCR products for some ofthe ACS cDNAs were generated using the sticky end PCR technique ((Zeng, G (1998) Biotechniques 25(2): 206- 8). These products were ligated, transformed, and sequenced as described above.
Acyl CoA Synthetases
AtACSIA
The cDNA clone corresponding to 240K22T7 was ordered from ABRC and unsuccessfully used to screen the Lambda PRL2 cDNA library. The remaining sequence was determined by isolation of a genomic clone from the genomic library using 240K22T7 insert as probe. The full-length cDNA was amplified using the new sequence information and cloned into a pPCR-Script Amp vector (Stratagene) and sequenced. Due to problems encountered when recloning this construct, the cDNA was reamplified from pooled RT reactions. The primers used for this amplification added
Kpnl and Sphl sites to the 5' and 3' ends ofthe gene, respectively. The resulting PCR product was then cut with these two enzymes and cloned into the same sites in the yeast expression vector pYES2 (Invitrogen) and sequenced.
AtACSIB
AtACS9 was found by searching the AGI database for sequences homologous to AtACS 7 and 8. Primers were designed based on the putative start and stop codons. The primers successfully amplified an appropriately sized product from genomic DNA, but to date attempts to RT-PCR a cDNA clone have been unsuccessful. The genomic product itself has not yet been cloned.
AtACSIC
The cDNA clone corresponding to 119E14T7 and unsuccessfully used to screen the Lambda PRL2 cDNA library. The remaining sequence was determined by isolation of a genomic clone from a genomic library using the 119E14T7 insert as probe. The sequence determined from the genomic clone was used to design primers for amplification ofthe full-length cDNA from DNA prepared from the cDNA libraries. This cDNA was cloned into pYES2 in a fashion similar to that described for AtACS7.
AtACS2
The cDNA clone corresponding to EST 229E14T7 was ordered from ABRC. The insert DNA was excised and used as probe for screening the Lambda PRL2 cDNA library. A clone was isolated with an approximately 2 kb insert and excised from the plasmid DNA. Sequencing revealed that the 5' end ofthe cDNA was missing based on homology to Brassica sequences. Five prime RACE amplifications were performed with total phage DNA isolated from the cDNA library. This led to the cloning and sequencing ofthe 5' sequence.
AtACS3A The cDNA clone corresponding to EST 205M6T7 from ABRC and used to isolate an apparently full length clone from the Lambda PRL2 cDNA library.
AtACS3B
The cDNA clone corresponding to EST 123N12T7 was ordered from ABRC and unsuccessfully used to screen the Lambda PRL2 cDNA library. The remaining sequence was determined by isolation of a genomic clone from the genomic library using the 123N12T7 insert as probe. The full-length cDNA was amplified using the new sequence information, cloned into the pPCR-Script Cam vector (Stratagene), and sequenced.
AtACS4A This gene, originally named AMP-BP3 and later renamed AtACS4A, was identified from the Arabidopsis databases using the sequence ofthe Brassica AMP-BP clone pMF28P (Genbank Accession # Z72151). The presumed start codon and stop codon were identified by homology. The full-length cDNA was amplified by RT-PCR using the primers AMP- BPS 5 S acICut (5 '-TGCATGGAGCTCATGGCTTCGACTTCTTCTTTG GGAC-3') (SEQ ID NO: 73) and AMP-BP33XhoICut (5'-
ACGATCCTCGAGTTAACTGTAGAGTTGATCAATCTC-3') (SEQ ID NO: 74). The resulting PCR product was cut with Sad and Xhol and ligated into the same sites in the yeast expression vector pYES2 (Invitrogen) and sequenced.
AtACS4B
The presence of this gene was found in the Arabidopsis database by homology to AtACS4A. The start and stop codons were deduced and primers designed according to them. The primers 4B-KpnI (5'-CGAATGGTACCAATGGCTTCAACGTCTCTCG GAGCTTCG-3') (SEQ ID NO: 75) and 4B-3SphI (5'- ATACTGCATGCCTACTTGTAGAGTCTTTCTATTTCA-3') (SEQ ID NO: 76) were used to amplify the full-length cDNA by RT-PCR. The resulting PCR product was cloned directly into the blunt-end vector pCRScript-Cam (Stratagene) and sequenced. The insert was cut using Kpnl and Sphl. Unfortunately, this cut the gene into two pieces. The 5' Kpn- Sph fragment was cloned into pYES2 first. The resulting construct was cut with Sphl and the 3' Sph-Sph fragment of AtACS4B was ligated into it.
AtACS5
The cDNA clone corresponding to EST GbGel 15a was ordered from ABRC. The insert DNA was excised and used as probe for screening the Lambda PRL2 cDNA library. A clone was isolated and again found to be missing sequence from the 5' end of the OR, which was determined by 5' RACE. The full-length cDNA was cloned into pPCR-Script Cam vector (Stratagene) and sequenced.
AtACS6A
The cDNA clone corresponding to EST G2B10T7 from ABRC and used to isolate an apparently full length clone from the Lambda PRL2 cDNA library.
AtACS6B
The cDNA clone corresponding to EST 203 Jl 1T7 was ordered from ABRC. The insert DNA was excised and used as probe for screening the Lambda PRL2 cDNA library. A almost full-length clone was isolated. Sequence missing from the 5' end of the open reading frame was determined by isolating a genomic clone from a genomic DNA library (ABRC) using the 203J11T7 insert as a probe. The full-length cDNA open reading frame was amplified with new primers designed from sequence from the 3' end ofthe partial cDNA clone and the 5' sequence of genomic clone. The cDNA was cloned into pPCR-Script Cam vector (Stratagene) and sequenced.
Example 2 This Example describes the cloning often AMP-BPs. These ten AMP-BPs were selected from a total of fourteen members of AMP-BPs discovered through the grouping ofthe original 44 genes into subfamilies as determined by phylogenetic relationships among the 44 genes (Shockey, J et al. (2001), manuscript in preparation), as described above. The methods of sequencing and homology analysis, identification and cloning of genes, and cloning of Arabidopsis genes in E. coli and Saccharomyces cerevisiae are described in Example 1, with additional details provided below. Total RNA was isolated from Arabidopsis dry seeds, roots, old stems, young stems, young leaves, old leaves, young stems, old stems, flowers, new siliques, and old siliques. First strand cDNA was prepared from each of these RNA preps with Superscript II reverse transcriptase (Gibco-BRL) as described in the Hieroglyph mRNA
Profile Kit (Genomyx). Using gene specific primers designed from the expected start codon and stop codon of each gene (Example 3), the open reading frame for each gene was amplified from a pool of all ofthe RT reaction.
The PCR reactions were carried out on an MJ Research PTC 100 thermal cycler. The polymerase was ExTAQ (Panvera Corp.). The reactions (50 μl) contained 5 μl of the lOXTaq buffer, 4μl ofthe 10 mM dNTP mix (Panvera) 5 μl each of 5 μM stocks of the 5' and 3' primers and 2 μl ofthe pooled RT reactions. The conditions were: 95°C for 3 minutes, followed by 30 cycles of 95°C for 20 sec, 58°C for 30 sec, 72°C for 1 minute. A final 72°C incubation of 2 minutes was followed by an indefinite 4°C hold until samples were removed.
A small amount of each reaction was analyzed by agarose gel electrophoresis to ascertain successful amplification. The remainder of each successful amplification was electrophoresed and the band cut out followed by purification ofthe DNA from the gel slice using Qiagen gel extraction columns. A 4 μl aliquot of each DNA was ligated to TOPO-activated pCR2.1 vector (Invitrogen), using their standard conditions and transformed into TOPI OF' competent cells supplied with the kit. Positive transformants were selected by growth on agar plates containing either 100 (g/ml carbenicillin or 50 μg/ml kanamycin plus X-GAL and IPTG for blue/white screening. Colonies containing plasmids with AMP-BP inserts were identified by colony PCR screening several white colonies, using the same PCR conditions as described above. Representative positive colonies for each gene were grown in 50 ml of liquid L-broth plus appropriate antibiotic overnight at 37°C, followed by isolation of plasmid DNA using Promega's Wizard MidiPrep kit.
Plasmid DNA was quantified spectrophotometrically and sequenced with several vector- and gene-specific primers.
AtAMP-BPl
The full length gene was isolated from 2-3 Kb size-selected cDNA library
(Kieber et al. (1993) Cell 72(3): 427-441) obtained from the Arabidopsis Biological Resource Center (ABRC) at Ohio State University. The insert from the partial cDNA clone 99N9T7 (Genbank Accession # T22607) was used as the probe. After sequencing, the full-length open reading frame was amplified from this plasmid with
Pfu Turbo Polymerase (Stratagene) with primers containing restriction sites compatible
for cloning into the yeast expression vector pYES2 (Invitrogen). The product was cut out and ligated into pYES2 using standard procedures.
AtAMP-BP3 The cDNA clone corresponding to EST FAFM13 was ordered from the
Arabidopsis Biological Resource Center (ABRC, Ohio State University). The insert DNA was excised and used as probe for screening a Lambda PRL2 cDNA library (also obtained from the ARBC). A clone was identified and isolated. The insert DNA from the lambda phage clone was excised by in vivo excision as described in library instructions resulting in the gene fused in pBlueScript SK+.
All other AMP-BPs
All other AMP-BP genes were cloned by identification in the databases by homology to cloned Arabidopsis ACS genes. The start codon and stop codon were identified and primers designed to these spots. These primers may or may not have contained restriction sites to facilitate cloning. The full-length open reading frames were amplified by RT-PCR from total RNA. These PCR reactions were carried out with one of two different DNA polymerases: ExTaq (Panvera) or Pfu Turbo (Stratagene). Those products (AtAMP-BPs 2, 4,5, 6, and 7) generated with ExTaq were cloned directly into the A-overhand vector pCR2.1 (Invitrogen). These genes were later cut out of pCR2.1 and ligated into ρYES2. The Pfu Turbo generated AtAMP-BP8 product was cloned into the blunt-end vector pCRScript-CAM (Stratagene), then cut out of this vector and ligated into pYES2. The PfU Turbo products for AtAMP-BP9 and 10 were cut with Kpnl and Sphl and cloned directly into pYES2.
Example 3 This Example describes primers useful for amplifying full-length ACSs and AMP-BPs and for use in RNAse protection assays.
AtACSIA
(SEQ ID NO: 52) AAGGCGATTCATCTTGAC - AtACSIA gene specific RPA primer (SEQ ID NO: 53) CTGGTACCATGACGCAGCAGAAGAAATAC - 5' yeast vector cloning primer + Kpnl restriction site.
(SEQ ID NO: 54) CTCTCGAGCTACCCTCTGGAAGCAAATT
AtACSIB
(SEQ ID NO: 55) ATGACGTCGCAGAAAAGATTCATCTTTG - 5' start codon cloning primer
(SEQ ID NO: 56) TTACTGTCCGGAAGCTAGACTTTCCTTTC -3' stop codon cloning primer
AtACSIC (SEQ ID NO: 57) GAGTCTATCTGCCGAAACC - AtACSIC gene specific RPA primer
(SEQ ID NO: 58) ATGGCGACTGGTCGATACATCGTTGAGGTTG - 5' start codon cloning primer (SEQ ID NO: 59) TTACACTCGTAGCTGCACTTCTC - 3' stop codon cloning primer
AtACS2
(SEQ ID NO: 60) 6RPA - AACTCAATTACCAATCTCCC
(SEQ ID NO: 61) CGCCATGAACACCGAGTCAG - 5' Start codon cloning primer
(SEQ ID NO: 62) GAGCCATTCAGAGCTTCGACG - 3' Stop codon cloning primer
AtACS3A
(SEQ ID NO: 63) ATCCGAGAGTGAAAGCAG - AtACS3A gene specific RPA primer
(SEQ ID NO: 64) CTGGTACCATGGATTCTTCTTCTTCGTC - 5' start codon for cloning into yeast expression vector pYES2, Kpnl restriction site included.
(SEQ ID NO: 65) AGCTCGAGTTCACAAACCTCTATTAGCAG - 3' stop codon for cloning into pYES2, Xhol restriction site included.
AtACS3B (SEQ ID NO: 66) CTTGCTGAGATGGATGAC - AtACS3B gene specific RPA primer (SEQ ID NO: 67) CATGGAATTTGCTTCGCCGGAAC
(SEQ ID NO: 68) GTACCATGGAATTTGCTTCGCCGGAAC - 5' Kpnl overhang sticky-end primers for cloning into yeast expression vector pYES2 (Invitrogen).
(SEQ ID NO: 69) CTCACAGTTTAGAAGGAATGGGG
(SEQ ID NO: 70) CATGCTCACAGTTTAGAAGGAATGGGG - 3' Sphl overhang sticky end cloning primers for cloning into pYES2.
AtACS4A
(SEQ ID NO: 71) ATGGCTTCGACTTCTTCTTTGGGA (SEQ ID NO: 72) CAAATGTCTTAACTGTAGAGTTGATCA (SEQ ID NO: 73) TGCATGGAGCTCATGGCTTCGACTTCTTCTTTGGGAC AMP- BP35SacICut (SEQ ID NO: 74) ACGATCCTCGAGTTAACTGTAGAGTTGATCAATCTC-3') AMP- BP33XhoICut
AtACS4B
(SEQ ID NO: 75) CGAATGGTACCAATGGCTTCAACGTCTCTCGGAGCTTCG - 4B- Kpnl
(SEQ ID NO: 76) ATACTGCATGCCTACTTGTAGAGTCTTTCTATTTCA - 4B-3SphI
AtACS5
(SEQ ID NO: 77) ACGGCAGAAAAGAACAAG - AtACS5 gene specific RPA primer (SEQ ID NO: 78) CTGGTACCATGAAGTCTTTTGCGGCTAAG - 5' start codon primer for cloning into pYES2, Kpnl restriction site included. (SEQ ID NO: 79) ACTCTAGATTATTGATACATATAACGTAC - 3' stop codon primer for cloning into pYES2, Xbal restriction site included.
AtACS6A
(SEQ ID NO: 80) ATGGAAGATTCTGGAGTGAATCCAATG - 5' start codon cloning primer
(SEQ ID NO: 81) TTAGGCATATAACTTGCTGAGTTCATC - 3' stop codon cloning primer
AtACS6B
(SEQ ID NO: 82) CTTCAAAGCAAGGAATAGAC - AtACS6B gene specific RPA primer
(SEQ ID NO: 83) ATGATTCCTTATGCTGCTGGTG - AtACS6B 5' Start codon cloning primer
(SEQ ID NO: 84) TTAGGCATATAACTTGGTGAGATC - 3' stop codon cloning primer
AtAMP-BPl
(SEQ ID NO: 85) ATGGAGGGAACTATCAAATCTC - 5' start codon cloning primer (SEQ ID NO: 86) TCATAACTTGCTTCTGCCTTTC - 3' stop codon cloning primer
AtAMP-BP2
(SEQ ID NO: 87) ATGAGATTCT TGTTAACCAA AAG - 5' start codon cloning primer
(SEQ ID NO: 88) TTACAAGCTA CCCATTTCAT CAG - 3' stop codon cloning primer
AtAMP-BP3
(SEQ ID NO: 89) TGAGAAATATGGGGAAGAG - AtAAMP-BP gene specific RPA primer
(SEQ ID NO: 90) ATGGATAGCGATACTCTCTCAG - 5' Start codon cloning primer (SEQ ID NO: 91) TCAGGGCTTCTCAAGGAAATG - 3' Stop codon cloning primer
AtAMP-BP4
(SEQ ID NO: 92) ATGGAACTTT TACTCCCACA CG - 5' start codon cloning primer
(SEQ ID NO: 93) TCATCAAGGCAAGGACTTAG C - 3' stop codon cloning primer
AtAMP-BP5
(SEQ ID NO: 94) GAAAACAATACATTGACCACTCAAGATG - 5' gene specific cloning primer
(SEQ ID NO: 95) TCGCAAGTTCTAATTTTACATCCGACTC - 3 ' gene specific cloning primer.
AMP-BP5 and AMP-BP6 are very similar, therefore the gene-specific cloning primers were moved "outward" from the start and stop codons a bit, to ensure gene- specificity.
AtAMP-BP6
(SEQ ID NO: 96) TTTGATTACCACTAGGAGGAAGAGATG - 5' gene specific cloning primer
(SEQ ID NO: 97) CGGTGAAAGAAAGACGTTTAAGAAATTG - 3' gene specific cloning primer
AtAMP-BP7
(SEQ ID NO: 98) ATGGCGGCAACGAAGTGGCGTG - 5' start codon cloning primer (SEQ ID NO: 99) CTATAACCTGCTTCTTGGTACTGGTCCC - 3' stop codon cloning primer
AtAMP-BP8
(SEQ ID NO: 100) ATGGAAGATTTGAAGCCAAG TGCC - 5' start codon cloning primer
(SEQ ID NO: 101) TTACATGTTTTTGGCAATCT CTTTAAGC - 3' stop codon cloning primer
AtAMP-BP9
(SEQ ID NO: 102) TACAAAACATTAACAAAAATCAAAGTATGG (SEQ ID NO: 103) ATAACTCAAGCGAATCTTTAAGGCAGAGA
AtAMP-BPl 0
(SEQ ID NO: 104) ACGATACTATAGTTTCTTGCAGCTAACTAA
(SEQ ID NO: 105) TTATTTAATGGACTTGTTCAAGACAGGGT
AMP-BP9 and 10 are so similar that primers upstream ofthe start codon and downstream ofthe stop codon had to be used to ensure gene-specific amplifications.
Example 4
This Example describes the detection of ACS enzyme activity by complementation. Eleven candidate ACS genes were cloned into the galactose- inducible Saccharomyces cerevisiae expression vector pYES2. These constructs were tested for their ability to complement the phenotype of Saccharomyces cerevisiae strain YB525. This yeast strain contains insertional disruptions in two of its ACS genes, FAA1 and FAA4 ((Knoll, LJ et al. (1995) J Biol Chem 270(18): 10861-7), which are responsible for the majority of ACS enzyme activity in S. cerevisiae. Thus, these cells are completely dependent on complementation by an active ACS when grown on media containing fatty acids as a sole carbon source and cerulenin to inhibit endogenous fatty acid synthesis by the fatty acid synthase complex.
A culture of YB525 was grown in YBD liquid media until approximately mid-log phase. Cells were harvested and made competent for transformation using the S.c. EasyComp kit (Invitrogen). Arabidopsis cDNAs were ligated into the pYES2 vector (Invitrogen), then checked for proper orientation and sequence. Any base pairs that did not match the AGI database sequence were corrected using the Quickchange site-directed mutagenesis kit (Stratagene). The expression constructs were transformed into chemically competent YB525 cells and uracil auxotrophs selected on DOBA-ura plates (DOBA: 2 percent yeast nitrogen base, 2 percent dextrose, 0.1 percent complete supplement mixture lacking uracil, 17g/L agar) (BIOIOI). Representative colonies were chosen at random and grown until mid- to late-log phase in DOB liquid media (DOBA minus agar). Galactose was added to a concentration of 2 percent to induce high-level expression ofthe transgenes from the GAL1 promoter ofthe vector. The cultures were then grown for an additional 2 to 4 hours. Aliquots of each culture were diluted 1:1 (vol/vol) with 2 M sorbitol and 5 ul aliquots plated on DOBA plates containing galactose plus 500 uM myristic acid and 25 uM cerulenin, followed by incubation at 30 °C for 3-4 days.
The results ofthe complementation experiment show that after four days at 30 °C, seven ofthe eleven candidate ACS genes had complemented the mutant phenotype and restored growth rates to wild-type levels, as compared to the wild-type strain Invisc (Invitrogen) that was used as a positive control. Only AtACS3A, 3B, 4A, and 4B did not complement the mutant phenotype.
The explanation for the inability of some ofthe genes to restore cerulenin- insensitive growth to this strain was obvious. The AtACS3A and AtACS3B genes contain
PTS2 and PTS1 peroxisome targeting sequences, respectively. Targeting of an ACS to the peroxisome renders the enzyme inaccessible to the pool of exogenous fatty acid, as evidenced by the inability of Faa2p, the endogenous peroxisomal Saccharomyces ACS ((Johnson, DR et al. (1994) J Cell Biol 127(3): 751-62; and Knoll, LJ et al. (1995) J Biol Chem 270(18): 10861-7), to support growth under the conditions used in this experiment.
The inability ofthe AtACS4A and AtACS4B genes to complement the YB525 strain was less easily explained. The deduced amino acid sequences for these two proteins did not contain recognizable peroxisome targeting sequences. AtACS4A and 4B do contain N- terminal extensions, however, that may target the encoded enzymes to other sites within the yeast cell that are separated from the pool of exogenous fatty acids. These two genes also contain abnormally long insertional elements, as seen in Figure 2. This difference in length was also observed in bnapmf28, the Brassica napus homolog of AtACS4A, which was also inactive in ACS assays when over-expressed in E. coli ((Fulda, M et al. (1997) Plant Mol Biol 33(5): 911-22). In general, the results ofthe complementation experiment indicate that most ofthe candidate genes are in fact ACSs, and that the insertional element described above is a reliable tool for distinguishing ACS genes from other related AMP-binding protein genes.
Example 5 This Example describes a biochemical assay for ACS activity. The results ofthe yeast complementation experiment clearly demonstrated that many ofthe candidate genes chosen from the initial library screens and database searches did encode ACS enzymes. However, additional analysis was necessary to address the inability ofthe AtACS3A, 3B, 4A, and 4B genes to complement the ACS deficiency in the S. cerevisiae YB525. In order to directly test the ability of this family of genes to produce active ACS enzymes, cell-free lysates were prepared from S. cerevisiae YB525 cells over-expressing each ofthe eleven candidate ACS genes, as described below. These lysates served as enzyme sources in ACS enzyme activity assays, using 14C-labeled oleic acid as a substrate.
Enzyme overproduction in Saccharomyces cerevisiae
Transformed YB525 cells were selected on solid selective media lacking uracil. Several colonies from each transformation were restreaked on a new selective media plate.
Representative colonies were randomly chosen to inoculate liquid media cultures. This media lacked uracil and contained dextrose as the carbon source, which suppressed the GAL1 promoter ofthe pYES2 vector. These cultures were grown at 30 °C with vigorous shaking to an optical density at 600 nm of about 0.7 - 1.0. Galactose (20 percent w/v) was added to a final concentration of 2 percent to induce gene expression. The cultures were shaken at 30 °C for an additional 2-4 hours and the cells harvested by centrifugation. The yeast cells were washed once with distilled water and harvested again for spheroplast production. Spheroplasts were generated from intact cells using lytic enzyme (ICN) following the manufacturers protocol. The spheroplasts were lysed by sonication on ice (2 x 1 min) followed by removal of solid debris by centrifugation at 8,000xg for 15 min at 4 °C. The resulting supernatants were used as enzyme sources for the ACS assay.
ACS enzyme assay
The assay conditions were similar to those described previously (Fulda, M et al. (1997) Plant Mol Biol 33(5): 911-22. The assay was conducted in 1.5 ml Eppendorf tubes in a volume of 100 ul. The assay mixture contained 100 mM Bis-Tris-propane (pH 7.6), 10 mM MgCl2, 5 mM ATP, 2.5 mM dithiothreitol, 1 mM CoA, 10 uM 1 -Relabeled oleic acid (specific activity 50-57 mCi/mmol, DuPont-NEN), and 20 ug of crude yeast cell lysate protein. The assay was initiated by addition ofthe fatty acid and incubated at room temperature for 15 minutes. The reactions were stopped by addition of 100 ul of 10 percent acetic acid in isopropanol and extracted twice with 900 ul of hexane (previously saturated with 50 percent isopropanol). Enzyme activity was measured by analyzing aliquots ofthe aqueous phase by liquid scintillation counting. Lysates from yeast cells bearing the empty pYES2 vector served as a negative control, while commercial ACS enzyme from Pseudomonas sp. (Sigma) served as the positive control.
Results
The results of these assays are shown in Figure 55, and demonstrate that all cell lines except those containing the AtACS4A and AtACS4B constructs produced significant levels of ACS activity. The results for these two genes was consistent with those observed in the yeast complementation experiment and in the E. coli expression studies ((Fulda, M et al. (1997) Plant Mol Biol 33(5): 911-22). Thus, in contrast to the
complementation study, cells containing constructs AtACS3A and AtACS3B produced active enzymes. The levels of activity produced by these two constructs was somewhat lower than that produced by the other active genes; thus, the activity of AtACS3A and 3B was approximately 5 -6-fold higher than that ofthe empty pYES2 negative control, compared to 12- and 20-fold higher activity for AtACS 1 A and AtACS6A, respectively. These levels of activity demonstrate that the AtACS3A and AtACS3B genes encode ACS. These results also further demonstrate that the other seven members of this family are ACSs as well.
The lack of enzyme activity for cells containing AtACS4A and 4B constructs provide further support to the hypothesis that the enzymes encoded by these genes are unique with respect to the other nine ACS genes. These genes may encode ACSs that activate specialized substrates, or the may encode a different type of enzyme related to ACS. It is also possible that these enzymes are indeed ACSs, but are inactive under the conditions used in these experiments due to special folding or multimer formation requirements, or the need for post-translational modifications not met by the cellular machinery of Saccharomyces cerevisiae.
Alternatively, it is contemplated that these two genes encode acyl ACP synthetases, as described previously.
Example 6
This Example describes the cellular location of ACS transcription as assayed by RNAse protection assays and by RNA expression profiles.
RNAse protection assays In vitro transcription and RNAse protection assays were performed basically as described in the Maxiscript and RPA II manuals (Ambion), respectively. Briefly, several different tissues (for example, seed, cultured roots, stem, young leaves [post- bolting], silique, flowers and buds, green rosette [pre-bolting], and older leaves [post- bolting]) were harvested from wild-type Arabidopsis ecotype Columbia plants. Tissues were frozen in liquid nitrogen and stored at -80°C until use.
Total RNA was isolated from the tissues using standard methods. The RNA pellets were dissolved in DEPC-treated water and quantified spectrophotometrically. Gene specific RPA probes templates were produced by PCR amplifying small (200-500
bp) fragments of each ACS gene from the full-length or partial cDNA clones obtained from ABRC. Primer sequences are provided in Example 3. The PCR products were electrophoresed through TAE-agarose gels and gel-purified using Qiaquick spin columns (Qiagen). The PCR products were transcribed in vitro in 20 μl reactions containing: 2 μl
10 X transcription buffer, approximately 1 μg of template DNA, 1 μl each ATP, CTP, and GTP, 5 μl 12.5 μM 32P labeled UTP, and 2μl either SP6, T3, or T7 RNA polymerase. The contents were mixed and incubated at 37°C for 1 hour. DNAse I was added to stop the reaction and remove template DNA. The radiolabeled RNA probe was then gel-purified on 5 percent TBE, 8 M Urea acrylamide gels. The RNA was eluted in elution buffer (0.5 M ammonium acetate, 1 mM EDTA, 0.1 percent SDS) overnight. An aliquot ofthe eluted probe was quantified by scintillation counting and, according to the manufacturer's calculation methods, the number of counts corresponding to 2 femptomoles of probe was determined. Twenty micrograms of total RNA from each tissue was co-precipitated with 2 femptomoles of probe and resuspended in 20 μl hybridization buffer (Solution A from the kit). After heating at 95°C for 3-4 minutes, the RNA/probe mixture was incubated overnight at 45°C.
Unprotected RNA was digested by adding to the RNA/probe mixture 200 ml RNAse solution (1/100 dilution of stock RNAse A/RNAse Tl mixture) and incubating the mix at 37°C for 30 minutes. Three hundred microliters of solution Dx was then added to each tube to stop the reaction. Two microliters of carrier yeast RNA was added to increase pellet visibility. The mixture was chilled at -20°C for at least 15 minutes, and then centrifuged at maximum speed for minutes in a cold room. The pellets were dissolved in nondenaturing gel sample buffer and electrophoresed through a nondenaturing 5 percent TBE acrylamide gel. After running, the gel was dried in a gel drier and the images were developed in a Bio-Rad Phosphorimager.
The results are summarized in Table 2 below. A relatively strong signal for a given tissue is designated by (+++), a relatively weak signal is designated by (+), and the apparent absence of a signal is indicated by (-). As can be seen, the RNAs for the different ACSs localize to a variety of tissues.
RNA expression profiles
The tissue-specific RNA expression profiles of each ofthe ACS genes was also examined by semi-quantitative RT-PCR ((Kong, SE et al. (1999) Anal Biochem 271(1): 111-4). This technique was chosen because careful control ofthe PCR conditions allows for easy and sensitive comparisons ofthe expression levels for each ofthe different genes while eliminating the risk of cross-hybridization between related genes on a Northern blot. Each gene was analyzed using RNA from mature seeds, tissue culture-grown roots, leaves, stems, flowers, and siliques.
RNA preparations from mature seed, roots, young leaves, stems, siliques, and flowers were quantified spectrophotometrically and 1 ug aliquots of each used as template for reverse transcription, as described above. One ul of each RT reaction was used as template in a 50 ul PCR reaction containing gene-specific primers. The amplification conditions were as follows: 95 °C 3 min, and 30 cycles of 94 °C 15 sec, 55 °C 30 sec, 72 °C 1 min. One-third of each reaction was analyzed by TAE-agarose gel electrophoresis and the degree of gene expression correlated to the relative intensity of each band as determined by visual comparison ofthe ethidium bromide staining intensity when the gels were visualized under UN illumination. The actin gene ACTS ((An et al, 1996)) was used as a control to insure that equal amounts of RΝA were used in both the RT and PCR portions ofthe experiments.
The results are summarized in Table 3 below. The relative strength ofthe signal is scored from 3 plusses ("+++"), denoting the strongest signal, to a negative sign ("-"), denoting the apparent absence of a signal.
The relative intensities ofthe bands for the positive control, the Arabidopsis actin ACT8 gene, were almost equivalent, with slight reductions in mature seed and siliques. This profile closely parallels the relative Northern blot signal intensities for this gene ((An, YQ et al. (1996) Anal Biochem 271(1): 111-4), thus validating the accuracy of this technique. As seen in Table 3, most ofthe ACS genes are expressed in a variety of tissues at widely varying levels.
Close inspection of Table 3 reveals several interesting phenomena. First, several ACS genes are expressed in the mature seed ofthe plant. The deposition of transcripts for these genes in the mature seed indicates that the ACS enzymes encoded by them are needed during the very early stages of germination. This is consistent with a strong demand for the enzymes of beta-oxidation and membrane lipid biosynthesis in the emerging seedling. The second interesting pattern observed is the strength of expression of all eleven ACS genes in flowers. These data are consistent with the high level of metabolic activity in flowers. The overall complexity of expression for the genes in this group suggests that at least some ofthe ACSs may have overlapping functions within the plant. Only AtACSIB seems to be highly specific, showing extremely high expression in flowers, but no expression in any ofthe other tissues tested. Nearly all the ACS genes, with the exception of AtACSIB and possibly AtACS2, are expressed in siliques.
In other experiments, the RNA expression pattern of AtACS6A (the closest paralog of AtACS6B) is similar to 6B in that highest levels of expression were observed
in young, developing leaves and seeds; this is consistent with the belief that de novo FAS is most active in these tissues. This observation suggests that many genes in this gene family may participate in glycerolipid synthesis in the developing seed.
Example 7
This Example describes the analysis ofthe subcellular localization of ACSs by a chloroplast import assay. Briefly, intact chloroplasts were isolated from young pea seedling extracts by centrifugation through Percoll gradients, and incubated with labeled expression products from an in vitro transcription/translation reaction mixture with an ACS encoding sequence. The chloroplasts were then separated from the labeled expression products by centrifugation through a Percoll cushion, lysed, and the different fractions ofthe chloroplast separated. The import ofthe labeled ACS was determined by the presence of label in chloroplast lysates, the location was determined by the presence of label in different fractions, and the identification of labeled ACS was confirmed by gel electrophoresis.
Chloroplasts are isolated from nine to ten day old pea seedlings by first removing the seedlings from a growth chamber and placing them in lab light for at least one hour to allow for starch degradation before grinding the tissue (this minimizes disruption of intact chloroplasts). Next, a standard Percoll gradient was formed by adding 1 mg glutathione to a 50 ml open top centrifuge tubes, followed by the addition of 17.5 ml 2X GR buffer (IX GR buffer is 50 mM HEPES/KOH pH 8.0, 10 mM EDTA, 0.33 M sorbitol, 5 mM Na+ ascorbate, pH 7.5, and 0.05 percent BSA) and 17.5 ml Percoll. The mixture was then covered with parafilm and mixed. Next, the tubes were centrifuged in SS34 rotors at 4°C min at 19,000 rpm (no brake).
When the gradient was almost complete, the aerial portions ofthe plants were cut and placed in a pre-weighed flask (about 40 g of tissue from a flat planted with -200 ml peas). The tissue was placed in a chilled blender containing 250 ml IX GR and pulsed three times for one second each. The extract was filtered through a funnel lined with cheesecloth and Miracloth. The process was then repeated with a second 40 g batch. The pooled extracts were placed in chilled 250 ml bottles and pelleted in a swinging bucket rotor for 3 min at 3200 rpm. The supernatant was decanted, and the pellet resuspended in 5 ml IX GR. The pellets (containing chloroplasts) were then
layered onto the gradients with a glass pipette and centrifuged in a swinging bucket rotor at 2600 rpm for 15 min. The lower intact chloroplast band was removed and placed into two 50 ml tubes. The tubes were filled to top with IX IB (IX IB buffer is 50 mM HEPES/KOH, pH 8.0, 0.33 M sorbitol) and centrifuged in a swinging bucket rotor at 2600 rpm for 5 min. The supernatant was removed and the pellet resuspended in 10 ml of IB.
The concentration of chloroplasts was determined by placing 1 ml acetone in each of three 1.5 ml tubes. Water (250 μl) was added to the first tube, 225 μl water and 25 μl chloroplasts were added to the second tube, and 200 μl water and 50 μl chloroplasts were added to the third tube. The tubes were mixed well and centrifuged to pellet the proteins. The OD at 652 nm was determined and the concentration of chloroplasts calculated by the following formula: (OD652/34.5) X 1.25)/ sample amount X 10 ml = mg total. The chloroplasts samples were then repelleted and resuspended to 1 mg/ml in IX IB. Labeled ACS gene products were prepared by in vitro transcription and translation of ACS cDNAs using a TNT kit (Promega) according to the manufacturer's instructions. Labeled control proteins for the import assay were also prepared in the same manner; these control proteins included luciferase, which is not imported into chloroplasts, the small subunit of RiBisCO, which is imported and is localized to the sfroma, with concomitant cleavage ofthe signal peptide (Froelich, JE et al. (2001) Plant Physiol 125: 306-317), and LeHPL, a tomato hydroperoxide lyase which is associated with the chloroplast envelopes, despite its lack of a typical signal peptide (Froelich, JE et al (2001) Plant Physiol 125: 306-317).
Import assays were performed in following reaction mixtures: 75μl IX IB, 5 μl 2X IB, 15 μl 50 mM Mg-ATP (in IB), 50 μl 2X chloroplasts (1 mg/ml), and 5 μl translation product. The reaction mixtures were incubated in water bath at 25 °C for 15- 30 min in the presence of light. The import reaction mixtures were then loaded onto 1 ml of 40 percent Percoll and centrifuged at 3,000Xg for 8 min. The supernatant was removed, the pellet resuspended, and centrifuged again. Next, 600 μl lysis buffer (25 mM HEPES + 5 mM MgCl2) was added to the pellet. This mixture was incubated on ice, in the dark, for about 20 min. The mixture was then divided into 3 equal parts in microfuge tubes and centrifuged in an Airfuge at 100,000Xg for 40 min at 4°C. The pellets were then resuspended in either 200 μl lysis buffer, 200 μl 2M NaCl, or 100 mM
Na2CO3. The mixtures were then centrifuged in an Airfuge at 100,000Xg for 30 min at 4°C. The supernatant was removed and 100 percent TCA added to 10 percent. The mixtures were stored overnight.
The next day, the mixtures were centrifuged at 20,000Xg for 10 min, washed with cold acetone, and resuspended in 30 μl 5X SDS Loading dye. Ten microliters of the chloroplast import assays were then loaded onto 10 percent nondenaturing gels and electrophoresed. Following electrophoresis, the gels were dried and exposed to film.
The results indicate that despite the lack of a typical chloroplast targeting signal, labeled AtACS6B was targeted to intact chloroplast, and was only present in the membrane fractions. Treatment of the lysed membranes with lysis buffer and NaCl did not dissociate AtACS6B from the membranes, whereas treatment with Na CO3 extracted a portion of it from the membranes. This pattern was similar to that observed with a control protein, LeHPL, a hydroperoxide lyase from tomato which has been shown to associate with chloroplast outer envelope, even though it too lacks a signal peptide (Froelich, JE et al. (2001) Plant Physiol 125: 306-317). Thus, the results suggest that AtACS6B is associated with the chloroplast envelope membranes. Moreover, ATACS6B does not appear to be proteolytically processed during plastidial targeting, because the gel mobility ofthe AtACS6B associated with the chloroplast was identical to that ofthe starting product, produced by in vitro translation. Additional results indicated that AtACS2 is also imported into chloroplasts.
Example 8
This Example describes identification and analyses of ACS knock-out mutant Arabidopsis plants. Two different mutants were found in two different lines of T-DNA Arabidopsis plants.
The first population, a T-DNA tagged population, available through the Arabidopsis Biological Resource Center (http://aims.cps.msu.edu/aims/), represents 6,000 individual transformants, each containing one or more T-DNA insertions. The T-DNA is a 17.0 kb DNA fragment that contains the nptll gene, which confers resistance to kanamycin. Insertions ofthe large T-DNA fragment in a gene of interest effectively prevents transcription of that gene.
This population was searched using a Pl/KFLB primer combination (primers listed below), and resulted in the identification of a mutant line in the CD5-7 population
(Feldmann lines) that contains a T-DNA interrupted AtACSόB coding region. The T-DNA insertional event occurs in the third exon, 1120 bp downstream from the start codon in the genomic sequence. From a sample of pooled seeds, two mutants were identified by using Pl/KFLB and the P1/P2 gene specific primer combinations in PCR analysis first on pooled and later on individual plants: a heterozygous mutant containing one copy of a T-DNA interrupted AtACSόB gene, and a homozygous mutant lacking both native copies of AtACSόB (both designated the T1 generation). The seeds were germinated after surface sterilization in 20 percent bleach + 0.1 percent SDS for 20 minutes, followed by rinsing 3 times in sterile water. The sterilized seeds suspended in 0.1 percent agarose were plated on germination medium (MS salts, 1 percent sucrose, 3.5 g/L Phytagel, 75 mg/L kanamycin, pH 5.7). PCR analysis and protocols were performed according to the protocols at http://www.biotech.wisc.edu/Arabidopsis/ using PanVera ExTaq.
PI primer (GAAAGTTAAACTCAATTCCTCCGTCGATCA) (SEQ ID NO: 106) P2 primer (GCATATAACTTGGTGAGATCTTCAGAGAATT) (SEQ ID NO: 107)
KFLB primer (TGCACTCGAAATCAGCCAATTTTAGACAA). (SEQ ID NO:
108)
In order to screen for the presence of multiple T-DNA insertions, progeny from the heterozygous T\ plants were subjected to segregation analysis. The kanamycin segregation ratios ofthe T2 seed ofthe heterozygous mutant indicated that only one T-DNA insertional event was present. Of 471 seed, 121 were kanamycin-sensitive, while 370 were resistant to kanamycin. This ratio represents a 3:1 hypothesis for a single insertion (χ2= 0.033; P>0.8). Southern blot analysis of 5 T2 plants from homozygous mutant showed identical restriction patterns to the heterozygous plants when probed with a LB fragment, confirming that the homozygous T\ individual also contained only one insert.
Results from a Northern blot analysis showed the lack of full-length AtACSόB transcript in the acsόb/acsόb mutant. Total RNA was isolated from floral and bud tissues of wild type, heterozygous, and homozygous AtACSόB plants. As expected, transcripts of full- length AtACSόB were present only in wild-type and heterozygous mutant plants. A truncated transcript corresponding to the length of transcript preceding the T-DNA insertion was present in the heterozygous and homozygous mutants.
A comparison ofthe phenotypes ofthe homozygous mutant and the wild-type plants showed that at all stages ofthe life cycle, the homozygous mutant was indistinguishable
from wild type plants grown under the same conditions. Quantitative measurements of growth rate also showed no difference between the homozygous mutant and wild-type plants.
Fatty acid analysis of above-ground portions of wild type and homozygous mutant plants at 19 days of age revealed no significant differences between any ofthe fatty acid species typically found in Arabidopsis leaves (fatty acids were analyzed as methyl esters of total extracted lipids).
Northern analysis showed that the AtACSόB transcript was more abundant in developing seeds than in leaves. Therefore, lipids of developing seeds from homozygous and wild-type plants were analyzed. The plants were grown under 14 hour photoperiod, and secondary and axillary floral stems were removed as they appeared in order to facilitate the cataloging and collection of siliques. At 42 days, intact siliques of varying developmental stages were removed and the total fatty acids analyzed. The lipid content ofthe homozygous mutant from 2 to 13 DAF did not differ significantly from that of wild type plants (see Figure 56). The peak of lipid accumulation (8-9 days after flowering, or DAF) corresponds to the highest level of AtACSόB transcripts at 6 to 11 DAF developing siliques. ACS activity was measured in chloroplasts isolated from wild type and homozygous mutant plants. Intact chloroplasts were isolated from 19 day old leaf tissue as described in Example 7. ACS was assayed as described in Example 6; the assay included isolated chloroplasts, CoenzymeA, ATP, and l-14C-oleic acid (18:1). When compared with wild type, the homozygous mutant chloroplasts exhibited a 13.75-fold decrease in ACS activity in this assay.
In summary, these results indicate that in the AtACSόB knock-out mutant, there were no visible phenotypic differences or measurable changes in fatty acid quantity or species between wild type and homozygous mutant plants, yet the homozygous mutant chloroplasts exhibited significantly less ACS activity than did the wild-type plants.
Another mutant, an A CS2 T-DNA knockout mutant, was also discovered, but in a different population of T-DNA mutant plants. This population of T-DNA mutant plants was prepared in a glabrous plant line, which is a Columbia mutant which is missing the gene responsible for developing trichomes. Thus, the wild-type plant for this mutant is a glabrous plant, or one which does not have trichomes.
The phenotype ofthe ACS2 mutant is quite different from that ofthe wild-type, in that the mutant has smaller, curled leaves and flowers slightly later. Segregation analysis
indicated that the homozygous ACS2 knockout plant (11-4) contained multiple T-DNA insertions. To obtain a plant line which contained only insertions in the ACS2 genes, the plants were backcrossed with Columbia pollen. After several generations of selfing, plant lines which contained only insertions (homozygous) in ACS2 were obtained. These plants exhibited the small, puckered leaf phenotype ofthe original mutant, indicating that the absence of functional ACS2 transcript was responsible for the phenotype. On the other hand, even though phenotypically this mutant is quite different, the leaf fatty acids of this mutant do not appear to differ significantly from those ofthe wild-type plant.
Leaf fatty acids were analyzed by removing leaves from each of a wild-type plant (glabrous, "gib"), progeny ofthe original mutant plant with the same phenotype
(homozygous, "11 -4"), and progeny of the original mutant plant crossed with wild-type phenotype which exhibits a wild type phenotype (which is therefore believed to be hemizygous, "wt"), and placing them in individual glass screw-cap tubes. One and a half milliliters 2.5 percent H2SO4 in methanol were added to each tube and the tubes were incubated at 80°C for 1.5 hours. Next, 1.5 ml water and 500 μl hexane were added to each tube. The tubes were vortexed and centrifuged to separate the phases. The hexane phases were then transferred to GC vials for GC analysis according to the following program: 150°C for 1 min, then ramp at 15 degrees/min to 240°C, then hold for 2 min.
The fatty acid profiles ofthe mutants did not differ significantly from those of wild- type plants (See Table 4).
Various modifications and variations ofthe described compositions and methods ofthe invention will be apparent to those skilled in the art without departing from the scope and spirit ofthe invention. Although the invention has been described in connection with particular preferred embodiments, it should be understood that the inventions claimed should not be unduly limited to such specific embodiments. Indeed, various modifications ofthe described modes for carrying out the invention which are obvious to those skilled in the art and in fields related thereto are intended to be within the scope ofthe following claims.
Claims
1. An isolated nucleic acid sequence comprising SEQ ID NO: 1 or SEQ ID NO:2 or SEQ ID NO:3 or SEQ ID NO:4 or SEQ ID NO:5 or SEQ ID NO:6 or SEQ ID NO:7 or SEQ ID NO:8 or SEQ ID NO:9 or SEQ ID NO:10 or SEQ ID NO: 11 or a sequence that hybridizes to at least one ofthe foregoing sequences under conditions of low stringency.
2. An isolated nucleic acid sequence encoding a protein comprising amino sequence SEQ ID NO:12 or SEQ ID NO:13 or SEQ ID NO:14 or SEQ ID NO:15 or SEQ ID NO:16 or SEQ ID NO:17 or SEQ ID NO:18 or SEQ ID NO:19 or SEQ ID NO:20 or SEQ ID NO:21 or SEQ ID NO:22.
3. The nucleic acid sequence of Claims 1-2, wherein the sequence is operably linked to a heterologous promoter.
4. The nucleic acid sequence of Claim 1-3, wherein the nucleic acid sequence is contained within a vector.
5. The vector of Claim 4, wherein the nucleic acid sequence is in a sense orientation.
6. The vector of Claim 4, wherein the nucleic acid sequence is in an antisense orientation.
7. A composition comprising the nucleic acid sequence of Claimsl-3.
8. A host cell transfected with a nucleic acid sequence or a vector or a composition according to Claims 1-7
9. A plant transfected with a nucleic acid sequence or a vector or a composition according to Claims 1-7.
10. A seed from the plant of Claim 9.
11. Oil from the transgenic plant of Claim 9.
12. An isolated nucleic acid sequence or a vector or a composition according to Claims 1-7 for use in altering a phenotype of a plant.
13. A method for making a transgenic plant comprising a) providing a nucleic acid sequence or a vector or a composition according to Claims 1-7, and plant tissue, b) and transfecting the plant tissue with the nucleic acid sequence or the vector or the composition under conditions such that a transgenic plant is generated.
14. A method for altering a phenotype of a plant comprising a) providing a nucleic acid sequence or a vector or a composition according to Claims 1-7, and plant tissue, b) and transfecting the plant tissue with the nucleic acid sequence or the vector or the composition under conditions such that a transgenic plant is generated and the phenotype is altered.
15. A method for producing variants of acyl-CoA synthetases comprising: a) providing a nucleic acid sequence according to Claims 1-3, b) mutagenizing the nucleic acid sequence; and c) screening a variant encoded by the mutagenized nucleic acid sequence for activity.
16. A method for screening acyl CoA synthetases comprising: a) providing a candidate acyl-CoA synthetase; and b) analyzing the candidate acyl-CoA synthetase for the presence of at least one of ACS motifs 1-9.
17. An isolated nucleic acid sequence encoding a plant acyl-CoA synthetase, wherein the plant acyl CoA synthetase competes for binding to a fatty acid substrate with a protein encoded by a second nucleic acid sequence and wherein the second nucleic acid sequence comprises a nucleic sequence according to Claims 1-2.
18. A composition comprising a first isolated nucleic acid sequence, wherein the first nucleic acid sequence inhibits the binding of at least a portion of a second nucleic acid sequence to its complementary sequence and wherein the second nucleic acid sequence has a nucleic sequence according to Claims 1-2.
19. A purified protein comprising an amino acid sequence encoded by a nucleic acid sequence of Claims 1-2.
20. A composition comprising a protein of Claim 19.
21. A compound according to Claims 1-4 substantially as described herein in any ofthe examples.
Applications Claiming Priority (5)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| US22047400P | 2000-07-21 | 2000-07-21 | |
| US60/220,474 | 2000-07-21 | ||
| US09/906,419 US20030037357A1 (en) | 2000-07-21 | 2001-07-16 | Plant acyl-CoA synthetases |
| US09/906,419 | 2001-07-16 | ||
| PCT/US2001/022774 WO2002009295A2 (en) | 2000-07-21 | 2001-07-19 | Plant acyl-coa synthetases |
Publications (1)
| Publication Number | Publication Date |
|---|---|
| AU2001277918A1 true AU2001277918A1 (en) | 2002-02-05 |
Family
ID=26914913
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| AU2001277918A Abandoned AU2001277918A1 (en) | 2000-07-21 | 2001-07-19 | Plant Acyl-CoA synthetases |
Country Status (7)
| Country | Link |
|---|---|
| US (1) | US20030037357A1 (en) |
| EP (1) | EP1356055A2 (en) |
| AU (1) | AU2001277918A1 (en) |
| BR (1) | BR0112639A (en) |
| CA (1) | CA2416558A1 (en) |
| MX (1) | MXPA03000621A (en) |
| WO (1) | WO2002009295A2 (en) |
Families Citing this family (5)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US20040010817A1 (en) * | 2000-07-21 | 2004-01-15 | Washington State University Research Foundation | Plant acyl-CoA synthetases |
| GB0421937D0 (en) * | 2004-10-02 | 2004-11-03 | Univ York | Acyl CoA synthetases |
| FR2904633B1 (en) * | 2006-08-04 | 2010-12-31 | Univ Grenoble 1 | PLASTIDIAL MICROARRAY |
| CA2907905A1 (en) | 2010-02-01 | 2011-08-04 | Suntory Holdings Limited | Polynucleotide encoding acyl-coa synthetase homolog and use thereof |
| CA3229999A1 (en) * | 2021-08-26 | 2023-03-02 | Andrew Conley | Large scale production of divarin, divarinic acid and other alkyl resorcinols by fermentation |
-
2001
- 2001-07-16 US US09/906,419 patent/US20030037357A1/en not_active Abandoned
- 2001-07-19 AU AU2001277918A patent/AU2001277918A1/en not_active Abandoned
- 2001-07-19 EP EP01955866A patent/EP1356055A2/en not_active Withdrawn
- 2001-07-19 BR BR0112639-3A patent/BR0112639A/en not_active IP Right Cessation
- 2001-07-19 WO PCT/US2001/022774 patent/WO2002009295A2/en not_active Ceased
- 2001-07-19 CA CA002416558A patent/CA2416558A1/en not_active Abandoned
- 2001-07-19 MX MXPA03000621A patent/MXPA03000621A/en unknown
Also Published As
| Publication number | Publication date |
|---|---|
| MXPA03000621A (en) | 2004-07-30 |
| US20030037357A1 (en) | 2003-02-20 |
| EP1356055A2 (en) | 2003-10-29 |
| WO2002009295A2 (en) | 2002-01-31 |
| WO2002009295A3 (en) | 2003-08-14 |
| CA2416558A1 (en) | 2002-01-31 |
| BR0112639A (en) | 2004-11-09 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| US6878861B2 (en) | Acyl coenzyme A thioesterases | |
| US7429473B2 (en) | Diacylglycerol acyltransferase genes, proteins, and uses thereof | |
| AU2001280652A1 (en) | Acyl coenzyme a thioesterases | |
| US8017386B2 (en) | Divinyl ether synthase gene and protein, and uses thereof | |
| JP2009536029A (en) | Molecular cloning and sequencing of the acetyl-CoA carboxylase (ACCase) gene from Jatrophacurcas | |
| WO2007149583A2 (en) | Acyltransferases for altering lipid production on plants | |
| US20040010817A1 (en) | Plant acyl-CoA synthetases | |
| US7446188B2 (en) | Plant cyclopropane fatty acid synthase genes, proteins, and uses thereof | |
| US8674176B2 (en) | ADS genes for reducing saturated fatty acid levels in seed oils | |
| US20030037357A1 (en) | Plant acyl-CoA synthetases | |
| US7105722B2 (en) | Plant acyl-CoA synthetases | |
| US7932433B2 (en) | Plant cyclopropane fatty acid synthase genes, proteins, and uses thereof | |
| WO2003060092A2 (en) | Modified fatty acid hydroxylase protein and genes |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| MK5 | Application lapsed section 142(2)(e) - patent request and compl. specification not accepted |