[go: up one dir, main page]

CN1293711A - Genes encoding MLO proteins and conferring fungal resistance upon plants - Google Patents

Genes encoding MLO proteins and conferring fungal resistance upon plants Download PDF

Info

Publication number
CN1293711A
CN1293711A CN99804041A CN99804041A CN1293711A CN 1293711 A CN1293711 A CN 1293711A CN 99804041 A CN99804041 A CN 99804041A CN 99804041 A CN99804041 A CN 99804041A CN 1293711 A CN1293711 A CN 1293711A
Authority
CN
China
Prior art keywords
leu
ser
val
ala
phe
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN99804041A
Other languages
Chinese (zh)
Inventor
J·M·萨尔米罗
L·J·威斯罗
L·J·斯塔拉
C·M·克莱默
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Novartis AG
Original Assignee
Novartis AG
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Novartis AG filed Critical Novartis AG
Publication of CN1293711A publication Critical patent/CN1293711A/en
Pending legal-status Critical Current

Links

Classifications

    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N15/00Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N15/00Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
    • C12N15/09Recombinant DNA-technology
    • C12N15/63Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
    • C12N15/79Vectors or expression systems specially adapted for eukaryotic hosts
    • C12N15/82Vectors or expression systems specially adapted for eukaryotic hosts for plant cells, e.g. plant artificial chromosomes (PACs)
    • C12N15/8241Phenotypically and genetically modified plants via recombinant DNA technology
    • C12N15/8261Phenotypically and genetically modified plants via recombinant DNA technology with agronomic (input) traits, e.g. crop yield
    • C12N15/8271Phenotypically and genetically modified plants via recombinant DNA technology with agronomic (input) traits, e.g. crop yield for stress resistance, e.g. heavy metal resistance
    • C12N15/8279Phenotypically and genetically modified plants via recombinant DNA technology with agronomic (input) traits, e.g. crop yield for stress resistance, e.g. heavy metal resistance for biotic stress resistance, pathogen resistance, disease resistance
    • C12N15/8282Phenotypically and genetically modified plants via recombinant DNA technology with agronomic (input) traits, e.g. crop yield for stress resistance, e.g. heavy metal resistance for biotic stress resistance, pathogen resistance, disease resistance for fungal resistance
    • CCHEMISTRY; METALLURGY
    • C07ORGANIC CHEMISTRY
    • C07KPEPTIDES
    • C07K14/00Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
    • C07K14/415Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from plants

Landscapes

  • Health & Medical Sciences (AREA)
  • Genetics & Genomics (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Chemical & Material Sciences (AREA)
  • Organic Chemistry (AREA)
  • Engineering & Computer Science (AREA)
  • Biomedical Technology (AREA)
  • Wood Science & Technology (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Molecular Biology (AREA)
  • General Engineering & Computer Science (AREA)
  • Biotechnology (AREA)
  • Zoology (AREA)
  • Biochemistry (AREA)
  • Biophysics (AREA)
  • General Health & Medical Sciences (AREA)
  • Physics & Mathematics (AREA)
  • Plant Pathology (AREA)
  • Microbiology (AREA)
  • Cell Biology (AREA)
  • Proteomics, Peptides & Aminoacids (AREA)
  • Medicinal Chemistry (AREA)
  • Gastroenterology & Hepatology (AREA)
  • Botany (AREA)
  • Peptides Or Proteins (AREA)
  • Breeding Of Plants And Reproduction By Means Of Culturing (AREA)
  • Micro-Organisms Or Cultivation Processes Thereof (AREA)
  • Agricultural Chemicals And Associated Chemicals (AREA)

Abstract

This invention describes genes encoding proteins which control resistance of plants to fungal pathogens. The invention also describes transgenic plants resistant to fungal pathogens and methods for making plants resistant to fungal pathogens. The invention further discloses a method to isolate additional genes coding for additional proteins controlling the resistance of plants to fungal pathogens.

Description

编码MLO蛋白质并赋予植物抗真菌抗性的基因The gene that encodes the MLO protein and confers resistance to the plant against the fungus

本发明描述了所编码蛋白质控制植物对真菌疾病的抗性的核苷酸序列。本发明还涉及抗真菌疾病的植物以及抗真菌疾病的植物的制备方法。The present invention describes nucleotide sequences encoding proteins that control plant resistance to fungal diseases. The invention also relates to plants resistant to fungal diseases and to methods for the preparation of plants resistant to fungal diseases.

真菌疾病在美国每年造成农作物损失约91亿美元,它们由多种生物学上差异很大的病原体引起。传统上已用不同的方法来控制它们。现已培育出农业上重要的抗性品种,从而提供了抗小范围或较大范围病原体分离物或品种的多水平抗性。不过,这项工作包括通过遗传杂交将所需性状引入商品化株系的长期辛苦的过程,且由于害虫可能进化成可克服天然植物抗性的风险,需要持续努力将新的抗性性状引入商品化品系中。此外,已通过应用化学杀真菌剂控制真菌疾病。这种方法通常产生有效的控制作用,但也与可能产生的抗性病原体相关,并可能对环境产生负面影响。而且,在某些农作物,诸如大麦和小麦中,用化学杀真菌剂控制真菌病原体是困难或不切实际的。目前的技术已能更好地在分子水平上理解植物及其病原体之间的相互作用,且已部分揭示了抗性机制。已对模型植物拟南芥进行了该分子鉴定的大部分工作,现已开始阐明重要经济农作物中的抗性机制。Fungal diseases cause an estimated $9.1 billion in annual crop losses in the United States, and they are caused by a variety of biologically diverse pathogens. Different methods have traditionally been used to control them. Agriculturally important resistant varieties have been bred to provide multi-level resistance against narrow or larger ranges of pathogen isolates or varieties. However, this work involves the long and laborious process of introducing desired traits into commercial lines through genetic crossing, and due to the risk that pests may evolve to overcome natural plant resistance, continued efforts are required to introduce new resistance traits into commercial products in the chemical line. In addition, fungal diseases have been controlled through the application of chemical fungicides. This approach often yields effective control, but is also associated with the possible development of resistant pathogens and can have negative environmental impacts. Furthermore, in certain crops, such as barley and wheat, it is difficult or impractical to control fungal pathogens with chemical fungicides. Current technologies have enabled a better understanding of the interactions between plants and their pathogens at the molecular level and have partially revealed resistance mechanisms. Much of this molecular characterization work has been carried out on the model plant Arabidopsis thaliana, and has now begun to elucidate resistance mechanisms in commercially important crops.

白粉病是影响大部分植物种类的主要疾病且已被广泛研究。它们的特征在于植物组织上生长的白色至浅灰色斑点或片状物,相应于真菌的菌丝体和闭囊壳。白粉病是由白粉菌目(Erysiphales)的数种真菌引起的。例如,禾白粉菌(Erysiphe graminis)引起谷类和草的白粉病。虽然白粉病在大部分农作物中难于控制,却有一些抗多数已知病原体分离菌的大麦品系。研究工作已显示,单位点即mlo位点的突变与抗性表型有关。Mlo抗性机制已部分阐明;包括在与病原体的接触位点形成被称为乳突的大细胞壁外加体(apposition),其中主要包含胼胝质,但也有碳水化合物、酚和蛋白质。在mlo植物中,细胞壁外加体阻止了病原体的侵入,因而产生了抗性。Powdery mildew is a major disease affecting most plant species and has been extensively studied. They are characterized by white to grayish spots or flakes that grow on plant tissue, corresponding to the fungal mycelium and ocystosis. Powdery mildew is caused by several fungi of the order Erysiphales. For example, Erysiphe graminis causes powdery mildew of cereals and grasses. Although powdery mildew is difficult to control in most crops, there are some barley lines that are resistant to most isolates of the known pathogen. Research work has shown that mutations at a single site, the mlo site, are associated with resistance phenotypes. The mechanism of Mlo resistance has been partially elucidated; it involves the formation of large cell wall appositions called papillae at sites of contact with the pathogen, which contain mainly callose but also carbohydrates, phenols and proteins. In mlo plants, cell wall extrasomes block pathogen entry and thus confer resistance.

不幸的是,此控制白粉病的有力工具只局限于针对大麦。由农业中真菌疾病,尤其是白粉病引起的问题看来,依然需要新的有效方法以控制其它农作物中的这些类型病原体,这些方法在经济上对农场主有吸引力且在环境上可接受。Unfortunately, this powerful tool for controlling powdery mildew is limited to barley. Given the problems caused by fungal diseases in agriculture, especially powdery mildew, it appears that there is still a need for new effective methods of controlling these types of pathogens in other crops that are economically attractive to farmers and environmentally acceptable.

本发明提出了对通过遗传工程技术的新型植物疾病控制方法的需求。更具体地说,本发明涉及控制白粉病的方法,优选是控制经济上重要的农作物中的白粉病的方法。The present invention addresses the need for new methods of plant disease control through genetic engineering techniques. More particularly, the present invention relates to methods of controlling powdery mildew, preferably powdery mildew in economically important crops.

本发明涉及编码Mlo蛋白质的分离DNA分子,其中所说的Mlo蛋白质赋予植物抗真菌病原体的抗性。更具体地说,本发明涉及含本发明发明者首次发现的保守氨基酸序列的Mlo蛋白质,并涉及编码该Mlo蛋白质的分离DNA分子。本发明还描述了用于在植物中表达本发明DNA分子的载体。本发明进一步涉及包含本发明任一种DNA分子的转基因植物。本发明也描述了具改善植物检疫特性的农产品,其中包括通过本发明任一种DNA分子的表达而抗真菌病原体的转基因植物。本发明还进一步涉及制备抗真菌疾病的植物的方法,该方法是通过改变由相应于本发明任一种DNA分子之内源基因拷贝编码的蛋白质在转基因植物中的表达,或通过改变由相应于本发明任一种DNA分子之内源基因拷贝编码的蛋白质的活性或稳定性而进行的。这样的转基因植物如预期地能抗感染植物活表皮细胞的病原体,尤其是来自白粉菌目的真菌(还已知为白粉菌),优选抗引发白粉病的白粉菌属(Erysiphe)真菌,更优选抗禾白粉菌的植物。本发明进一步描述了分离所编码蛋白质具有与本发明DNA分子编码蛋白质相同或相似功能和编码本发明所列保守氨基酸序列的DNA分子的方法。The present invention relates to isolated DNA molecules encoding Mlo proteins which confer resistance in plants against fungal pathogens. More specifically, the present invention relates to an Mlo protein comprising a conserved amino acid sequence discovered for the first time by the inventors of the present invention, and to an isolated DNA molecule encoding the Mlo protein. The invention also describes vectors for expressing the DNA molecules of the invention in plants. The invention further relates to transgenic plants comprising any of the DNA molecules of the invention. The invention also describes agricultural products having improved phytosanitary properties, including transgenic plants resistant to fungal pathogens by expression of any of the DNA molecules of the invention. The present invention still further relates to a method of producing plants resistant to fungal diseases by altering the expression in transgenic plants of a protein encoded by an endogenous gene copy corresponding to any one of the DNA molecules of the present invention, or by altering the The activity or stability of the protein encoded by the endogenous gene copy of any DNA molecule of the present invention is carried out. Such transgenic plants are expected to be resistant to pathogens infecting living epidermal cells of plants, especially fungi from the order Erysiphales (also known as Erysiphe), preferably against fungi of the genus Erysiphe that cause powdery mildew, more preferably against Plants of Erysipha graminearum. The present invention further describes the method of isolating the encoded protein having the same or similar function as the protein encoded by the DNA molecule of the present invention and encoding the conserved amino acid sequence listed in the present invention.

本发明因而提供了控制重要经济农作物中真菌疾病的新的有效方法,从而有可能减少用于农作物的化学药品量并降低对控制剂有抗性的病原体出现的危险。The present invention thus provides new and effective methods of controlling fungal diseases in economically important crops, making it possible to reduce the amount of chemicals used on the crops and reduce the risk of emergence of pathogens resistant to the control agents.

因而本发明提供了:The present invention thus provides:

编码赋予植物对真菌病原体的抗性之Mlo蛋白质的DNA分子,其中所说的蛋白质包含与SEQ ID NO:1或SEQ ID NO:2中所列氨基酸序列相同或基本相似的至少一个氨基酸序列,其中所说的DNA分子优选是cDNA分子。在优选的实施方案中,DNA分子优选不来自大麦而来自双子叶植物或以下植物:小麦、玉米、稻、燕麦、黑麦、高粱、甘蔗、小米、买罗高粱(milo)和棕榈科。在优选的实施方案中,本发明的DNA分子与SEQ ID NO:3、SEQ ID NO:5或SEQ ID NO:7中所列的任一种核苷酸序列相同或基本相似,或者编码与SEQ ID NO:4、SEQID NO:6或SEQ ID NO:8中所列Mlo蛋白质相同或基本相似的Mlo蛋白质。在更优选的实施方案中,包含SEQ ID NO:3、SEQ ID NO:5或SEQ ID NO:7中所列核苷酸序列的DNA分子来自小麦。在另一优选的实施方案中,本发明的DNA分子与SEQ ID NO:9、SEQ IDNO:11、SEQ ID NO:13、SEQ ID NO:15或SEQ ID NO:17中所列的任一种核苷酸序列相同或基本相似,或者编码与SEQ ID NO:10、SEQID NO:12、SEQ ID NO:14、SEQ ID NO:16或SEQ ID NO:18中所列任一种核苷酸序列编码的Mlo蛋白质相同或基本相似的Mlo蛋白质。在更优选的实施方案中,包含SEQ ID NO:9、SEQ ID NO:11、SEQ IDNO:13、SEQ ID NO:15或SEQ ID NO:17中所列核苷酸序列的DNA分子来自拟南芥。在另一优选实施方案中,前文所提及的DNA分子被修饰以致内源蛋白质的活性丧失。在本发明的特殊实施方案中,所说的DNA修饰导致相应蛋白质的氨基酸序列中发生以下改变中的一种、全部或多种改变的组合:A DNA molecule encoding the Mlo protein that confers resistance to fungal pathogens in plants, wherein said protein comprises at least one amino acid sequence identical or substantially similar to the amino acid sequence set forth in SEQ ID NO: 1 or SEQ ID NO: 2, wherein Said DNA molecules are preferably cDNA molecules. In a preferred embodiment, the DNA molecule is preferably not from barley but from a dicotyledonous plant or the following plants: wheat, maize, rice, oats, rye, sorghum, sugar cane, millet, milo and palmaceae. In a preferred embodiment, the DNA molecule of the present invention is identical or substantially similar to any one of the nucleotide sequences listed in SEQ ID NO: 3, SEQ ID NO: 5 or SEQ ID NO: 7, or the encoding is identical to that of SEQ ID NO: 7. A Mlo protein that is identical or substantially similar to the Mlo protein listed in ID NO: 4, SEQ ID NO: 6, or SEQ ID NO: 8. In a more preferred embodiment, the DNA molecule comprising the nucleotide sequence set forth in SEQ ID NO: 3, SEQ ID NO: 5 or SEQ ID NO: 7 is from wheat. In another preferred embodiment, the DNA molecule of the present invention is combined with any one of listed in SEQ ID NO: 9, SEQ ID NO: 11, SEQ ID NO: 13, SEQ ID NO: 15 or SEQ ID NO: 17 The nucleotide sequence is identical or substantially similar, or the encoding is any one of the nucleotide sequences listed in SEQ ID NO: 10, SEQ ID NO: 12, SEQ ID NO: 14, SEQ ID NO: 16 or SEQ ID NO: 18 Mlo proteins that encode the same or substantially similar Mlo proteins. In a more preferred embodiment, the DNA molecule comprising the nucleotide sequence set forth in SEQ ID NO: 9, SEQ ID NO: 11, SEQ ID NO: 13, SEQ ID NO: 15 or SEQ ID NO: 17 is from Arabidopsis mustard. In another preferred embodiment, the aforementioned DNA molecule is modified such that the activity of the endogenous protein is lost. In a particular embodiment of the invention, said DNA modification results in one, all or a combination of the following changes in the amino acid sequence of the corresponding protein:

-色氨酸(163)改变为精氨酸- tryptophan (163) changed to arginine

-脯氨酸(396)后移码- Proline (396) post-frameshift

-色氨酸(160)后移码-Tryptophan (160) post-frameshift

-甲硫氨酸(1)改变为异亮氨酸- Methionine (1) changed to Isoleucine

-甘氨酸(227)改变为天冬氨酸- Glycine (227) changed to aspartic acid

-甲硫氨酸(1)改变为缬氨酸-Methionine (1) changed to valine

-精氨酸(11)改变为色氨酸- Arginine (11) changed to tryptophan

-丢失苯丙氨酸(183)、苏氨酸(184)- loss of phenylalanine (183), threonine (184)

-缬氨酸(31)改变为谷氨酸- Change of valine (31) to glutamic acid

-丝氨酸(32)改变为苯丙氨酸- Serine (32) changed to phenylalanine

-亮氨酸(271)改变为组氨酸。- Leucine (271) was changed to histidine.

在进一步优选的实施方案中,真菌病原体优选感染活表皮细胞,更优选真菌病原体来自白粉菌目(还已知为白粉菌),尤其是来自白粉菌属,最优选真菌病原体是禾白粉菌。In a further preferred embodiment, the fungal pathogen preferably infects living epidermal cells, more preferably the fungal pathogen is from the order Erysiphales (also known as Erysiphales), especially from the genus Erysipha, most preferably the fungal pathogen is Erysipha graminis.

在另外的实施方案中,分离的DNA分子是与上述分离分子反义的分子,如与编码包含和SEQ ID NO:1或SEQ ID NO:2中所列氨基酸序列相同或基本相似的至少一种氨基酸序列的Mlo蛋白质的DNA分子,如cDNA分子反义的分子,尤其是与和SEQ ID NO:3、5、7、9、11、13、15或17中所示DNA分子相同或基本相似、且所编码Mlo蛋白质与SEQ ID NO:4、6、8、10、12、14、16或18所列Mlo蛋白质相同或基本相似的DNA分子反义的DNA分子。In another embodiment, the isolated DNA molecule is a molecule that is antisense to the isolated molecule described above, such as encoding at least one amino acid sequence identical to or substantially similar to that set forth in SEQ ID NO: 1 or SEQ ID NO: 2. A DNA molecule of amino acid sequence Mlo protein, such as a molecule antisense to a cDNA molecule, in particular identical or substantially similar to the DNA molecule shown in SEQ ID NO: 3, 5, 7, 9, 11, 13, 15 or 17, And the encoded Mlo protein is an antisense DNA molecule that is identical or substantially similar to the Mlo protein listed in SEQ ID NO: 4, 6, 8, 10, 12, 14, 16 or 18.

本发明进一步提供了:The present invention further provides:

至少包含一种与SEQ ID NO:1或SEQ ID NO:2中所列氨基酸序列相同或基本相似的氨基酸序列的蛋白质,其中所说的蛋白质是Mlo蛋白质且赋予植物对真菌病原体的抗性。该蛋白质优选不来自大麦,而是来自双子叶植物或以下植物:小麦、玉米、稻、燕麦、黑麦、高粱、甘蔗、小米、买罗高粱和棕榈科。在优选的实施方案中,本发明的蛋白质由与SEQ ID NO:3、SEQ ID NO:5或SEQ ID NO:7中所列的任一种核苷酸序列相同或基本相似的核苷酸序列编码,或者与SEQID NO:4、SEQ ID NO:6或SEQ ID NO:8中所列任一种Mlo蛋白质相同或基本相似。在更优选的实施方案中,所说的蛋白质来自小麦。在另一优选的实施方案中,本发明的蛋白质由与SEQ ID NO:9、SEQ IDNO:11、SEQ ID NO:13、SEQ ID NO:15或SEQ ID NO:17中所列的任一种核苷酸序列相同或基本相似的核苷酸序列编码,或者与SEQ IDNO:10、SEQ ID NO:12、SEQ ID NO:14、SEQ ID NO:16或SEQ IDNO:18中所列任一种Mlo蛋白质相同或基本相似。在更优选的实施方案中,该蛋白质来自拟南芥。在另一优选的实施方案中,优选真菌病原体感染活表皮细胞,更优选真菌病原体来自白粉菌目(还已知为白粉菌),尤其是来自白粉菌属,最优选真菌病原体是禾白粉菌。在进一步的优选实施方案中,本发明还包括由上述任一种DNA分子编码的蛋白质的突变形式或截短形式。A protein comprising at least one amino acid sequence identical or substantially similar to that set forth in SEQ ID NO: 1 or SEQ ID NO: 2, wherein said protein is an Mlo protein and confers resistance to fungal pathogens in plants. The protein is preferably not from barley, but from a dicotyledonous plant or the following plants: wheat, maize, rice, oats, rye, sorghum, sugar cane, millet, milo and palmaceae. In a preferred embodiment, the protein of the present invention consists of a nucleotide sequence identical or substantially similar to any one of the nucleotide sequences listed in SEQ ID NO: 3, SEQ ID NO: 5 or SEQ ID NO: 7 Coding, or identical or substantially similar to any one of the Mlo proteins listed in SEQ ID NO: 4, SEQ ID NO: 6 or SEQ ID NO: 8. In a more preferred embodiment, said protein is from wheat. In another preferred embodiment, the protein of the present invention is composed of any one of listed in SEQ ID NO: 9, SEQ ID NO: 11, SEQ ID NO: 13, SEQ ID NO: 15 or SEQ ID NO: 17 Nucleotide sequence codes with identical or substantially similar nucleotide sequences, or any of those listed in SEQ ID NO: 10, SEQ ID NO: 12, SEQ ID NO: 14, SEQ ID NO: 16 or SEQ ID NO: 18 The Mlo proteins are identical or substantially similar. In a more preferred embodiment, the protein is from Arabidopsis. In another preferred embodiment, it is preferred that the fungal pathogen infects living epidermal cells, more preferably the fungal pathogen is from the order Erysiphales (also known as Erysiphales), especially from the genus Erysipha, most preferably the fungal pathogen is Erysiphe graminis. In a further preferred embodiment, the present invention also includes mutant or truncated forms of the proteins encoded by any of the DNA molecules described above.

本发明进一步提供了:The present invention further provides:

含上述任一种DNA分子,如上述cDN A的表达盒,其中所说的DNA分子与启动子和能在植物中表达该DNA分子的终止信号有效连接。在优选的实施方案中,所说的表达盒是异源的。在更优选的实施方案中,启动子和终止信号是真核生物的。在更优选的实施方案中,启动子和终止信号相对于编码区而言是异源的。An expression cassette comprising any of the aforementioned DNA molecules, such as the aforementioned cDNA, wherein said DNA molecule is operably linked to a promoter and a termination signal capable of expressing said DNA molecule in a plant. In preferred embodiments, said expression cassette is heterologous. In more preferred embodiments, the promoter and termination signal are eukaryotic. In a more preferred embodiment, the promoter and termination signal are heterologous with respect to the coding region.

本发明还提供了:The present invention also provides:

含上述任一种表达盒的载体。在优选的实施方案中,该载体用于植物中所说表达盒的转化。在另一优选的实施方案中,本发明载体用于任一种上述DNA分子的扩增。A vector containing any of the above expression cassettes. In a preferred embodiment, the vector is used for the transformation of said expression cassette in plants. In another preferred embodiment, the vector of the present invention is used for the amplification of any one of the aforementioned DNA molecules.

本发明还提供了:The present invention also provides:

包括含本发明分离DNA分子之表达盒或其部分的细胞,其中所说表达盒内的该DNA分子在所说细胞内是可表达的。在优选的实施方案中,DNA分子不来自大麦。在另一优选实施方案中,该细胞是植物细胞。在更优选的实施方案中,该表达盒稳定整合入该细胞的基因组中,或包含在自主复制载体内并作为染色体外分子保留在细胞中。A cell comprising an expression cassette or portion thereof comprising an isolated DNA molecule of the invention, wherein the DNA molecule within said expression cassette is expressible in said cell. In a preferred embodiment, the DNA molecule is not from barley. In another preferred embodiment, the cell is a plant cell. In more preferred embodiments, the expression cassette is stably integrated into the genome of the cell, or contained within an autonomously replicating vector and retained in the cell as an extrachromosomal molecule.

本发明进一步提供了:The present invention further provides:

包含具有本发明分离DNA分子之表达盒或其部分的植物。在优选的实施方案中,该DNA分子不来自大麦。在另一优选实施方案中,表达盒内所含的DNA分子在植物中是可表达的。在另一优选的实施方案中,所述DNA分子稳定整合入植物基因组中,或包含在自我复制载体内并作为染色体外分子保留在细胞中。在另一优选的实施方案中,植物可抗真菌病原体,优选感染活表皮细胞的真菌病原体,更优选真菌病原体来自白粉菌目(还已知为白粉菌),尤其是来自白粉菌属,最优选真菌病原体是禾白粉菌。A plant comprising an expression cassette or part thereof having an isolated DNA molecule of the invention. In a preferred embodiment, the DNA molecule is not from barley. In another preferred embodiment, the DNA molecule contained within the expression cassette is expressible in plants. In another preferred embodiment, said DNA molecule is stably integrated into the plant genome, or contained within a self-replicating vector and retained in the cell as an extrachromosomal molecule. In another preferred embodiment, the plant is resistant to a fungal pathogen, preferably a fungal pathogen that infects living epidermal cells, more preferably a fungal pathogen from the order Erysiphales (also known as Erysiphales), especially from the genus Erysipha, most preferably The fungal pathogen is Erysiphe graminis.

本发明还涉及所说植物的种子,其中种子任选地已被处理(如接触过抗原的或被包被的)和/或包装,如放置于袋内并附有使用说明。The invention also relates to seeds of said plants, wherein the seeds have optionally been treated (eg contacted or coated) and/or packaged, eg placed in a bag with instructions for use.

本发明还提供了:The present invention also provides:

包括含本发明分离DNA分子之植物的农产品。在优选的实施方案中,所说的农产品用做如饲料、食品或青贮饲料,其中不含真菌病原体产生的真菌毒素,如黄曲霉毒素。因此,所说的农产品具有改善的植物检疫特性。Agricultural products including plants comprising the isolated DNA molecules of the invention. In a preferred embodiment, said agricultural product is used as feed, food or silage, free of mycotoxins such as aflatoxins produced by fungal pathogens. Thus, said agricultural product has improved phytosanitary properties.

本发明进一步提供了:The present invention further provides:

制备对真菌病原体有抗性的植物的方法,包括以下步骤:A method for preparing plants resistant to fungal pathogens, comprising the steps of:

a)在植物中以“有义”方向表达由上述任一种DNA分子编码的RNA转录物;或a) expressing in a plant an RNA transcript encoded by any of the DNA molecules described above in a "sense" orientation; or

b)在植物中以“反义”方向表达由上述任一种DNA分子编码的RNA转录物;或b) express in plants an RNA transcript encoded by any of the above DNA molecules in an "antisense" orientation; or

c)在植物中表达能特异切割由相应于任一种上述DNA分子之内源基因编码的信使RNA转录物的核酶;或c) expressing in a plant a ribozyme capable of specifically cleaving a messenger RNA transcript encoded by an endogenous gene corresponding to any of the aforementioned DNA molecules; or

d)在植物中表达特异于相应于任一种上述DNA分子之基因所编码的内源蛋白质的aptamer;或d) express in a plant an aptamer specific for an endogenous protein encoded by a gene corresponding to any one of the aforementioned DNA molecules; or

e)在植物中表达上述任一种DNA分子的突变或截短形式,从而使其能成为显性失活突变体;或e) expressing in plants a mutated or truncated form of any of the aforementioned DNA molecules, thereby enabling it to become a dominant negative mutant; or

f)在植物中通过同源重组修饰相应于上述任一种DNA分子的基因的至少一个染色体拷贝;或f) modifying in a plant by homologous recombination at least one chromosomal copy of a gene corresponding to any of the DNA molecules described above; or

g)在植物中通过同源重组修饰相应于上述任一种DNA分子的基因的调节元件至少一个染色体拷贝。g) modifying in a plant by homologous recombination at least one chromosomal copy of a regulatory element of a gene corresponding to any one of the DNA molecules described above.

本发明进一步提供了:The present invention further provides:

通过上述任一种方法获得的植物,包括所说植物的种子,其中种子任选地已被处理(如接触过抗原的或被包被的)和/或包装,如放置于袋内并附有使用说明。在另一优选的实施方案中,所获植物可抗真菌病原体,优选为感染活表皮细胞的真菌病原体,更优选真菌病原体来自白粉菌目(还已知为白粉菌),尤其是来自白粉菌属,最优选真菌病原体是禾白粉菌。Plants obtained by any of the above methods, including seeds of said plants, wherein the seeds have optionally been treated (e.g. contacted or coated) and/or packaged, e.g. placed in a bag with Instructions for use. In another preferred embodiment, the plants obtained are resistant to fungal pathogens, preferably fungal pathogens infecting living epidermal cells, more preferably fungal pathogens from the order Erysiphales (also known as Erysiphales), especially from the genus Erysipha , most preferably the fungal pathogen is Erysiphe graminis.

本发明还提供了:The present invention also provides:

用上述任一种方法获得的具改善的植物检疫特性的农产品。Agricultural products with improved phytosanitary properties obtained by any of the above methods.

本发明进一步提供了:The present invention further provides:

分离编码Mlo蛋白质之DNA分子的方法,包括以下步骤:A method for isolating a DNA molecule encoding an Mlo protein, comprising the steps of:

a)将编码SEQ ID NO:1中至少6个氨基酸的简并寡核苷酸和互补于编码SEQ ID NO:2中至少6个氨基酸之序列的简并寡核苷酸与提取自植物的DNA,在允许所说简并寡核苷酸与所述DNA杂交的条件下混合;a) combining a degenerate oligonucleotide encoding at least 6 amino acids in SEQ ID NO: 1 and a degenerate oligonucleotide complementary to a sequence encoding at least 6 amino acids in SEQ ID NO: 2 with DNA extracted from a plant , mixing under conditions that allow said degenerate oligonucleotides to hybridize to said DNA;

b)扩增所说植物DNA的DNA片段,其中所说的DNA片段在其左和右末端处含可与步骤a)中所说简并寡核苷酸退火的核苷酸序列;和b) amplifying a DNA fragment of said plant DNA, wherein said DNA fragment contains a nucleotide sequence capable of annealing to said degenerate oligonucleotide in step a) at its left and right ends; and

c)获得含步骤b)之DNA片段的全长cDNA克隆。c) Obtaining a full-length cDNA clone containing the DNA fragment of step b).

本发明还提供了:The present invention also provides:

通过“体外重组”或“DNA重排”生产本发明核苷酸序列的突变拷贝的方法。本发明核苷酸序列的突变拷贝可用于赋与更高的真菌病原体抗性。在优选的实施方案中,本发明核苷酸序列的突变拷贝用于使植物抵抗更宽范围的病原体。这样的一种方法描述如下:A method of producing mutated copies of the nucleotide sequences of the invention by "in vitro recombination" or "DNA rearrangement". Mutated copies of the nucleotide sequences of the invention can be used to confer greater resistance to fungal pathogens. In a preferred embodiment, mutated copies of the nucleotide sequences of the invention are used to render plants resistant to a wider range of pathogens. One such method is described as follows:

诱变本发明DNA分子的方法,其中所说的DNA分子已被切割成预期大小的双链随机片段,该方法包括以下步骤:A method for mutagenizing a DNA molecule of the present invention, wherein said DNA molecule has been cut into double-stranded random fragments of expected size, the method comprising the following steps:

a)向所得双链随机片段群中添加一或多种单链或双链寡核苷酸,其中所说寡核苷酸包含与双链模板多核苷酸相同的区域和异源的区域;a) adding to the resulting population of double-stranded random fragments one or more single-stranded or double-stranded oligonucleotides, wherein said oligonucleotides comprise regions identical to those of the double-stranded template polynucleotide and regions that are heterologous;

b)将所得双链随机片段和寡核苷酸的混合物变性成单链片段;b) denaturing the resulting mixture of double-stranded random fragments and oligonucleotides into single-stranded fragments;

c)在导致所说的单链片段在所说的相同区域退火而形成退火片段配对的条件下,将所得单链片段群与聚合酶保温,其中所说的相同区域足以使配对中的一成员引发另一成员的复制,从而形成经诱变的双链多核苷酸;并c) incubating the resulting population of single-stranded fragments with a polymerase under conditions that cause said single-stranded fragments to anneal at said identical region sufficient for one member of the pair to form an annealed fragment pair triggering the replication of another member, thereby forming a mutagenized double-stranded polynucleotide; and

d)再重复第二和第三步骤至少两个循环,其中下一循环中的第二步骤产生的混合物包括来自前一循环第三步骤的经诱变双链多核苷酸,并且该下一循环形成了进一步诱变的双链多核苷酸。d) repeating the second and third steps for at least two more cycles, wherein the mixture produced by the second step in the next cycle includes the mutagenized double-stranded polynucleotide from the third step of the previous cycle, and the next cycle A further mutagenized double-stranded polynucleotide is formed.

定义definition

“分离DNA分子”是因人工操作而存在于其天然环境之外、因此非天然产物的核苷酸序列。分离核苷酸序列可以纯化形式存在,或存在于如转基因宿主细胞之类的非天然环境中。An "isolated DNA molecule" is a nucleotide sequence that has been removed from its natural environment by human manipulation and is therefore not a natural product. An isolated nucleotide sequence may exist in purified form, or in a non-native environment such as a transgenic host cell.

本文限定的“蛋白质”是由相应核苷酸序列编码的完整蛋白质,或由核苷酸序列的相应部分编码的蛋白质部分。A "protein" as defined herein is the complete protein encoded by the corresponding nucleotide sequence, or a protein part encoded by the corresponding part of the nucleotide sequence.

“分离蛋白质”是由分离核苷酸序列编码的蛋白质,因而非天然产物。分离蛋白质可以纯化形式存在,或存在于非天然环境中,如转基因宿主细胞中,其中所说的蛋白质在等基因的非转基因宿主细胞中通常不表达或者以不同的形式或不同的量表达。An "isolated protein" is a protein encoded by an isolated nucleotide sequence and thus is not a natural product. An isolated protein may be present in purified form, or in a non-native environment, such as a transgenic host cell, where the protein is not normally expressed or is expressed in a different form or in a different amount than in an isogenic, non-transgenic host cell.

“抗真菌病原体”的植物通过抑制或限制真菌病原体在植物上生长的能力而无或较少有由该真菌引起的真菌感染症状。结果植物生长得更好,具有较高的产量且产生更多的种子。A plant that is "resistant to a fungal pathogen" exhibits no or fewer symptoms of fungal infection caused by the fungus by inhibiting or limiting the ability of the fungal pathogen to grow on the plant. As a result the plants grow better, have higher yields and produce more seeds.

“赋予植物对真菌病原体的抗性的蛋白质”表示蛋白质涉及负责植物对真菌病原体的抗性的植物遗传途径的调控。该蛋白质在其中可以是正调节物,可增强植物对真菌病原体的抗性,或者该蛋白质在其中可以是负调节物,可抑制植物对真菌病原体的抗性。赋予植物对真菌病原体的抗性的蛋白质一个特例是Mlo蛋白质。"Protein conferring resistance to fungal pathogens in plants" means proteins involved in the regulation of plant genetic pathways responsible for the resistance of plants to fungal pathogens. The protein may be a positive regulator in which the plant's resistance to the fungal pathogen is enhanced, or the protein may be a negative regulator in which the plant's resistance to the fungal pathogen is suppressed. A special case of a protein that confers resistance to fungal pathogens in plants is the Mlo protein.

本文的“Mlo蛋白质”指在疾病抗性途径中具基本相似功能并有一些结构同源性的蛋白质家族(Mlo家族)成员。结构同源性可以是,如,该家族成员具有至少一个保守区。"Mlo protein" herein refers to members of a family of proteins (Mlo family) with substantially similar functions and some structural homology in disease resistance pathways. Structural homology can be, eg, that the family members have at least one conserved region.

在最广的意义上说,术语“基本相似”在此用于核苷酸序列时,表示与参照核苷酸序列相当的核苷酸序列,其中该相当的序列编码与参照核苷酸序列编码的多肽具基本相同的结构和功能的多肽,如,只在不影响多肽功能的氨基酸处发生了改变。优选基本相似的核苷酸序列编码由参照核苷酸序列编码的多肽。基本相似的核苷酸序列与参照核苷酸序列之间的相同性百分率至少是80%,较好是至少85%,优选至少是90%,更优选至少为95%,还更优选至少为99%。In the broadest sense, the term "substantially similar" when applied to nucleotide sequences refers to a nucleotide sequence equivalent to a reference nucleotide sequence, wherein the equivalent sequence encodes the same sequence as the reference nucleotide sequence. A polypeptide having substantially the same structure and function, eg, only changes in amino acids that do not affect the function of the polypeptide. Preferably, substantially similar nucleotide sequences encode polypeptides encoded by the reference nucleotide sequence. The percent identity between the substantially similar nucleotide sequence and the reference nucleotide sequence is at least 80%, preferably at least 85%, preferably at least 90%, more preferably at least 95%, still more preferably at least 99% %.

术语“基本相似”在此用于蛋白质时,表示与参照蛋白质相当的蛋白质,其中所述蛋白质具有与参照蛋白质基本相同的结构和功能,如,只在不影响多肽功能的氨基酸处发生了改变。当用于蛋白质或氨基酸序列时,基本相似的蛋白质或氨基酸序列与参照蛋白质或氨基酸序列之间的相同性百分率至少是80%,较好是至少85%,优选至少是90%,更优选至少为95%,还更优选至少为99%。The term "substantially similar" when used herein for a protein means a protein that is comparable to a reference protein, wherein the protein has substantially the same structure and function as the reference protein, eg, only amino acids that do not affect the function of the polypeptide are changed. When applied to protein or amino acid sequences, the percent identity between a substantially similar protein or amino acid sequence and a reference protein or amino acid sequence is at least 80%, preferably at least 85%, preferably at least 90%, more preferably at least 95%, still more preferably at least 99%.

用基于动力学编程算法的计算机软件确定序列相同性的百分率。在本发明范围内优选的计算机软件包括BLAST(基本的局部序列对比检索工具)检索软件,它被设计成不管待查询的是蛋白质或DNA均可检索所有可获得的序列数据库。此检索工具的BLAST 2.0版本(缺刻BLAST)已公开于互联网上(目前是http:∥www.ncbi.nlm.nih.gov/BLAST/)。其中使用采用局部序列对比而非完整序列对比的渐进式算法,因而能探测只共有分离区域的序列间的关系。BLAST检索中指定的分值具有明确的统计学解释。所说的软件优选用设定为默认值的任选参数运行。The percent sequence identity is determined using computer software based on a kinetic programming algorithm. Preferred computer software within the scope of the present invention includes BLAST (Basic Local Alignment Search Tool) search software, which is designed to search all available sequence databases regardless of whether the query is protein or DNA. The BLAST 2.0 version of this search tool (Notched BLAST) has been published on the Internet (currently http:∥www.ncbi.nlm.nih.gov/BLAST/). It uses a progressive algorithm that employs partial sequence alignments rather than full sequence alignments, thereby enabling the detection of relationships between sequences that share only discrete regions. Scores assigned in BLAST searches have clear statistical interpretations. The software is preferably run with optional parameters set to default values.

术语“基因”指编码序列和相关调节序列,其中编码序列被转录成RNA,如mRNA、rRNA、tRNA、snRNA、有义RNA或反义RNA。调节序列的例子是启动子序列、5’和3’非翻译序列和终止序列。另外可存在的元件是,例如内含子。The term "gene" refers to a coding sequence and associated regulatory sequences, wherein the coding sequence is transcribed into RNA, such as mRNA, rRNA, tRNA, snRNA, sense RNA or antisense RNA. Examples of regulatory sequences are promoter sequences, 5' and 3' untranslated sequences and termination sequences. Additional elements that may be present are, for example, introns.

“表达”指植物中内源基因或转基因的转录和/或翻译。例如,在反义构建体情形下,表达可只指反义DNA的转录。"Expression" refers to the transcription and/or translation of an endogenous or transgene in a plant. For example, in the case of an antisense construct, expression may simply refer to transcription of the antisense DNA.

“表达盒”用于此处表示能指导特殊核苷酸序列在适当宿主细胞中表达的DNA序列,包含与该目的核苷酸序列有效连接的启动子,该目的核苷酸序列还与终止信号有效连接。其中通常还包含核苷酸序列正确翻译所要求的序列。编码区通常编码目的蛋白质,但也可编码有功能的目的RNA,例如反义RNA或在有义或反义方向抑制特殊基因表达的非翻译RNA,如反义RNA。含目的核苷酸序列的表达盒可以是嵌合的,意思是其中至少一种组分与其另一种组分异源。表达盒也可是天然存在的,但已按对异源表达有用的重组形式获得的表达盒。不过,一般地说,表达盒与宿主是异源的,即表达盒内的特殊DNA序列不是天然存在于宿主细胞中的,必须用转化方式引入宿主细胞或宿主细胞前体。表达盒中核苷酸序列的表达可在组成型启动子或只有当宿主细胞暴露于某些特殊外部刺激时才引发转录的诱导型启动子控制下。在多细胞生物体,如植物的情况下,启动子还可以是特殊组织或器官或发育阶段特异性的。"Expression cassette" is used herein to mean a DNA sequence capable of directing the expression of a particular nucleotide sequence in an appropriate host cell, comprising a promoter operably linked to the nucleotide sequence of interest in combination with a termination signal active connection. It usually also contains sequences required for correct translation of the nucleotide sequence. The coding region usually encodes a protein of interest, but may also encode a functional RNA of interest, such as an antisense RNA or an untranslated RNA, such as an antisense RNA, that inhibits expression of a particular gene in a sense or antisense orientation. An expression cassette containing a nucleotide sequence of interest may be chimeric, meaning that at least one of its components is heterologous to another. An expression cassette may also be one that occurs in nature but has been obtained in a recombinant form useful for heterologous expression. However, generally speaking, the expression cassette is heterologous to the host, that is, the specific DNA sequence in the expression cassette does not naturally exist in the host cell, and must be introduced into the host cell or host cell precursor by transformation. Expression of the nucleotide sequence in the expression cassette may be under the control of a constitutive promoter or an inducible promoter which initiates transcription only when the host cell is exposed to certain specific external stimuli. In the case of multicellular organisms, such as plants, the promoter may also be specific for a particular tissue or organ or developmental stage.

此处所用“异源的”表示“不同的天然或合成来源”或代表非天然状态。例如,如果用来自另一生物体,尤其是另一物种的核酸序列转化宿主细胞,则该基因与宿主细胞异源,还与带此基因的宿主细胞后代异源。转化的核酸可包含异源启动子、异源编码序列或异源终止序列。或者,转化核酸可以是完全异源的,或可包含任何可能的异源和内源核酸序列组合。同样地,异源也指来自同一天然原始细胞类型并插入其中,但以非天然状态出现的核酸序列,如不同的拷贝数,或在不同调节元件的控制下。"Heterologous" as used herein means "of a different natural or synthetic origin" or to represent a non-natural state. For example, if a host cell is transformed with a nucleic acid sequence from another organism, especially another species, the gene is heterologous to the host cell and also heterologous to the progeny of the host cell carrying the gene. The transforming nucleic acid may comprise a heterologous promoter, heterologous coding sequence or heterologous termination sequence. Alternatively, the transforming nucleic acid may be entirely heterologous, or may comprise any possible combination of heterologous and endogenous nucleic acid sequences. Likewise, heterologous also refers to nucleic acid sequences inserted into and derived from the same native cell type of origin, but in a non-native state, such as a different copy number, or under the control of different regulatory elements.

术语“启动子”指可起始相关DNA序列转录的DNA序列。启动子区域还包含可作为诸如激活子、增强子和/或阻抑物之类基因表达调节物的元件。The term "promoter" refers to a DNA sequence that initiates transcription of the associated DNA sequence. The promoter region also contains elements that act as regulators of gene expression such as activators, enhancers and/or repressors.

“合成的核苷酸序列”用于此处表示含有天然序列中不存在的结构特征的核苷酸序列。例如,更近似于双子叶植物和/或单子叶植物基因的G+C含量和正常密码子分布的人工序列就称作合成的。"Synthetic nucleotide sequence" is used herein to mean a nucleotide sequence that contains structural features not found in the native sequence. For example, an artificial sequence that more closely approximates the G+C content and normal codon distribution of dicot and/or monocot genes is said to be synthetic.

如果两序列所处的位置使得DNA调节序列影响DNA编码序列的表达,即称调节DNA序列与编码RNA或蛋白质的DNA序列“有效连接”或“相关”。A regulatory DNA sequence is said to be "operably linked" or "associated" with a DNA sequence encoding an RNA or protein if the two sequences are located such that the DNA regulatory sequence affects the expression of the DNA coding sequence.

“调节元件”指涉及核苷酸序列表达的序列。调节元件含与目的核苷酸序列有效连接的启动子和终止信号。它们通常还包括核苷酸序列正确翻译需要的序列。"Regulatory element" refers to a sequence involved in the expression of a nucleotide sequence. Regulatory elements comprise a promoter and a termination signal operably linked to a nucleotide sequence of interest. They also generally include sequences required for proper translation of the nucleotide sequence.

“植物”指任何植物或植物的一部分,尤其是任何发育阶段的种子植物。其中还包括插条、细胞或组织培养物和种子。当用于本发明时,术语“植物组织”包括但不局限于整株植物、植物器官、植物种子、原生质体、愈伤组织、细胞培养物以及组织成结构和/或功能单元的任何植物细胞群。"Plant" means any plant or part of a plant, especially a seed plant at any stage of development. These also include cuttings, cell or tissue cultures and seeds. As used herein, the term "plant tissue" includes, but is not limited to, whole plants, plant organs, plant seeds, protoplasts, callus, cell cultures, and any plant cells organized into structural and/or functional units group.

“植物细胞”指植物的结构和生理单位,包含原生质体和细胞壁。植物细胞可以是分离的单个细胞或人工培养细胞的形式,或作为如植物组织或植物器官之类较高级组织单元的一部分。"Plant cell" refers to the structural and physiological unit of a plant, comprising the protoplast and cell wall. Plant cells may be in the form of isolated single cells or cultured cells, or as part of higher organizational units such as plant tissues or plant organs.

“转化”用于此处表示将核酸引入细胞内。尤其是,将DNA分子稳定整合入目的生物体的基因组。"Transformation" is used herein to mean the introduction of a nucleic acid into a cell. In particular, the stable integration of the DNA molecule into the genome of the organism of interest.

“选择标记”是由在植物细胞中的表达带给细胞选择优势的基因。与非转化细胞的生长相比,用选择标记基因转化的细胞具有的选择优势可能是由于它们在负选择剂(如抗生素或除草剂)存在时生长的能力。与非转化细胞相比,转化细胞具有的选择优势还可能是由于它们具有利用所添加的化合物作为营养物、生长因子或能量来源的增强的或新的能力。选择标记基因还指在植物细胞中的表达带给细胞正和负选择优势的基因或基因组合。A "selectable marker" is a gene whose expression in a plant cell confers a selective advantage on the cell. The selective advantage possessed by cells transformed with a selectable marker gene compared to the growth of non-transformed cells may be due to their ability to grow in the presence of negative selection agents such as antibiotics or herbicides. Transformed cells may also have a selective advantage over non-transformed cells due to their enhanced or novel ability to utilize the added compound as a source of nutrients, growth factors or energy. A selectable marker gene also refers to a gene or combination of genes whose expression in a plant cell confers positive and negative selective advantages on the cell.

“筛选标记”是由其表达未带给转化细胞选择优势,但使转化细胞的表型与未转化细胞截然不同的基因赋予的。A "selectable marker" is conferred by a gene whose expression does not confer a selective advantage on transformed cells, but makes transformed cells phenotype distinct from non-transformed cells.

序列表中的序列简述Sequence Brief Description in Sequence Listing

SEQ ID NO:1  保守氨基酸序列1SEQ ID NO: 1 Conserved amino acid sequence 1

SEQ ID NO:2  保守氨基酸序列2SEQ ID NO: 2 Conserved amino acid sequence 2

SEQ ID NO:3  小麦Mlo蛋白质TrMlo1的核苷酸序列SEQ ID NO: 3 Nucleotide sequence of wheat Mlo protein TrMlo1

SEQ ID NO:4  TrMlo1的蛋白质序列SEQ ID NO: 4 Protein sequence of TrMlo1

SEQ ID NO:5  小麦Mlo蛋白质TrMlo2的核苷酸序列SEQ ID NO: 5 Nucleotide sequence of wheat Mlo protein TrMlo2

SEQ ID NO:6  TrMlo2的蛋白质序列SEQ ID NO:6 Protein sequence of TrMlo2

SEQ ID NO:7  小麦Mlo蛋白质TrMlo3的核苷酸序列SEQ ID NO: 7 Nucleotide sequence of wheat Mlo protein TrMlo3

SEQ ID NO:8  TrMlo3的蛋白质序列SEQ ID NO:8 Protein sequence of TrMlo3

SEQ ID NO:9  拟南芥属Mlo蛋白质CIB10259的核苷酸序列SEQ ID NO: 9 Nucleotide sequence of Arabidopsis Mlo protein CIB10259

SEQ ID NO:10  CIB10259的蛋白质序列SEQ ID NO: 10 Protein sequence of CIB10259

SEQ ID NO:11  拟南芥属Mlo蛋白质CIB10295的核苷酸序列SEQ ID NO: 11 Nucleotide sequence of Arabidopsis Mlo protein CIB10295

SEQ ID NO:12  CIB10295的蛋白质序列SEQ ID NO:12 Protein sequence of CIB10295

SEQ ID NO:13  拟南芥属Mlo蛋白质CIB10296的核苷酸序列SEQ ID NO: 13 Nucleotide sequence of Arabidopsis Mlo protein CIB10296

SEQ ID NO:14  CIB10296的蛋白质序列SEQ ID NO:14 Protein sequence of CIB10296

SEQ ID NO:15  拟南芥属Mlo蛋白质F19850的核苷酸序列SEQ ID NO: 15 Nucleotide sequence of Arabidopsis Mlo protein F19850

SEQ ID NO:16  F19850的蛋白质序列Protein sequence of SEQ ID NO: 16 F19850

SEQ ID NO:17  拟南芥属Mlo蛋白质U95973的核苷酸序列SEQ ID NO: 17 Nucleotide sequence of Arabidopsis Mlo protein U95973

SEQ ID NO:18  U95973的蛋白质序列Protein sequence of SEQ ID NO: 18 U95973

SEQ ID NO:19  寡核苷酸MLO-1SEQ ID NO: 19 Oligonucleotide MLO-1

SEQ ID NO:20  寡核苷酸MLO-3SEQ ID NO:20 Oligonucleotide MLO-3

SEQ ID NO:21  寡核苷酸MLO-5SEQ ID NO: 21 Oligonucleotide MLO-5

SEQ ID NO:22  寡核苷酸MLO-7SEQ ID NO:22 Oligonucleotide MLO-7

SEQ ID NO:23  寡核苷酸MLO-10SEQ ID NO:23 Oligonucleotide MLO-10

SEQ ID NO:24  寡核苷酸MLO-15SEQ ID NO: 24 Oligonucleotide MLO-15

SEQ ID NO:25  寡核苷酸MLO-26SEQ ID NO:25 Oligonucleotide MLO-26

SEQ ID NO:26  寡核苷酸MLO-GSP1SEQ ID NO: 26 Oligonucleotide MLO-GSP1

SEQ ID NO:27  寡核苷酸MLO-GSP2SEQ ID NO:27 Oligonucleotide MLO-GSP2

SEQ ID NO:28  寡核苷酸ST27SEQ ID NO: 28 Oligonucleotide ST27

SEQ ID NO:29  寡核苷酸N37544-1SEQ ID NO:29 Oligonucleotide N37544-1

SEQ ID NO:30  寡核苷酸N37544-2SEQ ID NO: 30 Oligonucleotide N37544-2

SEQ ID NO:31  寡核苷酸T22146-1SEQ ID NO: 31 Oligonucleotide T22146-1

SEQ ID NO:32  寡核苷酸T22146-2SEQ ID NO: 32 Oligonucleotide T22146-2

SEQ ID NO:33  寡核苷酸H76041-1SEQ ID NO: 33 Oligonucleotide H76041-1

SEQ ID NO:34  寡核苷酸H76041-2SEQ ID NO: 34 Oligonucleotide H76041-2

SEQ ID NO:35  寡核苷酸SAS-1SEQ ID NO: 35 Oligonucleotide SAS-1

SEQ ID NO:36  寡核苷酸SAS-2SEQ ID NO:36 Oligonucleotide SAS-2

SEQ ID NO:37  寡核苷酸SAS-3SEQ ID NO: 37 Oligonucleotide SAS-3

SEQ ID NO:38  寡核苷酸SAS-4SEQ ID NO: 38 Oligonucleotide SAS-4

SEQ ID NO:39  寡核苷酸SAS-5SEQ ID NO:39 Oligonucleotide SAS-5

SEQ ID NO:40  寡核苷酸SAS-6SEQ ID NO: 40 Oligonucleotide SAS-6

SEQ ID NO:41  寡核苷酸SAS-7SEQ ID NO: 41 Oligonucleotide SAS-7

SEQ ID NO:42  寡核苷酸SAS-8SEQ ID NO: 42 Oligonucleotide SAS-8

保藏菌种 Preserved strains

所有的保藏菌种均保藏在北方地区研究中心,1815 NorthernUniversity Street,Peoria,Illinois 61604,USA。All deposited strains are deposited at the Northern Regional Research Center, 1815 Northern University Street, Peoria, Illinois 61604, USA.

本发明涉及编码Mlo蛋白质的DNA分子,它们赋予植物对真菌病原体的抗性。本发明的发明者首次鉴定了Mlo蛋白质中的保守氨基酸序列。本发明的保守氨基酸序列在来自小麦的三种Mlo蛋白质和来自拟南芥的三种Mlo蛋白质之间是保守的。在两种推测的拟南芥属Mlo蛋白质中这些氨基酸序列也是保守的。SEQ ID NO:1中所列的第一个保守氨基酸序列包括13个氨基酸。SEQ ID NO:1中的第四个氨基酸是L、V或I,第五个氨基酸是V或L,而第七个氨基酸是F或L。SEQ ID NO:1中的第十三个氨基酸不是I,而优选为T、S或A。SEQID NO:2中所列的第二个保守氨基酸序列包括14个氨基酸。SEQ IDNO:2中的第一个氨基酸不是M而优选为I、V、S或G。它的第三个氨基酸是F、L或V,它的第六个氨基酸是Y或N,它的第七个氨基酸是A或V,它的第八个氨基酸是L或I,它的第十个氨基酸是T或S。本发明包括含至少一种上述保守氨基酸序列的分离Mlo蛋白质和编码该Mlo蛋白质的分离DNA分子。本发明还包括含SEQ ID NO:1和SEQ ID NO:2中所列保守序列的分离Mlo蛋白质。在优选的实施方案中,编码本发明Mlo蛋白质的分离DNA分子是cDNA分子。The present invention relates to DNA molecules encoding Mlo proteins which confer resistance in plants against fungal pathogens. The inventors of the present invention identified for the first time a conserved amino acid sequence in the Mlo protein. The conserved amino acid sequences of the present invention are conserved among the three Mlo proteins from wheat and the three Mlo proteins from Arabidopsis. These amino acid sequences are also conserved in the two putative Arabidopsis Mlo proteins. The first conserved amino acid sequence listed in SEQ ID NO: 1 includes 13 amino acids. The fourth amino acid in SEQ ID NO: 1 is L, V or I, the fifth amino acid is V or L, and the seventh amino acid is F or L. The thirteenth amino acid in SEQ ID NO: 1 is not I, but preferably T, S or A. The second conserved amino acid sequence listed in SEQ ID NO: 2 includes 14 amino acids. The first amino acid in SEQ ID NO: 2 is not M but preferably I, V, S or G. Its third amino acid is F, L or V, its sixth amino acid is Y or N, its seventh amino acid is A or V, its eighth amino acid is L or I, and its tenth amino acid is amino acid is T or S. The present invention includes isolated Mlo proteins comprising at least one of the conserved amino acid sequences described above and isolated DNA molecules encoding the Mlo proteins. The present invention also includes the isolated Mlo protein containing the conserved sequences listed in SEQ ID NO: 1 and SEQ ID NO: 2. In a preferred embodiment, the isolated DNA molecule encoding the Mlo protein of the invention is a cDNA molecule.

在另一实施方案中,编码含至少一种所述保守氨基酸序列的Mlo蛋白质的DNA分子不是来自大麦的。在另一实施方案中,所说的DNA分子来自双子叶植物或来自小麦、玉米、稻、燕麦、黑麦、高粱、甘蔗、小米、买罗高粱或棕榈科。在优选的实施方案中,本发明的DNA分子与SEQ ID NO:3、5或7或者SEQ ID NO:9、11、13、15或17中所列DNA分子相同或基本相似,或编码与SEQ ID NO:4、6或8或者SEQ ID NO:10、12、14、16、18中所列的任一种Mlo蛋白质相同或基本相似的Mlo蛋白质。SEQ ID NO:3、5或7中所列的DNA分子来自小麦,分别编码SEQ ID NO:4、6或8中所列的Mlo蛋白质。这种DNA分子的分离进一步描述于实施例1中。SEQ ID NO:9、11、13、15或17中所列的DNA分子来自拟南芥属,分别编码SEQ IDNO:10、12、14、16或18中所列的Mlo蛋白质。这种DNA分子的分离进一步描述于实施例2中。In another embodiment, the DNA molecule encoding the Mlo protein comprising at least one of said conserved amino acid sequences is not from barley. In another embodiment, said DNA molecule is from a dicotyledonous plant or from wheat, maize, rice, oats, rye, sorghum, sugar cane, millet, milo or palmaceae. In a preferred embodiment, the DNA molecule of the present invention is identical or substantially similar to the DNA molecule listed in SEQ ID NO: 3, 5 or 7 or SEQ ID NO: 9, 11, 13, 15 or 17, or the encoding is identical to that of SEQ ID NO: 9, 11, 13, 15 or 17 A Mlo protein identical or substantially similar to any one of the Mlo proteins listed in ID NO: 4, 6 or 8 or SEQ ID NO: 10, 12, 14, 16, 18. The DNA molecule set forth in SEQ ID NO: 3, 5 or 7 is from wheat and encodes the Mlo protein set forth in SEQ ID NO: 4, 6 or 8, respectively. The isolation of this DNA molecule is further described in Example 1. The DNA molecule set forth in SEQ ID NO: 9, 11, 13, 15 or 17 is from Arabidopsis thaliana and encodes the Mlo protein set forth in SEQ ID NO: 10, 12, 14, 16 or 18, respectively. The isolation of this DNA molecule is further described in Example 2.

SEQ ID NO:3的编码被称为TrMlo 1之小麦Mlo蛋白质的DNA分子以TrMlo 1和TrMlo 1-5株系保藏,接收号分别为NRRL B-21948和NRRL B-21949。SEQ ID NO:5的编码被称为TrMlo 2之小麦Mlo蛋白质的DNA分子以TrMlo 2和TrMlo 2-5株系保藏,接收号分别为NRRL B-21950和NRRL B-21951。SEQ ID NO:7的编码被称为TrMlo3之小麦Mlo蛋白质的DNA分子以TrMlo 3和TrMlo 3-5株系保藏,接收号分别为NRRL B-21952和NRRL B-21953。TrMlo1和TrMlo3包含相应Mlo基因的全长cDNA,还包含相应5’和3’非翻译区的一部分。TrMlo2是被修复的相应基因的最长cDNA克隆。它包含整个编码区,只是缺少与TrMlo1和TrMlo3比较推断出的第一个甲硫氨酸(起始密码子)。TrMlo2还包含相应基因3’非翻译区的一部分。The DNA molecule of SEQ ID NO: 3 encoding the wheat Mlo protein known as TrMlo 1 was deposited as TrMlo 1 and TrMlo 1-5 strains with accession numbers NRRL B-21948 and NRRL B-21949, respectively. The DNA molecule of SEQ ID NO: 5 encoding the wheat Mlo protein known as TrMlo 2 was deposited as TrMlo 2 and TrMlo 2-5 strains with accession numbers NRRL B-21950 and NRRL B-21951, respectively. The DNA molecule of SEQ ID NO: 7 encoding the wheat Mlo protein known as TrMlo3 was deposited as TrMlo 3 and TrMlo 3-5 lines with accession numbers NRRL B-21952 and NRRL B-21953, respectively. TrMlo1 and TrMlo3 contain the full-length cDNA of the corresponding Mlo gene and also contain part of the corresponding 5' and 3' untranslated regions. TrMlo2 is the longest cDNA clone of the corresponding gene that was repaired. It contains the entire coding region, but lacks the first methionine (start codon) deduced from comparison with TrMlo1 and TrMlo3. TrMlo2 also contains part of the 3' untranslated region of the corresponding gene.

SEQ ID NO:9的编码被称为CIB10259之拟南芥属Mlo蛋白质的DNA分子以pCIB 10259株保藏,接收号为NRRL B-21945。SEQ IDNO:11的编码被称为CIB10295之拟南芥属Mlo蛋白质的DNA分子以pCIB 10295株保藏,接收号为NRRL B-21946。SEQ ID NO:13的编码被称为CIB10296之拟南芥属Mlo蛋白质的DNA分子以pCIB10296株保藏,接收号为NRRL B-21947。CIB10259、CIB10295和CIB10296包含相应Mlo基因的全长cDNA,还包含相应5’和3’非翻译区的一部分。编码拟南芥属Mlo蛋白质家族成员F19850和U95973的核苷酸序列是从Gembank中获得的。不过,对于这两个克隆均测定了预计的氨基酸序列并发现它与Gembank中预计的氨基酸序列不相匹配。因此本发明的发明者测定的Mlo蛋白质是新的且非显而易见的。两种新预计蛋白质均包含SEQ ID NO:1和2中所列的保守氨基酸,因此它们及编码它们的分离cDNA均包括于本发明中。The DNA molecule of SEQ ID NO: 9 encoding the Arabidopsis Mlo protein designated CIB10259 was deposited as strain pCIB10259 under accession number NRRL B-21945. The DNA molecule of SEQ ID NO: 11 encoding the Arabidopsis Mlo protein designated CIB10295 was deposited as strain pCIB10295 under accession number NRRL B-21946. The DNA molecule of SEQ ID NO: 13 encoding the Arabidopsis Mlo protein designated CIB10296 was deposited as strain pCIB10296 under accession number NRRL B-21947. CIB10259, CIB10295 and CIB10296 contain the full-length cDNA of the corresponding Mlo gene and also contain part of the corresponding 5' and 3' untranslated regions. Nucleotide sequences encoding Arabidopsis Mlo protein family members F19850 and U95973 were obtained from Gembank. However, the predicted amino acid sequence was determined for both clones and found to not match the predicted amino acid sequence in Gembank. The Mlo protein assayed by the inventors of the present invention is therefore novel and non-obvious. Both new predicted proteins contain the conserved amino acids listed in SEQ ID NO: 1 and 2, so they and the isolated cDNA encoding them are included in the present invention.

本发明DNA分子编码的Mlo蛋白质赋予植物对真菌病原体的抗性,优选可感染活的植物表皮细胞的真菌病原体,更优选真菌病原体来自白粉菌目(也已知为白粉菌)(Agrios G.(1988)植物病理学,第三版,Academic Press Inc.,尤其是在第271页)。优选地,本发明DNA分子编码的Mlo蛋白质赋予植物对来自白粉菌属的病原体的抗性,更优选真菌病原体是禾白粉菌。The Mlo protein encoded by the DNA molecule of the present invention confers resistance to a plant against a fungal pathogen, preferably a fungal pathogen capable of infecting living plant epidermal cells, more preferably a fungal pathogen from the order Erysiphales (also known as Erysiphales) (Agrios G.( 1988) Phytopathology, Third Edition, Academic Press Inc., especially at page 271). Preferably, the Mlo protein encoded by the DNA molecule of the invention confers resistance to the plant against a pathogen from the genus Erysiphe, more preferably the fungal pathogen is Erysiphe graminis.

本发明还包括含本发明任一种DNA分子的重组载体。在这些载体中,所说DNA分子优选包含于表达盒中,该表达盒含有在能表达所说DNA分子的宿主细胞中用于表达该DNA分子的表达元件。所说的调节元件通常是启动子和终止信号,还优选包括允许由本发明DNA分子编码的蛋白质有效翻译的元件。在优选的实施方案中,表达盒是异源的。所说的载体用于将含本发明任一种DNA分子的表达盒转化入宿主细胞。在优选的实施方案中,表达盒稳定整合入所说宿主细胞的DNA中。在另一优选实施方案中,表达盒优选包含于载体中,该载体能在宿主细胞中复制并作为染色体外分子保留于宿主细胞中。在进一步优选的实施方案中,利用所说的染色体外复制分子在宿主细胞中扩增本发明的DNA分子。在优选的实施方案中,所说的宿主细胞是微生物,如细菌,尤其是大肠杆菌。在另一优选实施方案中,宿主细胞是真核生物细胞,如酵母细胞、昆虫细胞或植物细胞。The invention also includes recombinant vectors comprising any of the DNA molecules of the invention. In these vectors, said DNA molecule is preferably contained in an expression cassette containing expression elements for expression of said DNA molecule in a host cell capable of expressing said DNA molecule. Said regulatory elements are usually promoters and termination signals, and preferably also include elements allowing efficient translation of the protein encoded by the DNA molecule of the invention. In preferred embodiments, the expression cassette is heterologous. Said vectors are used to transform an expression cassette containing any of the DNA molecules of the present invention into host cells. In a preferred embodiment, the expression cassette is stably integrated into the DNA of the host cell. In another preferred embodiment, the expression cassette is preferably contained in a vector capable of replicating in the host cell and retained in the host cell as an extrachromosomal molecule. In a further preferred embodiment, the DNA molecule of the invention is amplified in a host cell using said extrachromosomal replication molecule. In a preferred embodiment, said host cell is a microorganism, such as a bacterium, especially E. coli. In another preferred embodiment, the host cell is a eukaryotic cell, such as a yeast cell, an insect cell or a plant cell.

在进一步的实施方案中,用已知为体外重组或DNA重排的技术通过引入随机突变修饰本发明的DNA分子。该技术描述于此处收编作为参考的文献:Stemmer等,自然370:389-391(1994)和美国专利5605793中。以本文所述的原始核苷酸序列为基础产生数百万个核苷酸序列突变拷贝,并回收具有改良特性的变体,如增强了对真菌病原体的抗性或能抵抗更宽范围的病原体。该方法包括从含本发明核苷酸序列的模板双链多核苷酸形成经诱变的双链多核苷酸,其中模板双链多核苷酸已切割成预期大小的双链随机片段,还包括下述步骤:向所得双链随机片段群中添加一或多种单链或双链寡核苷酸,其中所说寡核苷酸包含与双链模板多核苷酸相同的区域和异源的区域;将所得双链随机片段和寡核苷酸的混合物变性成单链片段;在导致所说的单链片段在所说的相同区域退火而形成退火片段配对的条件下,将所得单链片段群与聚合酶保温,其中所说的相同区域足以使配对中的一成员引发另一成员的复制,从而形成经诱变的双链多核苷酸;并且再重复第二和第三步骤至少两个循环,其中下一循环中的第二步骤产生的混合物包括来自前一循环第三步骤的经诱变双链多核苷酸,并且该下一循环形成了进一步诱变的双链多核苷酸。在优选实施方案中,双链随机片段中的单种双链随机片段的浓度小于总DNA重量的1%。在进一步优选的实施方案中,模板双链多核苷酸包含至少约100种多核苷酸。在另一实施方案中,双链随机片段的大小为约5bp-5kb。在进一步的实施方案中,该方法的第四步包括重复第二和第三步骤至少10个循环。In a further embodiment, the DNA molecules of the invention are modified by introducing random mutations using techniques known as in vitro recombination or DNA shuffling. This technique is described in Stemmer et al., Nature 370:389-391 (1994) and US Patent 5,605,793, hereby incorporated by reference. Generate millions of mutated copies of the nucleotide sequence based on the original nucleotide sequence described herein and recover variants with improved properties, such as increased resistance to fungal pathogens or resistance to a wider range of pathogens . The method includes forming a mutagenized double-stranded polynucleotide from a template double-stranded polynucleotide containing a nucleotide sequence of the present invention, wherein the template double-stranded polynucleotide has been cut into double-stranded random fragments of expected size, and also includes the following The above steps: adding one or more single-stranded or double-stranded oligonucleotides to the obtained double-stranded random fragment population, wherein said oligonucleotides contain the same region and heterologous region as the double-stranded template polynucleotide; denaturing the resulting mixture of double-stranded random fragments and oligonucleotides into single-stranded fragments; and combining the resulting population of single-stranded fragments with polymerase incubation, wherein said identical region is sufficient for one member of the pair to initiate replication of the other member, thereby forming a mutagenized double-stranded polynucleotide; and repeating the second and third steps for at least two more cycles, wherein the mixture produced by the second step in the next cycle comprises the mutagenized double-stranded polynucleotide from the third step of the previous cycle, and this next cycle forms a further mutagenized double-stranded polynucleotide. In a preferred embodiment, the concentration of a single double-stranded random fragment in the double-stranded random fragment is less than 1% by weight of the total DNA. In a further preferred embodiment, the template double-stranded polynucleotides comprise at least about 100 polynucleotides. In another embodiment, the size of the double-stranded random fragments is about 5 bp-5 kb. In a further embodiment, the fourth step of the method comprises repeating the second and third steps for at least 10 cycles.

本发明还包括含本发明DNA分子的细胞,其中所述DNA分子不存在于其天然细胞环境中。在优选的实施方案中,所说的细胞是植物细胞。在另一优选实施方案中,本发明DNA分子可于所说细胞中表达,并包含于允许其在这种细胞内表达的表达盒中。在优选的实施方案中,表达盒稳定整合入这种宿主细胞的DNA中。在另一优选实施方案中,表达盒包含在载体内,其能在细胞中复制并作为染色体外分子保持于细胞中。The present invention also includes cells comprising a DNA molecule of the present invention, wherein said DNA molecule is not present in its natural cellular environment. In preferred embodiments, said cells are plant cells. In another preferred embodiment, the DNA molecule of the invention is expressible in said cell and is contained in an expression cassette allowing its expression in such cell. In preferred embodiments, the expression cassette is stably integrated into the DNA of such host cells. In another preferred embodiment, the expression cassette is contained within a vector, which is capable of replicating in the cell and is maintained in the cell as an extrachromosomal molecule.

本发明还包括含上述植物细胞的植物。在另外的实施方案中,本发明的DNA分子可表达于植物中,本发明任一种DNA分子或其部分在转基因植物中的表达赋予该转基因植物对真菌病原体的抗性。在优选的实施方案中,所述真菌病原体优选可感染活表皮细胞,更优选真菌病原体来自白粉菌目(还已知为白粉菌),尤其是来自白粉菌属,最优选真菌病原体是禾白粉菌。因此本发明还包含通过本发明任一种DNA分子或其部分的表达而可抵抗真菌病原体的转基因植物。The present invention also includes plants comprising the above-mentioned plant cells. In additional embodiments, the DNA molecules of the invention can be expressed in plants, and expression of any of the DNA molecules of the invention, or portions thereof, in transgenic plants confers resistance to fungal pathogens in the transgenic plants. In a preferred embodiment, the fungal pathogen is preferably capable of infecting living epidermal cells, more preferably the fungal pathogen is from the order Erysiphales (also known as Erysiphales), especially from the genus Erysiphe, most preferably the fungal pathogen is Erysipha graminis . The present invention therefore also encompasses transgenic plants which are resistant to fungal pathogens by expression of any of the DNA molecules of the present invention or parts thereof.

按本发明转化的植物可以是单子叶植物或双子叶植物,包括但不局限于玉米、小麦、大麦、黑麦、甘薯、菜豆、豌豆、菊苣、莴苣、甘蓝、花椰菜、嫩茎花椰菜、芜箐、萝卜、菠菜、芦笋、洋葱、大蒜、胡椒、芹菜、南瓜、西葫芦、大麻、小胡瓜、苹果、梨、温柏、甜瓜、李子、樱桃、桃、油桃、杏、草莓、葡萄、覆盆子、黑莓、凤梨、鳄梨、番木瓜、芒果、香蕉、大豆、西红柿、高粱、甘蔗、甜菜、向日葵、油菜籽(rapeseed)、苜蓿、烟草、胡萝卜、棉花、苜蓿、稻、马铃薯、茄子、黄瓜、拟南芥和诸如针叶树和落叶树之类的木本植物,尤其是玉米、小麦或甜菜。Plants transformed according to the present invention may be monocots or dicots, including but not limited to maize, wheat, barley, rye, sweet potato, kidney bean, pea, chicory, lettuce, cabbage, cauliflower, broccoli, turnip , radish, spinach, asparagus, onion, garlic, pepper, celery, pumpkin, zucchini, hemp, courgette, apple, pear, quince, melon, plum, cherry, peach, nectarine, apricot, strawberry, grape, raspberry , blackberry, pineapple, avocado, papaya, mango, banana, soybean, tomato, sorghum, sugar cane, sugar beet, sunflower, rapeseed (rapeseed), alfalfa, tobacco, carrot, cotton, alfalfa, rice, potato, eggplant, cucumber , Arabidopsis and woody plants such as conifers and deciduous trees, especially maize, wheat or sugar beets.

一旦预期核苷酸已被转化入特殊种类植物中,即可用传统育种技术在该物种中将其增殖或转移入同一物种的其他变种内,尤其包括商业化品种。Once a desired nucleotide has been transformed into a plant of a particular species, it can be propagated in that species or transferred into other varieties of the same species, including especially commercial varieties, using conventional breeding techniques.

对于它们在转基因植物中的表达,DNA分子可能需要修饰和优化。本领域已知所有的生物体都有特殊偏爱使用的密码子,本发明DNA分子所含核苷酸序列中的密码子可在保持其编码氨基酸的同时被改变,以符合特殊植物的偏爱。具有至少35%GC含量,优选超过45%GC含量的编码序列可在植物中最好地达到高表达。由于存在可破坏信使稳定性的ATTTA基元和可能引起不恰当聚腺苷酸化的AATAAA基元,具低GC含量的核苷酸序列表达较差。尽管优选的基因序列可充分地表达于单子叶和双子叶种类植物中,因它们对密码子的偏爱不同(Murray等,核酸研究17:477-498(1989)),应修饰序列从满足单子叶或双子叶植物的特殊密码子偏爱和GC含量偏爱。此外,可筛选核苷酸序列以确定引起信使截短的不合理剪接位点的存在。用公开的专利申请EP 0385962、EP 0359472和WO 93/07278中的方法,通过众所周知的定位诱变、PCR和合成基因构建技术造成如上所述核苷酸序列中要求的所有改变。For their expression in transgenic plants, DNA molecules may require modification and optimization. It is known in the art that all organisms have special preferred codons, and the codons in the nucleotide sequence contained in the DNA molecule of the present invention can be changed while maintaining the encoded amino acids to meet the preference of special plants. High expression in plants is best achieved with coding sequences having a GC content of at least 35%, preferably more than 45%. Nucleotide sequences with low GC content are poorly expressed due to the presence of the ATTTA motif, which can destabilize the message, and the AATAAA motif, which can cause inappropriate polyadenylation. Although the preferred gene sequence can be fully expressed in monocot and dicot species, due to their different codon preferences (Murray et al., Nucleic Acids Res. 17:477-498 (1989)), the sequence should be modified from monocotyledonous to satisfying or the special codon bias and GC content bias of dicots. In addition, nucleotide sequences can be screened for the presence of illegitimate splice sites that cause message truncation. All changes required in the nucleotide sequence as described above were made by well known techniques of site-directed mutagenesis, PCR and synthetic gene construction using the methods of published patent applications EP 0385962, EP 0359472 and WO 93/07278.

为了有效起始翻译,可能要求修饰与起始甲硫氨酸邻近的序列。例如,它们可通过含有已知在植物中有效的序列而被修饰。Joshi已提出了对植物比较合适的共有序列(NAR15:6643-6653(1987)),而Clontech提出了进一步的共有翻译起始区(1993/1994目录,页码210)。这些共有序列可适用于本发明的核苷酸序列。将这些序列掺入含本发明核苷酸序列的结构中,掺入程度直到并包括ATG(同时第二个氨基酸未修饰),或掺入程度直到并包括ATG之后的GTC(具有修饰转基因第二个氨基酸的可能性)。Modification of sequences adjacent to the initial methionine may be required for efficient initiation of translation. For example, they may be modified by containing sequences known to be effective in plants. A consensus sequence suitable for plants has been proposed by Joshi (NAR 15:6643-6653 (1987)), and a further consensus translation initiation region has been proposed by Clontech (Catalogue 1993/1994, p. 210). These consensus sequences are applicable to the nucleotide sequences of the present invention. These sequences are incorporated into structures containing the nucleotide sequences of the present invention up to and including ATG (while the second amino acid is unmodified), or up to and including GTC after ATG (with the modified transgene second amino acid possibilities).

转基因植物中的DNA分子用在植物中显示有功能的启动子驱动。启动子的选择将随表达的时间和空间需求而变化,也可依目标物种而变化。为了保护植物抵抗叶病原体,优选在叶中表达;为了保护植物抵抗谷穗病原体,优选在花序中表达(如穗状花序、圆锥花序、玉米穗轴等);为了保护植物抵抗根病原体,优选在根中表达;为了保护幼苗抵抗土壤传播的病原体,优选在根和/或幼苗中表达。不过在许多情况下,要求保护物种抗一种以上类型的植物病原体,因而期望在多种组织中表达。尽管已显示许多双子叶植物的启动子可用于单子叶植物且反之亦然,理想的还是选择双子叶植物启动子用于双子叶植物中的表达,选择单子叶植物启动子用于单子叶植物中的表达。然而,对于所选启动子的出处无限制;只要它们可驱动DNA分子在预期细胞中的表达就足够了。The DNA molecule in the transgenic plant is driven by a promoter shown to be functional in the plant. The choice of promoter will vary with the temporal and spatial requirements of expression, and may vary with the species of interest. To protect plants against foliar pathogens, preferably expressed in leaves; to protect plants against ear pathogens, preferably expressed in inflorescences (such as spikes, panicles, corn cobs, etc.); to protect plants against root pathogens, preferably expressed in Expression in roots; to protect seedlings against soil-borne pathogens, expression in roots and/or seedlings is preferred. In many cases, however, species will be claimed against more than one type of phytopathogen, and expression in multiple tissues is thus desired. Although many dicot promoters have been shown to be usable in monocots and vice versa, it is desirable to select a dicot promoter for expression in dicots and a monocot promoter for expression in monocots expression. However, there is no restriction on the origin of the selected promoters; it is sufficient that they can drive the expression of the DNA molecule in the intended cell.

组成型表达的优选启动子包括来自农杆菌冠瘿碱合酶基因的启动子,如nos启动子,或来自农杆菌Ti质粒的二元启动子(Velten等(1984)EMBO J.3:2723-2730),或在植物中有功能的病毒启动子,如CaMV 35S和19S启动子,和编码肌动蛋白或遍在蛋白之基因的启动子。另一优选启动子是合成型启动子,如Gelvin Super MAS启动子(Ni等(1995)植物杂志7:661-676)。本发明的DNA分子还可在受化学调控的启动子的调节下表达。这使引起真菌疾病的蛋白质只在用诱导化学剂处理农作物植物时才合成。用于化学诱导基因表达的优选技术详述于公开的申请EP 0332104和美国专利5614395。用于化学诱导的优选启动子是烟草PR-1a启动子。Preferred promoters for constitutive expression include those from the Agrobacterium opine synthase gene, such as the nos promoter, or the binary promoter from the Agrobacterium Ti plasmid (Velten et al. (1984) EMBO J.3:2723- 2730), or viral promoters functional in plants, such as the CaMV 35S and 19S promoters, and the promoters of genes encoding actin or ubiquitin. Another preferred promoter is a synthetic promoter, such as the Gelvin Super MAS promoter (Ni et al. (1995) Plant J. 7:661-676). The DNA molecules of the invention may also be expressed under the regulation of chemically regulated promoters. This allows proteins that cause fungal diseases to be synthesized only when crop plants are treated with inducing chemicals. Preferred techniques for chemically inducing gene expression are detailed in published application EP 0332104 and US Patent 5614395. A preferred promoter for chemical induction is the tobacco PR-1a promoter.

优选的启动子种类是创伤可诱导的。已描述了许多表达于创伤位置及植物病原体感染位置的启动子。理想地,所说的启动子应只在感染位置处局部有活性,从而控制真菌疾病的蛋白质只积聚于需要合成它以杀死侵袭害虫的细胞内。此类优选启动子包括以下文献中所述的启动子:Stanford等,Mol.Gen.Genet.215:200-208(1989);Xu等,植物分子生物学22:573-588(1993);Logemann等,植物细胞1:151-158(1989);Rohrmeier和Lehle,植物分子生物学22:783-792(1993);Firek等,植物分子生物学22:129-142(1993)和Warner等,植物杂志3:191-201(1993)。Preferred promoter species are wound-inducible. A number of promoters have been described for expression at sites of wounding and infection with plant pathogens. Ideally, the promoter should be active only locally at the site of infection, so that the fungal disease-controlling protein accumulates only in the cells that need to synthesize it to kill the invading pest. Such preferred promoters include those described in the following literature: Stanford et al., Mol. Gen. Genet. 215:200-208 (1989); Xu et al., Plant Mol. Biol. 22:573-588 (1993); Logemann et al., Plant Cell 1:151-158 (1989); Rohrmeier and Lehle, Plant Mol. Biol. 22:783 -792 (1993); Firek et al., Plant Mol. Biol. 22: 129-142 (1993) and Warner et al., The Plant Journal 3: 191-201 (1993).

优选的组织特异性表达模式包括绿色组织特异性、根特异性、茎特异性和花特异性。适于在绿色组织中表达的启动子包括许多调节光合成中所涉及基因的启动子,其中的许多已自单子叶植物和双子叶植物克隆。优选的启动子是来自磷酸烯醇羧化酶基因的玉米PEPC启动子(Hudspeth和Grula,植物分子生物学12:579-589(1989))。根特异性表达的优选启动子是de Framond(FEBS290:103-106(1991);EP0452269)所述的启动子,而进一步优选的根特异性启动子是来自本发明提供之T-1基因的启动子。优选的茎特异性启动子如美国专利5625136中所述和驱动玉米trpA基因表达的启动子。Preferred tissue-specific expression patterns include green tissue-specific, root-specific, stem-specific and flower-specific. Promoters suitable for expression in green tissues include many that regulate genes involved in photosynthesis, many of which have been cloned from both monocots and dicots. A preferred promoter is the maize PEPC promoter from the phosphoenol carboxylase gene (Hudspeth and Grula, Plant Mol. Biol. 12:579-589 (1989)). A preferred promoter for root-specific expression is the promoter described by de Framond (FEBS290: 103-106 (1991); EP0452269), and a further preferred root-specific promoter is the promoter from the T-1 gene provided by the present invention son. Preferred stalk-specific promoters are those described in US Patent No. 5,625,136 and those driving expression of the maize trpA gene.

优选的本发明实施方案是以根特异方式表达DNA分子的转基因植物。进一步优选的实施方案是以创伤诱导或病原体感染诱导方式表达DNA分子的转基因植物。A preferred embodiment of the invention is a transgenic plant expressing a DNA molecule in a root-specific manner. A further preferred embodiment is a transgenic plant expressing a DNA molecule in a wound-inducible or pathogen infection-inducible manner.

除了合适启动子的选择外,用于植物中表达蛋白质的结构需要恰当转录终止子附着于异源核苷酸序列下游。数种所说的终止子是本领域中可获得和已知的(如来自CaMV的tml,来自rbcS的E9)。已知在植物中起作用的任何可获得的终止子可用于本发明的上下文中。In addition to the choice of an appropriate promoter, the construct for expressing the protein in plants requires the attachment of an appropriate transcription terminator downstream of the heterologous nucleotide sequence. Several of said terminators are available and known in the art (eg tml from CaMV, E9 from rbcS). Any available terminator known to function in plants may be used in the context of the present invention.

许多其它序列可掺入本发明DNA分子的表达盒中。这些序列包括已显示可增强表达的序列,如内含子序列(如来自Adh1和bronze1)和病毒前导序列(如来自TMV、MCMV和AMV)。Many other sequences can be incorporated into the expression cassettes of the DNA molecules of the invention. These sequences include those that have been shown to enhance expression, such as intronic sequences (eg, from Adhl and bronzel) and viral leader sequences (eg, from TMV, MCMV, and AMV).

优选将DNA分子的表达定位于植物中的不同细胞位置。在某些情况下,定位在胞液中比较理想,而在其他情况下,定位于某些亚细胞器中是优选的。可用本领域众所周知的技术进行转基因所编码酶的亚细胞定位。一般来说,利用编码来自已知靶向细胞器的基因产物的靶肽的DNA并将其融合于核苷酸序列的上游。许多这样的靶序列对于叶绿体是已知的,已显示它们在异源结构中可起作用。The expression of the DNA molecule is preferably localized to different cellular locations in the plant. In some cases, localization in the cytosol is desirable, while in other cases, localization in certain subcellular organelles is preferred. Subcellular localization of the enzyme encoded by the transgene can be performed using techniques well known in the art. In general, DNA encoding a target peptide from a gene product known to target an organelle is utilized and fused upstream of the nucleotide sequence. Many such target sequences are known for chloroplasts and they have been shown to function in heterologous structures.

适于植物转化的载体在本说明书的其它部分也有描述。对于农杆菌介导的转化,双元载体或至少带一个T-DNA边缘序列的载体是合适的,而对于直接的基因转移来说,任何载体都是合适的,优选只含目的结构的线性DNA。在直接基因转移的情况下,可采用单个DNA种类的转化或共转化(Schocher等,生物技术4:1093-1096(1986))。对于直接的基因转移和农杆菌介导的转移,通常(但非必须)与可提供抗生素(卡那霉素、潮霉素或氨甲喋呤)或杀虫剂(glufosinate、草甘磷或原卟啉原氧化酶抑制剂)抗性的选择标记,或可提供转化细胞选择优势的选择标记(如磷酸甘露糖异构酶基因)一起进行转化。不过,选择标记的选择对于本发明来说并不是关键的。Vectors suitable for plant transformation are also described elsewhere in this specification. For Agrobacterium-mediated transformation, binary vectors or vectors with at least one T-DNA border sequence are suitable, while for direct gene transfer any vector is suitable, preferably linear DNA containing only the construct of interest . In the case of direct gene transfer, transformation of individual DNA species or co-transformation can be employed (Schocher et al., Biotechnology 4:1093-1096 (1986)). For direct gene transfer and Agrobacterium-mediated transfer, antibiotics (kanamycin, hygromycin, or methotrexate) or insecticides (glufosinate, glyphosate, or protoporphyrinogen) are usually (but not necessarily) oxidase inhibitors), or a selectable marker that confers a selective advantage on transformed cells (such as the phosphomannose isomerase gene) for transformation. However, the choice of selectable marker is not critical to the invention.

在另一优选实施方案中,本发明的DNA分子被直接转化入质体基因组中。质体转化技术被广泛描述于美国专利5451513、5545817和5545818及PCT申请WO95/16783和McBride等(1994)美国国家科学院院报91,7301-7305中。用于叶绿体转化的基本技术包括如利用biolistics或原生质体转化(如氯化钙或PEG介导的转化)将位于选择标记与目的DNA分子侧翼的克隆质体DNA区域引入合适的靶组织中。被称为打靶序列的1-1.5kb侧翼序列促进了与质体基因组之间的同源重组,因而允许原质体系特异区的置换或修饰。最初,利用赋予植物抗壮观霉素和/或链霉素抗性的叶绿体16S rRNA和rps12基因中的点突变作为转化的选择标记(Svab,Z.、Hajdukiewicz,P.和Maliga,P.(1990)美国国家科学院院报87,8526-8530;Staub,J.M.和Maliga,P.(1992)植物细胞4,39-45)。这导致稳定的同质转化子,频率约为每轰击100次目标叶产生1个转化子。这些标记间克隆位点的存在使得可得到能用于引入外源基因的质体打靶载体(Staub,J.M.和Maliga,P.(1993)EMBO J.12,601-606,在此收编作为参考)。用显性选择标记-编码壮观霉素解毒酶氨基糖苷-3’-腺苷酰转移酶的细菌aadA基因置换隐性rRNA或r-蛋白质抗生素抗性基因能使得转化率实质上提高(Svab,Z.和Maliga,P.(1993)美国国家科学院院报90,913-917)。在此以前,已成功地将此标记用于绿藻莱因哈德衣藻质体基因组高频转化中(Goldschmidt-Clermont,M.(1991)核酸研究19,4083-4089)。其他可用于质体转化的选择标记在本领域中是已知的并包含于本发明范围内。一般地说,转化后需约15-20个细胞分裂周期来达到同质体状态。通过同源重组将基因插入各植物细胞内所有数千拷贝环状质体基因组的质体表达与核表达基因相比具有巨大拷贝数的优势,从而使表达水平可容易地超过总可溶性植物蛋白质的10%。In another preferred embodiment, the DNA molecule of the invention is transformed directly into the plastid genome. Plastid transformation techniques are extensively described in US Patents 5451513, 5545817 and 5545818 and PCT Application WO95/16783 and McBride et al. (1994) Proceedings of the National Academy of Sciences USA 91, 7301-7305. Basic techniques for chloroplast transformation include, for example, introduction of regions of cloned plastid DNA flanking the selectable marker and the DNA molecule of interest using biolistics or protoplast transformation (eg, calcium chloride or PEG-mediated transformation) into the appropriate target tissue. The 1-1.5 kb flanking sequence, called the targeting sequence, promotes homologous recombination with the plastid genome, thus allowing the replacement or modification of specific regions of the plastid system. Initially, point mutations in the chloroplast 16S rRNA and rps12 genes conferring resistance to spectinomycin and/or streptomycin were used as selection markers for transformation (Svab, Z., Hajdukiewicz, P. and Maliga, P. (1990 ) Proceedings of the National Academy of Sciences USA 87, 8526-8530; Staub, J.M. and Maliga, P. (1992) The Plant Cell 4, 39-45). This resulted in stable homogeneous transformants with a frequency of approximately 1 transformant per 100 bombardments of the target leaf. The presence of cloning sites between these markers makes it possible to obtain plastid targeting vectors that can be used to introduce foreign genes (Staub, J. M. and Maliga, P. (1993) EMBO J. 12, 601-606, incorporated herein Reference). Replacement of the recessive rRNA or r-protein antibiotic resistance gene with a dominant selectable marker, the bacterial aadA gene encoding the spectinomycin detoxifying enzyme aminoglycoside-3'-adenylyltransferase, resulted in substantially higher transformation rates (Svab, Z. and Maliga, P. (1993) Proceedings of the National Academy of Sciences USA 90, 913-917). Previously, this marker has been successfully used in the high-frequency transformation of the plastid genome of the green alga Chlamydomonas reinhardtii (Goldschmidt-Clermont, M. (1991) Nucleic Acids Res. 19, 4083-4089). Other selectable markers useful for plastid transformation are known in the art and are included within the scope of the present invention. Generally, about 15-20 cell division cycles are required after transformation to reach a homoplastid state. Plastid expression by homologous recombination inserts genes into all thousands of copies of the circular plastid genome in each plant cell has a huge copy number advantage over nuclear expressed genes such that expression levels can easily exceed 100% of total soluble plant protein 10%.

本发明还包含农产品,其中包括通过本发明任一种DNA分子的表达而具有真菌病原体抗性的转基因植物或用下文所述任一种方法制备的对真菌病原体有抗性的转基因植物。既然这些植物是抗真菌病原体的,病原体在其中的生长被抑制。因此这样的植物及其农产品不太可能包含许多真菌病原体天然产生且可能对人和动物非常有毒的真菌毒素。因而,这样的农产品具有更好的植物检疫特性。在优选的实施方案中,将这样的农产品用做饲料、青贮饲料或食品。The present invention also encompasses agricultural products including transgenic plants that are resistant to fungal pathogens by expression of any of the DNA molecules of the present invention or produced by any of the methods described below. Since the plants are resistant to fungal pathogens, the growth of pathogens in them is inhibited. Such plants and their produce are therefore less likely to contain mycotoxins that are naturally produced by many fungal pathogens and can be very toxic to humans and animals. Thus, such agricultural products have better phytosanitary properties. In preferred embodiments, such agricultural products are used as feed, silage or food.

本发明的另一目的是提供制备对真菌病原体有抗性的植物的方法。由本发明DNA分子编码的Mlo蛋白质赋予植物对真菌病原体的抗性,改变所说蛋白质在其天然宿主环境中的表达是本发明的一优选目的。本发明的另一优选目的是改变所说蛋白质在其天然环境中的稳定性或活性。植物中本发明DNA分子所编码蛋白质的表达、稳定性或活性的这种改变导致植物对真菌病原体的抗性增强。在优选的实施方案中,由本发明DNA分子编码的蛋白质是植物抵抗真菌病原体的负调控剂,因为其可抑制植物中负责植物对真菌病原体的抗性的遗传途径。因此,本发明的一优选目的是降低由本发明DNA分子编码的Mlo蛋白质在其天然宿主环境中的表达,或降低这种蛋白质在其天然宿主环境中的稳定性或活性。Another object of the present invention is to provide a method for producing plants resistant to fungal pathogens. It is a preferred object of the invention to modify the expression of the Mlo protein encoded by the DNA molecule of the invention to confer resistance to fungal pathogens in plants. Another preferred object of the invention is to alter the stability or activity of said protein in its natural environment. Such alterations in the expression, stability or activity of proteins encoded by the DNA molecules of the invention in plants lead to increased resistance of the plants to fungal pathogens. In a preferred embodiment, the protein encoded by the DNA molecule of the invention is a negative regulator of plant resistance to fungal pathogens, since it inhibits a genetic pathway in plants responsible for plant resistance to fungal pathogens. Therefore, a preferred object of the present invention is to reduce the expression of the Mlo protein encoded by the DNA molecule of the present invention in its natural host environment, or to reduce the stability or activity of this protein in its natural host environment.

“有义抑制”"Senseful Inhibition"

在优选的实施方案中,通过“有义抑制”减少本发明DNA分子所编码蛋白质的表达(参见如Jorgensen等(1996),植物分子生物学31,957-973)。在这种情况下,本发明DNA分子的全部或部分包含于将引入宿主细胞的表达盒中,优选植物细胞,其中DNA分子是可表达的。将DNA分子以“有义方向”插入表达盒中,即是指在表达盒中该DNA分子的5’末端邻近启动子处且该DNA分子的编码链可被转录。在优选的实施方案中,DNA分子是完全可翻译的,且DNA分子或部分中所含的所有遗传信息可被翻译成蛋白质。在另一优选的实施方案中,DNA分子是部分可翻译的,翻译成短肽。在优选的实施方案中,这是通过在DNA分子中插入至少一个早熟的终止密码子而完成,导致翻译停止。在另一更优选的实施方案中,DNA分子被转录但无翻译产物。这通常是通过去除DNA分子所编码蛋白质的起始密码子,如“ATG”而完成。在进一步优选的实施方案中,含所说DNA分子或其部分的表达盒被稳定整合入宿主细胞基因组中。在另一优选实施方案中,含所说DNA分子或其部分的表达盒包括于染色体外复制分子中。在含上述表达盒之一的转基因植物中,表达盒内所含DNA分子的相应基因的表达被减少或消除,导致在转基因植物中该蛋白质水平降低或缺乏。结果是该转基因植物可抗真菌病原体。In a preferred embodiment, the expression of the protein encoded by the DNA molecule of the invention is reduced by "sense suppression" (see eg Jorgensen et al. (1996) Plant Mol. Biol. 31, 957-973). In this case, all or part of the DNA molecule of the invention is contained in an expression cassette to be introduced into a host cell, preferably a plant cell, in which the DNA molecule is expressible. The DNA molecule is inserted into the expression cassette in "sense orientation", that is, in the expression cassette where the 5' end of the DNA molecule is adjacent to the promoter and the coding strand of the DNA molecule can be transcribed. In preferred embodiments, the DNA molecule is fully translatable, and all genetic information contained in the DNA molecule or portion can be translated into protein. In another preferred embodiment, the DNA molecule is partially translatable into short peptides. In a preferred embodiment, this is accomplished by inserting at least one premature stop codon into the DNA molecule, causing translation to stop. In another more preferred embodiment, the DNA molecule is transcribed but no translation product is produced. This is usually done by removing the start codon, such as "ATG", of the protein encoded by the DNA molecule. In a further preferred embodiment, the expression cassette comprising said DNA molecule or part thereof is stably integrated into the host cell genome. In another preferred embodiment, the expression cassette comprising said DNA molecule or a portion thereof is included in an extrachromosomally replicating molecule. In transgenic plants containing one of the aforementioned expression cassettes, the expression of the corresponding gene for the DNA molecule contained within the expression cassette is reduced or eliminated, resulting in reduced or absent levels of the protein in the transgenic plants. The result is that the transgenic plants are resistant to fungal pathogens.

“反义”抑制"Antisense" suppression

在另一优选实施方案中,通过“反义”抑制来减少本发明DNA分子所编码蛋白质的表达。本发明DNA分子的全部或部分包含于表达盒中,通过它将DNA分子引入宿主细胞,优选植物细胞,其中该DNA分子是可表达的。DNA分子以“反义方向”插入表达盒中,即是指在表达盒中该DNA分子的3’末端邻近启动子处,并且DNA分子的非编码链可被转录。在优选的实施方案中,含所说DNA分子或其部分的表达盒被稳定整合入宿主细胞基因组中。在另一优选实施方案中,含所说DNA分子或其部分的表达盒包括于染色体外复制分子中。一些描述此方法的文章被引用于此以便进一步地说明(Green,P.J.等,生化年鉴55:569-597(1986);van der Krol,A.R.等,反义核酸及蛋白质,pp.125-141(1991);Abel,P.P.等,美国国家科学院院报86:6949-6952(1989);Ecker,J.R.等,美国国家科学院院报83:5372-5376(1986年8月))。In another preferred embodiment, expression of the protein encoded by the DNA molecule of the invention is reduced by "antisense" inhibition. All or part of the DNA molecule of the invention is contained in an expression cassette by which the DNA molecule is introduced into a host cell, preferably a plant cell, wherein the DNA molecule is expressible. The DNA molecule is inserted into the expression cassette in an "antisense orientation", that is, in the expression cassette where the 3' end of the DNA molecule is adjacent to the promoter and the non-coding strand of the DNA molecule can be transcribed. In a preferred embodiment, the expression cassette comprising said DNA molecule or a portion thereof is stably integrated into the host cell genome. In another preferred embodiment, the expression cassette comprising said DNA molecule or a portion thereof is included in an extrachromosomally replicating molecule. Several articles describing this method are cited here for further illustration (Green, P.J. et al., Annals of Biochemistry 55:569-597 (1986); van der Krol, A.R. et al., Antisense Nucleic Acids and Proteins, pp.125-141 (1991); Abel, P.P. et al., PNAS USA 86:6949-6952 (1989); Ecker, J.R., et al. August 1986)).

同源重组homologous recombination

在另一优选的实施方案中,通过如进一步描述于Paszkowski等,EMBO杂志7:4021-26(1988)中的同源重组,对植物基因组中至少一个相应于本发明DNA分子的基因组拷贝进行修饰。所说技术利用同源序列的特征来相互识别并用本领域已知的方法,如同源重组交换它们之间的核苷酸序列。同源重组可在细胞内核苷酸序列的染色体拷贝与转化进入细胞的引入拷贝核苷酸序列之间发生。从而将特异的修饰准确引入核苷酸序列的染色体拷贝中。在一实施方案中,编码本发明蛋白质之基因的调节元件被修饰。已有的调节元件被不同的调节元件置换,从而减少了蛋白质的表达,或者突变或缺失该调控元件,因而消除蛋白质的表达。在另一实施方案中,通过删除整个编码序列或其部分或者突变来修饰蛋白质的编码区。突变蛋白质的表达还可赋予植物对真菌病原体更强的抗性。In another preferred embodiment, at least one genomic copy of the plant genome corresponding to the DNA molecule of the invention is modified by homologous recombination as further described in Paszkowski et al., EMBO Journal 7: 4021-26 (1988) . Said technique utilizes the characteristics of homologous sequences to recognize each other and exchange nucleotide sequences between them using methods known in the art, such as homologous recombination. Homologous recombination can occur between the chromosomal copy of a nucleotide sequence in a cell and the introduced copy of the nucleotide sequence transformed into the cell. Specific modifications are thus accurately introduced into the chromosomal copy of the nucleotide sequence. In one embodiment, the regulatory elements of the gene encoding the protein of the invention are modified. An existing regulatory element is replaced by a different regulatory element, thereby reducing protein expression, or the regulatory element is mutated or deleted, thereby eliminating protein expression. In another embodiment, the coding region of the protein is modified by deletion of the entire coding sequence or a portion thereof, or mutation. Expression of mutant proteins can also confer greater resistance to fungal pathogens in plants.

在另一优选的实施方案中,用含RNA和DNA残基连续序列的末端有双发夹帽之双螺旋构象的嵌合寡核苷酸转化细胞,从而在DNA分子染色体拷贝中引入突变。寡核苷酸的附加特征是在RNA残基处存在2’-O-甲基化。将RNA/DNA序列设计成与本发明DNA分子的染色体拷贝序列一致并含预期的核苷酸改变。此技术进一步描述于美国专利5501967中。In another preferred embodiment, cells are transformed with a chimeric oligonucleotide comprising a contiguous sequence of RNA and DNA residues terminated in a double helix conformation with double hairpin caps, thereby introducing mutations in the chromosomal copy of the DNA molecule. An additional feature of oligonucleotides is the presence of 2'-O-methylation at RNA residues. The RNA/DNA sequence is designed to be identical to the sequence of the chromosomal copy of the DNA molecule of the invention and to contain expected nucleotide changes. This technique is further described in US Patent 5,501,967.

核酶ribozyme

在另一实施方案中,用特异于所说RNA的催化性RNA或核酶切割编码本发明蛋白质的RNA。核酶表达于转基因植物中,造成植物细胞中编码本发明蛋白质的RNA量减少,从而导致细胞中积累的蛋白质量减少并提高植物对真菌病原体的抗性。该方法进一步描述于美国专利4987071。In another embodiment, the RNA encoding the protein of the invention is cleaved with a catalytic RNA or ribozyme specific for said RNA. The ribozyme is expressed in the transgenic plant, resulting in a reduction in the amount of RNA encoding the protein of the present invention in plant cells, thereby reducing the amount of protein accumulated in the cell and increasing the resistance of the plant to fungal pathogens. This method is further described in US Patent 4,987,071.

显性失活突变dominant negative mutation

在另一优选实施方案中,由本发明核苷酸序列编码的蛋白质的活性被改变。这是通过转基因植物中该蛋白质的显性失活突变体的表达,导致内源蛋白质活性丧失而完成。In another preferred embodiment, the activity of the protein encoded by the nucleotide sequence of the invention is altered. This is accomplished by expression of a dominant negative mutant of the protein in transgenic plants, resulting in loss of activity of the endogenous protein.

AptamerAptamer

在另一实施方案中,通过在转基因植物中表达可特异结合该蛋白质的核酸配体,即所谓的aptamer,从而抑制本发明DNA分子所编码的蛋白质的活性。优选用SELEX(Systematic Evolution of Ligands byExponential Enrichment)方法获得aptamer。在SELEX方法中,将具随机化序列区的单链核酸候选混合物与该蛋白质接触,那些对目标亲和性更高的核酸与候选混合物的残余物公开。为了产生富含配体的混合物,扩增分配出来的核酸。多次反复后得到对蛋白质具最佳亲和性的核酸并用于转基因植物中的表达。此方法进一步描述于美国专利5270163中。In another embodiment, the activity of the protein encoded by the DNA molecule according to the invention is inhibited by expressing in transgenic plants a nucleic acid ligand which binds specifically to the protein, a so-called aptamer. Preferably obtain aptamer with SELEX (Systematic Evolution of Ligands by Exponential Enrichment) method. In the SELEX method, a candidate mixture of single-stranded nucleic acids with randomized sequence regions is contacted with the protein, and those nucleic acids with higher affinity for the target are revealed with the residue of the candidate mixture. To generate a ligand-enriched mixture, the partitioned nucleic acids are amplified. After many iterations, the nucleic acid with the best affinity for the protein was obtained and used for expression in transgenic plants. This method is further described in US Patent 5,270,163.

含保守序列的核苷酸序列的分离方法Method for Separating Nucleotide Sequences Containing Conserved Sequences

本发明DNA分子所编码Mlo蛋白质中包含的保守序列被用于分离编码这种序列的其他DNA分子。在优选的实施方案中,产生简并寡核苷酸的混合物,其中包括至少一种编码SEQ ID NO:1或SEQ IDNO:2中所述序列的可能寡核苷酸。将编码SEQ ID NO:1所述序列的寡核苷酸混合物和与编码SEQ ID NO:2中所述序列的序列互补的寡核苷酸混合物与所选的模板DNA一起进行PCR扩增反应。简并寡核苷酸的混合物在本领域中是众所周知的,简并程度随需要而变化。在优选的实施方案中,模板DNA是来自植物的总DNA样品,其中所说的DNA样品可通过本领域众所周知的方法获得。用本领域众所周知的方法分离上述PCR反应产生的扩增片段,并通过筛选cDNA文库或利用RACE方案将其用于分离相应的全长cDNA,后两种技术也是本领域所共知的。这种方法代表了用于分离赋予植物对真菌病原体的抗性的新基因独特且有用的策略。The conserved sequence contained in the Mlo protein encoded by the DNA molecule of the present invention was used to isolate other DNA molecules encoding this sequence. In a preferred embodiment, a mixture of degenerate oligonucleotides is produced, including at least one possible oligonucleotide encoding the sequence set forth in SEQ ID NO: 1 or SEQ ID NO: 2. The oligonucleotide mixture encoding the sequence described in SEQ ID NO: 1 and the oligonucleotide mixture complementary to the sequence encoding the sequence described in SEQ ID NO: 2 are subjected to a PCR amplification reaction together with the selected template DNA. Mixtures of degenerate oligonucleotides are well known in the art, with the degree of degeneracy varied as desired. In a preferred embodiment, the template DNA is a total DNA sample from a plant, wherein said DNA sample can be obtained by methods well known in the art. The amplified fragments generated by the above PCR reactions were isolated by methods well known in the art and used to isolate the corresponding full-length cDNA by screening cDNA libraries or using the RACE protocol, the latter two techniques also being well known in the art. This approach represents a unique and useful strategy for isolating new genes that confer plant resistance to fungal pathogens.

参考以下详述的实施例,本发明将得到进一步的描述。提供这些实施例的目的只是用于例证,而非试图对其有所局限,除非有另外的说明。The present invention will be further described with reference to the following detailed examples. These examples are provided for the purpose of illustration only and are not intended to be limiting unless otherwise stated.

实施例Example

实施例1:小麦Mlo基因的克隆和测序Embodiment 1: Cloning and sequencing of wheat Mlo gene

用逆转录PCR方法克隆小麦的Mlo基因。RNA制备自小麦栽培品种UC703的叶并利用Stratagene RT-PCR试剂盒将其进行逆转录。产生的cDNA用于PCR反应,引物如下:Mlo gene of wheat was cloned by reverse transcription PCR method. RNA was prepared from leaves of wheat cultivar UC703 and reverse transcribed using the Stratagene RT-PCR kit. The generated cDNA was used in PCR reaction with the following primers:

MLO-26 5’TTC CAG CAC CGG CAC AAG AA 3’(SEQ ID No:25)MLO-26 5'TTC CAG CAC CGG CAC AAG AA 3' (SEQ ID No: 25)

MLO-10 5’AAG AAC TGC CTG AAG AAG GC 3’(SEQ ID No:23)MLO-10 5'AAG AAC TGC CTG AAG AAG GC 3' (SEQ ID No: 23)

MLO-7  5’CAG AAA CTT GTC TCA TCC CTG G 3’(SEQ ID No:22)MLO-7 5'CAG AAA CTT GTC TCA TCC CTG G 3' (SEQ ID No: 22)

MLO-5  5’ACA GAG ACC ACC TCC TTG GAA 3’(SEQ ID No:21)MLO-5 5'ACA GAG ACC ACC TCC TTG GAA 3' (SEQ ID No: 21)

MLO-15 5’CAC CAC CTT CAT GAT GCT CA 3’(SEQ ID No:24)MLO-15 5'CAC CAC CTT CAT GAT GCT CA 3' (SEQ ID No: 24)

用以下所列的引物进行PCR,反应产生了指示大小的扩增片段:PCR with the primers listed below yielded amplified fragments of the indicated sizes:

MLO-26和MLO-10 503bpMLO-26 and MLO-10 503bp

MLO-26和MLO-7 1481bpMLO-26 and MLO-7 1481bp

MLO-5和MLO-15 650bpMLO-5 and MLO-15 650bp

将片段克隆入pCR2.1或pCR2.1-TOPO(Invitrogen)。自转化子制备质粒DNA并进行DNA测序。测序显示了存在三种相互间有很高类似性的不同cDNA序列。这些小麦Mlo基因被称为TrMlo1、TrMlo2和TrMlo3。The fragment was cloned into pCR2.1 or pCR2.1-TOPO (Invitrogen). Plasmid DNA was prepared from the transformants and DNA sequencing was performed. Sequencing revealed the presence of three different cDNA sequences with high similarity to each other. These wheat Mlo genes are called TrMlo1, TrMlo2 and TrMlo3.

通过筛选构建于λ-ZAPⅡ载体中的cDNA文库来分离另外的小麦Mlo克隆。为了筛选该文库,进行大量切除以将文库转变成以Bluescript为基础的cDNA克隆群。将这些克隆以80000个独立克隆的库分别培养并制备质粒DNA。用寡核苷酸引物MLO-5和MLO-15(见上文)在混合DNA中进行PCR反应。三个库产生了650个碱基对预期大小的带。随后通过以越来越低的密度培养细菌克隆将这些库分级分离,每步之后都进行质粒DNA制备并用引物MLO-5和MLO-15进行各亚库的PCR。多次分级分离后得到含带Mlo序列之插入片段的单个克隆。将这些克隆中的两个进行插入片段测序,显示它们包含相同的插入片段。第三个克隆的序列表明其中插入片段除了在5’末端有40个附加碱基外,其余与另两个克隆的相同。Additional wheat Mlo clones were isolated by screening a cDNA library constructed in the lambda-ZAPII vector. To screen the library, extensive excision was performed to convert the library into a Bluescript-based cDNA clonal population. These clones were grown individually as a pool of 80,000 independent clones and plasmid DNA was prepared. PCR reactions were performed in mixed DNA using oligonucleotide primers MLO-5 and MLO-15 (see above). Three pools yielded a band of the expected size of 650 base pairs. These pools were subsequently fractionated by growing bacterial clones at lower and lower densities, each step followed by plasmid DNA preparation and PCR of each subpool with primers MLO-5 and MLO-15. Single clones containing an insert with the Mlo sequence were obtained after multiple fractionations. Insert sequencing of two of these clones showed that they contained the same insert. The sequence of the third clone showed that the insert was identical to the other two clones except for 40 additional bases at the 5' end.

通过cDNA末端的随机扩增(RACE)完成小麦Mlo剩余部分的克隆。用Marathon cDNA扩增试剂盒(Clontech)进行小麦UC703poly-A+RNA的RACE反应。Poly-A+RNA是用oligo-dT纤维素柱(Gibco BRL)制备自小麦总RNA。用于RACE反应的寡核苷酸是:Cloning of the remainder of wheat Mlo was accomplished by random amplification of cDNA ends (RACE). The RACE reaction of wheat UC703 poly-A+ RNA was performed with Marathon cDNA Amplification Kit (Clontech). Poly-A+ RNA was prepared from wheat total RNA using an oligo-dT cellulose column (Gibco BRL). The oligonucleotides used in the RACE reaction were:

MLO-GSP1  5’TGG ACC TCT TCA TGT TCG ATC CCA TCT G 3’(SEQ ID No:26)MLO-GSP1 5' TGG ACC TCT TCA TGT TCG ATC CCA TCT G 3' (SEQ ID No: 26)

MLO-GSP2  5’CCT GAC GCT GTT CCA GAA TGC GTT TCA 3’(SEQ ID No:27)MLO-GSP2 5'CCT GAC GCT GTT CCA GAA TGC GTT TCA 3' (SEQ ID No: 27)

用引物MLO-GSP1和试剂盒中所提供的5’衔接引物进行扩增产生约1300个核苷酸大小的DNA片段。用引物MLO-GSP2和3’衔接物进行扩增产生约600个核苷酸大小的DNA片段。将片段克隆入pCR2.1-TOPO,称为TrMlo1-5、TrMlo2-5和TrMlo3-5,其中分别包含小麦Mlo基因TrMlo1、TrMlo2和TrMlo3的5’端。从所说克隆中制备质粒DNA并将质粒插入片段测序。Amplification with the primer MLO-GSP1 and the 5' adapter primer provided in the kit produces a DNA fragment of about 1300 nucleotides in size. Amplification with primers MLO-GSP2 and a 3' adapter yielded a DNA fragment approximately 600 nucleotides in size. The fragments were cloned into pCR2.1-TOPO, called TrMlo1-5, TrMlo2-5 and TrMlo3-5, which contained the 5' ends of the wheat Mlo genes TrMlo1, TrMlo2 and TrMlo3, respectively. Plasmid DNA was prepared from the clones and the plasmid inserts were sequenced.

实施例2:拟南芥属Mlo cDNA的克隆Example 2: Cloning of Arabidopsis Mlo cDNA

用软件TBLASTN比较Mlo蛋白质序列和数据库中序列的翻译产物,表明其中许多序列与Mlo类似。其中,相应于三种拟南芥属EST序列的全长cDNA均已被克隆,接受号为H76041、N37544和T22146。对于各EST,设计寡核苷酸以自构建于质粒pFL61中的拟南芥属cDNA文库扩增相应于所说EST的序列(Minet等(1992)Gene Nov16;121(2):393-396)。所用的寡核苷酸是:Using the software TBLASTN to compare the translation products of the Mlo protein sequence and the sequences in the database, it was shown that many of the sequences were similar to Mlo. Among them, the full-length cDNAs corresponding to the three Arabidopsis EST sequences have been cloned, and the accession numbers are H76041, N37544 and T22146. For each EST, oligonucleotides were designed to amplify the sequence corresponding to said EST from an Arabidopsis cDNA library constructed in plasmid pFL61 (Minet et al. (1992) Gene Nov 16;121(2):393-396) . The oligonucleotides used were:

N37544-1  5’AAG ATC AAG ATG AGG ACG TGG AAG TCG TGG 3’(SEQ ID No:29)N37544-1 5'AAG ATC AAG ATG AGG ACG TGG AAG TCG TGG 3' (SEQ ID No: 29)

N37544-2  5’AGG CTG AAC CAC TGG GGC GCC TCT CAC CAC 3’(SEQ ID No:30)N37544-2 5' AGG CTG AAC CAC TGG GGC GCC TCT CAC CAC 3' (SEQ ID No: 30)

T22146-1  5’CAA GTA TAT GAT GCG CGC TCT AGA GGA TGA 3’(SEQ ID No:31)T22146-1 5'CAA GTA TAT GAT GCG CGC TCT AGA GGA TGA 3' (SEQ ID No: 31)

T22146-2  5’AGG TTT CAC CAC TAA GTC TCC TTC AAT GGC 3’(SEQ ID No:32)T22146-2 5' AGG TTT CAC CAC TAA GTC TCC TTC AAT GGC 3' (SEQ ID No: 32)

H76041-1  5’GAT CAT TCA AGA CTT AGG CTC ACT CAT GAG 3’(SEQ ID No:33)H76041-1 5'GAT CAT TCA AGA CTT AGG CTC ACT CAT GAG 3'(SEQ ID No: 33)

H76041-2  5’AAC AGC AAG GAA GAT TAC AAA TGA TGC CCA 3’(SEQ ID No:34)H76041-2 5'AAC AGC AAG GAA GAT TAC AAA TGA TGC CCA 3' (SEQ ID No: 34)

用引物N37544-1和N37544-2将制备自cDNA文库的DNA扩增得到约500个碱基对大小的片段,而用引物T22146-1和T22146-2扩增得到约250个碱基对大小的片段。用引物H76041-1和H76041-2扩增得到约350和约300个碱基对大小的两个片段。约300个碱基的片段是按EST序列预计的大小,将其随后用于检测cDNA文库中相应于H76041EST的cDNA的存在。将来自所说文库的DNA转化入大肠杆菌,并将克隆分成各约20000个克隆的库。用不同的引物对通过PCR筛选来自各个库的DNA,随后将阳性库细分,以使克隆数越来越少。通过将此过程进行到底,或在某些情况下,一旦库大小已达到200个克隆或更小时,通过以EST序列为探针进行菌落杂交,完成单个阳性克隆的分离。对于EST N37544和T22146,成功地分离出相应于EST的克隆,将插入片段测序。相应于EST N37544的cDNA序列在质粒pCIB10295中被命名为CIB10295,含相应于EST T22146(CIB10296)之cDNA的质粒被命名为pCIB10296。对于这两个EST,相应基因组序列可以最近保藏入GenBank的拟南芥属BAC克隆序列部分的形式获得。需要指出的是,由GenBank所确定的这些基因组序列预计翻译出的蛋白质序列与cDNA直接测序所确定的序列不相符。因此,ESTN37544和T22146相应基因的氨基酸序列不能由GenBank条目明显得到,而只能通过克隆和cDNA克隆测序阐明。已发现用H76041 EST的引物分离的克隆不包含H76041的基因,而是含有一新的Mlo基因家族成员作为插入片段。将插入片段完整测序,该Mlo基因家族成员在质粒pCIB10259中被命名为CIB10259。The DNA prepared from the cDNA library was amplified with primers N37544-1 and N37544-2 to obtain a fragment of about 500 base pairs in size, and amplified with primers T22146-1 and T22146-2 to obtain a fragment of about 250 base pairs in size. fragment. Amplification with primers H76041-1 and H76041-2 yielded two fragments of about 350 and about 300 base pair sizes. A fragment of approximately 300 bases was the size expected from the EST sequence and was subsequently used to detect the presence of the cDNA corresponding to the H76041 EST in the cDNA library. DNA from the library was transformed into E. coli and clones were divided into pools of approximately 20,000 clones each. DNA from individual pools was screened by PCR with different primer pairs, and the positive pools were subsequently subdivided so that the number of clones became smaller and smaller. Isolation of single positive clones is accomplished by carrying this process through, or in some cases once the pool size has reached 200 clones or less, by colony hybridization probed with the EST sequence. For ESTs N37544 and T22146, clones corresponding to ESTs were successfully isolated and the inserts were sequenced. The cDNA sequence corresponding to EST N37544 was named CIB10295 in plasmid pCIB10295, and the plasmid containing the cDNA corresponding to EST T22146 (CIB10296) was named pCIB10296. For both ESTs, the corresponding genomic sequences are available as part of the sequence of the Arabidopsis BAC clone recently deposited into GenBank. It should be pointed out that the predicted protein sequence of these genome sequences determined by GenBank does not match the sequence determined by direct cDNA sequencing. Therefore, the amino acid sequences of the corresponding genes of ESTN37544 and T22146 were not evident from GenBank entries, but could only be elucidated by cloning and sequencing of cDNA clones. It has been found that clones isolated with primers for the H76041 EST do not contain the gene for H76041, but contain a novel member of the Mlo gene family as an insert. The insert was completely sequenced and this member of the Mlo gene family was named CIB10259 in plasmid pCIB10259.

实施例3:用于小麦中Mlo基因表达的载体的构建Embodiment 3: the construction that is used for the carrier of Mlo gene expression in wheat

构建两个载体用于在小麦中“反义”表达大麦Mlo基因。用大麦cDNA和引物对MLO-5和MLO-7(见上文(1))进行PCR,反应产生1124bp的扩增片段,将其克隆入pGEM-T(Promega)。用SacⅡ和NotⅠ两种酶将此片段从pGEM-T中切出。将此1124bp片段克隆入pBluescript-SK(+)。这次通过BamHI和SacⅠ限制位点将插入片段切出并克隆入BamHⅠ-SacⅠ-消化的pCIB9806(专利申请08/838219中所述),插入方向为其中Mlo编码序列的方向与玉米遍在蛋白启动子相反。质粒被命名为pCK01。Two vectors were constructed for "antisense" expression of the barley Mlo gene in wheat. PCR was performed on MLO-5 and MLO-7 (see (1) above) using barley cDNA and primers, and the reaction generated an amplified fragment of 1124 bp, which was cloned into pGEM-T (Promega). This fragment was excised from pGEM-T with SacII and NotI enzymes. This 1124bp fragment was cloned into pBluescript-SK(+). This time the insert was excised through the BamHI and SacI restriction sites and cloned into BamHI-SacI-digested pCIB9806 (described in patent application 08/838219) with the orientation of the Mlo coding sequence aligned with the maize ubiquitin promoter Son is the opposite. The plasmid was named pCK01.

为了构建载体用于小麦中完整Mlo基因的“反义”表达,用引物对MLO-1(5’ATG TCG GAC AAA AAA GGG GT3’(SEQ ID NO:19))和MLO-10(见上文(1))进行PCR,反应产生635bp的扩增片段,将其克隆入pCR2.1(Invitrogen)。将此片段作为EcoRⅠ片段从pCR2.1中切出,并插入pGEM-9Zf(-)(Promega)。用SacⅠ和BstⅪ将跨越Mlo中天然存在的SacⅠ位点至引物位点MLO-10的320个核苷酸片段切出。用SacⅠ和BstⅪ消化pCK01,将此320个碱基片段插入。为了在单子叶植物表达载体中Mlo基因的完整构建,从pGEM-9Zf(-)衍生物中切出210个核苷酸的SacⅠ片段。该片段包含Mlo编码序列的5’端,从引物位点MLO-1至Mlo基因中天然存在的SacⅠ位点。用SacⅠ消化pCK01衍生物并将此210个碱基的片段插入。用引物MLO-1和MLO-10通过PCR分析克隆以确定此210个碱基的片段在新构建载体中的方向。只有其中210个碱基的片段以相对于遍在蛋白启动子反义的方向插入的克隆产生相应于Mlo编码序列5’末端的530个碱基对的产物。产生的质粒包含相对于遍在蛋白启动子处于“反义”方向的完整Mlo编码序列,并被命名为pCK02。In order to construct a vector for "antisense" expression of the complete Mlo gene in wheat, the primer pair MLO-1 (5'ATG TCG GAC AAA AAA GGG GT3' (SEQ ID NO: 19)) and MLO-10 (see above (1)) Perform PCR to generate a 635bp amplified fragment, which is cloned into pCR2.1 (Invitrogen). This fragment was excised from pCR2.1 as an EcoRI fragment and inserted into pGEM-9Zf(-) (Promega). A 320 nucleotide fragment spanning the naturally occurring SacI site in Mlo to primer site MLO-10 was excised with SacI and BstXI. pCK01 was digested with SacI and BstXI, and the 320-base fragment was inserted. For the complete construction of the Mlo gene in a monocot expression vector, a SacI fragment of 210 nucleotides was excised from the pGEM-9Zf(-) derivative. This fragment contains the 5' end of the Mlo coding sequence from primer site MLO-1 to the naturally occurring SacI site in the Mlo gene. The pCK01 derivative was digested with SacI and the 210 base fragment was inserted. Clones were analyzed by PCR using primers MLO-1 and MLO-10 to determine the orientation of this 210 base fragment in the newly constructed vector. Only the clone in which the 210 base fragment was inserted in an antisense orientation relative to the ubiquitin promoter produced a product corresponding to the 530 base pair 5' end of the Mlo coding sequence. The resulting plasmid contained the entire Mlo coding sequence in an "antisense" orientation relative to the ubiquitin promoter and was designated pCK02.

为了构建载体用于表达“有义”方向的Mlo基因,用BamHⅠ消化质粒pCK02以释放作为插入片段的Mlo编码序列。将BamHⅠ片段再连接回pCK02基本载体。通过SacⅠ消化鉴定所带Mlo编码序列相对于pCK02为反向的菌落,在这样的克隆中产生1.8kb片段,相比之下,与pCK02相同结构的克隆中得到210个碱基片段。选择所带Mlo编码序列相对于玉米遍在蛋白启动子为“有义”方向的克隆,命名为pCK03。To construct a vector for expression of the Mlo gene in "sense" orientation, plasmid pCK02 was digested with BamHI to release the Mlo coding sequence as the insert. The BamHI fragment was ligated back into the pCK02 base vector. Colonies with the Mlo coding sequence reversed relative to pCK02 were identified by SacI digestion, yielding a 1.8 kb fragment in such clones compared to a 210 base fragment in clones of the same structure as pCK02. The clone carrying the Mlo coding sequence in the "sense" orientation relative to the maize ubiquitin promoter was selected and named pCK03.

实施例4:用于在拟南芥中表达Mlo基因的载体的构建Embodiment 4: be used for the construction of the vector expressing Mlo gene in Arabidopsis

将pCIB10259、pCIB10295和pCIB10296以及pCK02(大麦Mlo基因)中的Mlo克隆用于PCR反应中,产生的带含侧翼有BamHⅠ限制位点的全长基因。所用引物序列如下:SAS-1:5’GGA TTA AGA TCT AAT GGC 3’(SEQ ID No:35,用于pCIB10295)SAS-2:5’CAA AGA TCT TCA TTT CTT AAA AG 3’(SEQ ID No:36,用于pCIB10295)SAS-3:5’GCG GAT CCA TGT CGG ACA AAA AAG G 3’(SEQ ID No:37,用于大麦Mlo)SAS-4:5’GCG GAT CCT CAT CCC TGG CTG AAG G 3’(SEQ ID No:38,用于大麦Mlo)SAS-5:5’GGA TCC ACC ATG GCC ACA AGA TG 3’(SEQ ID No:39,用于pCIB10259)SAS-6:5’GGA TCC TTA GTC AAT ATC ATT AGC 3’(SEQ ID No:40,用于pCIB10259)SAS-7:5’GCG GAT CCA TGG GTC ACG GAG GAG AAG 3’(SEQ ID No:41,The Mlo clones in pCIB10259, pCIB10295 and pCIB10296 and pCK02 (barley Mlo gene) were used in PCR reactions and the resulting bands contained the full length gene flanked by BamHI restriction sites. The primer sequences used are as follows: SAS-1: 5' GGA TTA AGA TCT AAT GGC 3' (SEQ ID No: 35, used for pCIB10295) SAS-2: 5' CAA AGA TCT TCA TTT CTT AAA AG 3' (SEQ ID No : 36, for pCIB10295) SAS-3: 5' GCG GAT CCA TGT CGG ACA AAA AAG G 3' (SEQ ID No: 37, for barley Mlo) SAS-4: 5' GCG GAT CCT CAT CCC TGG CTG AAG G 3' (SEQ ID No: 38, for barley Mlo) SAS-5: 5' GGA TCC ACC ATG GCC ACA AGA TG 3' (SEQ ID No: 39, for pCIB10259) SAS-6: 5' GGA TCC TTA GTC AAT ATC ATT AGC 3' (SEQ ID No: 40, for pCIB10259) SAS-7: 5' GCG GAT CCA TGG GTC ACG GAG GAG AAG 3' (SEQ ID No: 41,

用于pCIB10269)SAS-8:5’GCG GAT CCT CAG TTG TTA TGA TCA GGA 3’(SEQ ID No:42,For pCIB10269) SAS-8: 5' GCG GAT CCT CAG TTG TTA TGA TCA GGA 3' (SEQ ID No: 42,

用于pCIB10296)for pCIB10296)

将这些带克隆入pCR2.1-TOPO,测定所产生质粒中插入片段的序列以确认PCR未引入突变。用BamHⅠ消化质粒,纯化插入片段并克隆入BamHⅠ消化的pPEH28,一种含有直接位于BamHⅠ位点下游的一拷贝拟南芥遍在蛋白基因启动子UBQ3(Norris等(1993)植物分子生物学21:895-906)的穿梭载体。鉴定含与UBQ3融合之Mlo序列的克隆,进行限制酶分析,以鉴别含相对于UBQ3为“有义”和“反义”方向之插入片段的克隆。对于各Mlo基因,用XbaⅠ消化含有义方向插入片段的克隆和含反义方向插入片段的克隆,纯化插入片段并克隆入XbaⅠ消化的pCIB200。此方法将UBQ3-Mlo基因融合体置入T-DNA边界之间。These bands were cloned into pCR2.1-TOPO and the insert in the resulting plasmid was sequenced to confirm that no mutations had been introduced by PCR. The plasmid was digested with BamHI, and the insert was purified and cloned into BamHI-digested pPEH28, a gene containing a copy of the Arabidopsis ubiquitin gene promoter UBQ3 located directly downstream of the BamHI site (Norris et al. (1993) Plant Molecular Biology 21: 895-906) shuttle vector. Clones containing the Mlo sequence fused to UBQ3 were identified and restriction enzyme analysis was performed to identify clones containing inserts in the "sense" and "antisense" orientations relative to UBQ3. For each Mlo gene, the clone containing the insert in sense orientation and the clone containing the insert in antisense orientation were digested with XbaI, and the insert was purified and cloned into XbaI digested pCIB200. This approach places the UBQ3-Mlo gene fusion between the T-DNA borders.

实施例5:小麦的转化和表达子的鉴定Example 5: Transformation of wheat and identification of expressors

如专利申请WO94/13822中所详述的,通过未成熟胚胎的粒子轰击转化小麦。将在含Basta的培养基中再生小植株并进行PCR分析。为通过PCR判断Mlo转基因的存在,所用引物如下:Wheat was transformed by particle bombardment of immature embryos as detailed in patent application WO 94/13822. Plantlets will be regenerated in Basta-containing medium and subjected to PCR analysis. To determine the presence of the Mlo transgene by PCR, the primers used were as follows:

MLO-3:    5’ATG CTA CCA CAC GCA GAT CG 3’MLO-3: 5’ATG CTA CCA CAC GCA GAT CG 3’

ST27:     5’ACT TCT GCA GGT CGA CTC TA 3’ST27: 5’ACT TCT GCA GGT CGA CTC TA 3’

引物MLO-3相应于Mlo转基因的区域,而引物ST27位于玉米遍在蛋白启动子序列内。在PCR中同时使用Mlo基因和遍在蛋白启动子引物消除了使用两个Mlo引物可引起的假阳性,这种假阳性可能是从存在于小麦中的Mlo基因染色体拷贝引发的。Primer MLO-3 corresponds to the region of the Mlo transgene, while primer ST27 is located within the maize ubiquitin promoter sequence. The simultaneous use of Mlo gene and ubiquitin promoter primers in PCR eliminates false positives that can arise from the use of two Mlo primers, which could be elicited from the chromosomal copy of the Mlo gene present in wheat.

将确认含Mlo转基因的植物进行RNA凝胶印迹分析,以确定它们是否含改变水平的染色体编码小麦Mlo mRNA。自各转基因系制备Poly-A+RNA,并印迹到Hybond-N+滤膜上。用相应于Mlo基因5’末端的530个碱基片段探测印迹。该区域在pCK01克隆不存在;因此,在含pCK01的转基因系中不存在与自转基因表达之反义RNA的杂交。对于其中转基因包括该5’末端片段的pCK02转基因系,探针与不同大小的两条带杂交。一条相应于反义转基因的约2.5kb的mRNA与来自小麦染色体编码Mlo基因的2.0kb mRNA有明显区分。监测2.0kbmRNA的丰度,作为各株系中转基因所达到的基因抑制效率的衡量。Plants confirmed to contain the Mlo transgene were subjected to RNA gel blot analysis to determine whether they contained altered levels of chromosomally encoded wheat Mlo mRNA. Poly-A + RNA was prepared from each transgenic line and blotted onto Hybond-N+ filters. The blot was probed with a 530 base fragment corresponding to the 5' end of the Mlo gene. This region was absent in the pCK01 clone; therefore, there was no hybridization with antisense RNA expressed from the transgene in the pCK01 -containing transgenic line. For the pCK02 transgenic line in which the transgene included this 5' end fragment, the probe hybridized to two bands of different sizes. An approximately 2.5 kb mRNA corresponding to the antisense transgene was clearly distinguished from the 2.0 kb mRNA from the wheat chromosome encoding the Mlo gene. The abundance of 2.0 kb mRNA was monitored as a measure of the gene suppression efficiency achieved by the transgene in each line.

实施例6:转基因小麦品系的疾病检测Example 6: Disease Detection of Transgenic Wheat Lines

将转基因的和未转化的UC703(对照)小麦品系植物温室培养直到它们两周大。将植物转移入Percival生长腔内(每个循环为:黑暗中8小时,16℃和光亮中16小时,20℃),并通过孢子充分接种禾白粉菌小麦小种。接种两周后评估真菌孢子形成的程度。植物被评估为1(几乎无至根本无菌丝生长和无可见的孢子形成)、2(有一些菌丝生长和孢子形成,但比对照植物少)或3(有可与对照相比的菌丝生长和孢子形成)。表达Mlo-构建体的转基因小麦品系显示出对病原体的抗性增强。用反义大麦Mlo构建体获得的结果例子如下文所列。Transgenic and non-transformed UC703 (control) wheat line plants were grown in the greenhouse until they were two weeks old. Plants were transferred into Percival growth chambers (per cycle: 8 hours in the dark, 16°C and 16 hours in the light, 20°C) and thoroughly inoculated with Erysipha graminearum tritici with spores. The extent of fungal sporulation was assessed two weeks after inoculation. Plants were rated as 1 (little to no mycelial growth and no visible sporulation), 2 (some mycelial growth and sporulation, but less than control plants) or 3 (comparable to control plants). silk growth and spore formation). Transgenic wheat lines expressing Mlo-constructs show increased resistance to pathogens. Examples of results obtained with antisense barley Mlo constructs are listed below.

筛选转基因品系R1和R2姊妹株的疾病抗性Screening of transgenic lines R1 and R2 sister strains for disease resistance

种植Mlo反义转化子的姊妹株(T2种子),接种禾白粉菌,并评估疾病抗性。 R1 R2 UC703(对照) 种植的种子 24 24 24 发芽 14 24 21 疾病评分1 4 2 0 疾病评分2 0 0 0 疾病评分3 10 22 21 A sister strain of the Mlo antisense transformant (T2 seeds) was planted, inoculated with Erysiphe graminis, and assessed for disease resistance. R1 R2 UC703 (control) planted seeds twenty four twenty four twenty four germination 14 twenty four twenty one disease score 1 4 2 0 disease score 2 0 0 0 disease score 3 10 twenty two twenty one

小百分率的R1和R2植物展示有抗性可能是由于被检测的是仍然与转基因分离的T2种群。The small percentage of R1 and R2 plants exhibiting resistance may be due to the fact that the tested T2 population was still segregated from the transgene.

实施例7:表达Mlo基因的拟南芥品系分析Example 7: Analysis of Arabidopsis lines expressing the Mlo gene

通过真空渗透将含Mlo基因的pCIB200衍生物用于转化拟南芥属生态型Ws-O(Bechtold,N.,Ellis,J.和Pelletier,G.(1993)C.R.Acad.Sci。Paris316,1194-1199)。通过卡那霉素选择筛选后代以鉴定转化子。对于Mlo转基因品系,通过RNA凝胶印迹分析鉴定表达Mlo的植物。对于Mlo基因,通过RNA凝胶印迹分析法分析转化子在mRNA积累稳定状态水平上的变化。检测呈现靶基因有义或反义抑制的转化子与致植物病真菌二孢白粉菌和寄生霜霉及细菌病原体丁香假单胞菌番茄致病变种反应的改变。用台盼蓝染色宏观和微观地检查转基因植物的叶,从而检测坏疽的存在。The pCIB200 derivative containing the Mlo gene was used to transform the Arabidopsis ecotype Ws-O by vacuum infiltration (Bechtold, N., Ellis, J. and Pelletier, G. (1993) C. R. Acad. Sci. Paris 316 , 1194-1199). Progeny were screened by kanamycin selection to identify transformants. For Mlo transgenic lines, plants expressing Mlo were identified by RNA gel blot analysis. For the Mlo gene, transformants were analyzed for changes in the steady-state level of mRNA accumulation by RNA gel blot analysis. Transformants exhibiting sense or antisense suppression of the target gene were detected for altered responses to the phytopathogenic fungi Erysipha bisporus and Peronospora parasitica and the bacterial pathogen Pseudomonas syringae pv tomato. The presence of necrosis was detected by macroscopically and microscopically examining the leaves of the transgenic plants with trypan blue staining.

对于白粉菌接种,将孢子充分施用于拟南芥花结,使植物在25℃保持于Percival生长腔中。接种10天后评估真菌孢子形成的程度。实施例8:利用Mlo序列间的相似区分离另外的Mlo基因家族成员For powdery mildew inoculation, spores were applied generously to Arabidopsis rosettes and the plants were maintained at 25°C in Percival growth chambers. The extent of fungal sporulation was assessed 10 days after inoculation. Example 8: Isolation of Additional Mlo Gene Family Members Using Similarity Between Mlo Sequences

预计由Mlo基因编码的氨基酸序列的对比显示在所有的基因产物间有许多短的高度氨基酸类似区。设计这些区域的简并引物,按PCR试剂供应商的推荐方法用这些引物进行PCR反应。扩增片段用作探针分离新Mlo基因的全长cDNA或基因组克隆。在本发明Mlo蛋白质(粗线)与用于另外Mlo基因分离的简并寡核苷酸之间的氨基酸序列保守区如下所示:Alignment of the amino acid sequences predicted to be encoded by the Mlo genes revealed many short regions of high amino acid similarity among all gene products. Design degenerate primers for these regions, and perform PCR reactions with these primers according to the PCR reagent supplier's recommended method. The amplified fragments were used as probes to isolate full-length cDNA or genomic clones of novel Mlo genes. The conserved region of amino acid sequence between the Mlo protein of the invention (bold line) and the degenerate oligonucleotides used for isolation of additional Mlo genes is shown below:

      E   L   M   X1  X2  G   X3   I   S   L   L   L   X4WHEATTrMlo1   GAG CTC ATG CTG GTG GGC TTC ATCTrMlo2   GAG CTG ATG CTG GTG GGG TTC ATCTrMlo3   GAG CTG ATG CTG GTG GGA TTC ATCARABIDOPSISCIB10259 GAG CTG ATG ATT CTA GGA TTC ATTCIB10295 GAG CTT ATG CTG TTG GGA TTC ATACIB10296 GAG CTG ATG TTG TTA GGG TTT ATAF19850   GAG CTG ATG GTT CTT GGA TTC ATCU95973   GAG TTG ATG TTG CTG GGA CTT ATA5’ GAG CTB ATG MTB BTR GGM TTC AT 3’X5   T   X6   P   L   X7    X8   X9   V   X10   Q   M   G   SWHEATTrMlo1                           GCG  CTC  GTC  ACA  CAG ATG GGA TCATrMlo2                           GCG  CTC  GTC  ACA  CAG ATG GGA TCGTrMlo3                           GCG  CTA  GTC  ACA  CAG ATG GGA TCAARABIDOPSISCIB10259                         GCA  CTA  GTT  ACT  CAG ATG GGT TCACIB10295                         GCA  CTT  GTT  ACT  CAG ATG GGT AGTCIB10296                         GCC  ATC  GTC  TCA  CAG ATG GGA AGTF19850                           GCA  CTC  GTA  ACT  CAG ATG GGT TCTU95973                           GTA  ATC  GTT  ACT  CAG ATG GGA TCT5’         WCC  CAT  CTG  AGT  GAC DAG BGC RTA 3’E   L   M   X1  X2  G   X3   I   S   L   L   L   X4WHEATTrMlo1   GAG CTC ATG CTG GTG GGC TTC ATCTrMlo2   GAG CTG ATG CTG GTG GGG TTC ATCTrMlo3   GAG CTG ATG CTG GTG GGA TTC ATCARABIDOPSISCIB10259 GAG CTG ATG ATT CTA GGA TTC ATTCIB10295 GAG CTT ATG CTG TTG GGA TTC ATACIB10296 GAG CTG ATG TTG TTA GGG TTT ATAF19850   GAG CTG ATG GTT CTT GGA TTC ATCU95973   GAG TTG ATG TTG CTG GGA CTT ATA5' GAG CTB ATG MTB BTR GGM TTC AT 3'X5   T   X6   P   L   X7    X8   X9   V   X10   Q   M G   SWHEATTrMlo1                           GCG  CTC  GTC  ACA  CAG ATG GGA TCATrMlo2                           GCG  CTC  GTC  ACA  CAG ATG GGA TCGTrMlo3                           GCG  CTA  GTC  ACA  CAG ATG GGA TCAARABIDOPSISCIB10259                         GCA  CTA  GTT  ACT  CAG ATG GGT TCACIB10295                         GCA  CTT  GTT  ACT  CAG ATG GGT AGTCIB10296                         GCC  ATC  GTC  TCA  CAG ATG GGA AGTF19850 GCA CTC GTA ACT CAG ATG GGT TCTU95973 GTA ATC GTT ACT CAG ATG GGA TCT5' WCC CAT CTG AGT GAC DAG BGC RTA 3

X1=L、V或I,X2=V或L,X3=F或L,X4=T、S或A。X1=L, V or I, X2=V or L, X3=F or L, X4=T, S or A.

X5=I、V、S或G,X6=F、L或V,X7=Y或N,X8=A或V,X9=L或I,X10=T或S。X5=I, V, S or G, X6=F, L or V, X7=Y or N, X8=A or V, X9=L or I, X10=T or S.

R=A、G,Y=C、T,M=A、C,K=G、T,S=C、G,W=A、T,H=A、C、T,B=C、G、T,V=A、C、G,D=A、G、T,N=A、C、G、T。R=A, G, Y=C, T, M=A, C, K=G, T, S=C, G, W=A, T, H=A, C, T, B=C, G, T, V=A, C, G, D=A, G, T, N=A, C, G, T.

实施例9:编码序列和邻近序列的修饰Example 9: Modification of Coding Sequences and Adjacent Sequences

为了在转基因植物宿主中表达,本申请中所述的DNA分子可被修饰以完成和优化或负调控它们的表达。以下问题可能会遇到,可以用本领域众所周知的技术进行这些DNA分子的修饰。For expression in transgenic plant hosts, the DNA molecules described in this application can be modified to accomplish and optimize or negatively regulate their expression. The following problems may be encountered and modifications of these DNA molecules can be carried out using techniques well known in the art.

(1)密码子用法。一些植物中的偏爱密码子用法与另一些植物物种是不一样的。植物进化一般趋向于在单子叶植物第三个碱基位置较偏爱核苷酸C和G,而双子叶植物在此位置通常用核苷酸A或T。通过修饰基因掺入对于特殊目标转基因物种优选的密码子,将克服下述许多关于GC/AT含量和不合理剪接的问题。(1) Codon usage. The preferred codon usage in some plants is different from that in other plant species. Plant evolution generally tends to prefer nucleotides C and G at the third base position of monocots, while dicots usually use nucleotides A or T at this position. By modifying the gene to incorporate codons preferred for the particular transgenic species of interest, many of the problems described below with respect to GC/AT content and irrational splicing will be overcome.

(2)GC/AT含量。植物基因的GC含量通常超过35%。富含A和T核苷酸的DNA分子在植物中会引起许多问题。首先,ATTTA基元被认为会使信使不稳定,已发现于许多短寿mRNA的3’末端。其次,在信使中的不恰当位置存在如AATAAA之类聚腺苷酸化信号被认为会造成转录提前截止。此外,单子叶植物可能会将富含AT序列识别为剪接位点(见下文)。(2) GC/AT content. The GC content of plant genes usually exceeds 35%. DNA molecules rich in A and T nucleotides can cause many problems in plants. First, the ATTTA motif is thought to destabilize the messenger and has been found at the 3' end of many short-lived mRNAs. Second, the presence of polyadenylation signals such as AATAAA at inappropriate positions in the messenger is thought to cause premature termination of transcription. In addition, monocots may recognize AT-rich sequences as splice sites (see below).

(3)邻近起始甲硫氨酸的序列。现认为核糖体附着于信使的5’末端并寻找第一个ATG以起始翻译。不过,我们相信存在对某些ATG邻近核苷酸的偏爱,在ATG处包含一新的共有翻译起始密码子可增强本发明DNA分子的表达。Clontech(1993/1994目录,第210页)已提出一序列可作为共有翻译起始区用于植物中大肠杆菌uidA基因的表达。此外,Joshi(NAR 15:6643-6653(1987))已比较了许多邻近ATG的植物序列并揭示了一共有序列。当植物中DNA分子表达遇到困难时,在起始ATG处含这些序列之一有可能促进翻译。在这样的情况下,由于其第二个氨基酸的修饰,共有区的最后三个核苷酸可能不适合包含于被修饰序列中。不同植物种类之间邻近起始甲硫氨酸的优选序列可能是不同的。位于GenBank数据库中14个玉米基因的调查得到结果如下:(3) Sequence adjacent to the starting methionine. It is thought that the ribosome attaches to the 5' end of the message and seeks the first ATG to initiate translation. However, we believe that there is a preference for certain nucleotides adjacent to the ATG and that inclusion of a novel consensus translation initiation codon at the ATG enhances expression of the DNA molecules of the invention. Clontech (1993/1994 catalog, p. 210) has proposed a sequence as a consensus translation initiation region for expression of the E. coli uidA gene in plants. In addition, Joshi (NAR 15:6643-6653 (1987)) has compared many plant sequences adjacent to the ATG and revealed a consensus sequence. Having one of these sequences at the initiation ATG may facilitate translation when expression of the DNA molecule is difficult in plants. In such cases, the last three nucleotides of the consensus region may not be suitable for inclusion in the modified sequence due to the modification of its second amino acid. The preferred sequence adjacent to the starting methionine may vary between different plant species. A survey of 14 maize genes located in the GenBank database yielded the following results:

14个玉米基因中起始ATG前的位置The position before the start ATG in 14 maize genes

    -10  -9   -8   -7   -6   -5   -4   -3   -2   -1-10 -9 -8 -7 -6 -5 -4 -3 -2 -1

C    3    8    4    6    2    5    6    0    10   7C 3 8 4 6 2 5 6 0 10 7

T    3    0    3    4    3    2    1    1    1    0T 3 0 3 4 3 2 1 1 1 0

A    2    3    1    4    3    2    3    7    2    3A 2 3 1 4 3 2 3 7 2 3

G    6    3    6    0    6    5    4    6    1    5G 6 3 6 0 6 5 4 6 1 5

可对将插入核苷酸序列的预期植物种类进行此项分析,并对邻近ATG的序列进行修饰以掺入偏爱的核苷酸。This analysis can be performed on the expected plant species into which the nucleotide sequence will be inserted, and the sequence adjacent to the ATG modified to incorporate the preferred nucleotide.

(4)不合理剪接位点的去除。本发明DNA分子还可能包含植物中会被识别为5’或3’剪接位点的基元,并被切割,从而产生截短或缺失的信使。用本领域众所周知的技术可去除这些位点。(4) Removal of unreasonable splicing sites. The DNA molecules of the invention may also contain motifs which in plants would be recognized as 5' or 3' splice sites, and be cleaved, resulting in truncated or deleted messages. These sites can be removed using techniques well known in the art.

(5)显性失活突变体的产生(5) Generation of dominant negative mutants

此外,本发明DNA分子还可包括被修饰,从而本发明核苷酸序列编码的蛋白质活性被改变的分子。通过蛋白质显性失活突变体在转基因植物中的表达,导致内源蛋白质的活性丧失,可达到此结果。Mlo核苷酸序列中导致这种显性失活突变体产生的突变位点列举如下。可引入列举如下的单个突变或不同突变的组合。

Figure 9980404100401
In addition, the DNA molecules of the present invention may also include molecules that are modified such that the activity of the protein encoded by the nucleotide sequence of the present invention is altered. This result is achieved by expression of a dominant negative mutant of the protein in transgenic plants, resulting in the loss of activity of the endogenous protein. The mutation sites leading to the generation of this dominant negative mutant in the Mlo nucleotide sequence are listed below. A single mutation or a combination of different mutations listed below can be introduced.
Figure 9980404100401

用于修饰编码序列和邻近序列的技术在本领域中是众所周知的。若本发明DNA分子的起始表达水平低且适于如上所述改变序列,则按本领域众所周知的方法完成合成基因的构建。这些方法已被描述于发表的专利内容中,如EP0385962、EP0359472和WO93/07278。在大部分情况下,优选用瞬时检验方法(本领域众所周知)在基因构建体转移入转基因植物中前检测其表达。Techniques for modifying coding sequences and adjacent sequences are well known in the art. If the initial expression level of the DNA molecule of the invention is low and the sequence is amenable to alteration as described above, construction of the synthetic gene is accomplished by methods well known in the art. These methods have been described in published patent content such as EP0385962, EP0359472 and WO93/07278. In most cases, it is preferred to use transient assay methods (well known in the art) to detect expression of the gene construct prior to its transfer into transgenic plants.

实施例10:植物转化载体的构建Embodiment 10: Construction of plant transformation vector

可获得许多转化载体用于植物转化,本发明DNA分子可与任何这样的载体结合使用。所用载体的选择将取决于优选的转化技术和转化的目标物种。对于某些目标物种,可优选不同的抗生素或杀虫剂选择标记。常规用于转化中的选择标记包括使植物抗卡那霉素、巴龙霉素、遗传霉素和相关抗生素的nptII基因(Vieira和Messing,1982,基因19:259-268;Bevan等,1983,自然304:184-187)、编码氨基糖苷3’-腺苷酰转移酶并赋予对链霉素或壮观霉素的抗性的细菌aadA基因(Goldschmidt-Clermont,1991,核酸研究19:4083-4089)、赋予对抗生素潮霉素的抗性的hph基因(Blochlinger和Diggelmann,1984,分子细胞生物学4:2929-2931)和赋予对氨甲喋呤的抗性的dhfr基因(Bourouis和Jarry,1983,EMBO J.2:1099-1104)。其他可使用的标记包括使植物抗杀虫剂膦丝菌素的膦丝菌素乙酰转移酶基因(White等,1990,核酸研究18:1062;Spencer等1990,理论与应用遗传学79:625-631)、编码草甘磷抗性的突变EPSP合成酶基因(Hinchee等,1988,生物/技术6:915-922)、赋予对咪唑啉酮或磺酰脲的抗性的突变乙酰乳酸合成酶(ALS)基因(Lee等,1988,EMBO J.7:1241-1248)、赋予对莠去净的抗性的突变psbA基因(Smeda等,1993,植物生理学103:911-917)或如美国专利5767373所述的突变原卟啉原氧化酶基因。还可使用产生阳性选择的选择标记,诸如磷酸甘露糖异构酶基因。A number of transformation vectors are available for plant transformation, and the DNA molecules of the invention can be used in conjunction with any such vectors. The choice of vector to be used will depend on the preferred transformation technique and the target species for transformation. For certain target species, different antibiotic or insecticide selectable markers may be preferred. Selectable markers routinely used in transformation include the nptII gene which renders plants resistant to kanamycin, paromomycin, geneticin and related antibiotics (Vieira and Messing, 1982, Gene 19:259-268; Bevan et al., 1983, Nature 304:184-187), a bacterial aadA gene encoding aminoglycoside 3'-adenylyltransferase and conferring resistance to streptomycin or spectinomycin (Goldschmidt-Clermont, 1991, Nucleic Acids Res. 19:4083-4089 ), the hph gene conferring resistance to the antibiotic hygromycin (Blochlinger and Diggelmann, 1984, Molecular Cell Biology 4:2929-2931), and the dhfr gene conferring resistance to methotrexate (Bourouis and Jarry, 1983, EMBO J .2:1099-1104). Other useful markers include the phosphinothricin acetyltransferase gene that renders plants resistant to the insecticide phosphinothricin (White et al., 1990, Nucleic Acids Res. 18:1062; Spencer et al. 1990, Theoretical and Applied Genetics 79:625- 631), a mutant EPSP synthase gene encoding glyphosate resistance (Hinchee et al., 1988, Bio/Technology 6:915-922), a mutant acetolactate synthase conferring resistance to imidazolinones or sulfonylureas ( ALS) gene (Lee et al., 1988, EMBO J.7: 1241-1248), a mutant psbA gene conferring resistance to arazine (Smeda et al., 1993, Plant Physiology 103: 911-917) or as in US Patent 5767373 The mutant protoporphyrinogen oxidase gene. Selectable markers that produce positive selection, such as the phosphomannose isomerase gene, can also be used.

还可通过可筛选标记基因的表达完成转化细胞的鉴定,诸如编码氯霉素乙酰转移酶(CAT)、β-葡糖醛酸糖苷酶(GUS)、萤光素酶和绿色荧光蛋白(GFP)或任何可赋予转化细胞独特表型性状的其他蛋白质。Identification of transformed cells can also be accomplished by the expression of selectable marker genes, such as those encoding chloramphenicol acetyltransferase (CAT), β-glucuronidase (GUS), luciferase, and green fluorescent protein (GFP) Or any other protein that confers a unique phenotypic trait on transformed cells.

(1)适于农杆菌属转化之载体的构建(1) Construction of a vector suitable for Agrobacterium transformation

用于根癌农杆菌转化的许多载体是可获得的。这些载体一般携带至少一种T-DNA边界序列,并包括诸如pBIN19(Bevan,核酸研究(1984))和pXYZ之类的载体。下文描述了两个典型载体的构建。A number of vectors are available for Agrobacterium tumefaciens transformation. These vectors typically carry at least one T-DNA border sequence and include vectors such as pBIN19 (Bevan, Nucleic Acids Res. (1984)) and pXYZ. The construction of two typical vectors is described below.

pCIB200和pCIB2001的构建Construction of pCIB200 and pCIB2001

二元载体pCIB200和pCIB2001被用于构建用于农杆菌的重组载体并以下述方式构建。通过用NarI消化pTJS75(Schmidhauser & Helinski,细菌学杂志164:446-455(1985))切出四环素抗性基因,然后插入来自带NPTII(Vieria & Messing,基因19:259-268(1982);Bevan等,自然304:184-187(1983);McBride等,植物分子生物学14:266-276(1990))之pUC4K的AccI片段,产生pTJS75kan。将XhoⅠ接头与含T-DNA左右边界序列、植物nos/nptⅡ可选择嵌合基因和pUC多接头之pCIB7的EcoRV片段连接(Rothstein等,基因53:153-161(1987)),并将XhoⅠ消化片段克隆入Sall消化的pTJS75kan,产生pCIB200(还可参见EP 0332104,实施例19)。pCIB200包含以下的单一多接头限制位点:EcoRⅠ、SstⅠ、KpnⅠ、BglⅡ、XbaⅠ和SalⅠ。pCIB2001是通过插入附加限制位点多接头而产生的pCIB200的衍生物。pCIB2001多接头中的单一限制位点是EcoRⅠ、SstⅠ、KpnⅠ、BglⅡ、XbaⅠ、SalⅠ、MluⅠ、BcⅡ、AvrⅡ、ApaⅠ、HpaⅠ和StuⅠ。除含这些单一限制位点外,pCIB2001还具有植物和细菌卡那霉素选择基因、用于农杆菌介导转化的左右T-DNA边界序列、用于在大肠杆菌和其他宿主之间转移的RK2-衍生trfA功能和同样来自RK2的OriT和OriV功能。pCIB2001适用于克隆含自身调节信号的植物表达盒。Binary vectors pCIB200 and pCIB2001 were used to construct recombinant vectors for Agrobacterium and were constructed in the following manner. The tetracycline resistance gene was excised by digesting pTJS75 with NarI (Schmidhauser & Helinski, J. Bacteriology 164: 446-455 (1985)), and then inserted into the gene from the band NPTII (Vieria & Messing, Gene 19: 259-268 (1982); Bevan et al., Nature 304:184-187 (1983); McBride et al., Plant Mol. Biol. 14:266-276 (1990)) from the AccI fragment of pUC4K, yielding pTJS75kan. The XhoI linker was ligated to the EcoRV fragment of pCIB7 (Rothstein et al., Gene 53:153-161 (1987)) containing the T-DNA left and right border sequences, the plant nos/nptII selectable chimeric gene and the pUC polylinker, and digested with XhoI The fragment was cloned into Sall digested pTJS75kan, resulting in pCIB200 (see also EP 0332104, Example 19). pCIB200 contains the following single polylinker restriction sites: EcoRI, SstI, KpnI, BglII, XbaI and SalI. pCIB2001 is a derivative of pCIB200 generated by insertion of an additional restriction site polylinker. The single restriction sites in the pCIB2001 polylinker are EcoRI, SstI, KpnI, BglII, XbaI, SalI, MluI, BcII, AvrII, ApaI, HpaI and StuI. In addition to containing these unique restriction sites, pCIB2001 also has plant and bacterial kanamycin selection genes, left and right T-DNA border sequences for Agrobacterium-mediated transformation, RK2 for transfer between E. coli and other hosts - Derived trfA function and OriT and OriV functions also from RK2. pCIB2001 is suitable for cloning plant expression cassettes containing self-regulatory signals.

pCIB200及其潮霉素选择衍生物的构建Construction of pCIB200 and its hygromycin selective derivatives

二元载体pCIB10包含可用于植物中选择的卡那霉素抗性编码基因、T-DNA左右边界序列,并掺入广谱质粒pRK252的序列,使其在大肠杆菌和农杆菌中均能复制。Rothstein等(基因53:153-161(1987))描述了它的构建。已构建了多个pCIB10衍生物,其中掺入了Gritz等(基因25:179-188(1983))所述的潮霉素B磷酸转移酶基因。这些衍生物使得能够仅对潮霉素(pCIB743)或对潮霉素和卡那霉素二者(pCIB715、pCIB717)选择转基因植物细胞。The binary vector pCIB10 contains the kanamycin resistance coding gene that can be used for selection in plants, the left and right border sequences of T-DNA, and incorporates the sequence of the broad-spectrum plasmid pRK252, so that it can be replicated in both Escherichia coli and Agrobacterium. Its construction is described by Rothstein et al. (Gene 53:153-161 (1987)). Several pCIB10 derivatives have been constructed incorporating the hygromycin B phosphotransferase gene described by Gritz et al. (Gene 25: 179-188 (1983)). These derivatives enable the selection of transgenic plant cells for hygromycin only (pCIB743) or for both hygromycin and kanamycin (pCIB715, pCIB717).

(2)构建适于非农杆菌转化的载体(2) Construction of vectors suitable for non-Agrobacterium transformation

非根癌农杆菌转化在所选转化载体内无需T-DNA序列,因此除上述含T-DNA序列的载体外,还可利用缺乏这些序列的载体。不依赖于农杆菌的转化技术包括通过粒子轰击、原生质体摄入(如PEG和电穿孔)和显微注射进行的转化。载体的选择很大程度上取决于所选的转化物种。下文描述了一些典型载体的构建。Non-Agrobacterium tumefaciens transformation does not require T-DNA sequences within the chosen transformation vector, so vectors lacking these sequences may also be utilized in addition to the above-described vectors containing T-DNA sequences. Agrobacterium-independent transformation techniques include transformation by particle bombardment, protoplast uptake (eg, PEG and electroporation), and microinjection. The choice of vector depends largely on the chosen transformation species. The construction of some typical vectors is described below.

pCIB3064构建pCIB3064 build

pCIB3064是适于与杀虫剂Basta(或膦丝菌素)选择结合进行直接转移基因技术的pUC-衍生载体。质粒pCIB246含与大肠杆菌GUS基因可操作性融合的CaMV 35S启动子和CaMV 35S转录终止子,并被描述于PCT公开申请WO93/07278中。该载体的35S启动子在起始位点5’端含两个ATG序列。用标准PCR技术突变这些位点,以去除ATG并产生限制性位点SspⅠ和PvuⅡ。新的限制位点离单一SalⅠ位点96和37bp,离准确的起始位点101和42bp。产生的pCIB246衍生物被称为pCIB3025。然后用SalⅠ和SacⅠ消化从pCIB3025中切下GUS基因,补平末端并再连接产生质粒pCIB3060。质粒pJIT82获自John Innes Centre,Norwich,切出含来自绿色产色链霉素之bar基因的400bp SmaⅠ片段并插入pCIB3060的HpaⅠ位点(Thompson等,EMBO J6:2519-2523(1987)))。这样产生了pCIB3064,其中包括在CaMV 35S启动子和终止子控制下的bar基因用于杀虫剂选择、氨苄青霉素抗性基因(用于在大肠杆菌中选择)和带SphⅠ、PstⅠ、HindⅢ和BamHⅠ单一位点的多接头。该载体适于克隆含自身调节信号的植物表达盒。pCIB3064 is a pUC-derived vector suitable for direct gene transfer in combination with selection for the insecticide Basta (or phosphinothricin). Plasmid pCIB246 contains the CaMV 35S promoter and CaMV 35S transcriptional terminator operably fused to the E. coli GUS gene and is described in PCT Published Application WO 93/07278. The 35S promoter of this vector contains two ATG sequences at the 5' end of the initiation site. These sites were mutated using standard PCR techniques to remove the ATG and generate the restriction sites SspI and PvuII. The new restriction sites were 96 and 37 bp from the unique SalI site and 101 and 42 bp from the exact start site. The resulting pCIB246 derivative was named pCIB3025. The GUS gene was then excised from pCIB3025 by digestion with SalI and SacI, blunt-ended and religated to generate plasmid pCIB3060. Plasmid pJIT82 was obtained from the John Innes Centre, Norwich, and the 400 bp SmaI fragment containing the bar gene from green chromogenic streptomycin was excised and inserted into the HpaI site of pCIB3060 (Thompson et al., EMBO J6:2519-2523 (1987))). This generated pCIB3064, which includes the bar gene under the control of the CaMV 35S promoter and terminator for insecticide selection, the ampicillin resistance gene (for selection in E. Multiple adapters at a single site. This vector is suitable for cloning plant expression cassettes containing self-regulatory signals.

pSOG19和pSOG35的构建Construction of pSOG19 and pSOG35

pSOG35是利用大肠杆菌二氢叶酸还原酶(DHFR)作为赋予氨甲喋呤抗性的选择标记的转化载体。用PCR扩增35S启动子(约800bp)、来自玉米Adh1基因的内含子6(约550bp)和来自pSOG10之18bpGUS非翻译前导序列。还用PCR扩增编码大肠杆菌二氢叶酸还原酶类型Ⅱ基因的250bp片段,将这两个PCR片段与来自pBI221(Clontech)的含pUC19载体骨架和胭脂碱合酶终止子之SacI-PstI片段装配在一起。这些片段的装配产生pSOG19,其中含与内含子6序列、GUS前导序列、DHFR基因和胭脂碱合酶终止子融合的35S启动子。用来自玉米褪绿斑驳病毒(Maize Chlorotic Mottle Virus,MCMV)的前导序列置换pSOG19中的GUS前导序列产生载体pSOG35。pSOG19和pSOG35携带氨苄青霉素抗性pUC基因,并具有可用于克隆外源序列的HindⅢ、SphⅠ、PstⅠ和EcoRⅠ位点。pSOG35 is a transformation vector utilizing E. coli dihydrofolate reductase (DHFR) as a selection marker conferring methotrexate resistance. The 35S promoter (about 800 bp), intron 6 from the maize Adhl gene (about 550 bp) and the 18 bp GUS untranslated leader sequence from pSOG10 were amplified by PCR. A 250 bp fragment encoding the E. coli dihydrofolate reductase type II gene was also PCR amplified and these two PCR fragments were assembled with the SacI-PstI fragment from pBI221 (Clontech) containing the pUC19 vector backbone and the nopaline synthase terminator together. Assembly of these fragments generated pSOG19, which contains the 35S promoter fused to the intron 6 sequence, the GUS leader sequence, the DHFR gene and the nopaline synthase terminator. The GUS leader sequence in pSOG19 was replaced with the leader sequence from Maize Chlorotic Mottle Virus (MCMV) to generate vector pSOG35. pSOG19 and pSOG35 carry the ampicillin resistance pUC gene and have HindⅢ, SphⅠ, PstⅠ and EcoRI sites which can be used to clone foreign sequences.

实施例11:构建植物表达盒的要求Example 11: Requirements for constructing plant expression cassettes

首先将预期在转基因植物中表达的基因序列装配入表达盒中,位于恰当的启动子之后及恰当的转录终止子的上游。The gene sequence intended to be expressed in the transgenic plant is first assembled into an expression cassette, behind the appropriate promoter and upstream of the appropriate transcription terminator.

启动子选择promoter selection

用于表达盒中的启动子选择将决定转基因植物中转基因的空间和时间表达模式。所选择的启动子将在特殊细胞类型(如叶表皮细胞、叶肉细胞、根皮层细胞)或者在特殊组织或器官(例如根、叶或花)中表达转基因,该选择将反映本发明DNA分子生物合成的预期位置。或者,所选启动子可驱动基因在光诱导或其他时间调节启动子控制下的表达。另外的选择是所选启动子被化学调节。这就有可能只在需要时诱发通过化学诱导物处理引起核苷酸序列表达。The choice of promoter used in the expression cassette will determine the spatial and temporal expression pattern of the transgene in the transgenic plant. The selected promoter will express the transgene in a particular cell type (e.g. leaf epidermal cells, mesophyll cells, root cortex cells) or in a particular tissue or organ (e.g. root, leaf or flower) and the selection will reflect the DNA molecular biology of the present invention. The expected location of the composition. Alternatively, the selected promoter may drive expression of the gene under the control of a light-inducible or other temporally regulated promoter. Another option is that the selected promoter is chemically regulated. This makes it possible to induce expression of nucleotide sequences by chemical inducer treatment only when required.

转录终止子transcription terminator

可获得多种转录终止子用于表达盒中。它们负责转基因之外的转录终止及其正确的聚腺苷酸化作用。已知在植物中起作用的合适转录终止子均可使用,包括CaMV 35S终止子、tml终止子、胭脂碱合酶终止子、豌豆rbcS E9终止子。它们在单子叶植物和双子叶植物中均可使用。A variety of transcription terminators are available for use in expression cassettes. They are responsible for the termination of transcription outside of the transgene and its correct polyadenylation. Suitable transcription terminators known to function in plants can be used, including the CaMV 35S terminator, tml terminator, nopaline synthase terminator, pea rbcS E9 terminator. They are available in both monocots and dicots.

用于增强或调节表达的序列Sequences for enhancing or regulating expression

已发现许多序列可增强转录单元内的基因表达,这些序列可与本发明基因结合使用以提高它们在转基因植物中的表达。A number of sequences have been found to enhance gene expression within the transcription unit and these sequences can be used in conjunction with the genes of the invention to increase their expression in transgenic plants.

已显示多种内含子序列可增强表达,尤其是在单子叶植物细胞中。例如,已发现玉米Adh1基因内含子在引入玉米细胞时可显著增强其同源启动子控制下的野生型基因表达。发现内含子1特别有效,可增强与氯霉素乙酰转移酶基因的融合构建体的表达(Callis等,基因发育1:1183-1200(1987))。在相同实验体系中,来自玉米bronze1基因的内含子在增强表达方面具有类似作用(Callis等,见上文)。内含子序列已被常规引入植物转化载体中,一般引入到非翻译前导序列内。Various intronic sequences have been shown to enhance expression, especially in monocot cells. For example, it has been found that the intron of the maize Adhl gene, when introduced into maize cells, can significantly enhance the expression of the wild-type gene under the control of its cognate promoter. Intron 1 was found to be particularly effective in enhancing the expression of fusion constructs with the chloramphenicol acetyltransferase gene (Callis et al., Gene Development 1: 1183-1200 (1987)). In the same experimental system, an intron from the maize bronze1 gene had a similar effect in enhancing expression (Callis et al., supra). Intronic sequences have been routinely introduced into plant transformation vectors, typically within the untranslated leader sequence.

已知许多来自病毒的非翻译前导序列可增强表达,它们在双子叶植物细胞中尤其有效。具体地说,已显示来自烟草花叶病毒(TMV,‘Ω序列’)、玉米褪绿斑驳病毒(MCMV)和苜蓿花叶病毒(AMV)的前导序列在增强表达方面有效(如Gallie等,核酸研究15:8693-8711(1987);Skuzeski等,植物分子生物学15:65-79(1990))。Many untranslated leader sequences from viruses are known to enhance expression, and they are particularly effective in dicot cells. Specifically, leader sequences from tobacco mosaic virus (TMV, 'omega sequence'), maize chlorotic mottle virus (MCMV), and alfalfa mosaic virus (AMV) have been shown to be effective in enhancing expression (eg, Gallie et al., Nucleic Acids Research 15:8693-8711 (1987); Skuzeski et al., Plant Mol. Biol. 15:65-79 (1990)).

细胞中基因产物的定向Orientation of gene products in cells

已知在植物中存在将基因产物定向的多种机制,控制这些机制功能的序列已鉴定到一定程度。例如,将基因产物定向于叶绿体是由发现于多种蛋白质氨基末端的信号序列控制的,该信号序列在叶绿体输入期间被切去从而产生成熟蛋白质(如,Comai等,生物化学杂志263:15104-15109(1988))。这些信号序列可与异源基因产物融合从而使异源产物输入叶绿体(van den Broeck等,自然313:358-363(1985))。编码合适信号序列的DNA可分离自下述蛋白质编码cDNA的5’末端:RUBISCO蛋白质、CAB蛋白质、EPSP合成酶、GS2蛋白质和已知定位于叶绿体的许多其他蛋白质。Various mechanisms for directing gene products are known to exist in plants, and sequences controlling the function of these mechanisms have been identified to some extent. For example, targeting of gene products to chloroplasts is controlled by signal sequences found at the amino termini of many proteins, which are cleaved during chloroplast import to produce mature proteins (e.g., Comai et al., J. Biol. Chem. 263:15104- 15109 (1988)). These signal sequences can be fused to heterologous gene products to allow import of the heterologous products into chloroplasts (van den Broeck et al., Nature 313:358-363 (1985)). DNA encoding a suitable signal sequence can be isolated from the 5' end of cDNA encoding the following proteins: RUBISCO protein, CAB protein, EPSP synthetase, GS2 protein and many other proteins known to localize to chloroplasts.

其他一些基因产物定位于诸如线粒体和过氧化物酶体之类的其他细胞器中(如,Unger等,植物分子生物学13:411-418(1989))。也可操纵编码这些产物的cDNA以将异源基因产物定向于这些细胞器。所说的序列例子是核编码ATPase和线粒体的特异性天冬氨酸氨基转移酶同工型。定向于细胞蛋白质体的工作已由Roger等(美国国家科学院院报82:6512-6516(1985))描述。Other gene products localize to other organelles such as mitochondria and peroxisomes (eg, Unger et al., Plant Mol. Biol. 13:411-418 (1989)). The cDNAs encoding these products can also be manipulated to direct heterologous gene products to these organelles. Examples of such sequences are nuclear encoded ATPase and mitochondrial specific aspartate aminotransferase isoforms. Work targeting cellular proteosomes has been described by Roger et al. (Proc. National Academy of Sciences USA 82:6512-6516 (1985)).

此外,还鉴定出了使基因产物定向于其他细胞区室的序列。氨基末端序列负责定向于ER、质外体和糊粉细胞的胞外分泌(Koehler & Ho,植物细胞2:769-783(1990))。此外,氨基端序列与羧基端序列一起负责基因产物的液泡定向(Shinshi等,植物分子生物学14:357-368(1990))。In addition, sequences were identified that direct gene products to other cellular compartments. The amino-terminal sequence is responsible for extracellular secretion directed to the ER, apoplasts, and aleurone cells (Koehler & Ho, The Plant Cell 2: 769-783 (1990)). Furthermore, the amino-terminal sequence, together with the carboxy-terminal sequence, is responsible for the vacuolar orientation of the gene product (Shinshi et al., Plant Mol. Biol. 14:357-368 (1990)).

通过将上述合适的靶向序列与目的转基因序列融合,有可能指导转基因产物至任何细胞器或细胞区室。例如,对于叶绿体定向,可将来自RUBISCO基因、CAB基因、EPSP合成酶基因或GS2基因的叶绿体信号序列读框一致地与转基因的氨基末端ATG融合。所选择的信号序列应包含已知的切割位点,构建融合体应考虑到切割位点后切割所需的任何氨基酸。在某些情况下,通过在切割位点和转基因ATG之间添加少量氨基酸或置换转基因序列中的一些氨基酸可达到此要求。通过体外转录构建体的体外翻译及随后用文献(Bartlett等,在:Edelmann等(编辑)叶绿体分子生物学中的方法,Elsevier,pp1081-1091(1982);Wasmann等,分子与普通遗传学205:446-453(1986))所述技术进行体外叶绿体摄取,可检测为输入叶绿体而构建的融合体的叶绿体摄入效率。这些构建技术在本领域中是众所周知的,并可同样应用于线粒体和过氧化物酶体。对于杀虫毒素可能要求选择定向。通常是胞质的或叶绿体的,尽管在某些情况下也可能是线粒体的或过氧化物酶体的。核酸序列的表达也可能要求定向于ER、质外体或液泡。By fusing the appropriate targeting sequence described above to the transgene sequence of interest, it is possible to direct the transgene product to any organelle or cell compartment. For example, for chloroplast targeting, the chloroplast signal sequence from the RUBISCO gene, CAB gene, EPSP synthetase gene, or GS2 gene can be fused in frame to the amino-terminal ATG of the transgene. The signal sequence chosen should contain a known cleavage site, and fusion construction should take into account any amino acids required for cleavage after the cleavage site. In some cases, this requirement was achieved by adding a small number of amino acids between the cleavage site and the transgene ATG or by replacing some amino acids in the transgene sequence. In vitro translation of the construct by in vitro transcription and subsequent use of literature (Bartlett et al., In: Edelmann et al. (eds.) Methods in Chloroplast Molecular Biology, Elsevier, pp1081-1091 (1982); Wasmann et al., Molecular and General Genetics 205: 446-453 (1986)) for in vitro chloroplast uptake, allowing the detection of chloroplast uptake efficiency of fusions constructed for import into chloroplasts. These construction techniques are well known in the art and are equally applicable to mitochondria and peroxisomes. Selective orientation may be required for insecticidal toxins. Usually cytosolic or chloroplast, although in some cases it may also be mitochondrial or peroxisome. Expression of nucleic acid sequences may also require targeting to the ER, apoplast or vacuole.

上述细胞定向机制不仅可与其同源启动子联合使用,还可与异源启动子联合使用,从而在与定向信号所来源的启动子具不同表达模式的启动子的转录调节下达到特异细胞定向的目的。The above-mentioned cell-directed mechanism can be used not only in combination with its homologous promoter, but also in combination with a heterologous promoter, so as to achieve specific cell-directed activation under the transcriptional regulation of a promoter with a different expression pattern from the promoter from which the directional signal originates. Purpose.

实施例12:表达盒构建的例子Example 12: Example of expression cassette construction

本发明包含在任何植物中可表达的启动子调节下的DNA分子表达,不管启动子的来源如何。此外,本发明包括将任何植物可表达的启动子与DNA分子表达所要求或选择的任何其他序列联合使用。所说的序列包括,但不局限于,转录终止子、增强表达的外来序列(如内含子[如Adh内含子1]、病毒序列[如TMV-Ω])及将基因产物定向至特异细胞器和细胞区室的序列。The present invention encompasses the expression of a DNA molecule under the regulation of a promoter expressible in any plant, regardless of the origin of the promoter. Furthermore, the present invention encompasses the use of any plant expressible promoter in combination with any other sequence required or selected for expression of the DNA molecule. Such sequences include, but are not limited to, transcription terminators, foreign sequences that enhance expression (such as introns [such as Adh intron 1], viral sequences [such as TMV-Ω]) and direct gene products to specific Sequence of organelles and cellular compartments.

组成型表达:CaMV 35S启动子Constitutive expression: CaMV 35S promoter

质粒pCGN1761的构建描述于公开的专利申请EP 0392225中。pCGN1761包含‘双’35S启动子和tml转录终止子,启动子和终止子之间有单一的EcoRⅠ位点,还包括pUC-型骨架。构建具有修饰多接头的pCGN1761衍生物,该多接头中除存在原有EcoRⅠ位点外还包括NotⅠ和XhoⅠ位点。该衍生物被称为pCGN1761ENX。为了使基因在转基因植物中于35S启动子控制下表达,pCGN1761ENX可用于在其多接头内克隆cDNA序列或基因序列(包括微生物ORF序列)。可通过用启动子5’侧的HindⅢ、SphⅠ、SalⅠ和XbaⅠ位点和终止子3’侧的XbaⅠ、BamHⅠ和BglⅠ位点切出这种结构的完整35S启动子-基因序列-tml终止子盒,将其转移入如实施例35中所述之类的转化载体。此外,为了用别的启动子置换,可用HindⅢ、SphⅠ、SalⅠ、XbaⅠ或PstⅠ在5’切割,用多接头限制性位点(EcoRⅠ、NotⅠ或XhoⅠ)中任一个在3’切割,去除双35S启动子片段。The construction of plasmid pCGN1761 is described in published patent application EP 0392225. pCGN1761 contains a 'double' 35S promoter and a tml transcription terminator with a single EcoRI site between the promoter and terminator, and also includes a pUC-type backbone. A pCGN1761 derivative with a modified polylinker including NotI and XhoI sites in addition to the original EcoRI site was constructed. This derivative was called pCGN1761ENX. For gene expression in transgenic plants under the control of the 35S promoter, pCGN1761ENX can be used to clone cDNA sequences or gene sequences (including microbial ORF sequences) within its polylinker. The complete 35S promoter-gene sequence-tml terminator box of this structure can be excised by using the HindIII, SphI, SalI and XbaI sites on the 5' side of the promoter and the XbaI, BamHI and BglI sites on the 3' side of the terminator , which is transferred into a transformation vector such as described in Example 35. In addition, for replacement with another promoter, HindIII, SphI, SalI, XbaI, or PstI can be used to cut at 5', and any of the polylinker restriction sites (EcoRI, NotI, or XhoI) at 3' to remove the double 35S Promoter fragment.

通过优化翻译起始位点修饰pCGN1761ENXModification of pCGN1761ENX by optimizing the translation initiation site

对于本部分所述的任何构建,通过引入能增强翻译的序列可在克隆位点周围进行修饰。这一点在将来自微生物的基因引入植物表达盒中时尤其有用,因为这些基因可能在邻近其甲硫氨酸处不包含适于植物中翻译起始的序列。倘若来自微生物的基因将在其ATG处克隆入植物表达盒,则修饰其插入位点以优化其表达可能比较有用。作为引入数个植物表达优化序列之一的方式例子,参见pCGN1761ENX的修饰(如Joshi,见上文)。As with any of the constructs described in this section, modifications may be made around the cloning site by introducing sequences that enhance translation. This is especially useful when introducing genes from microorganisms into plant expression cassettes, since these genes may not contain, adjacent to their methionine, sequences suitable for translation initiation in plants. If a gene from a microorganism is to be cloned at its ATG into a plant expression cassette, it may be useful to modify its insertion site to optimize its expression. As an example of a way of introducing one of several plant expression optimized sequences, see the modification of pCGN1761ENX (eg Joshi, supra).

在可化学调控型启动子控制下的表达Expression under the control of a chemically regulatable promoter

本部分描述了用所选的任何启动子置换pCGN1761ENX中的双重35S启动子;描述的一个例子是用化学调节的RP-1a启动子置换。所选启动子优选通过限制酶从其天然来源中切出,但或者也可用带合适末端限制位点的引物进行PCR扩增。若进行PCR扩增,则在扩增启动子克隆入目标载体后应将启动子再测序以检查有无扩增错误。从质粒pCIB1004(见EP 0332104,关于构建的实施例21)中切割出可化学调控的烟草PR-1a启动子并将其转移入质粒pCGN1761ENX。用NcoI切割pCIB1004,并通过T4 DNA聚合酶的处理将产生的线性片段3’突出端补平。然后用HindⅢ切割此片段,产生的含PR-1a启动子的片段凝胶纯化并克隆入双重35S启动子已切除的pCGN1761ENX。这可通过用XhoⅠ切割并用T4聚合酶补平,随后用HindⅢ切割并分离含克隆入pCIB1004启动子片段的含载体一终止子的较大片段完成。这样就产生了含PR-1a启动子和tml终止子及具单一EcoRⅠ和NotⅠ位点之插入多接头的pCGN1761ENX衍生物。本发明的DNA分子可插入该载体中,融合产物(即启动子-基因-终止子)可随后被转移入任何选择的转化载体中,包括本申请中所述的载体。This section describes the replacement of the dual 35S promoter in pCGN1761ENX with any promoter of choice; one example described is replacement with the chemically regulated RP-1a promoter. The selected promoter is preferably excised from its natural source by restriction enzymes, but may alternatively be PCR amplified using primers with appropriate terminal restriction sites. If PCR amplification is performed, after the amplified promoter is cloned into the target vector, the promoter should be re-sequenced to check for amplification errors. The chemically regulatable tobacco PR-1a promoter was excised from plasmid pCIB1004 (see EP 0332104, Example 21 for construction) and transferred into plasmid pCGN1761ENX. pCIB1004 was cut with NcoI, and the 3' overhang of the resulting linear fragment was filled in by T4 DNA polymerase treatment. This fragment was then cleaved with HindIII and the resulting fragment containing the PR-1a promoter was gel purified and cloned into pCGN1761ENX in which the dual 35S promoter had been excised. This was accomplished by cutting with XhoI and filling in with T4 polymerase, followed by cutting with HindIII and isolating the larger fragment containing the vector-terminator cloned into the pCIB1004 promoter fragment. This yielded a pCGN1761ENX derivative containing the PR-1a promoter and tml terminator and an inserted polylinker with unique EcoRI and NotI sites. The DNA molecule of the invention can be inserted into this vector and the fusion product (ie promoter-gene-terminator) can then be transferred into any transformation vector of choice, including the vectors described in this application.

组成型表达:肌动蛋白启动子Constitutive Expression: Actin Promoter

已知数种肌动蛋白同种型表达于大部分细胞类型中,因此肌动蛋白启动子是组成型启动子的好选择。尤其是,来自稻Act1基因的启动子已被克隆和鉴定(McElroy等,植物细胞2:163-171(1990))。已发现该启动子的1.3kb片段,包含稻原生质体中表达所需的所有调节元件。此外,已构建了许多以ActⅠ启动子为基础、专门用于单子叶植物的表达载体(McElroy等,分子与普通遗传学231:150-160(1991))。它们中掺入了Act1-内含子1、Adh1 5’侧翼序列和Adh1-内含子1(来自玉米乙醇脱氢酶基因)和来自CaMV 35S启动子的序列。显示最高表达的载体是35S和Act1内含子的融合体或Act1 5’侧翼序列和Act1内含子的融合体。起始ATG(GUS报道基因的)周围序列的优化也增强了表达。McElroy等(分子与普通遗传学231:150-160(1991))描述的启动子表达盒可容易地被修饰以用于本发明DNA分子的表达,其尤其适用于单子叶植物宿主。例如,含启动子的片段可从McElroy构建体中切出并用于置换pCGN1761ENX中的双35S启动子,该载体然后可用于插入特异基因序列。这样构建的融合基因可随后被转移入合适的转化载体中。在个别报道中还已发现稻Act1启动子与其第一个内含子可指导人工培养大麦细胞中的高表达(Chibbar等,Plant CellRep.12:506-509(1993))。Several actin isoforms are known to be expressed in most cell types, so the actin promoter is a good choice for a constitutive promoter. In particular, the promoter from the rice Act1 gene has been cloned and characterized (McElroy et al., The Plant Cell 2: 163-171 (1990)). A 1.3 kb fragment of this promoter was found to contain all the regulatory elements required for expression in rice protoplasts. In addition, a number of expression vectors based on the ActI promoter have been constructed specifically for monocotyledonous plants (McElroy et al., Molecular and General Genetics 231: 150-160 (1991)). They incorporated Act1-intron 1, Adh1 5' flanking sequences and Adh1-intron 1 (from the maize alcohol dehydrogenase gene) and sequences from the CaMV 35S promoter. The vectors showing the highest expression were fusions of 35S and the Act1 intron or fusions of the Act1 5' flanking sequence and the Act1 intron. Optimization of the sequence surrounding the initial ATG (of the GUS reporter gene) also enhanced expression. The promoter expression cassette described by McElroy et al. (Molecular and General Genetics 231: 150-160 (1991)) can be readily modified for expression of the DNA molecules of the invention, which is particularly suitable for use in monocot hosts. For example, a promoter-containing fragment can be excised from the McElroy construct and used to replace the double 35S promoter in pCGN1761ENX, which can then be used to insert specific gene sequences. The fusion gene thus constructed can then be transferred into a suitable transformation vector. In individual reports, it has also been found that the rice Act1 promoter and its first intron can direct high expression in artificially cultured barley cells (Chibbar et al., Plant Cell Rep. 12: 506-509 (1993)).

组成型表达:遍在蛋白启动子Constitutive expression: ubiquitin promoter

遍在蛋白是另一已知积累于许多细胞类型中的基因产物,其启动子已自许多物种中克隆到,已用于转基因植物(如向日葵-Binet等,植物科学79:87-94(1991),玉米-Christensen等,植物分子生物学12:619-632(1989))。玉米遍在蛋白启动子已被用于转基因单子叶植物系统中,其序列和为单子叶植物转化所构建的载体公开于专利申请EP0342926中。此外,Taylor等(Plant Cell Rep.12:491-495(1993))描述了含玉米遍在蛋白启动子和第一内含子的载体(pAHC25),当其通过粒子轰击引入时在许多单子叶植物细胞悬浮物中有高活性。遍在蛋白启动子完全适于本发明DNA分子在转基因植物中的表达,尤其是单子叶植物。合适的载体是通过引入合适的遍在蛋白启动子和/或内含子序列而加以修饰的pAHC25衍生物或本申请中所述的任一个转化载体。Ubiquitin is another gene product known to accumulate in many cell types, and its promoters have been cloned from many species and used in transgenic plants (such as Sunflower-Binet et al., Plant Science 79:87-94 (1991 ), Maize-Christensen et al., Plant Mol. Biol. 12:619-632 (1989)). The maize ubiquitin promoter has been used in transgenic monocotyledonous plant systems, and its sequence and vectors constructed for monocotyledonous plant transformation are disclosed in patent application EP0342926. In addition, Taylor et al. (Plant Cell Rep. 12:491-495 (1993)) described a vector (pAHC25) containing the maize ubiquitin promoter and first intron that, when introduced by particle bombardment, was found in many monocots. Highly active in plant cell suspensions. The ubiquitin promoter is perfectly suitable for the expression of the DNA molecules according to the invention in transgenic plants, especially monocotyledonous plants. Suitable vectors are derivatives of pAHC25 modified by introducing appropriate ubiquitin promoter and/or intron sequences or any of the transformation vectors described in this application.

根特异性表达root-specific expression

本发明核苷酸序列表达的优选模式是根表达。根表达对于控制土壤传播的真菌病原体尤其有用。合适的根启动子是de Framond所述的启动子(FEBS290:103-106(1991)),在公开的专利申请EP0452269中也有描述。该启动子被转移入诸如pCGN1761ENX之类的合适载体中,以便插入核苷酸序列并随后将完整的启动子-基因-终止子盒转移入目的转化载体中。A preferred mode of expression of the nucleotide sequences of the invention is root expression. Root expression is especially useful for controlling soil-borne fungal pathogens. A suitable root promoter is that described by de Framond (FEBS 290:103-106 (1991)) and also described in published patent application EP0452269. The promoter is transferred into a suitable vector such as pCGN1761ENX for insertion of the nucleotide sequence and subsequent transfer of the complete promoter-gene-terminator cassette into the transformation vector of interest.

创伤诱导型启动子wound inducible promoter

创伤诱导型启动子尤其适用于本发明DNA分子的表达,因为它们一般不仅在创伤诱发部位有活性,在植物病原体感染部位也有活性。许多这样的启动子已被描述(如Xu等,植物分子生物学22:573-588(1993),Logemann等,植物细胞1:151-158(1989),Rohrmeier & Lehle,植物分子生物学22:783-792(1993),Firek等,植物分子生物学22:129-142(1993),Warner等,植物杂志3:191-201(1993))并均适用于本发明。Logemann等(见上文)描述了双子叶马铃薯wun1基因的5’上游序列。Xu等(见上文)显示了来自双子叶马铃薯(pin2)的创伤诱导型启动子在单子叶稻中是有活性的。此外,Rohrmeier & Lehle(见上文)描述了可被创伤诱导并可通过标准技术用于分离同源启动子的玉米Wip1 cDNA的克隆。同样地,Firek等(见上文)和Warner等(见上文)已描述了来自单子叶植物Asparagusofficimalis的创伤诱导型基因,它可表达于局部伤口和病原体侵入位点。用本领域众所周知的克隆技术,可将这些启动子转移入合适载体中,与本发明的DNA分子融合,并用于在植物病原体感染位置表达这些基因。Wound-inducible promoters are particularly suitable for the expression of the DNA molecules according to the invention, since they are generally active not only at the site of wound induction, but also at the site of infection by phytopathogens. Many such promoters have been described (such as Xu et al., Plant Molecular Biology 22: 573-588 (1993), Logemann et al., Plant Cell 1: 151-158 (1989), Rohrmeier & Lehle, Plant Molecular Biology 22: 783-792 (1993), Firek et al., Plant Mol. Biol. 22: 129-142 (1993), Warner et al., Plant Journal 3: 191-201 (1993)) and are applicable to the present invention. Logemann et al. (supra) describe the 5' upstream sequence of the dicotyledonous potato wun1 gene. Xu et al. (supra) showed that a wound-inducible promoter from dicotyledonous potato (pin2) was active in monocotyledonous rice. In addition, Rohrmeier & Lehle (supra) describe the cloning of a maize Wipl cDNA that is inducible by wounding and can be used to isolate a homologous promoter by standard techniques. Likewise, Firek et al. (supra) and Warner et al. (supra) have described a wound-inducible gene from the monocotyledonous plant Asparagus officimalis that is expressed in local wounds and sites of pathogen entry. Using cloning techniques well known in the art, these promoters can be transferred into suitable vectors, fused to the DNA molecules of the invention, and used to express these genes at the site of plant pathogen infection.

髓优选表达Myeloid-preferred expression

专利申请WO93/07278(Ciba-Geigy)描述了优选表达于髓细胞的玉米trpA基因的分离,并给出了从转录起点延伸至-1726位的启动子基因序列。用标准分子生物学技术可将该启动子或其部分转移入诸如pCGN1761之类的载体中,在其中它可置换35S启动子并被用于以髓优选方式驱动本发明DNA分子的表达。事实上,含髓优选启动子或其部分的片段可被转移入任何载体中并被修饰以便在转基因植物中应用。Patent application WO93/07278 (Ciba-Geigy) describes the isolation of the maize trpA gene preferentially expressed in myeloid cells and gives the sequence of the promoter gene extending from the transcription start to position -1726. Using standard molecular biology techniques, this promoter, or a portion thereof, can be transferred into a vector such as pCGN1761, where it can replace the 35S promoter and be used to drive expression of the DNA molecules of the invention in a myeloid-preferred manner. In fact, fragments containing the pith-preferred promoter or parts thereof can be transferred into any vector and modified for use in transgenic plants.

花粉特异性表达Pollen-specific expression

专利申请WO93/07278(Ciba-Geigy)进一步描述了表达于花粉细胞中的玉米钙依赖性蛋白激酶(CDPK)基因的分离。该基因序列和启动子从转录起点延伸长达1400bp。用标准分子生物学技术可将该启动子或其部分转移入诸如pCGN1761之类的载体中,在其中它可置换35S启动子并被用于驱动本发明DNA分子的表达。事实上,含花粉特异性启动子或其部分的片段可被转移入任何载体中并被修饰以便在转基因植物中应用。Patent application WO93/07278 (Ciba-Geigy) further describes the isolation of the maize calcium-dependent protein kinase (CDPK) gene expressed in pollen cells. The gene sequence and promoter extend up to 1400bp from the start of transcription. Using standard molecular biology techniques, this promoter, or a portion thereof, can be transferred into a vector such as pCGN1761, where it can replace the 35S promoter and be used to drive expression of the DNA molecules of the invention. In fact, fragments containing pollen-specific promoters or parts thereof can be transferred into any vector and modified for use in transgenic plants.

叶特异性表达leaf-specific expression

编码磷酸烯醇羧化酶(PEPC)的玉米基因已被Hudspeth & Grula(植物分子生物学12:579-589(1989))描述。该基因的启动子可通过标准分子生物学技术被用于在转基因植物中以叶特异性方式驱动任何基因的表达。The maize gene encoding phosphoenol carboxylase (PEPC) has been described by Hudspeth & Grula (Plant Mol. Biol. 12:579-589 (1989)). The promoter of this gene can be used to drive the expression of any gene in a leaf-specific manner in transgenic plants by standard molecular biology techniques.

叶绿体定向表达Chloroplast-directed expression

Chen & Jagendorf(生物化学杂志268:2363-2367(1993))已描述了成功应用叶绿体转运肽输入异源转基因。所用的这种肽是来自Nicotiana plumbaginifolia的rbcS基因的转运肽(Poulsen等,分子与普通遗传学205:193-200(1986))。用限制性酶DraⅠ和SphⅠ,或Tsp509I和SphⅠ可从质粒prbcS-8B(Poulsen等,见上文)中切出编码该转运肽的DNA序列并与任一上述构建体一起使用。DraⅠ-SphⅠ片段从相对于起始rbcS ATG的-58位点延伸至,并包括紧接于输入切割位点之后的成熟肽的第一个氨基酸(也是一甲硫氨酸),而TspS09Ⅰ-SphⅠ片段从相对于起始rbcS ATG的-8位点延伸至,并包括成熟肽的第一个氨基酸。这样,这些片段可恰当地插入任何所选表达盒的多接头处,产生与所选启动子(如35S、PR-1a、肌动蛋白、遍在蛋白等)非翻译前导序列融合的转录融合体,同时使本发明DNA分子的插入能正确融合于转运肽的下游。此类构建在本领域是常规性的。例如,DraⅠ末端是平端,而5’Tsp509Ⅰ位点可用T4聚合酶处理补平,或可与接头或衔接子序列连接从而促进它与所选启动子的融合。3’SphⅠ位点可同样保持,或可与接头或衔接子序列连接以促进其插入所选载体中,从而为随后插入本发明DNA分子提供合适的限制性位点。理想的状态是保持SphⅠ的ATG并包含本发明DNA分子的第一个ATG。Chen & Jagendorf(见上文)为叶绿体输入的理想切割提供了共有序列,在每个例子中甲硫氨酸都优选位于成熟蛋白质的第一位。随后的位置可有更多的变动,可能氨基酸不是如此关键。任何情况下都可用Bartlett等(在:Edelmann等(编辑)叶绿体分子生物学中的方法,Elsevier,pp1081-1091(1982)中)和Wasmann等(分子与普通遗传学205:446-453(1986))所述方法体外评估融合构建体的输入效率。一般地说,最好的方法可能是用氨基末端无修饰的本发明DNA分子建立融合体,且只有当这种融合体不能高效输入叶绿体时才引入修饰,这种情况下可按提供的文献进行修饰(Chen & Jagendorf,见上文;Wasman等,见上文;Ko&Ko,生物化学杂志267:13910-13916(1992))。Chen & Jagendorf (J. Biol. Chem. 268:2363-2367 (1993)) have described the successful use of chloroplast transit peptides to import heterologous transgenes. The peptide used was the transit peptide from the rbcS gene of Nicotiana plumbaginifolia (Poulsen et al., Molecular and General Genetics 205: 193-200 (1986)). The DNA sequence encoding this transit peptide can be excised from plasmid prbcS-8B (Poulsen et al., supra) using the restriction enzymes DraI and SphI, or Tsp509I and SphI, and used with any of the above constructs. The DraI-SphI fragment extends from position -58 relative to the starting rbcS ATG to and includes the first amino acid (also a methionine) of the mature peptide immediately following the import cleavage site, whereas the TspS09I-SphI The fragment extends from the -8 position relative to the starting rbcS ATG to and includes the first amino acid of the mature peptide. In this way, these fragments can be appropriately inserted into the polylinker of any expression cassette of choice, resulting in transcriptional fusions fused to the untranslated leader sequence of the promoter of choice (e.g., 35S, PR-1a, actin, ubiquitin, etc.) , while allowing the insertion of the DNA molecule of the present invention to be correctly fused downstream of the transit peptide. Such constructions are routine in the art. For example, the DraI end is blunt, while the 5' Tsp509I site can be filled in by treatment with T4 polymerase, or can be ligated with a linker or adapter sequence to facilitate its fusion to the promoter of choice. The 3'SphI site may likewise be maintained, or may be joined to a linker or adapter sequence to facilitate its insertion into the vector of choice, thereby providing a suitable restriction site for subsequent insertion of the DNA molecule of the invention. Ideally, the ATG of SphI is maintained and contains the first ATG of the DNA molecule of the invention. Chen & Jagendorf (supra) provide consensus sequences for ideal cleavage of chloroplast import, in each case methionine is preferred at the first position of the mature protein. Subsequent positions are subject to more variation, perhaps the amino acid is not so critical. In any case Bartlett et al. (in: Edelmann et al. (eds.) Methods in Chloroplast Molecular Biology, Elsevier, pp1081-1091 (1982)) and Wasmann et al. (Molecular and General Genetics 205:446-453 (1986) ) method for assessing the efficiency of import of a fusion construct in vitro. Generally speaking, the best method may be to use the DNA molecule of the present invention with no modification at the amino terminus to create a fusion, and only introduce the modification when the fusion cannot be efficiently imported into the chloroplast. In this case, it can be carried out according to the provided literature. Modifications (Chen & Jagendorf, supra; Wasman et al., supra; Ko & Ko, J. Biol. Chem. 267: 13910-13916 (1992)).

为利用由其他来源(单子叶植物和双子叶植物)或其他基因的其他GS2叶绿体转运肽编码序列,可进行类似的操作。此外,可遵从类似的流程以达到定向于如线粒体之类其他亚细胞区室的目的。Similar manipulations can be performed to utilize other GS2 chloroplast transit peptide coding sequences from other sources (monocots and dicots) or other genes. Furthermore, similar protocols can be followed for targeting to other subcellular compartments such as mitochondria.

序列表<110>Novartis<120>控制疾病的基因<130>S-30431/A<140>CGC1989<141>1998-03-17<150>US 09/042763<151>1998-03-17<160>42<170>PatentIn Ver.2.0<210>1<211>13<212>PRT<213>人工序列<220><223>人工序列描述:保守的氨基酸序列<400>1Glu Leu Met Xaa Xaa Gly Xaa Ile Ser Leu Leu Leu Xaa1          5                    10<210>2<211>14<212>PRT<213>人工序列<220><223>人工序列描述:保守的氨基酸序列<400>2Xaa Thr Xaa Pro Leu Xaa Xaa Xaa Val Xaa Gln Met Gly Ser1               5           10<210>3<211>1868<212>DNA<213>Triticum sp.<220><221>CDS<222>(176)..(1777)<220><221>misc_feature<222>(365)..(403)<223>SEQ ID NO:1所示氨基酸序列的位置<220><221>misc_feature<222>(1352)..(1393)<223>SEQ ID NO:2所示氨基酸序列的位置<400>3cacgcccaca cttcgccaac acacaacgta cctgcgtacg tacgctttcc atttcctttc 60ttgctccggc cggccggcca cgtagaatag atacccggcc aggtaggtac ctcgttggct 120cagacgaccg gcggctgggt ctccggacaa ggaaagaggt tgcgctcggg gaccg atg  178Met1gcg gac gac gac gag tac ccc cca gcg agg acg ctg ccg gag acg ccg   226Ala Asp Asp Asp Glu Tyr Pro Pro Ala Arg Thr Leu Pro Glu Thr Pro5                  10                  15tcc tgg gcg gtg gcc ctc gtc ttc gcc gtc atg atc atc gtg tcc gtc   274Ser Trp Ala Val Ala Leu Val Phe Ala Val Met Ile Ile Val Ser Val20                  25                  30ctc ctg gag cac gcg ctc cat aag ctc ggc cat tgg ttc cac aag cgg   322Leu Leu Glu His Ala Leu His Lys Leu Gly His Trp Phe His Lys Arg35                  40                  45cac aag aac gcg ctg gcg gag gcg ctg gag aag atc aag gcg gag ctc   370His Lys Asn Ala Leu Ala Glu Ala Leu Glu Lys Ile Lys Ala Glu Leu50                  55                  60                  65atg ctg gtg ggc ttc atc tcg ctg ctg ctc gcc gtg acg cag gac ccc   418Met Leu Val Gly Phe Ile Ser Leu Leu Leu Ala Val Thr Gln Asp Pro70              75                      80atc tcc ggg ata tgc atc tcc gag aag gcc gcc agc atc atg cgg ccc   466Ile Ser Gly Ile Cys Ile Ser Glu Lys Ala Ala Ser Ile Met Arg Pro85                  90                  95tgc aag ctg ccc cct ggc tcc gtc aag agc aag tac aaa gac tac tac   514Cys Lys Leu Pro Pro Gly Ser Val Lys Ser Lys Tyr Lys Asp Tyr Tyr100                 105                 110tgc gcc aaa cag ggc aag gtg tcg ctc atg tcc acg ggc agc ttg cac   562Cys Ala Lys Gln Gly Lys Val Ser Leu Met Ser Thr Gly Ser Leu His115                 120                 125cag ctg cac ata ttc atc ttc gtg ctc gcc gtc ttc cat gtc acc tac   610Gln Leu His Ile Phe Ile Phe Val Leu Ala Val Phe His Val Thr Tyr130                     135             140                 145agc gtc atc atc atg gct cta agc cgt ctc aaa atg aga acc tgg aag   658Ser Val Ile Ile Met Ala Leu Ser Arg Leu Lys Met Arg Thr Trp Lys150                 155                 160aaa tgg gag aca gag acc gcc tcc ctg gaa tac cag ttc gca aat gat   706Lys Trp Glu Thr Glu Thr Ala Ser Leu Glu Tyr Gln Phe Ala Asn Asp165                 170                 175cct gcg cgg ttc cgc ttc acg cac cag acg tcg ttc gtg aag cgg cac   754Pro Ala Arg Phe Arg Phe Thr His Gln Thr Ser Phe Val Lys Arg His180                 185                 190ctg ggc ctc tcc agc acc ccc ggc gtc aga tgg gtg gtg gcc ttc ttc   802Leu Gly Leu Ser Ser Thr Pro Gly Val Arg Trp Val Val Ala Phe Phe195                 200                 205agg cag ttc ttc agg tcg gtc acc aag gtg gac tac ctc acc ttg agg   850Arg Gln Phe Phe Arg Ser Val Thr Lys Val Asp Tyr Leu Thr Leu Arg210                 215                 220                 225gca ggc ttc atc aac gcg cat ttg tcg cat aac agc aag ttc gac ttc   898Ala Gly Phe Ile Asn Ala His Leu Ser His Asn Ser Lys Phe Asp Phe230                 235                 240cac aag tac atc aag agg tcc atg gag gac gac ttc aaa gtc gtc gtt   946His Lys Tyr Ile Lys Arg Ser Met Glu Asp Asp Phe Lys Val Val Val245                 250                 255ggc atc agc ctc ccg ctg tgg tgt gtg gcg atc ctc acc ctc ttc ctt   994Gly Ile Ser Leu Pro Leu Trp Cys Val Ala Ile Leu Thr Leu Phe Leu260                 265                 270gac att gac ggg atc ggc acg ctc acc tgg att tct ttc atc cct ctc   1042Asp Ile Asp Gly Ile Gly Thr Leu Thr Trp Ile Ser Phe Ile Pro Leu275                 280                 285gtc atc ctc ttg tgt gtt gga acc aag ctg gag atg atc atc atg gag   1090Val Ile Leu Leu Cys Val Gly Thr Lys Leu Glu Met Ile Ile Met Glu290                 295                 300                 305atg gcc ctg gag atc cag gac cgg gcg agc gtc atc aag ggg gcg ccc   1138Met Ala Leu Glu Ile Gln Asp Arg Ala Ser Val Ile Lys Gly Ala Pro310                 315                 320gtg gtt gag ccc agc aac aag ttc ttc tgg ttc cac cgc ccc gac tgg   1186Val Val Glu Pro Ser Asn Lys Phe Phe Trp Phe His Arg Pro Asp Trp325                 330                 335gtc ctc ttc ttc ata cac ctg acg cta ttc cag aac gcg ttt cag atg   1234Val Leu Phe Phe Ile His Leu Thr Leu Phe Gln Asn Ala Phe Gln Met340                 345                 350gca cat ttc gtg tgg aca gtg gcc acg ccc ggc ttg aag aaa tgc ttc   1282Ala His Phe Val Trp Thr Val Ala Thr Pro Gly Leu Lys Lys Cys Phe355                 360                 365cat atg cac atc ggg ctg agc atc atg aag gtc gtg ctg ggg ctg gct   1330His Met His Ile Gly Leu Ser Ile Met Lys Val Val Leu Gly Leu Ala370                 375                 380                 385ctt cag ttc ctc tgc agc tat atc acc ttc ccg ctc tac gcg ctc gtc   1378Leu Gln Phe Leu Cys Ser Tyr Ile Thr Phe Pro Leu Tyr Ala Leu Val390                 395                 400aca cag atg gga tca aac atg aag agg tcc atc ttc gac gag cag acg   1426Thr Gln Met Gly Ser Asn Met Lys Arg Ser Ile Phe Asp Glu Gln Thr405                 410                  415gcc aag gcg ctg aca aac tgg cgg aac acg gcc aag gag aag aag aag   1474Ala Lys Ala Leu Thr Asn Trp Arg Asn Thr Ala Lys Glu Lys Lys Lys420                 425                 430gtc cga gac acg gac atg ctg atg gcg cag atg atc ggc gac gcg acg   1522Val Arg Asp Thr Asp Met Leu Met Ala Gln Met Ile Gly Asp Ala Thr435                 440                 445ccc agc cga ggg gcg tcg ccc atg cct agc cgg ggc tcg tcg cca gtg   1570Pro Ser Arg Gly Ala Ser Pro Met Pro Ser Arg Gly Ser Ser Pro Val450                 455                 460                 465cac ctg ctt cac aag ggc atg gga cgg tcc gac gat ccc cag agc acg   1618His Leu Leu His Lys Gly Met Gly Arg Ser Asp Asp Pro Gln Ser Thr470                 475                 480cca acc tcg cca agg gcc atg gag gag gct agg gac atg tac ccg gtt   1666Pro Thr Ser Pro Arg Ala Met Glu Glu Ala Arg Asp Met Tyr Pro Val485                 490                     495gtg gtg gcg cat cca gtg cac aga cta aat cct gct gac agg aga agg   1714Val Val Ala His Pro Val His Arg Leu Asn Pro Ala Asp Arg Arg Arg500                 505                 510tcg gtc tcg tcg tcg gca ctc gat gtc gac att ccc agc gca gat ttt   1762Ser Val Ser Ser Ser Ala Leu Asp Val Asp Ile Pro Ser Ala Asp Phe515                 520                 525tcc ttc agc cag gga tgagacaagt ttctgtattg atgttagtcc aatgtatagc   1817Ser Phe Ser Gln Gly530caacatagga tgtcatgatt cgtacaataa gaaatacaaa tttttactga g          1868<210>4<211>534<212>PRT<213>Triticum sp.<400>4Met Ala Asp Asp Asp Glu Tyr Pro Pro Ala Arg Thr Leu Pro Glu Thr1               5                  10                  15Pro Ser Trp Ala Val Ala Leu Val Phe Ala Val Met Ile Ile Val Ser20              25                      30Val Leu Leu Glu His Ala Leu His Lys Leu Gly His Trp Phe His Lys35                 40                   45Arg His Lys Asn Ala Leu Ala Glu Ala Leu Glu Lys Ile Lys Ala Glu50                   55                  60Leu Met Leu Val Gly Phe Ile Ser Leu Leu Leu Ala Val Thr Gln Asp65                  70              75                      80Pro Ile Ser Gly Ile Cys Ile Ser Glu Lys Ala Ala Ser Ile Met Arg85                  90                  95Pro Cys Lys Leu Pro Pro Gly Ser Val Lys Ser Lys Tyr Lys Asp Tyr100                 105                 110Tyr Cys Ala Lys Gln Gly Lys Val Ser Leu Met Ser Thr Gly Ser Leu115                 120                 125His Gln Leu His Ile Phe Ile Phe Val Leu Ala Val Phe His Val Thr130                     135                 140Tyr Ser Val Ile Ile Met Ala Leu Ser Arg Leu Lys Met Arg Thr Trp145                 150                 155                 160Lys Lys Trp Glu Thr Glu Thr Ala Ser Leu Glu Tyr Gln Phe Ala Asn165                 170                 175Asp Pro Ala Arg Phe Arg Phe Thr His Gln Thr Ser Phe Val Lys Arg180                 185                 190His Leu Gly Leu Ser Ser Thr Pro Gly Val Arg Trp Val Val Ala Phe195                 200                 205Phe Arg Gln Phe Phe Arg Ser Val Thr Lys Val Asp Tyr Leu Thr Leu210                 215                 220Arg Ala Gly Phe Ile Asn Ala His Leu Ser His Asn Ser Lys Phe Asp225                 230                 235                 240She His Lys Tyr Ile Lys Arg Ser Met Glu Asp Asp Phe Lys Val Val245                 250                 255Val Gly Ile Ser Leu Pro Leu Trp Cys Val Ala Ile Leu Thr Leu Phe260                 265                 270Leu Asp Ile Asp Gly Ile Gly Thr Leu Thr Trp Ile Ser Phe Ile Pro275                 280                 285Leu Val Ile Leu Leu Cys Val Gly Thr Lys Leu Glu Met Ile Ile Met290                 295                 300Glu Met Ala Leu Glu Ile Gln Asp Arg Ala Ser Val Ile Lys Gly Ala305                 310                 315                 320Pro Val Val Glu Pro Ser Asn Lys Phe Phe Trp Phe His Arg Pro Asp325                 330                 335Trp Val Leu Phe Phe Ile His Leu Thr Leu Phe Gln Asn Ala Phe Gln340                 345                 350Met Ala His Phe Val Trp Thr Val Ala Thr Pro Gly Leu Lys Lys Cys355                 360                 365Phe His Met His Ile Gly Leu Ser Ile Met Lys Val Val Leu Gly Leu370                 375                 380Ala Leu Gln Phe Leu Cys Ser Tyr Ile Thr Phe Pro Leu Tyr Ala Leu385                 390                 395                 400Val Thr Gln Met Gly Ser Asn Met Lys Arg Ser Ile Phe Asp Glu Gln405                  410                 415Thr Ala Lys Ala Leu Thr Asn Trp Arg Asn Thr Ala Lys Glu Lys Lys420                  425                 430Lys Val Arg Asp Thr Asp Met Leu Met Ala Gln Met Ile Gly Asp Ala435                 440                 445Thr Pro Ser Arg Gly Ala Ser Pro Met Pro Ser Arg Gly Ser Ser Pro450                 455                 460Val His Leu Leu His Lys Gly Met Gly Arg Ser Asp Asp Pro Gln Ser465                 470                 475                 480Thr Pro Thr Ser Pro Arg Ala Met Glu Glu Ala Arg Asp Met Tyr Pro485                 490                 495Val Val Val Ala His Pro Val His Arg Leu Asn Pro Ala Asp Arg Arg500                 505                 510Arg Ser Val Ser Ser Ser Ala Leu Asp Val Asp Ile Pro Ser Ala Asp515                 520                 525Phe Ser Phe Ser Gln Gly530<210>5<211>1693<212>DNA<213>Triticum sp.<220><221>CDS<222>(1)..(1602)<220><221>misc_feature<222>(190)..(228)<223>SEQ ID NO:1所示氨基酸序列的位置<220><221>misc_feature<222>(1177)..(1218)<223>SEQ ID NO:2所示氨基酸序列的位置<400>5atg gcg gag gac tac gag tac ccc ccg gcg cgg acg ctg ccg gag acg 48Met Ala Glu Asp Tyr Glu Tyr Pro Pro Ala Arg Thr Leu Pro Glu Thr1               5                  10                  15ccg tcc tgg gcg gtg gcg ctc gtc ttc gcc gtc atg atc atc gtg tcc 96Pro Ser Trp Ala Val Ala Leu Val Phe Ala Val Met Ile Ile Val Ser20                  25                  30gtc ctc ctg gag cac gcg ctc cac aag ctc ggc cat tgg ttc cac aag 144Val Leu Leu Glu His Ala Leu His Lys Leu Gly His Trp Phe His Lys35                  40                  45cgg cac aag aac gcg ctg gcg gag gcg ctg gag aag atc aaa gcg gag 192Arg His Lys Asn Ala Leu Ala Glu Ala Leu Glu Lys Ile Lys Ala Glu50                  55                  60ctg atg ctg gtg ggg ttc atc tcg ctg ctg ctc gcc gtg acg cag gac 240Leu Met Leu Val Gly Phe Ile Ser Leu Leu Leu Ala Val Thr Gln Asp65                  70                  75                  80cca atc tcc ggg ata tgc atc tcc gag aag gcc gcc agc atc atg cgg 288Pro Ile Ser Gly Ile Cys Ile Ser Glu Lys Ala Ala Ser Ile Met Arg85                  90                  95ccc tgc agc ctg ccc cct ggt tcc gtc aag agc aag tac aaa gac tac 336Pro Cys Ser Leu Pro Pro Gly Ser Val Lys Sar Lys Tyr Lys Asp Tyr100                 105                 110tac tgc gcc aaa aag ggc aag gtg tcg cta atg tcc acg ggc agc ttg 384Tyr Cys Ala Lys Lys Gly Lys Val Ser Leu Met Ser Thr Gly Ser Leu115                 120                 125cac cag ctc cac atg ttc atc ttc gtg ctc gcc gtc ttc cat gtc acc 432His Gln Leu His Met Phe Ile Phe Val Leu Ala Val Phe His Val Thr130                 135                 140tac agc gtc atc atc atg gct cta agc cgt ctc aaa atg agg aca tgg 480Tyr Ser Val Ile Ile Met Ala Leu Ser Arg Leu Lys Met Arg Thr Trp145                 150                 155                 160aag aaa tgg gag aca gag acc gyc tcc ttg gaa tac cag ttc gca aat 528Lys Lys Trp Glu Thr Glu Thr Xaa Ser Leu Glu Tyr Gln Phe Ala Asn165                 170                 175gat cct gcg cgg ttc cgc ttc acg cac cag acg tcg ttc gtg aag cgt 576Asp Pro Ala Arg Phe Arg Phe Thr His Gln Thr Ser Phe Val Lys Arg180                  185                 190cac ctg ggc ctc tcc agc acc ccc ggc atc aga tgg gtg gtg gcc ttc 624His Leu Gly Leu Ser Ser Thr Pro Gly Ile Arg Trp Val Val Ala Phe195                 200                 205ttc agg cag ttc ttc agg tcg gtc acc aag gtg gac tac ctc acc ctg 672Phe Arg Gln Phe Phe Arg Ser Val Thr Lys Val Asp Tyr Leu Thr Leu210                 215                 220agg gca ggc ttc atc aac gcg cat ttg tcg cat aac agc aag ttc gac 720Arg Ala Gly Phe Ile Asn Ala His Leu Ser His Asn Ser Lys Phe Asp225                 230                 235                 240ttc cac aag tac atc aag agg tcc atg gag gac gac ttc aaa gtc gtc 768Phe His Lys Tyr Ile Lys Arg Ser Met Glu Asp Asp Phe Lys Val Val245                 250                 255gtt ggc atc agc ctc ccg ctg tgg tgt gtg gcg atc ctc acc ctc ttc 816Val Gly Ile Ser Leu Pro Leu Trp Cys Val Ala Ile Leu Thr Leu Phe260                 265                 270ctt gat att gac ggg atc ggc acg ctc acc tgg att tct ttc atc cct 864Leu Asp Ile Asp Gly Ile Gly Thr Leu Thr Trp Ile Ser Phe Ile Pro275                 280                 285ctc gtc atc ctc ttg tgt gtt gga acc aag ctg gag atg atc atc atg 912Leu Val Ile Leu Leu Cys Val Gly Thr Lys Leu Glu Met Ile Ile Met290                 295                 300gag atg gcc ctg gag atc cag gac cgg gcg agc gtc atc aag ggg gcg 960Glu Met Ala Leu Glu Ile Gln Asp Arg Ala Ser Val Ile Lys Gly Ala305                 310                 315                 320ccc gtg gtt gag ccc agc aac aag ttc ttc tgg ttc cac cgc ccc gac 1008Pro Val Val Glu Pro Ser Asn Lys Phe Phe Trp Phe His Arg Pro Asp325                 330                 335tgg gtc ctc ttc ttc ata cac ctg acg ctg ttc cag aat gcg ttt cag 1056Trp Val Leu Phe Phe Ile His Leu Thr Leu Phe Gln Asn Ala Phe Gln340                 345                 350atg gca cat ttc gtc tgg aca gtg gcc acg ccc ggc ttg aag aaa tgc 1104Met Ala His Phe Val Trp Thr Val Ala Thr Pro Gly Leu Lys Lys Cys355                 360                 365ttc cat atg cac atc ggt ctg agc atc atg aag gtc gtg ctg ggg ctg 1152Phe His Met His Ile Gly Leu Ser Ile Met Lys Val Val Leu Gly Leu    370                 375                 380gct ctt cag ttc ctc tgc agc tat atc acc ttc ccc ctc tac gcg ctc 1200Ala Leu Gln Phe Leu Cys Ser Tyr Ile Thr Phe Pro Leu Tyr Ala Leu385                 390                 395                 400gtc aca cag atg gga tcg aac atg aag agg tcc atc ttc gac gag cag 1248Val Thr Gln Met Gly Ser Asn Met Lys Arg Ser Ile Phe Asp Glu Gln405                 410                 415acg gcc aag gcg ctg acc aac tgg cgg aac acg gcc aag gag aag aag 1296Thr Ala Lys Ala Leu Thr Asn Trp Arg Asn Thr Ala Lys Glu Lys Lys420                 425                 430aag gtc cga gac acg gac atg ctg atg gcg cag atg atc ggc gac gcg 1344Lys Val Arg Asp Thr Asp Met Leu Met Ala Gln Met Ile Gly Asp Ala435                 440                 445acg ccc agc cga ggc acg tcg ccg atg cct agc cgg gct tcg tca ccg 1392Thr Pro Ser Arg Gly Thr Ser Pro Met Pro Ser Arg Ala Ser Ser Pro450                 455                 460gtg cac ctg ctt cac aag ggc atg gga cgg tcc gac gat ccc cag agc 1440Val His Leu Leu His Lys Gly Met Gly Arg Ser Asp Asp Pro Gln Ser465                 470                 475                 480gcg ccg acc tcg cca agg acc atg gag gag gct agg gac atg tac ccg 1488Ala Pro Thr Ser Pro Arg Thr Met Glu Glu Ala Arg Asp Met Tyr Pro485                 490                 495gtt gtg gtg gcg cat ccc gtg cac aga cta aat cct gct gac agg cgg 1536Val Val Val Ala His Pro Val His Arg Leu Asn Pro Ala Asp Arg Arg500                 505                 510agg tcg gtc tct tcg tcg gca ctc gat gcc gac atc ccc agc gca gat 1584Arg Ser Val Ser Ser Ser Ala Leu Asp Ala Asp Ile Pro Ser Ala Asp515                 520                 525ttt tcc ttc agc cag gga tgagacaagt ttctgtattg atgttagtcc        1632Phe Ser Phe Ser Gln Gly530aatgtatagc caacatagga tgtcatgatt cgtacaataa gaaatacaaa tttttactga 1692g                                                                 1693<210>6<211>534<212>PRT<213>Triticum sp.<400>6Met Ala Glu Asp Tyr Glu Tyr Pro Pro Ala Arg Thr Leu Pro Glu Thr1               5                  10                  15Pro Ser Trp Ala Val Ala Leu Val Phe Ala Val Met Ile Ile Val Ser20                  25                  30Val Leu Leu Glu His Ala Leu His Lys Leu Gly His Trp Phe His Lys35                  40                  45Arg His Lys Asn Ala Leu Ala Glu Ala Leu Glu Lys Ile Lys Ala Glu50                  55                  60Leu Met Leu Val Gly Phe Ile Ser Leu Leu Leu Ala Val Thr Gln Asp65                  70                  75                  80Pro Ile Ser Gly Ile Cys Ile Ser Glu Lys Ala Ala Ser Ile Met Arg85                  90                  95Pro Cys Ser Leu Pro Pro Gly Ser Val Lys Ser Lys Tyr Lys Asp Tyr100                 105                 110Tyr Cys Ala Lys Lys Gly Lys Val Ser Leu Met Ser Thr Gly Ser Leu115             120                     125His Gln Leu His Met Phe Ile Phe Val Leu Ala Val Phe His Val Thr130             135                     140Tyr Ser Val Ile Ile Met Ala Leu Ser Arg Leu Lys Met Arg Thr Trp145             150                      155                 160Lys Lys Trp Glu Thr Glu Thr Xaa Ser Leu Glu Tyr Gln Phe Ala Asn165                 170                 175Asp Pro Ala Arg Phe Arg Phe Thr His Gln Thr Ser Phe Val Lys Arg180                 185                 190His Leu Gly Leu Ser Ser Thr Pro Gly Ile Arg Trp Val Val Ala Phe195                 200                 205Phe Arg Gln Phe Phe Arg Ser Val Thr Lys Val Asp Tyr Leu Thr Leu210                 215                 220Arg Ala Gly Phe Ile Asn Ala His Leu Ser His Asn Ser Lys Phe Asp225                 230                 235                 240Phe His Lys Tyr Ile Lys Arg Ser Met Glu Asp Asp Phe Lys Val Val245                 250                 255Val Gly Ile Ser Leu Pro Leu Trp Cys Val Ala Ile Leu Thr Leu Phe260                 265                 270Leu Asp Ile Asp Gly Ile Gly Thr Leu Thr Trp Ile Ser Phe Ile Pro275                 280                 285Leu Val Ile Leu Leu Cys Val Gly Thr Lys Leu Glu Met Ile Ile Met290                 295                 300Glu Met Ala Leu Glu Ile Gln Asp Arg Ala Ser Val Ile Lys Gly Ala305                 310                 315                 320Pro Val Val Glu Pro Ser Asn Lys Phe Phe Trp Phe His Arg Pro Asp325                 330                 335Trp Val Leu Phe Phe Ile His Leu Thr Leu Phe Gln Asn Ala Phe Gln340                 345                 350Met Ala His Phe Val Trp Thr Val Ala Thr Pro Gly Leu Lys Lys Cys355                 360                 365Phe His Met His Ile Gly Leu Ser Ile Met Lys Val Val Leu Gly Leu370                 375                 380Ala Leu Gln Phe Leu Cys Ser Tyr Ile Thr Phe Pro Leu Tyr Ala Leu385                 390                 395                 400Val Thr Gln Met Gly Ser Asn Met Lys Arg Ser Ile Phe Asp Glu Gln405                 410                 415Thr Ala Lys Ala Leu Thr Asn Trp Arg Asn Thr Ala Lys Glu Lys Lys420                 425                 430Lys Val Arg Asp Thr Asp Met Leu Met Ala Gln Met Ile Gly Asp Ala435                 440                 445Thr Pro Ser Arg Gly Thr Ser Pro Met Pro Ser Arg Ala Ser Ser Pro450                 455                 460Val His Leu Leu His Lys Gly Met Gly Arg Ser Asp Asp Pro Gln Ser465                 470                 475                 480Ala Pro Thr Ser Pro Arg Thr Met Glu Glu Ala Arg Asp Met Tyr Pro485                 490                 495Val Val Val Ala His Pro Val His Arg Leu Asn Pro Ala Asp Arg Arg500                 505                 510Arg Ser Val Ser Ser Ser Ala Leu Asp Ala Asp Ile Pro Ser Ala Asp515                 520                 525Phe Ser Phe Ser Gln Gly530<210>7<211>1886<212>DNA<213>Triticum sp.<220><221>CDS<222>(198)..(1799)<220><221>misc_feature<222>(387)..(425)<223>SEQ ID NO:1所示氨基酸序列的位置<220><221>misc_feature<222>(1374)..(1415)<223>SEQ ID NO:2所示氨基酸序列的位置<400>7gcagcaacaa gctagacata cctgcgtgcg tacgtacgtt ttcgttttcc tttcttgctc 60cggccggccg gccggccacg tagaatagat acctgcccag gtacgtacct cgttggctca 120gacgatcggc ggttggactt gggtgcgcgc cctgccctgc tccggccaag gaaagaggtt 180gcgctaaaga cgggcgg atg gca aag gac gac ggg tac ccc ccg gcg cgg    230Met Ala Lys Asp Asp Gly Tyr Pro Pro Ala Arg1               5                  10acg ctg ccg gag acg ccg tcc tgg gcg gtg gcg ctg gtc ttc gcc gtc   278Thr Leu Pro Glu Thr Pro Ser Trp Ala Val Ala Leu Val Phe Ala Val15                      20              25atg atc atc gtc tcc gtc ctc ctg gag cac gcg ctc cac aag ctc ggc   326Met Ile Ile Val Ser Val Leu Leu Glu His Ala Leu His Lys Leu Gly30                      35              40cat tgg ttc cac aag cgg cac aag aac gcg ctg gcg gag gcg ctg gag   374His Trp Phe His Lys Arg His Lys Asn Ala Leu Ala Glu Ala Leu Glu45                      50              55aag atg aag gcg gag ctg atg ctg gtg gga ttc atc tcg ctg ctg ctc   422Lys Met Lys Ala Glu Leu Met Leu Val Gly Phe Ile Ser Leu Leu Leu60                      65              70                  75gcc gtc acg cag gac cca atc tcc ggg ata tgc atc tcc cag aag gcc   470Ala Val Thr Gln Asp Pro Ile Ser Gly Ile Cys Ile Ser Gln Lys Ala80                  85                  90gcc agc atc atg cgc ccc tgc aag gtg gaa ccc ggt tcc gtc aag agc   518Ala Ser Ile Met Arg Pro Cys Lys Val Glu Pro Gly Ser Val Lys Ser95                 100                 105aag tac aag gac tac tac tgc gcc aaa gag ggc aag gtg gcg ctc atg   566Lys Tyr Lys Asp Tyr Tyr Cys Ala Lys Glu Gly Lys Val Ala Leu Met110                 115                 120tcc acg ggc agc ctg cac cag ctc cac ata ttc atc ttc gtg cta gcc   614Ser Thr Gly Ser Leu His Gln Leu His Ile Phe Ile Phe Val Leu Ala125                 130                 135gtc ttc cat gtc acc tac agc gtc atc a tc atg gct cta agc cgt ctc   662Val Phe His Val Thr Tyr Ser Val Ile Ile Met Ala Leu Ser Arg Leu140                 145                 150                 155aag atg aga aca tgg aag aaa tgg gag aca gaa acc gcc tcc ttg gaa   710Lys Met Arg Thr Trp Lys Lys Trp Glu Thr Glu Thr Ala Ser Leu Glu160                 165                 170tac cag ttc gca aat gat cct gcg cgg ttc cgc ttc acg cac cag acg   758Tyr Gln Phe Ala Asn Asp Pro Ala Arg Phe Arg Phe Thr His Gln Thr175                 180                 185tcg ttc gtg aag cgg cac ctg ggc ctg tcc agc acc ccc ggc gtc aga   806Ser Phe Val Lys Arg His Leu Gly Leu Ser Ser Thr Pro Gly Val Arg190                 195                 200tgg gtg gtg gcc ttc ttc agg cag ttc ttc agg tcg gtc acc aag gtg   854Trp Val Val Ala Phe Phe Arg Gln Phe Phe Arg Ser Val Thr Lys Val205                 210                 215gac tac ctc acc ttg agg gca ggc ttc atc aac gcg cac ttg tcg cag   902Asp Tyr Leu Thr Leu Arg Ala Gly Phe Ile Asn Ala His Leu Ser Gln220                 225                 230                 235aac agc aag ttc gac ttc cac aag tac atc aag agg tcc atg gag gac   950Asn Ser Lys Phe Asp Phe His Lys Tyr Ile Lys Arg Ser Met Glu Asp240                 245                 250gac ttc aaa gtc gtc gtt ggc atc agc ctc ccg ctg tgg gct gtg gcg   998Asp Phe Lys Val Val Val Gly Ile Ser Leu Pro Leu Trp Ala Val Ala255                 260                 265atc ctc acc ctc ttc ctt gat atc gac ggg atc ggc aca ctc acc tgg   1046Ile Leu Thr Leu Phe Leu Asp Ile Asp Gly Ile Gly Thr Leu Thr Trp270                 275                 280gtt tct ttc atc cct ctc atc atc ctc ttg tgt gtt gga acc aag cta   1094Val Ser Phe Ile Pro Leu Ile Ile Leu Leu Cys Val Gly Thr Lys Leu285                 290                 295gag atg atc atc atg ggg atg gcc ctg gag atc cag gac cgg tcg agc   1142Glu Met Ile Ile Met Gly Met Ala Leu Glu Ile Gln Asp Arg Ser Ser300                 305                 310                 315gtc atc aag ggg gca ccc gtg gtc gag ccc agc aac aag ttc ttc tgg   1190Val Ile Lys Gly Ala Pro Val Val Glu Pro Ser Asn Lys Phe Phe Trp320                 325                 330ttc cac cgc ccc gac tgg gtc ctc ttc ttc ata cac ctg acg ctg ttc   1238Phe His Arg Pro Asp Trp Val Leu Phe Phe Ile His Leu Thr Leu Phe335                 340                 345cag aac gcg ttt cag atg gca cat ttc gtg tgg aca gtg gcc acg ccc   1286Gln Asn Ala Phe Gln Met Ala His Phe Val Trp Thr Val Ala Thr Pro350                 355                 360ggc ttg aag gac tgc ttc cat atg aac atc ggg ctg agc atc atg aag   1334Gly Leu Lys Asp Cys Phe His Met Asn Ile Gly Leu Ser Ile Met Lys365                 370                 375gtc gtg ctg ggg ctg gct ctc cag ttc ctg tgc agc tac atc acc ttc   1382Val Val Leu Gly Leu Ala Leu Gln Phe Leu Cys Ser Tyr Ile Thr Phe380                 385                 390                 395ccc ctc tac gcg cta gtc aca cag atg gga tca aac atg aag agg tcc   1430Pro Leu Tyr Ala Leu Val Thr Gln Met Gly Ser Asn Met Lys Arg Ser400                 405                 410atc ttc gac gag cag aca gcc aag gcg ctg acc aac tgg cgg aac acg   1478Ile Phe Asp Glu Gln Thr Ala Lys Ala Leu Thr Asn Trp Arg Asn Thr415                 420                 425gcc aag gag aag aag aag gtc cga gac acg gac atg ctg atg gcg cag   1526Ala Lys Glu Lys Lys Lys Val Arg Asp Thr Asp Met Leu Met Ala Gln430                 435                 440atg atc ggc gac gca aca ccc agc cga ggc acg tcc ccg atg cct agc   1574Met Ile Gly Asp Ala Thr Pro Ser Arg Gly Thr Ser Pro Met Pro Ser445                 450                 455cgg ggc tca tcg ccg gtg cac ctg ctt cag aag ggc atg gga cgg tct    1622Arg Gly Ser Ser Pro Val His Leu Leu Gln Lys Gly Met Gly Arg Ser460                 465                 470                 475gac gat ccc cag agc gca ccg acc tcg cca agg acc atg gag gag gct   1670Asp Asp Pro Gln Ser Ala Pro Thr Ser Pro Arg Thr Met Glu Glu Ala480                 485                 490agg gac atg tac ccg gtt gtg gtg gcg cat cct gta cac aga cta aat   1718Arg Asp Met Tyr Pro Val Val Val Ala His Pro Val His Arg Leu Asn495                 500                 505cct gct gac agg cgg agg tcg gtc tct tca tca gcc ctc gat gcc gac   1766Pro Ala Asp Arg Arg Arg Ser Val Ser Ser Ser Ala Leu Asp Ala Asp510                 515                 520atc ccc agc gca gat ttt tcc ttc agc cag gga tgagacaagt ttctgtattg 1819Ile Pro Ser Ala Asp Phe Ser Phe Ser Gln Gly525                 530atgttagtcc aatgtatagc caacatagga tgtgatgatt cgtacaataa gaaatacaat 1879tttttac                                                           1886<210>8<211>534<212>PRT<213>Triticum sp.<400>8Met Ala Lys Asp Asp Gly Tyr Pro Pro Ala Arg Thr Leu Pro Glu Thr1               5                  10                  15Pro Ser Trp Ala Val Ala Leu Val Phe Ala Val Met Ile Ile Val Ser20                  25                  30Val Leu Leu Glu His Ala Leu His Lys Leu Gly His Trp Phe His Lys35                  40                  45Arg His Lys Asn Ala Leu Ala Glu Ala Leu Glu Lys Met Lys Ala Glu50                  55                  60Leu Met Leu Val Gly Phe Ile Ser Leu Leu Leu Ala Val Thr Gln Asp65                  70                  75                  80Pro Ile Ser Gly Ile Cys Ile Ser Gln Lys Ala Ala Ser Ile Met Arg85                  90                  95Pro Cys Lys Val Glu Pro Gly Ser Val Lys Ser Lys Tyr Lys Asp Tyr100                 105                 110Tyr Cys Ala Lys Glu Gly Lys Val Ala Leu Met Ser Thr Gly Ser Leu115                 120                 125His Gln Leu His Ile Phe Ile Phe Val Leu Ala Val Phe His Val Thr130                 135                 140Tyr Ser Val Ile Ile Met Ala Leu Ser Arg Leu Lys Met Arg Thr Trp145                 150                 155                 160Lys Lys Trp Glu Thr Glu Thr Ala Ser Leu Glu Tyr Gln Phe Ala Asn165                 170                 175Asp Pro Ala Arg Phe Arg Phe Thr His Gln Thr Ser Phe Val Lys Arg180                 185                 190His Leu Gly Leu Ser Ser Thr Pro Gly Val Arg Trp Val Val Ala Phe195                 200                 205Phe Arg Gln Phe Phe Arg Ser Val Thr Lys Val Asp Tyr Leu Thr Leu210                 215                 220Arg Ala Gly Phe Ile Asn Ala His Leu Ser Gln Asn Ser Lys Phe Asp225                 230                 235                 240Phe His Lys Tyr Ile Lys Arg Ser Met Glu Asp Asp Phe Lys Val Val245                 250                 255Val Gly Ile Ser Leu Pro Leu Trp Ala Val Ala Ile Leu Thr Leu Phe260                 265                 270Leu Asp Ile Asp Gly Ile Gly Thr Leu Thr Trp Val Ser Phe Ile Pro275                 280                 285Leu Ile Ile Leu Leu Cys Val Gly Thr Lys Leu Glu Met Ile Ile Met290                 295                 300Gly Met Ala Leu Glu Ile Gln Asp Arg Ser Ser Val Ile Lys Gly Ala305                 310                 315                 320Pro Val Val Glu Pro Ser Asn Lys Phe Phe Trp Phe His Arg Pro Asp325                 330                 335Trp Val Leu Phe Phe Ile His Leu Thr Leu Phe Gln Asn Ala Phe Gln340                 345                 350Met Ala His Phe Val Trp Thr Val Ala Thr Pro Gly Leu Lys Asp Cys355                 360                 365Phe His Met Asn Ile Gly Leu Ser Ile Met Lys Val Val Leu Gly Leu370                 375                 380Ala Leu Gln Phe Leu Cys Ser Tyr Ile Thr Phe Pro Leu Tyr Ala Leu385                 390                 395                 400Val Thr Gln Met Gly Ser Asn Met Lys Arg Ser Ile Phe Asp Glu Gln405                 410                 415Thr Ala Lys Ala Leu Thr Asn Trp Arg Asn Thr Ala Lys Glu Lys Lys420                 425                 430Lys Val Arg Asp Thr Asp Met Leu Met Ala Gln Met Ile Gly Asp Ala435                 440                 445Thr Pro Ser Arg Gly Thr Ser Pro Met Pro Ser Arg Gly Ser Ser Pro450                 455                 460Val His Leu Leu Gln Lys Gly Met Gly Arg Ser Asp Asp Pro Gln Ser465                 470                 475                 480Ala Pro Thr Ser Pro Arg Thr Met Glu Glu Ala Arg Asp Met Tyr Pro485                 490                 495Val Val Val Ala His Pro Val His Arg Leu Asn Pro Ala Asp Arg Arg500                 505                 510Arg Ser Val Ser Ser Ser Ala Leu Asp Ala Asp Ile Pro Ser Ala Asp515                 520                 525Phe Ser Phe Ser Gln Gly530<210>9<211>2197<212>DNA<213>Arabidopsis thaliana<220><221>CDS<222>(331)..(2037)<220><221>misc_feature<222>(589)..(627)<223>SEQ ID NO:1所示氨基酸序列的位置<220><221>misc_feature<222>(1603)..(1644)<223>SEQ ID NO:2所示氨基酸序列的位置<400>9agtaatttag ctgttcttct acccctctga tctctcacag gggatcaaat agttttgata 60catagagcca caacagtgac attagtgtgt tgttgactac tgtaagggtt gggttttgaa 120aagagacatg aaggagtgtt attaggttga ttgtcttcaa gtacctccag tgtcaaacaa 180acattgacga ttgattctct tcccataatt tattgttatg cattacatat cacagtaaac 240ggactttcaa gtcaacaccg catttatttg ccctcttcat tgtttcacgt acgtaatcaa 300ggaccaaggg attttgttct tttggctacc atg gcc aca aga tgc ttt tgg tgt  354Met Ala Thr Arg Cys Phe Trp Cys1               5tgg acc act ttg ctc ttc tgc tct cag ctg ctt acc ggc ttt gcc cga   402Trp Thr Thr Leu Leu Phe Cys Ser Gln Leu Leu Thr Gly Phe Ala Arg10                  15                  20gct tcc tct gca ggc ggc gcc aaa gag aaa gga ctc tcc caa act ccc   450Ala Ser Ser Ala Gly Gly Ala Lys Glu Lys Gly Leu Ser Gln Thr Pro25                  30                  35                  40acc tgg gcc gtt gcc ctc gtc tgt acc ttt ttc att ctt gtc tcc gtc   498Thr Trp Ala Val Ala Leu Val Cys Thr Phe Phe Ile Leu Val Ser Val45                      50                  55ctt ctc gag aag gct ctt cac aga gtt gcc acg tgg ttg tgg gag aaa   546Leu Leu Glu Lys Ala Leu His Arg Val Ala Thr Trp Leu Trp Glu Lys60                  65                      70cat aag aac tct ctg ctt gaa gcc ttg gaa aaa ata aag gcc gag ctg   594His Lys Asn Ser Leu Leu Glu Ala Leu Glu Lys Ile Lys Ala Glu Leu75                  80                      85atg att cta gga ttc att tcc ttg ttg cta acc ttc gga gag cag tac   642Met Ile Leu Gly Phe Ile Ser Leu Leu Leu Thr Phe Gly Glu Gln Tyr90                  95                     100att ctc aag att tgt att cct gaa aag gct gca gcc tct atg tta cct   690Ile Leu Lys Ile Cys Ile Pro Glu Lys Ala Ala Ala Ser Met Leu Pro105                 110                 115                 120tgt cca gct cct tct act cat gac caa gac aag acc cac cgc aga cgt   738Cys Pro Ala Pro Ser Thr His Asp Gln Asp Lys Thr His Arg Arg Arg125                 130                 135cta gct gct gct acg acc tct tcc cgc tgc gat gag ggt cat gaa cca   786Leu Ala Ala Ala Thr Thr Ser Ser Arg Cys Asp Glu Gly His Glu Pro140                 145                 150ctc ata cct gcc acg ggt ttg cac cag cta cac att cta ttg ttc ttc   834Leu Ile Pro Ala Thr Gly Leu His Gln Leu His Ile Leu Leu Phe Phe155                 160                 165atg gct gcc ttt cat atc ctc tac agt ttc atc acc atg atg ctt ggc   882Met Ala Ala Phe His Ile Leu Tyr Ser Phe Ile Thr Met Met Leu Gly170                 175                 180aga ctc aag atc cgt ggc tgg aaa aag tgg gag cag gag aca tgt tct   930Arg Leu Lys Ile Arg Gly Trp Lys Lys Trp Glu Gln Glu Thr Cys Ser185                 190                 195                 200cat gat tac gag ttt tca atc gat cca tca aga ttc aga ctc act cat   978His Asp Tyr Glu Phe Ser Ile Asp Pro Ser Arg Phe Arg Leu Thr His205                 210                 215gag acg tcc ttt gtt aga caa cat tcc agt ttc tgg aca aaa atc ccc   1026Glu Thr Ser Phe Val Arg Gln His Ser Ser Phe Trp Thr Lys Ile Pro220                 225                 230ttc ttc ttt tat gct ggg tgc ttc cta cag cag ttt ttc cga tct gtc   1074Phe Phe Phe Tyr Ala Gly Cys Phe Leu Gln Gln Phe Phe Arg Ser Val235                 240                 245ggg agg act gac tac tta act ctg cgc cat ggc ttc atc gct gcc cat   1122Gly Arg Thr Asp Tyr Leu Thr Leu Arg His Gly Phe Ile Ala Ala His250                 255                 260tta gct cca gga aga aag ttc gac ttc cag aag tat atc aaa aga tca   1170Leu Ala Pro Gly Arg Lys Phe Asp Phe Gln Lys Tyr Ile Lys Arg Ser265                 270                 275                    280ttg gaa gac gat ttc aag gtg gta gtt gga ata agt cct ctt ttg tgg   1218Leu Glu Asp Asp Phe Lys Val Val Val Gly Ile Ser Pro Leu Leu Trp285                 290                 295gca tca ttt gta att ttc cta ctt ctg aat gtt aat ggc tgg gaa gca   1266Ala Ser Phe Val Ile Phe Leu Leu Leu Asn Val Asn Gly Trp Glu Ala300                 305                 310ttg ttt tgg gcg tca atc cta cct gta ctt atc att cta gct gtc agt   1314Leu Phe Trp Ala Ser Ile Leu Pro Val Leu Ile Ile Leu Ala Val Ser315                 320                 325acg aag ctt caa gcg atc cta aca aga atg gct ctg gga atc acg gag   1362Thr Lys Leu Gln Ala Ile Leu Thr Arg Met Ala Leu Gly Ile Thr Glu330                 335                 340aga cac gca gtt gtt caa ggg ata cct ctc gtg cat ggt tca gat aag   1410Arg His Ala Val Val Gln Gly Ile Pro Leu Val His Gly Ser Asp Lys345                 350                 355                 360tac ttt tgg ttt aat cgc cct cag ttg cta ctt cat ctt ctt cac ttc   1458Tyr Phe Trp Phe Asn Arg Pro Gln Leu Leu Leu His Leu Leu His Phe365                 370                 375gcc tta ttt cag aat gct ttc cag cta aca tac ttc ttc tgg gtc tgg   1506Ala Leu Phe Gln Asn Ala Phe Gln Leu Thr Tyr Phe Phe Trp Val Trp380                 385                 390tat tcc ttt ggg cta aaa tct tgc ttt cac acg gat ttc aaa cta gtc   1554Tyr Ser Phe Gly Leu Lys Ser Cys Phe His Thr Asp Phe Lys Leu Val395                 400                 405atc gta aaa ctc tct cta ggc gtt gga gct ttg att ttg tgc agc tac   1602Ile Val Lys Leu Ser Leu Gly Val Gly Ala Leu Ile Leu Cys Ser Tyr410                 415                 420atc aca ctt cct ttg tat gca cta gtt act cag atg ggt tca aac atg   1650Ile Thr Leu Pro Leu Tyr Ala Leu Val Thr Gln Met Gly Ser Asn Met425                 430                 435                 440aag aaa gct gtg ttt gat gag caa atg gca aaa gcg ttg aag aaa tgg   1698Lys Lys Ala Val Phe Asp Glu Gln Met Ala Lys Ala Leu Lys Lys Trp445                 450                 455cac atg act gtg aag aag aag aaa ggc aaa gcg aga aag cca cca aca   1746His Met Thr Val Lys Lys Lys Lys Gly Lys Ala Arg Lys Pro Pro Thr460                 465                 470gag acc ctt ggt gtt tct gac act gtc agc acc tct acc tca tcc ttt   1794Glu Thr Leu Gly Val Ser Asp Thr Val Ser Thr Ser Thr Ser Ser Phe475                 480                 485cac gcc tct gga gcc act cta ctc cgc tcc aag acc act ggt cac tcg   1842His Ala Ser Gly Ala Thr Leu Leu Arg Ser Lys Thr Thr Gly His Ser490                 495                 500aca gcc tct tat atg agt aat ttc gag gac caa agc atg tct gat ctt   1890Thr Ala Ser Tyr Met Ser Asn Phe Glu Asp Gln Ser Met Ser Asp Leu505                 510                 515                 520gaa gct gag cca tta tcc cct gaa cca ata gag ggg cac act ctc gtc   1938Glu Ala Glu Pro Leu Ser Pro Glu Pro Ile Glu Gly His Thr Leu Val525                 530                 535agg gtt ggt gat cag aac aca gag ata gaa tat act gga gat att agt   1986Arg Val Gly Asp Gln Asn Thr Glu Ile Glu Tyr Thr Gly Asp Ile Ser540                  545                550cct gga aac caa ttc tcc ttt gtg aag aac gtt cct gct aat gat att   2034Pro Gly Asn Gln Phe Ser Phe Val Lys Asn Val Pro Ala Asn Asp Ile555                 560                 565gac taatattcaa aatgaatgca gaacaaatcc atcatccggt ctttattttc        2087Asptattacatgt atgccaacaa ttgcttcgcc aagtgttacc aactaggttt tctgtataag 2147gctgtatttt agagctaaaa aaaaaaaaaa aaaaaaaaaa ctaaattact            2197<210>10<211>569<212>PRT<213>Arabidopsis thaliana<400>10Met Ala Thr Arg Cys Phe Trp Cys Trp Thr Thr Leu Leu Phe Cys Ser1               5                  10                  15Gln Leu Leu Thr Gly Phe Ala Arg Ala Ser Ser Ala Gly Gly Ala Lys20                  25                  30Glu Lys Gly Leu Ser Gln Thr Pro Thr Trp Ala Val Ala Leu Val Cys35                  40                  45Thr Phe Phe Ile Leu Val Ser Val Leu Leu Glu Lys Ala Leu His Arg50                  55                  60Val Ala Thr Trp Leu Trp Glu Lys His Lys Asn Ser Leu Leu Glu Ala65                  70                  75                  80Leu Glu Lys Ile Lys Ala Glu Leu Met Ile Leu Gly Phe Ile Ser Leu85                  90                  95Leu Leu Thr Phe Gly Glu Gln Tyr Ile Leu Lys Ile Cys Ile Pro Glu100                 105                 110Lys Ala Ala Ala Ser Met Leu Pro Cys Pro Ala Pro Ser Thr His Asp115                 120                 125Gln Asp Lys Thr His Arg Arg Arg Leu Ala Ala Ala Thr Thr Ser Ser130                 135                 140Arg Cys Asp Glu Gly His Glu Pro Leu Ile Pro Ala Thr Gly Leu His145                 150                 155                 160Gln Leu His Ile Leu Leu Phe Phe Met Ala Ala Phe His Ile Leu Tyr165                 170                 175Ser Phe Ile Thr Met Met Leu Gly Arg Leu Lys Ile Arg Gly Trp Lys180                 185                 190Lys Trp Glu Gln Glu Thr Cys Ser His Asp Tyr Glu Phe Ser Ile Asp195                 200                 205Pro Ser Arg Phe Arg Leu Thr His Glu Thr Ser Phe Val Arg Gln His210                 215                 220Ser Ser Phe Trp Thr Lys Ile Pro Phe Phe Phe Tyr Ala Gly Cys Phe225                 230                 235                 240Leu Gln Gln Phe Phe Arg Ser Val Gly Arg Thr Asp Tyr Leu Thr Leu245                 250                 255Arg His Gly Phe Ile Ala Ala His Leu Ala Pro Gly Arg Lys Phe Asp260                 265                 270Phe Gln Lys Tyr Ile Lys Arg Ser Leu Glu Asp Asp Phe Lys Val Val275                 280                 285Val Gly Ile Ser Pro Leu Leu Trp Ala Ser Phe Val Ile Phe Leu Leu290                 295                 300Leu Asn Val Asn Gly Trp Glu Ala Leu Phe Trp Ala Ser Ile Leu Pro305                 310                 315                 320Val Leu Ile Ile Leu Ala Val Ser Thr Lys Leu Gln Ala Ile Leu Thr325                 330                 335Arg Met Ala Leu Gly Ile Thr Glu Arg His Ala Val Val Gln Gly Ile340                 345                 350Pro Leu Val His Gly Ser Asp Lys Tyr Phe Trp Phe Asn Arg Pro Gln355                 360                 365Leu Leu Leu His Leu Leu His Phe Ala Leu Phe Gln Asn Ala Phe Gln370                 375                 380Leu Thr Tyr Phe Phe Trp Val Trp Tyr Ser Phe Gly Leu Lys Ser Cys385                 390                 395                 400Phe His Thr Asp Phe Lys Leu Val Ile Val Lys Leu Ser Leu Gly Val405                 410                 415Gly Ala Leu Ile Leu Cys Ser Tyr Ile Thr Leu Pro Leu Tyr Ala Leu420                 425                 430Val Thr Gln Met Gly Ser Asn Met Lys Lys Ala Val Phe Asp Glu Gln435                 440                 445Met Ala Lys Ala Leu Lys Lys Trp His Met Thr Val Lys Lys Lys Lys450                 455                 460Gly Lys Ala Arg Lys Pro Pro Thr Glu Thr Leu Gly Val Ser Asp Thr465                 470                 475                 480Val Ser Thr Ser Thr Ser Ser Phe His Ala Ser Gly Ala Thr Leu Leu                485                 490                 495Arg Ser Lys Thr Thr Gly His Ser Thr Ala Ser Tyr Met Ser Asn Phe500                 505                 510Glu Asp Gln Ser Met Ser Asp Leu Glu Ala Glu Pro Leu Ser Pro Glu515                 520                 525Pro Ile Glu Gly His Thr Leu Val Arg Val Gly Asp Gln Asn Thr Glu530                 535                 540Ile Glu Tyr Thr Gly Asp Ile Ser Pro Gly Asn Gln Phe Ser Phe Val545                 550                 555                 560Lys Asn Val Pro Ala Asn Asp Ile Asp565<210>11<211>1935<212>DNA<213>Arabidopsis thaliana<220><221>CDS<222>(89)..(1807)<220><221>misc_feature<222>(269)..(307)<223>SEQ ID NO:1所示氨基酸序列的位置<220><221>misc_feature<222>(1370)..(1411)<223>SEQ ID NO:2所示氨基酸序列的位置<400>11cagtgtgagt aatttagtaa aaagacaaga tctctggtct ggaattagaa gaatcttatt 60tgggtttttt tcttaggatt aagctcta atg gca gat caa gta aaa gag cgg    112Met Ala Asp Gln Val Lys Glu Arg1              5act tta gag gag acc tct acg tgg gca gtt gct gtt gtt tgc ttt gtc   160Thr Leu Glu Glu Thr Ser Thr Trp Ala Val Ala Val Val Cys Phe Val10                  15                  20tta ctc ttt att tcg att gtc ctc gaa cat tct att cac aaa att gga   208Leu Leu Phe Ile Ser Ile Val Leu Glu His Ser Ile His Lys Ile Gly25                  30                  35                  40acc tgg ttt aaa aag aag cac aag cag gct ctt ttt gaa gct ctt gaa   256Thr Trp Phe Lys Lys Lys His Lys Gln Ala Leu Phe Glu Ala Leu Glu45                  50                  55aag gtc aaa gca gag ctt atg ctg ttg gga ttc ata tca cta cta ctc   304Lys Val Lys Ala Glu Leu Met Leu Leu Gly Phe Ile Ser Leu Leu Leu60                  65                  70aca att gga caa aca cca atc tca aat atc tgc atc tcc cag aaa gtt   352Thr Ile Gly Gln Thr Pro Ile Ser Asn Ile Cys Ile Ser Gln Lys Val75                  80                  85gcg tca aca atg cac cct tgc agt gct gct gaa gaa gct aaa aaa tac   400Ala Ser Thr Met His Pro Cys Ser Ala Ala Glu Glu Ala Lys Lys Tyr90                  95                 100ggc aag aaa gac gcc gga aag aaa gat gat gga gat gga gat aaa ccc   448Gly Lys Lys Asp Ala Gly Lys Lys Asp Asp Gly Asp Gly Asp Lys Pro105                 110                 115                 120ggt cga aga ctt ctt ctt gag tta gct gaa tct tat atc cat aga aga   496Gly Arg Arg Leu Leu Leu Glu Leu Ala Glu Ser Tyr Ile His Arg Arg125                 130                 135agt tta gcc acc aaa ggc tat gac aaa tgt gca gag aag ggg aaa gtg   544Ser Leu Ala Thr Lys Gly Tyr Asp Lys Cys Ala Glu Lys Gly Lys Val140                 145                 150gct ttt gta tct gct tat gga atc cac cag ctg cat ata ttc atc ttc   592Ala Phe Val Ser Ala Tyr Gly Ile His Gln Leu His Ile Phe Ile Phe155                 160                 165gtg ctc gcg gtt gtt cat gtt gtt tac tgc att gtt act tat gct ttc   640Val Leu Ala Val Val His Val Val Tyr Cys Ile Val Thr Tyr Ala Phe170                 175                 180gga aag atc aag atg agg acg tgg aag tcg tgg gag gaa gag aca aag   688Gly Lys Ile Lys Met Arg Thr Trp Lys Ser Trp Glu Glu Glu Thr Lys185                 190                 195                 200aca ata gag tat cag tat tcc aac gat cct gag agg ttc agg ttt gcg   736Thr Ile Glu Tyr Gln Tyr Ser Asn Asp Pro Glu Arg Phe Arg Phe Ala205                 210                 215agg gac aca tct ttt ggg aga aga cat ctc aat ttc tgg agc aag acg   784Arg Asp Thr Ser Phe Gly Arg Arg His Leu Asn Phe Trp Ser Lys Thr220                 225                 230aga gtc aca cta tgg att gtt tgt ttt ttt aga cag ttc ttt gga tct   832Arg Val Thr Leu Trp Ile Val Cys Phe Phe Arg Gln Phe Phe Gly Ser235                 240                 245gtc acc aaa gtt gat tac tta gca cta aga cat ggt ttc atc atg gcg   880Val Thr Lys Val Asp Tyr Leu Ala Leu Arg His Gly Phe Ile Met Ala250                 255                 260cat ttt gct ccc ggt aac gaa tca agattc gat ttc cgc aag tat att   928His Phe Ala Pro Gly Asn Glu Ser Arg Phe Asp Phe Arg Lys Tyr Ile265                 270                 275                 280cag aga tca tta gag aaa gac ttc aaa acc gtt gtt gaa atc agt ccg   976Gln Arg Ser Leu Glu Lys Asp Phe Lys Thr Val Val Glu Ile Ser Pro285                 290                 295gtt atc tgg ttt gtc gct gtg cta ttc ctc ttg acc aat tca tat gga   1024Val Ile Trp Phe Val Ala Val Leu Phe Leu Leu Thr Asn Ser Tyr Gly300                 305                 310tta cgt tct tac ctc tgg tta cca ttc att cca cta gtc gta att cta   1072Leu Arg Ser Tyr Leu Trp Leu Pro Phe Ile Pro Leu Val Val Ile Leu315                 320                 325ata gtt gga aca aag ctt gaa gtc ata ata aca aaa ttg ggt cta aga   1120Ile Val Gly Thr Lys Leu Glu Val Ile Ile Thr Lys Leu Gly Leu Arg330                 335                 340atc caa gag aaa ggt gat gtg gtg aga ggc gcc cca gtg gtt cag cct   1168Ile Gln Glu Lys Gly Asp Val Val Arg Gly Ala Pro Val Val Gln Pro345                 350                 355                 360ggt gat gac ctc ttc tgg ttt ggc aag cca cgc ttc att ctt ttc ctt   1216Gly Asp Asp Leu Phe Trp Phe Gly Lys Pro Arg Phe Ile Leu Phe Leu365                 370                 375att cac ttg gtc ctt ttt acg aat gca ttt caa ctt gca ttc ttt gcc   1264Ile His Leu Val Leu Phe Thr Asn Ala Phe Gln Leu Ala Phe Phe Ala380                 385                 390tgg agt acg tat gaa ttc aat ctc aat aat tgt ttc cat gaa agc act   1312Trp Ser Thr Tyr Glu Phe Asn Leu Asn Asn Cys Phe His Glu Ser Thr395                 400                 405gca gat gtg gtc att aga ctt gta gtt gga gct gtt gtg cag ata ctt   1360Ala Asp Val Val Ile Arg Leu Val Val Gly Ala Val Val Gln Ile Leu410                 415                 420tgc agc tat gtg act ctt cca ctc tat gca ctt gtt act cag atg ggt   1408Cys Ser Tyr Val Thr Leu Pro Leu Tyr Ala Leu Val Thr Gln Met Gly425                 430                 435                 440agt aaa atg aag cca aca gta ttc aac gat aga gta gcc acg gca tta   1456Ser Lys Met Lys Pro Thr Val Phe Asn Asp Arg Val Ala Thr Ala Leu445                 450                 455aag aag tgg cat cac act gca aag aac gag acg aaa cac gga aga cac   1504Lys Lys Trp His His Thr Ala Lys Asn Glu Thr Lys His Gly Arg His460                 465                 470tcg gga tcc aat aca cct ttc tct agc cgt cca act aca cca aca cat   1552Ser Gly Ser Asn Thr Pro Phe Ser Ser Arg Pro Thr Thr Pro Thr His475                 480                 485ggc tca tct cca atc cat ctc ctt cac aat ttc aat aac cgg agc gtt   1600Gly Ser Ser Pro Ile His Leu Leu His Asn Phe Asn Asn Arg Ser Val490                 495                 500gaa aat tac cca agt tct cct tct cct aga tac tct ggt cat ggt cat   1648Glu Asn Tyr Pro Ser Ser Pro Ser Pro Arg Tyr Ser Gly His Gly His505                 510                 515                 520cat gaa cac caa ttt tgg gat cct gag tct caa cac caa gaa gct gaa   1696His Glu His Gln Phe Trp Asp Pro Glu Ser Gln His Gln Glu Ala Glu525                 530                 535act tcc aca cat cat tct ctt gcg cat gaa agc tca gaa cct gtt ctt   1744Thr Ser Thr His His Ser Leu Ala His Glu Ser Ser Glu Pro Val Leu540                 545                 550gca tct gtg gaa ctt cct cct ata agg act agc aaa agc tta aga gat   1792Ala Ser Val Glu Leu Pro Pro Ile Arg Thr Ser Lys Ser Leu Arg Asp555                 560                 565ttt tct ttt aag aaa tgatgattct tgtttgctat atttgatttc gtacagtggg   1847Phe Ser Phe Lys Lys570aattttgtca tatgaaaata atttcttgta cattactagt ttgataagaa ataaccatat 1907ctatatggat acaaaaaaaa aaaaaaaa                                    1935<210>12<211>573<212>PRT<213>Arabidopsis thaliana<400>12Met Ala Asp Gln Val Lys Glu Arg Thr Leu Glu Glu Thr Ser Thr Trp1               5                  10                  15Ala Val Ala Val Val Cys Phe Val Leu Leu Phe Ile Ser Ile Val Leu20                  25                  30Glu His Ser Ile His Lys Ile Gly Thr Trp Phe Lys Lys Lys His Lys35                  40                  45Gln Ala Leu Phe Glu Ala Leu Glu Lys Val Lys Ala Glu Leu Met Leu50                  55                  60Leu Gly Phe Ile Ser Leu Leu Leu Thr Ile Gly Gln Thr Pro Ile Ser65                  70                  75                  80Asn Ile Cys Ile Ser Gln Lys Val Ala Ser Thr Met His Pro Cys Ser85                  90                  95Ala Ala Glu Glu Ala Lys Lys Tyr Gly Lys Lys Asp Ala Gly Lys Lys            100                 105                 110Asp Asp Gly Asp Gly Asp Lys Pro Gly Arg Arg Leu Leu Leu Glu Leu115                 120                 125Ala Glu Ser Tyr Ile His Arg Arg Ser Leu Ala Thr Lys Gly Tyr Asp130                 135                 140Lys Cys Ala Glu Lys Gly Lys Val Ala Phe Val Ser Ala Tyr Gly Ile145                 150                 155                 160His Gln Leu His Ile Phe Ile Phe Val Leu Ala Val Val His Val Val165                 170                 175Tyr Cys Ile Val Thr Tyr Ala Phe Gly Lys Ile Lys Met Arg Thr Trp180                 185                 190Lys Ser Trp Glu Glu Glu Thr Lys Thr Ile Glu Tyr Gln Tyr Ser Asn195                 200                 205Asp Pro Glu Arg Phe Arg Phe Ala Arg Asp Thr Ser Phe Gly Arg Arg210                 215                 220His Leu Asn Phe Trp Ser Lys Thr Arg Val Thr Leu Trp Ile Val Cys225                 230                 235                 240Phe Phe Arg Gln Phe Phe Gly Ser Val Thr Lys Val Asp Tyr Leu Ala245                 250                 255Leu Arg His Gly Phe Ile Met Ala His Phe Ala Pro Gly Asn Glu Ser260                 265                 270Arg Phe Asp Phe Arg Lys Tyr Ile Gln Arg Ser Leu Glu Lys Asp Phe275                 280                 285Lys Thr Val Val Glu Ile Ser Pro Val Ile Trp Phe Val Ala Val Leu290                 295                 300Phe Leu Leu Thr Asn Ser Tyr Gly Leu Arg Ser Tyr Leu Trp Leu Pro305                 310                 315                 320Phe Ile Pro Leu Val Val Ile Leu Ile Val Gly Thr Lys Leu Glu Val325                 330                 335Ile Ile Thr Lys Leu Gly Leu Arg Ile Gln Glu Lys Gly Asp Val Val340                 345                 350Arg Gly Ala Pro Val Val Gln Pro Gly Asp Asp Leu Phe Trp Phe Gly355                 360                 365Lys Pro Arg Phe Ile Leu Phe Leu Ile His Leu Val Leu Phe Thr Asn370                 375                 380Ala Phe Gln Leu Ala Phe Phe Ala Trp Ser Thr Tyr Glu Phe Asn Leu385                 390                 395                 400Asn Asn Cys Phe His Glu Ser Thr Ala Asp Val Val Ile Arg Leu Val405                 410                 415Val Gly Ala Val Val Gln Ile Leu Cys Ser Tyr Val Thr Leu Pro Leu420                 425                 430Tyr Ala Leu Val Thr Gln Met Gly Ser Lys Met Lys Pro Thr Val Phe435                 440                 445Asn Asp Arg Val Ala Thr Ala Leu Lys Lys Trp His His Thr Ala Lys450                 455                 460Asn Glu Thr Lys His Gly Arg His Ser Gly Ser Asn Thr Pro Phe Ser465                 470                 475                 480Ser Arg Pro Thr Thr Pro Thr His Gly Ser Ser Pro Ile His Leu Leu485                 490                 495His Asn Phe Asn Asn Arg Ser Val Glu Asn Tyr Pro Ser Ser Pro Ser500                 505                 510Pro Arg Tyr Ser Gly His Gly His His Glu His Gln Phe Trp Asp Pro515                 520                 525Glu Ser Gln His Gln Glu Ala Glu Thr Ser Thr His His Ser Leu Ala530                 535                 540His Glu Ser Ser Glu Pro Val Leu Ala Ser Val Glu Leu Pro Pro Ile545                 550                 555                 560Arg Thr Ser Lys Ser Leu Arg Asp Phe Ser Phe Lys Lys565                 570<210>13<211>1811<212>DNA<213>Arabidopsis thaliana<220><221>CDS<222>(56)..(1633)<220><221>misc_feature<222>(236)..(274)<223>SEQ ID NO:1所示氨基酸序列的位置<220><221>misc_feature<222>(1328)..(1369)<223>SEQ ID NO:2所示氨基酸序列的位置<400>13gttcccagat tcatctttac ttattgtcta aattctctct ggtgtgagaa gtaaa atg  58Met1ggt cac gga gga gaa ggg atg tcg ctt gaa ttc act ccg acg tgg gtc   106Gly His Gly Gly Glu Gly Met Ser Leu Glu Phe Thr Pro Thr Trp Val5                  10                  15gtc gcc gga gtt tgt acg gtc atc gtc gcg att tca ctg gcg gtg gag   154Val Ala Gly Val Cys Thr Val Ile Val Ala Ile Ser Leu Ala Val Glu20                  25                  30cgt ttg ctt cac tat ttc ggt act gtt ctt aag aag aag aag caa aaa   202Arg Leu Leu His Tyr Phe Gly Thr Val Leu Lys Lys Lys Lys Gln Lys35                  40                  45ccc ctt tac gaa gcc ctt caa aag gtt aaa gaa gag ctg atg ttg tta   250Pro Leu Tyr Glu Ala Leu Gln Lys Val Lys Glu Glu Leu Met Leu Leu50                  55                  60                  65ggg ttt ata tcg ctg tta ctg acg gta ttc caa ggg ctc att tcc aaa   298Gly Phe Ile Ser Leu Leu Leu Thr Val Phe Gln Gly Leu Ile Ser Lys70                  75                  80ttc tgt gtg aaa gaa aat gtg ctt atg cat atg ctt cca tgt tct ctc   346Phe Cys Val Lys Glu Asn Val Leu Met His Met Leu Pro Cys Ser Leu85                  90                  95gat tca aga cga gaa gct ggg gca agt gaa cat aaa aac gtt aca gca   394Asp Ser Arg Arg Glu Ala Gly Ala Ser Glu His Lys Asn Val Thr Ala100                 105                 110aaa gaa cat ttt cag act ttt tta cct att gtt gga acc act agg cgt   442Lys Glu His Phe Gln Thr Phe Leu Pro Ile Val Gly Thr Thr Arg Arg115                 120                 125cta ctt gct gaa cat gct gct gtg caa gtt ggt tac tgt agc gaa aag   490Leu Leu Ala Glu His Ala Ala Val Gln Val Gly Tyr Cys Ser Glu Lys130                 135                 140                 145ggt aaa gta cca ttg ctt tcg ctt gag gca ttg cac cat cta cat att   538Gly Lys Val Pro Leu Leu Ser Leu Glu Ala Leu His His Leu His Ile150                 155                 160ttc atc ttc gtc ctc gcc ata tcc cat gtg aca ttc tgt gtc ctt acc   586Phe Ile Phe Val Leu Ala Ile Ser His Val Thr Phe Cys Val Leu Thr165                 170                 175gtg att ttt gga agc aca agg att cac caa tgg aag aaa tgg gag gat   634Val Ile Phe Gly Ser Thr Arg Ile His Gln Trp Lys Lys Trp Glu Asp180                 185                 190tcg atc gca gat gag aag ttt gac ccc gaa aca gct ctc agg aaa aga   682Ser Ile Ala Asp Glu Lys Phe Asp Pro Glu Thr Ala Leu Arg Lys Arg195                 200                 205agg gtc act cat gta cac aac cat gct ttt att aaa gag cat ttt ctt   730Arg Val Thr His Val His Asn His Ala Phe Ile Lys Glu His Phe Leu210                 215                 220                 225ggt att ggc aaa gat tca gtc atc ctc gga tgg acg caa tcc ttt ctc   778Gly Ile Gly Lys Asp Ser Val Ile Leu Gly Trp Thr Gln Ser Phe Leu230                 235                 240aag caa ttc tat gat tct gtg acg aaa tca gat tac gtg act tta cgt   826Lys Gln Phe Tyr Asp Ser Val Thr Lys Ser Asp Tyr Val Thr Leu Arg245                 250                 255ctt ggt ttc att atg aca cat tgt aag gga aac ccc aag ctt aat ttc   874Leu Gly Phe Ile Met Thr His Cys Lys Gly Asn Pro Lys Leu Asn Phe260                 265                 270cac aag tat atg atg cgc gct yta gag gat gat ttc aaa caa gtt gtt   922His Lys Tyr Met Met Arg Ala Xaa Glu Asp Asp Phe Lys Gln Val Val275                 280                 285ggt att agt tgg tat ctt tgg atc ttt gtc gtc atc ttt ttg ctg cta   970Gly Ile Ser Trp Tyr Leu Trp Ile Phe Val Val Ile Phe Leu Leu Leu290                 295                 300                 305aat gtt aac gga tgg cac aca tat ttc tgg ata gca ttt att ccc ttt   1018Asn Val Asn Gly Trp His Thr Tyr Phe Trp Ile Ala Phe Ile Pro Phe310                 315                 320gct ttg ctt ctt gct gtg gga aca aag ttg gag cat gtg att gca cag   1066Ala Leu Leu Leu Ala Val Gly Thr Lys Leu Glu His Val Ile Ala Gln325                 330                 335tta gct cat gaa gtt gca gag aaa cat gta gcc att gaa gga gac tta   1114Leu Ala His Glu Val Ala Glu Lys His Val Ala Ile Glu Gly Asp Leu340                 345                 350gtg gtg aaa ccc tca gat gag cat ttc tgg ttc agc aaa cct caa att   1162Val Val Lys Pro Ser Asp Glu His Phe Trp Phe Ser Lys Pro Gln Ile355                 360                 365gtt ctc tac ttg atc cat ttt atc ctc ttc cag aat gct ttt gag att   1210Val Leu Tyr Leu Ile His Phe Ile Leu Phe Gln Asn Ala Phe Glu Ile370                 375                 380                 385gcg ttt ttc ttt tgg att tgg gtt aca tac ggc ttc gac tcg tgc att   1258Ala Phe Phe Phe Trp Ile Trp Val Thr Tyr Gly Phe Asp Ser Cys Ile390                 395                 400atg gga cag gtg aga tac att gtt cca aga ttg gtt atc ggg gtc ttc   1306Met Gly Gln Val Arg Tyr Ile Val Pro Arg Leu Val Ile Gly Val Phe405                 410                 415att caa gtg ctt tgc agt tac agt aca ctg cct ctt tac gcc atc gtc   1354Ile Gln Val Leu Cys Ser Tyr Ser Thr Leu Pro Leu Tyr Ala Ile Val420                 425                 430tca cag atg gga agt agc ttc aag aaa gct ata ttc gag gag aat gtg   1402Ser Gln Met Gly Ser Ser Phe Lys Lys Ala Ile Phe Glu Glu Asn Val435                 440                 445cag gtt ggt ctt gtt ggt tgg gca cag aaa gtg aaa caa aag aga gac   1450Gln Val Gly Leu Val Gly Trp Ala Gln Lys Val Lys Gln Lys Arg Asp450                 455                 460                 465cta aaa gct gca gct agt aat gga aac gaa gga agc tct cag gct ggt 1498Leu Lys Ala Ala Ala Ser Asn Gly Asn Glu Gly Ser Ser Gln Ala Gly470                 475                 480cct ggt cct gat tct ggt tct ggt tct gct cct gct gct ggt cct ggt   1546Pro Gly Pro Asp Ser Gly Ser Gly Ser Ala Pro Ala Ala Gly Pro Gly485                 490                 495gca ggt ttt gca gga att cag ctc agc aga gta aca aga aac aac gca   1594Ala Gly Phe Ala Gly Ile Gln Leu Ser Arg Val Thr Arg Asn Asn Ala500                 505                 510ggg gac aca aac aat gag att aca cct gat cat aac aac tgagcagaga    1643Gly Asp Thr Asn Asn Glu Ile Thr Pro Asp His Asn Asn515                 520                 525tattatcttt tccatttaga ggatcatcat cagattttag cttcaaggtc cggttttgtg 1703gtttatacat aagttatagt gacttgattt ttttgttttg ttacaaagtt accatctttg 1763gattagaatt gggaaattga atctgtttgt aaaaaaaaaa aaaaaaaa              1811<210>14<211>526<212>PRT<213>Arabidopsis thaliana<400>14Met Gly His Gly Gly Glu Gly Met Ser Leu Glu Phe Thr Pro Thr Trp1               5                  10                  15Val Val Ala Gly Val Cys Thr Val Ile Val Ala Ile Ser Leu Ala Val20                  25                  30Glu Arg Leu Leu His Tyr Phe Gly Thr Val Leu Lys Lys Lys Lys Gln35                  40                  45Lys Pro Leu Tyr Glu Ala Leu Gln Lys Val Lys Glu Glu Leu Met Leu50                  55                  60Leu Gly Phe Ile Ser Leu Leu Leu Thr Val Phe Gln Gly Leu Ile Ser65                  70                  75                  80Lys Phe Cys Val Lys Glu Asn Val Leu Met His Met Leu Pro Cys Ser85                  90                  95Leu Asp Ser Arg Arg Glu Ala Gly Ala Ser Glu His Lys Asn Val Thr100                 105                 110Ala Lys Glu His Phe Gln Thr Phe Leu Pro Ile Val Gly Thr Thr Arg115                 120                 125Arg Leu Leu Ala Glu His Ala Ala Val Gln Val Gly Tyr Cys Ser Glu130                 135                 140Lys Gly Lys Val Pro Leu Leu Ser Leu Glu Ala Leu His His Leu His145                 150                 155                 160Ile Phe Ile Phe Val Leu Ala Ile Ser His Val Thr Phe Cys Val Leu165                 170                 175Thr Val Ile Phe Gly Ser Thr Arg Ile His Gln Trp Lys Lys Trp Glu180                 185                 190Asp Ser Ile Ala Asp Glu Lys Phe Asp Pro Glu Thr Ala Leu Arg Lys195                 200                 205Arg Arg Val Thr His Val His Asn His Ala Phe Ile Lys Glu His Phe210                 215                 220Leu Gly Ile Gly Lys Asp Ser Val Ile Leu Gly Trp Thr Gln Ser Phe225                 230                 235                 240Leu Lys Gln Phe Tyr Asp Ser Val Thr Lys Ser Asp Tyr Val Thr Leu245                 250                 255Arg Leu Gly Phe Ile Met Thr His Cys Lys Gly Asn Pro Lys Leu Asn260                 265                 270Phe His Lys Tyr Met Met Arg Ala Xaa Glu Asp Asp Phe Lys Gln Val275                 280                 285Val Gly Ile Ser Trp Tyr Leu Trp Ile Phe Val Val Ile Phe Leu Leu290                 295                 300Leu Asn Val Asn Gly Trp His Thr Tyr Phe Trp Ile Ala Phe Ile Pro305                 310                 315                 320Phe Ala Leu Leu Leu Ala Val Gly Thr Lys Leu Glu His Val Ile Ala325                 330                 335Gln Leu Ala His Glu Val Ala Glu Lys His Val Ala Ile Glu Gly Asp340                 345                 350Leu Val Val Lys Pro Ser Asp Glu His Phe Trp Phe Ser Lys Pro Gln355                 360                 365Ile Val Leu Tyr Leu Ile His Phe Ile Leu Phe Gln Asn Ala Phe Glu370                 375                 380Ile Ala Phe Phe Phe Trp Ile Trp Val Thr Tyr Gly Phe Asp Ser Cys385                 390                 395                 400Ile Met Gly Gln Val Arg Tyr Ile Val Pro Arg Leu Val Ile Gly Val405                 410                 415Phe Ile Gln Val Leu Cys Ser Tyr Ser Thr Leu Pro Leu Tyr Ala Ile420                 425                 430Val Ser Gln Met Gly Ser Ser Phe Lys Lys Ala Ile Phe Glu Glu Asn435                 440                 445Val Gln Val Gly Leu Val Gly Trp Ala Gln Lys Val Lys Gln Lys Arg450                 455                 460Asp Leu Lys Ala Ala Ala Ser Asn Gly Asn Glu Gly Ser Ser Gln Ala465                 470                 475                 480Gly Pro Gly Pro Asp Ser Gly Ser Gly Ser Ala Pro Ala Ala Gly Pro485                 490                 495Gly Ala Gly Phe Ala Gly Ile Gln Leu Ser Arg Val Thr Arg Asn Asn500                 505                 510Ala Gly Asp Thr Asn Asn Glu Ile Thr Pro Asp His Asn Asn515                 520                 525<210>15<211>1782<212>DNA<213>Arabidopsis thaliana<220><221>CDS<222>(1)..(1779)<220><221>misc_feature<222>(274)..(313)<223>SEQ ID NO:1所示氨基酸序列的位置<220><221>misc_feature<222>(1327)..(1370)<223>SEQ ID NO:2所示氨基酸序列的位置<400>15atg gga atc atc gac ggt tct ttg ctt cgg cgg ttg att tgt ctc tgt   48Met Gly Ile Ile Asp Gly Ser Leu Leu Arg Arg Leu Ile Cys Leu Cys  1               5                  10                  15ctc tgg tgt ctt ctc ggt gga gga gtg acg gtg gtt acg gcg gag gat   96Leu Trp Cys Leu Leu Gly Gly Gly Val Thr Val Val Thr Ala Glu Asp20                  25                  30gag aag aaa gtg gta cat aaa cag ctt aat caa act ccg act tgg gct   144Glu Lys Lys Val Val His Lys Gln Leu Asn Gln Thr Pro Thr Trp Ala35                  40                  45gtt gct gct gtt tgt act ttc ttc atc gtt gtt tct gtt ctt ctt gaa   192Val Ala Ala Val Cys Thr Phe Phe Ile Val Val Ser Val Leu Leu Glu50                  55                  60aaa ctt ctt cac aaa gtt gga aag gtt cta tgg gat cgg cac aag aca   240Lys Leu Leu His Lys Val Gly Lys Val Leu Trp Asp Arg His Lys Thr65                  70                  75                  80gct ctt ctt gac gct ttg gag aag atc aaa gca gag ctg atg gtt ctt   288Ala Leu Leu Asp Ala Leu Glu Lys Ile Lys Ala Glu Leu Met Val Leu85                  90                  95gga ttc atc tct ttg ctt ctg aca ttt gga caa acc tac att ttg gat   336Gly Phe Ile Ser Leu Leu Leu Thr Phe Gly Gln Thr Tyr Ile Leu Asp100                 105                 110att tgt atc cct tca cat gtt gct cgt acg atg ctc ccg tgt cct gct   384Ile Cys Ile Pro Ser His Val Ala Arg Thr Met Leu Pro Cys Pro Ala115                 120                 125cct aac ttg aaa aag gag gat gat gac aat ggt gaa agt cac agg aga   432Pro Asn Leu Lys Lys Glu Asp Asp Asp Asn Gly Glu Ser His Arg Arg130                 135                 140ctc ttg tcg ttt gag cac aga ttt tta tct gga ggt gaa gca tct ccc   480Leu Leu Ser Phe Glu His Arg Phe Leu Ser Gly Gly Glu Ala Ser Pro145                 150                 155                 160act aaa tgc acg aag gag ggt tat gta gag ctt atc tct gcc gag gca   528Thr Lys Cys Thr Lys Glu Gly Tyr Val Glu Leu Ile Ser Ala Glu Ala165                 170                 175ctc cat cag ttg cac atc ctt ata ttc ttc tta gcc att ttc cac gtt   576Leu His Gln Leu His Ile Leu Ile Phe Phe Leu Ala Ile Phe His Val180                 185                 190ctt tac agc ttc tta act atg atg ctt gga agg ttg aag att cgc gga   624Leu Tyr Ser Phe Leu Thr Met Met Leu Gly Arg Leu Lys Ile Arg Gly195                 200                 205tgg aag cat tgg gag aat gag aca tca tcc cat aat tac gag ttt tca   672Trp Lys His Trp Glu Asn Glu Thr Ser Ser His Asn Tyr Glu Phe Ser210                 215                 220aca gac act tcc agattc agg cta act cat gaa aca tct ttt gtg aga   720Thr Asp Thr Ser Arg Phe Arg Leu Thr His Glu Thr Ser Phe Val Arg225                 230                 235                 240gcg cac acc agt ttc tgg acc cgg att cca ttc ttt ttc tat gtt gga   768Ala His Thr Ser Phe Trp Thr Arg Ile Pro Phe Phe Phe Tyr Val Gly245                 250                  255tgc ttt ttc aga cag ttt ttc aga tcc gtt ggg aga act gac tat ttg   816Cys Phe Phe Arg Gln Phe Phe Arg Ser Val Gly Arg Thr Asp Tyr Leu260                 265                 270aca ttg aga aat ggt ttc atc gct gtt cat tta gct cca gga agt caa   864Thr Leu Arg Asn Gly Phe Ile Ala Val His Leu Ala Pro Gly Ser Gln275                 280                 285ttt aac ttc caa aaa tac att aaa aga tcg ttg gag gat gat ttc aag   912Phe Asn Phe Gln Lys Tyr Ile Lys Arg Ser Leu Glu Asp Asp Phe Lys290                 295                 300gta gtc gtt gga gtc agc cct gtc ttg tgg gga tct ttt gtg cta ttc   960Val Val Val Gly Val Ser Pro Val Leu Trp Gly Ser Phe Val Leu Phe305                 310                 315                 320ctc ctc cta aat att gac ggt gag tat atg atg ttc atc ggc act gca   1008Leu Leu Leu Asn Ile Asp Gly Glu Tyr Met Met Phe Ile Gly Thr Ala325                 330                 335ata ccg gtt att atc att tta gct gta ggg aca aag ctt caa gcg att   1056Ile Pro Val Ile Ile Ile Leu Ala Val Gly Thr Lys Leu Gln Ala Ile340                 345                 350atg aca agg atg gct ctt ggt atc aca gat aga cat gcg gta gtt caa   1104Met Thr Arg Met Ala Leu Gly Ile Thr Asp Arg His Ala Val Val Gln355                 360                 365gga atg ccg ctt gta caa ggc aac gat gag tat ttc tgg ttc ggt cgt   1152Gly Met Pro Leu Val Gln Gly Asn Asp Glu Tyr Phe Trp Phe Gly Arg370                 375                 380ccc cat ttg att ctc cat ctc atg cat ttc gcc ttg ttt cag aac gca   1200Pro His Leu Ile Leu His Leu Met His Phe Ala Leu Phe Gln Asn Ala385                 390                 395                 400ttt cag atc act tat ttc ttc tgg ata tgg tat tcc ttt gga tca gat   1248Phe Gln Ile Thr Tyr Phe Phe Trp Ile Trp Tyr Ser Phe Gly Ser Asp405                 410                 415tct tgc tac cat cct aat ttc aag att gca ctt gta aaa gta gcg att   1296Ser Cys Tyr His Pro Asn Phe Lys Ile Ala Leu Val Lys Val Ala Ile420                 425                 430gct tta gga gta ttg tgt ctt tgc agc tac atc aca ctt cct ctt tac   1344Ala Leu Gly Val Leu Cys Leu Cys Ser Tyr Ile Thr Leu Pro Leu Tyr435                 440                 445gca ctc gta act cag atg ggt tct cgg atg aaa aaa tcg gta ttc gat    1392Ala Leu Val Thr Gln Met Gly Ser Arg Met Lys Lys Ser Val Phe Asp450                 455                 460gaa caa acg tca aaa gca ctc aag aaa tgg aga atg gca gtg aag aag   1440Glu Gln Thr Ser Lys Ala Leu Lys Lys Trp Arg Met Ala Val Lys Lys465                 470                 475                 480aag aaa ggt gtg aaa gcc act act aag aga cta ggt gga gat gga agt   1488Lys Lys Gly Val Lys Ala Thr Thr Lys Arg Leu Gly Gly Asp Gly Ser485                 490                 495gcg agc cct acg gca tcg aca gtt agg tct act tcg tct gta cgt tca   1536Ala Ser Pro Thr Ala Ser Thr Val Arg Ser Thr Ser Ser Val Arg Ser500                 505                 510ttg cag cgt tac aaa acc aca cca cat tcg atg aga tac gaa gga ctt   1584Leu Gln Arg Tyr Lys Thr Thr Pro His Ser Met Arg Tyr Glu Gly Leu515                 520                 525gac cct gaa aca tcg gat ctc gac aca gat aat gaa gct ttg act cct   1632Asp Pro Glu Thr Ser Asp Leu Asp Thr Asp Asn Glu Ala Leu Thr Pro530                 535                 540ccc aaa tct cct cca agc ttc gag ctt gtt gtg aaa gtt gaa cca aat   1680Pro Lys Ser Pro Pro Ser Phe Glu Leu Val Val Lys Val Glu Pro Asn545                 550                 555                 560aag acc aat acc ggt gag act agc cgt gac act gaa act gat tct aaa   1728Lys Thr Asn Thr Gly Glu Thr Ser Arg Asp Thr Glu Thr Asp Ser Lys565                 570                 575gag ttc tct ttc gtc aag cct gct ccg agt aat gaa tca tct caa gac   1776Glu Phe Ser Phe Val Lys Pro Ala Pro Ser Asn Glu Ser Ser Gln Asp580                 585                 590cgg tga                                                           1782Arg<210>16<211>593<212>PRT<213>Arabidopsis thaliana<400>16Met Gly Ile Ile Asp Gly Ser Leu Leu Arg Arg Leu Ile Cys Leu Cys1               5                  10                  15Leu Trp Cys Leu Leu Gly Gly Gly Val Thr Val Val Thr Ala Glu Asp20                  25                  30Glu Lys Lys Val Val His Lys Gln Leu Asn Gln Thr Pro Thr Trp Ala35                  40                  45Val Ala Ala Val Cys Thr Phe Phe Ile Val Val Ser Val Leu Leu Glu50                  55                  60Lys Leu Leu His Lys Val Gly Lys Val Leu Trp Asp Arg His Lys Thr65                  70                  75                  80Ala Leu Leu Asp Ala Leu Glu Lys Ile Lys Ala Glu Leu Met Val Leu85                  90                  95Gly Phe Ile Ser Leu Leu Leu Thr Phe Gly Gln Thr Tyr Ile Leu Asp100                 105                 110Ile Cys Ile Pro Ser His Val Ala Arg Thr Met Leu Pro Cys Pro Ala115                 120                 125Pro Asn Leu Lys Lys Glu Asp Asp Asp Asn Gly Glu Ser His Arg Arg130                 135                 140Leu Leu Ser Phe Glu His Arg Phe Leu Ser Gly Gly Glu Ala Ser Pro145                 150                 155                 160Thr Lys Cys Thr Lys Glu Gly Tyr Val Glu Leu Ile Ser Ala Glu Ala165                 170                 175Leu His Gln Leu His Ile Leu Ile Phe Phe Leu Ala Ile Phe His Val180                 185                 190Leu Tyr Ser Phe Leu Thr Met Met Leu Gly Arg Leu Lys Ile Arg Gly195                 200                 205Trp Lys His Trp Glu Asn Glu Thr Ser Ser His Asn Tyr Glu Phe Ser210                 215                 220Thr Asp Thr Ser Arg Phe Arg Leu Thr His Glu Thr Ser Phe Val Arg225                 230                 235                 240Ala His Thr Ser Phe Trp Thr Arg Ile Pro Phe Phe Phe Tyr Val Gly245                 250                 255Cys Phe Phe Arg Gln Phe Phe Arg Ser Val Gly Arg Thr Asp Tyr Leu260                 265                 270Thr Leu Arg Asn Gly Phe Ile Ala Val His Leu Ala Pro Gly Ser Gln275                 280                 285Phe Asn Phe Gln Lys Tyr Ile Lys Arg Ser Leu Glu Asp Asp Phe Lys290                 295                 300Val Val Val Gly Val Ser Pro Val Leu Trp Gly Ser Phe Val Leu Phe305                 310                 315                 320Leu Leu Leu Asn Ile Asp Gly Glu Tyr Met Met Phe Ile Gly Thr Ala325                 330                 335Ile Pro Val Ile Ile Ile Leu Ala Val Gly Thr Lys Leu Gln Ala Ile            340                 345                 350Met Thr Arg Met Ala Leu Gly Ile Thr Asp Arg His Ala Val Val Gln355                 360                 365Gly Met Pro Leu Val Gln Gly Asn Asp Glu Tyr Phe Trp Phe Gly Arg370                 375                 380Pro His Leu Ile Leu His Leu Met His Phe Ala Leu Phe Gln Asn Ala385                 390                 395                 400Phe Gln Ile Thr Tyr Phe Phe Trp Ile Trp Tyr Ser Phe Gly Ser Asp405                 410                 415Ser Cys Tyr His Pro Asn Phe Lys Ile Ala Leu Val Lys Val Ala Ile420                 425                 430Ala Leu Gly Val Leu Cys Leu Cys Ser Tyr Ile Thr Leu Pro Leu Tyr435                 440                 445Ala Leu Val Thr Gln Met Gly Ser Arg Met Lys Lys Ser Val Phe Asp450                 455                 460Glu Gln Thr Ser Lys Ala Leu Lys Lys Trp Arg Met Ala Val Lys Lys465                 470                 475                 480Lys Lys Gly Val Lys Ala Thr Thr Lys Arg Leu Gly Gly Asp Gly Ser485                 490                 495Ala Ser Pro Thr Ala Ser Thr Val Arg Ser Thr Ser Ser Val Arg Ser500                 505                 510Leu Gln Arg Tyr Lys Thr Thr Pro His Ser Met Arg Tyr Glu Gly Leu515                 520                 525Asp Pro Glu Thr Ser Asp Leu Asp Thr Asp Asn Glu Ala Leu Thr Pro530                 535                 540Pro Lys Ser Pro Pro Ser Phe Glu Leu Val Val Lys Val Glu Pro Asn545                 550                 555                 560Lys Thr Asn Thr Gly Glu Thr Ser Arg Asp Thr Glu Thr Asp Ser Lys565                 570                 575Glu Phe Ser Phe Val Lys Pro Ala Pro Ser Asn Glu Ser Ser Gln Asp580                 585                 590Arg<210>17<211>1629<212>DNA<213>Arabidopsis thaliana<220><221>CDS<222>(1)..(1626)<220><221>misc_feature<222>(184)..(223)<223>SEQ ID NO:1所示氨基酸序列的位置<220><221>misc_feature<222>(1141)..(1183)<223>SEQ ID NO:2所示氨基酸序列的位置<400>17atg gag cat atg atg aaa gaa gga agg tct ctt gca gag acg ccg act    48Met Glu His Met Met Lys Glu Gly Arg Ser Leu Ala Glu Thr Pro Thr1               5                  10                  15tac tct gtt gct tcg gtt gtt act gtt ttg gtc ttt gtt tgc ttt ctc   96Tyr Ser Val Ala Ser Val Val Thr Val Leu Val Phe Val Cys Phe Leu20                  25                  30gtt gaa cgc gcc att tac aga ttt gga aag tgg tta aag aag act aga   144Val Glu Arg Ala Ile Tyr Arg Phe Gly Lys Trp Leu Lys Lys Thr Arg35                  40                  45aga aag gca ctt ttt act tca ctt gag aaa atg aaa gag gag ttg atg   192Arg Lys Ala Leu Phe Thr Ser Leu Glu Lys Met Lys Glu Glu Leu Met50                  55                  60ttg ctg gga ctt ata tca ctt ctg ttg tca caa agc gcg aga tgg att   240Leu Leu Gly Leu Ile Ser Leu Leu Leu Ser Gln Ser Ala Arg Trp Ile65                  70                  75                  80tca gaa atc tgt gtt aac tct tcc ctt ttc aat agt aaa ttc tac att   288Ser Glu Ile Cys Val Asn Ser Ser Leu Phe Asn Ser Lys Phe Tyr Ile85                  90                  95tgc tct gaa gag gac tat gga atc cat aag aaa gtt ctt ctg gaa cac   336Cys Ser Glu Glu Asp Tyr Gly Ile His Lys Lys Val Leu Leu Glu His100                 105                 110acc tct tct aca aac cag agc tcc tta cct cat cat gga ata cat gaa   384Thr Ser Ser Thr Asn Gln Ser Ser Leu Pro His His Gly Ile His Glu115                 120                 125gcc tct cat caa tgt ggt cat ggc cgt gaa cca ttt gtg tcg tat gag   432Ala Ser His Gln Cys Gly His Gly Arg Glu Pro Phe Val Ser Tyr Glu130                 135                 140gga ctc gag caa ctc cta agattc tta ttc gtc ctg ggt atc act cat   480Gly Leu Glu Gln Leu Leu Arg Phe Leu Phe Val Leu Gly Ile Thr His145                 150                 155                160gtt cta tac agt ggc att gcc att ggt tta gcc atg agc aag att tac   528Val Leu Tyr Ser Gly Ile Ala Ile Gly Leu Ala Met Ser Lys Ile Tyr165                 170                 175agt tgg aga aaa tgg gaa gcc caa gcg atc ata atg gct gaa tca gat   576Ser Trp Arg Lys Trp Glu Ala Gln Ala Ile Ile Met Ala Glu Ser Asp180                 185                 190atc cac ctt tgt ttc ctg cgg caa ttt aga ggc tcc ata cga aag tct    624Ile His Leu Cys Phe Leu Arg Gln Phe Arg Gly Ser Ile Arg Lys Ser195                 200                 205gac tac ttc gca ctt cgg tta ggt ttc ctc act aaa cat aat ttg cca   672Asp Tyr Phe Ala Leu Arg Leu Gly Phe Leu Thr Lys His Asn Leu Pro210                 215                 220ttt aca tac aac ttc cat atg tat atg gta cgg acg atg gaa gat gag   720Phe Thr Tyr Asn Phe His Met Tyr Met Val Arg Thr Met Glu Asp Glu225                 230                 235                 240ttt cat ggc att gtt gga att agc tgg cca ctt tgg gtt tac gct ata   768Phe His Gly Ile Val Gly Ile Ser Trp Pro Leu Trp Val Tyr Ala Ile245                 250                 255gta tgc atc tgc ata aat gtt cat ggc ctg aat atg tac ttt tgg ata   816Val Cys Ile Cys Ile Asn Val His Gly Leu Asn Met Tyr Phe Trp Ile260                 265                 270tca ttc gtt cct gcc att ctt gtc atg ttg gtt gga acc aaa ctt gag   864Ser Phe Val Pro Ala Ile Leu Val Met Leu Val Gly Thr Lys Leu Glu275                 280                 285cat gtt gtc tcc aag ctt gct ctc gag gtt aag gag cag cag aca ggc   912His Val Val Ser Lys Leu Ala Leu Glu Val Lys Glu Gln Gln Thr Gly290                 295                 300aca tct aat ggg gct caa gtc aaa cca cgt gat ggg ctc ttc tgg ttt   960Thr Ser Asn Gly Ala Gln Val Lys Pro Arg Asp Gly Leu Phe Trp Phe305                 310                 315                 320ggg aaa cca gaa att ctg cta cgg ttg ata caa ttt atc att ttt cag   1008Gly Lys Pro Glu Ile Leu Leu Arg Leu Ile Gln Phe Ile Ile Phe Gln325                 330                 335aat gca ttt gaa atg gca aca ttc atc tgg ttc ttg tgg gga atc aag   1056Asn Ala Phe Glu Met Ala Thr Phe Ile Trp Phe Leu Trp Gly Ile Lys340                 345                 350gaa aga tct tgc ttc atg aag aac cat gtg atg ata tca agc cgg cta   1104Glu Arg Ser Cys Phe Met Lys Asn His Val Met Ile Ser Ser Arg Leu355                 360                 365att tct ggg gtt ctc gtt cag ttc tgg tgt agt tat ggc act gtg cct   1152Ile Ser Gly Val Leu Val Gln Phe Trp Cys Ser Tyr Gly Thr Val Pro370                 375                  380ctc aat gta atc gtt act cag atg gga tct cgg cat aag aaa gct gtg   1200Leu Asn Val Ile Val Thr Gln Met Gly Ser Arg His Lys Lys Ala Val385                 390                 395                 400ata gca gag agc gta aga gac tca ctt cac agt tgg tgc aag aga gtg   1248Ile Ala Glu Ser Val Arg Asp Ser Leu His Ser Trp Cys Lys Arg Val405                 410                 415aaa gag agg tct aag cac acg aga tca gtg tgt tcc ctt gac aca gca   1296Lys Glu Arg Ser Lys His Thr Arg Ser Val Cys Ser Leu Asp Thr Ala420                 425                 430aca ata gac gag aga gac gag atg aca gtg ggg aca ttg tct agg agc   1344Thr Ile Asp Glu Arg Asp Glu Met Thr Val Gly Thr Leu Ser Arg Ser435                 440                 445tca tcg atg act tca ctg aat cag att acc ata aac tcc ata gac caa   1392Ser Ser Met Thr Ser Leu Asn Gln Ile Thr Ile Asn Ser Ile Asp Gln450                 455                 460gca gag tct ata ttc gga gca gca gct tca tcc agc agt cct caa gat    1440Ala Glu Ser Ile Phe Gly Ala Ala Ala Ser Ser Ser Ser Pro Gln Asp465                 470                 475                 480gga tac acg tcg agg gtg gaa gaa tat ctg tct gaa aca tac aat aac   1488Gly Tyr Thr Ser Arg Val Glu Glu Tyr Leu Ser Glu Thr Tyr Asn Asn485                 490                 495atc ggt tcg ata ccg cct tta aac gat gag att gag att gag att gaa   1536Ile Gly Ser Ile Pro Pro Leu Asn Asp Glu Ile Glu Ile Glu Ile Glu500                 505                 510ggt gaa gaa gat aat gga ggg aga gga agt ggg agt gat gag aat aac   1584Gly Glu Glu Asp Asn Gly Gly Arg Gly Ser Gly Ser Asp Glu Asn Asn515                 520                 525ggt gat gct gga gaa aca ctt ctt gag ttg ttt agg agg act tga        1629Gly Asp Ala Gly Glu Thr Leu Leu Glu Leu Phe Arg Arg Thr530                 535                 540<210>18<211>542<212>PRT<213>Arabidopsis thaliana<400>18Met Glu His Met Met Lys Glu Gly Arg Ser Leu Ala Glu Thr Pro Thr1               5                  10                  15Tyr Ser Val Ala Ser Val Val Thr Val Leu Val Phe Val Cys Phe Leu             20                  25                  30Val Glu Arg Ala Ile Tyr Arg Phe Gly Lys Trp Leu Lys Lys Thr Arg35                  40                  45Arg Lys Ala Leu Phe Thr Ser Leu Glu Lys Met Lys Glu Glu Leu Met50                  55                  60Leu Leu Gly Leu Ile Ser Leu Leu Leu Ser Gln Ser Ala Arg Trp Ile65                  70                  75                  80Ser Glu Ile Cys Val Asn Ser Ser Leu Phe Asn Ser Lys Phe Tyr Ile85                  90                  95Cys Ser Glu Glu Asp Tyr Gly Ile His Lys Lys Val Leu Leu Glu His100                 105                 110Thr Ser Ser Thr Asn Gln Ser Ser Leu Pro His His Gly Ile His Glu115                 120                 125Ala Ser His Gln Cys Gly His Gly Arg Glu Pro Phe Val Ser Tyr Glu130                 135                 140Gly Leu Glu Gln Leu Leu Arg Phe Leu Phe Val Leu Gly Ile Thr His145                 150                 155                 160Val Leu Tyr Ser Gly Ile Ala Ile Gly Leu Ala Met Ser Lys Ile Tyr165                 170                 175Ser Trp Arg Lys Trp Glu Ala Gln Ala Ile Ile Met Ala Glu Ser Asp180                 185                 190Ile His Leu Cys Phe Leu Arg Gln Phe Arg Gly Ser Ile Arg Lys Ser195                 200                 205Asp Tyr Phe Ala Leu Arg Leu Gly Phe Leu Thr Lys His Asn Leu Pro210                 215                 220Phe Thr Tyr Asn Phe His Met Tyr Met Val Arg Thr Met Glu Asp Glu225                 230                 235                 240Phe His Gly Ile Val Gly Ile Ser Trp Pro Leu Trp Val Tyr Ala Ile245                 250                 255Val Cys Ile Cys Ile Asn Val His Gly Leu Asn Met Tyr Phe Trp Ile260                 265                 270Ser Phe Val Pro Ala Ile Leu Val Met Leu Val Gly Thr Lys Leu Glu275                 280                 285His Val Val Ser Lys Leu Ala Leu Glu Val Lys Glu Gln Gln Thr Gly290                 295                 300Thr Ser Asn Gly Ala Gln Val Lys Pro Arg Asp Gly Leu Phe Trp Phe305                 310                 315                 320Gly Lys Pro Glu Ile Leu Leu Arg Leu Ile Gln Phe Ile Ile Phe Gln325                 330                 335Asn Ala Phe Glu Met Ala Thr Phe Ile Trp Phe Leu Trp Gly Ile Lys340                 345                 350Glu Arg Ser Cys Phe Met Lys Asn His Val Met Ile Ser Ser Arg Leu355                 360                 365Ile Ser Gly Val Leu Val Gln Phe Trp Cys Ser Tyr Gly Thr Val Pro370                 375                 380Leu Asn Val Ile Val Thr Gln Met Gly Ser Arg His Lys Lys Ala Val385                 390                 395                 400Ile Ala Glu Ser Val Arg Asp Ser Leu His Ser Trp Cys Lys Arg Val405                 410                 415Lys Glu Arg Ser Lys His Thr Arg Ser Val Cys Ser Leu Asp Thr Ala420                 425                 430Thr Ile Asp Glu Arg Asp Glu Met Thr Val Gly Thr Leu Ser Arg Ser435                 440                 445Ser Ser Met Thr Ser Leu Asn Gln Ile Thr Ile Asn Ser Ile Asp Gln450                 455                 460Ala Glu Ser Ile Phe Gly Ala Ala Ala Ser Ser Ser Ser Pro Gln Asp465                 470                 475                 480Gly Tyr Thr Ser Arg Val Glu Glu Tyr Leu Ser Glu Thr Tyr Asn Asn485                 490                 495Ile Gly Ser Ile Pro Pro Leu Asn Asp Glu Ile Glu Ile Glu Ile Glu500                 505                 510Gly Glu Glu Asp Asn Gly Gly Arg Gly Ser Gly Ser Asp Glu Asn Asn515                 520                 525Gly Asp Ala Gly Glu Thr Leu Leu Glu Leu Phe Arg Arg Thr530                 535                 540<210>19<211>20<212>DNA<213>人工序列<220><223>人工序列描述:寡核苷酸<400>19atgtcggaca aaaaaggggt<210>0<211>20<212>DNA<213>人工序列<220><223>人工序列描述:寡核苷酸<400>20atgctaccac acgcagatcg    20<210>21<211>21<212>DNA<213>人工序列<220><223>人工序列描述:寡核苷酸<400>21acagagacca cctccttgga a    21<210>22<211>22<212>DNA<213>人工序列<220><223>人工序列描述:寡核苷酸<400>22cagaaacttg tctcatccct gg    22<210>23<211>20<212>DNA<213>人工序列<220><223>人工序列描述:寡核苷酸<400>23aagaactgcc tgaagaaggc                20<210>24<211>20<212>DNA<213>人工序列<220><223>人工序列描述:寡核苷酸<400>24caccaccttc atgatgctca    20<210>25<211>20<212>DNA<213>人工序列<220><223>人工序列描述:寡核苷酸<400>25ttccagcacc ggcacaagaa    20<210>26<211>28<212>DNA<213>人工序列<220><223>人工序列描述:寡核苷酸<400>26tggacctctt catgttcgat cccatctg    28<210>27<211>27<212>DNA<213>人工序列<220><223>人工序列描述:寡核苷酸<400>27cctgacgctg ttccagaatg cgtttca           27<210>28<211>20<212>DNA<213>人工序列<223>人工序列描述:寡核苷酸<400>28acttctgcag gtcgactcta    20<210>29<211>30<212>DNA<213>人工序列<220><223>人工序列描述:寡核苷酸<400>29aagatcaaga tgaggacgtg gaagtcgtgg    30<210>30<211>30<212>DNA<213>人工序列<220><223>人工序列描述:寡核苷酸<400>30aggctgaacc actggggcgc ctctcaccac    30<210>31<211>30<212>DNA<213>人工序列<220><223>人工序列描述:寡核苷酸<400>31caagtatatg atgcgcgctc tagaggatga    30<210>32<211>30<212>DNA<213>人工序列<220><223>人工序列描述:寡核苷酸aggtttcacc actaagtctc cttcaatggc    30<210>33<211>30<212>DNA<213>人工序列<220><223>人工序列描述:寡核苷酸<400>33gatcattcaa gacttaggct cactcatgag    30<210>34<211>30<212>DNA<213>人工序列<220><223>人工序列描述:寡核苷酸<400>34aacagcaagg aagattacaa atgatgccca    30<210>35<211>18<212>DNA<213>人工序列<220><223>人工序列描述:寡核苷酸<400>35ggattaagat ctaatggc    18<210>36<211>23<212>DNA<213>人工序列<220><223>人工序列描述:寡核苷酸<400>36caaagatctt catttcttaa aag    23<210>37<211>25<212>DNA<213>人工序列<220><223>人工序列描述:寡核苷酸<400>37gcggatccat gtcggacaaa aaagg    25<210>38<211>25<212>DNA<213>人工序列<220><223>人工序列描述:寡核苷酸<400>38gcggatcctc atccctggct gaagg    25<210>39<211>23<212>DNA<213>人工序列<220><223>人工序列描述:寡核苷酸<400>39ggatccacca tggccacaag atg    23<210>40<211>24<212>DNA<213>人工序列<220><223>人工序列描述:寡核苷酸<400>40ggatccttag tcaatatcat tagc              24<210>41<211>27<212>DNA<213>人工序列<220><223>人工序列描述:寡核苷酸<400>41gcggatccat gggtcacgga ggagaag    27<210>42<211>27<212>DNA<213>人工序列<220><223>人工序列描述:寡核苷酸<400>42gcggatcctc agttgttatg atcagga    27sequence listing <110> Novartis <120> Genes controlling disease <130>S-30431/A <140>CGC1989 <141>1998-03-17 <150> US 09/042763 <151>1998-03-17 <160>42 <170>Patent In Ver. 2.0 <210>1 <211>13 <212>PRT <213> Artificial sequence <220> <223> Artificial sequence description: Conserved amino acid sequence <400>1Glu Leu Met Xaa Xaa Gly Xaa Ile Ser Leu Leu Leu Xaa1 5 10 <210>2 <211>14 <212>PRT <213> Artificial sequence <220> <223> Artificial sequence description: Conserved amino acid sequence <400>2Xaa Thr Xaa Pro Leu Xaa Xaa Xaa Val Xaa Gln Met Gly Ser1 5 10 <210>3 <211>1868 <212>DNA <213>Triticum sp. <220> <221> CDS <222>(176). . (1777) <220> <221>misc_feature <222>(365). . (403) <223>The position of the amino acid sequence shown in SEQ ID NO: 1 <220> <221>misc_feature <222>(1352). . (1393) <223> The position of the amino acid sequence shown in SEQ ID NO: 2 <400>3cacgcccaca cttcgccaac acacaacgta cctgcgtacg tacgctttcc atttcctttc 60ttgctccggc cggccggcca cgtagaatag atacccggcc aggtaggtac ctcgttggct 120cagacgaccg gcggctgggt ctccggacaa ggaaagaggt tgcgctcggg gaccg atg  178Met1gcg gac gac gac gag tac ccc cca gcg agg acg ctg ccg gag acg ccg   226Ala Asp Asp Asp Glu Tyr Pro Pro Ala Arg Thr Leu Pro Glu Thr Pro5                  10                  15tcc tgg gcg gtg gcc ctc gtc ttc gcc gtc atg atc atc gtg tcc gtc   274Ser Trp Ala Val Ala Leu Val Phe Ala Val Met Ile Ile Val Ser Val20                  25                  30ctc ctg gag cac gcg ctc cat aag ctc ggc cat tgg ttc cac aag cgg   322Leu Leu Glu His Ala Leu His Lys Leu Gly His Trp Phe His Lys Arg35                  40                  45cac aag aac gcg ctg gcg gag gcg ctg gag aag atc aag gcg gag ctc   370His Lys Asn Ala Leu Ala Glu Ala Leu Glu Lys Ile Lys Ala Glu Leu50                  55                  60                  65atg ctg gtg ggc ttc atc tcg ctg ctg ctc gcc gtg acg cag gac ccc   418Met Leu Val Gly Phe Ile Ser Leu Leu Leu Ala Val Thr Gln Asp Pro70              75                      80atc tcc ggg ata tgc atc tcc gag aag gcc gcc agc atc atg cgg ccc   466Ile Ser Gly Ile Cys Ile Ser Glu Lys Ala Ala Ser Ile Met Arg Pro85                  90                  95tgc aag ctg ccc cct ggc tcc gtc aag agc aag tac aaa gac tac tac   514Cys Lys Leu Pro Pro Gly Ser Val Lys Ser Lys Tyr Lys Asp Tyr Tyr100                 105                 110tgc gcc aaa cag ggc aag gtg tcg ctc atg tcc acg ggc agc ttg cac   562Cys Ala Lys Gln Gly Lys Val Ser Leu Met Ser Thr Gly Ser Leu His115                 120                 125cag ctg cac ata ttc atc ttc gtg ctc gcc gtc ttc cat gtc acc tac   610Gln Leu His Ile Phe Ile Phe Val Leu Ala Val Phe His Val Thr Tyr130                     135             140                 145agc gtc atc atc atg gct cta agc cgt ctc aaa atg aga acc tgg aag   658Ser Val Ile Ile Met Ala Leu Ser Arg Leu Lys Met Arg Thr Trp Lys150                 155                 160aaa tgg gag aca gag acc gcc tcc ctg gaa tac cag ttc gca aat gat   706Lys Trp Glu Thr Glu Thr Ala Ser Leu Glu Tyr Gln Phe Ala Asn Asp165                 170                 175cct gcg cgg ttc cgc ttc acg cac cag acg tcg ttc gtg aag cgg cac   754Pro Ala Arg Phe Arg Phe Thr His Gln Thr Ser Phe Val Lys Arg His180                 185                 190ctg ggc ctc tcc agc acc ccc ggc gtc aga tgg gtg gtg gcc ttc ttc   802Leu Gly Leu Ser Ser Thr Pro Gly Val Arg Trp Val Val Ala Phe Phe195                 200                 205agg cag ttc ttc agg tcg gtc acc aag gtg gac tac ctc acc ttg agg   850Arg Gln Phe Phe Arg Ser Val Thr Lys Val Asp Tyr Leu Thr Leu Arg210                 215                 220                 225gca ggc ttc atc aac gcg cat ttg tcg cat aac agc aag ttc gac ttc   898Ala Gly Phe Ile Asn Ala His Leu Ser His Asn Ser Lys Phe Asp Phe230                 235                 240cac aag tac atc aag agg tcc atg gag gac gac ttc aaa gtc gtc gtt   946His Lys Tyr Ile Lys Arg Ser Met Glu Asp Asp Phe Lys Val Val Val245                 250                 255ggc atc agc ctc ccg ctg tgg tgt gtg gcg atc ctc acc ctc ttc ctt   994Gly Ile Ser Leu Pro Leu Trp Cys Val Ala Ile Leu Thr Leu Phe Leu260                 265                 270gac att gac ggg atc ggc acg ctc acc tgg att tct ttc atc cct ctc   1042Asp Ile Asp Gly Ile Gly Thr Leu Thr Trp Ile Ser Phe Ile Pro Leu275                 280                 285gtc atc ctc ttg tgt gtt gga acc aag ctg gag atg atc atc atg gag   1090Val Ile Leu Leu Cys Val Gly Thr Lys Leu Glu Met Ile Ile Met Glu290                 295                 300                 305atg gcc ctg gag atc cag gac cgg gcg agc gtc atc aag ggg gcg ccc   1138Met Ala Leu Glu Ile Gln Asp Arg Ala Ser Val Ile Lys Gly Ala Pro310                 315                 320gtg gtt gag ccc agc aac aag ttc ttc tgg ttc cac cgc ccc gac tgg   1186Val Val Glu Pro Ser Asn Lys Phe Phe Trp Phe His Arg Pro Asp Trp325                 330                 335gtc ctc ttc ttc ata cac ctg acg cta ttc cag aac gcg ttt cag atg   1234Val Leu Phe Phe Ile His Leu Thr Leu Phe Gln Asn Ala Phe Gln Met340                 345                 350gca cat ttc gtg tgg aca gtg gcc acg ccc ggc ttg aag aaa tgc ttc   1282Ala His Phe Val Trp Thr Val Ala Thr Pro Gly Leu Lys Lys Cys Phe355                 360                 365cat atg cac atc ggg ctg agc atc atg aag gtc gtg ctg ggg ctg gct   1330His Met His Ile Gly Leu Ser Ile Met Lys Val Val Leu Gly Leu Ala370                 375                 380                 385ctt cag ttc ctc tgc agc tat atc acc ttc ccg ctc tac gcg ctc gtc   1378Leu Gln Phe Leu Cys Ser Tyr Ile Thr Phe Pro Leu Tyr Ala Leu Val390                 395                 400aca cag atg gga tca aac atg aag agg tcc atc ttc gac gag cag acg   1426Thr Gln Met Gly Ser Asn Met Lys Arg Ser Ile Phe Asp Glu Gln Thr405                 410                  415gcc aag gcg ctg aca aac tgg cgg aac acg gcc aag gag aag aag aag   1474Ala Lys Ala Leu Thr Asn Trp Arg Asn Thr Ala Lys Glu Lys Lys Lys420                 425                 430gtc cga gac acg gac atg ctg atg gcg cag atg atc ggc gac gcg acg   1522Val Arg Asp Thr Asp Met Leu Met Ala Gln Met Ile Gly Asp Ala Thr435 440                 445ccc agc cga ggg gcg tcg ccc atg cct agc cgg ggc tcg tcg cca gtg   1570Pro Ser Arg Gly Ala Ser Pro Met Pro Ser Arg Gly Ser Ser Pro Val450                 455                 460                 465cac ctg ctt cac aag ggc atg gga cgg tcc gac gat ccc cag agc acg   1618His Leu Leu His Lys Gly Met Gly Arg Ser Asp Asp Pro Gln Ser Thr470                 475                 480cca acc tcg cca agg gcc atg gag gag gct agg gac atg tac ccg gtt   1666Pro Thr Ser Pro Arg Ala Met Glu Glu Ala Arg Asp Met Tyr Pro Val485 490                     495gtg gtg gcg cat cca gtg cac aga cta aat cct gct gac agg aga agg   1714Val Val Ala His Pro Val His Arg Leu Asn Pro Ala Asp Arg Arg Arg500                 505                 510tcg gtc tcg tcg tcg gca ctc gat gtc gac att ccc agc gca gat ttt 1762Ser Val Ser Ser Ser Ala Leu Asp Val Asp Ile Pro Ser Ala Asp Phe515                 520                 525tcc ttc agc cag gga tgagacaagt ttctgtattg atgttagtcc aatgtatagc   1817Ser Phe Ser Gln Gly530caacatagga tgtcatgatt cgtacaataa gaaatacaaa tttttactga g          1868 <210>4 <211>534 <212>PRT <213>Triticum sp. <400> 4MET ALA ALA ASP ASP ASP GLU TYR Pro Pro Ala Ala ARG THR Leu Glu THR1 5 10 15Pro Serp Ala Val Phe Ala Val Met Ile Val SELE Val SELE VAL GLU HIS ALA Leu His Lys Trp Phe His Lys35                 40                   45Arg His Lys Asn Ala Leu Ala Glu Ala Leu Glu Lys Ile Lys Ala Glu50                   55                  60Leu Met Leu Val Gly Phe Ile Ser Leu Leu Leu Ala Val Thr Gln Asp65                  70              75                      80Pro Ile Ser Gly Ile Cys Ile Ser Glu Lys Ala Ala Ser Ile Met Arg85                  90                  95Pro Cys Lys Leu Pro Pro Gly Ser Val Lys Ser Lys Tyr Lys Asp Tyr100                 105                 110Tyr Cys Ala Lys Gln Gly Lys Val Ser Leu Met Ser Thr Gly Ser Leu115                 120                 125His Gln Leu His Ile Phe Ile Phe Val Leu Ala Val Phe His Val Thr130                     135                 140Tyr Ser Val Ile Ile Met Ala Leu Ser Arg Leu Lys Met Arg Thr Trp145                 150                 155                 160Lys Lys Trp Glu Thr Glu Thr Ala Ser Leu Glu Tyr Gln Phe Ala Asn165                 170                 175Asp Pro Ala Arg Phe Arg Phe Thr His Gln Thr Ser Phe Val Lys Arg180                 185                 190His Leu Gly Leu Ser Ser Thr Pro Gly Val Arg Trp Val Val Ala Phe195                 200                 205Phe Arg Gln Phe Phe Arg Ser Val Thr Lys Val Asp Tyr Leu Thr Leu210                 215                 220Arg Ala Gly Phe Ile Asn Ala His Leu Ser His Asn Ser Lys Phe Asp225                 230                 235                 240She His Lys Tyr Ile Lys Arg Ser Met Glu Asp Asp Phe Lys Val Val245                 250                 255Val Gly Ile Ser Leu Pro Leu Trp Cys Val Ala Ile Leu Thr Leu Phe260                 265                 270Leu Asp Ile Asp Gly Ile Gly Thr Leu Thr Trp Ile Ser Phe Ile Pro275                 280                 285Leu Val Ile Leu Leu Cys Val Gly Thr Lys Leu Glu Met Ile Ile Met290                 295                 300Glu Met Ala Leu Glu Ile Gln Asp Arg Ala Ser Val Ile Lys Gly Ala305                 310                 315                 320Pro Val Val Glu Pro Ser Asn Lys Phe Phe Trp Phe His Arg Pro Asp325                 330                 335Trp Val Leu Phe Phe Ile His Leu Thr Leu Phe Gln Asn Ala Phe Gln340                 345                 350Met Ala His Phe Val Trp Thr Val Ala Thr Pro Gly Leu Lys Lys Cys355                 360                 365Phe His Met His Ile Gly Leu Ser Ile Met Lys Val Val Leu Gly Leu370                 375                 380Ala Leu Gln Phe Leu Cys Ser Tyr Ile Thr Phe Pro Leu Tyr Ala Leu385                 390                 395                 400Val Thr Gln Met Gly Ser Asn Met Lys Arg Ser Ile Phe Asp Glu Gln405 410                 415Thr Ala Lys Ala Leu Thr Asn Trp Arg Asn Thr Ala Lys Glu Lys Lys420                  425                 430Lys Val Arg Asp Thr Asp Met Leu Met Ala Gln Met Ile Gly Asp Ala435                 440                 445Thr Pro Ser Arg Gly Ala Ser Pro Met Pro Ser Arg Gly Ser Ser Pro450                 455                 460Val His Leu Leu His Lys Gly Met Gly Arg Ser Asp Asp Pro Gln Ser465                 470                 475                 480Thr Pro Thr Ser Pro Arg Ala Met Glu Glu Ala Arg Asp Met Tyr Pro485                 490                 495Val Val Val Ala His Pro Val His Arg Leu Asn Pro Ala ASP ARG ARG500 505 510ARG Ser Val Ser Ser Ala Leu ASP Val Asp Ile Pro Sering <210>5 <211>1693 <212>DNA <213>Triticum sp. <220> <221> CDS <222> (1). . (1602) <220> <221>misc_feature <222>(190). . (228) <223>The position of the amino acid sequence shown in SEQ ID NO: 1 <220> <221>misc_feature <222>(1177). . (1218) <223> The position of the amino acid sequence shown in SEQ ID NO: 2 <400> 5ATG GCG GAG GAC TAC GAG TAG TAC CCC CCC CCG GCG CGG CTG CTG CCG GAG 48MET ALA Glu ASP Tyr Pro Pro Ala Ala GCCG GCG GCG G GCG G GCG G GCG G GCG G TC -C -C -C -C -TC's trTC -m thatces -m -tr that atc gtg tcc 96Pro Ser Trp Ala Val Ala Leu Val Phe Ala Val Met Ile Ile Val Ser20                  25                  30gtc ctc ctg gag cac gcg ctc cac aag ctc ggc cat tgg ttc cac aag 144Val Leu Leu Glu His Ala Leu His Lys Leu Gly His Trp Phe His Lys35                  40                  45cgg cac aag aac gcg ctg gcg gag gcg ctg gag aag atc aaa gcg gag 192Arg His Lys Asn Ala Leu Ala Glu Ala Leu Glu Lys Ile Lys Ala Glu50                  55                  60ctg atg ctg gtg ggg ttc atc tcg ctg ctg ctc gcc gtg acg cag gac 240Leu Met Leu Val Gly Phe Ile Ser Leu Leu Leu Ala Val Thr Gln Asp65                  70                  75                  80cca atc tcc ggg ata tgc atc tcc gag aag gcc gcc agc atc atg cgg 288Pro Ile Ser Gly Ile Cys Ile Ser Glu Lys Ala Ala Ser Ile Met Arg85                  90                  95ccc tgc agc ctg ccc cct ggt tcc gtc aag agc aag tac aaa gac tac 336Pro Cys Ser Leu Pro Pro Gly Ser Val Lys Sar Lys Tyr Lys Asp Tyr100                 105                 110tac tgc gcc aaa aag ggc aag gtg tcg cta atg tcc acg ggc agc ttg 384Tyr Cys Ala Lys Lys Gly Lys Val Ser Leu Met Ser Thr Gly Ser Leu115                 120                 125cac cag ctc cac atg ttc atc ttc gtg ctc gcc gtc ttc cat gtc acc 432His Gln Leu His Met Phe Ile Phe Val Leu Ala Val Phe His Val Thr130                 135                 140tac agc gtc atc atc atg gct cta agc cgt ctc aaa atg agg aca tgg 480Tyr Ser Val Ile Ile Met Ala Leu Ser Arg Leu Lys Met Arg Thr Trp145                 150                 155                 160aag aaa tgg gag aca gag acc gyc tcc ttg gaa tac cag ttc gca aat 528Lys Lys Trp Glu Thr Glu Thr Xaa Ser Leu Glu Tyr Gln Phe Ala Asn165                 170                 175gat cct gcg cgg ttc cgc ttc acg cac cag acg tcg ttc gtg aag cgt 576Asp Pro Ala Arg Phe Arg Phe Thr His Gln Thr Ser Phe Val Lys Arg180                  185                 190cac ctg ggc ctc tcc agc acc ccc ggc atc aga tgg gtg gtg gcc ttc 624His Leu Gly Leu Ser Ser Thr Pro Gly Ile Arg Trp Val Val Ala Phe195                 200                 205ttc agg cag ttc ttc agg tcg gtc acc aag gtg gac tac ctc acc ctg 672Phe Arg Gln Phe Phe Arg Ser Val Thr Lys Val Asp Tyr Leu Thr Leu210                 215                 220agg gca ggc ttc atc aac gcg cat ttg tcg cat aac agc aag ttc gac 720Arg Ala Gly Phe Ile Asn Ala His Leu Ser His Asn Ser Lys Phe Asp225 230                 235                 240ttc cac aag tac atc aag agg tcc atg gag gac gac ttc aaa gtc gtc 768Phe His Lys Tyr Ile Lys Arg Ser Met Glu Asp Asp Phe Lys Val Val245                 250                 255gtt ggc atc agc ctc ccg ctg tgg tgt gtg gcg atc ctc acc ctc ttc 816Val Gly Ile Ser Leu Pro Leu Trp Cys Val Ala Ile Leu Thr Leu Phe260                 265                 270ctt gat att gac ggg atc ggc acg ctc acc tgg att tct ttc atc cct 864Leu Asp Ile Asp Gly Ile Gly Thr Leu Thr Trp Ile Ser Phe Ile Pro275 280                 285ctc gtc atc ctc ttg tgt gtt gga acc aag ctg gag atg atc atc atg 912Leu Val Ile Leu Leu Cys Val Gly Thr Lys Leu Glu Met Ile Ile Met290                 295                 300gag atg gcc ctg gag atc cag gac cgg gcg agc gtc atc aag ggg gcg 960Glu Met Ala Leu Glu Ile Gln Asp Arg Ala Ser Val Ile Lys Gly Ala305                 310                 315                 320ccc gtg gtt gag ccc agc aac aag ttc ttc tgg ttc cac cgc ccc gac 1008Pro Val Val Glu Pro Ser Asn Lys Phe Phe Trp Phe His Arg Pro Asp325 330                 335tgg gtc ctc ttc ttc ata cac ctg acg ctg ttc cag aat gcg ttt cag 1056Trp Val Leu Phe Phe Ile His Leu Thr Leu Phe Gln Asn Ala Phe Gln340                 345                 350atg gca cat ttc gtc tgg aca gtg gcc acg ccc ggc ttg aag aaa tgc 1104Met Ala His Phe Val Trp Thr Val Ala Thr Pro Gly Leu Lys Lys Cys355                 360                 365ttc cat atg cac atc ggt ctg agc atc atg aag gtc gtg ctg ggg ctg 1152Phe His Met His Ile Gly Leu Ser Ile Met Lys Val Val Leu Gly Leu    370 375                 380gct ctt cag ttc ctc tgc agc tat atc acc ttc ccc ctc tac gcg ctc 1200Ala Leu Gln Phe Leu Cys Ser Tyr Ile Thr Phe Pro Leu Tyr Ala Leu385                 390                 395                 400gtc aca cag atg gga tcg aac atg aag agg tcc atc ttc gac gag cag 1248Val Thr Gln Met Gly Ser Asn Met Lys Arg Ser Ile Phe Asp Glu Gln405                 410                 415acg gcc aag gcg ctg acc aac tgg cgg aac acg gcc aag gag aag aag 1296Thr Ala Lys Ala Leu Thr Asn Trp Arg Asn Thr Ala Lys Glu Lys Lys420 425                 430aag gtc cga gac acg gac atg ctg atg gcg cag atg atc ggc gac gcg 1344Lys Val Arg Asp Thr Asp Met Leu Met Ala Gln Met Ile Gly Asp Ala435                 440                 445acg ccc agc cga ggc acg tcg ccg atg cct agc cgg gct tcg tca ccg 1392Thr Pro Ser Arg Gly Thr Ser Pro Met Pro Ser Arg Ala Ser Ser Pro450                 455                 460gtg cac ctg ctt cac aag ggc atg gga cgg tcc gac gat ccc cag agc 1440Val His Leu Leu His Lys Gly Met Gly Arg Ser Asp Asp Pro Gln Ser465                 470 475                 480gcg ccg acc tcg cca agg acc atg gag gag gct agg gac atg tac ccg 1488Ala Pro Thr Ser Pro Arg Thr Met Glu Glu Ala Arg Asp Met Tyr Pro485                 490                 495gtt gtg gtg gcg cat ccc gtg cac aga cta aat cct gct gac agg cgg 1536Val Val Val Ala His Pro Val His Arg Leu Asn Pro Ala Asp Arg Arg500                 505                 510agg tcg gtc tct tcg tcg gca ctc gat gcc gac atc ccc agc gca gat 1584Arg Ser Val Ser Ser Ser Ala Leu Asp Ala Asp Ile Pro Ser Ala Asp515                 520 525ttt TCC TTC ATC CAG GGA TGAGACAGT TTCTGTAGTAGTCC 1632PHE Ser Gln Gly530AatgtataGA TGTCATAAAAAAAAATTTTTTGA 1692G 169939939993GA <210>6 <211>534 <212>PRT <213>Triticum sp. <400> 6MET ALA Glu ASP Tyr Glu Tyr Pro Pro Ala Ala ARG THR Leu Glu THR1 5 10 15Pro Serp Ala Val Phe Ala Val Met Ile Val SELE Val Glu Glu His Lys Leu His Lys Lys Trp Phe His Lys35                  40                  45Arg His Lys Asn Ala Leu Ala Glu Ala Leu Glu Lys Ile Lys Ala Glu50                  55                  60Leu Met Leu Val Gly Phe Ile Ser Leu Leu Leu Ala Val Thr Gln Asp65                  70                  75                  80Pro Ile Ser Gly Ile Cys Ile Ser Glu Lys Ala Ala Ser Ile Met Arg85                  90                  95Pro Cys Ser Leu Pro Pro Gly Ser Val Lys Ser Lys Tyr Lys Asp Tyr100                 105                 110Tyr Cys Ala Lys Lys Gly Lys Val Ser Leu Met Ser Thr Gly Ser Leu115             120                     125His Gln Leu His Met Phe Ile Phe Val Leu Ala Val Phe His Val Thr130             135                     140Tyr Ser Val Ile Ile Met Ala Leu Ser Arg Leu Lys Met Arg Thr Trp145             150                      155                 160Lys Lys Trp Glu Thr Glu Thr Xaa Ser Leu Glu Tyr Gln Phe Ala Asn165                 170                 175Asp Pro Ala Arg Phe Arg Phe Thr His Gln Thr Ser Phe Val Lys Arg180                 185                 190His Leu Gly Leu Ser Ser Thr Pro Gly Ile Arg Trp Val Val Ala Phe195                 200                 205Phe Arg Gln Phe Phe Arg Ser Val Thr Lys Val Asp Tyr Leu Thr Leu210                 215                 220Arg Ala Gly Phe Ile Asn Ala His Leu Ser His Asn Ser Lys Phe Asp225                 230                 235                 240Phe His Lys Tyr Ile Lys Arg Ser Met Glu Asp Asp Phe Lys Val Val245                 250                 255Val Gly Ile Ser Leu Pro Leu Trp Cys Val Ala Ile Leu Thr Leu Phe260                 265                 270Leu Asp Ile Asp Gly Ile Gly Thr Leu Thr Trp Ile Ser Phe Ile Pro275                 280                 285Leu Val Ile Leu Leu Cys Val Gly Thr Lys Leu Glu Met Ile Ile Met290                 295                 300Glu Met Ala Leu Glu Ile Gln Asp Arg Ala Ser Val Ile Lys Gly Ala305                 310                 315                 320Pro Val Val Glu Pro Ser Asn Lys Phe Phe Trp Phe His Arg Pro Asp325                 330                 335Trp Val Leu Phe Phe Ile His Leu Thr Leu Phe Gln Asn Ala Phe Gln340                 345                 350Met Ala His Phe Val Trp Thr Val Ala Thr Pro Gly Leu Lys Lys Cys355                 360                 365Phe His Met His Ile Gly Leu Ser Ile Met Lys Val Val Leu Gly Leu370                 375                 380Ala Leu Gln Phe Leu Cys Ser Tyr Ile Thr Phe Pro Leu Tyr Ala Leu385                 390                 395                 400Val Thr Gln Met Gly Ser Asn Met Lys Arg Ser Ile Phe Asp Glu Gln405 410                 415Thr Ala Lys Ala Leu Thr Asn Trp Arg Asn Thr Ala Lys Glu Lys Lys420                 425                 430Lys Val Arg Asp Thr Asp Met Leu Met Ala Gln Met Ile Gly Asp Ala435                 440                 445Thr Pro Ser Arg Gly Thr Ser Pro Met Pro Ser Arg Ala Ser Ser Pro450                 455                 460Val His Leu Leu His Lys Gly Met Gly Arg Ser Asp Asp Pro Gln Ser465                 470                 475                 480Ala Pro Thr Ser Pro Arg Thr Met Glu Glu Ala Arg Asp Met Tyr Pro485                 490                 495Val Val Val Ala His Pro Val His Arg Leu Asn Pro Ala ASP ARG ARG500 505 510ARG Ser Val Ser Sera Leu ASP ALA Asp Ile Pro Sero Sera Ala Ala Ala Ala Ala Ala Ala Ala Ala Ala Ala Ala Ala ALA ALA ALA ALE515 <210>7 <211>1886 <212>DNA <213>Triticum sp. <220> <221> CDS <222>(198). . (1799) <220> <221>misc_feature <222>(387). . (425) <223>The position of the amino acid sequence shown in SEQ ID NO: 1 <220> <221>misc_feature <222>(1374). . (1415) <223> The position of the amino acid sequence shown in SEQ ID NO: 2 <400>7gcagcaacaa gctagacata cctgcgtgcg tacgtacgtt ttcgttttcc tttcttgctc 60cggccggccg gccggccacg tagaatagat acctgcccag gtacgtacct cgttggctca 120gacgatcggc ggttggactt gggtgcgcgc cctgccctgc tccggccaag gaaagaggtt 180gcgctaaaga cgggcgg atg gca aag gac gac ggg tac ccc ccg gcg cgg    230Met Ala Lys Asp Asp Gly Tyr Pro Pro Ala Arg1               5                  10acg ctg ccg gag acg ccg tcc tgg gcg gtg gcg ctg gtc ttc gcc gtc   278Thr Leu Pro Glu Thr Pro Ser Trp Ala Val Ala Leu Val Phe Ala Val15                      20              25atg atc atc gtc tcc gtc ctc ctg gag cac gcg ctc cac aag ctc ggc   326Met Ile Ile Val Ser Val Leu Leu Glu His Ala Leu His Lys Leu Gly30                      35              40cat tgg ttc cac aag cgg cac aag aac gcg ctg gcg gag gcg ctg gag   374His Trp Phe His Lys Arg His Lys Asn Ala Leu Ala Glu Ala Leu Glu45                      50              55aag atg aag gcg gag ctg atg ctg gtg gga ttc atc tcg ctg ctg ctc   422Lys Met Lys Ala Glu Leu Met Leu Val Gly Phe Ile Ser Leu Leu Leu60                      65              70                  75gcc gtc acg cag gac cca atc tcc ggg ata tgc atc tcc cag aag gcc   470Ala Val Thr Gln Asp Pro Ile Ser Gly Ile Cys Ile Ser Gln Lys Ala80                  85                  90gcc agc atc atg cgc ccc tgc aag gtg gaa ccc ggt tcc gtc aag agc   518Ala Ser Ile Met Arg Pro Cys Lys Val Glu Pro Gly Ser Val Lys Ser95                 100                 105aag tac aag gac tac tac tgc gcc aaa gag ggc aag gtg gcg ctc atg   566Lys Tyr Lys Asp Tyr Tyr Cys Ala Lys Glu Gly Lys Val Ala Leu Met110                 115                 120tcc acg ggc agc ctg cac cag ctc cac ata ttc atc ttc gtg cta gcc   614Ser Thr Gly Ser Leu His Gln Leu His Ile Phe Ile Phe Val Leu Ala125                 130                 135gtc ttc cat gtc acc tac agc gtc atc a tc atg gct cta agc cgt ctc   662Val Phe His Val Thr Tyr Ser Val Ile Ile Met Ala Leu Ser Arg Leu140                 145                 150                 155aag atg aga aca tgg aag aaa tgg gag aca gaa acc gcc tcc ttg gaa   710Lys Met Arg Thr Trp Lys Lys Trp Glu Thr Glu Thr Ala Ser Leu Glu160                 165                 170tac cag ttc gca aat gat cct gcg cgg ttc cgc ttc acg cac cag acg   758Tyr Gln Phe Ala Asn Asp Pro Ala Arg Phe Arg Phe Thr His Gln Thr175                 180                 185tcg ttc gtg aag cgg cac ctg ggc ctg tcc agc acc ccc ggc gtc aga   806Ser Phe Val Lys Arg His Leu Gly Leu Ser Ser Thr Pro Gly Val Arg190                 195                 200tgg gtg gtg gcc ttc ttc agg cag ttc ttc agg tcg gtc acc aag gtg   854Trp Val Val Ala Phe Phe Arg Gln Phe Phe Arg Ser Val Thr Lys Val205                 210                 215gac tac ctc acc ttg agg gca ggc ttc atc aac gcg cac ttg tcg cag   902Asp Tyr Leu Thr Leu Arg Ala Gly Phe Ile Asn Ala His Leu Ser Gln220                 225                 230                 235aac agc aag ttc gac ttc cac aag tac atc aag agg tcc atg gag gac   950Asn Ser Lys Phe Asp Phe His Lys Tyr Ile Lys Arg Ser Met Glu Asp240                 245                 250gac ttc aaa gtc gtc gtt ggc atc agc ctc ccg ctg tgg gct gtg gcg   998Asp Phe Lys Val Val Val Gly Ile Ser Leu Pro Leu Trp Ala Val Ala255                 260                 265atc ctc acc ctc ttc ctt gat atc gac ggg atc ggc aca ctc acc tgg   1046Ile Leu Thr Leu Phe Leu Asp Ile Asp Gly Ile Gly Thr Leu Thr Trp270                 275                 280gtt tct ttc atc cct ctc atc atc ctc ttg tgt gtt gga acc aag cta   1094Val Ser Phe Ile Pro Leu Ile Ile Leu Leu Cys Val Gly Thr Lys Leu285                 290                 295gag atg atc atc atg ggg atg gcc ctg gag atc cag gac cgg tcg agc   1142Glu Met Ile Ile Met Gly Met Ala Leu Glu Ile Gln Asp Arg Ser Ser300                 305                 310                 315gtc atc aag ggg gca ccc gtg gtc gag ccc agc aac aag ttc ttc tgg   1190Val Ile Lys Gly Ala Pro Val Val Glu Pro Ser Asn Lys Phe Phe Trp320                 325                 330ttc cac cgc ccc gac tgg gtc ctc ttc ttc ata cac ctg acg ctg ttc   1238Phe His Arg Pro Asp Trp Val Leu Phe Phe Ile His Leu Thr Leu Phe335                 340                 345cag aac gcg ttt cag atg gca cat ttc gtg tgg aca gtg gcc acg ccc   1286Gln Asn Ala Phe Gln Met Ala His Phe Val Trp Thr Val Ala Thr Pro350                 355                 360ggc ttg aag gac tgc ttc cat atg aac atc ggg ctg agc atc atg aag   1334Gly Leu Lys Asp Cys Phe His Met Asn Ile Gly Leu Ser Ile Met Lys365                 370                 375gtc gtg ctg ggg ctg gct ctc cag ttc ctg tgc agc tac atc acc ttc   1382Val Val Leu Gly Leu Ala Leu Gln Phe Leu Cys Ser Tyr Ile Thr Phe380                 385                 390                 395ccc ctc tac gcg cta gtc aca cag atg gga tca aac atg aag agg tcc   1430Pro Leu Tyr Ala Leu Val Thr Gln Met Gly Ser Asn Met Lys Arg Ser400                 405                 410atc ttc gac gag cag aca gcc aag gcg ctg acc aac tgg cgg aac acg   1478Ile Phe Asp Glu Gln Thr Ala Lys Ala Leu Thr Asn Trp Arg Asn Thr415                 420                 425gcc aag gag aag aag aag gtc cga gac acg gac atg ctg atg gcg cag   1526Ala Lys Glu Lys Lys Lys Val Arg Asp Thr Asp Met Leu Met Ala Gln430                 435                 440atg atc ggc gac gca aca ccc agc cga ggc acg tcc ccg atg cct agc   1574Met Ile Gly Asp Ala Thr Pro Ser Arg Gly Thr Ser Pro Met Pro Ser445                 450                 455cgg ggc tca tcg ccg gtg cac ctg ctt cag aag ggc atg gga cgg tct    1622Arg Gly Ser Ser Pro Val His Leu Leu Gln Lys Gly Met Gly Arg Ser460                 465                 470                 475gac gat ccc cag agc gca ccg acc tcg cca agg acc atg gag gag gct   1670Asp Asp Pro Gln Ser Ala Pro Thr Ser Pro Arg Thr Met Glu Glu Ala480                 485                 490agg gac atg tac ccg gtt gtg gtg gcg cat cct gta cac aga cta aat   1718Arg Asp Met Tyr Pro Val Val Val Ala His Pro Val His Arg Leu Asn495                 500                 505cct gct gac agg cgg agg tcg gtc tct tca tca gcc ctc gat gcc gac   1766Pro Ala Asp Arg Arg Arg Ser Val Ser Ser Ser Ala Leu Asp Ala Asp510                 515                 520atc ccc agc gca gat ttt tcc ttc agc cag gga tgagacaagt ttctgtattg 1819Ile Pro Ser Ala Asp Phe Ser Phe Ser Gln Gly525                 530atgttagtcc aatgtatagc caacatagga tgtgatgatt cgtacaataa gaaatacaat 1879tttttac                                                           1886 <210>8 <211>534 <212>PRT <213>Triticum sp. <400> 8MET ALA LY LY ASP ASP GLY TYR Pro Pro Ala Ala ARG THR Leu Glu THR1 5 10 15Pro Serp Ala Val Ala Val Met Ile Ile Val SELE Val SELE VAL GLU HIS ALA Leu His Lys Trp Phe His Lys35                  40                  45Arg His Lys Asn Ala Leu Ala Glu Ala Leu Glu Lys Met Lys Ala Glu50                  55                  60Leu Met Leu Val Gly Phe Ile Ser Leu Leu Leu Ala Val Thr Gln Asp65                  70                  75                  80Pro Ile Ser Gly Ile Cys Ile Ser Gln Lys Ala Ala Ser Ile Met Arg85                  90                  95Pro Cys Lys Val Glu Pro Gly Ser Val Lys Ser Lys Tyr Lys Asp Tyr100                 105                 110Tyr Cys Ala Lys Glu Gly Lys Val Ala Leu Met Ser Thr Gly Ser Leu115                 120                 125His Gln Leu His Ile Phe Ile Phe Val Leu Ala Val Phe His Val Thr130                 135                 140Tyr Ser Val Ile Ile Met Ala Leu Ser Arg Leu Lys Met Arg Thr Trp145                 150                 155                 160Lys Lys Trp Glu Thr Glu Thr Ala Ser Leu Glu Tyr Gln Phe Ala Asn165                 170                 175Asp Pro Ala Arg Phe Arg Phe Thr His Gln Thr Ser Phe Val Lys Arg180                 185                 190His Leu Gly Leu Ser Ser Thr Pro Gly Val Arg Trp Val Val Ala Phe195                 200                 205Phe Arg Gln Phe Phe Arg Ser Val Thr Lys Val Asp Tyr Leu Thr Leu210                 215                 220Arg Ala Gly Phe Ile Asn Ala His Leu Ser Gln Asn Ser Lys Phe Asp225                 230                 235                 240Phe His Lys Tyr Ile Lys Arg Ser Met Glu Asp Asp Phe Lys Val Val245                 250                 255Val Gly Ile Ser Leu Pro Leu Trp Ala Val Ala Ile Leu Thr Leu Phe260                 265                 270Leu Asp Ile Asp Gly Ile Gly Thr Leu Thr Trp Val Ser Phe Ile Pro275                 280                 285Leu Ile Ile Leu Leu Cys Val Gly Thr Lys Leu Glu Met Ile Ile Met290                 295                 300Gly Met Ala Leu Glu Ile Gln Asp Arg Ser Ser Val Ile Lys Gly Ala305                 310                 315                 320Pro Val Val Glu Pro Ser Asn Lys Phe Phe Trp Phe His Arg Pro Asp325                 330                 335Trp Val Leu Phe Phe Ile His Leu Thr Leu Phe Gln Asn Ala Phe Gln340                 345                 350Met Ala His Phe Val Trp Thr Val Ala Thr Pro Gly Leu Lys Asp Cys355                 360                 365Phe His Met Asn Ile Gly Leu Ser Ile Met Lys Val Val Leu Gly Leu370                 375                 380Ala Leu Gln Phe Leu Cys Ser Tyr Ile Thr Phe Pro Leu Tyr Ala Leu385                 390                 395                 400Val Thr Gln Met Gly Ser Asn Met Lys Arg Ser Ile Phe Asp Glu Gln405 410                 415Thr Ala Lys Ala Leu Thr Asn Trp Arg Asn Thr Ala Lys Glu Lys Lys420                 425                 430Lys Val Arg Asp Thr Asp Met Leu Met Ala Gln Met Ile Gly Asp Ala435                 440                 445Thr Pro Ser Arg Gly Thr Ser Pro Met Pro Ser Arg Gly Ser Ser Pro450                 455                 460Val His Leu Leu Gln Lys Gly Met Gly Arg Ser Asp Asp Pro Gln Ser465                 470                 475                 480Ala Pro Thr Ser Pro Arg Thr Met Glu Glu Ala Arg Asp Met Tyr Pro485                 490                 495Val Val Val Ala His Pro Val His Arg Leu Asn Pro Ala ASP ARG ARG500 505 510ARG Ser Val Ser Sera Leu ASP ALA Asp Ile Pro Sero Sera Ala Ala Ala Ala Ala Ala Ala Ala Ala Ala Ala Ala Ala ALA ALA ALA ALE515 <210>9 <211>2197 <212>DNA <213> Arabidopsis thaliana <220> <221> CDS <222>(331). . (2037) <220> <221>misc_feature <222>(589). . (627) <223>The position of the amino acid sequence shown in SEQ ID NO: 1 <220> <221>misc_feature <222>(1603). . (1644) <223> The position of the amino acid sequence shown in SEQ ID NO: 2 <400>9agtaatttag ctgttcttct acccctctga tctctcacag gggatcaaat agttttgata 60catagagcca caacagtgac attagtgtgt tgttgactac tgtaagggtt gggttttgaa 120aagagacatg aaggagtgtt attaggttga ttgtcttcaa gtacctccag tgtcaaacaa 180acattgacga ttgattctct tcccataatt tattgttatg cattacatat cacagtaaac 240ggactttcaa gtcaacaccg catttatttg ccctcttcat tgtttcacgt acgtaatcaa 300ggaccaaggg attttgttct tttggctacc atg gcc aca aga tgc ttt tgg tgt  354Met Ala Thr Arg Cys Phe Trp Cys1               5tgg acc act ttg ctc ttc tgc tct cag ctg ctt acc ggc ttt gcc cga   402Trp Thr Thr Leu Leu Phe Cys Ser Gln Leu Leu Thr Gly Phe Ala Arg10                  15                  20gct tcc tct gca ggc ggc gcc aaa gag aaa gga ctc tcc caa act ccc   450Ala Ser Ser Ala Gly Gly Ala Lys Glu Lys Gly Leu Ser Gln Thr Pro25                  30                  35                  40acc tgg gcc gtt gcc ctc gtc tgt acc ttt ttc att ctt gtc tcc gtc   498Thr Trp Ala Val Ala Leu Val Cys Thr Phe Phe Ile Leu Val Ser Val45                      50                  55ctt ctc gag aag gct ctt cac aga gtt gcc acg tgg ttg tgg gag aaa   546Leu Leu Glu Lys Ala Leu His Arg Val Ala Thr Trp Leu Trp Glu Lys60                  65                      70cat aag aac tct ctg ctt gaa gcc ttg gaa aaa ata aag gcc gag ctg   594His Lys Asn Ser Leu Leu Glu Ala Leu Glu Lys Ile Lys Ala Glu Leu75                  80                      85atg att cta gga ttc att tcc ttg ttg cta acc ttc gga gag cag tac   642Met Ile Leu Gly Phe Ile Ser Leu Leu Leu Thr Phe Gly Glu Gln Tyr90 95                     100att ctc aag att tgt att cct gaa aag gct gca gcc tct atg tta cct   690Ile Leu Lys Ile Cys Ile Pro Glu Lys Ala Ala Ala Ser Met Leu Pro105                 110                 115                 120tgt cca gct cct tct act cat gac caa gac aag acc cac cgc aga cgt   738Cys Pro Ala Pro Ser Thr His Asp Gln Asp Lys Thr His Arg Arg Arg125                 130                 135cta gct gct gct acg acc tct tcc cgc tgc gat gag ggt cat gaa cca   786Leu Ala Ala Ala Thr Thr Ser Ser Arg Cys Asp Glu Gly His Glu Pro140 145                 150ctc ata cct gcc acg ggt ttg cac cag cta cac att cta ttg ttc ttc   834Leu Ile Pro Ala Thr Gly Leu His Gln Leu His Ile Leu Leu Phe Phe155                 160                 165atg gct gcc ttt cat atc ctc tac agt ttc atc acc atg atg ctt ggc 882Met Ala Ala Phe His Ile Leu Tyr Ser Phe Ile Thr Met Met Leu Gly170                 175                 180aga ctc aag atc cgt ggc tgg aaa aag tgg gag cag gag aca tgt tct   930Arg Leu Lys Ile Arg Gly Trp Lys Lys Trp Glu Gln Glu Thr Cys Ser185                 190 195                 200cat gat tac gag ttt tca atc gat cca tca aga ttc aga ctc act cat   978His Asp Tyr Glu Phe Ser Ile Asp Pro Ser Arg Phe Arg Leu Thr His205                 210                 215gag acg tcc ttt gtt aga caa cat tcc agt ttc tgg aca aaa atc ccc 1026Glu Thr Ser Phe Val Arg Gln His Ser Ser Phe Trp Thr Lys Ile Pro220                 225                 230ttc ttc ttt tat gct ggg tgc ttc cta cag cag ttt ttc cga tct gtc   1074Phe Phe Phe Tyr Ala Gly Cys Phe Leu Gln Gln Phe Phe Arg Ser Val235                 240 245ggg agg act gac tac tta act ctg cgc cat ggc ttc atc gct gcc cat   1122Gly Arg Thr Asp Tyr Leu Thr Leu Arg His Gly Phe Ile Ala Ala His250                 255                 260tta gct cca gga aga aag ttc gac ttc cag aag tat atc aaa aga tca   1170Leu Ala Pro Gly Arg Lys Phe Asp Phe Gln Lys Tyr Ile Lys Arg Ser265                 270                 275                    280ttg gaa gac gat ttc aag gtg gta gtt gga ata agt cct ctt ttg tgg   1218Leu Glu Asp Asp Phe Lys Val Val Val Gly Ile Ser Pro Leu Leu Trp285                 290 295gca tca ttt gta att ttc cta ctt ctg aat gtt aat ggc tgg gaa gca   1266Ala Ser Phe Val Ile Phe Leu Leu Leu Asn Val Asn Gly Trp Glu Ala300                 305                 310ttg ttt tgg gcg tca atc cta cct gta ctt atc att cta gct gtc agt   1314Leu Phe Trp Ala Ser Ile Leu Pro Val Leu Ile Ile Leu Ala Val Ser315                 320                 325acg aag ctt caa gcg atc cta aca aga atg gct ctg gga atc acg gag   1362Thr Lys Leu Gln Ala Ile Leu Thr Arg Met Ala Leu Gly Ile Thr Glu330                 335                 340aga cac gca gtt gtt caa ggg ata cct ctc gtg cat ggt tca gat aag   1410Arg His Ala Val Val Gln Gly Ile Pro Leu Val His Gly Ser Asp Lys345                 350                 355                 360tac ttt tgg ttt aat cgc cct cag ttg cta ctt cat ctt ctt cac ttc   1458Tyr Phe Trp Phe Asn Arg Pro Gln Leu Leu Leu His Leu Leu His Phe365                 370                 375gcc tta ttt cag aat gct ttc cag cta aca tac ttc ttc tgg gtc tgg   1506Ala Leu Phe Gln Asn Ala Phe Gln Leu Thr Tyr Phe Phe Trp Val Trp380                 385                 390tat tcc ttt ggg cta aaa tct tgc ttt cac acg gat ttc aaa cta gtc   1554Tyr Ser Phe Gly Leu Lys Ser Cys Phe His Thr Asp Phe Lys Leu Val395                 400                 405atc gta aaa ctc tct cta ggc gtt gga gct ttg att ttg tgc agc tac   1602Ile Val Lys Leu Ser Leu Gly Val Gly Ala Leu Ile Leu Cys Ser Tyr410                 415                 420atc aca ctt cct ttg tat gca cta gtt act cag atg ggt tca aac atg   1650Ile Thr Leu Pro Leu Tyr Ala Leu Val Thr Gln Met Gly Ser Asn Met425                 430                 435                 440aag aaa gct gtg ttt gat gag caa atg gca aaa gcg ttg aag aaa tgg   1698Lys Lys Ala Val Phe Asp Glu Gln Met Ala Lys Ala Leu Lys Lys Trp445                 450                 455cac atg act gtg aag aag aag aaa ggc aaa gcg aga aag cca cca aca   1746His Met Thr Val Lys Lys Lys Lys Gly Lys Ala Arg Lys Pro Pro Thr460                 465                 470gag acc ctt ggt gtt tct gac act gtc agc acc tct acc tca tcc ttt   1794Glu Thr Leu Gly Val Ser Asp Thr Val Ser Thr Ser Thr Ser Ser Phe475                 480                 485cac gcc tct gga gcc act cta ctc cgc tcc aag acc act ggt cac tcg   1842His Ala Ser Gly Ala Thr Leu Leu Arg Ser Lys Thr Thr Gly His Ser490                 495                 500aca gcc tct tat atg agt aat ttc gag gac caa agc atg tct gat ctt   1890Thr Ala Ser Tyr Met Ser Asn Phe Glu Asp Gln Ser Met Ser Asp Leu505                 510                 515                 520gaa gct gag cca tta tcc cct gaa cca ata gag ggg cac act ctc gtc   1938Glu Ala Glu Pro Leu Ser Pro Glu Pro Ile Glu Gly His Thr Leu Val525                 530                 535agg gtt ggt gat cag aac aca gag ata gaa tat act gga gat att agt   1986Arg Val Gly Asp Gln Asn Thr Glu Ile Glu Tyr Thr Gly Asp Ile Ser540                  545                550cct gga aac caa ttc tcc ttt gtg aag aac gtt cct gct aat gat att   2034Pro Gly Asn Gln Phe Ser Phe Val Lys Asn Val Pro Ala Asn Asp Ile555                 560                 565gac taatattcaa aatgaatgca gaacaaatcc atcatccggt ctttattttc        2087Asptattacatgt atgccaacaa ttgcttcgcc aagtgttacc aactaggttt tctgtataag 2147gctgtatttt agagctaaaa aaaaaaaaaa aaaaaaaaaa ctaaattact            2197 <210>10 <211>569 <212>PRT <213> Arabidopsis thaliana <400>10Met Ala Thr Arg Cys Phe Trp Cys Trp Thr Thr Leu Leu Phe Cys Ser1               5                  10                  15Gln Leu Leu Thr Gly Phe Ala Arg Ala Ser Ser Ala Gly Gly Ala Lys20                  25                  30Glu Lys Gly Leu Ser Gln Thr Pro Thr Trp Ala Val Ala Leu Val Cys35                  40                  45Thr Phe Phe Ile Leu Val Ser Val Leu Leu Glu Lys Ala Leu His Arg50                  55                  60Val Ala Thr Trp Leu Trp Glu Lys His Lys Asn Ser Leu Leu Glu Ala65                  70                  75                  80Leu Glu Lys Ile Lys Ala Glu Leu Met Ile Leu Gly Phe Ile Ser Leu85                  90                  95Leu Leu Thr Phe Gly Glu Gln Tyr Ile Leu Lys Ile Cys Ile Pro Glu100                 105                 110Lys Ala Ala Ala Ser Met Leu Pro Cys Pro Ala Pro Ser Thr His Asp115                 120                 125Gln Asp Lys Thr His Arg Arg Arg Leu Ala Ala Ala Thr Thr Ser Ser130                 135                 140Arg Cys Asp Glu Gly His Glu Pro Leu Ile Pro Ala Thr Gly Leu His145                 150                 155                 160Gln Leu His Ile Leu Leu Phe Phe Met Ala Ala Phe His Ile Leu Tyr165                 170                 175Ser Phe Ile Thr Met Met Leu Gly Arg Leu Lys Ile Arg Gly Trp Lys180                 185                 190Lys Trp Glu Gln Glu Thr Cys Ser His Asp Tyr Glu Phe Ser Ile Asp195                 200                 205Pro Ser Arg Phe Arg Leu Thr His Glu Thr Ser Phe Val Arg Gln His210                 215                 220Ser Ser Phe Trp Thr Lys Ile Pro Phe Phe Phe Tyr Ala Gly Cys Phe225                 230                 235                 240Leu Gln Gln Phe Phe Arg Ser Val Gly Arg Thr Asp Tyr Leu Thr Leu245                 250                 255Arg His Gly Phe Ile Ala Ala His Leu Ala Pro Gly Arg Lys Phe Asp260                 265                 270Phe Gln Lys Tyr Ile Lys Arg Ser Leu Glu Asp Asp Phe Lys Val Val275                 280                 285Val Gly Ile Ser Pro Leu Leu Trp Ala Ser Phe Val Ile Phe Leu Leu290                 295                 300Leu Asn Val Asn Gly Trp Glu Ala Leu Phe Trp Ala Ser Ile Leu Pro305                 310                 315                 320Val Leu Ile Ile Leu Ala Val Ser Thr Lys Leu Gln Ala Ile Leu Thr325                 330                 335Arg Met Ala Leu Gly Ile Thr Glu Arg His Ala Val Val Gln Gly Ile340                 345                 350Pro Leu Val His Gly Ser Asp Lys Tyr Phe Trp Phe Asn Arg Pro Gln355                 360                 365Leu Leu Leu His Leu Leu His Phe Ala Leu Phe Gln Asn Ala Phe Gln370                 375                 380Leu Thr Tyr Phe Phe Trp Val Trp Tyr Ser Phe Gly Leu Lys Ser Cys385                 390                 395                 400Phe His Thr Asp Phe Lys Leu Val Ile Val Lys Leu Ser Leu Gly Val405 410                 415Gly Ala Leu Ile Leu Cys Ser Tyr Ile Thr Leu Pro Leu Tyr Ala Leu420                 425                 430Val Thr Gln Met Gly Ser Asn Met Lys Lys Ala Val Phe Asp Glu Gln435                 440                 445Met Ala Lys Ala Leu Lys Lys Trp His Met Thr Val Lys Lys Lys Lys450                 455                 460Gly Lys Ala Arg Lys Pro Pro Thr Glu Thr Leu Gly Val Ser Asp Thr465                 470                 475                 480Val Ser Thr Ser Thr Ser Ser Phe His Ala Ser Gly Ala Thr Leu Leu                485                 490                 495Arg Ser Lys Thr Thr Gly His Ser Thr Ala Ser Tyr Met Ser Asn Phe500                 505                 510Glu Asp Gln Ser Met Ser Asp Leu Glu Ala Glu Pro Leu Ser Pro Glu515                 520                 525Pro Ile Glu Gly His Thr Leu Val Arg Val Gly Asp Gln Asn Thr Glu530                 535                 540Ile Glu Tyr Thr Gly Asp Ile Ser Pro Gly Asn Gln Phe Ser Phe Val545 550 555 560Lys Asn Val Pro Ala Asn Asp Ile Asp565 <210>11 <211>1935 <212>DNA <213> Arabidopsis thaliana <220> <221> CDS <222>(89). . (1807) <220> <221>misc_feature <222>(269). . (307) <223>The position of the amino acid sequence shown in SEQ ID NO: 1 <220> <221>misc_feature <222>(1370). . (1411) <223> The position of the amino acid sequence shown in SEQ ID NO: 2 <400>11cagtgtgagt aatttagtaa aaagacaaga tctctggtct ggaattagaa gaatcttatt 60tgggtttttt tcttaggatt aagctcta atg gca gat caa gta aaa gag cgg    112Met Ala Asp Gln Val Lys Glu Arg1              5act tta gag gag acc tct acg tgg gca gtt gct gtt gtt tgc ttt gtc   160Thr Leu Glu Glu Thr Ser THR TRP ALA Val Ala Val Val Cys PHE VAL10 15 20TTA CTC TTT ATT TCG ATT GTC GAA CAA CAA CAA CAC AAA ATT GGA 208leu PHE Ile Val Leu His LYS Ile Gly25 35 40 40 40 40 40 40 40Acly 25 40 40 40 40 40 40 40 40Acly 25 40 40 40 40 40 40 40 40 40 40Acly 25 40 40 40 40 40 40 40 40 40 40 40 40 40 40 40 40 40 40Acly 25 aag cac aag cag gct ctt ttt gaa gct ctt gaa   256Thr Trp Phe Lys Lys Lys His Lys Gln Ala Leu Phe Glu Ala Leu Glu45                  50                  55aag gtc aaa gca gag ctt atg ctg ttg gga ttc ata tca cta cta ctc   304Lys Val Lys Ala Glu Leu Met Leu Leu Gly Phe Ile Ser Leu Leu Leu60                  65                  70aca att gga caa aca cca atc tca aat atc tgc atc tcc cag aaa gtt   352Thr Ile Gly Gln Thr Pro Ile Ser Asn Ile Cys Ile Ser Gln Lys Val75                  80                  85gcg tca aca atg cac cct tgc agt gct gct gaa gaa gct aaa aaa tac   400Ala Ser Thr Met His Pro Cys Ser Ala Ala Glu Glu Ala Lys Lys Tyr90                  95                 100ggc aag aaa gac gcc gga aag aaa gat gat gga gat gga gat aaa ccc   448Gly Lys Lys Asp Ala Gly Lys Lys Asp Asp Gly Asp Gly Asp Lys Pro105                 110                 115                 120ggt cga aga ctt ctt ctt gag tta gct gaa tct tat atc cat aga aga   496Gly Arg Arg Leu Leu Leu Glu Leu Ala Glu Ser Tyr Ile His Arg Arg125                 130                 135agt tta gcc acc aaa ggc tat gac aaa tgt gca gag aag ggg aaa gtg   544Ser Leu Ala Thr Lys Gly Tyr Asp Lys Cys Ala Glu Lys Gly Lys Val140                 145                 150gct ttt gta tct gct tat gga atc cac cag ctg cat ata ttc atc ttc   592Ala Phe Val Ser Ala Tyr Gly Ile His Gln Leu His Ile Phe Ile Phe155                 160                 165gtg ctc gcg gtt gtt cat gtt gtt tac tgc att gtt act tat gct ttc   640Val Leu Ala Val Val His Val Val Tyr Cys Ile Val Thr Tyr Ala Phe170                 175                 180gga aag atc aag atg agg acg tgg aag tcg tgg gag gaa gag aca aag   688Gly Lys Ile Lys Met Arg Thr Trp Lys Ser Trp Glu Glu Glu Thr Lys185                 190                 195                 200aca ata gag tat cag tat tcc aac gat cct gag agg ttc agg ttt gcg   736Thr Ile Glu Tyr Gln Tyr Ser Asn Asp Pro Glu Arg Phe Arg Phe Ala205                 210                 215agg gac aca tct ttt ggg aga aga cat ctc aat ttc tgg agc aag acg   784Arg Asp Thr Ser Phe Gly Arg Arg His Leu Asn Phe Trp Ser Lys Thr220                 225                 230aga gtc aca cta tgg att gtt tgt ttt ttt aga cag ttc ttt gga tct   832Arg Val Thr Leu Trp Ile Val Cys Phe Phe Arg Gln Phe Phe Gly Ser235                 240                 245gtc acc aaa gtt gat tac tta gca cta aga cat ggt ttc atc atg gcg   880Val Thr Lys Val Asp Tyr Leu Ala Leu Arg His Gly Phe Ile Met Ala250                 255                 260cat ttt gct ccc ggt aac gaa tca agattc gat ttc cgc aag tat att   928His Phe Ala Pro Gly Asn Glu Ser Arg Phe Asp Phe Arg Lys Tyr Ile265                 270                 275                 280cag aga tca tta gag aaa gac ttc aaa acc gtt gtt gaa atc agt ccg   976Gln Arg Ser Leu Glu Lys Asp Phe Lys Thr Val Val Glu Ile Ser Pro285                 290                 295gtt atc tgg ttt gtc gct gtg cta ttc ctc ttg acc aat tca tat gga   1024Val Ile Trp Phe Val Ala Val Leu Phe Leu Leu Thr Asn Ser Tyr Gly300                 305                 310tta cgt tct tac ctc tgg tta cca ttc att cca cta gtc gta att cta   1072Leu Arg Ser Tyr Leu Trp Leu Pro Phe Ile Pro Leu Val Val Ile Leu315                 320                 325ata gtt gga aca aag ctt gaa gtc ata ata aca aaa ttg ggt cta aga   1120Ile Val Gly Thr Lys Leu Glu Val Ile Ile Thr Lys Leu Gly Leu Arg330                 335                 340atc caa gag aaa ggt gat gtg gtg aga ggc gcc cca gtg gtt cag cct   1168Ile Gln Glu Lys Gly Asp Val Val Arg Gly Ala Pro Val Val Gln Pro345                 350                 355                 360ggt gat gac ctc ttc tgg ttt ggc aag cca cgc ttc att ctt ttc ctt   1216Gly Asp Asp Leu Phe Trp Phe Gly Lys Pro Arg Phe Ile Leu Phe Leu365                 370                 375att cac ttg gtc ctt ttt acg aat gca ttt caa ctt gca ttc ttt gcc   1264Ile His Leu Val Leu Phe Thr Asn Ala Phe Gln Leu Ala Phe Phe Ala380                 385                 390tgg agt acg tat gaa ttc aat ctc aat aat tgt ttc cat gaa agc act   1312Trp Ser Thr Tyr Glu Phe Asn Leu Asn Asn Cys Phe His Glu Ser Thr395                 400                 405gca gat gtg gtc att aga ctt gta gtt gga gct gtt gtg cag ata ctt   1360Ala Asp Val Val Ile Arg Leu Val Val Gly Ala Val Val Gln Ile Leu410                 415                 420tgc agc tat gtg act ctt cca ctc tat gca ctt gtt act cag atg ggt   1408Cys Ser Tyr Val Thr Leu Pro Leu Tyr Ala Leu Val Thr Gln Met Gly425                 430                 435                 440agt aaa atg aag cca aca gta ttc aac gat aga gta gcc acg gca tta   1456Ser Lys Met Lys Pro Thr Val Phe Asn Asp Arg Val Ala Thr Ala Leu445                 450                 455aag aag tgg cat cac act gca aag aac gag acg aaa cac gga aga cac   1504Lys Lys Trp His His Thr Ala Lys Asn Glu Thr Lys His Gly Arg His460                 465                 470tcg gga tcc aat aca cct ttc tct agc cgt cca act aca cca aca cat   1552Ser Gly Ser Asn Thr Pro Phe Ser Ser Arg Pro Thr Thr Pro Thr His475                 480                 485ggc tca tct cca atc cat ctc ctt cac aat ttc aat aac cgg agc gtt   1600Gly Ser Ser Pro Ile His Leu Leu His Asn Phe Asn Asn Arg Ser Val490                 495                 500gaa aat tac cca agt tct cct tct cct aga tac tct ggt cat ggt cat   1648Glu Asn Tyr Pro Ser Ser Pro Ser Pro Arg Tyr Ser Gly His Gly His505                 510                 515                 520cat gaa cac caa ttt tgg gat cct gag tct caa cac caa gaa gct gaa   1696His Glu His Gln Phe Trp Asp Pro Glu Ser Gln His Gln Glu Ala Glu525                 530                 535act tcc aca cat cat tct ctt gcg cat gaa agc tca gaa cct gtt ctt   1744Thr Ser Thr His His Ser Leu Ala His Glu Ser Ser Glu Pro Val Leu540                 545                 550gca tct gtg gaa ctt cct cct ata agg act agc aaa agc tta aga gat   1792Ala Ser Val Glu Leu Pro Pro Ile Arg Thr Ser Lys Ser Leu Arg Asp555                 560                 565ttt tct ttt aag aaa tgatgattct tgtttgctat atttgatttc gtacagtggg   1847Phe Ser Phe Lys Lys570aattttgtca tatgaaaata atttcttgta cattactagt ttgataagaa ataaccatat 1907ctatatggat acaaaaaaaa aaaaaaaa <210>12 <211>573 <212>PRT <213> Arabidopsis thaliana <400> 12MET ALA ASLN VAL LYS GLU ARG THR Leu Glu THR Serp1 5 10 15ALA Val Ala Val Val Leu Phe Ile Val Leu20 25 30glu see pHRS LYS LYS ILE GLLT THR THR TRT LYS LYS HIS LYS30 45GLN ALA Leu Phe Glu Ala Leu Lys Val Lys Ala Glu Leu Met Leu 50 60leu Gly PHE ILEU Leu Thr Ile Gln THRLN THRLLLLLSN Ile Cys Ile Cys Ile Cys Thr Met His Pro Cys Ser85                  90                  95Ala Ala Glu Glu Ala Lys Lys Tyr Gly Lys Lys Asp Ala Gly Lys Lys            100                 105                 110Asp Asp Gly Asp Gly Asp Lys Pro Gly Arg Arg Leu Leu Leu Glu Leu115                 120                 125Ala Glu Ser Tyr Ile His Arg Arg Ser Leu Ala Thr Lys Gly Tyr Asp130                 135                 140Lys Cys Ala Glu Lys Gly Lys Val Ala Phe Val Ser Ala Tyr Gly Ile145                 150                 155                 160His Gln Leu His Ile Phe Ile Phe Val Leu Ala Val Val His Val Val165                 170                 175Tyr Cys Ile Val Thr Tyr Ala Phe Gly Lys Ile Lys Met Arg Thr Trp180                 185                 190Lys Ser Trp Glu Glu Glu Thr Lys Thr Ile Glu Tyr Gln Tyr Ser Asn195                 200                 205Asp Pro Glu Arg Phe Arg Phe Ala Arg Asp Thr Ser Phe Gly Arg Arg210                 215                 220His Leu Asn Phe Trp Ser Lys Thr Arg Val Thr Leu Trp Ile Val Cys225                 230                 235                 240Phe Phe Arg Gln Phe Phe Gly Ser Val Thr Lys Val Asp Tyr Leu Ala245                 250                 255Leu Arg His Gly Phe Ile Met Ala His Phe Ala Pro Gly Asn Glu Ser260                 265                 270Arg Phe Asp Phe Arg Lys Tyr Ile Gln Arg Ser Leu Glu Lys Asp Phe275                 280                 285Lys Thr Val Val Glu Ile Ser Pro Val Ile Trp Phe Val Ala Val Leu290                 295                 300Phe Leu Leu Thr Asn Ser Tyr Gly Leu Arg Ser Tyr Leu Trp Leu Pro305                 310                 315                 320Phe Ile Pro Leu Val Val Ile Leu Ile Val Gly Thr Lys Leu Glu Val325                 330                 335Ile Ile Thr Lys Leu Gly Leu Arg Ile Gln Glu Lys Gly Asp Val Val340                 345                 350Arg Gly Ala Pro Val Val Gln Pro Gly Asp Asp Leu Phe Trp Phe Gly355                 360 365Lys Pro Arg Phe Ile Leu Phe Leu Ile His Leu Val Leu Phe Thr Asn370                 375                 380Ala Phe Gln Leu Ala Phe Phe Ala Trp Ser Thr Tyr Glu Phe Asn Leu385                 390                 395                 400Asn Asn Cys Phe His Glu Ser Thr Ala Asp Val Val Ile Arg Leu Val405                 410                 415Val Gly Ala Val Val Gln Ile Leu Cys Ser Tyr Val Thr Leu Pro Leu420                 425                 430Tyr Ala Leu Val Thr Gln Met Gly Ser Lys Met Lys Pro Thr Val Phe435                 440                 445Asn Asp Arg Val Ala Thr Ala Leu Lys Lys Trp His His Thr Ala Lys450                 455                 460Asn Glu Thr Lys His Gly Arg His Ser Gly Ser Asn Thr Pro Phe Ser465                 470                 475                 480Ser Arg Pro Thr Thr Pro Thr His Gly Ser Ser Pro Ile His Leu Leu485                 490                 495His Asn Phe Asn Asn Arg Ser Val Glu Asn Tyr Pro Ser Ser Pro Ser500                 505                 510Pro Arg Tyr Ser Gly His Gly His His Glu His Gln Phe Trp Asp Pro515                 520                 525Glu Ser Gln His Gln Glu Ala Glu Thr Ser Thr His His Ser Leu Ala530                 535                 540His Glu Ser Ser Glu Pro Val Leu Ala Ser Val Glu Leu Pro Pro Ile545 550 555 560Arg Thr Ser Lys Ser Leu Arg Asp Phe Ser Phe Lys Lys565 5 7   <210>13 <211>1811 <212>DNA <213> Arabidopsis thaliana <220> <221> CDS <222>(56). . (1633) <220> <221>misc_feature <222>(236). . (274) <223>The position of the amino acid sequence shown in SEQ ID NO: 1 <220> <221>misc_feature <222>(1328). . (1369) <223> The position of the amino acid sequence shown in SEQ ID NO: 2 <400>13gttcccagat tcatctttac ttattgtcta aattctctct ggtgtgagaa gtaaa atg  58Met1ggt cac gga gga gaa ggg atg tcg ctt gaa ttc act ccg acg tgg gtc   106Gly His Gly Gly Glu Gly Met Ser Leu Glu Phe Thr Pro Thr Trp Val5                  10                  15gtc gcc gga gtt tgt acg gtc atc gtc gcg att tca ctg gcg gtg gag   154Val Ala Gly Val Cys Thr Val Ile Val Ala Ile Ser Leu Ala Val Glu20                  25                  30cgt ttg ctt cac tat ttc ggt act gtt ctt aag aag aag aag caa aaa   202Arg Leu Leu His Tyr Phe Gly Thr Val Leu Lys Lys Lys Lys Gln Lys35                  40                  45ccc ctt tac gaa gcc ctt caa aag gtt aaa gaa gag ctg atg ttg tta   250Pro Leu Tyr Glu Ala Leu Gln Lys Val Lys Glu Glu Leu Met Leu Leu50                  55                  60                  65ggg ttt ata tcg ctg tta ctg acg gta ttc caa ggg ctc att tcc aaa   298Gly Phe Ile Ser Leu Leu Leu Thr Val Phe Gln Gly Leu Ile Ser Lys70                  75                  80ttc tgt gtg aaa gaa aat gtg ctt atg cat atg ctt cca tgt tct ctc   346Phe Cys Val Lys Glu Asn Val Leu Met His Met Leu Pro Cys Ser Leu85                  90                  95gat tca aga cga gaa gct ggg gca agt gaa cat aaa aac gtt aca gca   394Asp Ser Arg Arg Glu Ala Gly Ala Ser Glu His Lys Asn Val Thr Ala100                 105                 110aaa gaa cat ttt cag act ttt tta cct att gtt gga acc act agg cgt   442Lys Glu His Phe Gln Thr Phe Leu Pro Ile Val Gly Thr Thr Arg Arg115                 120                 125cta ctt gct gaa cat gct gct gtg caa gtt ggt tac tgt agc gaa aag   490Leu Leu Ala Glu His Ala Ala Val Gln Val Gly Tyr Cys Ser Glu Lys130                 135                 140                 145ggt aaa gta cca ttg ctt tcg ctt gag gca ttg cac cat cta cat att   538Gly Lys Val Pro Leu Leu Ser Leu Glu Ala Leu His His Leu His Ile150                 155                 160ttc atc ttc gtc ctc gcc ata tcc cat gtg aca ttc tgt gtc ctt acc   586Phe Ile Phe Val Leu Ala Ile Ser His Val Thr Phe Cys Val Leu Thr165                 170                 175gtg att ttt gga agc aca agg att cac caa tgg aag aaa tgg gag gat   634Val Ile Phe Gly Ser Thr Arg Ile His Gln Trp Lys Lys Trp Glu Asp180                 185                 190tcg atc gca gat gag aag ttt gac ccc gaa aca gct ctc agg aaa aga   682Ser Ile Ala Asp Glu Lys Phe Asp Pro Glu Thr Ala Leu Arg Lys Arg195                 200                 205agg gtc act cat gta cac aac cat gct ttt att aaa gag cat ttt ctt   730Arg Val Thr His Val His Asn His Ala Phe Ile Lys Glu His Phe Leu210                 215                 220                 225ggt att ggc aaa gat tca gtc atc ctc gga tgg acg caa tcc ttt ctc   778Gly Ile Gly Lys Asp Ser Val Ile Leu Gly Trp Thr Gln Ser Phe Leu230                 235                 240aag caa ttc tat gat tct gtg acg aaa tca gat tac gtg act tta cgt   826Lys Gln Phe Tyr Asp Ser Val Thr Lys Ser Asp Tyr Val Thr Leu Arg245                 250                 255ctt ggt ttc att atg aca cat tgt aag gga aac ccc aag ctt aat ttc   874Leu Gly Phe Ile Met Thr His Cys Lys Gly Asn Pro Lys Leu Asn Phe260                 265                 270cac aag tat atg atg cgc gct yta gag gat gat ttc aaa caa gtt gtt   922His Lys Tyr Met Met Arg Ala Xaa Glu Asp Asp Phe Lys Gln Val Val275                 280                 285ggt att agt tgg tat ctt tgg atc ttt gtc gtc atc ttt ttg ctg cta   970Gly Ile Ser Trp Tyr Leu Trp Ile Phe Val Val Ile Phe Leu Leu Leu290                 295                 300                 305aat gtt aac gga tgg cac aca tat ttc tgg ata gca ttt att ccc ttt   1018Asn Val Asn Gly Trp His Thr Tyr Phe Trp Ile Ala Phe Ile Pro Phe310                 315                 320gct ttg ctt ctt gct gtg gga aca aag ttg gag cat gtg att gca cag   1066Ala Leu Leu Leu Ala Val Gly Thr Lys Leu Glu His Val Ile Ala Gln325                 330                 335tta gct cat gaa gtt gca gag aaa cat gta gcc att gaa gga gac tta   1114Leu Ala His Glu Val Ala Glu Lys His Val Ala Ile Glu Gly Asp Leu340                 345                 350gtg gtg aaa ccc tca gat gag cat ttc tgg ttc agc aaa cct caa att   1162Val Val Lys Pro Ser Asp Glu His Phe Trp Phe Ser Lys Pro Gln Ile355                 360                 365gtt ctc tac ttg atc cat ttt atc ctc ttc cag aat gct ttt gag att   1210Val Leu Tyr Leu Ile His Phe Ile Leu Phe Gln Asn Ala Phe Glu Ile370                 375                 380                 385gcg ttt ttc ttt tgg att tgg gtt aca tac ggc ttc gac tcg tgc att   1258Ala Phe Phe Phe Trp Ile Trp Val Thr Tyr Gly Phe Asp Ser Cys Ile390                 395                 400atg gga cag gtg aga tac att gtt cca aga ttg gtt atc ggg gtc ttc   1306Met Gly Gln Val Arg Tyr Ile Val Pro Arg Leu Val Ile Gly Val Phe405                 410                 415att caa gtg ctt tgc agt tac agt aca ctg cct ctt tac gcc atc gtc   1354Ile Gln Val Leu Cys Ser Tyr Ser Thr Leu Pro Leu Tyr Ala Ile Val420                 425                 430tca cag atg gga agt agc ttc aag aaa gct ata ttc gag gag aat gtg   1402Ser Gln Met Gly Ser Ser Phe Lys Lys Ala Ile Phe Glu Glu Asn Val435                 440                 445cag gtt ggt ctt gtt ggt tgg gca cag aaa gtg aaa caa aag aga gac   1450Gln Val Gly Leu Val Gly Trp Ala Gln Lys Val Lys Gln Lys Arg Asp450                 455                 460                 465cta aaa gct gca gct agt aat gga aac gaa gga agc tct cag gct ggt 1498Leu Lys Ala Ala Ala Ser Asn Gly Asn Glu Gly Ser Ser Gln Ala Gly470                 475                 480cct ggt cct gat tct ggt tct ggt tct gct cct gct gct ggt cct ggt   1546Pro Gly Pro Asp Ser Gly Ser Gly Ser Ala Pro Ala Ala Gly Pro Gly485                 490                 495gca ggt ttt gca gga att cag ctc agc aga gta aca aga aac aac gca   1594Ala Gly Phe Ala Gly Ile Gln Leu Ser Arg Val Thr Arg Asn Asn Ala500                 505                 510ggg gac aca aac aat gag att aca cct gat cat aac aac tgagcagaga    1643Gly Asp Thr Asn Asn Glu Ile Thr Pro Asp His Asn Asn515                 520 525tattatcttt tccatttaga ggatcatcat cagattttag cttcaaggtc cggttttgtg 1703gtttatacat aagttatagt gacttgattt ttttgttttg ttacaaagtt accatctttg 1763gattagaatt gggaaattga atctgtttgt aaaaaaaaaa aaaaaaaa              1811 <210>14 <211>526 <212>PRT <213> Arabidopsis thaliana <400>14Met Gly His Gly Gly Glu Gly Met Ser Leu Glu Phe Thr Pro Thr Trp1               5                  10                  15Val Val Ala Gly Val Cys Thr Val Ile Val Ala Ile Ser Leu Ala Val20                  25                  30Glu Arg Leu Leu His Tyr Phe Gly Thr Val Leu Lys Lys Lys Lys Gln35                  40                  45Lys Pro Leu Tyr Glu Ala Leu Gln Lys Val Lys Glu Glu Leu Met Leu50                  55                  60Leu Gly Phe Ile Ser Leu Leu Leu Thr Val Phe Gln Gly Leu Ile Ser65                  70                  75                  80Lys Phe Cys Val Lys Glu Asn Val Leu Met His Met Leu Pro Cys Ser85                  90                  95Leu Asp Ser Arg Arg Glu Ala Gly Ala Ser Glu His Lys Asn Val Thr100                 105                 110Ala Lys Glu His Phe Gln Thr Phe Leu Pro Ile Val Gly Thr Thr Arg115                 120                 125Arg Leu Leu Ala Glu His Ala Ala Val Gln Val Gly Tyr Cys Ser Glu130                 135                 140Lys Gly Lys Val Pro Leu Leu Ser Leu Glu Ala Leu His His Leu His145                 150                 155                 160Ile Phe Ile Phe Val Leu Ala Ile Ser His Val Thr Phe Cys Val Leu165                 170                 175Thr Val Ile Phe Gly Ser Thr Arg Ile His Gln Trp Lys Lys Trp Glu180                 185                 190Asp Ser Ile Ala Asp Glu Lys Phe Asp Pro Glu Thr Ala Leu Arg Lys195                 200                 205Arg Arg Val Thr His Val His Asn His Ala Phe Ile Lys Glu His Phe210                 215                 220Leu Gly Ile Gly Lys Asp Ser Val Ile Leu Gly Trp Thr Gln Ser Phe225                 230                 235                 240Leu Lys Gln Phe Tyr Asp Ser Val Thr Lys Ser Asp Tyr Val Thr Leu245                 250                 255Arg Leu Gly Phe Ile Met Thr His Cys Lys Gly Asn Pro Lys Leu Asn260                 265                 270Phe His Lys Tyr Met Met Arg Ala Xaa Glu Asp Asp Phe Lys Gln Val275                 280                 285Val Gly Ile Ser Trp Tyr Leu Trp Ile Phe Val Val Ile Phe Leu Leu290                 295                 300Leu Asn Val Asn Gly Trp His Thr Tyr Phe Trp Ile Ala Phe Ile Pro305                 310                 315                 320Phe Ala Leu Leu Leu Ala Val Gly Thr Lys Leu Glu His Val Ile Ala325                 330                 335Gln Leu Ala His Glu Val Ala Glu Lys His Val Ala Ile Glu Gly Asp340                 345                 350Leu Val Val Lys Pro Ser Asp Glu His Phe Trp Phe Ser Lys Pro Gln355                 360                 365Ile Val Leu Tyr Leu Ile His Phe Ile Leu Phe Gln Asn Ala Phe Glu370                 375                 380Ile Ala Phe Phe Phe Trp Ile Trp Val Thr Tyr Gly Phe Asp Ser Cys385                 390                 395                 400Ile Met Gly Gln Val Arg Tyr Ile Val Pro Arg Leu Val Ile Gly Val405 410                 415Phe Ile Gln Val Leu Cys Ser Tyr Ser Thr Leu Pro Leu Tyr Ala Ile420                 425                 430Val Ser Gln Met Gly Ser Ser Phe Lys Lys Ala Ile Phe Glu Glu Asn435                 440                 445Val Gln Val Gly Leu Val Gly Trp Ala Gln Lys Val Lys Gln Lys Arg450                 455                 460Asp Leu Lys Ala Ala Ala Ser Asn Gly Asn Glu Gly Ser Ser Gln Ala465                 470                 475                 480Gly Pro Gly Pro Asp Ser Gly Ser Gly Ser Ala Pro Ala Ala Gly Pro485                 490                 495Gly Ala Gly Phe Ala Gly Ile Gln Leu Ser Arg Val Thr Arg Asn Asn500 505 510Ala Gly Asp Thr Asn Asn Glu Ile Thr Pro Asp His Asn Asn515 520 5 5 <210>15 <211>1782 <212>DNA <213> Arabidopsis thaliana <220> <221> CDS <222> (1). . (1779) <220> <221>misc_feature <222>(274). . (313) <223>The position of the amino acid sequence shown in SEQ ID NO: 1 <220> <221>misc_feature <222>(1327). . (1370) <223> The position of the amino acid sequence shown in SEQ ID NO: 2 <400> 15ATG GGA ATC ATC GAC GAC GGT TCT TTG CTT CGG CGG TTG ATT TGT CTC TGT 48MET GLY Ile ASP GLY Serg acg gcg gag gat   96Leu Trp Cys Leu Leu Gly Gly Gly Val Thr Val Val Thr Ala Glu Asp20                  25                  30gag aag aaa gtg gta cat aaa cag ctt aat caa act ccg act tgg gct   144Glu Lys Lys Val Val His Lys Gln Leu Asn Gln Thr Pro Thr Trp Ala35                  40                  45gtt gct gct gtt tgt act ttc ttc atc gtt gtt tct gtt ctt ctt gaa   192Val Ala Ala Val Cys Thr Phe Phe Ile Val Val Ser Val Leu Leu Glu50                  55                  60aaa ctt ctt cac aaa gtt gga aag gtt cta tgg gat cgg cac aag aca   240Lys Leu Leu His Lys Val Gly Lys Val Leu Trp Asp Arg His Lys Thr65                  70                  75                  80gct ctt ctt gac gct ttg gag aag atc aaa gca gag ctg atg gtt ctt   288Ala Leu Leu Asp Ala Leu Glu Lys Ile Lys Ala Glu Leu Met Val Leu85                  90                  95gga ttc atc tct ttg ctt ctg aca ttt gga caa acc tac att ttg gat   336Gly Phe Ile Ser Leu Leu Leu Thr Phe Gly Gln Thr Tyr Ile Leu Asp100                 105                 110att tgt atc cct tca cat gtt gct cgt acg atg ctc ccg tgt cct gct   384Ile Cys Ile Pro Ser His Val Ala Arg Thr Met Leu Pro Cys Pro Ala115                 120                 125cct aac ttg aaa aag gag gat gat gac aat ggt gaa agt cac agg aga   432Pro Asn Leu Lys Lys Glu Asp Asp Asp Asn Gly Glu Ser His Arg Arg130                 135                 140ctc ttg tcg ttt gag cac aga ttt tta tct gga ggt gaa gca tct ccc   480Leu Leu Ser Phe Glu His Arg Phe Leu Ser Gly Gly Glu Ala Ser Pro145                 150                 155                 160act aaa tgc acg aag gag ggt tat gta gag ctt atc tct gcc gag gca   528Thr Lys Cys Thr Lys Glu Gly Tyr Val Glu Leu Ile Ser Ala Glu Ala165                 170                 175ctc cat cag ttg cac atc ctt ata ttc ttc tta gcc att ttc cac gtt   576Leu His Gln Leu His Ile Leu Ile Phe Phe Leu Ala Ile Phe His Val180                 185                 190ctt tac agc ttc tta act atg atg ctt gga agg ttg aag att cgc gga   624Leu Tyr Ser Phe Leu Thr Met Met Leu Gly Arg Leu Lys Ile Arg Gly195                 200                 205tgg aag cat tgg gag aat gag aca tca tcc cat aat tac gag ttt tca   672Trp Lys His Trp Glu Asn Glu Thr Ser Ser His Asn Tyr Glu Phe Ser210                 215                 220aca gac act tcc agattc agg cta act cat gaa aca tct ttt gtg aga   720Thr Asp Thr Ser Arg Phe Arg Leu Thr His Glu Thr Ser Phe Val Arg225 230                 235                 240gcg cac acc agt ttc tgg acc cgg att cca ttc ttt ttc tat gtt gga   768Ala His Thr Ser Phe Trp Thr Arg Ile Pro Phe Phe Phe Tyr Val Gly245                 250                  255tgc ttt ttc aga cag ttt ttc aga tcc gtt ggg aga act gac tat ttg   816Cys Phe Phe Arg Gln Phe Phe Arg Ser Val Gly Arg Thr Asp Tyr Leu260                 265                 270aca ttg aga aat ggt ttc atc gct gtt cat tta gct cca gga agt caa   864Thr Leu Arg Asn Gly Phe Ile Ala Val His Leu Ala Pro Gly Ser Gln275 280                 285ttt aac ttc caa aaa tac att aaa aga tcg ttg gag gat gat ttc aag   912Phe Asn Phe Gln Lys Tyr Ile Lys Arg Ser Leu Glu Asp Asp Phe Lys290                 295                 300gta gtc gtt gga gtc agc cct gtc ttg tgg gga tct ttt gtg cta ttc 960Val Val Val Gly Val Ser Pro Val Leu Trp Gly Ser Phe Val Leu Phe305                 310                 315                 320ctc ctc cta aat att gac ggt gag tat atg atg ttc atc ggc act gca   1008Leu Leu Leu Asn Ile Asp Gly Glu Tyr Met Met Phe Ile Gly Thr Ala325 330                 335ata ccg gtt att atc att tta gct gta ggg aca aag ctt caa gcg att   1056Ile Pro Val Ile Ile Ile Leu Ala Val Gly Thr Lys Leu Gln Ala Ile340                 345                 350atg aca agg atg gct ctt ggt atc aca gat aga cat gcg gta gtt caa 1104Met Thr Arg Met Ala Leu Gly Ile Thr Asp Arg His Ala Val Val Gln355                 360                 365gga atg ccg ctt gta caa ggc aac gat gag tat ttc tgg ttc ggt cgt   1152Gly Met Pro Leu Val Gln Gly Asn Asp Glu Tyr Phe Trp Phe Gly Arg370                 375 380ccc cat ttg att ctc cat ctc atg cat ttc gcc ttg ttt cag aac gca   1200Pro His Leu Ile Leu His Leu Met His Phe Ala Leu Phe Gln Asn Ala385                 390                 395                 400ttt cag atc act tat ttc ttc tgg ata tgg tat tcc ttt gga tca gat 1248Phe Gln Ile Thr Tyr Phe Phe Trp Ile Trp Tyr Ser Phe Gly Ser Asp405                 410                 415tct tgc tac cat cct aat ttc aag att gca ctt gta aaa gta gcg att   1296Ser Cys Tyr His Pro Asn Phe Lys Ile Ala Leu Val Lys Val Ala Ile420                 425 430gct tta gga gta ttg tgt ctt tgc agc tac atc aca ctt cct ctt tac   1344Ala Leu Gly Val Leu Cys Leu Cys Ser Tyr Ile Thr Leu Pro Leu Tyr435                 440                 445gca ctc gta act cag atg ggt tct cgg atg aaa aaa tcg gta ttc gat    1392Ala Leu Val Thr Gln Met Gly Ser Arg Met Lys Lys Ser Val Phe Asp450                 455                 460gaa caa acg tca aaa gca ctc aag aaa tgg aga atg gca gtg aag aag   1440Glu Gln Thr Ser Lys Ala Leu Lys Lys Trp Arg Met Ala Val Lys Lys465                 470                 475 480aag aaa ggt gtg aaa gcc act act aag aga cta ggt gga gat gga agt   1488Lys Lys Gly Val Lys Ala Thr Thr Lys Arg Leu Gly Gly Asp Gly Ser485                 490                 495gcg agc cct acg gca tcg aca gtt agg tct act tcg tct gta cgt tca   1536Ala Ser Pro Thr Ala Ser Thr Val Arg Ser Thr Ser Ser Val Arg Ser500                 505                 510ttg cag cgt tac aaa acc aca cca cat tcg atg aga tac gaa gga ctt   1584Leu Gln Arg Tyr Lys Thr Thr Pro His Ser Met Arg Tyr Glu Gly Leu515                 520                 525gac cct gaa aca tcg gat ctc gac aca gat aat gaa gct ttg act cct   1632Asp Pro Glu Thr Ser Asp Leu Asp Thr Asp Asn Glu Ala Leu Thr Pro530                 535                 540ccc aaa tct cct cca agc ttc gag ctt gtt gtg aaa gtt gaa cca aat   1680Pro Lys Ser Pro Pro Ser Phe Glu Leu Val Val Lys Val Glu Pro Asn545                 550                 555                 560aag acc aat acc ggt gag act agc cgt gac act gaa act gat tct aaa   1728Lys Thr Asn Thr Gly Glu Thr Ser Arg Asp Thr Glu Thr Asp Ser Lys565                 570                 575gag TTC TCT TTC GTC Aag CCT GCT CCG AGT AAT GAA TCA TCT CAA GAC 1776GLU PHE VAL LYS PRO Ala Pro Ser Asn Gln ASP585 590CGG TGA 1782arg <210>16 <211>593 <212>PRT <213> Arabidopsis thaliana <400> 16MET GLY Ile ASP GLY Serg ARG Leu Ile Cys1 5 10 15LEU Leu Leu Leu Gly Gly Val Val THR Ala Glu Alas LYS VAl His LEU ALN Leu Alas LYS LEU Pro Thr Trp Ala35                  40                  45Val Ala Ala Val Cys Thr Phe Phe Ile Val Val Ser Val Leu Leu Glu50                  55                  60Lys Leu Leu His Lys Val Gly Lys Val Leu Trp Asp Arg His Lys Thr65                  70                  75                  80Ala Leu Leu Asp Ala Leu Glu Lys Ile Lys Ala Glu Leu Met Val Leu85                  90                  95Gly Phe Ile Ser Leu Leu Leu Thr Phe Gly Gln Thr Tyr Ile Leu Asp100                 105                 110Ile Cys Ile Pro Ser His Val Ala Arg Thr Met Leu Pro Cys Pro Ala115                 120                 125Pro Asn Leu Lys Lys Glu Asp Asp Asp Asn Gly Glu Ser His Arg Arg130                 135                 140Leu Leu Ser Phe Glu His Arg Phe Leu Ser Gly Gly Glu Ala Ser Pro145                 150                 155                 160Thr Lys Cys Thr Lys Glu Gly Tyr Val Glu Leu Ile Ser Ala Glu Ala165                 170                 175Leu His Gln Leu His Ile Leu Ile Phe Phe Leu Ala Ile Phe His Val180                 185                 190Leu Tyr Ser Phe Leu Thr Met Met Leu Gly Arg Leu Lys Ile Arg Gly195                 200                 205Trp Lys His Trp Glu Asn Glu Thr Ser Ser His Asn Tyr Glu Phe Ser210                 215                 220Thr Asp Thr Ser Arg Phe Arg Leu Thr His Glu Thr Ser Phe Val Arg225                 230                 235                 240Ala His Thr Ser Phe Trp Thr Arg Ile Pro Phe Phe Phe Tyr Val Gly245                 250                 255Cys Phe Phe Arg Gln Phe Phe Arg Ser Val Gly Arg Thr Asp Tyr Leu260                 265                 270Thr Leu Arg Asn Gly Phe Ile Ala Val His Leu Ala Pro Gly Ser Gln275                 280                 285Phe Asn Phe Gln Lys Tyr Ile Lys Arg Ser Leu Glu Asp Asp Phe Lys290                 295                 300Val Val Val Gly Val Ser Pro Val Leu Trp Gly Ser Phe Val Leu Phe305                 310                 315                 320Leu Leu Leu Asn Ile Asp Gly Glu Tyr Met Met Phe Ile Gly Thr Ala325                 330                 335Ile Pro Val Ile Ile Ile Leu Ala Val Gly Thr Lys Leu Gln Ala Ile            340                 345                 350Met Thr Arg Met Ala Leu Gly Ile Thr Asp Arg His Ala Val Val Gln355                 360 365Gly Met Pro Leu Val Gln Gly Asn Asp Glu Tyr Phe Trp Phe Gly Arg370                 375                 380Pro His Leu Ile Leu His Leu Met His Phe Ala Leu Phe Gln Asn Ala385                 390                 395                 400Phe Gln Ile Thr Tyr Phe Phe Trp Ile Trp Tyr Ser Phe Gly Ser Asp405                 410                 415Ser Cys Tyr His Pro Asn Phe Lys Ile Ala Leu Val Lys Val Ala Ile420                 425                 430Ala Leu Gly Val Leu Cys Leu Cys Ser Tyr Ile Thr Leu Pro Leu Tyr435                 440                 445Ala Leu Val Thr Gln Met Gly Ser Arg Met Lys Lys Ser Val Phe Asp450                 455                 460Glu Gln Thr Ser Lys Ala Leu Lys Lys Trp Arg Met Ala Val Lys Lys465                 470                 475                 480Lys Lys Gly Val Lys Ala Thr Thr Lys Arg Leu Gly Gly Asp Gly Ser485                 490                 495Ala Ser Pro Thr Ala Ser Thr Val Arg Ser Thr Ser Ser Val Arg Ser500                 505                 510Leu Gln Arg Tyr Lys Thr Thr Pro His Ser Met Arg Tyr Glu Gly Leu515                 520                 525Asp Pro Glu Thr Ser Asp Leu Asp Thr Asp Asn Glu Ala Leu Thr Pro530                 535                 540Pro Lys Ser Pro Pro Ser Phe Glu Leu Val Val LYS VAL GLU Pro Asn545 550 560LYS THR Asn Thr GLY GLU THR Serg ARG ARG ARG ASP THR ASP Ser Lys565 575Glu Phe Val Lysn Gln Gln ASP585 595590 595 595585585585 590 <210>17 <211>1629 <212>DNA <213> Arabidopsis thaliana <220> <221> CDS <222> (1). . (1626) <220> <221>misc_feature <222>(184). . (223) <223>The position of the amino acid sequence shown in SEQ ID NO: 1 <220> <221>misc_feature <222>(1141). . (1183) <223> The position of the amino acid sequence shown in SEQ ID NO: 2 <400> 17atg Cat atg AAA GAA GGA GGA AGG TCT CTT GCA GAG ACG ACG Act 48MET GLU HIS MET MET MET MET LYS GLU GLY Serg Ser Leu THR PRO THR1 5 10 15TAC TCG GTT GTT GTT GTT TGT GTT TGT GTT TGTGTA tgc ttt ctc   96Tyr Ser Val Ala Ser Val Val Thr Val Leu Val Phe Val Cys Phe Leu20                  25                  30gtt gaa cgc gcc att tac aga ttt gga aag tgg tta aag aag act aga   144Val Glu Arg Ala Ile Tyr Arg Phe Gly Lys Trp Leu Lys Lys Thr Arg35                  40                  45aga aag gca ctt ttt act tca ctt gag aaa atg aaa gag gag ttg atg   192Arg Lys Ala Leu Phe Thr Ser Leu Glu Lys Met Lys Glu Glu Leu Met50                  55                  60ttg ctg gga ctt ata tca ctt ctg ttg tca caa agc gcg aga TGG Att 240leu Leu Ile Seru Leu LEU Leu Serg Trg TRP ILE65 80TCA GAA ATC TGT GTT AAC CTT AGT AAAA TAES SET 288ser Glu Ile Cys Val aser Cys Val Tyr Ile85                  90                  95tgc tct gaa gag gac tat gga atc cat aag aaa gtt ctt ctg gaa cac   336Cys Ser Glu Glu Asp Tyr Gly Ile His Lys Lys Val Leu Leu Glu His100                 105                 110acc tct tct aca aac cag agc tcc tta cct cat cat gga ata Cat Gaa 384thr Serte THR ASN GLN SER Leu PRO HIS HIS GLY Ile His Glu115 125GCC TCT CAA TGT Cat GGC CGT GAA CCA TAT GAG TAG TAG TAT Hisl SERN CYS GLY HIS GLL ARG GLY ARG GLY ARG GLY ARG GLY ARG GLY ARG GLY ARG GLY ARG GLY ARG GLY GLY ARG GLY GLY GLY GLY GLY GLY GLY ARG Glu130                 135                 140gga ctc gag caa ctc cta agattc tta ttc gtc ctg ggt atc act cat   480Gly Leu Glu Gln Leu Leu Arg Phe Leu Phe Val Leu Gly Ile Thr His145                 150                 155                160gtt cta tac agt ggc att gcc att ggt tta gcc atg agc aag att tac   528Val Leu Tyr Ser Gly Ile Ala Ile Gly Leu Ala Met Ser Lys Ile Tyr165                 170                 175agt tgg aga aaa tgg gaa gcc caa gcg atc ata atg gct gaa tca gat   576Ser Trp Arg Lys Trp Glu Ala Gln Ala Ile Ile Met Ala Glu Ser Asp180 185                 190atc cac ctt tgt ttc ctg cgg caa ttt aga ggc tcc ata cga aag tct    624Ile His Leu Cys Phe Leu Arg Gln Phe Arg Gly Ser Ile Arg Lys Ser195                 200                 205gac tac ttc gca ctt cgg tta ggt ttc ctc act aaa cat aat ttg cca 672Asp Tyr Phe Ala Leu Arg Leu Gly Phe Leu Thr Lys His Asn Leu Pro210                 215                 220ttt aca tac aac ttc cat atg tat atg gta cgg acg atg gaa gat gag   720Phe Thr Tyr Asn Phe His Met Tyr Met Val Arg Thr Met Glu Asp Glu225                 230 235                 240ttt cat ggc att gtt gga att agc tgg cca ctt tgg gtt tac gct ata   768Phe His Gly Ile Val Gly Ile Ser Trp Pro Leu Trp Val Tyr Ala Ile245                 250                 255gta tgc atc tgc ata aat gtt cat ggc ctg aat atg tac ttt tgg ata 816Val Cys Ile Cys Ile Asn Val His Gly Leu Asn Met Tyr Phe Trp Ile260                 265                 270tca ttc gtt cct gcc att ctt gtc atg ttg gtt gga acc aaa ctt gag   864Ser Phe Val Pro Ala Ile Leu Val Met Leu Val Gly Thr Lys Leu Glu275                 280 285cat gtt gtc tcc aag ctt gct ctc gag gtt aag gag cag cag aca ggc   912His Val Val Ser Lys Leu Ala Leu Glu Val Lys Glu Gln Gln Thr Gly290                 295                 300aca tct aat ggg gct caa gtc aaa cca cgt gat ggg ctc ttc tgg ttt   960Thr Ser Asn Gly Ala Gln Val Lys Pro Arg Asp Gly Leu Phe Trp Phe305                 310                 315                 320ggg aaa cca gaa att ctg cta cgg ttg ata caa ttt atc att ttt cag   1008Gly Lys Pro Glu Ile Leu Leu Arg Leu Ile Gln Phe Ile Ile Phe Gln325                 330 335aat gca ttt gaa atg gca aca ttc atc tgg ttc ttg tgg gga atc aag   1056Asn Ala Phe Glu Met Ala Thr Phe Ile Trp Phe Leu Trp Gly Ile Lys340                 345                 350gaa aga tct tgc ttc atg aag aac cat gtg atg ata tca agc cgg cta   1104Glu Arg Ser Cys Phe Met Lys Asn His Val Met Ile Ser Ser Arg Leu355                 360                 365att tct ggg gtt ctc gtt cag ttc tgg tgt agt tat ggc act gtg cct   1152Ile Ser Gly Val Leu Val Gln Phe Trp Cys Ser Tyr Gly Thr Val Pro370                 375                  380ctc aat gta atc gtt act cag atg gga tct cgg cat aag aaa gct gtg   1200Leu Asn Val Ile Val Thr Gln Met Gly Ser Arg His Lys Lys Ala Val385                 390                 395                 400ata gca gag agc gta aga gac tca ctt cac agt tgg tgc aag aga gtg   1248Ile Ala Glu Ser Val Arg Asp Ser Leu His Ser Trp Cys Lys Arg Val405                 410                 415aaa gag agg tct aag cac acg aga tca gtg tgt tcc ctt gac aca gca   1296Lys Glu Arg Ser Lys His Thr Arg Ser Val Cys Ser Leu Asp Thr Ala420                 425                 430aca ata gac gag aga gac gag atg aca gtg ggg aca ttg tct agg agc   1344Thr Ile Asp Glu Arg Asp Glu Met Thr Val Gly Thr Leu Ser Arg Ser435                 440                 445tca tcg atg act tca ctg aat cag att acc ata aac tcc ata gac caa   1392Ser Ser Met Thr Ser Leu Asn Gln Ile Thr Ile Asn Ser Ile Asp Gln450                 455                 460gca gag tct ata ttc gga gca gca gct tca tcc agc agt cct caa gat    1440Ala Glu Ser Ile Phe Gly Ala Ala Ala Ser Ser Ser Ser Pro Gln Asp465                 470                 475                 480gga tac acg tcg agg gtg gaa gaa tat ctg tct gaa aca tac aat aac   1488Gly Tyr Thr Ser Arg Val Glu Glu Tyr Leu Ser Glu Thr Tyr Asn Asn485                 490                 495atc ggt tcg ata ccg cct tta aac gat gag att gag att gag att gaa   1536Ile Gly Ser Ile Pro Pro Leu Asn Asp Glu Ile Glu Ile Glu Ile Glu500                 505                 510ggt gaa gaa gat aat gga ggg aga gga agt ggg agt gat gag aat aac   1584Gly Glu Glu Asp Asn Gly Gly Arg Gly Ser Gly Ser Asp Glu Asn Asn515                 520                 525ggt gat gct gga gaa aca ctt ctt gag ttg ttt agg agg act tga 1629Gly Asp Ala Gly Glu Thr Leu Leu Glu Leu Phe Arg Arg Thr530 535 0 4 5 <210>18 <211>542 <212>PRT <213> Arabidopsis thaliana <400> 18MET GLU HIS MET MET LYS GLU GLY ARG Serite Leu lys lys thr arg35 45arg LYS ALA Leu Phe Thr Seru Lys Met Lys Glu Leu Met50 55 60leu Leu Leu Leu Serg Trle65 75 80SER SALS VALS VALS VALS VALS VALS VALS VALS VALS VALS Vals Val Phe Asn Ser Lys Phe Tyr Ile85                  90                  95Cys Ser Glu Glu Asp Tyr Gly Ile His Lys Lys Val Leu Leu Glu His100                 105                 110Thr Ser Ser Thr Asn Gln Ser Ser Leu Pro His His Gly Ile His Glu115                 120                 125Ala Ser His Gln Cys Gly His Gly Arg Glu Pro Phe Val Ser Tyr Glu130                 135                 140Gly Leu Glu Gln Leu Leu Arg Phe Leu Phe Val Leu Gly Ile Thr His145                 150                 155                 160Val Leu Tyr Ser Gly Ile Ala Ile Gly Leu Ala Met Ser Lys Ile Tyr165                 170                 175Ser Trp Arg Lys Trp Glu Ala Gln Ala Ile Ile Met Ala Glu Ser Asp180                 185                 190Ile His Leu Cys Phe Leu Arg Gln Phe Arg Gly Ser Ile Arg Lys Ser195                 200                 205Asp Tyr Phe Ala Leu Arg Leu Gly Phe Leu Thr Lys His Asn Leu Pro210                 215                 220Phe Thr Tyr Asn Phe His Met Tyr Met Val Arg Thr Met Glu Asp Glu225                 230                 235                 240Phe His Gly Ile Val Gly Ile Ser Trp Pro Leu Trp Val Tyr Ala Ile245                 250                 255Val Cys Ile Cys Ile Asn Val His Gly Leu Asn Met Tyr Phe Trp Ile260                 265                 270Ser Phe Val Pro Ala Ile Leu Val Met Leu Val Gly Thr Lys Leu Glu275                 280                 285His Val Val Ser Lys Leu Ala Leu Glu Val Lys Glu Gln Gln Thr Gly290                 295                 300Thr Ser Asn Gly Ala Gln Val Lys Pro Arg Asp Gly Leu Phe Trp Phe305                 310                 315                 320Gly Lys Pro Glu Ile Leu Leu Arg Leu Ile Gln Phe Ile Ile Phe Gln325                 330                 335Asn Ala Phe Glu Met Ala Thr Phe Ile Trp Phe Leu Trp Gly Ile Lys340                 345                 350Glu Arg Ser Cys Phe Met Lys Asn His Val Met Ile Ser Ser Arg Leu355                 360 365Ile Ser Gly Val Leu Val Gln Phe Trp Cys Ser Tyr Gly Thr Val Pro370                 375                 380Leu Asn Val Ile Val Thr Gln Met Gly Ser Arg His Lys Lys Ala Val385                 390                 395                 400Ile Ala Glu Ser Val Arg Asp Ser Leu His Ser Trp Cys Lys Arg Val405                 410                 415Lys Glu Arg Ser Lys His Thr Arg Ser Val Cys Ser Leu Asp Thr Ala420                 425                 430Thr Ile Asp Glu Arg Asp Glu Met Thr Val Gly Thr Leu Ser Arg Ser435                 440                 445Ser Ser Met Thr Ser Leu Asn Gln Ile Thr Ile Asn Ser Ile Asp Gln450                 455                 460Ala Glu Ser Ile Phe Gly Ala Ala Ala Ser Ser Ser Ser Pro Gln Asp465                 470                 475                 480Gly Tyr Thr Ser Arg Val Glu Glu Tyr Leu Ser Glu Thr Tyr Asn Asn485                 490                 495Ile Gly Ser Ile Pro Pro Leu Asn Asp Glu Ile Glu Ile Glu Ile Glu500 505 510GLY GLU GLU ASN GLY GLY GLY ARG GLY Serg GLY Ser ASP GLU ASN515 520 525GLY ALA GLY GLU Leu Leu Leu PHE ARG THR530 535 540 540 <210>19 <211>20 <212>DNA <213> Artificial sequence <220> <223> Artificial sequence description: oligonucleotides <400>19atgtcggacaaaaaaggggt <210>0 <211>20 <212>DNA <213> Artificial sequence <220> <223> Artificial sequence description: oligonucleotides <400>20atgctaccac acgcagatcg 20 <210>21 <211>21 <212>DNA <213> Artificial sequence <220> <223> Artificial sequence description: oligonucleotides <400>21acagagacca cctccttgga a 21 <210>22 <211>22 <212>DNA <213> Artificial sequence <220> <223> Artificial sequence description: oligonucleotides <400>22cagaaacttg tctcatccct gg 22 <210>23 <211>20 <212>DNA <213> Artificial sequence <220> <223> Artificial sequence description: oligonucleotides <400>23aagaactgcc tgaagaaggc 20 <210>24 <211>20 <212>DNA <213> Artificial sequence <220> <223> Artificial sequence description: oligonucleotides <400>24caccaccttc atgatgctca 20 <210>25 <211>20 <212>DNA <213> Artificial sequence <220> <223> Artificial sequence description: oligonucleotides <400>25ttccagcacc ggcacaagaa 20 <210>26 <211>28 <212>DNA <213> Artificial sequence <220> <223> Artificial sequence description: oligonucleotides <400>26tggacctctt catgttcgat cccatctg 28 <210>27 <211>27 <212>DNA <213> Artificial sequence <220> <223> Artificial sequence description: oligonucleotides <400>27cctgacgctg ttccagaatg cgtttca 27 <210>28 <211>20 <212>DNA <213> Artificial sequence <223> Artificial sequence description: oligonucleotides <400>28acttctgcag gtcgactcta 20 <210>29 <211>30 <212>DNA <213> Artificial sequence <220> <223> Artificial sequence description: oligonucleotides <400>29aagatcaaga tgaggacgtg gaagtcgtgg 30 <210>30 <211>30 <212>DNA <213> Artificial sequence <220> <223> Artificial sequence description: oligonucleotides <400>30aggctgaacc actggggcgc ctctcaccac 30 <210>31 <211>30 <212>DNA <213> Artificial sequence <220> <223> Artificial sequence description: oligonucleotides <400>31caagtatatg atgcgcgctc tagaggatga 30 <210>32 <211>30 <212>DNA <213> Artificial sequence <220> <223> Artificial sequence description: oligonucleotide aggtttcacc actaagtctc cttcaatggc 30 <210>33 <211>30 <212>DNA <213> Artificial sequence <220> <223> Artificial sequence description: oligonucleotides <400>33gatcattcaa gacttaggct cactcatgag 30 <210>34 <211>30 <212>DNA <213> Artificial sequence <220> <223> Artificial sequence description: oligonucleotides <400>34aacagcaagg aagattacaa atgatgccca 30 <210>35 <211>18 <212>DNA <213> Artificial sequence <220> <223> Artificial sequence description: oligonucleotides <400>35ggattaagat ctaatggc 18 <210>36 <211>23 <212>DNA <213> Artificial sequence <220> <223> Artificial sequence description: oligonucleotides <400>36caaagatctt catttcttaa aag 23 <210>37 <211>25 <212>DNA <213> Artificial sequence <220> <223> Artificial sequence description: oligonucleotides <400>37gcggatccat gtcggacaaa aaagg 25 <210>38 <211>25 <212>DNA <213> Artificial sequence <220> <223> Artificial sequence description: oligonucleotides <400>38gcggatcctc atccctggct gaagg 25 <210>39 <211>23 <212>DNA <213> Artificial sequence <220> <223> Artificial sequence description: oligonucleotides <400>39ggatccacca tggccacaag atg 23 <210>40 <211>24 <212>DNA <213> Artificial sequence <220> <223> Artificial sequence description: oligonucleotides <400>40ggatccttag tcaatatcat tagc 24 <210>41 <211>27 <212>DNA <213> Artificial sequence <220> <223> Artificial sequence description: oligonucleotides <400>41gcggatccat gggtcacgga ggagaag 27 <210>42 <211>27 <212>DNA <213> Artificial sequence <220> <223> Artificial sequence description: oligonucleotides <400>42gcggatcctc agttgttatg atcagga 27

Claims (28)

1. coding Mlo protein DNA molecule, wherein said Mlo protein comprises listed at least a sequence among SEQ ID NO:1 or the SEQ ID NO:2, and said Mlo protein is given the resistance of plant to fungal pathogens.
2. the dna molecular of claim 1, any listed among wherein said dna molecular and SEQ IDNO:3, SEQ ID NO:5 or SEQ ID NO:7 nucleotide sequence is identical or similar substantially, perhaps identical the or similar substantially Mlo protein of listed Mlo protein among coding and SEQ ID NO:4, SEQ ID NO:6 or the SEQ IDNO:8.
3. the dna molecular of claim 1, any listed among wherein said dna molecular and SEQ IDNO:9, SEQ ID NO:11, SEQ ID NO:13, SEQ ID NO:15 or SEQ IDNO:17 nucleotide sequence is identical or similar substantially, perhaps identical with listed Mlo protein among SEQID NO:10, SEQ ID NO:12, SEQ ID NO:14, SEQ ID NO:16 or the SEQ IDNO:18 or the basic similar Mlo protein of coding.
4. each dna molecular among the claim 1-3, wherein said dna molecular is not to derive from barley.
5. each dna molecular among the claim 1-4, wherein said DNA are modified so that the loss of activity of endogenous protein.
6. the dna molecular of claim 5, wherein said dna modification causes that one of following change, whole or various combination are arranged in the respective egg white matter aminoacid sequence:
-tryptophane (163) is changed into arginine
-proline(Pro) (396) back frameshit
-tryptophane (160) back frameshit
-methionine(Met) (1) is changed into Isoleucine
-glycine (227) is changed into aspartic acid
-methionine(Met) (1) is changed into Xie Ansuan
-arginine (11) is changed into tryptophane
-lose phenylalanine (183), Threonine (184)
-Xie Ansuan (31) is changed into L-glutamic acid
-Serine (32) is changed into phenylalanine
-leucine (271) is changed into Histidine.
With claim 1-6 in the dna molecular of each dna molecular antisense.
8. the protein that comprises one of listed sequence among SEQ ID NO:1 or the SEQ ID NO:2 at least, wherein said protein are Mlo protein and give the resistance of plant to fungal pathogens.
9. the protein of claim 8, wherein said protein is by identical or similar substantially with listed any sequence among SEQ ID NO:3, SEQ ID NO:5 or the SEQ ID NO:7 nucleotide sequence coded, and is perhaps identical or similar substantially with listed any protein among SEQ ID NO:4, SEQ ID NO:6 or the SEQ IDNO:8.
10. the protein of claim 8, wherein said protein is by identical or similar substantially with listed any sequence among SEQ ID NO:9, SEQ ID NO:11, SEQ ID NO:13, SEQ ID NO:15 or the SEQ ID NO:17 nucleotide sequence coded, and is perhaps identical or similar substantially with listed any protein among SEQ IDNO:10, SEQ ID NO:12, SEQ ID NO:14, SEQ ID NO:16 or the SEQ IDNO:18.
11. each protein among the claim 8-10, wherein said protein is not to derive from barley.
12. contain the expression cassette that right requires each dna molecular among the 1-7.
13. contain the carrier of the expression cassette of the dna molecular that comprises claim 12.
14. comprise that containing right requires the expression cassette of each dna molecular among the 1-7 or the cell of its part, wherein the said dna molecular in this expression cassette is effable in said cell.
15. the cell of claim 14, wherein said dna molecular is stabilized in the genome that is integrated into said cell.
16. each cell in claim 14 or 15, wherein said cell is a vegetable cell.
17. comprise that containing right requires the expression cassette of each dna molecular among the 1-7 or the plant of its part, wherein the said dna molecular in this expression cassette is effable in said plant.
18. the plant of claim 17, wherein said dna molecular is stabilized in the genome that is integrated into this plant.
19. comprise containing the agricultural-food that right requires the plant of each DNA isolation molecule among the 1-7, wherein said agricultural-food have the Plant Quarantine characteristic of improvement.
20. the method for the plant of preparation resistant to fungal pathogens may further comprise the steps:
A) in said plant, express among the claim 1-6 each dna molecular with " justice is arranged " direction; Or
B) in said plant, express among the claim 1-6 each dna molecular with " antisense " direction; Or
C) in said plant, express the special cutting of energy by ribozyme corresponding to the coded messenger RNA(mRNA) transcript of the native gene of each dna molecular among the claim 1-6; Or
D) expression is specific to coded protein or its a part of aptamer of each dna molecular among the claim 1-6 in plant; Or
E) in plant, express the sudden change or the clipped form of each dna molecular among the claim 1-6; Or
F) in plant, modify at least one chromosome copies corresponding to the gene of each dna molecular among the claim 1-6 by homologous recombination.
21. plant with the resistant to fungal pathogens of the method for claim 20 preparation.
22. the plant of claim 21, wherein said fungal pathogens can infect epidermic cell alive.
23. the plant of claim 21, wherein said fungal pathogens is from Erysiphales.
24. the plant of claim 21, wherein said fungal pathogens is from Erysiphe.
25. the plant of claim 21, wherein said fungal pathogens is the standing grain powdery mildew.
26. the tool that obtains with the method for claim 20 improves the agricultural-food of Plant Quarantine characteristic.
27. a method of separating the dna molecular of coding Mlo protein may further comprise the steps:
A) will encode at least 6 amino acid whose degenerate oligonucleotides among the SEQ ID NO:1 and be complementary to the degenerate oligonucleotide and the DNA that extracts from plant of at least 6 amino acid whose sequences among the coding SEQ ID NO:2 mix under said degenerate oligonucleotide and the condition that described DNA is hybridized allowing;
B) amplification said DNA of plants dna fragmentation, wherein said dna fragmentation its a left side and right end contain can with said degenerate oligonucleotide annealed nucleotide sequence in the step a); With
C) acquisition contains the full length cDNA clone of the dna fragmentation of step b).
28. the method for the dna molecular of mutagenesis claim 1, wherein said dna molecular have been cut into the double-stranded random fragment of expection size, this method may further comprise the steps:
A) add one or more strand or double chain oligonucleotide in the double-stranded random fragment group of gained, wherein said oligonucleotide comprises zone identical with the double-stranded template polynucleotide and allogenic zone;
B) the mixture sex change with double-stranded random fragment of gained and oligonucleotide becomes single-chain fragment;
C) causing said single-chain fragment to form under the annealing fragment paired condition in said same area annealing, with gained single-chain fragment group and polysaccharase insulation, wherein said same area is enough to make the member in the pairing to cause duplicating of another member, thereby forms the double-stranded polynucleotide through mutagenesis; With
D) repeat second and at least two circulations of third step again, wherein the mixture that produces of second step in next circulation comprise from last circulation third step through the double-stranded polynucleotide of mutagenesis, and this next circulation has formed the double-stranded polynucleotide of further mutagenesis.
CN99804041A 1998-03-17 1999-03-17 Genes encoding MLO proteins and conferring fungal resistance upon plants Pending CN1293711A (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US4276398A 1998-03-17 1998-03-17
US09/042,763 1998-03-17

Publications (1)

Publication Number Publication Date
CN1293711A true CN1293711A (en) 2001-05-02

Family

ID=21923620

Family Applications (1)

Application Number Title Priority Date Filing Date
CN99804041A Pending CN1293711A (en) 1998-03-17 1999-03-17 Genes encoding MLO proteins and conferring fungal resistance upon plants

Country Status (14)

Country Link
EP (1) EP1064304A2 (en)
JP (1) JP2002512775A (en)
KR (1) KR20010041943A (en)
CN (1) CN1293711A (en)
AU (1) AU3331199A (en)
BR (1) BR9908851A (en)
CA (1) CA2322016A1 (en)
EA (1) EA200000952A1 (en)
HU (1) HUP0102051A2 (en)
ID (1) ID25781A (en)
IL (1) IL138341A0 (en)
PL (1) PL343599A1 (en)
TR (1) TR200002537T2 (en)
WO (1) WO1999047552A2 (en)

Cited By (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104561031A (en) * 2013-10-28 2015-04-29 常熟市董浜镇里睦蔬菜专业合作社 Quick identification of MLO powdery mildew resistant gene for citrus
CN104593379A (en) * 2013-10-30 2015-05-06 江苏省常熟现代农业产业园区发展有限公司 Rapid identification of powdery mildew resistant gene of Chinese cabbage by using comparative genomics
CN106164272A (en) * 2014-01-21 2016-11-23 中国科学院遗传与发育生物学研究所 The plant modified
CN108291235A (en) * 2015-10-07 2018-07-17 埃德蒙马赫基金会 Resistance conferring genes for byssus simplex capsula albuginea in apples
CN111826454A (en) * 2019-04-23 2020-10-27 江苏省农业科学院 Molecular marker VrMLO_Indel2 for identification of mung bean powdery mildew resistance phenotype and its primers and applications
CN115942867A (en) * 2019-02-23 2023-04-07 优良种子有限责任公司 Powdery mildew resistant cannabis plant

Families Citing this family (16)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2000078799A2 (en) * 1999-06-18 2000-12-28 Syngenta Participations Ag Mlo-genes controlling diseases
US6864404B1 (en) * 1999-10-15 2005-03-08 Carnegie Institution Of Washington Engineering disease resistance with pectate lyase-like genes
GB0015122D0 (en) * 2000-06-20 2000-08-09 Plant Bioscience Ltd Plant resistance gene
EP1925672A1 (en) 2001-06-22 2008-05-28 Syngeta Participations AG Abiotic stress responsive polynucleotides and polypeptides
US7456335B2 (en) 2001-09-03 2008-11-25 Basf Plant Science Gmbh Nucleic acid sequences and their use in methods for achieving pathogen resistance in plants
US7572950B2 (en) 2002-07-04 2009-08-11 Sungene Gmbh & Co. Kgaa Methods for obtaining pathogen resistance in plants
DE10233327A1 (en) 2002-07-22 2004-02-05 Basf Ag Process for achieving pathogen resistance in plants
US20070169227A1 (en) 2003-12-16 2007-07-19 Pioneer Hi-Bred International Inc. Dominant Gene Suppression Transgenes and Methods of Using Same
WO2005059121A2 (en) 2003-12-16 2005-06-30 Pioneer Hi-Bred International, Inc. Dominant gene suppression transgenes and methods of using same
AU2006311089C1 (en) 2005-11-08 2012-11-08 Basf Plant Science Gmbh Use of armadillo repeat (ARM1) polynucleotides for obtaining pathogen resistance in plants
EP2380986A1 (en) 2006-01-12 2011-10-26 BASF Plant Science GmbH Use of stomatin (STM1) polynucleotides for achieving a pathogen resistance in plants
US8222486B2 (en) 2006-08-30 2012-07-17 Basf Plant Science Gmbh Method for increasing resistance to pathogens in transgenic plants
EP2487245A3 (en) 2006-10-12 2012-10-24 BASF Plant Science GmbH Method for increasing pathogen resistance in transgenic plants
CA2660439C (en) 2006-10-24 2016-02-16 Basf Plant Science Gmbh Methods for increasing the resistance in plants to biotropic fungi
AU2008206975B2 (en) 2007-01-15 2013-05-02 Basf Plant Science Gmbh Use of subtilisin (RNR9) polynucleotides for achieving a pathogen resistance in plants
ES2702562T3 (en) * 2011-03-01 2019-03-01 Enza Zaden Beheer Bv Genes that provide resistance to powdery mildew in Cucumis melo

Family Cites Families (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
DE4009990A1 (en) * 1990-03-28 1991-10-02 Btc Biotech Int Phytosanitary agent for prevention and treatment of plant diseases - contains natural carboxylic acids and/or aminoacid(s), purine(s) and/or pyrimidine(s) and vitamin(s)
US5605793A (en) * 1994-02-17 1997-02-25 Affymax Technologies N.V. Methods for in vitro recombination
US5773696A (en) * 1996-03-29 1998-06-30 Monsanto Company Antifungal polypeptide and methods for controlling plant pathogenic fungi
CA2260363C (en) * 1996-07-29 2011-02-08 Plant Bioscience Limited Polynucleotide and its use for modulating a defence response in plants
AU1450199A (en) * 1997-11-05 1999-05-24 E.I. Du Pont De Nemours And Company Corn cdna encoding southern leaf blight resistance protein

Cited By (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104561031A (en) * 2013-10-28 2015-04-29 常熟市董浜镇里睦蔬菜专业合作社 Quick identification of MLO powdery mildew resistant gene for citrus
CN104593379A (en) * 2013-10-30 2015-05-06 江苏省常熟现代农业产业园区发展有限公司 Rapid identification of powdery mildew resistant gene of Chinese cabbage by using comparative genomics
CN106164272A (en) * 2014-01-21 2016-11-23 中国科学院遗传与发育生物学研究所 The plant modified
US10557146B2 (en) 2014-01-21 2020-02-11 The Institute Of Genetics And Developmental Biology, Chinese Academy Of Sciences Modified plants
CN106164272B (en) * 2014-01-21 2020-12-29 中国科学院遗传与发育生物学研究所 Modified plants
CN108291235A (en) * 2015-10-07 2018-07-17 埃德蒙马赫基金会 Resistance conferring genes for byssus simplex capsula albuginea in apples
CN108291235B (en) * 2015-10-07 2021-11-30 埃德蒙马赫基金会 Resistance conferring genes for byssus simplex capsula albuginea in apples
CN115942867A (en) * 2019-02-23 2023-04-07 优良种子有限责任公司 Powdery mildew resistant cannabis plant
CN111826454A (en) * 2019-04-23 2020-10-27 江苏省农业科学院 Molecular marker VrMLO_Indel2 for identification of mung bean powdery mildew resistance phenotype and its primers and applications

Also Published As

Publication number Publication date
EA200000952A1 (en) 2001-06-25
BR9908851A (en) 2000-12-12
WO1999047552A3 (en) 1999-11-18
TR200002537T2 (en) 2000-11-21
WO1999047552A2 (en) 1999-09-23
CA2322016A1 (en) 1999-09-23
AU3331199A (en) 1999-10-11
EP1064304A2 (en) 2001-01-03
ID25781A (en) 2000-11-02
JP2002512775A (en) 2002-05-08
IL138341A0 (en) 2001-10-31
HUP0102051A2 (en) 2001-10-28
PL343599A1 (en) 2001-08-27
KR20010041943A (en) 2001-05-25

Similar Documents

Publication Publication Date Title
CN1293711A (en) Genes encoding MLO proteins and conferring fungal resistance upon plants
CN1056880C (en) Synthetic insecticidal crystal protein gene
CN1176577C (en) Nucleotide sequence for coding insecticidal crystal protein and application thereof
CN1267557C (en) Genes that synthesize anti-pathogenic substances
CN1332800A (en) Methods for transforming plants to express bacillus thuringiensis delta-endotoxins
CN1350587A (en) Acquired resistance genes in plants
CN1334874A (en) Increased expression of Cry3B insecticidal protein in plants
CN1414973A (en) Insecticidal protein of Bacillus thuringiensis (BACILLUS THUR_INGIENSIS)
CN1126423A (en) Methods of Controlling Insects
CN1402790A (en) DNA encoding plant deoxyheplysine synthase, plant eukaryotic initiation factor 5A, transgenic plants and methods for controlling plant senescence and programmed cell death
CN1309184A (en) Operation of enzymatic vitality of protoporphyrin primary oxidase in eukaryote
CN1251136A (en) Plant with modified growth
CN1292824A (en) Expression of trehalose biosynthetic genes in plants
CN1333833A (en) New method of identifying non-host plant disease resistance genes
CN1341151A (en) Herbicide target genes and methods
CN101062943A (en) Rice stress tolerance related DREB transfer factor and its coding gene and application
CN1190433A (en) PAP mutants that exhibit anti-viral and/or anti-fungal acitivity in plants
CN1155714C (en) Antifungal protein, DNA encoding the same and host incorporating the same
CN1832961A (en) Fungus resistant plants and their uses
HK1054248A1 (en) Apomixis conferred by expression of serk interacting proteins
CN1481391A (en) A novel gene encoding a F-box protein that regulates Arabidopsis leaf lifespan and its mutant genes
CN1321191A (en) Pi-ta gene conferring disease resistance to plants
CN1355847A (en) Novel plant genes and their uses
CN1316007A (en) Polynucleotide sequences
CN1150329C (en) Nematode-induced regulatory DNA sequences

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C02 Deemed withdrawal of patent application after publication (patent law 2001)
WD01 Invention patent application deemed withdrawn after publication