CN1768141B - Pantolactone hydrolase - Google Patents
Pantolactone hydrolase Download PDFInfo
- Publication number
- CN1768141B CN1768141B CN2004800085042A CN200480008504A CN1768141B CN 1768141 B CN1768141 B CN 1768141B CN 2004800085042 A CN2004800085042 A CN 2004800085042A CN 200480008504 A CN200480008504 A CN 200480008504A CN 1768141 B CN1768141 B CN 1768141B
- Authority
- CN
- China
- Prior art keywords
- leu
- gly
- ala
- asp
- val
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Expired - Lifetime
Links
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/11—DNA or RNA fragments; Modified forms thereof; Non-coding nucleic acids having a biological activity
- C12N15/52—Genes encoding for enzymes or proenzymes
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
- C12N9/14—Hydrolases (3)
- C12N9/16—Hydrolases (3) acting on ester bonds (3.1)
- C12N9/18—Carboxylic ester hydrolases (3.1.1)
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12P—FERMENTATION OR ENZYME-USING PROCESSES TO SYNTHESISE A DESIRED CHEMICAL COMPOUND OR COMPOSITION OR TO SEPARATE OPTICAL ISOMERS FROM A RACEMIC MIXTURE
- C12P13/00—Preparation of nitrogen-containing organic compounds
- C12P13/02—Amides, e.g. chloramphenicol or polyamides; Imides or polyimides; Urethanes, i.e. compounds comprising N-C=O structural element or polyurethanes
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12P—FERMENTATION OR ENZYME-USING PROCESSES TO SYNTHESISE A DESIRED CHEMICAL COMPOUND OR COMPOSITION OR TO SEPARATE OPTICAL ISOMERS FROM A RACEMIC MIXTURE
- C12P41/00—Processes using enzymes or microorganisms to separate optical isomers from a racemic mixture
- C12P41/003—Processes using enzymes or microorganisms to separate optical isomers from a racemic mixture by ester formation, lactone formation or the inverse reactions
- C12P41/005—Processes using enzymes or microorganisms to separate optical isomers from a racemic mixture by ester formation, lactone formation or the inverse reactions by esterification of carboxylic acid groups in the enantiomers or the inverse reaction
Landscapes
- Chemical & Material Sciences (AREA)
- Life Sciences & Earth Sciences (AREA)
- Organic Chemistry (AREA)
- Engineering & Computer Science (AREA)
- Health & Medical Sciences (AREA)
- Wood Science & Technology (AREA)
- Zoology (AREA)
- Genetics & Genomics (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Biotechnology (AREA)
- General Engineering & Computer Science (AREA)
- Microbiology (AREA)
- General Health & Medical Sciences (AREA)
- Biochemistry (AREA)
- Biomedical Technology (AREA)
- Molecular Biology (AREA)
- General Chemical & Material Sciences (AREA)
- Chemical Kinetics & Catalysis (AREA)
- Analytical Chemistry (AREA)
- Medicinal Chemistry (AREA)
- Physics & Mathematics (AREA)
- Biophysics (AREA)
- Plant Pathology (AREA)
- Enzymes And Modification Thereof (AREA)
- Micro-Organisms Or Cultivation Processes Thereof (AREA)
- Preparation Of Compounds By Using Micro-Organisms (AREA)
Abstract
Description
本发明涉及可用于泛酸内酯(pantolactone)对映异构体的光拆分(optical resolution)的酶,涉及编码所述酶的基因,还涉及所述酶在制备R-泛酸或其盐或R-泛酰醇(R-panthenol)的方法中的用途。The present invention relates to the enzyme that can be used for the photoresolution (optical resolution) of pantolactone (pantolactone) enantiomer, relates to the gene of encoding said enzyme, also relates to said enzyme in the preparation of R-pantothenic acid or its salt or R - Use of R-panthenol in a method.
对映异构体在其生理效应,即毒理和药理效应,与酶的反应以及其感知特征(sensorial characteristics)方面是不同的。已知仅有R-泛酸内酯是R-泛酸制备中的中间产物,R-泛酸是维生素,其对人类和动物的生长、繁殖以及正常生理功能来说都是必要的。泛酸作为辅酶A的前体以及作为脂肪酸合成酶的酰基载体,涉及到超过一百种的不同代谢途径,其中包括碳水化合物、蛋白质和脂类的能量代谢,以及脂类、神经递质、甾类激素、卟啉和激素的合成。直到近来,广泛用于R-泛酸内酯制备的方法是对化学合成的R-和S-泛酸内酯进行的光拆分。此种方法非常昂贵,因为其需要使用昂贵的光拆分试剂。此外,对泛酸内酯的R-对映异构体的回收也相当困难。因此,提供可用于泛酸内酯的R-和S-对映异构体的光拆分的泛酸内酯水解酶是人们所需要的。使用这种酶,尤其是水解泛酸内酯的S-或R-形式,是分离泛酸内酯的两种对映异构体的更为经济且更容易操作的方法。R-泛酸的盐的例子包括R-泛酸钙。Enantiomers differ in their physiological effects, ie toxicological and pharmacological effects, reactions with enzymes, and in their sensorial characteristics. Only R-pantothenic acid lactone is known as an intermediate in the preparation of R-pantothenic acid, a vitamin which is essential for growth, reproduction and normal physiological functions of humans and animals. As the precursor of coenzyme A and as the acyl carrier of fatty acid synthase, pantothenic acid is involved in more than one hundred different metabolic pathways, including energy metabolism of carbohydrates, proteins and lipids, as well as lipids, neurotransmitters, steroids Hormones, porphyrins, and hormone synthesis. Until recently, the widely used method for the preparation of R-pantolactones was the photoresolution of chemically synthesized R- and S-pantolactones. This method is very expensive as it requires the use of expensive photoresolution reagents. In addition, the recovery of the R-enantiomer of pantolactone is also quite difficult. Accordingly, it would be desirable to provide pantolactone hydrolases useful for the photoresolution of the R- and S-enantiomers of pantolactone. The use of this enzyme, especially to hydrolyze the S- or R-form of pantothenolactone, is a more economical and more manipulative method for the separation of the two enantiomers of pantothenolactone. Examples of salts of R-pantothenic acid include calcium R-pantothenate.
在一种实施方式中,本发明提供了编码泛酸内酯水解酶的基因的核苷酸序列,所述泛酸内酯水解酶是从马肝脏、A.niger ATCC 9142、A.nigerawamori ATCC 38854或A.niger MacRae ATCC 46951中获得的。公众可从American Type Culture Collection(ATCC),10801 University Boulvard,Manassas,VA 20110-2209USA获得上述菌株。In one embodiment, the present invention provides the nucleotide sequence of the gene encoding pantothenate lactone hydrolase, which is obtained from horse liver, A. niger ATCC 9142, A. nigerawamori ATCC 38854 or A. Obtained in .niger MacRae ATCC 46951. The public can obtain the above strains from American Type Culture Collection (ATCC), 10801 University Boulvard, Manassas, VA 20110-2209USA.
在另一种实施方式中,本发明提供了编码泛酸内酯水解酶的核苷酸序列,所述序列包含:当泛酸内酯水解酶来自马肝脏时,如SEQ ID NO:1所示的核苷酸序列;当泛酸内酯水解酶来自A.niger MacRae ATCC 46951时,如SEQ ID NO:5所示的核苷酸序列;当泛酸内酯水解酶来自A.nigerATCC9142时,如SEQ ID NO:7所示的序列;以及当泛酸内酯水解酶来自A.niger awamori ATCC38854时,如SEQ ID NO:9所示的序列。In another embodiment, the present invention provides a nucleotide sequence encoding pantothenate hydrolase, said sequence comprising: when the pantothenate hydrolase is from horse liver, the nucleus shown in SEQ ID NO: 1 Nucleotide sequence; When the pantothenate lactone hydrolase is from A.niger MacRae ATCC 46951, the nucleotide sequence shown in SEQ ID NO: 5; when the pantothenate lactone hydrolase is from A.nigerATCC9142, such as SEQ ID NO: The sequence shown in 7; and when the pantothenate lactone hydrolase is from A. niger awamori ATCC38854, the sequence shown in SEQ ID NO: 9.
所述核苷酸序列可以包含上述每种泛酸内酯水解酶的编码序列以及调控序列。The nucleotide sequence may comprise the coding sequence and the regulatory sequence of each pantothenate hydrolase mentioned above.
“调控序列”在此被定义为:指导可操作地连接的核酸转录的核酸控制序列的排列。一个此类表达控制序列的例子是启动子。“启动子”包括在转录起始位点附近的必要核酸序列。启动子还可选地包括远端增强子或阻遏元件,其可定位于转录起始位点之后的多达数千个碱基对上。“组成型”启动子是在大多数环境及发育条件下都有活性的启动子。“诱导型”启动子是在环境或发育调控中具有活性的启动子。术语“可操作地连接”指核酸表达控制序列(例如启动子或转录因子结合位点的排列)以及第二种核酸序列之间的功能性连接,其中所述表达控制序列指导对应于第二种序列的核酸的转录。A "regulatory sequence" is defined herein as an arrangement of nucleic acid control sequences that direct the transcription of an operably linked nucleic acid. An example of such an expression control sequence is a promoter. A "promoter" includes the necessary nucleic acid sequences near the site where transcription begins. A promoter also optionally includes distal enhancer or repressor elements, which can be positioned as many as several thousand base pairs after the start site of transcription. A "constitutive" promoter is one that is active under most environmental and developmental conditions. An "inducible" promoter is a promoter that is active under environmental or developmental regulation. The term "operably linked" refers to a functional linkage between a nucleic acid expression control sequence (such as a promoter or an arrangement of transcription factor binding sites) and a second nucleic acid sequence, wherein the expression control sequence directs a sequence corresponding to the second Transcription of a sequence of nucleic acids.
可以使用核酸序列的片段,例如,在杂交试验中作为探针。Fragments of nucleic acid sequences can be used, for example, as probes in hybridization assays.
在宿主生物中插入核苷酸序列,令其转录及翻译,以制造功能性多肽的情况下,技术人员将认识到,由于密码简并性,很多种多核苷酸序列都将编码同样的多肽。这些变异体也明确地包括在本发明的范围之内。此外,本发明还明确地包括下述序列,所述序列彼此之间实质上相同(按照下文所述来确定),它们编码是野生型多肽的突变体的多肽或保留有多肽功能的多肽(例如由对多肽氨基酸的保守取代得到的)。此外,变异体可以是编码下述显性失活突变体的。Where a nucleotide sequence is inserted into a host organism, transcribed and translated to produce a functional polypeptide, the skilled artisan will recognize that, due to codon degeneracy, many polynucleotide sequences will encode the same polypeptide. Such variants are also expressly included within the scope of the present invention. In addition, the present invention expressly includes sequences that are substantially identical to one another (determined as described below) that encode polypeptides that are mutants of wild-type polypeptides or polypeptides that retain polypeptide function (e.g. resulting from conservative substitutions of polypeptide amino acids). In addition, variants may be those encoding dominant negative mutants described below.
如果按照下文所述进行最大一致性比对时,两条核酸序列或多肽各自的核苷酸序列或氨基酸残基是同样的,这两条核苷酸序列或多肽会被认为是“相同的”。在关于两条或多条核酸或多肽序列的上下文中,术语“相同的”或百分比“相同性”指:在比较窗(comparison window)上进行最大一致性比较及比对时,用下述序列比较算法中的一种或通过手工比对及眼睛观察来测量的情况下,两条或多条序列或亚序列是同样的,或其中同样的氨基酸残基或核苷酸具有特定的百分比。当序列相同性的百分比针对蛋白质或肽来使用时,应认识到,不相同的残基位置通常是由于保守的氨基酸取代而导致不同的,其中氨基酸残基被取代为其它具有相似化学性质(例如,电荷或疏水性)的氨基酸残基,因此并不改变分子的功能属性。当序列因保守取代而不同时,可因取代的保守本质,将百分比序列相同性上调以进行校正。进行这种调节的方法是本领域技术人员公知的。典型地,这包括将保守取代作为部分而非完全的错配进行评分,因而提高百分比序列相同性。因此,例如,在给相同的氨基酸1分,给非保守取代0分的情况下,给保守取代0和1之间的分数。按照,例如,Meyers&Miller,Computer Applic.Biol.Sci.4:11-17(1988)的算法,例如,在程序PC/GENE(Intelligenetics,Mountain View,Calif.,USA)中实现,来计算对保守取代的给分。Two nucleic acid sequences or polypeptides are said to be "identical" if their respective nucleotide sequences or amino acid residues are identical when aligned for maximum identity as described below . The term "identical" or percent "identity" in the context of two or more nucleic acid or polypeptide sequences refers to: when comparing and aligning for maximum identity over a comparison window, the sequence Two or more sequences or subsequences are identical, or have a specified percentage of amino acid residues or nucleotides that are identical, as measured by one of the comparison algorithms or by manual alignment and visual inspection. When percentages of sequence identity are used with respect to proteins or peptides, it is recognized that residue positions that are not identical typically differ by conservative amino acid substitutions, in which amino acid residues are replaced by others with similar chemical properties (e.g. , charge or hydrophobicity) amino acid residues and thus do not alter the functional properties of the molecule. When sequences differ by conservative substitutions, the percent sequence identity can be adjusted upwards to correct for the conservative nature of the substitutions. Methods for making such adjustments are well known to those skilled in the art. Typically, this involves scoring conservative substitutions as partial rather than complete mismatches, thus increasing the percent sequence identity. So, for example, give a score between 0 and 1 for conservative substitutions where identical amino acids are given 1 point and non-conservative substitutions are given 0 points. Conservative substitutions are calculated according to, for example, the algorithm of Meyers & Miller, Computer Applic. Biol. Sci. 4: 11-17 (1988), for example, implemented in the program PC/GENE (Intelligenetics, Mountain View, Calif., USA). give points.
在关于两条核酸或多肽的上下文中,短语“实质上相同”指:在比较窗上进行最大一致性比对时,用下述序列比较算法中的一种或通过手工比对及眼睛观察来测量的情况下,具有至少60%,优选80%,最优选90-95%的核苷酸或氨基酸残基相同性的序列或亚序列。该定义还指能与待测序列杂交的序列的互补序列。The phrase "substantially identical" in the context of two nucleic acids or polypeptides means that when aligned for maximum identity over the comparison window, they are identified using one of the following sequence comparison algorithms or by manual alignment and visual inspection. Where measured, a sequence or subsequence having at least 60%, preferably 80%, most preferably 90-95% nucleotide or amino acid residue identity. This definition also refers to the complement of a sequence that hybridizes to the test sequence.
对序列比较而言,典型地,一条序列用作参照序列,将待测序列与其进行比较。使用序列比较算法时,将待测和参照序列都输入到计算机中,如果需要的话,指定亚序列坐标,以及指定序列算法程序参数。可以使用缺省的程序参数,或者指定其它参数。然后序列比较算法会基于程序参数,对待测序列相对于参照序列的百分比序列相同性进行计算。For sequence comparison, typically one sequence acts as a reference sequence, to which test sequences are compared. When using a sequence comparison algorithm, test and reference sequences are entered into a computer, subsequence coordinates are designated, if necessary, and sequence algorithm program parameters are designated. Default program parameters can be used, or other parameters can be specified. The sequence comparison algorithm then calculates the percent sequence identities for the test sequences relative to the reference sequence, based on the program parameters.
本文中使用的“比较窗”包括:指一种片断,其连续位置的数量选自由20至600,通常大约50至大约200,更通常为大约100至大约150所构成的组,其中,序列可与具有同样数量的连续位置的参照序列进行比较,这是在两条序列被最优排列之后进行的。对序列进行排列以进行比较的方法是本领域内公知的。As used herein, "comparison window" includes: refers to a fragment whose number of consecutive positions is selected from the group consisting of 20 to 600, usually about 50 to about 200, more usually about 100 to about 150, wherein the sequence can be Comparisons are made to a reference sequence having the same number of consecutive positions, after the two sequences have been optimally aligned. Methods for aligning sequences for comparison are well known in the art.
“经保守修饰的变异体”用于氨基酸及核酸序列。就具体的核酸序列而言,经保守修饰的变异体指编码相同或基本相同的氨基酸序列的核酸,或者,当核酸不编码氨基酸序列时,指基本相同的序列。由于遗传密码的简并性,有多种功能上相同的核酸密码子编码任何给定的蛋白质。例如,密码子GCA、GCC、GCG和GCU都编码丙氨酸这种氨基酸。因此,在丙氨酸被密码子确定的每个位置上,密码子可变为上述相应密码子中的任何一个,而不会改变编码出的多肽。此类核酸变异是“沉默变异”,其是经保守修饰的变异中的一种。本文中每条编码多肽的核酸序列也还描述该核酸的每种可能的沉默变异。技术人员将认识到,核酸中的每个密码子(AUG除外,其通常是编码甲硫氨酸的唯一的密码子)都可被修饰,以产生功能上相同的分子。因此,编码多肽的核酸的每种沉默变异都隐含在每条被描述过的序列中。"Conservatively modified variants" applies to amino acid and nucleic acid sequences. With respect to particular nucleic acid sequences, conservatively modified variants refers to nucleic acids that encode identical or essentially identical amino acid sequences, or, where the nucleic acid does not encode an amino acid sequence, to essentially identical sequences. Due to the degeneracy of the genetic code, there are multiple functionally identical nucleic acid codons that encode any given protein. For example, the codons GCA, GCC, GCG, and GCU all encode the amino acid alanine. Therefore, at each position where alanine is defined by a codon, the codon can be changed to any of the corresponding codons described above without changing the encoded polypeptide. Such nucleic acid variations are "silent variations," which are one species of conservatively modified variations. Every nucleic acid sequence herein which encodes a polypeptide also describes every possible silent variation of the nucleic acid. The skilled artisan will recognize that every codon in a nucleic acid (except AUG, which is usually the only codon encoding methionine) can be modified to produce a functionally identical molecule. Accordingly, every silent variation of a nucleic acid which encodes a polypeptide is implicit in every described sequence.
对氨基酸序列而言,技术人员将认识到,对核酸的个别取代、缺失或增加,或对肽、多肽或蛋白质序列进行的在被编码的序列中改变单个氨基酸或小百分比(即,小于20%,例如15%、10%、5%、4%、3%、2%或1%)的氨基酸的取代,是“经保守修饰的变异体”,其中的改变是氨基酸被化学上相似的氨基酸所取代。提供功能上相似的氨基酸的保守取代表在本领域内是公知的。With respect to amino acid sequences, the skilled artisan will recognize that individual substitutions, deletions or additions to nucleic acids, or to peptide, polypeptide or protein sequences, change single amino acids or small percentages (i.e., less than 20%) in the encoded sequence , such as 15%, 10%, 5%, 4%, 3%, 2%, or 1%) of amino acid substitutions are "conservatively modified variants" in which the amino acid is replaced by a chemically similar amino acid replace. Conservative substitution tables providing functionally similar amino acids are well known in the art.
下述六组中的每一组都含有可以互相保守取代的氨基酸:Each of the following six groups contains amino acids that may be conservatively substituted for each other:
丙氨酸(A)、丝氨酸(S)、苏氨酸(T);Alanine (A), Serine (S), Threonine (T);
天冬氨酸(D)、谷氨酸(E);Aspartic acid (D), glutamic acid (E);
天冬酰胺(N)、谷氨酰胺(Q);Asparagine (N), Glutamine (Q);
精氨酸(R)、赖氨酸(K);Arginine (R), Lysine (K);
异亮氨酸(I)、亮氨酸(L)、甲硫氨酸(M)、缬氨酸(V);以及Isoleucine (I), Leucine (L), Methionine (M), Valine (V); and
苯丙氨酸(F)、酪氨酸(Y)、色氨酸(W)。(见,例如,Creighton,Proteins(1984))。Phenylalanine (F), Tyrosine (Y), Tryptophan (W). (See, eg, Creighton, Proteins (1984)).
两条核酸序列或多肽实质上相同的标志是:第一条核酸编码的多肽可与因对抗第二条核酸编码的多肽而产生的抗体进行免疫交叉反应。因此,例如,当两条肽仅仅只由于保守取代而有不同时,则典型地,其中一条多肽与第二条多肽实质上相同。两条核酸序列实质上相同的另一种标志是:这两个分子或其互补体在如下文所述的严谨条件下能相互杂交。The sign that two nucleic acid sequences or polypeptides are substantially identical is that the polypeptide encoded by the first nucleic acid can perform immunological cross-reaction with the antibody raised against the polypeptide encoded by the second nucleic acid. Thus, for example, when two peptides differ only by conservative substitutions, then typically one of the polypeptides is substantially identical to the second polypeptide. Another indication that two nucleic acid sequences are substantially identical is that the two molecules, or their complements, hybridize to each other under stringent conditions as described below.
短语“与……特异性杂交”指:在严谨杂交条件下,当序列存在于复杂混合物(例如,细胞总DNA或RNA,或文库DNA或RNA)中时,分子仅与特定的核苷酸序列结合、形成双链或杂交。The phrase "hybridizes specifically to" means that, under stringent hybridization conditions, a molecule only binds to a specific nucleotide sequence when the sequence is present in a complex mixture (e.g., total cellular DNA or RNA, or library DNA or RNA) Combine, form double strands or hybridize.
短语“严谨杂交条件”指探针将与其目标序列而不与其它序列杂交的条件,典型地,其目标序列处于核酸序列的复杂混合物中。严谨条件是取决于序列的,在不同的情况下也有所不同。较长的序列会在较高的温度下特异性杂交。通常,高严谨度条件被选择为:比在给定的离子强度和pH下特异性序列的热融温度(Tm)低大约5-10℃。低严谨度条件通常被选择为比Tm低大约15-30℃。Tm是:(在给定的离子强度、pH和核酸浓度下,)平衡时,互补于目标的探针中的50%与目标序列杂交的温度(目标序列过量存在时,Tm处,平衡时50%的探针被结合)。严谨条件将是下述条件:盐浓度低于大约1.0M的钠离子,典型地,大约0.01至1.0M的钠离子浓度(或其它盐),pH7.0至8.3,对短探针(例如,10至50个核苷酸)而言温度为至少大约30℃,对长探针(例如,多于50个核苷酸)而言,温度为至少大约60℃。严谨条件还可以通过加入去稳定试剂,例如甲酰胺来获得。对选择性或特异性杂交而言,阳性信号是背景的至少两倍,优选10倍于背景杂交。The phrase "stringent hybridization conditions" refers to conditions under which a probe will hybridize to its target sequence, typically in a complex mixture of nucleic acid sequences, but to no other sequences. Stringent conditions are sequence-dependent and will be different in different circumstances. Longer sequences hybridize specifically at higher temperatures. Generally, high stringency conditions are selected to be about 5-10°C lower than the melting temperature ( Tm ) for the specific sequence at a given ionic strength and pH. Low stringency conditions are typically selected to be about 15-30°C lower than the Tm . Tm is: (at a given ionic strength, pH and nucleic acid concentration,) at equilibrium, the temperature at which 50% of the probes complementary to the target hybridize to the target sequence (when the target sequence is present in excess, at Tm, at equilibrium 50% of the probes were bound). Stringent conditions will be the following: a salt concentration of less than about 1.0 M sodium ion, typically a sodium ion concentration (or other salt) of about 0.01 to 1.0 M, pH 7.0 to 8.3, for short probes (e.g., 10 to 50 nucleotides), the temperature is at least about 30°C, and for long probes (eg, greater than 50 nucleotides), the temperature is at least about 60°C. Stringent conditions can also be achieved by the addition of destabilizing agents, such as formamide. For selective or specific hybridization, a positive signal is at least two times background, preferably 10 times background hybridization.
如果在严谨条件下不能相互杂交的核酸所编码的多肽是实质上相同的,那么这样的核酸也仍然是实质上相同的。这会发生于,例如,用由遗传密码允许的最大密码子简并度产生核酸拷贝的情况下。在此类情况下,典型地,核酸在中度严谨杂交条件下杂交。Nucleic acids that do not hybridize to each other under stringent conditions are also substantially identical if the polypeptides encoded by such nucleic acids are substantially identical. This would occur, for example, if a copy of the nucleic acid was created with the maximum codon degeneracy permitted by the genetic code. In such cases, typically, the nucleic acids hybridize under moderately stringent hybridization conditions.
在本发明中,用本文中公开的核酸序列,在严谨条件下进行标准的Southern印迹实验,可鉴定出含有本发明核酸的基因组DNA(gDNA)或cDNA。就此种公开的目的而言,适合于此类杂交的严谨条件包括:在37℃下在40%甲酰胺、1M NaCl、1%十二烷基硫酸钠(SDS)缓冲液中杂交,在至少大约50℃的温度下于0.2 X SSC中洗至少一次,20分钟,温度通常是在大约55℃至大约60℃;或是相当的条件。阳性杂交至少是背景的两倍。普通技术人员很容易认识到,其它杂交及洗涤条件可用于提供具有相似严谨度的条件。In the present invention, the genomic DNA (gDNA) or cDNA containing the nucleic acid of the present invention can be identified by performing standard Southern blot experiments under stringent conditions using the nucleic acid sequences disclosed herein. For the purposes of this disclosure, stringent conditions suitable for such hybridization include hybridization at 37°C in 40% formamide, 1M NaCl, 1% sodium dodecyl sulfate (SDS) buffer at least about Wash at least once in 0.2X SSC for 20 minutes at a temperature of 50°C, usually between about 55°C and about 60°C; or equivalent conditions. Positive hybridization is at least twice background. One of ordinary skill will readily recognize that other hybridization and wash conditions can be used to provide conditions of similar stringency.
关于两条多核苷酸相同的另一种标志是:通过一对寡核苷酸引物扩增出来的参照序列,在严谨杂交条件下可被用作为探针,以从cDNA或基因组文库中分离出待测序列,或在northern或Southern印迹实验中鉴定出待测序列。Another indication that two polynucleotides are identical is that a reference sequence amplified by a pair of oligonucleotide primers can be used as a probe under stringent hybridization conditions to isolate cDNA or genomic libraries. The sequence to be tested, or the sequence to be tested was identified in a northern or Southern blot.
本发明还包括本文所定义的表达载体。表达载体包括上文列出的多核苷酸序列中任一种的一个或多个拷贝。本发明的表达载体可以含有本文定义的任何多核苷酸序列。The invention also includes expression vectors as defined herein. Expression vectors include one or more copies of any of the polynucleotide sequences listed above. An expression vector of the invention may contain any polynucleotide sequence as defined herein.
因此,在另一种实施方式中,本发明提供了一种表达载体,所述载体包含编码泛酸内酯水解酶的基因的核苷酸序列,所述泛酸内酯水解酶来自马肝脏、Bacillus subtilis、A.niger ATCC 9142、A.niger awamori ATCC38854或A.niger MacRae ATCC46951;或者所述载体包含编码泛酸内酯水解酶的基因的核苷酸序列,所述泛酸内酯水解酶来自马肝脏,如SEQID NO:2所示,来自Bacillus subtilis,如SEQ ID NO:4所示,来自A.niger ATCC9142,如SEQ ID NO:8所示,来自A.niger awamori ATCC38854,如SEQ ID NO:10所示,以及来自A.niger MacRae ATCC46951,部分如SEQ ID NO:6所示。Therefore, in another embodiment, the present invention provides an expression vector comprising the nucleotide sequence of a gene encoding pantothenate hydrolase from horse liver, Bacillus subtilis , A.niger ATCC 9142, A.niger awamori ATCC38854 or A.niger MacRae ATCC46951; or the vector contains the nucleotide sequence of a gene encoding pantothenate hydrolase derived from horse liver, such as As shown in SEQ ID NO: 2, from Bacillus subtilis, as shown in SEQ ID NO: 4, from A. niger ATCC9142, as shown in SEQ ID NO: 8, from A. niger awamori ATCC38854, as shown in SEQ ID NO: 10 , and from A. niger MacRae ATCC46951, shown in part in SEQ ID NO:6.
可选地,表达载体中的多核苷酸序列可与上文定义的表达控制序列可操作地连接。Alternatively, the polynucleotide sequence in the expression vector may be operably linked to the expression control sequences defined above.
在本文中使用的短语“表达载体”是可复制的(replicatable)载体,其带有编码本文列出的多核苷酸序列的DNA序列,并能调控这些DNA序列的表达。在本文上下文中,术语“可复制的”指,在引入载体的给定类型的宿主细胞中,载体可被复制。在感兴趣的多核苷酸序列上游接近该多核苷酸的地方,可能会提供有编码信号肽的序列,其存在确保了带有载体的宿主细胞所表达的被编码的多肽得以分泌。信号序列可以是与被选出的多核苷酸序列天然相连接的,或者可以来自另一种来源。The phrase "expression vector" as used herein is a replicatable vector carrying DNA sequences encoding the polynucleotide sequences set forth herein and capable of regulating the expression of these DNA sequences. In the present context, the term "replicable" means that the vector can be replicated in a given type of host cell into which the vector is introduced. Near the polynucleotide sequence upstream of the polynucleotide sequence of interest, a sequence encoding a signal peptide may be provided, the presence of which ensures secretion of the encoded polypeptide expressed by the host cell carrying the vector. The signal sequence may be naturally associated with the selected polynucleotide sequence, or may be derived from another source.
载体可以是传统用于重组DNA工艺的任何载体,对载体的选择将通常取决于其将被引入的宿主细胞。因此,载体可以是自主复制载体,即,作为染色体外物质存在的载体,其复制独立于染色体复制;此种载体的例子是质粒、噬菌体、粘粒或迷你染色体。另外,载体可以是下述载体,当其被引入到宿主细胞中时,会整合到宿主细胞基因组中,并与其被整合到的宿主细胞基因组一起复制。本发明的表达载体可以带有本发明的任何DNA序列,可被用于表达本发明的任何多肽。The vector can be any vector conventionally used in recombinant DNA techniques, the choice of which will generally depend on the host cell into which it is to be introduced. Thus, a vector may be an autonomously replicating vector, ie a vector that exists as extrachromosomal material that replicates independently of chromosomal replication; examples of such vectors are plasmids, phages, cosmids or minichromosomes. In addition, the vector may be a vector which, when introduced into a host cell, is integrated into the host cell genome and replicated together with the host cell genome into which it has been integrated. The expression vector of the present invention can carry any DNA sequence of the present invention, and can be used to express any polypeptide of the present invention.
本发明还包括经培养的细胞或宿主细胞,其中含有一条或多条本文公开的多核苷酸序列和/或一个或多个本文公开的表达载体。在本文中使用的“经培养的细胞”包括,任何能够在给定条件下生长,并能表达本发明的多核苷酸编码的一种或多种多肽的细胞。优选地,经培养的细胞是酵母、真菌、细菌或藻类。更优选地,经培养的细胞是Escherichia coli、Saccharomyces cerevisiae、Pichia pastoris、Aspergillus niger或Bacillussubtilis。The invention also includes cultured cells or host cells containing one or more polynucleotide sequences disclosed herein and/or one or more expression vectors disclosed herein. "Cultured cells" as used herein include any cells capable of growing under given conditions and expressing one or more polypeptides encoded by the polynucleotides of the present invention. Preferably, the cultured cells are yeast, fungi, bacteria or algae. More preferably, the cultured cells are Escherichia coli, Saccharomyces cerevisiae, Pichia pastoris, Aspergillus niger or Bacillus subtilis.
在另一种实施方式中,本发明提供了从马肝脏、A.niger ATCC 9142、A.niger awamori ATCC 38854或A.niger MacRae ATCC 46951中获得的泛酸内酯水解酶。In another embodiment, the present invention provides pantothenate lactone hydrolase obtained from horse liver, A. niger ATCC 9142, A. niger awamori ATCC 38854 or A. niger MacRae ATCC 46951.
在又一种实施方式中,本发明提供了一种泛酸内酯水解酶,其氨基酸序列包含:当来自马肝脏时,SEQ ID NO:2所示的氨基酸序列;当来自A.niger MacRae ATCC 46951时,SEQ ID NO:6所示的氨基酸序列;当来自A.niger ATCC 9142时,SEQ ID NO:8所示的氨基酸序列;当来自A.niger awamori ATCC 38854时,SEQ ID NO:10所示的氨基酸序列。In yet another embodiment, the present invention provides a pantothenate lactone hydrolase, the amino acid sequence of which comprises: when derived from horse liver, the amino acid sequence shown in SEQ ID NO: 2; when derived from A. niger MacRae ATCC 46951 When from A. niger ATCC 9142, the amino acid sequence shown in SEQ ID NO: 8; when from A.niger awamori ATCC 38854, the amino acid sequence shown in SEQ ID NO: 10 amino acid sequence.
SEQ ID NO:2所示的来自马肝脏的和SEQ ID NO:4所示的来自Bacillus subtilis的泛酸内酯水解酶显示S-泛酸内酯水解酶活性。SEQ IDNO:8所示的来自A.niger ATCC 9142的、SEQ ID NO:10所示的来自A.niger awamori ATCC 38854的和部分如SEQ ID NO:6所示的来自A.nigerMacRae ATCC 46951的泛酸内酯水解酶显示R-泛酸内酯水解酶活性。Pantolactone hydrolase from horse liver as shown in SEQ ID NO: 2 and from Bacillus subtilis as shown in SEQ ID NO: 4 exhibits S-pantolactone hydrolase activity. Pantothenic acid from A. niger ATCC 9142 as shown in SEQ ID NO: 8, from A. niger awamori ATCC 38854 as shown in SEQ ID NO: 10 and partially from A. nigerMacRae ATCC 46951 as shown in SEQ ID NO: 6 The lactone hydrolase exhibits R-pantolactone hydrolase activity.
在另一种实施方式中,本发明包含上述氨基酸序列的片段,所述片段具有完整蛋白质的酶活性。In another embodiment, the present invention comprises a fragment of the above amino acid sequence, which fragment has the enzymatic activity of the complete protein.
根据本发明的“氨基酸序列的片段”可被定义为缺少一段氨基酸的序列,该段氨基酸对蛋白质正确折叠来说可能是必要的。然而,一旦蛋白质被折叠,该段氨基酸就可被删除,而不会破坏单个蛋白质的功能。这些段可以存在于被折叠的蛋白质中,例如环,或者可以是与被折叠的蛋白质的活性不直接相关的N-或C-末端序列。A "fragment of an amino acid sequence" according to the present invention may be defined as a sequence lacking a stretch of amino acids which may be essential for the correct folding of a protein. However, once the protein is folded, this stretch of amino acids can be deleted without disrupting the individual protein's function. These segments may be present in the folded protein, such as loops, or may be N- or C-terminal sequences not directly related to the activity of the folded protein.
此外,本发明包括包含所述R-和S-泛酸内酯水解酶活性的蛋白质的衍生物。所述衍生物包括被固定的酶,例如通过内含(inclusion)被固定,其中所述的酶被保留于膜装置上,例如中空纤维、聚合网状结构或微囊体。还可能通过交联或通过产生酶晶体来对酶进行固定。另一种对酶进行固定的可能方法是将其结合到正在生长的多聚物上,例如二氧化硅溶胶凝胶或二氧化硅石墨溶胶凝胶,或者通过将酶结合到预制载体上,例如EupergitC、Eupergit 250L、PEG、Celite和Amberlite XAD-7。对本发明的R-和S-泛酸内酯水解酶适用的另一种固定酶的技术是将其分散在与水无法混溶的有机溶剂中。通常,通过本领域内已知的几乎所有固定技术,可对具有酶活性的蛋白质,例如本发明的酶进行固定。Furthermore, the present invention includes derivatives of proteins comprising said R- and S-pantolactone hydrolase activity. Said derivatives include immobilized enzymes, for example by inclusion, wherein said enzymes are retained on membrane devices, such as hollow fibers, polymeric networks or microcapsules. It is also possible to immobilize the enzyme by cross-linking or by creating enzyme crystals. Another possibility for the immobilization of enzymes is to bind them to growing polymers, such as silica sol-gels or silica-graphite sol-gels, or by binding enzymes to prefabricated supports such as Eupergit C, Eupergit 250L, PEG, Celite, and Amberlite XAD-7. Another enzyme immobilization technique suitable for the R- and S-pantolactone hydrolases of the present invention is to disperse them in a water-immiscible organic solvent. In general, proteins with enzymatic activity, such as the enzymes of the invention, can be immobilized by almost any immobilization technique known in the art.
在又一种实施方式中,本发明涉及用经分离的泛酸内酯水解酶选择性水解R-或S-泛酸内酯水解酶的用途,这可用于对泛酸内酯的R-对映异构体的工业生产,泛酸内酯的R-对映异构体是生产R-泛酸和R-泛酰醇的前体。In yet another embodiment, the present invention relates to the use of the isolated pantothenolactone hydrolase for the selective hydrolysis of R- or S-pantolactone hydrolase, which can be used for the R-enantiomerization of pantolactone The R-enantiomer of pantolactone is the precursor for the production of R-pantothenic acid and R-panthenol.
因此,本发明的一个目的是提供用本文所述的经分离的泛酸内酯水解酶来对R-或S-泛酸内酯进行选择性水解的方法或途径。Accordingly, it is an object of the present invention to provide a method or route for the selective hydrolysis of R- or S-pantolactone using the isolated pantolactone hydrolase described herein.
在另一种实施方式中,本发明提供了生产R-泛酸或其盐或R-泛酰醇的方法,其中包括在本发明的泛酸内酯水解酶存在下对R-或S-泛酸内酯进行选择性水解的步骤。In another embodiment, the present invention provides a method for producing R-pantothenic acid or a salt thereof or R-panthenol, which comprises degrading R- or S-pantothenolactone in the presence of the pantothenolactone hydrolase of the present invention A step of selective hydrolysis is carried out.
在另一种实施方式中,本发明提供了克隆与已知序列直接相邻的核苷酸序列的方法,例如编码新颖的泛酸内酯水解酶或其部分的核苷酸序列。In another embodiment, the present invention provides a method for cloning a nucleotide sequence immediately adjacent to a known sequence, such as a nucleotide sequence encoding a novel pantolactone hydrolase or a portion thereof.
具体而言,本发明包括能扩增未知序列的新的PCR策略。所述新的PCR策略包括在第一条和第二条DNA链的合成中使用同一种引物。该引物能与将被扩增的gDNA的已经已知的区域杂交。该引物显示与已知序列100%的同源性。在该引物的帮助下,进行第一次PCR,提供基因组DNA的第一条单链拷贝,到达未知序列。用一部分第一次PCR反应产物作为模板来进行第二次PCR,其中使用的(嵌套)引物能与定位于第一次PCR中所用的引物的3’方向上的DNA片段杂交,但逻辑上仍处于序列的已知部分。该嵌套引物与第一次聚合酶反应产生的单链DNA进行随机杂交之后,产生该单链DNA的反义链,其自身作为用于第二次PCR、产生第二条链的同样的引物的模板。在最后,起始于已知序列内部,延伸于邻近的基因组DNA或延伸于缺失部分序列的任何其它DNA的DNA片段通过与新片段两端都能杂交的一种引物被制造出来。Specifically, the present invention includes novel PCR strategies capable of amplifying unknown sequences. The new PCR strategy involves using the same primers for the synthesis of the first and second DNA strands. The primers hybridize to already known regions of the gDNA to be amplified. This primer showed 100% homology to the known sequence. With the help of this primer, the first PCR is performed, providing the first single-stranded copy of the genomic DNA, reaching the unknown sequence. Using a portion of the product of the first PCR reaction as a template to perform a second PCR using (nested) primers that hybridize to DNA fragments positioned 3' to the primers used in the first PCR, but logically Still in a known part of the sequence. After random hybridization of this nested primer with the single-stranded DNA produced by the first polymerase reaction, the antisense strand of this single-stranded DNA is generated, which itself serves as the same primer used in the second PCR to generate the second strand template. In the end, DNA fragments starting within the known sequence, extending to adjacent genomic DNA or to any other DNA with missing parts of the sequence, are produced with a primer that hybridizes to both ends of the new fragment.
因此,在另一种实施方式中,本发明提供了克隆与已知序列直接相邻的核苷酸序列的方法,所述方法包括PCR扩增,所述扩增包括:Accordingly, in another embodiment, the present invention provides a method of cloning a nucleotide sequence immediately adjacent to a known sequence, said method comprising PCR amplification comprising:
a)用第一次PCR的单链DNA作为模板进行第二次PCR,在所述第二次PCR中用能与第一次PCR产生的单链DNA随机杂交的引物;以及a) using the single-stranded DNA of the first PCR as a template to perform a second PCR in which primers capable of randomly hybridizing to the single-stranded DNA produced by the first PCR are used; and
b)由此,仍用步骤a)中的随机杂交引物,产生自身作为第二条链模板的反义链。b) Thus, still using the random hybridization primers from step a), an antisense strand is generated which itself serves as a template for the second strand.
下述实施例对编码泛酸内酯水解酶基因的鉴定、表征和表达进行了详细的描述。这些实施例仅是示例性的,而非欲以任何方式对本发明的范围加以限制。The following examples describe in detail the identification, characterization and expression of genes encoding pantolactone hydrolase. These examples are illustrative only and are not intended to limit the scope of the invention in any way.
实施例1从可商购的马肝脏酯酶丙酮粉末来纯化(S)-泛酸内酯水解活性Example 1 Purification of (S)-pantolactone hydrolytic activity from commercially available horse liver esterase acetone powder
将1g马肝脏酯酶丙酮粉末(Fluka Holding AG,Industriestrasse 25,P.O.Box 260,CH-9471 Buchs,瑞士)溶于40ml 50mM的Tris/HCl中,pH7.5,其中含有2M硫酸铵、1mM CaCl2、1mM MgCl2和10μM EDTA(缓冲液A)。通过离心分离出未溶解的物质,之后,用0.45μM过滤器进行过滤灭菌。将蛋白质溶液上样到用缓冲液A平衡过的丁基琼脂糖柱HR26/10(Amersham Pharmacia Biotech Europe GmbH,Dübendorf,瑞士)中。用100ml缓冲液A洗该柱。之后应用400ml从0至100%线性梯度的缓冲液B(没有硫酸铵的缓冲液A)。内酯酶在梯度中间被洗脱出来。收集显示泛酸内酯水解活性的片断,将缓冲液换为pH8.0的20mMTris/HCl、1mM CaCl2、1mM MgCl2以及10μM EDTA。收集的片断上样到MonoQ-预填(pre-packed)柱HR 26/10(Amersham Pharmacia BiotechEurope GmbH,Dübendorf,瑞士)中,应用0至350mM NaCl的线性梯度。蛋白质在该梯度的开始即被洗脱出来。再次收集活性片断,浓缩到1ml,上样至Superdex 200柱(Amersham Pharmacia Biotech Europe GmbH,Dübendorf,瑞士)上。用pH8.0的50mM Tris/HCl、1mM CaCl2、1mMMgCl2以及10μM EDTA作为洗脱缓冲液,内酯酶作为最后一个峰被洗脱出来,其显示大约31kDa的分子大小,与SDS-PAGE所示的尺寸相符。1 g of horse liver esterase acetone powder (Fluka Holding AG, Industriestrasse 25, POBox 260, CH-9471 Buchs, Switzerland) was dissolved in 40 ml of 50 mM Tris/HCl, pH 7.5, containing 2 M ammonium sulfate, 1 mM CaCl 2 , 1 mM MgCl 2 and 10 μM EDTA (buffer A). Undissolved material was separated by centrifugation, after which filter sterilization was performed with a 0.45 μM filter. The protein solution was loaded onto a Butyl Sepharose column HR26/10 (Amersham Pharmacia Biotech Europe GmbH, Dübendorf, Switzerland) equilibrated with buffer A. The column was washed with 100 ml of buffer A. Then 400 ml of a linear gradient from 0 to 100% of buffer B (buffer A without ammonium sulfate) was applied. Lactonase was eluted in the middle of the gradient. Fragments showing pantolactone hydrolytic activity were collected, and the buffer was exchanged with 20 mM Tris/HCl, pH 8.0, 1 mM CaCl 2 , 1 mM MgCl 2 , and 10 μM EDTA. The pooled fragments were loaded onto a MonoQ-pre-packed column HR 26/10 (Amersham Pharmacia Biotech Europe GmbH, Dübendorf, Switzerland) applying a linear gradient of 0 to 350 mM NaCl. Proteins were eluted at the beginning of the gradient. The active fragments were collected again, concentrated to 1 ml, and loaded onto a Superdex 200 column (Amersham Pharmacia Biotech Europe GmbH, Dübendorf, Switzerland). Using 50 mM Tris/HCl, pH 8.0, 1 mM CaCl 2 , 1 mM MgCl 2 , and 10 μM EDTA as the elution buffer, lactonase was eluted as the last peak, which showed a molecular size of about 31 kDa, which was consistent with that of SDS-PAGE. match the dimensions shown.
通过在37℃将酶于pH9.0的200mM Tris/乙酸缓冲液、5mM MgCl2中培养60分钟进行测定,发现最高比活性为13.3U/(mg蛋白质)。通过HPLC来测定(S)-和(R)-泛酸内酯的浓度。The highest specific activity was found to be 13.3 U/(mg protein) as determined by incubating the enzyme in 200 mM Tris/acetate buffer, pH 9.0, 5 mM MgCl 2 at 37° C. for 60 minutes. (S)- and (R)-pantolactone concentrations were determined by HPLC.
实施例2对来自马肝脏的(S)-泛酸内酯水解酶进行初步的表征Example 2 Preliminary characterization of (S)-pantolactone hydrolase from horse liver
(a)分子大小、结构以及等电点:(a) Molecular size, structure and isoelectric point:
根据SDA-PAGE,变性蛋白具有大约35kDa的分子量。用经校正的凝胶过滤柱的洗脱时间,天然蛋白质的分子量被计算为大约31kDa。这意味着来自马肝脏的内酯酶是30至35kDa的单体。用2D凝胶,等电点被测定为pH5.5的范围内。According to SDA-PAGE, the denatured protein has a molecular weight of approximately 35 kDa. Using corrected elution times from the gel filtration column, the molecular weight of the native protein was calculated to be approximately 31 kDa. This means that lactonase from horse liver is a 30 to 35 kDa monomer. With 2D gels, the isoelectric point was determined to be in the range of pH 5.5.
(b)N-末端和内部氨基酸测序:(b) N-terminal and internal amino acid sequencing:
用胰蛋白酶对2D凝胶电泳之后富集的蛋白质进行消化。通过质谱对产生的肽进行分析。SEQ NO:11中示出了肽No.66的N-末端序列。Proteins enriched after 2D gel electrophoresis were digested with trypsin. The resulting peptides were analyzed by mass spectrometry. The N-terminal sequence of peptide No. 66 is shown in SEQ NO: 11.
该序列与MS-谱一起显示,该蛋白质属于已知为衰老标记蛋白-30或钙调蛋白(regucalcins)的蛋白质家族,其主要被发现于脊椎动物中。来自大鼠、小鼠、人、鸡、兔、牛、Xenopus laevis的钙调蛋白序列,以及来自Drosophila melanogaster的两条较少同源性的序列已被测定。然而,来自马的该基因还未被分离及测序。The sequence, together with the MS-spectrum, revealed that the protein belongs to a family of proteins known as senescence marker protein-30 or regucalcins, which are mainly found in vertebrates. Calmodulin sequences from rat, mouse, human, chicken, rabbit, bovine, Xenopus laevis, and two less homologous sequences from Drosophila melanogaster have been determined. However, this gene from horses has not been isolated and sequenced.
(c)金属依赖性:(c) Metal dependence:
为测定不同二价金属对马肝脏酶的泛酸内酯酶活性的影响,将1.3、2.6、5.2或10.4mM的Ca2+、Mg2+或Mn2+加入到由pH9.0的200mM Tris/乙酸盐和50mM(R,S)-泛酸内酯组成的反应混合物中。在37℃对反应混合物进行60分钟的培养,用等体积的含有2mM EDTA的甲醇来终止反应。通过HPLC测定(S)-泛酸内酯的减少。钙令酶活性降低,而镁和锰则使酶活性增高。To determine the effect of different divalent metals on the pantothenate lactonase activity of horse liver enzymes, 1.3, 2.6, 5.2 or 10.4 mM Ca 2+ , Mg 2+ or Mn 2+ were added to 200 mM Tris/ Acetate and 50mM (R, S)-pantothenolactone in the reaction mixture. The reaction mixture was incubated at 37°C for 60 minutes and terminated with an equal volume of methanol containing 2 mM EDTA. The reduction of (S)-pantolactone was determined by HPLC. Calcium reduces enzyme activity, while magnesium and manganese increase enzyme activity.
(d)热稳定性:(d) thermal stability:
在30至80℃的温度下,将经纯化的酶在5.2mM MgCl2和pH9.0的200mM Tris/乙酸盐中培养15分钟。冰上30分钟之后,用标准试验方法来测定活性(60分钟,37℃)。直到60℃酶依然是稳定的。其在65℃开始失活,在70℃丢失超过50%的活性。The purified enzyme was incubated in 5.2 mM MgCl 2 and 200 mM Tris/acetate pH 9.0 at a temperature of 30 to 80° C. for 15 minutes. After 30 minutes on ice, activity was determined using standard assay methods (60 minutes, 37°C). The enzyme remains stable up to 60°C. It starts to inactivate at 65°C and loses more than 50% of its activity at 70°C.
实施例3克隆来自马肝脏的(S)-泛酸内酯酶Example 3 Cloning of (S)-pantothenate lactonase from horse liver
用来自所有已知哺乳动物基因(见下文)的序列信息来设计引物,用于针对从来自Bavaria的一年龄“Hafflinger”的马肝脏制得的cDNA进行的PCR,所述cDNA由Roche Molecular Biochemicals,Penzberg,德国友情提供。用QIAGEN RNeasy Maxi Kit(Qiagen,Hilden,德国)来制备来自肝脏组织的总RNA。Sequence information from all known mammalian genes (see below) was used to design primers for PCR against cDNA prepared from a one-year-old "Hafflinger" horse liver from Bavaria supplied by Roche Molecular Biochemicals, Courtesy of Penzberg, Germany. Total RNA from liver tissue was prepared with the QIAGEN RNeasy Maxi Kit (Qiagen, Hilden, Germany).
马肝脏内酯酶与钙调蛋白家族其它成员之间的同源性显示于表1中。测定是通过GCG程序包中的GAP程序,使用标准参数来测定的。相似性的数字显示于表中对角线的右边,而相同性的数字显示于对角线左边。The homology between horse liver lactonase and other members of the calmodulin family is shown in Table 1. Assays were performed by the GAP program in the GCG package, using standard parameters. Numbers for similarity are shown to the right of the diagonal in the table, and numbers for identity are shown to the left of the diagonal.
表1:钙调蛋白家族中的氨基酸序列相似性和相同性Table 1: Amino acid sequence similarities and identities within the calmodulin family
鸡:鸡;X.laev:Xenopus laevis;D.mela:Drosophila melanogasterChicken: Chicken; X.laev: Xenopus laevis; D.mela: Drosophila melanogaster
用Titan One Tube RT-PCT Kit(Roche Molecular Biochemicals,Penzberg,德国)和覆盖了起始密码子及终止密码子上游50bp序列的同源引物,利用RT-PCT扩增出885bp的片段(SEQ ID NO:1的1-885位核苷酸)。反应按照厂商推荐的方法进行。Using Titan One Tube RT-PCT Kit (Roche Molecular Biochemicals, Penzberg, Germany) and homologous primers covering the upstream 50bp sequence of the start codon and stop codon, a fragment of 885bp was amplified by RT-PCT (SEQ ID NO : 1-885 nucleotides of 1). The reaction was carried out according to the method recommended by the manufacturer.
用于第一次PCR的N-末端5’引物(SEQ ID NO:12)和C-末端3’-引物(3’-CAGTTTCCTTAAGGGGGGATA-5’)(SEQ ID NO:13)的构建基于来自牛(SEQ ID NO:14)、兔(SEQ ID NO:15)、人(SEQ IDNO:16)、大鼠(SEQ ID NO:17)、小鼠(SEQ ID NO:18)、鸡(SEQ ID NO:19)和Xenopus(SEQ ID NO:20)的N-末端序列以及来自牛(SEQ ID NO:21)、兔(SEQ ID NO:22)、人(SEQ ID NO:23)、大鼠(SEQ ID NO:24)、小鼠(SEQ ID NO:25)、鸡(SEQ IDNO:26)和Xenopus(SEQ ID NO:27)的C-末端序列。The N-terminal 5' primer (SEQ ID NO: 12) and the C-terminal 3'-primer (3'-CAGTTTCCTTAAGGGGGGATA-5') (SEQ ID NO: 13) for the first PCR were constructed based on SEQ ID NO: 14), rabbit (SEQ ID NO: 15), human (SEQ ID NO: 16), rat (SEQ ID NO: 17), mouse (SEQ ID NO: 18), chicken (SEQ ID NO: 19) and the N-terminal sequence of Xenopus (SEQ ID NO: 20) and from bovine (SEQ ID NO: 21), rabbit (SEQ ID NO: 22), human (SEQ ID NO: 23), rat (SEQ ID NO: 24), mouse (SEQ ID NO: 25), chicken (SEQ ID NO: 26) and the C-terminal sequence of Xenopus (SEQ ID NO: 27).
将该片段克隆到TA克隆载体(Invitrogen,Carlsberg,CA,USA)中。测序证明其来自马肝脏钙调蛋白。通过被称为5’/3’RACE的方法,用Roche Molecular Biochemicals,Penzberg,德国提供的单独的试剂盒,分离出缺失的5’-末端:用完美匹配的引物(SEQ ID NO:28)从经分离的马肝脏cDNA生产单链DNA。纯化该单链DNA,加上polyC尾巴,用第一条引物5’端的嵌套引物、polyG引物来进行另一次PCR。代表着基因的缺失的5’-末端的DNA片段被扩增出来。用根据已经已知的序列设计的正向引物(SEQ ID NO:29)和主要根据牛和兔基因设计的起始于基因最后一个氨基酸的引物(SEQ ID NO:30)来克隆出3’-末端。在最后一次PCR中,用获得的序列信息来设计覆盖基因起始和终止密码子的5’-和3’-引物,按照上文所述从总RNA中分离出内酯酶的全长cDNA。该cDNA和氨基酸序列展示于SEQ ID NO:1和SEQ ID NO:2中。该基因由900bp/299个氨基酸构成。This fragment was cloned into the TA cloning vector (Invitrogen, Carlsberg, CA, USA). Sequencing proved it to be from horse liver calmodulin. The missing 5'-end was isolated by a method called 5'/3'RACE with a separate kit provided by Roche Molecular Biochemicals, Penzberg, Germany: with a perfectly matched primer (SEQ ID NO: 28) from Single-stranded DNA was produced from isolated horse liver cDNA. The single-stranded DNA is purified, polyC tails are added, and another PCR is performed using the nested primer at the 5' end of the first primer, the polyG primer. A DNA fragment representing the deleted 5'-end of the gene is amplified. The 3'- end. In the final PCR, the obtained sequence information was used to design 5'- and 3'-primers covering the start and stop codons of the gene, and the full-length cDNA of lactonase was isolated from the total RNA as described above. The cDNA and amino acid sequences are shown in SEQ ID NO: 1 and SEQ ID NO: 2. The gene consists of 900bp/299 amino acids.
用一种商业途径可获得的表达载体,例如来自Qiagen的pQE60,在E.coli中表达完整的基因,其C-末端与His标签融合。按照标准程序对加上His标签的蛋白质进行表达和纯化,产生出显示(S)-泛酸内酯酶活性的33kDa的蛋白质。A commercially available expression vector, eg pQE60 from Qiagen, was used to express the complete gene in E. coli with a His-tag fused at its C-terminus. Expression and purification of the His-tagged protein following standard procedures yielded a 33 kDa protein exhibiting (S)-pantolactonase activity.
实施例4由可商购的A.niger脂肪酶A粗制剂来纯化(R)-泛酸内酯水解活Example 4 Purification of (R)-pantolactone hydrolysis activity by commercially available A.niger lipase A crude preparation 性sex
在可从商业途径获得的不同种的由A.niger发酵获得的酶产品中,鉴定出了特异性水解(R)-泛酸内酯的活性。为纯化具体的酶,使用了来自Amano Pharmaceuticals Co.Ltd.,Nagoya,Japan的被称为Lipase A“Amano”的商业制剂。根据从与内酯酶共同纯化的两个酶获得的序列数据,该制剂来自经冻干和破碎的过量表达来自A.tubigensis的脂肪酶的A.awamori细胞。将该材料溶于pH7.5的20mM Tris/HCl、2.5mM的MgCl2(标准缓冲液)中。加入5%的乙醇,以增加材料的溶解度。通过离心(30分钟,15000rpm,4℃)以及接着的过滤(用0.45μm的过滤器),分离出无法进入到溶液中的材料。先在Amicon cell超滤器(50kDa分离点)中通过超滤对已溶解的部分进行第一次纯化,以去除任何盐类和小分子化合物,它们会扰乱后续的阴离子交换色谱步骤。然后将蛋白质上样至HR26/10预填Q-琼脂糖凝胶柱(Amersham Pharmacia Biotech EuropeGmbH,Dübendorf,瑞士)。用200ml标准缓冲液洗柱,然后通过0至350mM NaCl的线性梯度(400ml)洗脱出蛋白质。含有活性的片断被收集起来。加入硫酸铵,至终浓度为1.5M,将经过处理的蛋白质溶液上样至HR 26/10丁基琼脂糖凝胶预填柱(Amersham Pharmacia Biotech EuropeGmbH,Dübendorf,瑞士)中。用200ml含有1.5M硫酸铵的标准缓冲液洗涤,通过1.5至0M递减的硫酸铵梯度(400ml)分离出蛋白质。此时活性在两个峰被洗脱出,并被分别收集。两次收集物的体积被减少到小于2ml。用标准缓冲液,通过在Superdex 200柱上进行凝胶过滤分离出两种浓缩物。每种分离物的活性片段被收集和浓缩。收集物II的活性趋向于稍早于收集物I被洗脱出来,这表明,相应蛋白质的分子量可能稍高。Among the different commercially available enzyme products obtained from the fermentation of A. niger, specific hydrolysis of (R)-pantothenolactone was identified. For purification of specific enzymes, a commercial preparation known as Lipase A "Amano" from Amano Pharmaceuticals Co. Ltd., Nagoya, Japan was used. The preparation was obtained from lyophilized and disrupted A. awamori cells overexpressing lipase from A. tubigensis based on sequence data obtained from two enzymes co-purified with lactonase. This material was dissolved in 20 mM Tris/HCl, pH 7.5, 2.5 mM MgCl2 (standard buffer). Add 5% ethanol to increase the solubility of the material. Material that did not go into solution was separated by centrifugation (30 min, 15000 rpm, 4°C) followed by filtration (with a 0.45 μm filter). The dissolved fraction was first purified by ultrafiltration on an Amicon cell ultrafilter (50kDa cut point) to remove any salts and small molecules that would disturb the subsequent anion exchange chromatography step. Proteins were then loaded onto HR26/10 prepacked Q-Sepharose columns (Amersham Pharmacia Biotech Europe GmbH, Dübendorf, Switzerland). The column was washed with 200 ml of standard buffer, and then the protein was eluted by a linear gradient (400 ml) of 0 to 350 mM NaCl. Fragments containing activity are pooled. Ammonium sulfate was added to a final concentration of 1.5M, and the treated protein solution was loaded onto a HR 26/10 butyl sepharose prepacked column (Amersham Pharmacia Biotech Europe GmbH, Dübendorf, Switzerland). Proteins were separated by washing with 200 ml of standard buffer containing 1.5 M ammonium sulfate and a gradient of decreasing ammonium sulfate from 1.5 to 0 M (400 ml). At this point the activity elutes in two peaks which are collected separately. The volume of the two pools was reduced to less than 2ml. The two concentrates were separated by gel filtration on a Superdex 200 column with standard buffer. Active fragments from each isolate were pooled and concentrated. The activity of pool II tended to elute slightly earlier than pool I, suggesting that the corresponding protein may be of slightly higher molecular weight.
80U/ml(40℃)是针对收集物I和收集物II分别已测出的最高活性。如2D凝胶所示,蛋白质制剂不完全是同源的,这表示真正的比活性甚至会更高。然而,因为内酯酶活性对应于凝胶过滤洗脱曲线中38kDa的蛋白质条带,也是最终制剂中占优势的蛋白质,该条带最有可能是内酯酶。根据凝胶过滤的结果,天然内酯酶具有75kDa的分子量。这暗示着该酶的同型二聚体结构。80 U/ml (40°C) is the highest activity that has been measured for pool I and pool II respectively. As shown by the 2D gels, the protein preparations were not completely homogeneous, suggesting that the true specific activity could be even higher. However, since lactonase activity corresponds to the 38 kDa protein band in the gel filtration elution profile and is the predominant protein in the final preparation, this band is most likely lactonase. According to the results of gel filtration, the natural lactonase has a molecular weight of 75 kDa. This suggests a homodimeric structure of the enzyme.
实施例5对(R)-泛酸内酯水解酶进行初步的表征Example 5 Preliminary characterization of (R)-pantolactone hydrolase
(a)天然形式的结构和分子量:(a) Structure and molecular weight of native form:
用如实施例4所述的条件,在72ml处,用Superdex 200(AmershamPharmacia Biotech Europe GmbH,Dübendorf,瑞士)洗脱出酶。从标准曲线推断,这反映了75kDa的分子量。SDS-PAGE表明分子量为38kDa。上述数据的组合暗示来自A.niger的(R)-泛酸内酯酶是由两个相同的38kDa亚基组成的同型二聚体。The enzyme was eluted with Superdex 200 (AmershamPharmacia Biotech Europe GmbH, Dübendorf, Switzerland) at 72 ml using conditions as described in Example 4. Extrapolated from the standard curve, this reflects a molecular weight of 75 kDa. SDS-PAGE showed a molecular weight of 38 kDa. The combination of the above data suggests that the (R)-pantolactonase from A. niger is a homodimer composed of two identical 38 kDa subunits.
(b)热稳定性:(b) thermal stability:
在pH 7.5的20mM Tris/HCl中于0、30、40、50、55、60、70℃对50μl经稀释的酶(0.1U)进行一小时的培养。冰上15分钟之后,用标准试验方法测定残余活性。直到50℃酶都是稳定的。在更高的温度下,其开始失活。50 μl of diluted enzyme (0.1 U) were incubated for one hour at 0, 30, 40, 50, 55, 60, 70°C in 20 mM Tris/HCl, pH 7.5. After 15 minutes on ice, residual activity was determined using standard assay methods. The enzyme is stable up to 50°C. At higher temperatures, it begins to deactivate.
(c)最佳温度:(c) Optimum temperature:
采用下述条件,于20、30、40、50、55和60℃,对来自商业来源(见实施例5)的强烈富集的(R)-泛酸内酯酶进行30分钟的培养:Strongly enriched (R)-pantolactonase from a commercial source (see Example 5) was incubated for 30 minutes at 20, 30, 40, 50, 55 and 60°C using the following conditions:
100mM(R,S)-泛酸内酯100mM (R, S)-pantothenate
100mM MgCl2 100mM MgCl2
200mM Tris/HCl,pH 8.0200mM Tris/HCl, pH 8.0
0.05U(R)-泛酸内酯酶,最终体积为10μl。0.05 U(R)-pantolactonase in a final volume of 10 μl.
在所用的条件下,来自商业化A.niger制剂的(R)-泛酸内酯酶的最佳温度为大约50℃。The temperature optimum for (R)-pantolactonase from a commercial A. niger preparation was about 50°C under the conditions used.
(d)辅助因子依赖性和抑制:(d) Cofactor dependence and inhibition:
来自A.niger的(R)-泛酸内酯酶的活性需要Mg2+或Mn2+。加入1mMEDTA会完全抑制其活性。The activity of (R)-pantolactonase from A. niger requires Mg 2+ or Mn 2+ . Adding 1mM EDTA will completely inhibit its activity.
(e)最佳pH:(e) optimal pH:
当pH在7.0至11.0之间变化时,该酶并未显示出“正常的”pH-活性曲线。在40℃,对200mM TEA-乙酸盐缓冲液(pH7.0至11.0)中的10mg Amano Lipase A进行20分钟的培养,所述缓冲液含有0.1%Span 80,20mM乙酸镁。加入等体积的500mM MES缓冲液来终止样品的反应,所述缓冲液pH为5.5,含有50mM EDTA;对样品进行离心(10,000g,10分钟),然后通过HPLC进行分析,其中,用0.46×15cm CHIRADEX柱,用70:30的水-甲醇来洗脱,1ml/分钟,20℃。最高活性在pH为大约10时达到。从这个位置到pH11.0之间,活性并无可观的改变。最高选择性在pH大约9.0时达到。The enzyme did not show a "normal" pH-activity profile when the pH was varied between 7.0 and 11.0. 10 mg Amano Lipase A in 200 mM TEA-acetate buffer (pH 7.0 to 11.0) containing 0.1% Span 80, 20 mM magnesium acetate was incubated for 20 minutes at 40°C. Add an equal volume of 500mM MES buffer to stop the reaction of the sample, the pH of the buffer is 5.5, containing 50mM EDTA; the sample is centrifuged (10,000g, 10 minutes), and then analyzed by HPLC, wherein, with 0.46 × 15cm CHIRADEX column, eluted with 70:30 water-methanol, 1ml/min, 20°C. The highest activity is reached at a pH of about 10. There was no appreciable change in activity from this position to pH 11.0. The highest selectivity is achieved at about pH 9.0.
实施例6培养A.niger ATCC9142、9029、26875、62863和10864,并对Embodiment 6 cultivates A.niger ATCC9142, 9029, 26875, 62863 and 10864, and to 其(R)-泛酸内酯酶活性进行评测Its (R)-pantothenic acid lactonase activity was evaluated
为获知内酯酶活性是否在Aspergillus属中广泛存在,在下述条件下培养五个A.niger菌株:To see if lactonase activity is widespread in the Aspergillus genus, five A. niger strains were grown under the following conditions:
培养基(相对于每升的量):1.2g NaNO3、0.5g KH2PO4、0.2gMgSO4 x 7H2O、0.5g酵母提取物、5%葡萄糖和0.04ml的痕量金属溶液,所述溶液如Vishniac和Santer,Bacteriol.Rev.21:195-213(1957)所述。Medium (relative to volume per liter): 1.2g NaNO 3 , 0.5g KH 2 PO 4 , 0.2g MgSO 4 x 7H 2 O, 0.5g yeast extract, 5% glucose and 0.04ml trace metal solution, all Such solutions are described by Vishniac and Santer, Bacteriol. Rev. 21:195-213 (1957).
用每ml 5×106个孢子于30℃去接种11瓶中的300ml预培养物。8小时后,将预培养物加入到21发酵罐中的1.71培养基中,发酵罐带有pH和溶解氧控制。通过自动加入5M NaOH将pH保持在5.5。在低通气条件(5-10%的空气饱和度)下对细胞进行14小时的培养。之后将溶解氧水平增加至30-50%空气饱和度。5小时后,收获菌丝体,对其进行过滤,用去矿物质水洗两次,冷冻于液氮中。用French Press对其中的一部分直接进行裂解。用标准试验方法对培养基和细胞裂解液进行针对泛酸内酯酶活性的研究。A 300 ml preculture in 11 flasks was inoculated at 30°C with 5 x 10 6 spores per ml. After 8 hours, the preculture was added to 1.71 medium in a 21 fermenter with pH and dissolved oxygen control. The pH was maintained at 5.5 by automatic addition of 5M NaOH. Cells were incubated for 14 hours under hypoaerobic conditions (5-10% air saturation). The dissolved oxygen level was then increased to 30-50% air saturation. After 5 hours, the mycelia were harvested, filtered, washed twice with demineralized water, and frozen in liquid nitrogen. A part of them was directly lysed with a French Press. Media and cell lysates were investigated for pantothenate activity using standard assay methods.
(R)-泛酸内酯酶活性主要被发现于所有被测试的A.niger菌株的裂解液中。在培养基中仅发现了非常小的活性。(R)-Pantolactonase activity was mainly found in the lysates of all tested A. niger strains. Only very little activity was found in the medium.
实施例7对来自脂肪酶制剂的经富集的(R)-泛酸内酯酶进行胰蛋白酶消化Example 7 Trypsin Digestion of Enriched (R)-Pantolactonase from a Lipase Preparation 产生的小肽的氨基酸序列进行测定The amino acid sequence of the resulting small peptide was determined
在2D凝胶上对实施例5的经过富集的制剂进行分析。用胰蛋白酶对两个主要的点进行消化,其分别是33和40kDa,4.7和5.6的pI。通过HPLC来分离产生的肽(设备:Hewlett Packard 1090,柱:Vydac C18(0.21×25cm),210nm处吸收,流速:0.20ml/分钟,缓冲液A:0.075%TFA,缓冲液B:80%乙腈中的0.065%TFA,梯度:2%B(0-10分钟),2-75%B(10-120分钟))。通过Edman降解法来对选出的肽进行测序,点C[pI 5.6,39kDa(Poll C)]:Fxn 42(肽1)(SEQ ID NO:81)、Fxn 57(肽3)(SEQ IDNO:82)、Fx 73(肽2)(SEQ ID NO:83)、Fx 74(SEQ ID NO:84)和Fx 81(SEQ ID NO:85),点C[pI 4.7,33kDa(Poll A/B)]:N末端(SEQ ID NO:86)、Fx 58(SEQ ID NO:87)、Fx 64(SEQ ID NO:88)、Fx 65(SEQ IDNO:89)、Fx 73(SEQ ID NO:90)和Fx 68(SEQ ID NO:91)。X代表未确定的氨基酸,圆括号中的氨基酸是无法被明确确定的。The enriched formulation of Example 5 was analyzed on a 2D gel. Two major spots, 33 and 40 kDa, pi of 4.7 and 5.6, respectively, were digested with trypsin. The resulting peptides were separated by HPLC (equipment: Hewlett Packard 1090, column: Vydac C18 (0.21×25 cm), absorption at 210 nm, flow rate: 0.20 ml/min, buffer A: 0.075% TFA, buffer B: 80% acetonitrile 0.065% TFA in , gradient: 2% B (0-10 min), 2-75% B (10-120 min)). The selected peptides were sequenced by Edman degradation method, point C [pI 5.6, 39kDa (Poll C)]: Fxn 42 (peptide 1) (SEQ ID NO: 81), Fxn 57 (peptide 3) (SEQ ID NO: 82), Fx 73 (peptide 2) (SEQ ID NO:83), Fx 74 (SEQ ID NO:84) and Fx 81 (SEQ ID NO:85), point C [pI 4.7, 33kDa (Poll A/B) ]: N-terminal (SEQ ID NO: 86), Fx 58 (SEQ ID NO: 87), Fx 64 (SEQ ID NO: 88), Fx 65 (SEQ ID NO: 89), Fx 73 (SEQ ID NO: 90) and Fx 68 (SEQ ID NO: 91). X represents an unidentified amino acid, and the amino acid in parentheses cannot be clearly identified.
用所有短肽序列作为目标进行数据库搜索,没有得到可靠的信息能帮助确定两个主要的点中的哪一个代表着内酯酶。因此,我们利用一套阴离子交换色谱对实施例5的内酯酶制剂进行进一步的富集。在预填MonoQ柱HR 16/10(Amersham Pharmacia Biotech Europe GmbH,Dübendorf,瑞士)上,0至350mM NaCl梯度下,对蛋白质溶液(在pH 7.5的20mMTris/HCl、2.5mM MgCl2中)进行分离。对所有片断的活性进行测定。按照厂商(Invitrogen,Carlsbad,CA,USA)所述进行等电聚焦以及SDS-PAGE,对活性片断进行分析。比较SDS-PAGE凝胶和IEF得出的片断活性数据,暗示内酯酶是38至40kDa的蛋白质,pI在pH 5.6左右。该数据与点PollC吻合得非常好。因此,该点的氨基酸序列就被用于设计PCR引物和用于Southern印迹试验的寡核苷酸。收集最具有活性的片断,再在2D凝胶上进行分析,分析显示,较之之前的2D凝胶,PollC相对于PollA/B更加清楚地富集。A database search using all short peptide sequences as targets yielded no reliable information to help determine which of the two major spots represented the lactonase. Therefore, we used a set of anion exchange chromatography to further enrich the lactonase preparation of Example 5. Protein solutions (in 20 mM Tris/HCl, pH 7.5, 2.5 mM MgCl 2 ) were separated on a prepacked MonoQ column HR 16/10 (Amersham Pharmacia Biotech Europe GmbH, Dübendorf, Switzerland) with a gradient of 0 to 350 mM NaCl. The activity of all fragments was assayed. Active fragments were analyzed by isoelectric focusing and SDS-PAGE as described by the manufacturer (Invitrogen, Carlsbad, CA, USA). Comparison of fragment activity data from SDS-PAGE gels and IEF suggested that the lactonase is a 38 to 40 kDa protein with a pi around pH 5.6. This data fits very well with point PollC. Therefore, the amino acid sequence of this spot was used to design PCR primers and oligonucleotides for Southern blot experiments. The most active fragments were collected and analyzed on a 2D gel, which showed that PollC was more clearly enriched relative to PollA/B than the previous 2D gel.
实施例8设计用于Southern印迹试验的简并寡核苷酸以及用于从A.nigerEmbodiment 8 is designed to be used for the degenerate oligonucleotide of Southern blotting experiment and be used for from A.niger 克隆(R)-泛酸内酯酶的PCR引物PCR primers for cloning (R)-pantolactonase
使用来自不同Aspergillus种的12条不同的植酸酶序列,通过GCG程序包版本10.2的CODONFREQUENCY程序,得到Aspergillus的密码子使用情况,将其用于减少由于遗传密码简并性所造成的可能的不同DNA序列组合的数目。用Aspergillus密码子使用情况作为决定性因素,引物3’-末端都考虑到了每种可能的组合,但引物5’-末端就忽略了一些稀有的密码子。在用寡核苷酸进行杂交实验的情况下,用同样的方式对3’-和5’-末端进行处理。名称中含有“S(sense)”或“AS(anti-sense)”的寡核苷酸是用于PCR的。名称中不含S或AS的寡核苷酸是用于杂交实验的。Using 12 different phytase sequences from different Aspergillus species, the codon usage of Aspergillus was obtained by the CODONFRENCY program of GCG package version 10.2, which was used to reduce possible differences due to the degeneracy of the genetic code The number of DNA sequence combinations. Using Aspergillus codon usage as the determining factor, every possible combination was considered at the 3'-end of the primers, but some rare codons were ignored at the 5'-end of the primers. In the case of hybridization experiments with oligonucleotides, the 3'- and 5'-ends were treated in the same way. Oligonucleotides with "S (sense)" or "AS (anti-sense)" in their names are intended for PCR. Oligonucleotides without S or AS in the name are for hybridization experiments.
用于从A.niger中克隆(R)-泛酸内酯酶的寡核苷酸是Fxn42(SEQ IDNO:31)、Fxn42S(SEQ ID NO:32)、Fxn42AS(SEQ ID NO:33)、Fx57(SEQ ID NO:34)、Fxn57S(SEQ ID NO:35)、Fxn57AS(SEQ ID NO:36)、Fx73(SEQ ID NO:37)、Fx73S(SEQ ID NO:38)、Fx73AS(SEQ IDNO:39)、Fx74(SEQ ID NO:40)、Fx74S(SEQ ID NO:41)和Fx74AS(SEQ ID NO:42)。The oligonucleotides used to clone (R)-pantolactonase from A. niger are Fxn42 (SEQ ID NO: 31), Fxn42S (SEQ ID NO: 32), Fxn42AS (SEQ ID NO: 33), Fx57 ( SEQ ID NO: 34), Fxn57S (SEQ ID NO: 35), Fxn57AS (SEQ ID NO: 36), Fx73 (SEQ ID NO: 37), Fx73S (SEQ ID NO: 38), Fx73AS (SEQ ID NO: 39) , Fx74 (SEQ ID NO: 40), Fx74S (SEQ ID NO: 41) and Fx74AS (SEQ ID NO: 42).
实施例9从A.niger ATCC9142、A.niger NRRL3135、A.niger ssp.Embodiment 9 is from A.niger ATCC9142, A.niger NRRL3135, A.niger ssp. awamori(ATCC38854)或A.niger MacRae(ATCC46951)中分离基因Genes isolated from awamori (ATCC38854) or A. niger MacRae (ATCC46951) 组DNADNA
用1×106孢子/ml去接种300ml的YPD培养基[Sherman et al.,Laboratory course manual for methods in yeast genetics,Cold Spring HarborLaoratory,Cold Spring Harbor,New York(1986)]。在猛烈振荡下(200rpm),于30℃对培养物进行过夜培养。用Whatman滤纸通过过滤收集菌丝体,用PBS(等渗磷酸缓冲液)洗,在液氮中冷冻。在冰冷的研钵中,将菌丝体磨成细粉末。将该粉末重新悬浮于1/3体积的pH8.0的50mM Tris/HCl、1.0%SDS、50mM EDTA中,于65℃频繁颠倒条件下培养15分钟。将溶液冷却至50℃。加入蛋白酶K(终浓度为100μg/ml)。将溶液在50℃培养1小时。另外加入100mg/ml的蛋白酶K,继续培养3小时。加入1/3体积的苯酚/氯仿/异戊醇(50/49/1)。彻底且轻柔的混合之后,对乳化液进行离心(12000g,20分钟,4℃)。移出水相,再次萃取。另一个离心步骤(12000g,10分钟,4℃)之后,用氯仿对水相进行萃取。向水相中加入0.7倍体积的异丙醇,轻柔混合溶液15分钟。通过离心(10000g,30分钟,4℃)收集沉淀DNA。向残留的上清液中加入1/10体积的pH5.2的3M KCl,在轻柔振荡下培养15分钟。再一次离心步骤之后,用70%的乙醇对两份沉淀洗两次,空气中干燥,溶于0.5ml水中。最后,用RNase处理DNA以去除残留的RNA。Use 1×10 6 spores/ml to inoculate 300 ml of YPD medium [Sherman et al., Laboratory course manual for methods in yeast genetics, Cold Spring Harbor Laboratory, Cold Spring Harbor, New York (1986)]. The cultures were grown overnight at 30°C with vigorous shaking (200 rpm). Mycelia were collected by filtration with Whatman filter paper, washed with PBS (isotonic phosphate buffered saline), and frozen in liquid nitrogen. In an ice-cold mortar, grind the mycelium to a fine powder. The powder was resuspended in 1/3 volume of 50 mM Tris/HCl, pH 8.0, 1.0% SDS, 50 mM EDTA and incubated at 65°C for 15 minutes with frequent inversions. The solution was cooled to 50 °C. Proteinase K was added (final concentration 100 μg/ml). The solution was incubated at 50°C for 1 hour. In addition, 100 mg/ml of proteinase K was added, and the culture was continued for 3 hours. Add 1/3 volume of phenol/chloroform/isoamyl alcohol (50/49/1). After thorough and gentle mixing, the emulsion was centrifuged (12000g, 20 minutes, 4°C). The aqueous phase was removed and extracted again. After another centrifugation step (12000 g, 10 min, 4° C.), the aqueous phase was extracted with chloroform. Add 0.7 volumes of isopropanol to the aqueous phase and mix the solution gently for 15 min. Precipitated DNA was collected by centrifugation (10000 g, 30 min, 4°C). To the remaining supernatant was added 1/10 volume of 3M KCl, pH 5.2, and incubated for 15 minutes with gentle shaking. After another centrifugation step, the two pellets were washed twice with 70% ethanol, air dried and dissolved in 0.5 ml of water. Finally, the DNA is treated with RNase to remove residual RNA.
实施例10克隆来自A.niger ATCC9142的(R)-泛酸内酯酶Example 10 Cloning of (R)-pantothenate lactonase from A. niger ATCC9142
按照实施例6所述来制备A.niger ATCC9142的基因组DNA(gDNA)。用来自Stratagene(Stratagene,La Jolla,CA,USA)的Robocycler Infinity作为PCR仪,用来自Roche Molecular Biochemicals(Penzberg,德国)的TAQ聚合酶试剂盒来进行PCR:Genomic DNA (gDNA) of A. niger ATCC9142 was prepared as described in Example 6. The Robocycler Infinity from Stratagene (Stratagene, La Jolla, CA, USA) was used as PCR machine, and the TAQ polymerase kit from Roche Molecular Biochemicals (Penzberg, Germany) was used to carry out PCR:
1μl gDNA(1/10稀释) 步骤1:5分钟-95℃1μl gDNA (1/10 dilution) Step 1: 5 minutes -95°C
1μl核苷酸混合物 步骤2:30秒-95℃1 μl Nucleotide Mixture Step 2: 30 sec -95°C
5μl带Mg2+的PCR标准缓冲液 步骤3:30秒-50℃5 μl PCR standard buffer with Mg 2+ Step 3: 30 sec -50°C
1μl引物S(100pM/μl) 步骤4:1分钟-72℃1 μl Primer S (100pM/μl) Step 4: 1 minute -72°C
1μl引物AS(100pM/μl)1μl Primer AS (100pM/μl)
1μl来自Roche Molecular Biochemicals的TAQ聚合酶1 μl TAQ polymerase from Roche Molecular Biochemicals
40μl H2O40 μl H2O
步骤2至4重复35次。Steps 2 to 4 are repeated 35 times.
引物组合:Primer combination:
对所有引物都会进行关于自身锚定(self-priming)的检查。Fxn57AS是唯一会产生相当数量自身结合产物的引物,自身锚定产物为大约300至320bp。用55℃的杂交温度,PCR 5、6、8、10和11仍会产生一条或多条产物。60℃下采用30秒扩增和30个循环,PCR 5、6、8、10和11仍会产生一条或多条与较低杂交温度下产生的条带相对应的PCR产物。All primers are checked for self-priming. Fxn57AS was the only primer that produced a significant amount of self-anchored product, with a self-anchored product of approximately 300 to 320 bp. With a hybridization temperature of 55°C, PCR 5, 6, 8, 10 and 11 still produced one or more products. Using 30-second amplification and 30 cycles at 60°C, PCRs 5, 6, 8, 10, and 11 still produced one or more PCR products corresponding to the bands produced at the lower hybridization temperature.
用来自Qiagen(Qiagen,Hilden,德国)的QIAEX II凝胶抽提试剂盒从琼脂糖凝胶中分离出PCR反应12的840bp的产物,将其溶于40μl H2O中。The 840 bp product of PCR reaction 12 was isolated from the agarose gel with the QIAEX II gel extraction kit from Qiagen (Qiagen, Hilden, Germany) and dissolved in 40 μl H2O .
1μl 840bp的片段(1/10稀释) 步骤1:5分钟-95℃1μl 840bp fragment (1/10 dilution) Step 1: 5 minutes -95°C
1μl核苷酸混合物 步骤2:30秒-95℃1 μl nucleotide mixture Step 2: 30 sec -95°C
5μl带Mg2+的PCR标准缓冲液 步骤3:30秒-55℃5 μl PCR standard buffer with Mg 2+ Step 3: 30 sec -55°C
1μl引物S(10pMol/μl) 步骤4:30秒-72℃1μl Primer S (10pMol/μl) Step 4: 30 seconds -72°C
1μl引物AS(10pMol/μl)1μl Primer AS (10pMol/μl)
1μl来自Roche Molecular Biochemicals的TAQ聚合酶1 μl TAQ polymerase from Roche Molecular Biochemicals
40μl H2O40 μl H2O
步骤2至4重复30次。Repeat steps 2 to 4 30 times.
使用了反应2、5、10、11和12的引物组合。除反应2之外,其它所有反应都有PCR产物产生。这意味着引物42S/AS、57S/AS、73S/AS和74S/AS的序列是840bp片段的部分。这四对引物的高度接近性强烈预示,840bp的片段是感兴趣的内酯酶基因的一部分。这种假设得到了测序的支持。肽Fx57(SEQ ID NO:90)的氨基酸序列在840bp的片段中得到了证实,而肽Fxn42(SEQ ID NO:45)的序列仅得到了部分证实。Primer combinations for reactions 2, 5, 10, 11 and 12 were used. Except reaction 2, all other reactions produced PCR products. This means that the sequences of primers 42S/AS, 57S/AS, 73S/AS and 74S/AS are part of a 840 bp fragment. The high proximity of these four pairs of primers strongly indicates that the 840bp fragment is part of the lactonase gene of interest. This hypothesis was supported by sequencing. The amino acid sequence of peptide Fx57 (SEQ ID NO: 90) was confirmed in a fragment of 840 bp, whereas the sequence of peptide Fxn42 (SEQ ID NO: 45) was only partially confirmed.
还可以分离出A.niger MacRae的相应DNA片段。关于A.awamori的情况,通过该方法仅能分离出引物Fxn57S和Fx73AS的产物。上述Aspergillus的序列在DNA水平上与A.niger ATCC9142的序列具有大约90%的相同性。The corresponding DNA fragment of A. niger MacRae can also be isolated. In the case of A. awamori, only the products of primers Fxn57S and Fx73AS could be isolated by this method. The above sequence of Aspergillus has about 90% identity with the sequence of A. niger ATCC9142 at the DNA level.
为分离内酯酶基因的5’-和3’-末端,使用下述方法:To isolate the 5'- and 3'-ends of the lactonase gene, the following method was used:
(a)分离3’-末端:(a) Isolation of the 3'-end:
1μl A.niger ATCC9142的gDNA 步骤1:5分钟-95℃1 μl gDNA of A. niger ATCC9142 Step 1: 5 minutes -95°C
1μl核苷酸混合物 步骤2:30秒-95℃1μl Nucleotide Mixture Step 2: 30 sec -95°C
5μl带Mg2+的PCR标准缓冲液 步骤3:30秒-55℃5 μl PCR standard buffer with Mg 2+ Step 3: 30 sec -55°C
1μl引物Oal2AS或Oal3S(10pMol/μl) 步骤4:30秒-72℃1μl primer Oal2AS or Oal3S (10pMol/μl) Step 4: 30 seconds -72°C
1μl来自Roche Molecular Biochemicals的TAQ聚合酶1 μl TAQ polymerase from Roche Molecular Biochemicals
41μl H2O41 μl H2O
步骤2至4重复30次。Repeat steps 2 to 4 30 times.
用等体积的苯酚/氯仿对PCR混合进行两次萃取,再用氯仿单独进行一次。然后通过加入1/10体积的pH5.2的3M乙酸钾和2倍体积的乙醇沉淀出DNA。15分钟的离心(Eppendorf板式离心管中,14000rpm)之后,用70%的乙醇洗沉淀,空气干燥,最终溶于20μl灭菌水中。The PCR mix was extracted twice with equal volumes of phenol/chloroform and once with chloroform alone. The DNA was then precipitated by adding 1/10 volume of 3M potassium acetate pH 5.2 and 2 volumes of ethanol. After 15 minutes of centrifugation (14000 rpm in Eppendorf plate centrifuge tubes), the pellet was washed with 70% ethanol, air-dried and finally dissolved in 20 μl sterile water.
第二次PCR:Second PCR:
4μl来自第一次PCR的DNA 步骤1:5分钟-95℃4 μl of DNA from the first PCR Step 1: 5 min -95°C
1μl核苷酸混合物 步骤2:30秒-95℃1 μl nucleotide mixture Step 2: 30 sec -95°C
5μl带Mg2+的PCR标准缓冲液 步骤3:30秒-58℃5 μl PCR standard buffer with Mg 2+ Step 3: 30 sec -58°C
1μl引物Oal4S或Oal1AS(10pMol/μl) 步骤4:1分钟-72℃1 μl primer Oal4S or Oal1AS (10pMol/μl) Step 4: 1 minute -72°C
1μl来自Roche Molecular Biochemicals的High Fidelity Mix1 μl High Fidelity Mix from Roche Molecular Biochemicals
38μl H2O38 μl H2O
步骤2至4重复35次。Steps 2 to 4 are repeated 35 times.
用Oal4S引物,产生了一段900bp的DNA分子。测序显示其编码基因的3’-末端。Using the Oal4S primer, a 900 bp DNA molecule was generated. Sequencing revealed the 3'-end of its encoding gene.
(b)分离5’-末端:(b) Isolation of the 5'-end:
用DIG标记过的寡核苷酸Fxn42、Fxn57、Fx73和Fx74作为探针,使用带有X-光胶片化学荧光探测的DIG系统(Roche MolecularBiochemicals),对A.niger ATCC9142的基因组DNA进行杂交实验。杂交和探测按照厂商的推荐来进行。Using DIG-labeled oligonucleotides Fxn42, Fxn57, Fx73 and Fx74 as probes, the genomic DNA of A. niger ATCC9142 was hybridized using the DIG system (Roche Molecular Biochemicals) with X-ray film chemiluminescence detection. Hybridization and probing were performed according to the manufacturer's recommendations.
用BamHI、EcoRI、HindIII、PstI、SacI和XhoI来对10μg的基因组DNA进行消化。用Roche Molecular Biochemicals杂交溶液在42℃进行过夜杂交。室温用2x SSC、0.1%SDS进行两次洗涤步骤之后,接着在50℃用0.5x SSC、0.1%SDS再进行两次洗涤步骤。用CSPD(Disodium 2-chloro-5-(4-methoxyspiro{1,2-dioxetane-3,2’-(5’-chloro)tricyclo[3.3.1.13,7]decan}-4-yl)-1-phenyl phospate)作为底物,对碱性磷酸酶进行探测,之后将X-光胶片曝光于滤光器(filter)1小时。胶片上没有合用的信号。10 μg of genomic DNA was digested with BamHI, EcoRI, HindIII, PstI, SacI and XhoI. Hybridization was performed overnight at 42°C using Roche Molecular Biochemicals hybridization solution. Two wash steps with 2x SSC, 0.1% SDS at room temperature were followed by two more wash steps with 0.5x SSC, 0.1% SDS at 50°C. With CSPD (Disodium 2-chloro-5-(4-methoxyspiro{1,2-dioxetane-3,2'-(5'-chloro)tricyclo[3.3.1.1 3,7 ]decan}-4-yl)-1 -phenyl phospate) was used as a substrate to detect alkaline phosphatase, and then the X-ray film was exposed to a filter (filter) for 1 hour. No usable signal on film.
因此,使用反应5和11的随机锚定的DIG标记过的PCR产物(见实施例11)作为探针。按照同样的方法来进行杂交,只是洗涤条件稍微改变为室温下2×5分钟,55℃下2×15分钟。胶片上显示了下述条带:Therefore, randomly anchored DIG-labeled PCR products from reactions 5 and 11 (see Example 11) were used as probes. Hybridization was performed in the same manner, except that the washing conditions were slightly changed to 2 x 5 minutes at room temperature and 2 x 15 minutes at 55°C. The following bands were shown on the film:
为克隆整个基因或至少是其缺失的5’-末端,根据杂交实验,用HindIII对A.niger ATCC9142的染色体DNA进行消化,产生出大约4kb的含有内酯酶基因的片段,实验方法如下:In order to clone the entire gene or at least its missing 5'-end, according to the hybridization experiment, the chromosomal DNA of A. niger ATCC9142 was digested with HindIII to produce a fragment containing the lactonase gene of about 4 kb. The experimental method is as follows:
20μl染色体DNA20 μl chromosomal DNA
20μl10x缓冲液20 μl 10x buffer
5μl HindIII(10U/μl)5μl HindIII (10U/μl)
175μl H2O175 μl H2O
在37℃培养16小时。Incubate at 37°C for 16 hours.
用来自Qiagen(Qiagen,Hilden,德国)的PCR纯化试剂盒对经过消化的DNA进行纯化,并洗脱为100μl。下述连接反应在16℃过夜进行。将经过纯化的DNA用水稀释到2ml。按照厂商(Roche MolecularBiochemicals)推荐加入连接酶和连接酶缓冲液。连接完成的溶液被用作模板,进行不同的PCR,其中使用经部分分离的内酯酶基因的内部引物:The digested DNA was purified with a PCR purification kit from Qiagen (Qiagen, Hilden, Germany) and eluted in 100 μl. The following ligation reactions were performed overnight at 16°C. Dilute the purified DNA to 2ml with water. Ligase and ligase buffer were added as recommended by the manufacturer (Roche Molecular Biochemicals). The ligated solution was used as a template for a different PCR using internal primers for the partially isolated lactonase gene:
5μl连接反应溶液 步骤1:5分钟-95℃5 μl ligation reaction solution Step 1: 5 minutes -95°C
1μl核苷酸混合物 步骤2:30秒-95℃1μl Nucleotide Mixture Step 2: 30 sec -95°C
5μl带Mg2+的PCR标准缓冲液 步骤3:30秒-55℃5 μl PCR standard buffer with Mg 2+ Step 3: 30 sec -55°C
1μl sense引物(10pMol/μl) 步骤4:3分钟-72℃1μl sense primer (10pMol/μl) Step 4: 3 minutes -72°C
1μl anti-sense引物(10pMol/μl)1μl anti-sense primer (10pMol/μl)
1μl来自Roche Molecular Biochemicals的High Fidelity聚合酶混合液1 μl High Fidelity Polymerase Mix from Roche Molecular Biochemicals
40μl H2O40 μl H2O
步骤2至4重复32次。Steps 2 to 4 are repeated 32 times.
四种引物组合,Oal3S/Fxn57AS、Oal4S/Fxn57AS、Oal3S/Oal8AS和Oal4S/Oal8AS,都能产生单条4kb的DNA片段。所有引物对都定向于基因的外部。因此,内酯酶基因应该位于作为自连接HindIII片段的环状DNA分子上。Four primer combinations, Oal3S/Fxn57AS, Oal4S/Fxn57AS, Oal3S/Oal8AS and Oal4S/Oal8AS, can all generate a single 4kb DNA fragment. All primer pairs are directed to the outside of the gene. Therefore, the lactonase gene should be located on a circular DNA molecule as a self-ligating HindIII fragment.
用来自Qiagen的QIAEX II凝胶抽提试剂盒从凝胶中分离出片段,按照厂商(Applied Biosystems,Foster City,CA,USA)推荐的二去氧方法,用ABI 310Genetic Analyzer来进行测序。获得的序列证实,分离出的片段含有来自A.niger ATCC9142的(R)-泛酸内酯酶基因的两个末端。Fragments were isolated from the gel with the QIAEX II gel extraction kit from Qiagen, and sequenced with the ABI 310 Genetic Analyzer according to the dideoxy method recommended by the manufacturer (Applied Biosystems, Foster City, CA, USA). The sequence obtained confirmed that the isolated fragment contained both ends of the (R)-pantolactonase gene from A. niger ATCC9142.
用引物Oall0S和Oal12AS,也可能分离出A.niger ssp.awamori的同源基因,其中含有与所有被测出的肽完全一样的氨基酸序列。唯一的区别仅在于肽Fxn42的序列。此时Fxn42来自对两个肽的测序。最可能的情况是,当进行HPLC来分离经过消化的酶片段时,同时洗脱出了两个不同的肽。测序首先仅揭示了主要的肽的序列,但其却带有很强的背景。对第一个肽的测序完成后,剩下的就是第二个肽的序列,其具有另外四个氨基酸:Using the primers Oall0S and Oal12AS, it was also possible to isolate the homologous gene of A. niger ssp.awamori, which contains the exact same amino acid sequence as all the peptides tested. The only difference is only in the sequence of the peptide Fxn42. At this point Fxn42 comes from the sequencing of two peptides. Most likely, when HPLC was performed to separate the digested enzyme fragments, two different peptides eluted simultaneously. Sequencing first revealed only the sequence of the main peptide, but with a strong background. Once the sequencing of the first peptide is complete, all that remains is the sequence of the second peptide, which has four more amino acids:
a)K-P-F-A-H-Q-V-K(SEQ ID NO:43)和a) K-P-F-A-H-Q-V-K (SEQ ID NO: 43) and
b)R-H-H-N-A-P-A-P-T-P-E-D-P-E-R-R(SEQ ID NO:44)给出b) R-H-H-N-A-P-A-P-T-P-E-D-P-E-R-R (SEQ ID NO: 44) gives
K-P-F-A-H-Q-V-K-T-x-E-D(SEQ ID NO:45),其为Fxn 42。K-P-F-A-H-Q-V-K-T-x-E-D (SEQ ID NO: 45), which is Fxn 42.
这还解释了为何基于肽Fxn42的寡核苷酸Fxn42S和Fxn42AS无法给出合用的结果。根据氨基酸序列,来自菌株ATCC9142的蛋白质具有36573Da的分子量,280nm处估计的吸光率为1.83(1mg/ml蛋白质溶液),而来自A.niger ssp.awamori的氨基酸序列的分子量被计算为36547Da,对1mg/ml蛋白质溶液而言,估计的E280nm为1.831(Pace et al.,1995)。This also explains why the oligonucleotides Fxn42S and Fxn42AS based on the peptide Fxn42 failed to give combined results. Based on the amino acid sequence, the protein from strain ATCC9142 has a molecular weight of 36573 Da with an estimated absorbance at 280 nm of 1.83 (1 mg/ml protein solution), while the molecular weight of the amino acid sequence from A. niger ssp. awamori is calculated to be 36547 Da for 1 mg /ml protein solution, the estimated E280nm is 1.831 (Pace et al., 1995).
实施例11对来自A.niger ATCC 9142的(R)-泛酸内酯酶基因进行克隆和Embodiment 11 is cloned from the (R)-pantothenate lactonase gene of A.niger ATCC 9142 and 表达Express
对按照实施例10中所述的全部三种方法获得的序列进行比较,将其进行装配,产生一条连续的覆盖整个内酯酶基因的2443bp的序列。为测定该基因的起始密码子、终止密码子和可能的内含子,将从来自商业制剂的纯化基因得到的肽的可获得的氨基酸序列,与GCG程序包版本10.2的两个程序TESTCODE和CODONPREFERENCE获得的结果联合使用。虽然按照同样的方式两个程序都测定出了终止密码子和一个内含子,而起始密码子却并不明显,具体而言,因为没有肽Fx74(SEQ ID NO:84)的其它可获得的N-末端已被测序的肽。对数据库搜索期间发现的向同源氨基酸序列的所有可能的翻译进行的比较也没有帮助,因为蛋白质的N-末端部分是完全异源的。因此,在肽Fx74的位置上游但仍在同一个阅读框内发现的两个甲硫氨酸残基被选出,将其用于构建不同的表达盒。在SEQ IDNO:7的666-742号碱基对之间发现了富含CT的区域。然而,该富含CT的区域的位置并非像起始密码子那样真正地偏好Met而不是其它氨基酸,因为在真菌中该区域与起始密码子之间的距离通常有相当大的变化。此外,在基因的下游鉴定出了可能的聚腺苷化位点。The sequences obtained by all three methods described in Example 10 were compared and assembled to generate a contiguous 2443 bp sequence covering the entire lactonase gene. To determine the start codon, stop codon and possible introns of the gene, the available amino acid sequences of peptides obtained from purified genes from commercial preparations were compared with the two programs TESTCODE and The results obtained with CODONPREFERENCE are used in conjunction. While both programs determined the stop codon and an intron in the same way, the start codon was not evident, specifically because no other variants of the peptide Fx74 (SEQ ID NO:84) were available. The N-terminus of the peptide has been sequenced. A comparison of all possible translations to homologous amino acid sequences found during database searches was also not helpful, since the N-terminal part of the protein is completely heterologous. Therefore, two methionine residues found upstream of the position of peptide Fx74 but still in the same reading frame were selected and used to construct different expression cassettes. A CT-rich region was found between base pairs 666-742 of SEQ ID NO:7. However, the location of this CT-rich region does not really favor Met over other amino acids like the start codon, since the distance between this region and the start codon often varies considerably in fungi. In addition, possible polyadenylation sites were identified downstream of the gene.
对来自A.niger ATCC9142的基因的72bp长的内含子而言,从第976至981位核苷酸的DNA段最可能代表5’-剪切位点,从第1029至1035位或从1016至1022位核苷酸的段最可能代表推测出的内部保守序列(consensus sequence),从第1045至1047位核苷酸的DNA段最可能代表3’-剪切位点(SEQ ID NO:7)。A.niger MacRae基因的相应序列在51bp长的内含子(SEQ ID NO:79)中定位于bp 32至37、66至72、以及80至82。对A.niger ssp.awamori的oal基因的50bp长的内含子而言,5’-剪切位点始自第203位核苷酸(GTGCCC),内部保守区域在第236位核苷酸处(AACTAAC),3’-剪切位点在第250位核苷酸处(CAG)(SEQID NO:9)。For the 72 bp long intron of the gene from A. niger ATCC9142, the DNA segment from 976 to 981 nucleotides most likely represents the 5'-splice site, from 1029 to 1035 or from 1016 The segment to 1022 nucleotides is most likely to represent a deduced internal conservative sequence (consensus sequence), and the DNA segment from 1045 to 1047 nucleotides is most likely to represent a 3'-cleavage site (SEQ ID NO: 7 ). The corresponding sequences of the A. niger MacRae gene are located at bp 32 to 37, 66 to 72, and 80 to 82 in a 51 bp long intron (SEQ ID NO: 79). For the 50bp long intron of the oal gene of A. niger ssp.awamori, the 5'-cleavage site starts from the 203rd nucleotide (GTGCCC), and the internal conserved region is at the 236th nucleotide (AACTAAC), the 3'-cleavage site is at the 250th nucleotide (CAG) (SEQID NO: 9).
5’-剪切位点的保守序列是GTPuNGPy。列出的位点中没有一个在第5位有G。用于套索(lariat)形成的内部位点的酵母的保守序列是TACTAAC。3’-剪切位点具有PyAG的保守序列[Unkles,Gene organizationin industrial filamentous fungi.In Applied molecular genetics of filamentousfungi(Kinghoun,ed.),pp.28-53.Blackie Academic&Professional,WesterCleddens Road,Bishopbriggs,Glasgow G64 2NZ,UK,Glasgow(1992)]。The conserved sequence for the 5'-cleavage site is GTPuNGPy. None of the sites listed have a G at position 5. The conserved sequence in yeast for the internal site of lariat formation is TACTAAC. The 3'-cleavage site has the conserved sequence of PyAG [Unkles, Gene organization in industrial filamentous fungi. In Applied molecular genetics of filamentous fungi (Kinghoun, ed.), pp.28-53. Blackie Academic & Professional, WesterCleddens Road, Bishopbriggs, Glasgow G64 2NZ, UK, Glasgow (1992)].
产生了下述cDNA:The following cDNA was produced:
-用于在E.coli中表达的oal(SEQ ID NO:98)- oal (SEQ ID NO: 98) for expression in E.coli
-用于在Saccharomyces cerevisiae中表达的oalEco(SEQ ID NO:94)- oalEco (SEQ ID NO: 94) for expression in Saccharomyces cerevisiae
-用于在S.cerevisiae中表达的oalEShort(用SEQ ID NO:95的第32位处的第二个可能的Met的31个氨基酸的较短的版本)- oalEShort for expression in S.cerevisiae (with a shorter version of 31 amino acids of the second possible Met at position 32 of SEQ ID NO:95)
-用于在E.coli中表达的oalS(oalS被制成是因为Fxn42肽的令人误解的序列,其最终被鉴定为人工产物)- oalS for expression in E. coli (oalS was made because of the misleading sequence of the Fxn42 peptide, which was eventually identified as an artifact)
-用于在S.cerevisiae中表达的oalSEco-oalSEco for expression in S.cerevisiae
-用于在E.coli中表达的含有C-末端his-标签的oalhis- oalhis containing C-terminal his-tag for expression in E.coli
-用于在S.cerevisiae中表达的oalEhis- oalEhis for expression in S. cerevisiae
-用于在E.coli中表达的含有N-末端his-标签的oalNhis(SEQ IDNO:92)- oalNhis (SEQ ID NO: 92) containing the N-terminal his-tag for expression in E.coli
-用于在S.cerevisiae中表达的oalENhis(SEQ ID NO:96)(构建体oalsec,其含有A.terreus cbs的植酸酶的信号肽,该信号肽用于在S.cerevisiae中的表达和分泌)- oalENhis (SEQ ID NO: 96) for expression in S.cerevisiae (construct oalsec, it contains the signal peptide of the phytase of A.terreus cbs, and this signal peptide is used for the expression in S.cerevisiae and secretion)
通常的工序:Usual procedure:
首先,在两次PCR中分离出oal基因的假设编码序列(cDNA)。通过第三次PCR将两种PCR产物装配起来。使用下述引物:First, the putative coding sequence (cDNA) of the oal gene was isolated in two PCRs. The two PCR products were assembled by a third PCR. Use the following primers:
Oal1AS(SEQ ID NO:46)、Oal2AS(SEQ ID NO:47)、Oal3S(SEQ ID NO:48)、Oal4S(SEQ ID NO:49)、Oal5S(SEQ ID NO:50)、Oal6AS(SEQ ID NO:51)、Oal7AS(SEQ ID NO:52)、Oal8AS(SEQ ID NO:53)、Oal9AS(SEQ ID NO:54)、Oall 0S(SEQID NO:55)、Oal10SEco(SEQ ID NO:56)、OalENhis(SEQ ID NO:57)、Oal10SShis(SEQ ID NO:58)、Oal11S(SEQ ID NO:59)、Oal12AS(SEQ ID NO:60)、Oal12ASEco(SEQ ID NO:61)、Oal12AShis(SEQ ID NO:62)、Oal12ASHisEco(SEQ ID NO:63)、Oal13S(SEQ ID NO:64)、Oal13AS(SEQ ID NO:65)、Oal14S(SEQID NO:66)、Oal14AS(SEQ ID NO:67)、Oal15S(SEQ ID NO:68)、Oal15AS(SEQ ID NO:69)、Oal16S(SEQ ID NO:70)(第二个Met)、OalsecS(SEQ ID NO:71)、OalsecAS(SEQ ID NO:72)、pQE80EcoS(SEQ ID NO:73)、pQE80EcoNhisS(SEQ ID NO:74)、pQE80BamAS(SEQ ID NO:75)、pQE80BamhisAS(SEQ ID NO:76)、pQE80EcoSshort(SEQ ID NO:77)和CP-a(SEQ ID NO:78)。Oal1AS (SEQ ID NO: 46), Oal2AS (SEQ ID NO: 47), Oal3S (SEQ ID NO: 48), Oal4S (SEQ ID NO: 49), Oal5S (SEQ ID NO: 50), Oal6AS (SEQ ID NO : 51), Oal7AS (SEQ ID NO: 52), Oal8AS (SEQ ID NO: 53), Oal9AS (SEQ ID NO: 54), Oall0S (SEQ ID NO: 55), Oal10SEco (SEQ ID NO: 56), OalENhis (SEQ ID NO: 57), Oal10SShis (SEQ ID NO: 58), Oal11S (SEQ ID NO: 59), Oal12AS (SEQ ID NO: 60), Oal12ASEco (SEQ ID NO: 61), Oal12AShis (SEQ ID NO: 62), Oal12ASHisEco (SEQ ID NO: 63), Oal13S (SEQ ID NO: 64), Oal13AS (SEQ ID NO: 65), Oal14S (SEQ ID NO: 66), Oal14AS (SEQ ID NO: 67), Oal15S (SEQ ID NO: 68), Oal15AS (SEQ ID NO: 69), Oal16S (SEQ ID NO: 70) (second Met), OalsecS (SEQ ID NO: 71), OalsecAS (SEQ ID NO: 72), pQE80EcoS ( SEQ ID NO: 73), pQE80EcoNhisS (SEQ ID NO: 74), pQE80BamAS (SEQ ID NO: 75), pQE80BamhisAS (SEQ ID NO: 76), pQE80EcoSshort (SEQ ID NO: 77) and CP-a (SEQ ID NO :78).
Oal11S直接始自起始密码子的更为上游的位点前面,而Oal9AS始自假设的终止密码子下游若干核苷酸处。为能在第三次PCR中不用内含子将得到的两种PCR产物装配起来,Oal14S和Oal14AS是互补的,其含有从外显子I的终点到外显子II的起点之间的序列。Oal11S begins directly in front of the site more upstream of the start codon, while Oal9AS begins several nucleotides downstream of the putative stop codon. In order to assemble the two resulting PCR products without introns in the third PCR, Oal14S and Oal14AS are complementary, containing the sequence from the end of exon I to the beginning of exon II.
1μl gDNA(1/10稀释) 步骤1:5分钟-95℃1μl gDNA (1/10 dilution) Step 1: 5 minutes -95°C
1μl核苷酸混合物 步骤2:30秒-95℃1 μl Nucleotide Mixture Step 2: 30 sec -95°C
5μl带Mg2+的PCR标准缓冲液 步骤3:30秒-55℃5 μl PCR standard buffer with Mg 2+ Step 3: 30 sec -55°C
1μl Oal11S(10pMol/μl) 步骤4:1分钟-72℃1 μl Oal11S (10pMol/μl) Step 4: 1 minute -72°C
1μl Oal14AS(10pMol/μl)1μl Oal14AS (10pMol/μl)
1μl来自Roche Molecular Biochemicals的High Fidelity聚合酶混合液40μl H2O1 μl High Fidelity Polymerase Mixture from Roche Molecular Biochemicals 40 μl H2O
步骤2至4重复35次。Steps 2 to 4 are repeated 35 times.
1μl gDNA(1/10稀释) 步骤1:5分钟-95℃1μl gDNA (1/10 dilution) Step 1: 5 minutes -95°C
1μl核苷酸混合物 步骤2:30秒-95℃1 μl Nucleotide Mixture Step 2: 30 sec -95°C
5μl带Mg2+的PCR标准缓冲液 步骤3:30秒-55℃5 μl PCR standard buffer with Mg 2+ Step 3: 30 sec -55°C
1μl Oal14S(10pMol/μl) 步骤4:1分钟-72℃1 μl Oal14S (10pMol/μl) Step 4: 1 minute -72°C
1μl Oal9AS(10pMol/μl)1μl Oal9AS (10pMol/μl)
1μl来自Roche Molecular Biochemicals的High Fidelity聚合酶混合液40μl H2O1 μl High Fidelity Polymerase Mixture from Roche Molecular Biochemicals 40 μl H2O
步骤2至4重复35次。Steps 2 to 4 are repeated 35 times.
对A.niger ATCC9142的gDNA进行的PCR给出了具有期望大小的PCR产物。通过琼脂糖凝胶电泳对PCR产物进行纯化。用来自Qiagen(Qiagen,Hilden,德国)的QIAEX II试剂盒从凝胶中提取出PCR产物,将其用于第三次PCR:PCR on gDNA of A. niger ATCC9142 gave a PCR product of the expected size. PCR products were purified by agarose gel electrophoresis. The PCR product was extracted from the gel with the QIAEX II kit from Qiagen (Qiagen, Hilden, Germany) and used for the third PCR:
PCR产物1和2各0.5μl 步骤1:5分钟-95℃0.5 μl each of PCR products 1 and 2 Step 1: 5 minutes -95°C
1μl核苷酸混合物 步骤2:30秒-95℃1 μl Nucleotide Mixture Step 2: 30 sec -95°C
5μl带Mg2+的PCR标准缓冲液 步骤3:30秒-55℃5 μl PCR standard buffer with Mg 2+ Step 3: 30 sec -55°C
1μl Oal12S(10pMol/μl) 步骤4:1分钟-72℃1 μl Oal12S (10pMol/μl) Step 4: 1 minute -72°C
1μl Oal10AS(10pMol/μl)1μl Oal10AS (10pMol/μl)
1μl来自Roche Molecular Biochemicals的High Fidelity聚合酶混合液40μl H2O1 μl High Fidelity Polymerase Mixture from Roche Molecular Biochemicals 40 μl H2O
步骤2至4重复30次。Repeat steps 2 to 4 30 times.
通过琼脂糖凝胶电泳对终产物进行纯化,从凝胶中提取出来,并按照Invitrogen(carlsbad,CA,USA)提供的方法将其克隆进TA-载体,用于对PCR产物进行直接克隆。所有其它必需的克隆步骤都按照Sambrook et al.(1989)所述来进行。通过测序检测出来的含有正确插入(oal1)的质粒在所有后续的构建步骤中被用作模板。The final product was purified by agarose gel electrophoresis, extracted from the gel, and cloned into a TA-vector according to the method provided by Invitrogen (carlsbad, CA, USA) for direct cloning of PCR products. All other necessary cloning steps were performed as described by Sambrook et al. (1989). The plasmid containing the correct insert (oal1) detected by sequencing was used as template in all subsequent construction steps.
(a)用于在E.coli和Saccharomyces cerevisiae中表达的构建体oal和oalEco:按照下述方案,用引物pQE80EcoS和pQE80BamAS对oal1进行PCR,以产生构建体Eco-oal-Bam:(a) Constructs oal and oalEco for expression in E. coli and Saccharomyces cerevisiae: PCR was performed on oal1 with primers pQE80EcoS and pQE80BamAS to generate construct Eco-oal-Bam according to the following scheme:
1μl含有作为插入的oal1的质粒(10ng)1 μl of plasmid (10 ng) containing oal1 as insert
1μl核苷酸混合物1 μl nucleotide mix
5μl带Mg2+的PCR标准缓冲液5μl PCR standard buffer with Mg2 +
1μl pQE80EcoS(10pMol/μl)1μl pQE80EcoS (10pMol/μl)
1μl pQE80BamAS(10pMol/μl)1μl pQE80BamAS (10pMol/μl)
1μl来自Roche Molecular Biochemicals的High Fidelity聚合酶混合液40μl H2O1 μl High Fidelity Polymerase Mixture from Roche Molecular Biochemicals 40 μl H2O
步骤1:5分钟-95℃Step 1: 5 minutes -95°C
步骤1:30秒-95℃Step 1: 30 seconds -95°C
步骤1:30秒-55℃Step 1: 30 seconds -55°C
步骤4:1分钟-72℃Step 4: 1 minute -72°C
步骤2至4重复30次。Repeat steps 2 to 4 30 times.
通过琼脂糖凝胶电泳对PCR产物进行纯化,从凝胶中提取人们感兴趣的片段(QIAEX II,Qiagen),用EcoRI和BamHI进行消化,用琼脂糖凝胶电泳再次进行纯化,将其连接到来自Qiagen的载体pQE80上,转化进E.coli Top10。分离出四个克隆的质粒,对插入进行测序。含有正确构建体的质粒被转化进E.coli M15或相当的E.coli菌株。所有分子技术都是技术人员已知的标准技术。The PCR product was purified by agarose gel electrophoresis, the fragment of interest (QIAEX II, Qiagen) was extracted from the gel, digested with EcoRI and BamHI, purified again by agarose gel electrophoresis, and ligated into Transformed into E.coli Top10 on the vector pQE80 from Qiagen. Plasmids of four clones were isolated and the inserts were sequenced. Plasmids containing the correct constructs were transformed into E. coli M15 or equivalent E. coli strains. All molecular techniques are standard techniques known to the skilled person.
对酵母表达构建体而言,将所用的引物换为Oal10EcoS和Oal12ASEco,用EcoRI对PCR产物进行单独消化,通过载体上的EcoRI限制性位点,将终产物克隆进pYES2或用于S.cerevisiae的相当的表达载体。按照Hinnen et al.,Proc.Natl.Acad.Sci.USA 75:1929-1933(1978)或按照与菌株同时提供的说明书,来进行对S.cerevisiae菌株,例如INVScl(Invitrogen,Carlsbad,CA,USA)的转化。For yeast expression constructs, the primers used were changed to Oal10EcoS and Oal12ASEco, the PCR product was digested separately with EcoRI, and the final product was cloned into pYES2 or used for S.cerevisiae through the EcoRI restriction site on the vector. equivalent expression vector. According to Hinnen et al., Proc.Natl.Acad.Sci.USA 75:1929-1933 (1978) or according to the specification sheet provided simultaneously with bacterial strain, carry out to S.cerevisiae bacterial strain, for example INVScl (Invitrogen, Carlsbad, CA, USA ) conversion.
(b)用于在E.coli中表达的构建体pQE80oalS(用第二个可能的Met的31个氨基酸的减短版本):(b) Construct pQE80oalS for expression in E. coli (with a 31 amino acid shortened version of the second possible Met):
用跟上述内容一样的PCR方案,用同样的模板,引物用pQE80EcoSshort和pQE80BamAS。按照(a)中所述对PCR反应完成之后的反应液进行处理。Use the same PCR scheme as above, use the same template, and use pQE80EcoSshort and pQE80BamAS as primers. Treat the reaction solution after the PCR reaction as described in (a).
(c)用于在S.cerevisiae中表达的构建体oalS:(c) Construct oalS for expression in S. cerevisiae:
使用引物Oal16S和Oal12ASEco。其它都按照(a)中所述来进行。Primers Oal16S and Oal12ASEco were used. Everything else was carried out as described in (a).
(d)用于在E.coli中表达的含有C-末端his-标签的构建体pQE80oalhis:(d) Construct pQE80oalhis containing a C-terminal his-tag for expression in E. coli:
使用引物pQE80EcoS和pQE80BamhisAS来进行PCR,见(a)。PCR was performed using primers pQE80EcoS and pQE80BamhisAS, see (a).
(e)用于在S.cerevisiae中表达的构建体oalhis:(e) Construct oalhis for expression in S. cerevisiae:
使用引物Oal10Seco和Oal12ASHisEco来进行PCR,见(a)。PCR was performed using primers Oal10Seco and Oal12AShisEco, see (a).
(f)用于在E.coli中表达的含有N-末端His-标签的构建体pQE80oalNhis:(f) Construct pQE80oalNhis containing the N-terminal His-tag for expression in E. coli:
使用引物pQE80EcoNhisS和pQE80BamAS来进行PCR(见a)。PCR was performed using primers pQE80EcoNhisS and pQE80BamAS (see a).
(g)用于在S.cerevisiae中表达的构建体oalENhis:(g) Construct oalENhis for expression in S. cerevisiae:
使用引物OalENhis和Oal12ASEco来进行PCR(见a)。PCR was performed using primers OalENhis and Oal12ASEco (see a).
(h)构建体oalsec,其含有A.terreus cbs的植酸酶的信号肽,该信号肽用于在S.cerevisiae中的表达和分泌:(h) Construct oalsec containing the signal peptide of the phytase of A. terreus cbs for expression and secretion in S. cerevisiae:
用三次独立的PCR来制备该构建体。首先,通过对保守植酸酶基因进行PCR分离出信号序列[Lehmann et al.,Protein Eng.13:49-57(2000)]。This construct was prepared using three independent PCRs. First, the signal sequence was isolated by PCR on the conserved phytase gene [Lehmann et al., Protein Eng. 13:49-57 (2000)].
1.0μl含有作为插入的保守植酸酶基因的质粒(10ng)1.0 μl of plasmid (10 ng) containing the conserved phytase gene as an insert
1.0μl核苷酸混合物1.0 μl nucleotide mix
5.0μl带Mg2+的PCR标准缓冲液5.0μl PCR standard buffer with Mg2 +
1.0μl CP-a(10pMol/μl)1.0μl CP-a (10pMol/μl)
1.0μl OalsecAS(10pMol/μl)1.0μl OalsecAS (10pMol/μl)
1.0μl来自Roche Molecular Biochemicals的High Fidelity聚合酶混合液1.0 μl High Fidelity Polymerase Mix from Roche Molecular Biochemicals
40μl H2O40 μl H2O
步骤1:5分钟-95℃Step 1: 5 minutes -95°C
步骤1:30秒-95℃Step 1: 30 seconds -95°C
步骤1:30秒-55℃Step 1: 30 seconds -55°C
步骤4:30秒-72℃Step 4: 30 seconds -72°C
步骤2至4重复30次。Repeat steps 2 to 4 30 times.
1.0μl含有oal1基因的质粒(10ng)1.0 μl plasmid containing oal1 gene (10ng)
1.0μl核苷酸混合物1.0 μl nucleotide mix
5.0μl带Mg2+的PCR标准缓冲液5.0μl PCR standard buffer with Mg2 +
1.0μl OalsecS(10pMol/μl)1.0μl OalsecS (10pMol/μl)
1.0μl Oal12ASEco(10pMol/μl)1.0μl Oal12ASEco (10pMol/μl)
1.0μl来自Roche Molecular Biochemicals的High Fidelity聚合酶混合液1.0 μl High Fidelity Polymerase Mix from Roche Molecular Biochemicals
40μl H2O40 μl H2O
步骤1:5分钟-95℃Step 1: 5 minutes -95°C
步骤1:30秒-95℃Step 1: 30 seconds -95°C
步骤1:30秒-55℃Step 1: 30 seconds -55°C
步骤4:45秒-72℃Step 4: 45 seconds -72°C
步骤2至4重复30次。Repeat steps 2 to 4 30 times.
通过在1.5%的琼脂糖凝胶上进行琼脂糖凝胶电泳来纯化PCR产物,从凝胶中提取PCR产物(QIAEX II,Qiagen),将其用于第三次PCR:The PCR product was purified by agarose gel electrophoresis on a 1.5% agarose gel, from which the PCR product was extracted (QIAEX II, Qiagen) and used for a third PCR:
0.5μl PCR1的产物0.5 μl PCR1 product
05μl PCR2的产物05 μl PCR2 product
1.0μl核苷酸混合物1.0 μl nucleotide mix
5.0μl带Mg2+的PCR标准缓冲液5.0μl PCR standard buffer with Mg2 +
1.0μl CP-a(10pMol/μl)1.0μl CP-a (10pMol/μl)
1.0μl Oal12ASEco(10pMol/μl)1.0μl Oal12ASEco (10pMol/μl)
1.0μl来自Roche Molecular Biochemicals的High Fidelity聚合酶混合液1.0 μl High Fidelity Polymerase Mix from Roche Molecular Biochemicals
40μl H2O40 μl H2O
步骤1:5分钟-95℃Step 1: 5 minutes -95°C
步骤1:30秒-95℃Step 1: 30 seconds -95°C
步骤1:30秒-55℃Step 1: 30 seconds -55°C
步骤4:1分钟-72℃Step 4: 1 minute -72°C
步骤2至4重复30次。Repeat steps 2 to 4 30 times.
通过琼脂糖凝胶电泳对PCR产物进行纯化,从凝胶中提取出PCR产物,用EcoRI进行消化,再次用琼脂糖凝胶电泳进行纯化,按照上述连接到S.cerevisiae表达载体中。The PCR product was purified by agarose gel electrophoresis, extracted from the gel, digested with EcoRI, purified by agarose gel electrophoresis again, and ligated into the S. cerevisiae expression vector as described above.
实施例12 oal的表达The expression of embodiment 12 oal
(a)在E.coli中:(a) In E.coli:
在E.coli M15中表达pQE80oal、pQE80oalS、pQE80oalhis和pQE80oalNhis,所述E.coli M15含有pREP4,其含有lac启动子的阻遏子(见德国,Hilden的Qiagen的表达说明书)。将过夜培养物稀释至600nm的OD为0.1,在30℃培养至OD600nm为1.5。然后用0.5mM IPTG诱导培养物,再在30℃培养6小时。通过离心(5000g,20分钟)收获细胞,冷冻于-80℃。按照厂商提供的方案,用B-PER(Pierce,Rockford,IL,USA)对它们进行裂解。再一次离心步骤之后,将上清液用于对(R)-泛酸内酯的水解。pQE80oal, pQE80oalS, pQE80oalhis and pQE80oalNhis were expressed in E. coli M15 containing pREP4 which contains the repressor of the lac promoter (see expression instructions from Qiagen, Hilden, Germany). The overnight culture was diluted to an OD600nm of 0.1 and grown at 30°C to an OD600nm of 1.5. The cultures were then induced with 0.5 mM IPTG and incubated at 30°C for 6 hours. Cells were harvested by centrifugation (5000 g, 20 minutes) and frozen at -80°C. They were cleaved with B-PER (Pierce, Rockford, IL, USA) according to the protocol provided by the manufacturer. After another centrifugation step, the supernatant was used for the hydrolysis of (R)-pantolactone.
在含有构建体pQE80oalS和pQE80oalhis的转化子中,上清液和细胞裂解液中都没有发现活性,而用构建体pQE80oal和pQE80oalNhis的情况下,可溶蛋白质中的大约5%是具有活性的(R)-泛酸内酯酶。In transformants containing constructs pQE80oalS and pQE80oalhis, no activity was found in the supernatants or cell lysates, whereas with constructs pQE80oal and pQE80oalNhis approximately 5% of the soluble protein was active (R )-pantothenic acid lactonase.
用与没有His-标签的基因同样的方法,来表达含有N-末端His-标签的构建体。用来自Qiagen(Hilden,德国)的“QIAexpressionist”所述的方案,从细胞裂解液中对天然形式的加上His-标签的蛋白质进行纯化。The construct containing the N-terminal His-tag was expressed in the same way as the gene without the His-tag. His-tagged protein in native form was purified from cell lysates using the protocol described by "QIAexpressionist" from Qiagen (Hilden, Germany).
(b)在S.cerevisiae中:(b) In S. cerevisiae:
构建体oalEco、oalES、oalEhis、oalENhis和oalsec在S.cerevisiaeINVScl或类似相当的菌株中表达。转化并在SDura-培养基[Sherman et al.,Laboratory course manual for methods in yeast genetics,Cold Spring HarborLaboratory,Cold Spring Harbor,New York(1986)]上涂布之后,在30℃对平板进行3至4天的培养,捡出培养出的菌落,将其转移至2ml SDura-液体培养基中,在猛烈振荡下于30℃培养3天。上述培养物被用作为25mlSDura-培养基的预培养物。在猛烈振荡下于30℃再培养3天后,用制备的培养物[Sherman et al.,如上所述]接种21瓶中的500ml YPD培养基。再在相同条件下培养3天之后,通过离心从培养基中分离出细胞,用Y-PER(Pierce,Rockford,IL)进行裂解。The constructs oalEco, oalES, oalEhis, oalENhis and oalsec were expressed in S. cerevisiaeINVScl or similar equivalent strains. After transformation and plating on SD ura -medium [Sherman et al., Laboratory course manual for methods in yeast genetics, Cold Spring Harbor Laboratory, Cold Spring Harbor, New York (1986)], the plates were incubated at 30°C for 3 to After 4 days of cultivation, the cultured colonies were picked out, transferred to 2ml SD ura- liquid medium, and cultured at 30° C. for 3 days under vigorous shaking. The above culture was used as a pre-culture in 25 ml of SD ura -medium. After an additional 3 days of cultivation at 30°C with vigorous shaking, 500 ml of YPD medium in 21 flasks was inoculated with the prepared culture [Sherman et al., as described above]. After an additional 3 days of culture under the same conditions, cells were separated from the medium by centrifugation and lysed with Y-PER (Pierce, Rockford, IL).
如在E.coli中表达的情况一样,仅有oalEco和oalENhis构建体导致了活性(R)-泛酸内酯酶的表达,其仅在细胞裂解液中被发现。用75ml YPD培养物,也可以产生300U的(R)-泛酸内酯酶。相应的裂解液显示大约5U/mg总蛋白的比活性。As in the case of expression in E. coli, only the oalEco and oalENhis constructs resulted in the expression of active (R)-pantothenolactonase, which was found only in cell lysates. 300 U of (R)-pantolactonase can also be produced with 75 ml of YPD culture. The corresponding lysates showed a specific activity of approximately 5 U/mg total protein.
用上述内含子位置的序列数据,还可用类似的方式对来自A.awamori的oal基因和来自A.niger ATCC46951的oal基因进行构建和表达。The oal gene from A. awamori and the oal gene from A. niger ATCC46951 can also be constructed and expressed in a similar manner using the sequence data of the above intron position.
实施例13对在E.coli或S.cerevisiae中异源表达的Oal进行固定,以及Example 13 fixes Oal heterologously expressed in E.coli or S.cerevisiae, and 将其用于对(RS)-泛酸内酯的水解拆分Use it for the hydrolytic resolution of (RS)-pantolactone
为在生物反应器中发挥多种用途,对来自A.niger的(R)-泛酸内酯酶进行固定。在E.coli或S.cerevisiae中表达之后,通过化学手段,例如B-PER(E.coli)或Y-PER(S.cerevisiae)(Pierce,Rockford,IL,USA),或通过机械手段例如超声波或高压对细胞进行破碎。通过离心来澄清裂解液。将制剂直接用于固定。或者,在固定之前,用另一项纯化步骤,例如硫酸铵沉淀,来进一步地增加Oal比活性。(R)-Pantolactonase from A. niger was immobilized for multiple uses in bioreactors. After expression in E.coli or S.cerevisiae, by chemical means such as B-PER (E.coli) or Y-PER (S.cerevisiae) (Pierce, Rockford, IL, USA), or by mechanical means such as ultrasound Or high pressure to disrupt the cells. Lysates were clarified by centrifugation. Preparations were used directly for fixation. Alternatively, use another purification step, such as ammonium sulfate precipitation, prior to fixation to further increase Oal specific activity.
按照下述方法来固定该生物催化剂:The biocatalyst was immobilized as follows:
(1)在二氧化硅溶胶凝胶(Silica Sol-Gel)上固定:(1) Immobilized on Silica Sol-Gel:
按照文献[Gill and Ballesteros,J.Am.Chem.Soc.120:8587-8598(1998)]所述来制备聚(硅酸甘油)-1.0(PGS-1.0),将其溶于其质量一半的冰冷的水中。取其中100或200mg的部分,与50或100mg冰冷的生物催化剂贮存液制剂(来自若干制剂的可溶的酶)彻底地混合,混合于2ml或5ml小瓶中的50mM磷酸盐中进行,pH7.0,将小瓶轻柔旋转2分钟,以在瓶壁上形成水凝胶的薄层。将小瓶置于冰上20分钟,然后(开启)转移至冰箱。水凝胶在5℃成熟48-72小时,以形成干凝胶(xerogel)。通过在5℃以100rpm振荡,用2×1或2×4ml pH为7的50mM磷酸缓冲液来洗涤有干凝胶的小瓶,然后是2×1或2×4ml pH为7的50mM TEA-乙酸盐,其中含有10mM乙酸镁。According to the literature [Gill and Ballesteros, J.Am.Chem.Soc.120:8587-8598 (1998)], poly(glycerol silicate)-1.0 (PGS-1.0) was prepared and dissolved in half of its mass icy water. A 100 or 200 mg portion of this was mixed thoroughly with 50 or 100 mg of ice-cold biocatalyst stock solution preparation (soluble enzyme from several preparations) in 50 mM phosphate in 2 ml or 5 ml vials, pH 7.0 , swirling the vial gently for 2 minutes to form a thin layer of hydrogel on the vial walls. Place the vial on ice for 20 minutes, then transfer (open) to the refrigerator. The hydrogel was matured at 5°C for 48-72 hours to form a xerogel. Wash the vial with xerogel with 2 x 1 or 2 x 4 ml of 50 mM phosphate buffer pH 7 followed by 2 x 1 or 2 x 4 ml of 50 mM TEA-B at pH 7 by shaking at 100 rpm at 5°C. acid salt, which contains 10mM magnesium acetate.
(2)将Oal固定于二氧化硅聚乙烯醇溶胶凝胶(Sol-Gel Silica-Polyvinyl Alcohol)上:(2) Fix Oal on silica polyvinyl alcohol sol gel (Sol-Gel Silica-Polyvinyl Alcohol):
方法与(1)中类似,不同之处是在与内酯酶贮存液结合之前,DGS溶液要与恰当数量的聚乙烯醇(PVA,来自Aldrich,13%w/w的水中的85K)混合。后续方法如(1)所示。The method was similar to (1), except that the DGS solution was mixed with the appropriate amount of polyvinyl alcohol (PVA, 85K in water at 13% w/w from Aldrich) before combining with the lactonase stock solution. The follow-up method is shown in (1).
(3)将Oal固定到用戊二醛活化并支持的聚乙烯亚胺上:(3) Oal is fixed on the polyethylenimine activated and supported with glutaraldehyde:
将聚乙烯亚胺(1g,无水,高分子量,分支聚合物,来自Aldrich)、CA(0.2g,36%Ac)和乙酸丁酸纤维素(0.2g,48%的乙酸和丁酸含量,来自Aldrich)溶于5mL的二氯甲烷中,用溶液对7g漂白土(Fullers Earth,30-60目,来自Aldrich)进行包衣。经过湿包衣的材料在空气中于室温下干燥0.5小时,直到给出大约9g的被包衣的漂白土。然后将其重新悬浮于戊二醛(8mL,50%w/w,来自Aldrich)和磷酸缓冲液(50mL,50mM,pH 8)的冰冷的混合物中,以200rpm搅拌2小时。用水洗(4×10mL)经过活化的支持物,然后用磷酸缓冲液洗(4×10mL,20mM,pH 8),然后排出液体。将湿的支持物与冰冷的内酯酶贮存液(50mM磷酸中,pH8)混合,以200rpm对悬浮液进行20-30小时的搅拌。将液体从被固定的酶上轻柔移走,用磷酸盐洗(3×5mL)湿的固定物,用乙醇胺(20mL,20mM,pH 8)进行处理,再用磷酸盐洗(2×5mL,20mM,pH 7),然后排出液体。Polyethyleneimine (1 g, anhydrous, high molecular weight, branched polymer from Aldrich), CA (0.2 g, 36% Ac) and cellulose acetate butyrate (0.2 g, 48% acetic acid and butyric acid content, ex Aldrich) was dissolved in 5 mL of dichloromethane and 7 g of Fullers Earth (Fullers Earth, 30-60 mesh, ex Aldrich) was coated with the solution. The wet-coated material was dried in air at room temperature for 0.5 hours until approximately 9 g of coated Fuller's earth was given. It was then resuspended in an ice-cold mixture of glutaraldehyde (8 mL, 50% w/w from Aldrich) and phosphate buffer (50 mL, 50 mM, pH 8) and stirred at 200 rpm for 2 hours. The activated support was washed with water (4×10 mL), then with phosphate buffer (4×10 mL, 20 mM, pH 8), and the liquid was drained. The wet support was mixed with ice-cold lactonase stock solution (50 mM phosphoric acid, pH 8) and the suspension was stirred at 200 rpm for 20-30 hours. The liquid was gently removed from the immobilized enzyme, the wet fixation was washed with phosphate (3 × 5 mL), treated with ethanolamine (20 mL, 20 mM, pH 8), and washed with phosphate (2 × 5 mL, 20 mM , pH 7), and then drain the liquid.
(4)用于对(RS)-泛酸内酯进行水解拆分的反应条件(4) Reaction conditions for hydrolytic resolution of (RS)-pantolactone
反应在2或4mL的小瓶中进行,其中使用50或100mg的生物催化剂以及1或2mL的0.5M外消旋内酯(在0.75M三乙胺-乙酸盐中,pH8.5,含有20mM的乙酸镁)。将小瓶在40℃、200rpm下培养必需的时间段,将溶液排出,并用手性HPLC进行分析,在开始下一次循环之前用新鲜的底物溶液洗(2×1或2mL)催化剂。用等体积的500mM MES缓冲液来终止用于分析的样品的反应,所述缓冲液pH为5.5,其中含有50mM EDTA,离心(10000g,10分钟)然后通过HPLC进行分析,其中使用0.46×15cm的CHIRADEX柱,用70∶30的水-甲醇进行洗脱,1mL/分钟,在20℃进行。在15分钟的时候对初始速度进行测量,在每个循环的末尾测定E值。Reactions were carried out in 2 or 4 mL vials using 50 or 100 mg of biocatalyst and 1 or 2 mL of 0.5 M racemic lactone (in 0.75 M triethylamine-acetate, pH 8.5, containing 20 mM magnesium acetate). The vials were incubated at 40°C, 200 rpm for the necessary period of time, the solution was drained and analyzed by chiral HPLC, the catalyst was washed (2 x 1 or 2 mL) with fresh substrate solution before starting the next cycle. The reaction of samples for analysis was stopped with an equal volume of 500 mM MES buffer, pH 5.5 containing 50 mM EDTA, centrifuged (10000 g, 10 min) and then analyzed by HPLC using a 0.46 x 15 cm CHIRADEX column, eluted with 70:30 water-methanol, 1 mL/min, at 20°C. Initial velocity measurements were taken at 15 minutes and E values were determined at the end of each cycle.
表2中概括了选用的被固定的酶制剂在对(RS)-泛酸内酯进行分批拆分过程中的作用。溶胶凝胶和共价固定方案都被证明为有效的,甚至在24个每批为1小时的循环后,催化剂保留了其初始活性的62-72%,并显示出:对映异构体的过量(excess)值和对(R)-泛酸内酯的对映体选择性分别为89-98%和71-88%。此外,当延长批次进行的时间进行试验时,催化剂运作方式也是非常相似的,即当每批反应进行一天而不是1小时的情况下,其展示出了良好的长时间操作稳定性。Table 2 summarizes the effect of selected immobilized enzyme preparations on the batch resolution of (RS)-pantolactone. Both sol-gel and covalent immobilization schemes proved to be effective, and even after 24 1-hour batch cycles, the catalyst retained 62–72% of its initial activity and showed: Excess values and enantioselectivities to (R)-pantolactone were 89-98% and 71-88%, respectively. Furthermore, the catalyst behaved very similarly when the batches were run for extended periods of time, showing good long-term operational stability when each batch was run for a day instead of 1 hour.
表2:通过被固定的内酯酶制剂对(RS)-泛酸内酯进行分批拆分Table 2: Batch resolution of (RS)-pantolactone by immobilized lactonase preparations
催化剂*:Biocatalyst/Prod.Organism(Recomb.Activity)Catalyst * : Biocatalyst/Prod.Organism(Recomb.Activity)
实施例14被固定的由重组E.coli表达的Oal对(RS)-泛酸内酯进行连续The Oal expressed by recombinant E.coli that is fixed in embodiment 14 is carried out continuously to (RS)-pantolactone 拆分的应用split app
按照实施例14(2)中所述来制备的经溶胶凝胶-PVA固定的重组内酯酶生物催化剂(E.coli中表达的),被用于在填充床(packed-bed)反应器中对外消旋内酯进行连续拆分:将被固定的生物催化剂(1.1g,0.21kUg-1,总共0.231kU)干填入带有Teflon末端(endpiece)的0.46×15cm的Omnifit加套玻璃柱中。其与1×10cm的Omnifit加套玻璃柱相连,1×10cm的玻璃柱中填有1-2mm的纤维素珠,其是通过两个装有50mL玻璃/Teflon注射器的注射器泵填入的。这些柱都与循环水浴相连,拆分进行期间被保持在40℃。按照下述方法对反应器进行调节:室温下以5mL min-1的速率加入Tris-乙酸盐缓冲液(100mM,pH 7,含有100mM乙酸镁),进行30分钟;接着是在室温下以1∶1的体积比,2mL min-1的总流速,加入外消旋内酯溶液(水中1.5M)和Tris-乙酸缓冲液(2.5M,pH8.25,含有50mM乙酸镁)的组合,进行30分钟。然后将流速降为1.2mL min-1,将柱子加热到40℃,系统被允许平衡15分钟。因此对反应器进行了大约45分钟的操作,在冰浴上收集洗脱液,在此条件下,以46-48%的不变的转化率对总共52.7mL的进料物(相当于5.14g RSPL)进行了处理。用硫酸(10%v/v水溶液)将洗脱液的pH调为6.5,用二氯甲烷对溶液进行萃取(10×75mL),有机层在无水硫酸镁上干燥,旋转式蒸发,产生由10%的RPL和90%的SPL(由手性HPLC分析得出)组成的(S)-泛酸内酯富含物(2.67g,理论上98%)。用硫酸(40%w/w)对水相进行酸化,至pH1.5,加热至70℃,1小时,再在冰上冷却,用氯化钠饱和,然后用二氯甲烷萃取(10×75mL),有机层在无水硫酸镁上干燥,旋转式蒸发,产生(R)-泛酸内酯富含物(2.23g,理论产量92%),其对映异构体纯度为93%。对(R)-/(S)-的组合回收率为95%。A sol-gel-PVA immobilized recombinant lactonase biocatalyst (expressed in E. coli) prepared as described in Example 14(2) was used in a packed-bed reactor Sequential resolution of racemic lactones: The immobilized biocatalyst (1.1 g, 0.21 kUg -1 , 0.231 kU in total) was dry packed into a 0.46×15 cm Omnifit jacketed glass column with Teflon endpieces . It was attached to a 1 x 10 cm Omnifit jacketed glass column filled with 1-2 mm cellulose beads via two syringe pumps fitted with 50 mL glass/Teflon syringes. These columns were connected to a circulating water bath, which was maintained at 40°C during the resolution. The reactor was conditioned as follows: Tris-acetate buffer (100 mM, pH 7, containing 100 mM magnesium acetate) was added at a rate of 5 mL min at room temperature for 30 minutes; followed by 1 :1 volume ratio, total flow rate of 2 mL min -1 , adding a combination of racemic lactone solution (1.5M in water) and Tris-acetic acid buffer (2.5M, pH 8.25, containing 50mM magnesium acetate) for 30 minute. The flow rate was then reduced to 1.2 mL min -1 , the column was heated to 40°C, and the system was allowed to equilibrate for 15 minutes. The reactor was thus operated for approximately 45 minutes, collecting the eluate on an ice bath, under which conditions a total of 52.7 mL of feed (corresponding to 5.14 g RSPL) were processed. The pH of the eluent was adjusted to 6.5 with sulfuric acid (10% v/v aqueous solution), the solution was extracted with dichloromethane (10×75 mL), the organic layer was dried over anhydrous magnesium sulfate, and the rotary evaporation yielded (S)-Pantolactone enrichment (2.67 g, 98% of theory) consisting of 10% RPL and 90% SPL (by chiral HPLC analysis). The aqueous phase was acidified with sulfuric acid (40% w/w) to pH 1.5, heated to 70 °C for 1 h, cooled on ice, saturated with NaCl, and extracted with dichloromethane (10 x 75 mL ), the organic layer was dried over anhydrous magnesium sulfate and rotary evaporated to yield (R)-pantolactone rich (2.23 g, 92% of theoretical yield) with an enantiomeric purity of 93%. The combined recovery for (R)-/(S)- was 95%.
实施例(13)和(14)的结果清楚地表明,重组内酯酶对于水解拆分外消旋泛酸内酯来说是有效的生物催化剂,该催化剂还可被有效固定,固定物可用于生产对映异构过量达90-98%的高度富集的(R)-泛酸内酯,不管是分批操作还是连续操作的情况下都如此。The results of Examples (13) and (14) clearly show that the recombinant lactonase is an effective biocatalyst for the hydrolytic resolution of racemic pantothenate lactone, the catalyst can also be effectively fixed, and the fixed material can be used for the production of Highly enriched (R)-pantolactone with an enantiomeric excess of 90-98%, both in the case of batch and continuous operations.
实施例15克隆和表达来自B.subtilis的(S)-泛酸内酯特异性内酯酶Example 15 Cloning and expression of (S)-pantolactone-specific lactonase from B. subtilis
用来自牛和马肝脏的内酯酶的氨基酸序列,我们发现了若干种其它显示与上述序列具有有限同源性的序列。通过PCR将它们中间来自B.subtilis的yvre基因(YVRE BACSU,被注释为SMP30/CGR1家族的成员/假设的33.2kDa蛋白)从基因组DNA中克隆出来,其中使用的引物的5’和3’-末端还含有后续向表达载体中克隆的步骤所需要的限制性位点(EcoRI和PstI)。PCR条件与从A.niger的基因组DNA分离oal基因(见实施例8)所用的条件相同。用EcoRI和PstI来消化PCR产物和载体(来自Stratagene(La Jolla,CA,USA)的PET41a),通过琼脂糖凝胶电泳来纯化,连接,并将其转化进E.coli Top10细胞(Stratagene,La Jolla,CA,USA)。使用该种策略,该基因被融合进来自E.coli的GST蛋白基因的阅读框中。Using the amino acid sequences of lactonases from bovine and equine liver, we found several other sequences showing limited homology to the above sequences. Among them, the yvre gene from B. subtilis (YVRE BACSU, annotated as a member of the SMP30/CGR1 family/hypothetical 33.2kDa protein) was cloned from genomic DNA by PCR using the 5' and 3'- The ends also contained restriction sites (EcoRI and PstI) required for subsequent cloning steps into expression vectors. The PCR conditions were the same as those used to isolate the oal gene from A. niger genomic DNA (see Example 8). The PCR product and vector (PET41a from Stratagene (La Jolla, CA, USA)) were digested with EcoRI and PstI, purified by agarose gel electrophoresis, ligated, and transformed into E.coli Top10 cells (Stratagene, La Jolla, CA). Jolla, CA, USA). Using this strategy, the gene was fused into the reading frame of the GST protein gene from E. coli.
根据厂商提供的方案来进行表达和纯化。用其中额外含有5mM CaCl2的标准试验在40℃来测定被纯化的蛋白质的活性。该酶显示了对(S)-泛酸内酯的非常高的选择性。该制剂具有相对于每mg富集融合蛋白1-2U的比活性。该酶仅在Mg2+或Ca2+离子存在的情况下具有活性。Expression and purification were performed according to the protocol provided by the manufacturer. The activity of the purified protein was determined using a standard assay at 40°C with additional 5 mM CaCl2 . This enzyme shows very high selectivity towards (S)-pantolactone. This preparation has a specific activity of 1-2 U per mg of enriched fusion protein. The enzyme is only active in the presence of Mg 2+ or Ca 2+ ions.
序列表sequence listing
<110>帝斯曼知识产权资产管理有限公司<110> DSM Intellectual Property Asset Management Co., Ltd.
<120>新颖的酶<120> Novel enzymes
<130>21717<130>21717
<160>99<160>99
<170>PatentIn version 3.1<170>PatentIn version 3.1
<210>1<210>1
<211>900<211>900
<212>DNA<212>DNA
<213>Equus caballus<213> Equus caballus
<220><220>
<221>CDS<221> CDS
<222>(1)..(900)<222>(1)..(900)
<223><223>
<400>1<400>1
atg tct tcc att aag att gag tgt gtt ctg ccc gag aac tgc cag tgc 48atg tct tcc att aag att gag tgt gtt ctg ccc gag aac tgc cag tgc 48
Met Ser Ser Ile Lys Ile Glu Cys Val Leu Pro Glu Asn Cys Gln CysMet Ser Ser Ile Lys Ile Glu Cys Val Leu Pro Glu Asn Cys Gln Cys
1 5 10 151 5 10 15
ggt gag tcg cca gtg tgg gag gaa gcg tcc agc tct ctg ctc ttc gta 96ggt gag tcg cca gtg tgg gag gaa gcg tcc agc tct ctg ctc ttc gta 96
Gly Glu Ser Pro Val Trp Glu Glu Ala Ser Ser Ser Leu Leu Phe ValGly Glu Ser Pro Val Trp Glu Glu Ala Ser Ser Ser Leu Leu Phe Val
20 25 3020 25 30
gat att cct gcc aaa aag gtt tgc cgg tgg gat gcg ctc agc aag caa 144gat att cct gcc aaa aag gtt tgc cgg tgg gat gcg ctc agc aag caa 144
Asp Ile Pro Ala Lys Lys Val Cys Arg Trp Asp Ala Leu Ser Lys GlnAsp Ile Pro Ala Lys Lys Val Cys Arg Trp Asp Ala Leu Ser Lys Gln
35 40 4535 40 45
gtg cag cga atg acc gtg gat gcc ccg gtt ggc tca gtg gcc ctc cgc 192gtg cag cga atg acc gtg gat gcc ccg gtt ggc tca gtg gcc ctc cgc 192
Val Gln Arg Met Thr Val Asp Ala Pro Val Gly Ser Val Ala Leu ArgVal Gln Arg Met Thr Val Asp Ala Pro Val Gly Ser Val Ala Leu Arg
50 55 6050 55 60
cag tcg gga ggc tat gtc gcc aca att gga acg aag ttc tgt gct ttg 240cag tcg gga ggc tat gtc gcc aca att gga acg aag ttc tgt gct ttg 240
Gln Ser Gly Gly Tyr Val Ala Thr Ile Gly Thr Lys Phe Cys Ala LeuGln Ser Gly Gly Tyr Val Ala Thr Ile Gly Thr Lys Phe Cys Ala Leu
65 70 75 8065 70 75 80
aac tgg gaa gat cag tca gta gtt gtc ctg gcc gcg gta gag aaa gac 288aac tgg gaa gat cag tca gta gtt gtc ctg gcc gcg gta gag aaa gac 288
Asn Trp Glu Asp Gln Ser Val Val Val Leu Ala Ala Val Glu Lys AspAsn Trp Glu Asp Gln Ser Val Val Val Leu Ala Ala Val Glu Lys Asp
85 90 9585 90 95
aag aaa aac aat cga ttc aat gac ggg aag gtg gat ccc gcg ggg aga 336aag aaa aac aat cga ttc aat gac ggg aag gtg gat ccc gcg ggg aga 336
Lys Lys Asn Asn Arg Phe Asn Asp Gly Lys Val Asp Pro Ala Gly ArgLys Lys Asn Asn Arg Phe Asn Asp Gly Lys Val Asp Pro Ala Gly Arg
100 105 110100 105 110
tac ttt gct ggc acc atg gct gag gaa aca gcc ccg gca gtt acg gag 384tac ttt gct ggc acc atg gct gag gaa aca gcc ccg gca gtt acg gag 384
Tyr Phe Ala Gly Thr Met Ala Glu Glu Thr Ala Pro Ala Val Thr GluTyr Phe Ala Gly Thr Met Ala Glu Glu Thr Ala Pro Ala Val Thr Glu
115 120 125115 120 125
cgg cac caa ggg tcc ctg tac gcc ctc ttt cct gac cac cac gtg gaa 432cgg cac caa ggg tcc ctg tac gcc ctc ttt cct gac cac cac gtg gaa 432
Arg His Gln Gly Ser Leu Tyr Ala Leu Phe Pro Asp His His Val GluArg His Gln Gly Ser Leu Tyr Ala Leu Phe Pro Asp His His Val Glu
130 135 140130 135 140
aag tac ttt gac cag gtg gac atc tcc aac ggt ttg gat tgg tcc ctg 480aag tac ttt gac cag gtg gac atc tcc aac ggt ttg gat tgg tcc ctg 480
Lys Tyr Phe Asp Gln Val Asp Ile Ser Asn Gly Leu Asp Trp Ser LeuLys Tyr Phe Asp Gln Val Asp Ile Ser Asn Gly Leu Asp Trp Ser Leu
145 150 155 160145 150 155 160
gac cac aaa atc ttc tat tac atc gac agc ctg tcc tac tct gtg gat 528gac cac aaa atc ttc tat tac atc gac agc ctg tcc tac tct gtg gat 528
Asp His Lys Ile Phe Tyr Tyr Ile Asp Ser Leu Ser Tyr Ser Val AspAsp His Lys Ile Phe Tyr Tyr Ile Asp Ser Leu Ser Tyr Ser Val Asp
165 170 175165 170 175
gcc ttt gac tatgac ctg ctg aca gga agg atc tct aac cgc aga agt 576gcc ttt gac tatgac ctg ctg aca gga agg atc tct aac cgc aga agt 576
Ala Phe Asp Tyr Asp Leu Leu Thr Gly ArgIle Ser Asn Arg Arg SerAla Phe Asp Tyr Asp Leu Leu Thr Gly ArgIle Ser Asn Arg Arg Ser
180 185 190180 185 190
gtt tac aag ctg gag aag gaa gaa cat atc cca gat gga atg tgt att 624gtt tac aag ctg gag aag gaa gaa cat atc cca gat gga atg tgt att 624
Val Tyr Lys Leu Glu Lys Glu Glu His Ile Pro Asp Gly Met Cys IleVal Tyr Lys Leu Glu Lys Glu Glu His Ile Pro Asp Gly Met Cys Ile
195 200 205195 200 205
gat act gag ggg aag ctc tgg gtg gcc tgt tac aac gga gga aga gtg 672gat act gag ggg aag ctc tgg gtg gcc tgt tac aac gga gga aga gtg 672
Asp Thr Glu Gly Lys Leu Trp Val Ala Cys Tyr Asn Gly Gly Arg ValAsp Thr Glu Gly Lys Leu Trp Val Ala Cys Tyr Asn Gly Gly Arg Val
210 215 220210 215 220
att cgt tta gat cct gag aca ggg aaa aga ctc caa act gtg aag ttg 720att cgt tta gat cct gag aca ggg aaa aga ctc caa act gtg aag ttg 720
Ile Arg Leu Asp pro Glu Thr Gly Lys Arg Leu Gln Thr Val Lys LeuIle Arg Leu Asp pro Glu Thr Gly Lys Arg Leu Gln Thr Val Lys Leu
225 230 235 240225 230 235 240
cct gtt gat aaa acc act tcg tgc tgc ttc gga ggg aag gat tac tct 768cct gtt gat aaa acc act tcg tgc tgc ttc gga ggg aag gat tac tct 768
Pro Val Asp Lys Thr Thr Ser Cys Cys Phe Gly Gly Lys Asp Tyr SerPro Val Asp Lys Thr Thr Ser Cys Cys Phe Gly Gly Lys Asp Tyr Ser
245 250 255245 250 255
gaa atg tac gtg acc tgc gcc cgg gct gga ctc gac ccc gag gct ctc 816gaa atg tac gtg acc tgc gcc cgg gct gga ctc gac ccc gag gct ctc 816
Glu Met Tyr Val Thr Cys Ala Arg Ala Gly Leu Asp Pro Glu Ala LeuGlu Met Tyr Val Thr Cys Ala Arg Ala Gly Leu Asp Pro Glu Ala Leu
260 265 270260 265 270
tcg cgg cag cct gaa gct ggt ggc att ttc aag ata act ggc ctg ggg 864tcg cgg cag cct gaa gct ggt ggc att ttc aag ata act ggc ctg ggg 864
Ser Arg Gln Pro Glu Ala Gly Gly Ile Phe Lys Ile Thr Gly Leu GlySer Arg Gln Pro Glu Ala Gly Gly Ile Phe Lys Ile Thr Gly Leu Gly
275 280 285275 280 285
gtc aaa gga att cct ccc tat gcc tac gca gga tga 900gtc aaa gga att cct ccc tat gcc tac gca gga tga 900
Val Lys Gly Ile Pro Pro Tyr Ala Tyr Ala GlyVal Lys Gly Ile Pro Pro Tyr Ala Tyr Ala Gly
290 295290 295
<210>2<210>2
<211>299<211>299
<212>PRT<212>PRT
<213>Equus caballus<213> Equus caballus
<400>2<400>2
Met Ser Ser Ile Lys Ile Glu Cys Val Leu Pro Glu Asn Cys Gln CysMet Ser Ser Ile Lys Ile Glu Cys Val Leu Pro Glu Asn Cys Gln Cys
1 5 10 151 5 10 15
Gly Glu Ser Pro Val Trp Glu Glu Ala Ser Ser Ser Leu Leu Phe ValGly Glu Ser Pro Val Trp Glu Glu Ala Ser Ser Ser Leu Leu Phe Val
20 25 3020 25 30
Asp Ile Pro Ala Lys Lys Val Cys Arg Trp Asp Ala Leu Ser Lys GlnAsp Ile Pro Ala Lys Lys Val Cys Arg Trp Asp Ala Leu Ser Lys Gln
35 40 4535 40 45
Val Gln Arg Met Thr Val Asp Ala Pro Val Gly Ser Val Ala Leu ArgVal Gln Arg Met Thr Val Asp Ala Pro Val Gly Ser Val Ala Leu Arg
50 55 6050 55 60
Gln Ser Gly Gly Tyr Val Ala Thr Ile Gly Thr Lys Phe Cys Ala LeuGln Ser Gly Gly Tyr Val Ala Thr Ile Gly Thr Lys Phe Cys Ala Leu
65 70 75 8065 70 75 80
Asn Trp Glu Asp Gln Ser Val Val Val Leu Ala Ala Val Glu Lys AspAsn Trp Glu Asp Gln Ser Val Val Val Leu Ala Ala Val Glu Lys Asp
85 90 9585 90 95
Lys Lys Asn Asn Arg Phe Asn Asp Gly Lys Val Asp Pro Ala Gly ArgLys Lys Asn Asn Arg Phe Asn Asp Gly Lys Val Asp Pro Ala Gly Arg
100 105 110100 105 110
Tyr Phe Ala Gly Thr Met Ala Glu Glu Thr Ala Pro Ala Val Thr GluTyr Phe Ala Gly Thr Met Ala Glu Glu Thr Ala Pro Ala Val Thr Glu
115 120 125115 120 125
Arg His Gln Gly Ser Leu Tyr Ala Leu Phe Pro Asp His His Val GluArg His Gln Gly Ser Leu Tyr Ala Leu Phe Pro Asp His His Val Glu
130 135 140130 135 140
Lys Tyr Phe Asp Gln Val Asp Ile Ser Asn Gly Leu Asp Trp Ser LeuLys Tyr Phe Asp Gln Val Asp Ile Ser Asn Gly Leu Asp Trp Ser Leu
145 150 155 160145 150 155 160
Asp His Lys Ile Phe Tyr Tyr Ile Asp Ser Leu Ser Tyr Ser Val AspAsp His Lys Ile Phe Tyr Tyr Ile Asp Ser Leu Ser Tyr Ser Val Asp
165 170 175165 170 175
Ala Phe Asp Tyr Asp Leu Leu Thr Gly Arg Ile Ser Asn Arg Arg SerAla Phe Asp Tyr Asp Leu Leu Thr Gly Arg Ile Ser Asn Arg Arg Ser
180 185 190180 185 190
Val Tyr Lys Leu Glu Lys Glu Glu His Ile Pro Asp Gly Met Cys IleVal Tyr Lys Leu Glu Lys Glu Glu His Ile Pro Asp Gly Met Cys Ile
195 200 205195 200 205
Asp Thr Glu Gly Lys Leu Trp Val Ala Cys Tyr Asn Gly Gly Arg ValAsp Thr Glu Gly Lys Leu Trp Val Ala Cys Tyr Asn Gly Gly Arg Val
210 215 220210 215 220
Ile Arg Leu Asp Pro Glu Thr Gly Lys Arg Leu Gln Thr Val Lys LeuIle Arg Leu Asp Pro Glu Thr Gly Lys Arg Leu Gln Thr Val Lys Leu
225 230 235 240225 230 235 240
Pro Val Asp Lys Thr Thr Ser Cys Cys Phe Gly Gly Lys Asp Tyr SerPro Val Asp Lys Thr Thr Ser Cys Cys Phe Gly Gly Lys Asp Tyr Ser
245 250 255245 250 255
Glu Met Tyr Val Thr Cys Ala Arg Ala Gly Leu Asp Pro Glu Ala LeuGlu Met Tyr Val Thr Cys Ala Arg Ala Gly Leu Asp Pro Glu Ala Leu
260 265 270260 265 270
Ser Arg Gln Pro Glu Ala Gly Gly Ile Phe Lys Ile Thr Gly Leu GlySer Arg Gln Pro Glu Ala Gly Gly Ile Phe Lys Ile Thr Gly Leu Gly
275 280 285275 280 285
Val Lys Gly Ile Pro Pro Tyr Ala Tyr Ala GlyVal Lys Gly Ile Pro Pro Tyr Ala Tyr Ala Gly
290 295290 295
<210>3<210>3
<211>879<211>879
<212>DNA<212>DNA
<213>Bacillus subtilis<213>Bacillus subtilis
<220><220>
<221>CDS<221> CDS
<222>(1)..(879)<222>(1)..(879)
<223><223>
<400>3<400>3
atg gat gca gta ttg gaa gca gac act cgg gca gtg att ggt gaa ggc 48atg gat gca gta ttg gaa gca gac act cgg gca gtg att ggt gaa ggc 48
Met Asp Ala Val Leu Glu Ala Asp Thr Arg Ala Val Ile Gly Glu GlyMet Asp Ala Val Leu Glu Ala Asp Thr Arg Ala Val Ile Gly Glu Gly
1 5 10 151 5 10 15
ccg tta tgg gat gaa gag aac ggc cgc tta tat tgg gtt gat atc ctg 96ccg tta tgg gat gaa gag aac ggc cgc tta tat tgg gtt gat atc ctg 96
Pro Leu Trp Asp Glu Glu Asn Gly Arg Leu Tyr Trp Val Asp Ile LeuPro Leu Trp Asp Glu Glu Asn Gly Arg Leu Tyr Trp Val Asp Ile Leu
20 25 3020 25 30
ggg agc gag ctc cac atc ttt gac cct gaa gaa aaa atc aac cga tca 144ggg agc gag ctc cac atc ttt gac cct gaa gaa aaa atc aac cga tca 144
Gly Ser Glu Leu His Ile Phe Asp Pro Glu Glu Lys Ile Asn Arg SerGly Ser Glu Leu His Ile Phe Asp Pro Glu Glu Lys Ile Asn Arg Ser
35 40 4535 40 45
atc aaa ttc aaa tcc ttt gtg acg gcg ctt gcg aaa tat tca aag gat 192atc aaa ttc aaa tcc ttt gtg acg gcg ctt gcg aaa tat tca aag gat 192
Ile Lys Phe Lys Ser Phe Val Thr Ala Leu Ala Lys Tyr Ser Lys AspIle Lys Phe Lys Ser Phe Val Thr Ala Leu Ala Lys Tyr Ser Lys Asp
50 55 6050 55 60
gaa ctg att atg acg atg aag gac ggg ttt tac ctg tat cat ctt cgg 240gaa ctg att atg acg atg aag gac ggg ttt tac ctg tat cat ctt cgg 240
Glu Leu Ile Met Thr Met Lys Asp Gly Phe Tyr Leu Tyr His Leu ArgGlu Leu Ile Met Thr Met Lys Asp Gly Phe Tyr Leu Tyr His Leu Arg
65 70 75 8065 70 75 80
gat gac agc ttg gaa aaa att aaa cag ccg aag gac atg cat gag agc 288gat gac agc ttg gaa aaa att aaa cag ccg aag gac atg cat gag agc 288
Asp Asp Ser Leu Glu Lys Ile Lys Gln Pro Lys Asp Met His Glu SerAsp Asp Ser Leu Glu Lys Ile Lys Gln Pro Lys Asp Met His Glu Ser
85 90 9585 90 95
ctg aga ttt aat gat gct aaa tgt gac ccg tac gga agg ctt tgg gcg 336ctg aga ttt aat gat gct aaa tgt gac ccg tac gga agg ctt tgg gcg 336
Leu Arg Phe Asn Asp Ala Lys Cys Asp Pro Tyr Gly Arg Leu Trp AlaLeu Arg Phe Asn Asp Ala Lys Cys Asp Pro Tyr Gly Arg Leu Trp Ala
100 105 110100 105 110
ggg acg acg agc atg gaa ggc gag caa aaa cag gcg tcg ctg tac cgt 384ggg acg acg agc atg gaa ggc gag caa aaa cag gcg tcg ctg tac cgt 384
Gly Thr Thr Ser Met Glu Gly Glu Gln Lys Gln Ala Ser Leu Tyr ArgGly Thr Thr Ser Met Glu Gly Glu Gln Lys Gln Ala Ser Leu Tyr Arg
115 120 125115 120 125
ttg aat cta gac ggc agt ctt gtc aaa atc aag gat caa gtc tcc acc 432ttg aat cta gac ggc agt ctt gtc aaa atc aag gat caa gtc tcc acc 432
Leu Asn Leu Asp Gly Ser Leu Val Lys Ile Lys Asp Gln Val Ser ThrLeu Asn Leu Asp Gly Ser Leu Val Lys Ile Lys Asp Gln Val Ser Thr
130 135 140130 135 140
tca aac ggt ttg gat tgg gac cgc gag cgg aat ttg atg tat tac atc 480tca aac ggt ttg gat tgg gac cgc gag cgg aat ttg atg tat tac atc 480
Ser Asn Gly Leu Asp Trp Asp Arg Glu Arg Asn Leu Met Tyr Tyr IleSer Asn Gly Leu Asp Trp Asp Arg Glu Arg Asn Leu Met Tyr Tyr Ile
145 150 155 160145 150 155 160
gac acg ccg acc cag gag att gta cgt tac agc tat gat cct caa agc 528gac acg ccg acc cag gag att gta cgt tac agc tat gat cct caa agc 528
Asp Thr Pro Thr Gln Glu Ile Val Arg Tyr Ser Tyr Asp Pro Gln SerAsp Thr Pro Thr Gln Glu Ile Val Arg Tyr Ser Tyr Asp Pro Gln Ser
165 170 175165 170 175
gga gat gtt tcg aat cca gaa cct gtc tat cgt ttt gat cag tca gac 576gga gat gtt tcg aat cca gaa cct gtc tat cgt ttt gat cag tca gac 576
Gly Asp Val Ser Asn Pro Glu Pro Val Tyr Arg Phe Asp Gln Ser AspGly Asp Val Ser Asn Pro Glu Pro Val Tyr Arg Phe Asp Gln Ser Asp
180 185 190180 185 190
gga ttg ccg gac ggc atg aca att gac caa aat ggc atg ctg tgg gtg 624gga ttg ccg gac ggc atg aca att gac caa aat ggc atg ctg tgg gtg 624
Gly Leu Pro Asp Gly Met Thr Ile Asp Gln Asn Gly Met Leu Trp ValGly Leu Pro Asp Gly Met Thr Ile Asp Gln Asn Gly Met Leu Trp Val
195 200 205195 200 205
gcg ctg ttt ggc ggc agc cgc gtt gtt cac att gac ccg ttt cag aaa 672gcg ctg ttt ggc ggc agc cgc gtt gtt cac att gac ccg ttt cag aaa 672
Ala Leu Phe Gly Gly Ser Arg Val Val His Ile Asp Pro Phe Gln LysAla Leu Phe Gly Gly Ser Arg Val Val His Ile Asp Pro Phe Gln Lys
210 215 220210 215 220
aaa gaa atc aat tca atc agc gtg ccg gct aaa tat gtc acg tgc tgc 720aaa gaa atc aat tca atc agc gtg ccg gct aaa tat gtc acg tgc tgc 720
Lys Glu Ile Asn Ser Ile Ser Val Pro Ala Lys Tyr Val Thr Cys CysLys Glu Ile Asn Ser Ile Ser Val Pro Ala Lys Tyr Val Thr Cys Cys
225 230 235 240225 230 235 240
gcg ttc ggc ggc aga gac tta aaa acc ctt tac att aca acg gca aca 768gcg ttc ggc ggc aga gac tta aaa acc ctt tac att aca acg gca aca 768
Ala Phe Gly Gly Arg Asp Leu Lys Thr Leu Tyr Ile Thr Thr Ala ThrAla Phe Gly Gly Arg Asp Leu Lys Thr Leu Tyr Ile Thr Thr Ala Thr
245 250 255245 250 255
gaa caa atg aca gaa aaa gag aga tac gag cag cct cac gct gga ggg 816gaa caa atg aca gaa aaa gag aga tac gag cag cct cac gct gga ggg 816
Glu Gln Met Thr Glu Lys Glu Arg Tyr Glu Gln Pro His Ala Gly GlyGlu Gln Met Thr Glu Lys Glu Arg Tyr Glu Gln Pro His Ala Gly Gly
260 265 270260 265 270
ttg ttt tca gca caa ctg gaa aca ggc gga tat cag ccg gta cca ttt 864ttg ttt tca gca caa ctg gaa aca ggc gga tat cag ccg gta cca ttt 864
Leu Phe Ser Ala Gln Leu Glu Thr Gly Gly Tyr Gln Pro Val Pro PheLeu Phe Ser Ala Gln Leu Glu Thr Gly Gly Tyr Gln Pro Val Pro Phe
275 280 285275 280 285
gct gga gac gta taa 879gct gga gac gta taa 879
Ala Gly Asp ValAla Gly Asp Val
290290
<210>4<210>4
<211>292<211>292
<212>PRT<212>PRT
<213>Bacillus subtilis<213>Bacillus subtilis
<400>4<400>4
Met Asp Ala Val Leu Glu Ala Asp Thr Arg Ala Val Ile Gly Glu GlyMet Asp Ala Val Leu Glu Ala Asp Thr Arg Ala Val Ile Gly Glu Gly
1 5 10 151 5 10 15
Pro Leu Trp Asp Glu Glu ASn Gly Arg Leu Tyr Trp Val Asp Ile LeuPro Leu Trp Asp Glu Glu ASn Gly Arg Leu Tyr Trp Val Asp Ile Leu
20 25 3020 25 30
Gly Ser Glu Leu His Ile Phe Asp Pro Glu Glu Lys Ile Asn Arg SerGly Ser Glu Leu His Ile Phe Asp Pro Glu Glu Lys Ile Asn Arg Ser
35 40 4535 40 45
Ile Lys Phe Lys Ser Phe Val Thr Ala Leu Ala Lys Tyr Ser Lys AspIle Lys Phe Lys Ser Phe Val Thr Ala Leu Ala Lys Tyr Ser Lys Asp
50 55 6050 55 60
Glu Leu Ile Met Thr Met Lys Asp Gly Phe Tyr Leu Tyr His Leu ArgGlu Leu Ile Met Thr Met Lys Asp Gly Phe Tyr Leu Tyr His Leu Arg
65 70 75 8065 70 75 80
Asp Asp Ser Leu Glu Lys Ile Lys Gln Pro Lys Asp Met His Glu SerAsp Asp Ser Leu Glu Lys Ile Lys Gln Pro Lys Asp Met His Glu Ser
85 90 9585 90 95
Leu Arg Phe Asn Asp Ala Lys Cys Asp Pro Tyr Gly Arg Leu Trp AlaLeu Arg Phe Asn Asp Ala Lys Cys Asp Pro Tyr Gly Arg Leu Trp Ala
100 105 110100 105 110
Gly Thr Thr Ser Met Glu Gly Glu Gln Lys Gln Ala Ser Leu Tyr ArgGly Thr Thr Ser Met Glu Gly Glu Gln Lys Gln Ala Ser Leu Tyr Arg
115 120 125115 120 125
Leu Asn Leu Asp Gly Ser Leu Val Lys Ile Lys Asp Gln Val Ser ThrLeu Asn Leu Asp Gly Ser Leu Val Lys Ile Lys Asp Gln Val Ser Thr
130 135 140130 135 140
Ser Asn Gly Leu Asp Trp Asp Arg Glu Arg Asn Leu Met Tyr Tyr IleSer Asn Gly Leu Asp Trp Asp Arg Glu Arg Asn Leu Met Tyr Tyr Ile
145 150 155 160145 150 155 160
Asp Thr Pro Thr Gln Glu Ile Val Arg Tyr Ser Tyr Asp Pro Gln SerAsp Thr Pro Thr Gln Glu Ile Val Arg Tyr Ser Tyr Asp Pro Gln Ser
165 170 175165 170 175
Gly Asp Val Ser Asn Pro Glu Pro Val Tyr Arg Phe Asp Gln Ser AspGly Asp Val Ser Asn Pro Glu Pro Val Tyr Arg Phe Asp Gln Ser Asp
180 185 190180 185 190
Gly Leu Pro Asp Gly Met Thr Ile Asp Gln Asn Gly Met Leu Trp ValGly Leu Pro Asp Gly Met Thr Ile Asp Gln Asn Gly Met Leu Trp Val
195 200 205195 200 205
Ala Leu Phe Gly Gly Ser Arg Val Val His Ile Asp Pro Phe Gln LysAla Leu Phe Gly Gly Ser Arg Val Val His Ile Asp Pro Phe Gln Lys
210 215 220210 215 220
Lys Glu Ile Asn Ser Ile Ser Val Pro Ala Lys Tyr Val Thr Cys CysLys Glu Ile Asn Ser Ile Ser Val Pro Ala Lys Tyr Val Thr Cys Cys
225 230 235 240225 230 235 240
Ala Phe Gly Gly Arg Asp Leu Lys Thr Leu Tyr Ile Thr Thr Ala ThrAla Phe Gly Gly Arg Asp Leu Lys Thr Leu Tyr Ile Thr Thr Ala Thr
245 250 255245 250 255
Glu Gln Met Thr Glu Lys Glu Arg Tyr Glu Gln Pro His Ala Gly GlyGlu Gln Met Thr Glu Lys Glu Arg Tyr Glu Gln Pro His Ala Gly Gly
260 265 270260 265 270
Leu Phe Ser Ala Gln Leu Glu Thr Gly Gly Tyr Gln Pro Val Pro PheLeu Phe Ser Ala Gln Leu Glu Thr Gly Gly Tyr Gln Pro Val Pro Phe
275 280 285275 280 285
Ala Gly Asp ValAla Gly Asp Val
290290
<210>5<210>5
<211>726<211>726
<212>DNA<212>DNA
<213>Aspergillus niger<213>Aspergillus niger
<220><220>
<221>CDS<221> CDS
<222>(1)..(726)<222>(1)..(726)
<223><223>
<400>5<400>5
aac tgg gac ctg gaa aaa ctc aat cct cca ggc ttc ggc cgc aaa ccc 48aac tgg gac ctg gaa aaa ctc aat cct cca ggc ttc ggc cgc aaa ccc 48
Asn Trp Asp Leu Glu Lys Leu Asn Pro Pro Gly Phe Gly Arg Lys ProAsn Trp Asp Leu Glu Lys Leu Asn Pro Pro Gly Phe Gly Arg Lys Pro
1 5 10 151 5 10 15
ttc gcc cac cag gtt aaa tgg gtc agc gaa ggc cac gcc ttc gac gac 96ttc gcc cac cag gtt aaa tgg gtc agc gaa ggc cac gcc ttc gac gac 96
Phe Ala His Gln Val Lys Trp Val Ser Glu Gly His Ala Phe Asp AspPhe Ala His Gln Val Lys Trp Val Ser Glu Gly His Ala Phe Asp Asp
20 25 3020 25 30
gag tac cac ttc aac cca cag caa tcc tcc caa tgg gac ggc cta cga 144gag tac cac ttc aac cca cag caa tcc tcc caa tgg gac ggc cta cga 144
Glu Tyr His Phe Asn Pro Gln Gln Ser Ser Gln Trp Asp Gly Leu ArgGlu Tyr His Phe Asn Pro Gln Gln Ser Ser Gln Trp Asp Gly Leu Arg
35 40 4535 40 45
cac cac aat gcc cca gcc cca aca ccc gac gac cct gac cgc cgc gtc 192cac cac aat gcc cca gcc cca aca ccc gac gac cct gac cgc cgc gtc 192
His His Asn Ala Pro Ala Pro Thr Pro Asp Asp Pro Asp Arg Arg ValHis His Asn Ala Pro Ala Pro Thr Pro Asp Asp Pro Asp Arg Arg Val
50 55 6050 55 60
ttc tac ggc ggc acc acc tcc acc gag atc ctc gac ccg tcc tca gct 240ttc tac ggc ggc acc acc tcc acc acc gag atc ctc gac ccg tcc tca gct 240
Phe Tyr Gly Gly Thr Thr Ser Thr Glu Ile Leu Asp Pro Ser Ser AlaPhe Tyr Gly Gly Thr Thr Ser Thr Glu Ile Leu Asp Pro Ser Ser Ala
65 70 75 8065 70 75 80
cgc atc ggc atc gct cac tgg gcc aag aag ggc atc gcc ggc cgt ggc 288cgc atc ggc atc gct cac tgg gcc aag aag ggc atc gcc ggc cgt ggc 288
Arg Ile Gly Ile Ala His Trp Ala Lys Lys Gly Ile Ala Gly Arg GlyArg Ile Gly Ile Ala His Trp Ala Lys Lys Gly Ile Ala Gly Arg Gly
85 90 9585 90 95
gtc ctc atc gac tac gcg tcc tac atc cag cga acc aag ggc ata cca 336gtc ctc atc gac tac gcg tcc tac atc cag cga acc aag ggc ata cca 336
Val Leu Ile Asp Tyr Ala Ser Tyr Ile Gln Arg Thr Lys Gly Ile ProVal Leu Ile Asp Tyr Ala Ser Tyr Ile Gln Arg Thr Lys Gly Ile Pro
100 105 110100 105 110
gta aac gcc cta acc cgg cac acc gtc tcc ctg gac gac gtc ctc acc 384gta aac gcc cta acc cgg cac acc gtc tcc ctg gac gac gtc ctc acc 384
Val Ash Ala Leu Thr Arg His Thr Val Ser Leu Asp Asp Val Leu ThrVal Ash Ala Leu Thr Arg His Thr Val Ser Leu Asp Asp Val Leu Thr
115 120 125115 120 125
atc gcg aaa gaa tgc aac atc acc ttc cag cca ggc gac att ctc ttc 432atc gcg aaa gaa tgc aac atc acc ttc cag cca ggc gac att ctc ttc 432
Ile Ala Lys Glu Cys Asn Ile Thr Phe Gln Pro Gly Asp Ile Leu PheIle Ala Lys Glu Cys Asn Ile Thr Phe Gln Pro Gly Asp Ile Leu Phe
130 135 140130 135 140
ttg cgt gtc ggc ctg ccc acc aca tgg gat aac atg tcc gat gac gag 480ttg cgt gtc ggc ctg ccc acc aca tgg gat aac atg tcc gat gac gag 480
Leu Arg Val Gly Leu Pro Thr Thr Trp Asp Asn Met Ser Asp Asp GluLeu Arg Val Gly Leu Pro Thr Thr Trp Asp Asn Met Ser Asp Asp Glu
145 150 155 160145 150 155 160
aag gtc aaa tac agt caa cag gaa atg cct cag cat gca ggg tta gag 528aag gtc aaa tac agt caa cag gaa atg cct cag cat gca ggg tta gag 528
Lys Val Lys Tyr Ser Gln Gln Glu Met Pro Gln His Ala Gly Leu GluLys Val Lys Tyr Ser Gln Gln Glu Met Pro Gln His Ala Gly Leu Glu
165 170 175165 170 175
cag agt gag cgc gtg gtg cgg ttc ctc tgg gac cag cat ttc gcg gct 576cag agt gag cgc gtg gtg cgg ttc ctc tgg gac cag cat ttc gcg gct 576
Gln Ser Glu Arg Val Val Arg Phe Leu Trp Asp Gln His Phe Ala AlaGln Ser Glu Arg Val Val Arg Phe Leu Trp Asp Gln His Phe Ala Ala
180 185 190180 185 190
gtg gcg ggt gat gcg gtc agc ttc gag gtg tat ccg cct gta gag aag 624gtg gcg ggt gat gcg gtc agc ttc gag gtg tat ccg cct gta gag aag 624
Val Ala Gly Asp Ala Val Ser Phe Glu Val Tyr Pro Pro Val Glu LysVal Ala Gly Asp Ala Val Ser Phe Glu Val Tyr Pro Pro Val Glu Lys
195 200 205195 200 205
gag tgg gat ttg cat cat ttt ctg ttg gcc ggg tgg gga gtg ccg att 672gag tgg gat ttg cat cat ttt ctg ttg gcc ggg tgg gga gtg ccg att 672
Glu Trp Asp Leu His His Phe Leu Leu Ala Gly Trp Gly Val Pro IleGlu Trp Asp Leu His His Phe Leu Leu Ala Gly Trp Gly Val Pro Ile
210 215 220210 215 220
ggg gag atg ttt gat ctg gag ggg ttg aag gag gtt tgt gag agg ttg 720ggg gag atg ttt gat ctg gag ggg ttg aag gag gtt tgt gag agg ttg 720
Gly Glu Met Phe Asp Leu Glu Gly Leu Lys Glu Val Cys Glu Arg LeuGly Glu Met Phe Asp Leu Glu Gly Leu Lys Glu Val Cys Glu Arg Leu
225 230 235 240225 230 235 240
ggc agg 726ggc agg 726
Gly ArgGly Arg
<210>6<210>6
<211>242<211>242
<212>PRT<212>PRT
<213>Aspergillus niger<213>Aspergillus niger
<400>6<400>6
Asn Trp Asp Leu Glu Lys Leu Asn Pro Pro Gly Phe Gly Arg Lys ProAsn Trp Asp Leu Glu Lys Leu Asn Pro Pro Gly Phe Gly Arg Lys Pro
1 5 10 151 5 10 15
Phe Ala His Gln Val Lys Trp Val Ser Glu Gly His Ala Phe Asp AspPhe Ala His Gln Val Lys Trp Val Ser Glu Gly His Ala Phe Asp Asp
20 25 3020 25 30
Glu Tyr His Phe Asn Pro Gln Gln Ser Ser Gln Trp Asp Gly Leu ArgGlu Tyr His Phe Asn Pro Gln Gln Ser Ser Gln Trp Asp Gly Leu Arg
35 40 4535 40 45
His His Asn Ala Pro Ala Pro Thr Pro Asp Asp Pro Asp Arg Arg ValHis His Asn Ala Pro Ala Pro Thr Pro Asp Asp Pro Asp Arg Arg Val
50 55 6050 55 60
Phe Tyr Gly Gly Thr Thr Ser Thr Glu Ile Leu Asp Pro Ser Ser AlaPhe Tyr Gly Gly Thr Thr Ser Thr Glu Ile Leu Asp Pro Ser Ser Ala
65 70 75 8065 70 75 80
Arg Ile Gly Ile Ala His Trp Ala Lys Lys Gly Ile Ala Gly Arg GlyArg Ile Gly Ile Ala His Trp Ala Lys Lys Gly Ile Ala Gly Arg Gly
85 90 9585 90 95
Val Leu Ile Asp Tyr Ala Ser Tyr Ile Gln Arg Thr Lys Gly Ile ProVal Leu Ile Asp Tyr Ala Ser Tyr Ile Gln Arg Thr Lys Gly Ile Pro
100 105 110100 105 110
Val Asn Ala Leu Thr Arg His Thr Val Ser Leu Asp Asp Val Leu ThrVal Asn Ala Leu Thr Arg His Thr Val Ser Leu Asp Asp Val Leu Thr
115 120 125115 120 125
Ile Ala Lys Glu Cys Asn Ile Thr Phe Gln Pro Gly Asp Ile Leu PheIle Ala Lys Glu Cys Asn Ile Thr Phe Gln Pro Gly Asp Ile Leu Phe
130 135 140130 135 140
Leu Arg Val Gly Leu Pro Thr Thr Trp Asp Asn Met Ser Asp Asp GluLeu Arg Val Gly Leu Pro Thr Thr Trp Asp Asn Met Ser Asp Asp Glu
145 150 155 160145 150 155 160
Lys Val Lys Tyr Ser Gln Gln Glu Met Pro Gln His Ala Gly Leu GluLys Val Lys Tyr Ser Gln Gln Glu Met Pro Gln His Ala Gly Leu Glu
165 170 175165 170 175
Gln Ser Glu Arg Val Val Arg Phe Leu Trp Asp Gln His Phe Ala AlaGln Ser Glu Arg Val Val Arg Phe Leu Trp Asp Gln His Phe Ala Ala
180 185 190180 185 190
Val Ala Gly Asp Ala Val Ser Phe Glu Val Tyr Pro Pro Val Glu LysVal Ala Gly Asp Ala Val Ser Phe Glu Val Tyr Pro Pro Val Glu Lys
195 200 205195 200 205
Glu Trp Asp Leu His His Phe Leu Leu Ala Gly Trp Gly Val Pro IleGlu Trp Asp Leu His His Phe Leu Leu Ala Gly Trp Gly Val Pro Ile
210 215 220210 215 220
Gly Glu Met Phe Asp Leu Glu Gly Leu Lys Glu Val Cys Glu Arg LeuGly Glu Met Phe Asp Leu Glu Gly Leu Lys Glu Val Cys Glu Arg Leu
225 230 235 240225 230 235 240
Gly ArgGly Arg
<210>7<210>7
<211>2443<211>2443
<212>DNA<212>DNA
<213>Aspergillus niger<213>Aspergillus niger
<220><220>
<221>CDS<221> CDS
<222>(774)..(977)<222>(774)..(977)
<223><223>
<220><220>
<221>CDS<221> CDS
<222>(1050)..(1820)<222>(1050)..(1820)
<223><223>
<400>7<400>7
ccgctatcac tcagcggcag tcgggcgcag acaatgctac cgaggattca gggcacaagg 60ccgctatcac tcagcggcag tcgggcgcag acaatgctac cgaggattca gggcacaagg 60
gggatcagtt acgtggggcg tgacgatgta tgggtttcat gaggatgcgt ttacgagtgc 120gggatcagtt acgtggggcg tgacgatgta tgggtttcat gaggatgcgt ttacgagtgc 120
gtttggggtg gtggaagagt tgggggcgaa gatcccgttt ccggtggctg atagtcggtt 180gtttggggtg gtggaagagt tgggggcgaa gatcccgttt ccggtggctg atagtcggtt 180
gagagggggt gcagtgaagg aggtgagatg gttggaatat gtgttcagat tggtgttgca 240gagagggggt gcagtgaagg aggtgagatg gttggaatat gtgttcagat tggtgttgca 240
gggggtgcag ggcttgattt tgtggctggt gggagaggga aaagggaagc agaaggcgag 300gggggtgcag ggcttgattt tgtggctggt gggagaggga aaagggaagc agaaggcgag 300
ctagaacctt ttacacactg cctctcgaat gaacattcag aaattttgga tagacaaggt 360ctagaacctt ttacacactg cctctcgaat gaacattcag aaattttgga tagacaaggt 360
ttctgtcggt gctattctgg gcgttgcaat acctgtcaca ttgaataacc tccgaggtga 420ttctgtcggt gctattctgg gcgttgcaat acctgtcaca ttgaataacc tccgaggtga 420
ttatccacgc aatcccatat catctgatgt accccttgca taatctgtct ttaccaaatc 480ttatccacgc aatcccatat catctgatgt accccttgca taatctgtct ttaccaaatc 480
aggtagaaag taagaccatc cggataggct gtatactacc tcgtgtccac atcggggata 540aggtagaaag taagaccatc cggataggct gtatactacc tcgtgtccac atcggggata 540
ctgaacacga ttgtatatca attcgatgat ggagatacaa ccctaatgaa ccaagcacag 600ctgaacacga ttgtatatca attcgatgat ggagatacaa ccctaatgaa ccaagcacag 600
ccgtggatct ccccggactt tctccgatgg cttccccacc cgccaatcca gcatcaacta 660ccgtggatct ccccggactt tctccgatgg cttccccacc cgccaatcca gcatcaacta 660
aacaacccct cacgctcttc atcatctctc atttcactcc cacctatcat ctacagcttc 720aacaacccct cacgctcttc atcatctctc atttcactcc cacctatcat ctacagcttc 720
acaaacaaca ctactccctc ccaagcacac acctctactg acaagatacc acg atg 776acaaacaaca ctactccctc ccaagcacac acctctactg acaagatacc acg atg 776
MetMet
1 1
gcc gac tcc ctc ccg aaa acc tac gac gac ctc ccc gac aag cgg cgc 824gcc gac tcc ctc ccg aaa acc tac gac gac ctc ccc gac aag cgg cgc 824
Ala Asp Ser Leu Pro Lys Thr Tyr Asp Asp Leu Pro Asp Lys Arg ArgAla Asp Ser Leu Pro Lys Thr Tyr Asp Asp Leu Pro Asp Lys Arg Arg
5 10 155 10 15
tac tgg ccc gca acc ccc aac tcc gcc gac gag gga ctg ggg atg ctc 872tac tgg ccc gca acc ccc aac tcc gcc gac gag gga ctg ggg atg ctc 872
Tyr Trp Pro Ala Thr Pro Asn Ser Ala Asp Glu Gly Leu Gly Met LeuTyr Trp Pro Ala Thr Pro Asn Ser Ala Asp Glu Gly Leu Gly Met Leu
20 25 3020 25 30
cgt ctc ctc act cca gaa att gtc gcc aac gca gca cgc acc cag atc 920cgt ctc ctc act cca gaa att gtc gcc aac gca gca cgc acc cag atc 920
Arg Leu Leu Thr Pro Glu Ile Val Ala Asn Ala Ala Arg Thr Gln IleArg Leu Leu Thr Pro Glu Ile Val Ala Asn Ala Ala Arg Thr Gln Ile
35 40 4535 40 45
cag acc ggc gag cgc gta tgc ctg aac tgg gac ctg gag aag ctg aac 968cag acc ggc gag cgc gta tgc ctg aac tgg gac ctg gag aag ctg aac 968
Gln Thr Gly Glu Arg Val Cys Leu Asn Trp Asp Leu Glu Lys Leu AsnGln Thr Gly Glu Arg Val Cys Leu Asn Trp Asp Leu Glu Lys Leu Asn
50 55 60 6550 55 60 65
cca cca ggt accctccctt ctacctcagt gacataggtt cacccccaac 1017cca cca ggt accctccctt ctacctcagt gacataggtt cacccccaac 1017
Pro Pro GlyPro Pro Gly
ctcacccatc aacctaactt aaaaacatag gc ttc ggc cgc aaa ccc ttc gcc 1070ctcacccatc aacctaactt aaaaacatag gc ttc ggc cgc aaa ccc ttc gcc 1070
Phe Gly Arg Lys Pro Phe AlaPhe Gly Arg Lys Pro Phe Ala
70 7570 75
cac cac gta aaa tgg gtc tcc gaa ggc cac gcc ttc gac gac gaa tac 1118cac cac gta aaa tgg gtc tcc gaa ggc cac gcc ttc gac gac gaa tac 1118
His His Val Lys Trp Val Ser Glu Gly His Ala Phe Asp Asp Glu TyrHis His Val Lys Trp Val Ser Glu Gly His Ala Phe Asp Asp Glu Tyr
80 85 9080 85 90
cac ttc aac ccg caa caa tcc tcc caa tgg gac ggt ctg cga cac cac 1166cac ttc aac ccg caa caa tcc tcc caa tgg gac ggt ctg cga cac cac 1166
His Phe Asn Pro Gln Gln Ser Ser Gln Trp Asp Gly Leu Arg His HisHis Phe Asn Pro Gln Gln Ser Ser Gln Trp Asp Gly Leu Arg His His
95 100 10595 100 105
aac gcc ccg gct cca acc ccc gac gac cct aac cgc cgg gtc ttc tac 1214aac gcc ccg gct cca acc ccc gac gac cct aac cgc cgg gtc ttc tac 1214
Asn Ala Pro Ala Pro Thr Pro Asp Asp Pro Asn Arg Arg Val Phe TyrAsn Ala Pro Ala Pro Thr Pro Asp Asp Pro Asn Arg Arg Val Phe Tyr
110 115 120110 115 120
ggc ggc acc acc tcc acc gag atc ctc gac ccc tcg tcc acc cga atc 1262ggc ggc acc acc tcc acc gag atc ctc gac ccc tcg tcc acc cga atc 1262
Gly Gly Thr Thr Ser Thr Glu Ile Leu Asp Pro Ser Ser Thr Arg IleGly Gly Thr Thr Ser Thr Glu Ile Leu Asp Pro Ser Ser Thr Arg Ile
125 130 135125 130 135
ggc ata gcc cac tgg gcc aag aaa ggc atc gcc ggc cgc ggc gtc ctc 1310ggc ata gcc cac tgg gcc aag aaa ggc atc gcc ggc cgc ggc gtc ctc 1310
Gly Ile Ala His Trp Ala Lys Lys Gly Ile Ala Gly Arg Gly Val LeuGly Ile Ala His Trp Ala Lys Lys Gly Ile Ala Gly Arg Gly Val Leu
140 145 150 155140 145 150 155
atc gac tac gcc tcc tac gtg caa cga acc aag ggc gtc aca gta aac 1358atc gac tac gcc tcc tac gtg caa cga acc aag ggc gtc aca gta aac 1358
Ile Asp Tyr Ala Ser Tyr Val Gln Arg Thr Lys Gly Val Thr Val AsnIle Asp Tyr Ala Ser Tyr Val Gln Arg Thr Lys Gly Val Thr Val Asn
160 165 170160 165 170
gcc cta acc cgc cac acc gtc tcc ctg gac gac gtc ctc acc atc gcg 1406gcc cta acc cgc cac acc gtc tcc ctg gac gac gtc ctc acc atc gcg 1406
Ala Leu Thr Arg His Thr Val Ser Leu Asp Asp Val Leu Thr Ile AlaAla Leu Thr Arg His Thr Val Ser Leu Asp Asp Val Leu Thr Ile Ala
175 180 185175 180 185
aag gaa tgc aac atc acc ttc gaa cca ggc gac att ctc ttc ctg cgc 1454aag gaa tgc aac atc acc ttc gaa cca ggc gac att ctc ttc ctg cgc 1454
Lys Glu Cys Asn Ile Thr Phe Glu Pro Gly Asp Ile Leu Phe Leu ArgLys Glu Cys Asn Ile Thr Phe Glu Pro Gly Asp Ile Leu Phe Leu Arg
190 195 200190 195 200
gtc ggt cta ccg act aca tgg gat aac atg acc gat gag gag agg gtc 1502gtc ggt cta ccg act aca tgg gat aac atg acc gat gag gag agg gtc 1502
Val Gly Leu Pro Thr Thr Trp Asp Asn Met Thr Asp Glu Glu Arg ValVal Gly Leu Pro Thr Thr Trp Asp Asn Met Thr Asp Glu Glu Arg Val
205 210 215205 210 215
aag tat agt cag cag gaa acg ccc aac cat gca gga tta gag cag agt 1550aag tat agt cag cag gaa acg ccc aac cat gca gga tta gag cag agt 1550
Lys Tyr Ser Gln Gln Glu Thr Pro Asn His Ala Gly Leu Glu Gln SerLys Tyr Ser Gln Gln Glu Thr Pro Asn His Ala Gly Leu Glu Gln Ser
220 225 230 235220 225 230 235
gag cgt gtg gtg cgg ttc ctt tgg gat cag cat ttc gcc gct gta gcg 1598gag cgt gtg gtg cgg ttc ctt tgg gat cag cat ttc gcc gct gta gcg 1598
Glu Arg Val Val Arg Phe Leu Trp Asp Gln His Phe Ala Ala Val AlaGlu Arg Val Val Arg Phe Leu Trp Asp Gln His Phe Ala Ala Val Ala
240 245 250240 245 250
ggt gat gcg gtc agc ttt gag gtg tat ccg ccg gta gag aag gag tgg 1646ggt gat gcg gtc agc ttt gag gtg tat ccg ccg gta gag aag gag tgg 1646
Gly Asp Ala Val Ser Phe Glu Val Tyr Pro Pro Val Glu Lys Glu TrpGly Asp Ala Val Ser Phe Glu Val Tyr Pro Pro Val Glu Lys Glu Trp
255 260 265255 260 265
gat tta cat cat ttt ttg ctg gct ggg tgg gga cta ccg att ggg gag 1694gat tta cat cat ttt ttg ctg gct ggg tgg gga cta ccg att ggg gag 1694
Asp Leu His His Phe Leu Leu Ala Gly Trp Gly Leu Pro Ile Gly GluAsp Leu His His Phe Leu Leu Ala Gly Trp Gly Leu Pro Ile Gly Glu
270 275 280270 275 280
atg ttt gat ctt gag ggg ttg aag gag gtg tgt gag agg ttg gga agg 1742atg ttt gat ctt gag ggg ttg aag gag gtg tgt gag agg ttg gga agg 1742
Met Phe Asp Leu Glu Gly Leu Lys Glu Val Cys Glu Arg Leu Gly ArgMet Phe Asp Leu Glu Gly Leu Lys Glu Val Cys Glu Arg Leu Gly Arg
285 290 295285 290 295
tgg acg ttt ttc gtt agt tct agt ccg ttg aac tgt ggg aga ggt gtg 1790tgg acg ttt ttc gtt agt tct agt ccg ttg aac tgt ggg aga ggt gtg 1790
Trp Thr Phe Phe Val Ser Ser Ser Pro Leu Asn Cys Gly Arg Gly ValTrp Thr Phe Phe Val Ser Ser Ser Ser Pro Leu Asn Cys Gly Arg Gly Val
300 305 310 315300 305 310 315
tcg agt ccg ccg aat tgt atg gca att ttt taaaatgcag aggtggtgtg 1840tcg agt ccg ccg aat tgt atg gca att ttt taaaatgcag aggtggtgtg 1840
Ser Ser Pro Pro Asn Cys Met Ala Ile PheSer Ser Pro Pro Asn Cys Met Ala Ile Phe
320 325320 325
ggaacggagt tggatggacg atggagtgga tagaacagcg ggagttgatg tattgagctg 1900ggaacggagt tggatggacg atggagtgga tagaacagcg ggagttgatg tattgagctg 1900
gggccctagt catgatttgt ctgaatctgg agactataca tagcaccttc agggcatcaa 1960gggccctagt catgatttgt ctgaatctgg agactataca tagcaccttc agggcatcaa 1960
gaggcataat atgcagttat acttgtgata ttacattctt tggccacagt tatgactgtc 2020gaggcataat atgcagttat acttgtgata ttacattctt tggccacagt tatgactgtc 2020
tcttatattg actttgaagt gtcttctttg agccgagtat acttggatta taggattcca 2080tcttatattg actttgaagt gtcttctttg agccgagtat acttggatta taggattcca 2080
atgacggtac atcccgaagg tctgtcccag cacttcccta atcgcagtgg taatacctac 2140atgacggtac atcccgaagg tctgtcccag cacttcccta atcgcagtgg taatacctac 2140
tactagccag tatatctaaa gtacaccact cggaactcat tcacatatac gaatggtgac 2200tactagccag tatatctaaa gtacaccact cggaactcat tcacatatac gaatggtgac 2200
tatgaagggg taatacattc acaacaagat cagtaaatag cagttgcgag gaatgcaaga 2260tatgaagggg taatacattc acaacaagat cagtaaatag cagttgcgag gaatgcaaga 2260
ctagcccttc gtacattcaa tatcagaagt tgggactata aaggccactg cctccctcac 2320ctagcccttc gtacattcaa tatcagaagt tgggactata aaggccactg cctccctcac 2320
tagcccgtaa agcaatacct cgctgctacc attgccaact gcagggtgtt tccagtaaac 2380tagcccgtaa agcaatacct cgctgctacc attgccaact gcagggtgtt tccagtaaac 2380
ggagtggctg ttcccaccgt gcagcactct gtacccactc cccttcctgc tgtcattctg 2440ggagtggctg ttcccaccgt gcagcactct gtacccactc cccttcctgc tgtcattctg 2440
att 2443att 2443
<210>8<210>8
<211>325<211>325
<212>PRT<212>PRT
<213>Aspergillus niger<213>Aspergillus niger
<400>8<400>8
Met Ala Asp Ser Leu Pro Lys Thr Tyr Asp Asp Leu Pro Asp Lys ArgMet Ala Asp Ser Leu Pro Lys Thr Tyr Asp Asp Leu Pro Asp Lys Arg
1 5 10 151 5 10 15
Arg Tyr Trp Pro Ala Thr Pro Asn Ser Ala Asp Glu Gly Leu Gly MetArg Tyr Trp Pro Ala Thr Pro Asn Ser Ala Asp Glu Gly Leu Gly Met
20 25 3020 25 30
Leu Arg Leu Leu Thr Pro Glu Ile Val Ala Asn Ala Ala Arg Thr GlnLeu Arg Leu Leu Thr Pro Glu Ile Val Ala Asn Ala Ala Arg Thr Gln
35 40 4535 40 45
Ile Gln Thr Gly Glu Arg Val Cys Leu Asn Trp Asp Leu Glu Lys LeuIle Gln Thr Gly Glu Arg Val Cys Leu Asn Trp Asp Leu Glu Lys Leu
50 55 6050 55 60
Asn Pro Pro Gly Phe Gly Arg Lys Pro Phe Ala His His Val Lys TrpAsn Pro Pro Gly Phe Gly Arg Lys Pro Phe Ala His His Val Lys Trp
65 70 75 8065 70 75 80
Val Ser Glu Gly His Ala Phe Asp Asp Glu Tyr His Phe Asn Pro GlnVal Ser Glu Gly His Ala Phe Asp Asp Glu Tyr His Phe Asn Pro Gln
85 90 9585 90 95
Gln Ser Ser Gln Trp Asp Gly Leu Arg His His Asn Ala Pro Ala ProGln Ser Ser Gln Trp Asp Gly Leu Arg His His Asn Ala Pro Ala Pro
100 105 110100 105 110
Thr Pro Asp Asp Pro Asn Arg Arg Val Phe Tyr Gly Gly Thr Thr SerThr Pro Asp Asp Pro Asn Arg Arg Val Phe Tyr Gly Gly Thr Thr Ser
115 120 125115 120 125
Thr Glu Ile Leu Asp Pro Ser Ser Thr Arg Ile Gly Ile Ala His TrpThr Glu Ile Leu Asp Pro Ser Ser Thr Arg Ile Gly Ile Ala His Trp
130 135 140130 135 140
Ala Lys Lys Gly Ile Ala Gly Arg Gly Val Leu Ile Asp Tyr Ala SerAla Lys Lys Gly Ile Ala Gly Arg Gly Val Leu Ile Asp Tyr Ala Ser
145 150 155 160145 150 155 160
Tyr Val Gln Arg Thr Lys Gly Val Thr Val Asn Ala Leu Thr Arg HisTyr Val Gln Arg Thr Lys Gly Val Thr Val Asn Ala Leu Thr Arg His
165 170 175165 170 175
Thr Val Ser Leu Asp Asp Val Leu Thr Ile Ala Lys Glu Cys Asn IleThr Val Ser Leu Asp Asp Val Leu Thr Ile Ala Lys Glu Cys Asn Ile
180 185 190180 185 190
Thr Phe Glu Pro Gly Asp Ile Leu Phe Leu Arg Val Gly Leu Pro ThrThr Phe Glu Pro Gly Asp Ile Leu Phe Leu Arg Val Gly Leu Pro Thr
195 200 205195 200 205
Thr Trp Asp Asn Met Thr Asp Glu Glu Arg Val Lys Tyr Ser Gln GlnThr Trp Asp Asn Met Thr Asp Glu Glu Arg Val Lys Tyr Ser Gln Gln
210 215 220210 215 220
Glu Thr Pro Asn His Ala Gly Leu Glu Gln Ser Glu Arg Val Val ArgGlu Thr Pro Asn His Ala Gly Leu Glu Gln Ser Glu Arg Val Val Arg
225 230 235 240225 230 235 240
Phe Leu Trp Asp Gln His Phe Ala Ala Val Ala Gly Asp Ala Val SerPhe Leu Trp Asp Gln His Phe Ala Ala Val Ala Gly Asp Ala Val Ser
245 250 255245 250 255
Phe Glu Val Tyr Pro Pro Val Glu Lys Glu Trp Asp Leu His His PhePhe Glu Val Tyr Pro Pro Val Glu Lys Glu Trp Asp Leu His His Phe
260 265 270260 265 270
Leu Leu Ala Gly Trp Gly Leu Pro Ile Gly Glu Met Phe Asp Leu GluLeu Leu Ala Gly Trp Gly Leu Pro Ile Gly Glu Met Phe Asp Leu Glu
275 280 285275 280 285
Gly Leu Lys Glu Val Cys Glu Arg Leu Gly Arg Trp Thr Phe Phe ValGly Leu Lys Glu Val Cys Glu Arg Leu Gly Arg Trp Thr Phe Phe Val
290 295 300290 295 300
Ser Ser Ser Pro Leu Asn Cys Gly Arg Gly Val Ser Ser Pro Pro AsnSer Ser Ser Pro Leu Asn Cys Gly Arg Gly Val Ser Ser Pro Pro Asn
305 310 315 320305 310 315 320
Cys Met Ala Ile PheCys Met Ala Ile Phe
325325
<210>9<210>9
<211>1025<211>1025
<212>DNA<212>DNA
<213>Aspergillus awamorii<213>Aspergillus awamorii
<220><220>
<221>CDS<221> CDS
<222>(1)..(204)<222>(1)..(204)
<223><223>
<220><220>
<221>CDS<221> CDS
<222>(255)..(1025)<222>(255)..(1025)
<223><223>
<400>9<400>9
atg gcc gac tcc ctc ccg aaa acc tac gac gac ctc ccc gac aag cgg 48atg gcc gac tcc ctc ccg aaa acc tac gac gac ctc ccc gac aag cgg 48
Met Ala Asp Ser Leu Pro Lys Thr Tyr Asp Asp Leu Pro Asp Lys ArgMet Ala Asp Ser Leu Pro Lys Thr Tyr Asp Asp Leu Pro Asp Lys Arg
1 5 10 151 5 10 15
cgc tac tgg ccc gca gcc ccc aac tcc gcc gac gag ggg ctg ggg atg 96cgc tac tgg ccc gca gcc ccc aac tcc gcc gac gag ggg ctg ggg atg 96
Arg Tyr Trp Pro Ala Ala Pro Asn Ser Ala Asp Glu Gly Leu Gly MetArg Tyr Trp Pro Ala Ala Pro Asn Ser Ala Asp Glu Gly Leu Gly Met
20 25 3020 25 30
ctc cgt ctc ctc acc cca gaa gtg gtc gcc aac gca gca cgc acc cag 144ctc cgt ctc ctc acc cca gaa gtg gtc gcc aac gca gca cgc acc cag 144
Leu Arg Leu Leu Thr Pro Glu Val Val Ala Asn Ala Ala Arg Thr GlnLeu Arg Leu Leu Thr Pro Glu Val Val Ala Asn Ala Ala Arg Thr Gln
35 40 4535 40 45
atc cag acc ggc gag cgt gta tgc ctc aac tgg gac ctg gag aag cta 192atc cag acc ggc gag cgt gta tgc ctc aac tgg gac ctg gag aag cta 192
Ile Gln Thr Gly Glu Arg Val Cys Leu Asn Trp Asp Leu Glu Lys LeuIle Gln Thr Gly Glu Arg Val Cys Leu Asn Trp Asp Leu Glu Lys Leu
50 55 6050 55 60
aac cca cca ggt gcccctcctt ccacccccat cgtctctcac caactaacca 244aac cca cca ggt gcccctcctt ccacccccat cgtctctcac caactaacca 244
Asn Pro Pro GlyAsn Pro Pro Gly
6565
aaacacaggt ttc ggc cgc aaa ccc ttc gcc cac cag gtt aaa tgg gtc 293aaacacaggt ttc ggc cgc aaa ccc ttc gcc cac cag gtt aaa tgg gtc 293
Phe Gly Arg Lys Pro Phe Ala His Gln Val Lys Trp ValPhe Gly Arg Lys Pro Phe Ala His Gln Val Lys Trp Val
70 75 8070 75 80
aac gaa ggc cac gcc ttc gac gac gaa tac cac ttc aat ccc cag caa 341aac gaa ggc cac gcc ttc gac gac gaa tac cac ttc aat ccc cag caa 341
Asn Glu Gly His Ala Phe Asp Asp Glu Tyr His Phe Asn Pro Gln GlnAsn Glu Gly His Ala Phe Asp Asp Glu Tyr His Phe Asn Pro Gln Gln
85 90 9585 90 95
tcc tcc caa tgg gac ggc ctc cga cac cac aac gcc ccg gcc cca aca 389tcc tcc caa tgg gac ggc ctc cga cac cac aac gcc ccg gcc cca aca 389
Ser Ser Gln Trp Asp Gly Leu Arg His His Asn Ala Pro Ala Pro ThrSer Ser Gln Trp Asp Gly Leu Arg His His Asn Ala Pro Ala Pro Thr
100 105 110100 105 110
ccc gaa gac cct gag cgc cgc gtc ttc tac ggc ggc acc acc tcc acc 437ccc gaa gac cct gag cgc cgc gtc ttc tac ggc ggc acc acc tcc acc 437
Pro Glu Asp Pro Glu Arg Arg Val Phe Tyr Gly Gly Thr Thr Ser ThrPro Glu Asp Pro Glu Arg Arg Val Phe Tyr Gly Gly Thr Thr Ser Thr
115 120 125115 120 125
gag atc ctc gac ccc tcg tcc gca cgg atc ggc ata gcc cac tgg gcc 485gag atc ctc gac ccc tcg tcc gca cgg atc ggc ata gcc cac tgg gcc 485
Glu Ile Leu Asp Pro Ser Ser Ala Arg Ile Gly Ile Ala His Trp AlaGlu Ile Leu Asp Pro Ser Ser Ala Arg Ile Gly Ile Ala His Trp Ala
130 135 140 145130 135 140 145
aag aag ggc atc gcc ggc cgc ggc gtc ctc ata gac tac gcc tcc tac 533aag aag ggc atc gcc ggc cgc ggc gtc ctc ata gac tac gcc tcc tac 533
Lys Lys Gly Ile Ala Gly Arg Gly Val Leu Ile Asp Tyr Ala Ser TyrLys Lys Gly Ile Ala Gly Arg Gly Val Leu Ile Asp Tyr Ala Ser Tyr
150 155 160150 155 160
atc cag cga acc aag ggc atc aca gta aac gcc cta acc cgc cac acc 581atc cag cga acc aag ggc atc aca gta aac gcc cta acc cgc cac acc 581
Ile Gln Arg Thr Lys Gly Ile Thr Val Asn Ala Leu Thr Arg His ThrIle Gln Arg Thr Lys Gly Ile Thr Val Asn Ala Leu Thr Arg His Thr
165 170 175165 170 175
gtc tcc ctc gac gac gtc ctc acc atc gcg aag gaa tgc aac att acc 629gtc tcc ctc gac gac gtc ctc acc atc gcg aag gaa tgc aac att acc 629
Val Ser Leu Asp Asp Val Leu Thr Ile Ala Lys Glu Cys Asn Ile ThrVal Ser Leu Asp Asp Val Leu Thr Ile Ala Lys Glu Cys Asn Ile Thr
180 185 190180 185 190
ttc cag cca ggc gac att ctc ttc ctg cgc gtt ggt cta ccg aca acg 677ttc cag cca ggc gac att ctc ttc ctg cgc gtt ggt cta ccg aca acg 677
Phe Gln Pro Gly Asp Ile Leu Phe Leu Arg Val Gly Leu Pro Thr ThrPhe Gln Pro Gly Asp Ile Leu Phe Leu Arg Val Gly Leu Pro Thr Thr
195 200 205195 200 205
tgg gat aac atg tcc gat gaa gag aag gtc gagtat agt caa cag caa 725tgg gat aac atg tcc gat gaa gag aag gtc gagtat agt caa cag caa 725
Trp Asp Asn Met Ser Asp Glu Glu Lys Val Glu Tyr Ser Gln Gln GlnTrp Asp Asn Met Ser Asp Glu Glu Lys Val Glu Tyr Ser Gln Gln Gln
210 215 220 225210 215 220 225
aca ccc cag cat gca ggt tta gag cag agc gag cgt gtg gtg cgg ttc 773aca ccc cag cat gca ggt tta gag cag agc gag cgt gtg gtg cgg ttc 773
Thr Pro Gln His Ala Gly Leu Glu Gln Ser Glu Arg Val Val Arg PheThr Pro Gln His Ala Gly Leu Glu Gln Ser Glu Arg Val Val Arg Phe
230 235 240230 235 240
ctc tgg gac cag cat ttc gcg gct gtg gcg ggc gat gcg gtc agc ttt 821ctc tgg gac cag cat ttc gcg gct gtg gcg ggc gat gcg gtc agc ttt 821
Leu Trp Asp Gln His Phe Ala Ala Val Ala Gly Asp Ala Val Ser PheLeu Trp Asp Gln His Phe Ala Ala Val Ala Gly Asp Ala Val Ser Phe
245 250 255245 250 255
gag gtg tat ccg ccg gta gag aag gag tgg gat tta cac cat ttc ttg 869gag gtg tat ccg ccg gta gag aag gag tgg gat tta cac cat ttc ttg 869
Glu Val Tyr Pro Pro Val Glu Lys Glu Trp Asp Leu His His Phe LeuGlu Val Tyr Pro Pro Val Glu Lys Glu Trp Asp Leu His His Phe Leu
260 265 270260 265 270
ctt gct ggg tgg gga gtg ccg att gga gag atg ttt gat ctg gag ggg 917ctt gct ggg tgg gga gtg ccg att gga gag atg ttt gat ctg gag ggg 917
Leu Ala Gly Trp Gly Val Pro Ile Gly Glu Met Phe Asp Leu Glu GlyLeu Ala Gly Trp Gly Val Pro Ile Gly Glu Met Phe Asp Leu Glu Gly
275 280 285275 280 285
ttg aag gag gtg tgt gag agg ttg ggc agg tgg acg ttt ttt gtt agt 965ttg aag gag gtg tgt gag agg ttg ggc agg tgg acg ttt ttt gtt agt 965
Leu Lys Glu Val Cys Glu Arg Leu Gly Arg Trp Thr Phe Phe Val SerLeu Lys Glu Val Cys Glu Arg Leu Gly Arg Trp Thr Phe Phe Val Ser
290 295 300 305290 295 300 305
tcg agt ccg ttg aat tgt aag acg ggg gtg tcg agt ccg ccg aat tgt 1013tcg agt ccg ttg aat tgt aag acg ggg gtg tcg agt ccg ccg aat tgt 1013
Ser Ser Pro Leu Asn Cys Lys Thr Gly Val Ser Ser Pro Pro Asn CysSer Ser Pro Leu Asn Cys Lys Thr Gly Val Ser Ser Pro Pro Asn Cys
310 315 320310 315 320
atg gca att ttt 1025atg gca att ttt 1025
Met Ala Ile PheMet Ala Ile Phe
325325
<210>10<210>10
<211>325<211>325
<212>PRT<212>PRT
<213>Aspergillus awamorii<213>Aspergillus awamorii
<400>10<400>10
Met Ala Asp Ser Leu Pro Lys Thr Tyr Asp Asp Leu Pro Asp Lys ArgMet Ala Asp Ser Leu Pro Lys Thr Tyr Asp Asp Leu Pro Asp Lys Arg
1 5 10 151 5 10 15
Arg Tyr Trp Pro Ala Ala Pro Asn Ser Ala Asp Glu Gly Leu Gly MetArg Tyr Trp Pro Ala Ala Pro Asn Ser Ala Asp Glu Gly Leu Gly Met
20 25 3020 25 30
Leu Arg Leu Leu Thr Pro Glu Val Val Ala Asn Ala Ala Arg Thr GlnLeu Arg Leu Leu Thr Pro Glu Val Val Ala Asn Ala Ala Arg Thr Gln
35 40 4535 40 45
Ile Gln Thr Gly Glu Arg Val Cys Leu Asn Trp Asp Leu Glu Lys LeuIle Gln Thr Gly Glu Arg Val Cys Leu Asn Trp Asp Leu Glu Lys Leu
50 55 6050 55 60
Asn Pro Pro Gly Phe Gly Arg Lys Pro Phe Ala His Gln Val Lys TrpAsn Pro Pro Gly Phe Gly Arg Lys Pro Phe Ala His Gln Val Lys Trp
65 70 75 8065 70 75 80
Val Asn Glu Gly His Ala Phe Asp Asp Glu Tyr His Phe Asn Pro GlnVal Asn Glu Gly His Ala Phe Asp Asp Glu Tyr His Phe Asn Pro Gln
85 90 9585 90 95
Gln Ser Ser Gln Trp Asp Gly Leu Arg His His Asn Ala Pro Ala ProGln Ser Ser Gln Trp Asp Gly Leu Arg His His Asn Ala Pro Ala Pro
100 105 110100 105 110
Thr Pro Glu Asp Pro Glu Arg Arg Val Phe Tyr Gly Gly Thr Thr SerThr Pro Glu Asp Pro Glu Arg Arg Val Phe Tyr Gly Gly Thr Thr Ser
115 120 125115 120 125
Thr Glu Ile Leu Asp Pro Ser Ser Ala Arg Ile Gly Ile Ala His TrpThr Glu Ile Leu Asp Pro Ser Ser Ala Arg Ile Gly Ile Ala His Trp
130 135 140130 135 140
Ala Lys Lys Gly Ile Ala Gly Arg Gly Val Leu Ile Asp Tyr Ala SerAla Lys Lys Gly Ile Ala Gly Arg Gly Val Leu Ile Asp Tyr Ala Ser
145 150 155 160145 150 155 160
Tyr Ile Gln Arg Thr Lys Gly Ile Thr Val Asn Ala Leu Thr Arg HisTyr Ile Gln Arg Thr Lys Gly Ile Thr Val Asn Ala Leu Thr Arg His
165 170 175165 170 175
Thr Val Ser Leu Asp Asp Val Leu Thr Ile Ala Lys Glu Cys Asn IleThr Val Ser Leu Asp Asp Val Leu Thr Ile Ala Lys Glu Cys Asn Ile
180 185 190180 185 190
Thr Phe Gln Pro Gly Asp Ile Leu Phe Leu Arg Val Gly Leu Pro ThrThr Phe Gln Pro Gly Asp Ile Leu Phe Leu Arg Val Gly Leu Pro Thr
195 200 205195 200 205
Thr Trp Asp Asn Met Ser Asp Glu Glu Lys Val Glu Tyr Ser Gln GlnThr Trp Asp Asn Met Ser Asp Glu Glu Lys Val Glu Tyr Ser Gln Gln
210 215 220210 215 220
Gln Thr Pro Gln His Ala Gly Leu Glu Gln Ser Glu Arg Val Val ArgGln Thr Pro Gln His Ala Gly Leu Glu Gln Ser Glu Arg Val Val Arg
225 230 235 240225 230 235 240
Phe Leu Trp Asp Gln His Phe Ala Ala Val Ala Gly Asp Ala Val SerPhe Leu Trp Asp Gln His Phe Ala Ala Val Ala Gly Asp Ala Val Ser
245 250 255245 250 255
Phe Glu Val Tyr Pro Pro Val Glu Lys Glu Trp Asp Leu His His PhePhe Glu Val Tyr Pro Pro Val Glu Lys Glu Trp Asp Leu His His Phe
260 265 270260 265 270
Leu Leu Ala Gly Trp Gly Val Pro Ile Gly Glu Met Phe Asp Leu GluLeu Leu Ala Gly Trp Gly Val Pro Ile Gly Glu Met Phe Asp Leu Glu
275 280 285275 280 285
Gly Leu Lys Glu Val Cys Glu Arg Leu Gly Arg Trp Thr Phe Phe ValGly Leu Lys Glu Val Cys Glu Arg Leu Gly Arg Trp Thr Phe Phe Val
290 295 300290 295 300
Ser Ser Ser Pro Leu Asn Cys Lys Thr Gly Val Ser Ser Pro pro AsnSer Ser Ser Pro Leu Asn Cys Lys Thr Gly Val Ser Ser Pro pro Asn
305 310 315 320305 310 315 320
Cys Met Ala Ile PheCys Met Ala Ile Phe
325325
<210>11<210>11
<211>11<211>11
<212>PRT<212>PRT
<213>Eguus caballus<213> Eguus caballus
<400>11<400>11
His Gln Gly Ser Leu Val Ala Leu Phe Pro AspHis Gln Gly Ser Leu Val Ala Leu Phe Pro Asp
1 5 101 5 10
<210>12<210>12
<211>23<211>23
<212>DNA<212>DNA
<213>人工<213> Artificial
<220><220>
<221>misc_feature<221>misc_feature
<222>(1)..(23)<222>(1)..(23)
<223>5’-引物<223>5'-primer
<400>12<400>12
atgtcttcca tcaagattga atg 23atgtcttcca tcaagattga atg 23
<210>13<210>13
<211>21<211>21
<212>DNA<212>DNA
<213>人工<213> Artificial
<220><220>
<221>misc_feature<221>misc_feature
<222>(1)..(21)<222>(1)..(21)
<223>3’-引物<223>3'-primer
<400>13<400>13
gtcaaaggaa ttccccccta t 21gtcaaaggaa ttccccccta t 21
<210>14<210>14
<211>30<211>30
<212>DNA<212>DNA
<213>Bos sp.<213>Bos sp.
<400>14<400>14
atgtcttcca ttaagattga gtgtgttttg 30atgtcttcca ttaagattga gtgtgttttg 30
<210>15<210>15
<21l>30<21l>30
<212>DNA<212>DNA
<213>Oryctolagus cuniculus<213>Oryctolagus cuniculus
<400>15<400>15
atgtcttcca ttaagatcga gtgtgttttg 30atgtcttcca ttaagatcga gtgtgttttg 30
<210>16<210>16
<211>30<211>30
<212>DNA<212>DNA
<213>Homo sapiens<213>Homo sapiens
<400>16<400>16
atgtcttcca ttaagattga gtgtgttttg 30atgtcttcca ttaagattga gtgtgttttg 30
<210>17<210>17
<211>30<211>30
<212>DNA<212>DNA
<213>Rattus norvegicus<213>Rattus norvegicus
<400>17<400>17
atgtcttcca tcaagattga atgtgtttta 30atgtcttcca tcaagattga atgtgtttta 30
<210>18<210>18
<211>30<211>30
<212>DNA<212>DNA
<213>Mus sp.<213>Mus sp.
<400>18<400>18
atgtcttcca tcaaagttga atgtgtttta 30atgtcttcca tcaaagttga atgtgtttta 30
<210>19<210>19
<211>30<211>30
<212>DNA<212>DNA
<213>Gallus sp.<213>Gallus sp.
<400>19<400>19
atgtcgtccg t taagatcga gtgcgtcggg 30atgtcgtccg t taagatcga gtgcgtcggg 30
<210>20<210>20
<211>30<211>30
<212>DNA<212>DNA
<213>Xenopus sp.<213>Xenopus sp.
<400>20<400>20
atgtcttcca tcaagataga atgtgtagtc 30atgtcttcca tcaagataga atgtgtagtc 30
<210>21<210>21
<211>39<211>39
<212>DNA<212>DNA
<213>Bos sp.<213>Bos sp.
<400>21<400>21
ataactggcc taggagtcaa aggaattcct ccctatcct 39ataactggcc taggagtcaa aggaattcct ccctatcct 39
<210>22<210>22
<211>39<211>39
<212>DNA<212>DNA
<213>Oryctolagus cuniculus<213>Oryctolagus cuniculus
<400>22<400>22
ataactggcc tgggggtcaa aggaattccc ccctactcc 39ataactggcc tgggggtcaa aggaattccc ccctactcc 39
<210>23<210>23
<211>39<211>39
<212>DNA<212>DNA
<213>Homo sapiens<213>Homo sapiens
<400>23<400>23
ataactggtc tgggggtcaa aggaattgct ccctactcc 39ataactggtc tgggggtcaa aggaattgct ccctactcc 39
<210>24<210>24
<211>39<211>39
<212>DNA<212>DNA
<213>Rattus norvegicus<213>Rattus norvegicus
<400>24<400>24
ataacaggtc ttggggtcaa aggaattgct ccatattcc 39ataacaggtc ttggggtcaa aggaattgct ccatattcc 39
<210>25<210>25
<211>39<211>39
<212>DNA<212>DNA
<213>Mus sp.<213>Mus sp.
<400>25<400>25
ataacaggtc tcggagtcaa aggaattgcc ccatattcc 39ataacaggtc tcggagtcaa aggaattgcc ccatattcc 39
<210>26<210>26
<211>39<211>39
<212>DNA<212>DNA
<213>Gallus sp.<213>Gallus sp.
<400>26<400>26
atcactgggc ttggtgtgaa aggaatcccg ccatatcca 39atcactgggc ttggtgtgaa aggaatcccg ccatatcca 39
<210>27<210>27
<211>39<211>39
<212>DNA<212>DNA
<213>Xenopus sp.<213>Xenopus sp.
<400>27<400>27
attactggac ttggagtaaa aggaatcgca ccaactgca 39attackggac ttggagtaaa aggaatcgca ccaactgca 39
<210>28<210>28
<211>21<211>21
<212>DNA<212>DNA
<213>人工<213> Artificial
<220><220>
<221>misc_feature<221>misc_feature
<222>(1)..(21)<222>(1)..(21)
<223>完美匹配的引物<223>Perfectly matched primers
<400>28<400>28
aatacacatt ccatctggga t 21aatacacatt ccatctggga t 21
<210>29<210>29
<211>21<211>21
<212>DNA<212>DNA
<213>人工<213> Artificial
<220><220>
<221>misc_feature<221>misc_feature
<222>(1)..(21)<222>(1)..(21)
<223>正向引物<223> forward primer
<400>29<400>29
gaactgtgaa gttgcctgtt g 21gaactgtgaa gttgcctgtt g 21
<210>30<210>30
<211>21<211>21
<212>DNA<212>DNA
<213>人工<213> Artificial
<220><220>
<221>misc_feature<221>misc_feature
<222>(1)..(21)<222>(1)..(21)
<223>引物<223> Primer
<400>30<400>30
ggaagaggat gtcactcatc c 21ggaagaggat gtcactcatc c 21
<210>31<210>31
<211>26<211>26
<212>DNA<212>DNA
<213>人工<213> Artificial
<220><220>
<221>misc_feature<221>misc_feature
<222>(6)..(6)<222>(6)..(6)
<223>n表示C、G或T<223>n means C, G or T
<220><220>
<221>misc_feature<221>misc_feature
<222>(9)..(9)<222>(9)..(9)
<223>n表示C或T<223>n means C or T
<220><220>
<221>misc_feature<221>misc_feature
<222>(12)..(12)<222>(12)..(12)
<223>n表示C、G或T<223>n means C, G or T
<220><220>
<221>misc_feature<221>misc_feature
<222>(15)..(15)<222>(15)..(15)
<223>n表示C或T<223>n means C or T
<220><220>
<221>misc_feature<221>misc_feature
<222>(18)..(18)<222>(18)..(18)
<223>n表示A或G<223>n means A or G
<220><220>
<221>misc_feature<221>misc_feature
<222>(21)..(21)<222>(21)..(21)
<223>n表示C或G<223>n means C or G
<220><220>
<221>misc_feature<221>misc_feature
<222>(1)..(26)<222>(1)..(26)
<223>寡核苷酸Fxn42<223> oligonucleotide Fxn42
<400>31<400>31
aagccnttng cncancangt naagac 26aagccnttng cnccancangt naagac 26
<210>32<210>32
<211>26<211>26
<212>DNA<212>DNA
<213>人工<213> Artificial
<220><220>
<221>misc_feature<221>misc_feature
<222>(15)..(15)<222>(15)..(15)
<223>n表示C或T<223>n means C or T
<220><220>
<221>misc_feature<221>misc_feature
<222>(1)..(26)<222>(1)..(26)
<223>寡核苷酸Fxn42S<223> oligonucleotide Fxn42S
<220><220>
<221>misc_feature<221>misc_feature
<222>(18)..(18)<222>(18)..(18)
<223>n表示A或G<223>n means A or G
<220><220>
<221>misc_feature<221>misc_feature
<222>(21)..(21)<222>(21)..(21)
<223>n表示C、G或T<223>n means C, G or T
<220><220>
<221>misc_feature<221>misc_feature
<222>(24)..(24)<222>(24)..(24)
<223>n表示A或G<223>n means A or G
<400>32<400>32
aagccgttcg cccancangt naanac 26aagccgttcg cccancangt naanac 26
<210>33<210>33
<211>26<211>26
<212>DNA<212>DNA
<213>人工<213> Artificial
<220><220>
<221>misc_feature<221>misc_feature
<222>(6)..(6)<222>(6)..(6)
<223>n表示C或G<223>n means C or G
<220><220>
<221>misc_feature<221>misc_feature
<222>(15)..(15)<222>(15)..(15)
<223>n表示C或G<223>n means C or G
<220><220>
<221>misc_feature<221>misc_feature
<222>(18)..(18)<222>(18)..(18)
<223>n表示A或G<223>n means A or G
<220><220>
<221>misc_feature<221>misc_feature
<222>(21)..(21)<222>(21)..(21)
<223>n表示A、C、T或G<223>n means A, C, T or G
<220><220>
<221>misc_feature<221>misc_feature
<222>(24)..(24)<222>(24)..(24)
<223>n表示C或T<223>n means C or T
<220><220>
<221>misc_feature<221>misc_feature
<222>(1)..(26)<222>(1)..(26)
<223>寡核苷酸Fxn42AS<223> oligonucleotide Fxn42AS
<400>33<400>33
gtcttnacct ggtgngcnaa nggntt 26gtcttnacct ggtgngcnaa nggntt 26
<210>34<210>34
<211>24<211>24
<212>DNA<212>DNA
<213>人工<213> Artificial
<220><220>
<221>misc_feature<221>misc_feature
<222>(3)..(3)<222>(3)..(3)
<223>n表示C或T<223>n means C or T
<220><220>
<221>misc_feature<221>misc_feature
<222>(6)..(6)<222>(6)..(6)
<223>n表示A、C或G<223>n means A, C or G
<220><220>
<221>misc_feature<221>misc_feature
<222>(9)..(9)<222>(9)..(9)
<223>n表示C或T<223>n means C or T
<220><220>
<221>misc_feature<221>misc_feature
<222>(12)..(12)<222>(12)..(12)
<223>n表示C或G<223>n means C or G
<220><220>
<221>misc_feature<221>misc_feature
<222>(15)..(15)<222>(15)..(15)
<223>n表示C或T<223>n means C or T
<220><220>
<221>misc_feature<221>misc_feature
<222>(21)..(21)<222>(21)..(21)
<223>n表示C或G<223>n means C or G
<220><220>
<221>misc_feature<221>misc_feature
<222>(1)..(24)<222>(1)..(24)
<223>寡核苷酸Fxn57<223> oligonucleotide Fxn57
<400>34<400>34
atnggnatng cncantgggc naag 24atnggnatng cncantgggc naag 24
<210>35<210>35
<211>24<211>24
<212>DNA<212>DNA
<213>人工<213> Artificial
<220><220>
<221>misc_feature<221>misc_feature
<222>(9)..(9)<222>(9)..(9)
<223>n表示C或T<223>n means C or T
<220><220>
<221>misc_feature<221>misc_feature
<222>(12)..(12)<222>(12)..(12)
<223>n表示C或G<223>n means C or G
<220><220>
<221>misc_feature<221>misc_feature
<222>(15)..(15)<222>(15)..(15)
<223>n表示C或T<223>n means C or T
<220><220>
<221>misc_feature<221>misc_feature
<222>(21)..(21)<222>(21)..(21)
<223>n表示C、G或T<223>n means C, G or T
<220><220>
<221>misc_feature<221>misc_feature
<222>(24)..(24)<222>(24)..(24)
<223>n表示A或G<223>n means A or G
<220><220>
<221>misc_feature<221>misc_feature
<222>(1)..(24)<222>(1)..(24)
<223>寡核苷酸Fxn57S<223> oligonucleotide Fxn57S
<400>35<400>35
atcggcatng cncantgggc naan 24atcggcatng cncantgggc naan 24
<210>36<210>36
<211>24<211>24
<212>DNA<212>DNA
<213>人工<213> Artificial
<220><220>
<221>misc_feature<221>misc_feature
<222>(13)..(13)<222>(13)..(13)
<223>n表示A、C或G<223>n means A, C or G
<220><220>
<221>misc_feature<221>misc_feature
<222>(16)..(16)<222>(16)..(16)
<223>n表示A或G<223>n means A or G
<220><220>
<221>misc_feature<221>misc_feature
<222>(19)..(19)<222>(19)..(19)
<223>n表示A、C、T或G<223>n means A, C, T or G
<220><220>
<221>misc_feature<221>misc_feature
<222>(22)..(22)<222>(22)..(22)
<223>n表示A或G<223>n means A or G
<220><220>
<221>misc_feature<221>misc_feature
<222>(1)..(24)<222>(1)..(24)
<223>寡核苷酸Fxn57AS<223> oligonucleotide Fxn57AS
<400>36<400>36
cttggcccag tgngcnatnc cnat 24cttggcccag tgngcnatnc cnat 24
<210>37<210>37
<211>39<211>39
<212>DNA<212>DNA
<213>人工<213> Artificial
<220><220>
<221>misc_feature<221>misc_feature
<222>(6)..(6)<222>(6)..(6)
<223>n表示C、G或T<223>n means C, G or T
<220><220>
<221>misc_feature<221>misc_feature
<222>(9)..(9)<222>(9)..(9)
<223>n表示C或T<223>n means C or T
<220><220>
<221>misc_feature<221>misc_feature
<222>(12)..(12)<222>(12)..(12)
<223>n表示C或T<223>n means C or T
<220><220>
<221>misc_feature<221>misc_feature
<222>(15)..(15)<222>(15)..(15)
<223>n表示C或G<223>n means C or G
<220><220>
<221>misc_feature<221>misc_feature
<222>(18)..(18)<222>(18)..(18)
<223>n表示C或G<223>n means C or G
<220><220>
<221>misc_feature<221>misc_feature
<222>(21)..(21)<222>(21)..(21)
<223>n表示C或G<223>n means C or G
<220><220>
<221>misc_feature<221>misc_feature
<222>(24)..(24)<222>(24)..(24)
<223>n表示C或G<223>n means C or G
<220><220>
<221>misc_feature<221>misc_feature
<222>(27)..(27)<222>(27)..(27)
<223>n表示C、G或T<223>n means C, G or T
<220><220>
<221>misc_feature<221>misc_feature
<222>(30)..(30)<222>(30)..(30)
<223>n表示C或G<223>n means C or G
<220><220>
<221>misc_feature<221>misc_feature
<222>(33)..(33)<222>(33)..(33)
<223>n表示C或T<223>n means C or T
<220><220>
<221>misc_feature<221>misc_feature
<222>(1)..(39)<222>(1)..(39)
<223>寡核苷酸Fx73<223> oligonucleotide Fx73
<220><220>
<221>misc_feature<221>misc_feature
<222>(36)..(36)<222>(36)..(36)
<223>n表示C或T<223>n means C or T
<400>37<400>37
tggacnttnt tngtntcntc ntcnccnctn aantgnaag 39tggacnttnt tngtntcntc ntcnccnctn aantgnaag 39
<210>38<210>38
<211>39<211>39
<212>DNA<212>DNA
<213>人工<213> Artificial
<220><220>
<221>misc_feature<221>misc_feature
<222>(24)..(24)<222>(24)..(24)
<223>n表示C或G<223>n means C or G
<220><220>
<221>misc_feature<221>misc_feature
<222>(27)..(27)<222>(27)..(27)
<223>n表示C或G<223>n means C or G
<220><220>
<221>misc_feature<221>misc_feature
<222>(30)..(30)<222>(30)..(30)
<223>n表示C或G<223>n means C or G
<220><220>
<221>misc_feature<221>misc_feature
<222>(33)..(33)<222>(33)..(33)
<223>n表示C或T<223>n means C or T
<220><220>
<221>misc_feature<221>misc_feature
<222>(36)..(36)<222>(36)..(36)
<223>n表示C或T<223>n means C or T
<220><220>
<221>misc_feature<221>misc_feature
<222>(39)..(39)<222>(39)..(39)
<223>n表示A或G<223>n means A or G
<220><220>
<221>misc_feature<221>misc_feature
<222>(1)..(39)<222>(1)..(39)
<223>寡核苷酸Fx73S<223> oligonucleotide Fx73S
<400>38<400>38
tggaccttgt tggtgtcctc ctcnccnctn aantgnaan 39tggaccttgt tggtgtcctc ctcnccnctn aantgnaan 39
<210>39<210>39
<211>39<211>39
<212>DNA<212>DNA
<213>人工<213> Artificial
<220><220>
<221>misc_feature<221>misc_feature
<222>(22)..(22)<222>(22)..(22)
<223>n表示C或G<223>n means C or G
<220><220>
<221>misc_feature<221>misc_feature
<222>(25)..(25)<222>(25)..(25)
<223>n表示C或G<223>n means C or G
<220><220>
<221>misc_feature<221>misc_feature
<222>(28)..(28)<222>(28)..(28)
<223>n表示A或C<223>n means A or C
<220><220>
<221>misc_feature<221>misc_feature
<222>(31)..(31)<222>(31)..(31)
<223>n表示A或C<223>n means A or C
<220><220>
<221>misc_feature<221>misc_feature
<222>(34)..(34)<222>(34)..(34)
<223>n表示A、C或G<223>n means A, C or G
<220><220>
<221>misc_feature<221>misc_feature
<222>(1)..(39)<222>(1)..(39)
<223>寡核苷酸Fx73AS<223> oligonucleotide Fx73AS
<400>39<400>39
cttgcagttc agcggggagg anganacnaa naangtcca 39cttgcagttc agcggggagg anganacnaa naangtcca 39
<210>40<210>40
<211>38<211>38
<212>DNA<212>DNA
<213>人工<213> Artificial
<220><220>
<221>misc_feature<221>misc_feature
<222>(3)..(3)<222>(3)..(3)
<223>n表示C或G<223>n means C or G
<220><220>
<221>misc_feature<221>misc_feature
<222>(6)..(6)<222>(6)..(6)
<223>n表示C或T<223>n means C or T
<220><220>
<221>misc_feature<221>misc_feature
<222>(9)..(9)<222>(9)..(9)
<223>n表示C或G<223>n means C or G
<220><220>
<221>misc_feature<221>misc_feature
<222>(12)..(12)<222>(12)..(12)
<223>n表示C或T<223>n means C or T
<220><220>
<221>misc_feature<221>misc_feature
<222>(18)..(18)<222>(18)..(18)
<223>n表示C或T<223>n means C or T
<220><220>
<221>misc_feature<221>misc_feature
<222>(21)..(21)<222>(21)..(21)
<223>n表示C或G<223>n means C or G
<220><220>
<221>misc_feature<221>misc_feature
<222>(24)..(24)<222>(24)..(24)
<223>n表示A或G<223>n means A or G
<220><220>
<221>misc_feature<221>misc_feature
<222>(27)..(27)<222>(27)..(27)
<223>n表示A或G<223>n means A or G
<220><220>
<221>misc_feature<221>misc_feature
<222>(30)..(30)<222>(30)..(30)
<223>n表示C或G<223>n means C or G
<220><220>
<221>misc_feature<221>misc_feature
<222>(33)..(33)<222>(33)..(33)
<223>n表示C或T<223>n means C or T
<220><220>
<221>misc_feature<221>misc_feature
<222>(36)..(36)<222>(36)..(36)
<223>n表示C或G<223>n means C or G
<220><220>
<221>misc_feature<221>misc_feature
<222>(1)..(38)<222>(1)..(38)
<223>寡核苷酸FX74<223> oligonucleotide FX74
<400>40<400>40
gtntgnctna antggganct nganaanctn aanccncc 38gtntgnctna antggganct nganaanctn aanccncc 38
<210>41<210>41
<211>29<211>29
<212>DNA<212>DNA
<213>人工<213> Artificial
<220><220>
<221>misc_feature<221>misc_feature
<222>(15)..(15)<222>(15)..(15)
<223>n表示A或G<223>n means A or G
<220><220>
<221>misc_feature<221>misc_feature
<222>(18)..(18)<222>(18)..(18)
<223>n表示A或G<223>n means A or G
<220><220>
<221>misc_feature<221>misc_feature
<222>(21)..(21)<222>(21)..(21)
<223>n表示C或G<223>n means C or G
<220><220>
<221>misc_feature<221>misc_feature
<222>(24)..(24)<222>(24)..(24)
<223>n表示C或T<223>n means C or T
<220><220>
<221>misc_feature<221>misc_feature
<222>(27)..(27)<222>(27)..(27)
<223>n表示C、G或T<223>n means C, G or T
<220><220>
<221>misc_feature<221>misc_feature
<222>(1)..(29)<222>(1)..(29)
<223>寡核苷酸Fx74S<223> oligonucleotide Fx74S
<400>41<400>41
aactgggacc tgganaanct naanccncc 29aactgggacc tgganaanct naanccncc 29
<210>42<210>42
<211>27<211>27
<212>DNA<212>DNA
<213>人工<213> Artificial
<220><220>
<221>misc_feature<221>misc_feature
<222>(12)..(12)<222>(12)..(12)
<223>n表示C或T<223>n means C or T
<220><220>
<221>misc_feature<221>misc_feature
<222>(15)..(15)<222>(15)..(15)
<223>n表示C或T<223>n means C or T
<220><220>
<221>misc_feature<221>misc_feature
<222>(18)..(18)<222>(18)..(18)
<223>n表示C或G<223>n means C or G
<220><220>
<221>misc_feature<221>misc_feature
<222>(21)..(21)<222>(21)..(21)
<223>n表示A或G<223>n means A or G
<220><220>
<221>misc_feature<221>misc_feature
<222>(27)..(27)<222>(27)..(27)
<223>n表示A或G<223>n means A or G
<220><220>
<221>misc_feature<221>misc_feature
<222>(1)..(27)<222>(1)..(27)
<223>寡核苷酸Fx74AS<223> oligonucleotide Fx74AS
<400>42<400>42
ggcgggttca gnttntcnag ntcccan 27ggcgggttca gnttntcnag ntcccan 27
<210>43<210>43
<211>8<211>8
<212>PRT<212>PRT
<213>Aspergillus sp.<213>Aspergillus sp.
<400>43<400>43
Lys Pro Phe Ala His Gln Val LysLys Pro Phe Ala His Gln Val Lys
1 51 5
<210>44<210>44
<211>16<211>16
<212>PRT<212>PRT
<213>Aspergillus sp.<213>Aspergillus sp.
<400>44<400>44
Arg His His Asn Ala Pro Ala Pro Thr Pro Glu Asp Pro Glu Arg ArgArg His His Asn Ala Pro Ala Pro Thr Pro Glu Asp Pro Glu Arg Arg
1 5 10 151 5 10 15
<210>45<210>45
<211>12<211>12
<212>PRT<212>PRT
<213>Aspergillus sp.<213>Aspergillus sp.
<220><220>
<221>MISC_FEATURE<221>MISC_FEATURE
<222>(10)..(10)<222>(10)..(10)
<223>x表示?<223>x means?
<400>45<400>45
Lys Pro Phe Ala His Gln Val Lys Thr Xaa Glu AspLys Pro Phe Ala His Gln Val Lys Thr Xaa Glu Asp
1 5 101 5 10
<210>46<210>46
<211>21<211>21
<212>DNA<212>DNA
<213>人工<213> Artificial
<220><220>
<221>misc_feature<221>misc_feature
<222>(1)..(21)<222>(1)..(21)
<223>引物Oal1AS<223> Primer Oal1AS
<400>46<400>46
cgaagggtttgcggccgaag c 21cgaagggtttgcggccgaag c 21
<210>47<210>47
<211>21<211>21
<212>DNA<212>DNA
<213>人工<213> Artificial
<220><220>
<221>misc_feature<221>misc_feature
<222>(1)..(21)<222>(1)..(21)
<223>引物Oal2AS<223> Primer Oal2AS
<400>47<400>47
aaggcgtggc c ttcggagac c 21aaggcgtggc c ttcggagac c 21
<210>48<210>48
<211>21<211>21
<212>DNA<212>DNA
<213>人工<213> Artificial
<220><220>
<221>misc_feature<221>misc_feature
<222>(1)..(21)<222>(1)..(21)
<223>引物Oal3S<223> Primer Oal3S
<400>48<400>48
gtagcgggtg atgcggtcag c 21gtagcgggtg atgcggtcag c 21
<210>49<210>49
<211>21<211>21
<212>DNA<212>DNA
<213>人工<213> Artificial
<220><220>
<221>misc_feature<221>misc_feature
<222>(1)..(21)<222>(1)..(21)
<223>引物Oal4S<223> Primer Oal4S
<400>49<400>49
tgtatccgcc ggtagagaag g 21tgtatccgcc ggtagagaag g 21
<210>50<210>50
<211>21<211>21
<212>DNA<212>DNA
<213>人工<213> Artificial
<220><220>
<221>misc_feature<221>misc_feature
<222>(1)..(21)<222>(1)..(21)
<223>引物Oal5S<223> Primer Oal5S
<400>50<400>50
ccctcgtcca cccgaatcgg c 21ccctcgtcca cccgaatcgg c 21
<210>51<210>51
<211>21<211>21
<212>DNA<212>DNA
<213>人工<213> Artificial
<220><220>
<221>misc_feature<221>misc_feature
<222>(1)..(21)<222>(1)..(21)
<223>引物Oal6AS<223> Primer Oal6AS
<400>51<400>51
gcgtagtcgatgaggacgcc g 21gcgtagtcgatgaggacgcc g 21
<210>52<210>52
<211>21<211>21
<212>DNA<212>DNA
<213>人工<213> Artificial
<220><220>
<221>misc_feature<221>misc_feature
<222>(1)..(21)<222>(1)..(21)
<223>引物Oal7AS<223> Primer Oal7AS
<400>52<400>52
tgactatact tgaccctctc c 21tgactatact tgaccctctc c 21
<210>53<210>53
<211>22<211>22
<212>DNA<212>DNA
<213>人工<213> Artificial
<220><220>
<221>misc_feature<221>misc_feature
<222>(1)..(22)<222>(1)..(22)
<223>引物Oal8AS<223> Primer Oal8AS
<400>53<400>53
ggagacccat tttacgtggtgg 22ggagacccat tttacgtggtgg 22
<210>54<210>54
<211>21<211>21
<212>DNA<212>DNA
<213>人工<213> Artificial
<220><220>
<221>misc_feature<221>misc_feature
<222>(1)..(21)<222>(1)..(21)
<223>引物Oal9AS<223> Primer Oal9AS
<400>54<400>54
ccgttcccac accacctctg c 21ccgttcccac accacctctg c 21
<210>55<210>55
<211>24<211>24
<212>DNA<212>DNA
<213>人工<213> Artificial
<220><220>
<221>misc_feature<221>misc_feature
<222>(1)..(24)<222>(1)..(24)
<223>引物Oal10S<223> Primer Oal10S
<400>55<400>55
atggccgact ccctcccgaa aacc 24atggccgact ccctcccgaa aacc 24
<210>56<210>56
<211>27<211>27
<212>DNA<212>DNA
<213>人工<213> Artificial
<220><220>
<221>misc_feature<221>misc_feature
<222>(1)..(27)<222>(1)..(27)
<223>引物Oal10SEco<223> Primer Oal10SEco
<400>56<400>56
atataagaat tcaaatggcc gactccc 27atataagaat tcaaatggcc gactccc 27
<210>57<210>57
<211>53<211>53
<212>DNA<212>DNA
<213>人工<213> Artificial
<220><220>
<221>misc_feature<221>misc_feature
<222>(1)..(53)<222>(1)..(53)
<223>引物OalENhis<223> Primer OalENhis
<400>57<400>57
gaattcaaat gagaggatcc catcaccatc accatcacgc cgactccctc ccg 53gaattcaaat gagaggatcc catcaccatc accatcacgc cgactccctc ccg 53
<210>58<210>58
<211>19<211>19
<212>DNA<212>DNA
<213>人工<213> Artificial
<220><220>
<221>misc_feature<221>misc_feature
<222>(1)..(19)<222>(1)..(19)
<223>引物Oal10SShis<223> Primer Oal10SShis
<400>58<400>58
atgagaggat cgcatcacc 19atgagaggat cgcatcacc 19
<210>59<210>59
<211>21<211>21
<212>DNA<212>DNA
<213>人工<213> Artificial
<220><220>
<221>misc_feature<221>misc_feature
<222>(1)..(21)<222>(1)..(21)
<223>引物Oal11S<223> Primer Oal11S
<400>59<400>59
ctctactgac aagataccac g 21ctctactgac aagataccac g 21
<210>60<210>60
<211>26<211>26
<212>DNA<212>DNA
<213>人工<213> Artificial
<220><220>
<221>misc_feature<221>misc_feature
<222>(1)..(26)<222>(1)..(26)
<223>引物Oal12AS<223> Primer Oal12AS
<400>60<400>60
ttaaaaaatt gccatacaat tcggcg 26ttaaaaaatt gccatacaat tcggcg 26
<210>61<210>61
<211>38<211>38
<212>DNA<212>DNA
<213>人工<213> Artificial
<220><220>
<221>misc_feature<221>misc_feature
<222>(1)..(38)<222>(1)..(38)
<223>引物Oal12ASEco<223> Primer Oal12ASEco
<400>61<400>61
tatatagaat tcttaaaaaattgccataca attcggcg 38tatatagaat tcttaaaaaattgccataca attcggcg 38
<210>62<210>62
<211>44<211>44
<212>DNA<212>DNA
<213>人工<213> Artificial
<220><220>
<221>misc_feature<221>misc_feature
<222>(1)..(44)<222>(1)..(44)
<223>引物Oal12AShis<223> Primer Oal12AShis
<400>62<400>62
tcaatggtga tggtgatgat gaaaaattgc catacaattc ggcg 44tcaatggtga tggtgatgat gaaaaattgc catacaattc ggcg 44
<210>63<210>63
<211>56<211>56
<212>DNA<212>DNA
<213>人工<213> Artificial
<220><220>
<221>misc_feature<221>misc_feature
<222>(1)..(56)<222>(1)..(56)
<223>引物Oal12ASHisEco<223> Primer Oal12ASHisEco
<400>63<400>63
tatatagaat tcttaatggt gatggtgatg atgaaaaatt gccatacaat tcggcg 56tatatagaat tcttaatggt gatggtgatg atgaaaaatt gccatacaat tcggcg 56
<210>64<210>64
<211>21<211>21
<212>DNA<212>DNA
<213>人工<213> Artificial
<220><220>
<221>misc_feature<221>misc_feature
<222>(1)..(21)<222>(1)..(21)
<223>引物Oal13S<223> Primer Oal13S
<400>64<400>64
ggagatacaa ccctaatgaa c 21ggagatacaa ccctaatgaa c 21
<210>65<210>65
<211>21<211>21
<212>DNA<212>DNA
<213>人工<213> Artificial
<220><220>
<221>misc_feature<221>misc_feature
<222>(1)..(21)<222>(1)..(21)
<223>引物Oal13AS<223> Primer Oal13AS
<400>65<400>65
gttcattagg gttgtatctc c 21gttcattagg gttgtatctc c 21
<210>66<210>66
<211>37<211>37
<212>DNA<212>DNA
<213>人工<213> Artificial
<220><220>
<221>misc_feature<221>misc_feature
<222>(1)..(37)<222>(1)..(37)
<223>引物Oal14S<223> Primer Oal14S
<400>66<400>66
gagaagctga acccaccagg cttcggccgc aaaccct 37gagaagctga accccaccagg cttcggccgc aaaccct 37
<210>67<210>67
<211>37<211>37
<212>DNA<212>DNA
<213>人工<213> Artificial
<220><220>
<221>misc_feature<221>misc_feature
<222>(1)..(37)<222>(1)..(37)
<223>引物Oal14AS<223> Primer Oal14AS
<400>67<400>67
agggtttgcg gccgaagcct ggtgggttca gcttctc 37agggtttgcg gccgaagcct ggtgggttca gcttctc 37
<210>68<210>68
<211>37<211>37
<212>DNA<212>DNA
<213>人工<213> Artificial
<220><220>
<221>misc_feature<221>misc_feature
<222>(1)..(37)<222>(1)..(37)
<223>引物Oal15S<223> Primer Oal15S
<400>68<400>68
cttcgcccac cacgtaaaaa cccccgacga ccctaac 37cttcgcccac cacgtaaaaa cccccgacga ccctaac 37
<210>69<210>69
<211>37<211>37
<212>DNA<212>DNA
<213>人工<213> Artificial
<220><220>
<221>misc_feature<221>misc_feature
<222>(1)..(37)<222>(1)..(37)
<223>引物Oal15AS<223> Primer Oal15AS
<400>69<400>69
gttagggtcg tcgggggttt ttacgtggtg ggcgaag 37gttagggtcg tcgggggttt ttacgtggtg ggcgaag 37
<210>70<210>70
<211>32<211>32
<212>DNA<212>DNA
<213>人工<213> Artificial
<220><220>
<221>misc_feature<221>misc_feature
<222>(1)..(32)<222>(1)..(32)
<223>引物Oal16S<223> Primer Oal16S
<400>70<400>70
gaattcaaat gctccgtctc ctcactccac tc 32gaattcaaat gctccgtctc ctcactccac tc 32
<210>71<210>71
<211>30<211>30
<212>DNA<212>DNA
<213>人工<213> Artificial
<220><220>
<221>misc_feature<221>misc_feature
<222>(1)..(30)<222>(1)..(30)
<223>引物OalsecS<223> Primer OalsecS
<400>71<400>71
ggtcctcgtg gtaatgccga ctccctcccg 30ggtcctcgtg gtaatgccga ctccctcccg 30
<210>72<210>72
<211>27<211>27
<212>DNA<212>DNA
<213>人工<213> Artificial
<220><220>
<221>misc_feature<221>misc_feature
<222>(1)..(27)<222>(1)..(27)
<223>引物OalsecAS<223> Primer OalsecAS
<400>72<400>72
cgggagggag tcggcattac cacgagg 27cgggaggggag tcggcattac cacgagg 27
<210>73<210>73
<211>48<211>48
<212>DNA<212>DNA
<213>人工<213> Artificial
<220><220>
<221>misc_feature<221>misc_feature
<222>(1)..(48)<222>(1)..(48)
<223>引物pQE80EcoS<223> Primer pQE80EcoS
<400>73<400>73
tatatagaat tcattaaaga ggagaaatta actatggccg actccctc 48tatatagaat tcattaaaga ggagaaatta actatggccg actccctc 48
<210>74<210>74
<211>52<211>52
<212>DNA<212>DNA
<213>人工<213> Artificial
<220><220>
<221>misc_feature<221>misc_feature
<222>(1)..(52)<222>(1)..(52)
<223>引物pQE80EcoNhisS<223> Primer pQE80EcoNhisS
<400>74<400>74
tatatagaat tcattaaaga ggagaaatta actatgagag gatcgcatca cc 52tatatagaat tcattaaaga ggagaaatta actatgagag gatcgcatca cc 52
<210>75<210>75
<211>34<211>34
<212>DNA<212>DNA
<213>人工<213> Artificial
<220><220>
<221>misc_feature<221>misc_feature
<222>(1)..(34)<222>(1)..(34)
<223>引物pQE80BamAS<223> Primer pQE80BamAS
<400>75<400>75
tatatagtcg acttaaaaaa ttgccataca attc 34tatatagtcg acttaaaaaa ttgccataca attc 34
<210>76<210>76
<211>56<211>56
<212>DNA<212>DNA
<213>人工<213> Artificial
<220><220>
<221>misc_feature<221>misc_feature
<222>(1)..(56)<222>(1)..(56)
<223>引物pQE80BamhisAS<223> Primer pQE80BamhisAS
<400>76<400>76
tatatagtcg actcaatggtgatggtgatg atgaaaaatt gccatacaat tcggcg 56tatatagtcg actcaatggtgatggtgatg atgaaaaatt gccatacaat tcggcg 56
<210>77<210>77
<211>57<211>57
<212>DNA<212>DNA
<213>人工<213> Artificial
<220><220>
<221>misc_feature<221>misc_feature
<222>(1)..(57)<222>(1)..(57)
<223>引物pQE80EcoSshort<223> Primer pQE80EcoSshort
<400>77<400>77
tatatagaat tcattaaaga ggagaaatta actatgctcc gtctcctcac tccactc 57tatatagaat tcattaaaga ggagaaatta actatgctcc gtctcctcac tccactc 57
<210>78<210>78
<211>26<211>26
<212>DNA<212>DNA
<213>人工<213> Artificial
<220><220>
<221>misc_feature<221>misc_feature
<222>(1)..(26)<222>(1)..(26)
<223>引物CP-a<223> Primer CP-a
<400>78<400>78
tatatgaatt catgggcgtg ttcgtc 26tatatgaatt catgggcgtg ttcgtc 26
<210>79<210>79
<211>777<211>777
<212>DNA<212>DNA
<213>Aspergillus niger<213>Aspergillus niger
<220><220>
<221>Intron<221> Intron
<222>(32)..(82)<222>(32)..(82)
<223><223>
<220><220>
<221>CDS<221> CDS
<222>(83)..(777)<222>(83)..(777)
<223><223>
<220><220>
<221>CDS<221> CDS
<222>(1)..(31)<222>(1)..(31)
<223><223>
<400>79<400>79
aac tgg gac ctg gaa aaa ctc aat cct cca g gtaatcccct ccttccaacc 51aac tgg gac ctg gaa aaa ctc aat cct cca g gtaatcccct ccttccaacc 51
Asn Trp Asp Leu Glu Lys Leu Asn Pro ProAsn Trp Asp Leu Glu Lys Leu Asn Pro Pro
1 5 101 5 10
catcctctat cgccaactaa cctaaacaca g gc ttc ggc cgc aaa ccc ttc 102catcctctat cgccaactaa cctaaacaca g gc ttc ggc cgc aaa ccc ttc 102
Gly Phe Gly Arg Lys Pro PheGly Phe Gly Arg Lys Pro Phe
15
gcc cac cag gtt aaa tgg gtc agc gaa ggc cac gcc ttc gac gac gag 150gcc cac cag gtt aaa tgg gtc agc gaa ggc cac gcc ttc gac gac gag 150
Ala His Gln Val Lys Trp Val Ser Glu Gly His Ala Phe Asp Asp GluAla His Gln Val Lys Trp Val Ser Glu Gly His Ala Phe Asp Asp Glu
20 25 3020 25 30
tac cac ttc aac cca cag caa tcc tcc caa tgg gac ggc cta cga cac 198tac cac ttc aac cca cag caa tcc tcc caa tgg gac ggc cta cga cac 198
Tyr His Phe Asn Pro Gln Gln Ser Ser Gln Trp Asp Gly Leu Arg HisTyr His Phe Asn Pro Gln Gln Ser Ser Gln Trp Asp Gly Leu Arg His
35 40 4535 40 45
cac aat gcc cca gcc cca aca ccc gac gac cct gac cgc cgc gtc ttc 246cac aat gcc cca gcc cca aca ccc gac gac cct gac cgc cgc gtc ttc 246
His Asn Ala Pro Ala Pro Thr Pro Asp Asp Pro Asp Arg Arg Val PheHis Asn Ala Pro Ala Pro Thr Pro Asp Asp Pro Asp Arg Arg Val Phe
50 55 60 6550 55 60 65
tac ggc ggc acc acc tcc acc gag atc ctc gac ccg tcc tca gct cgc 294tac ggc ggc acc acc tcc acc gag atc ctc gac ccg tcc tca gct cgc 294
Tyr Gly Gly Thr Thr Ser Thr Glu Ile Leu Asp Pro Ser Ser Ala ArgTyr Gly Gly Thr Thr Ser Thr Glu Ile Leu Asp Pro Ser Ser Ala Arg
70 75 8070 75 80
atc ggc atc gct cac tgg gcc aag aag ggc atc gcc ggc cgt ggc gtc 342atc ggc atc gct cac tgg gcc aag aag ggc atc gcc ggc cgt ggc gtc 342
Ile Gly Ile Ala His Trp Ala Lys Lys Gly Ile Ala Gly Arg Gly ValIle Gly Ile Ala His Trp Ala Lys Lys Gly Ile Ala Gly Arg Gly Val
85 90 9585 90 95
ctc atc gac tac gcg tcc tac atc cag cga acc aag ggc ata cca gta 390ctc atc gac tac gcg tcc tac atc cag cga acc aag ggc ata cca gta 390
Leu Ile Asp Tyr Ala Ser Tyr Ile Gln Arg Thr Lys Gly Ile Pro ValLeu Ile Asp Tyr Ala Ser Tyr Ile Gln Arg Thr Lys Gly Ile Pro Val
100 105 110100 105 110
aac gcc cta acc cgg cac acc gtc tcc ctg gac gac gtc ctc acc atc 438aac gcc cta acc cgg cac acc gtc tcc ctg gac gac gtc ctc acc atc 438
Asn Ala Leu Thr Arg His Thr Val Ser Leu Asp Asp Val Leu Thr IleAsn Ala Leu Thr Arg His Thr Val Ser Leu Asp Asp Val Leu Thr Ile
115 120 125115 120 125
gcg aaa gaa tgc aac atc acc ttc cag cca ggc gac att ctc ttc ttg 486gcg aaa gaa tgc aac atc acc ttc cag cca ggc gac att ctc ttc ttg 486
Ala Lys Glu Cys Asn Ile Thr Phe Gln Pro Gly Asp Ile Leu Phe LeuAla Lys Glu Cys Asn Ile Thr Phe Gln Pro Gly Asp Ile Leu Phe Leu
130 135 140 145130 135 140 145
cgt gtc ggc ctg ccc acc aca tgg gat aac atg tcc gat gac gag aag 534cgt gtc ggc ctg ccc acc aca tgg gat aac atg tcc gat gac gag aag 534
Arg Val Gly Leu Pro Thr Thr Trp Asp Asn Met Ser Asp Asp Glu LysArg Val Gly Leu Pro Thr Thr Trp Asp Asn Met Ser Asp Asp Glu Lys
150 155 160150 155 160
gtc aaa tac agt caa cag gaa atg cct cag cat gca ggg tta gag cag 582gtc aaa tac agt caa cag gaa atg cct cag cat gca ggg tta gag cag 582
Val Lys Tyr Ser Gln Gln Glu Met Pro Gln His Ala Gly Leu Glu GlnVal Lys Tyr Ser Gln Gln Glu Met Pro Gln His Ala Gly Leu Glu Gln
165 170 175165 170 175
agt gag cgc gtg gtg cgg ttc ctc tgg gac cag cat ttc gcg gct gtg 630agt gag cgc gtg gtg cgg ttc ctc tgg gac cag cat ttc gcg gct gtg 630
Ser Glu Arg Val Val Arg Phe Leu Trp Asp Gln His Phe Ala Ala ValSer Glu Arg Val Val Arg Phe Leu Trp Asp Gln His Phe Ala Ala Val
180 185 190180 185 190
gcg ggt gat gcg gtc agc ttc gag gtg tat ccg cct gta gag aag gag 678gcg ggt gat gcg gtc agc ttc gag gtg tat ccg cct gta gag aag gag 678
Ala Gly Asp Ala Val Ser Phe Glu Val Tyr Pro Pro Val Glu Lys GluAla Gly Asp Ala Val Ser Phe Glu Val Tyr Pro Pro Val Glu Lys Glu
195 200 205195 200 205
tgg gat ttg cat cat ttt ctg ttg gcc ggg tgg gga gtg ccg att ggg 726tgg gat ttg cat cat ttt ctg ttg gcc ggg tgg gga gtg ccg att ggg 726
Trp Asp Leu His His Phe Leu Leu Ala Gly Trp Gly Val Pro Ile GlyTrp Asp Leu His His Phe Leu Leu Ala Gly Trp Gly Val Pro Ile Gly
210 215 220 225210 215 220 225
gag atg ttt gat ctg gag ggg ttg aag gag gtt tgt gag agg ttg ggc 774gag atg ttt gat ctg gag ggg ttg aag gag gtt tgt gag agg ttg ggc 774
Glu Met Phe Asp Leu Glu Gly Leu Lys Glu Val Cys Glu Arg Leu GlyGlu Met Phe Asp Leu Glu Gly Leu Lys Glu Val Cys Glu Arg Leu Gly
230 235 240230 235 240
agg 777agg 777
ArgArg
<210>80<210>80
<211>242<211>242
<212>PRT<212>PRT
<213>Aspergillus niger<213>Aspergillus niger
<400>80<400>80
Asn Trp Asp Leu Glu Lys Leu Asn Pro Pro Gly Phe Gly Arg Lys ProAsn Trp Asp Leu Glu Lys Leu Asn Pro Pro Gly Phe Gly Arg Lys Pro
1 5 10 151 5 10 15
Phe Ala His Gln Val Lys Trp Val Ser Glu Gly His Ala Phe Asp AspPhe Ala His Gln Val Lys Trp Val Ser Glu Gly His Ala Phe Asp Asp
20 25 3020 25 30
Glu Tyr His Phe Asn Pro Gln Gln Ser Ser Gln Trp Asp Gly Leu ArgGlu Tyr His Phe Asn Pro Gln Gln Ser Ser Gln Trp Asp Gly Leu Arg
35 40 4535 40 45
His His Asn Ala Pro Ala Pro Thr Pro Asp Asp Pro Asp Arg Arg ValHis His Asn Ala Pro Ala Pro Thr Pro Asp Asp Pro Asp Arg Arg Val
50 55 6050 55 60
Phe Tyr Gly Gly Thr Thr Ser Thr Glu Ile Leu Asp Pro Ser Ser AlaPhe Tyr Gly Gly Thr Thr Ser Thr Glu Ile Leu Asp Pro Ser Ser Ala
65 70 75 8065 70 75 80
Arg Ile Gly Ile Ala His Trp Ala Lys Lys Gly Ile Ala Gly Arg GlyArg Ile Gly Ile Ala His Trp Ala Lys Lys Gly Ile Ala Gly Arg Gly
85 90 9585 90 95
Val Leu Ile Asp Tyr Ala Ser Tyr Ile Gln Arg Thr Lys Gly Ile ProVal Leu Ile Asp Tyr Ala Ser Tyr Ile Gln Arg Thr Lys Gly Ile Pro
100 105 110100 105 110
Val Asn Ala Leu Thr Arg His Thr Val Ser Leu Asp Asp Val Leu ThrVal Asn Ala Leu Thr Arg His Thr Val Ser Leu Asp Asp Val Leu Thr
115 120 125115 120 125
Ile Ala Lys Glu Cys Asn Ile Thr Phe Gln Pro Gly Asp Ile Leu PheIle Ala Lys Glu Cys Asn Ile Thr Phe Gln Pro Gly Asp Ile Leu Phe
130 135 140130 135 140
Leu Arg Val Gly Leu Pro Thr Thr Trp Asp Asn Met Ser Asp Asp GluLeu Arg Val Gly Leu Pro Thr Thr Trp Asp Asn Met Ser Asp Asp Glu
145 150 155 160145 150 155 160
Lys Val Lys Tyr Ser Gln Gln Glu Met Pro Gln His Ala Gly Leu GluLys Val Lys Tyr Ser Gln Gln Glu Met Pro Gln His Ala Gly Leu Glu
165 170 175165 170 175
Gln Ser Glu Arg Val Val Arg Phe Leu Trp Asp Gln His Phe Ala AlaGln Ser Glu Arg Val Val Arg Phe Leu Trp Asp Gln His Phe Ala Ala
180 185 190180 185 190
Val Ala Gly Asp Ala Val Ser Phe Glu Val Tyr Pro Pro Val Glu LysVal Ala Gly Asp Ala Val Ser Phe Glu Val Tyr Pro Pro Val Glu Lys
195 200 205195 200 205
Glu Trp Asp Leu His His Phe Leu Leu Ala Gly Trp Gly Val Pro IleGlu Trp Asp Leu His His Phe Leu Leu Ala Gly Trp Gly Val Pro Ile
210 215 220210 215 220
Gly Glu Met Phe Asp Leu Glu Gly Leu Lys Glu Val Cys Glu Arg LeuGly Glu Met Phe Asp Leu Glu Gly Leu Lys Glu Val Cys Glu Arg Leu
225 230 235 240225 230 235 240
Gly ArgGly Arg
<210>81<210>81
<211>12<211>12
<212>PRT<212>PRT
<213>Aspergillus niger<213>Aspergillus niger
<220><220>
<221>MISC_FEATURE<221>MISC_FEATURE
<222>(10)..(10)<222>(10)..(10)
<223>undefined amino acid<223>undefined amino acid
<400>81<400>81
Lys Pro Phe Ala His Gln Val Lys Thr Xaa Glu AspLys Pro Phe Ala His Gln Val Lys Thr Xaa Glu Asp
1 5 101 5 10
<210>82<210>82
<211>8<211>8
<212>PRT<212>PRT
<213>Aspergillus niger<213>Aspergillus niger
<400>82<400>82
Ile Gly Ile Ala His Trp Ala LysIle Gly Ile Ala His Trp Ala Lys
1 51 5
<210>83<210>83
<211>13<211>13
<212>PRT<212>PRT
<213>Aspergillus niger<213>Aspergillus niger
<400>83<400>83
Trp Thr Phe Phe Val Ser Ser Ser Pro Leu Asn Cys LysTrp Thr Phe Phe Val Ser Ser Ser Ser Pro Leu Asn Cys Lys
1 5 101 5 10
<210>84<210>84
<211>15<211>15
<212>PRT<212>PRT
<213>Aspergillus niger<213>Aspergillus niger
<220><220>
<221>MISC_FEATURE<221>MISC_FEATURE
<222>(5)..(5)<222>(5)..(5)
<223>氨基酸无法被明确确定<223>Amino acid cannot be unambiguously determined
<220><220>
<221>MISC_FEATURE<221>MISC_FEATURE
<222>(14)..(14)<222>(14)..(14)
<223>未确定的氨基酸<223> Unidentified amino acid
<400>84<400>84
Val Cys Leu Asn Trp Asp Leu Glu Lys Leu Asn Pro Pro Xaa PheVal Cys Leu Asn Trp Asp Leu Glu Lys Leu Asn Pro Pro Xaa Phe
1 5 10 151 5 10 15
<210>85<210>85
<211>15<211>15
<212>PRT<212>PRT
<213>Aspergillus niger<213>Aspergillus niger
<400>85<400>85
Glu Cys Asn Ile Thr Phe Gln Pro Gly Asp Ile Leu Phe Leu ArgGlu Cys Asn Ile Thr Phe Gln Pro Gly Asp Ile Leu Phe Leu Arg
1 5 10 151 5 10 15
<210>86<210>86
<211>11<211>11
<212>PAT<212>PAT
<213>Aspergillus niger<213>Aspergillus niger
<220><220>
<221>MISC_FEATURE<221>MISC_FEATURE
<222>(8)..(8)<222>(8)..(8)
<223>未确定的氨基酸<223> Unidentified amino acid
<400>86<400>86
Ala Glu Thr Thr Thr Asn Pro Xaa Tyr Phe PheAla Glu Thr Thr Thr Asn Pro Xaa Tyr Phe Phe
1 5 101 5 10
<210>87<210>87
<211>8<211>8
<212>PRT<212>PRT
<213>Aspergillus niger<213>Aspergillus niger
<400>87<400>87
Tyr His Leu Leu Gln Ala Glu AspTyr His Leu Leu Gln Ala Glu Asp
1 51 5
<210>88<210>88
<211>10<211>10
<212>PRT<212>PRT
<213>Aspergillus niger<213>Aspergillus niger
<400>88<400>88
Leu Glu Ser Leu Val Glu Glu Val Tyr LysLeu Glu Ser Leu Val Glu Glu Val Tyr Lys
1 5 101 5 10
<210>89<210>89
<211>16<211>16
<212>PRT<212>PRT
<213>Aspergillus niger<213>Aspergillus niger
<220><220>
<221>MISC_FEATURE<221>MISC_FEATURE
<222>(15)..(15)<222>(15)..(15)
<223>未确定的氨基酸<223> Unidentified amino acid
<400>89<400>89
Leu Phe Glu Asp Val Tyr Ala Gln Lys Pro Glu Thr Ala Pro Xaa AspLeu Phe Glu Asp Val Tyr Ala Gln Lys Pro Glu Thr Ala Pro Xaa Asp
1 5 10 151 5 10 15
<210>90<210>90
<211>9<211>9
<212>PRT<212>PRT
<213>Aspergillus niger<213>Aspergillus niger
<220><220>
<221>MISC_FEATURE<221>MISC_FEATURE
<222>(7)..(7)<222>(7)..(7)
<223>氨基酸无法被明确确定<223> Amino acids cannot be unambiguously determined
<400>90<400>90
Phe Leu Phe Asn Val Pro Tyr Thr SerPhe Leu Phe Asn Val Pro Tyr Thr Ser
1 51 5
<210>91<210>91
<211>8<211>8
<212>PRT<212>PRT
<213>Aspergillus niger<213>Aspergillus niger
<220><220>
<221>MISC_FEATURE<221>MISC_FEATURE
<222>(8)..(8)<222>(8)..(8)
<223>氨基酸无法被明确确定<223> Amino acids cannot be unambiguously determined
<400>91<400>91
Gly Val Met Gly Gly Leu Gly GlyGly Val Met Gly Gly Leu Gly Gly
1 51 5
<210>92<210>92
<211>1005<211>1005
<212>DNA<212>DNA
<213>Aspergillus niger<213>Aspergillus niger
<220><220>
<221>CDS<221> CDS
<222>(1)..(1002)<222>(1)..(1002)
<223><223>
<400>92<400>92
atg aga gga tcg cat cac cat cac cat cac gcc gac tcc ctc ccg aaa 48atg aga gga tcg cat cac cat cac cat cac gcc gac tcc ctc ccg aaa 48
Met Arg Gly Ser His His His His His His Ala Asp Ser Leu Pro LysMet Arg Gly Ser His His His His His His Ala Asp Ser Leu Pro Lys
1 5 10 151 5 10 15
acc tac gac gac ctc ccc gac aag cgg cgc tac tgg ccc gca acc ccc 96acc tac gac gac ctc ccc gac aag cgg cgc tac tgg ccc gca acc ccc 96
Thr Tyr Asp Asp Leu Pro Asp Lys Arg Arg Tyr Trp Pro Ala Thr ProThr Tyr Asp Asp Leu Pro Asp Lys Arg Arg Tyr Trp Pro Ala Thr Pro
20 25 3020 25 30
aac tcc gcc gac gag gga ctg ggg atg ctc cgt ctc ctc act cca gaa 144aac tcc gcc gac gag gga ctg ggg atg ctc cgt ctc ctc act cca gaa 144
Asn Ser Ala Asp Glu Gly Leu Gly Met Leu Arg Leu Leu Thr Pro GluAsn Ser Ala Asp Glu Gly Leu Gly Met Leu Arg Leu Leu Thr Pro Glu
35 40 4535 40 45
att gtc gcc aac gca gca cgc acc cag atc cag acc ggc gag cgc gta 192att gtc gcc aac gca gca cgc acc cag atc cag acc ggc gag cgc gta 192
Ile Val Ala Asn Ala Ala Arg Thr Gln Ile Gln Thr Gly Glu Arg ValIle Val Ala Asn Ala Ala Arg Thr Gln Ile Gln Thr Gly Glu Arg Val
50 55 6050 55 60
tgc ctg aac tgg gac ctg gag aag ctg aac cca cca ggc ttc ggc cgc 240tgc ctg aac tgg gac ctg gag aag ctg aac cca cca ggc ttc ggc cgc 240
Cys Leu Asn Trp Asp Leu Glu Lys Leu Asn Pro Pro Gly Phe Gly ArgCys Leu Asn Trp Asp Leu Glu Lys Leu Asn Pro Pro Gly Phe Gly Arg
65 70 75 8065 70 75 80
aaa ccc ttc gcc cac cac gta aaa tgg gtc tcc gaa ggc cac gcc ttc 288aaa ccc ttc gcc cac cac gta aaa tgg gtc tcc gaa ggc cac gcc ttc 288
Lys Pro Phe Ala His His Val Lys Trp Val Ser Glu Gly His Ala PheLys Pro Phe Ala His His Val Lys Trp Val Ser Glu Gly His Ala Phe
85 90 9585 90 95
gac gac gaa tac cac ttc aac ccg caa caa tcc tcc caa tgg gac ggt 336gac gac gaa tac cac ttc aac ccg caa caa tcc tcc caa tgg gac ggt 336
Asp Asp Glu Tyr His Phe Asn Pro Gln Gln Ser Ser Gln Trp Asp GlyAsp Asp Glu Tyr His Phe Asn Pro Gln Gln Ser Ser Gln Trp Asp Gly
100 105 110100 105 110
ctg cga cac cac aac gcc ccg gct cca acc ccc gac gac cct aac cgc 384ctg cga cac cac aac gcc ccg gct cca acc ccc gac gac cct aac cgc 384
Leu Arg His His Asn Ala Pro Ala Pro Thr Pro Asp Asp Pro Asn ArgLeu Arg His His Asn Ala Pro Ala Pro Thr Pro Asp Asp Pro Asn Arg
115 120 125115 120 125
cgg gtc ttc tac ggc ggc acc acc tcc acc gag atc ctc gac ccc tcg 432cgg gtc ttc tac ggc ggc acc acc acc tcc acc gag atc ctc gac ccc tcg 432
Arg Val Phe Tyr Gly Gly Thr Thr Ser Thr Glu Ile Leu Asp Pro SerArg Val Phe Tyr Gly Gly Thr Thr Ser Thr Glu Ile Leu Asp Pro Ser
130 135 140130 135 140
tcc acc cga atc ggc ata gcc cac tgg gcc aag aaa ggc atc gcc ggc 480tcc acc cga atc ggc ata gcc cac tgg gcc aag aaa ggc atc gcc ggc 480
Ser Thr Arg Ile Gly Ile Ala His Trp Ala Lys Lys Gly Ile Ala GlySer Thr Arg Ile Gly Ile Ala His Trp Ala Lys Lys Gly Ile Ala Gly
145 150 155 160145 150 155 160
cgc ggc gtc ctc atc gac tac gcc tcc tac gtg caa cga acc aag ggc 528cgc ggc gtc ctc atc gac tac gcc tcc tac gtg caa cga acc aag ggc 528
Arg Gly Val Leu Ile Asp Tyr Ala Ser Tyr Val Gln Arg Thr Lys GlyArg Gly Val Leu Ile Asp Tyr Ala Ser Tyr Val Gln Arg Thr Lys Gly
165 170 175165 170 175
gtc aca gta aac gcc cta acc cgc cac acc gtc tcc ctg gac gac gtc 576gtc aca gta aac gcc cta acc cgc cac acc gtc tcc ctg gac gac gtc 576
Val Thr Val Asn Ala Leu Thr Arg His Thr Val Ser Leu Asp Asp ValVal Thr Val Asn Ala Leu Thr Arg His Thr Val Ser Leu Asp Asp Val
180 185 190180 185 190
ctc acc atc gcg aag gaa tgc aac atc acc ttc gaa cca ggc gac att 624ctc acc atc gcg aag gaa tgc aac atc acc ttc gaa cca ggc gac att 624
Leu Thr Ile Ala Lys Glu Cys Asn Ile Thr Phe Glu Pro Gly Asp IleLeu Thr Ile Ala Lys Glu Cys Asn Ile Thr Phe Glu Pro Gly Asp Ile
195 200 205195 200 205
ctc ttc ctg cgc gtc ggt cta ccg act aca tgg gat aac atg acc gat 672ctc ttc ctg cgc gtc ggt cta ccg act aca tgg gat aac atg acc gat 672
Leu Phe Leu Arg Val Gly Leu Pro Thr Thr Trp Asp Asn Met Thr AspLeu Phe Leu Arg Val Gly Leu Pro Thr Thr Trp Asp Asn Met Thr Asp
210 215 220210 215 220
gag gag agg gtc aag tat agt cag cag gaa acg ccc aac cat gca gga 720gag gag agg gtc aag tat agt cag cag gaa acg ccc aac cat gca gga 720
Glu Glu Arg Val Lys Tyr Ser Gln Gln Glu Thr Pro Asn His Ala GlyGlu Glu Arg Val Lys Tyr Ser Gln Gln Glu Thr Pro Asn His Ala Gly
225 230 235 240225 230 235 240
tta gag cag agt gag cgt gtg gtg cgg ttc ctt tgg gat cag cat ttc 768tta gag cag agt gag cgt gtg gtg cgg ttc ctt tgg gat cag cat ttc 768
Leu Glu Gln Ser Glu Arg Val Val Arg Phe Leu Trp Asp Gln His PheLeu Glu Gln Ser Glu Arg Val Val Arg Phe Leu Trp Asp Gln His Phe
245 250 255245 250 255
gcc gct gta gcg ggt gat gcg gtc agc ttt gag gtg tat ccg ccg gta 816gcc gct gta gcg ggt gat gcg gtc agc ttt gag gtg tat ccg ccg gta 816
Ala Ala Val Ala Gly Asp Ala Val Ser Phe Glu Val Tyr Pro Pro ValAla Ala Val Ala Gly Asp Ala Val Ser Phe Glu Val Tyr Pro Pro Val
260 265 270260 265 270
gag aag gag tgg gat tta cat cat ttt ttg ctg gct ggg tgg gga cta 864gag aag gag tgg gat tta cat cat ttt ttg ctg gct ggg tgg gga cta 864
Glu Lys Glu Trp Asp Leu His His Phe Leu Leu Ala Gly Trp Gly LeuGlu Lys Glu Trp Asp Leu His His Phe Leu Leu Ala Gly Trp Gly Leu
275 280 285275 280 285
ccg att ggg gag atg ttt gat ctt gag ggg ttg aag gag gtg tgt gag 912ccg att ggg gag atg ttt gat ctt gag ggg ttg aag gag gtg tgt gag 912
Pro Ile Gly Glu Met Phe Asp Leu Glu Gly Leu Lys Glu Val Cys GluPro Ile Gly Glu Met Phe Asp Leu Glu Gly Leu Lys Glu Val Cys Glu
290 295 300290 295 300
agg ttg gga agg tgg acg ttt ttc gtt agt tct agt ccg ttg aac tgt 960agg ttg gga agg tgg acg ttt ttc gtt agt tct agt ccg ttg aac tgt 960
Arg Leu Gly Arg Trp Thr Phe Phe Val Ser Ser Ser Pro Leu Asn CysArg Leu Gly Arg Trp Thr Phe Phe Val Ser Ser Ser Pro Leu Asn Cys
305 310 315 320305 310 315 320
ggg aga ggt gtg tcg agt ccg ccg aat tgt atg gca att ttt taa 1005ggg aga ggt gtg tcg agt ccg ccg aat tgt atg gca att ttt taa 1005
Gly Arg Gly Val Ser Ser Pro Pro Asn Cys Met Ala Ile PheGly Arg Gly Val Ser Ser Pro Pro Asn Cys Met Ala Ile Phe
325 330325 330
<210>93<210>93
<211>334<211>334
<212>PRT<212>PRT
<213>Aspergillus niger<213>Aspergillus niger
<400>93<400>93
Met Arg Gly Ser His His His His His His Ala Asp Ser Leu Pro LysMet Arg Gly Ser His His His His His His Ala Asp Ser Leu Pro Lys
1 5 10 151 5 10 15
Thr Tyr Asp Asp Leu Pro Asp Lys Arg Arg Tyr Trp Pro Ala Thr ProThr Tyr Asp Asp Leu Pro Asp Lys Arg Arg Tyr Trp Pro Ala Thr Pro
20 25 3020 25 30
Asn Ser Ala Asp Glu Gly Leu Gly Met Leu Arg Leu Leu Thr Pro GluAsn Ser Ala Asp Glu Gly Leu Gly Met Leu Arg Leu Leu Thr Pro Glu
35 40 4535 40 45
Ile Val Ala Asn Ala Ala Arg Thr Gln Ile Gln Thr Gly Glu Arg ValIle Val Ala Asn Ala Ala Arg Thr Gln Ile Gln Thr Gly Glu Arg Val
50 55 6050 55 60
Cys Leu Asn Trp Asp Leu Glu Lys Leu Asn Pro Pro Gly Phe Gly ArgCys Leu Asn Trp Asp Leu Glu Lys Leu Asn Pro Pro Gly Phe Gly Arg
65 70 75 8065 70 75 80
Lys Pro Phe Ala His His Val Lys Trp Val Ser Glu Gly His Ala PheLys Pro Phe Ala His His Val Lys Trp Val Ser Glu Gly His Ala Phe
85 90 9585 90 95
Asp Asp Glu Tyr His Phe Asn Pro Gln Gln Ser Ser Gln Trp Asp GlyAsp Asp Glu Tyr His Phe Asn Pro Gln Gln Ser Ser Gln Trp Asp Gly
100 105 110100 105 110
Leu Arg His His Asn Ala Pro Ala Pro Thr Pro Asp Asp Pro Asn ArgLeu Arg His His Asn Ala Pro Ala Pro Thr Pro Asp Asp Pro Asn Arg
115 120 125115 120 125
Arg Val Phe Tyr Gly Gly Thr Thr Ser Thr Glu Ile Leu Asp Pro SerArg Val Phe Tyr Gly Gly Thr Thr Ser Thr Glu Ile Leu Asp Pro Ser
130 135 140130 135 140
Ser Thr Arg Ile Gly Ile Ala His Trp Ala Lys Lys Gly Ile Ala GlySer Thr Arg Ile Gly Ile Ala His Trp Ala Lys Lys Gly Ile Ala Gly
145 150 155 160145 150 155 160
Arg Gly Val Leu Ile Asp Tyr Ala Ser Tyr Val Gln Arg Thr Lys GlyArg Gly Val Leu Ile Asp Tyr Ala Ser Tyr Val Gln Arg Thr Lys Gly
165 170 175165 170 175
Val Thr Val Asn Ala Leu Thr Arg His Thr Val Ser Leu Asp Asp ValVal Thr Val Asn Ala Leu Thr Arg His Thr Val Ser Leu Asp Asp Val
180 185 190180 185 190
Leu Thr Ile Ala Lys Glu Cys Asn Ile Thr Phe Glu Pro Gly Asp IleLeu Thr Ile Ala Lys Glu Cys Asn Ile Thr Phe Glu Pro Gly Asp Ile
195 200 205195 200 205
Leu Phe Leu Arg Val Gly Leu Pro Thr Thr Trp Asp Asn Met Thr AspLeu Phe Leu Arg Val Gly Leu Pro Thr Thr Trp Asp Asn Met Thr Asp
210 215 220210 215 220
Glu Glu Arg Val Lys Tyr Ser Gln Gln Glu Thr Pro Asn His Ala GlyGlu Glu Arg Val Lys Tyr Ser Gln Gln Glu Thr Pro Asn His Ala Gly
225 230 235 240225 230 235 240
Leu Glu Gln Ser Glu Arg Val Val Arg Phe Leu Trp Asp Gln His PheLeu Glu Gln Ser Glu Arg Val Val Arg Phe Leu Trp Asp Gln His Phe
245 250 255245 250 255
Ala Ala Val Ala Gly Asp Ala Val Ser Phe Glu Val Tyr Pro Pro ValAla Ala Val Ala Gly Asp Ala Val Ser Phe Glu Val Tyr Pro Pro Val
260 265 270260 265 270
Glu Lys Glu Trp Asp Leu His His Phe Leu Leu Ala Gly Trp Gly LeuGlu Lys Glu Trp Asp Leu His His Phe Leu Leu Ala Gly Trp Gly Leu
275 280 285275 280 285
Pro Ile Gly Glu Met Phe Asp Leu Glu Gly Leu Lys Glu Val Cys GluPro Ile Gly Glu Met Phe Asp Leu Glu Gly Leu Lys Glu Val Cys Glu
290 295 300290 295 300
Arg Leu Gly Arg Trp Thr Phe Phe Val Ser Ser Ser Pro Leu Asn CysArg Leu Gly Arg Trp Thr Phe Phe Val Ser Ser Ser Pro Leu Asn Cys
305 310 315 320305 310 315 320
Gly Arg Gly Val Ser Ser Pro Pro Asn Cys Met Ala Ile PheGly Arg Gly Val Ser Ser Pro Pro Asn Cys Met Ala Ile Phe
325 330325 330
<210>94<210>94
<211>992<211>992
<212>DNA<212>DNA
<213>Aspergillus niger<213>Aspergillus niger
<220><220>
<221>CDS<221> CDS
<222>(9)..(983)<222>(9)..(983)
<223><223>
<400>94<400>94
gaattcaa atg gcc gac tcc ctc ccg aaa acc tac gac gac ctc ccc gac 50gaattcaa atg gcc gac tcc ctc ccg aaa acc tac gac gac ctc ccc gac 50
Met Ala Asp Ser Leu Pro Lys Thr Tyr Asp Asp Leu Pro AspMet Ala Asp Ser Leu Pro Lys Thr Tyr Asp Asp Leu Pro Asp
1 5 101 5 10
aag cgg cgc tac tgg ccc gca acc ccc aac tcc gcc gac gag gga ctg 98aag cgg cgc tac tgg ccc gca acc ccc aac tcc gcc gac gag gga ctg 98
Lys Arg Arg Tyr Trp Pro Ala Thr Pro Asn Ser Ala Asp Glu Gly LeuLys Arg Arg Tyr Trp Pro Ala Thr Pro Asn Ser Ala Asp Glu Gly Leu
15 20 25 3015 20 25 30
ggg atg ctc cgt ctc ctc act cca gaa att gtc gcc aac gca gca cgc 146ggg atg ctc cgt ctc ctc act cca gaa att gtc gcc aac gca gca cgc 146
Gly Met Leu Arg Leu Leu Thr Pro Glu Ile Val Ala Asn Ala Ala ArgGly Met Leu Arg Leu Leu Thr Pro Glu Ile Val Ala Asn Ala Ala Arg
35 40 4535 40 45
acc cag atc cag acc ggc gag cgc gta tgc ctg aac tgg gac ctg gag 194acc cag atc cag acc ggc gag cgc gta tgc ctg aac tgg gac ctg gag 194
Thr Gln Ile Gln Thr Gly Glu Arg Val Cys Leu Asn Trp Asp Leu GluThr Gln Ile Gln Thr Gly Glu Arg Val Cys Leu Asn Trp Asp Leu Glu
50 55 6050 55 60
aag ctg aac cca cca ggc ttc ggc cgc aaa ccc ttc gcc cac cac gta 242aag ctg aac cca cca ggc ttc ggc cgc aaa ccc ttc gcc cac cac gta 242
Lys Leu Asn Pro Pro Gly Phe Gly Arg Lys Pro Phe Ala His His ValLys Leu Asn Pro Pro Gly Phe Gly Arg Lys Pro Phe Ala His His Val
65 70 7565 70 75
aaa tgg gtc tcc gaa ggc cac gcc ttc gac gac gaa tac cac ttc aac 290aaa tgg gtc tcc gaa ggc cac gcc ttc gac gac gaa tac cac ttc aac 290
Lys Trp Val Ser Glu Gly His Ala Phe Asp Asp Glu Tyr His Phe AsnLys Trp Val Ser Glu Gly His Ala Phe Asp Asp Glu Tyr His Phe Asn
80 85 9080 85 90
ccg caa caa tcc tcc caa tgg gac ggt ctg cga cac cac aac gcc ccg 338ccg caa caa tcc tcc caa tgg gac ggt ctg cga cac cac aac gcc ccg 338
Pro Gln Gln Ser Ser Gln Trp Asp Gly Leu Arg His His Asn Ala ProPro Gln Gln Ser Ser Gln Trp Asp Gly Leu Arg His His Asn Ala Pro
95 100 105 11095 100 105 110
gct cca acc ccc gac gac cct aac cgc cgg gtc ttc tac ggc ggc acc 386gct cca acc ccc gac gac cct aac cgc cgg gtc ttc tac ggc ggc acc 386
Ala Pro Thr Pro Asp Asp Pro Asn Arg Arg Val Phe Tyr Gly Gly ThrAla Pro Thr Pro Asp Asp Pro Asn Arg Arg Val Phe Tyr Gly Gly Thr
115 120 125115 120 125
acc tcc acc gag atc ctc gac ccc tcg tcc acc cga atc ggc ata gcc 434acc tcc acc gag atc ctc gac ccc tcg tcc acc cga atc ggc ata gcc 434
Thr Ser Thr Glu Ile Leu Asp Pro Ser Ser Thr Arg Ile Gly Ile AlaThr Ser Thr Glu Ile Leu Asp Pro Ser Ser Thr Arg Ile Gly Ile Ala
130 135 140130 135 140
cac tgg gcc aag aaa ggc atc gcc ggc cgc ggc gtc ctc atc gac tac 482cac tgg gcc aag aaa ggc atc gcc ggc cgc ggc gtc ctc atc gac tac 482
His Trp Ala Lys Lys Gly Ile Ala Gly Arg Gly Val Leu Ile Asp TyrHis Trp Ala Lys Lys Gly Ile Ala Gly Arg Gly Val Leu Ile Asp Tyr
145 150 155145 150 155
gcc tcc tac gtg caa cga acc aag ggc gtc aca gta aac gcc cta acc 530gcc tcc tac gtg caa cga acc aag ggc gtc aca gta aac gcc cta acc 530
Ala Ser Tyr Val Gln Arg Thr Lys Gly Val Thr Val Asn Ala Leu ThrAla Ser Tyr Val Gln Arg Thr Lys Gly Val Thr Val Asn Ala Leu Thr
160 165 170160 165 170
cgc cac acc gtc tcc ctg gac gac gtc ctc acc atc gcg aag gaa tgc 578cgc cac acc gtc tcc ctg gac gac gtc ctc acc atc gcg aag gaa tgc 578
Arg His Thr Val Ser Leu Asp Asp Val Leu Thr Ile Ala Lys Glu CysArg His Thr Val Ser Leu Asp Asp Val Leu Thr Ile Ala Lys Glu Cys
175 180 185 190175 180 185 190
aac atc acc ttc gaa cca ggc gac att ctc ttc ctg cgc gtc ggt cta 626aac atc acc ttc gaa cca ggc gac att ctc ttc ctg cgc gtc ggt cta 626
Asn Ile Thr Phe Glu Pro Gly Asp Ile Leu Phe Leu Arg Val Gly LeuAsn Ile Thr Phe Glu Pro Gly Asp Ile Leu Phe Leu Arg Val Gly Leu
195 200 205195 200 205
ccg act aca tgg gat aac atg acc gat gag gag agg gtc aag tat agt 674ccg act aca tgg gat aac atg acc gat gag gag agg gtc aag tat agt 674
Pro Thr Thr Trp Asp Asn Met Thr Asp Glu Glu Arg Val Lys Tyr SerPro Thr Thr Trp Asp Asn Met Thr Asp Glu Glu Arg Val Lys Tyr Ser
210 215 220210 215 220
cag cag gaa acg ccc aac cat gca gga tta gag cag agt gag cgt gtg 722cag cag gaa acg ccc aac cat gca gga tta gag cag agt gag cgt gtg 722
Gln Gln Glu Thr Pro Asn His Ala Gly Leu Glu Gln Ser Glu Arg ValGln Gln Glu Thr Pro Asn His Ala Gly Leu Glu Gln Ser Glu Arg Val
225 230 235225 230 235
gtg cgg ttc ctt tgg gat cag cat ttc gcc gct gta gcg ggt gat gcg 770gtg cgg ttc ctt tgg gat cag cat ttc gcc gct gta gcg ggt gat gcg 770
Val Arg Phe Leu Trp Asp Gln His Phe Ala Ala Val Ala Gly Asp AlaVal Arg Phe Leu Trp Asp Gln His Phe Ala Ala Val Ala Gly Asp Ala
240 245 250240 245 250
gtc agc ttt gag gtg tat ccg ccg gta gag aag gag tgg gat tta cat 818gtc agc ttt gag gtg tat ccg ccg gta gag aag gag tgg gat tta cat 818
Val Ser Phe Glu Val Tyr Pro Pro Val Glu Lys Glu Trp Asp Leu HisVal Ser Phe Glu Val Tyr Pro Pro Val Glu Lys Glu Trp Asp Leu His
255 260 265 270255 260 265 270
cat ttt ttg ctg gct ggg tgg gga cta ccg att ggg gag atg ttt gat 866cat ttt ttg ctg gct ggg tgg gga cta ccg att ggg gag atg ttt gat 866
His Phe Leu Leu Ala Gly Trp Gly Leu Pro Ile Gly Glu Met Phe AspHis Phe Leu Leu Ala Gly Trp Gly Leu Pro Ile Gly Glu Met Phe Asp
275 280 285275 280 285
ctt gag ggg ttg aag gag gtg tgt gag agg ttg gga agg tgg acg ttt 914ctt gag ggg ttg aag gag gtg tgt gag agg ttg gga agg tgg acg ttt 914
Leu Glu Gly Leu Lys Glu Val Cys Glu Arg Leu Gly Arg Trp Thr PheLeu Glu Gly Leu Lys Glu Val Cys Glu Arg Leu Gly Arg Trp Thr Phe
290 295 300290 295 300
ttc gtt agt tct agt ccg ttg aac tgt ggg aga ggt gtg tcg agt ccg 962ttc gtt agt tct agt ccg ttg aac tgt ggg aga ggt gtg tcg agt ccg 962
Phe Val Ser Ser Ser Pro Leu Asn Cys Gly Arg Gly Val Ser Ser ProPhe Val Ser Ser Ser Pro Leu Asn Cys Gly Arg Gly Val Ser Ser Pro
305 310 315305 310 315
ccg aat tgt atg gca att ttt taagaattc 992ccg aat tgt atg gca att ttt taagaattc 992
Pro Asn Cys Met Ala Ile PhePro Asn Cys Met Ala Ile Phe
320 325320 325
<210>95<210>95
<211>325<211>325
<212>PRT<212>PRT
<213>Aspergillus niger<213>Aspergillus niger
<400>95<400>95
Met Ala Asp Ser Leu Pro Lys Thr Tyr Asp Asp Leu Pro Asp Lys ArgMet Ala Asp Ser Leu Pro Lys Thr Tyr Asp Asp Leu Pro Asp Lys Arg
1 5 10 151 5 10 15
Arg Tyr Trp Pro Ala Thr Pro Asn Ser Ala Asp Glu Gly Leu Gly MetArg Tyr Trp Pro Ala Thr Pro Asn Ser Ala Asp Glu Gly Leu Gly Met
20 25 3020 25 30
Leu Arg Leu Leu Thr Pro Glu Ile Val Ala Asn Ala Ala Arg Thr GlnLeu Arg Leu Leu Thr Pro Glu Ile Val Ala Asn Ala Ala Arg Thr Gln
35 40 4535 40 45
Ile Gln Thr Gly Glu Arg Val Cys Leu Asn Trp Asp Leu Glu Lys LeuIle Gln Thr Gly Glu Arg Val Cys Leu Asn Trp Asp Leu Glu Lys Leu
50 55 6050 55 60
Asn Pro Pro Gly Phe Gly Arg Lys Pro Phe Ala His His Val Lys TrpAsn Pro Pro Gly Phe Gly Arg Lys Pro Phe Ala His His Val Lys Trp
65 70 75 8065 70 75 80
Val Ser Glu Gly His Ala Phe Asp Asp Glu Tyr His Phe Asn Pro GlnVal Ser Glu Gly His Ala Phe Asp Asp Glu Tyr His Phe Asn Pro Gln
85 90 9585 90 95
Gln Ser Ser Gln Trp Asp Gly Leu Arg His His Asn Ala Pro Ala ProGln Ser Ser Gln Trp Asp Gly Leu Arg His His Asn Ala Pro Ala Pro
100 105 110100 105 110
Thr Pro Asp Asp Pro Asn Arg Arg Val Phe Tyr Gly Gly Thr Thr SerThr Pro Asp Asp Pro Asn Arg Arg Val Phe Tyr Gly Gly Thr Thr Ser
115 120 125115 120 125
Thr Glu Ile Leu Asp Pro Ser Ser Thr Arg Ile Gly Ile Ala His TrpThr Glu Ile Leu Asp Pro Ser Ser Thr Arg Ile Gly Ile Ala His Trp
130 135 140130 135 140
Ala Lys Lys Gly Ile Ala Gly Arg Gly Val Leu Ile Asp Tyr Ala SerAla Lys Lys Gly Ile Ala Gly Arg Gly Val Leu Ile Asp Tyr Ala Ser
145 150 155 160145 150 155 160
Tyr Val Gln Arg Thr Lys Gly Val Thr Val Asn Ala Leu Thr Arg HisTyr Val Gln Arg Thr Lys Gly Val Thr Val Asn Ala Leu Thr Arg His
165 170 175165 170 175
Thr Val Ser Leu Asp Asp Val Leu Thr Ile Ala Lys Glu Cys Asn IleThr Val Ser Leu Asp Asp Val Leu Thr Ile Ala Lys Glu Cys Asn Ile
180 185 190180 185 190
Thr Phe Glu Pro Gly Asp Ile Leu Phe Leu Arg Val Gly Leu Pro ThrThr Phe Glu Pro Gly Asp Ile Leu Phe Leu Arg Val Gly Leu Pro Thr
195 200 205195 200 205
Thr Trp Asp Asn Met Thr Asp Glu Glu Arg Val Lys Tyr Ser Gln GlnThr Trp Asp Asn Met Thr Asp Glu Glu Arg Val Lys Tyr Ser Gln Gln
210 215 220210 215 220
Glu Thr Pro Asn His Ala Gly Leu Glu Gln Ser Glu Arg Val Val ArgGlu Thr Pro Asn His Ala Gly Leu Glu Gln Ser Glu Arg Val Val Arg
225 230 235 240225 230 235 240
Phe Leu Trp Asp Gln His Phe Ala Ala Val Ala Gly Asp Ala Val SerPhe Leu Trp Asp Gln His Phe Ala Ala Val Ala Gly Asp Ala Val Ser
245 250 255245 250 255
Phe Glu Val Tyr Pro Pro Val Glu Lys Glu Trp Asp Leu His His PhePhe Glu Val Tyr Pro Pro Val Glu Lys Glu Trp Asp Leu His His Phe
260 265 270260 265 270
Leu Leu Ala Gly Trp Gly Leu Pro Ile Gly Glu Met Phe Asp Leu GluLeu Leu Ala Gly Trp Gly Leu Pro Ile Gly Glu Met Phe Asp Leu Glu
275 280 285275 280 285
Gly Leu Lys Glu Val Cys Glu Arg Leu Gly Arg Trp Thr Phe Phe ValGly Leu Lys Glu Val Cys Glu Arg Leu Gly Arg Trp Thr Phe Phe Val
290 295 300290 295 300
Ser Ser Ser Pro Leu Asn Cys Gly Arg Gly Val Ser Ser Pro Pro AsnSer Ser Ser Pro Leu Asn Cys Gly Arg Gly Val Ser Ser Pro Pro Asn
305 310 315 320305 310 315 320
Cys Met Ala Ile PheCys Met Ala Ile Phe
325325
<210>96<210>96
<211>1019<211>1019
<212>DNA<212>DNA
<213>Aspergillus niger<213>Aspergillus niger
<220><220>
<221>CDS<221> CDS
<222>(9)..(1010)<222>(9)..(1010)
<223><223>
<400>96<400>96
gaattcaa atg aga gga tcg cat cac cat cac cat cac gcc gac tcc ctc 50gaattcaa atg aga gga tcg cat cac cat cac cat cac gcc gac tcc ctc 50
Met Arg Gly Ser His His His His His His Ala Asp Ser Leu Met Arg Gly Ser His His His His His His His Ala Asp Ser Leu
1 5 101 5 10
ccg aaa acc tac gac gac ctc ccc gac aag cgg cgc tac tgg ccc gca 98ccg aaa acc tac gac gac ctc ccc gac aag cgg cgc tac tgg ccc gca 98
Pro Lys Thr Tyr Asp Asp Leu Pro Asp Lys Arg Arg Tyr Trp Pro AlaPro Lys Thr Tyr Asp Asp Leu Pro Asp Lys Arg Arg Tyr Trp Pro Ala
15 20 25 3015 20 25 30
acc ccc aac tcc gcc gac gag gga ctg ggg atg ctc cgt ctc ctc act 146acc ccc aac tcc gcc gac gag gga ctg ggg atg ctc cgt ctc ctc act 146
Thr Pro Asn Ser Ala Asp Glu Gly Leu Gly Met Leu Arg Leu Leu ThrThr Pro Asn Ser Ala Asp Glu Gly Leu Gly Met Leu Arg Leu Leu Thr
35 40 4535 40 45
cca gaa att gtc gcc aac gca gca cgc acc cag atc cag acc ggc gag 194cca gaa att gtc gcc aac gca gca cgc acc cag atc cag acc ggc gag 194
Pro Glu Ile Val Ala Asn Ala Ala Arg Thr Gln Ile Gln Thr Gly GluPro Glu Ile Val Ala Asn Ala Ala Arg Thr Gln Ile Gln Thr Gly Glu
50 55 6050 55 60
cgc gta tgc ctg aac tgg gac ctg gag aag ctg aac cca cca ggc ttc 242cgc gta tgc ctg aac tgg gac ctg gag aag ctg aac cca cca ggc ttc 242
Arg Val Cys Leu Asn Trp Asp Leu Glu Lys Leu Asn Pro Pro Gly PheArg Val Cys Leu Asn Trp Asp Leu Glu Lys Leu Asn Pro Pro Gly Phe
65 70 7565 70 75
ggc cgc aaa ccc ttc gcc cac cac gta aaa tgg gtc tcc gaa ggc cac 290ggc cgc aaa ccc ttc gcc cac cac gta aaa tgg gtc tcc gaa ggc cac 290
Gly Arg Lys Pro Phe Ala His His Val Lys Trp Val Ser Glu Gly HisGly Arg Lys Pro Phe Ala His His Val Lys Trp Val Ser Glu Gly His
80 85 9080 85 90
gcc ttc gac gac gaa tac cac ttc aac ccg caa caa tcc tcc caa tgg 338gcc ttc gac gac gaa tac cac ttc aac ccg caa caa tcc tcc caa tgg 338
Ala Phe Asp Asp Glu Tyr His Phe Asn Pro Gln Gln Ser Ser Gln TrpAla Phe Asp Asp Glu Tyr His Phe Asn Pro Gln Gln Ser Ser Gln Trp
95 100 105 11095 100 105 110
gac ggt ctg cga cac cac aac gcc ccg gct cca acc ccc gac gac cct 386gac ggt ctg cga cac cac aac gcc ccg gct cca acc ccc gac gac cct 386
Asp Gly Leu Arg His His Asn Ala Pro Ala Pro Thr Pro Asp Asp ProAsp Gly Leu Arg His His Asn Ala Pro Ala Pro Thr Pro Asp Asp Pro
115 120 125115 120 125
aac cgc cgg gtc ttc tac ggc ggc acc acc tcc acc gag atc ctc gac 434aac cgc cgg gtc ttc tac ggc ggc acc acc tcc acc gag atc ctc gac 434
Asn Arg Arg Val Phe Tyr Gly Gly Thr Thr Ser Thr Glu Ile Leu AspAsn Arg Arg Val Phe Tyr Gly Gly Thr Thr Ser Thr Glu Ile Leu Asp
130 135 140130 135 140
ccc tcg tcc acc cga atc ggc ata gcc cac tgg gcc aag aaa ggc atc 482ccc tcg tcc acc cga atc ggc ata gcc cac tgg gcc aag aaa ggc atc 482
Pro Ser Ser Thr Arg Ile Gly Ile Ala His Trp Ala Lys Lys Gly IlePro Ser Ser Thr Arg Ile Gly Ile Ala His Trp Ala Lys Lys Gly Ile
145 150 155145 150 155
gcc ggc cgc ggc gtc ctc atc gac tac gcc tcc tac gtg caa cga acc 530gcc ggc cgc ggc gtc ctc atc gac tac gcc tcc tac gtg caa cga acc 530
Ala Gly Arg Gly Val Leu Ile Asp Tyr Ala Ser Tyr Val Gln Arg ThrAla Gly Arg Gly Val Leu Ile Asp Tyr Ala Ser Tyr Val Gln Arg Thr
160 165 170160 165 170
aag ggc gtc aca gta aac gcc cta acc cgc cac acc gtc tcc ctg gac 578aag ggc gtc aca gta aac gcc cta acc cgc cac acc gtc tcc ctg gac 578
Lys Gly Val Thr Val Asn Ala Leu Thr Arg His Thr Val Ser Leu AspLys Gly Val Thr Val Asn Ala Leu Thr Arg His Thr Val Ser Leu Asp
175 180 185 190175 180 185 190
gac gtc ctc acc atc gcg aag gaa tgc aac atc acc ttc gaa cca ggc 626gac gtc ctc acc atc gcg aag gaa tgc aac atc acc ttc gaa cca ggc 626
Asp Val Leu Thr Ile Ala Lys Glu Cys Asn Ile Thr Phe Glu Pro GlyAsp Val Leu Thr Ile Ala Lys Glu Cys Asn Ile Thr Phe Glu Pro Gly
195 200 205195 200 205
gac att ctc ttc ctg cgc gtc ggt cta ccg act aca tgg gat aac atg 674gac att ctc ttc ctg cgc gtc ggt cta ccg act aca tgg gat aac atg 674
Asp Ile Leu Phe Leu Arg Val Gly Leu Pro Thr Thr Trp Asp Asn MetAsp Ile Leu Phe Leu Arg Val Gly Leu Pro Thr Thr Trp Asp Asn Met
210 215 220210 215 220
acc gat gag gag agg gtc aag tat agt cag cag gaa acg ccc aac cat 722acc gat gag gag agg gtc aag tat agt cag cag gaa acg ccc aac cat 722
Thr Asp Glu Glu Arg Val Lys Tyr Ser Gln Gln Glu Thr Pro Asn HisThr Asp Glu Glu Arg Val Lys Tyr Ser Gln Gln Glu Thr Pro Asn His
225 230 235225 230 235
gca gga tta gag cag agt gag cgt gtg gtg cgg ttc ctt tgg gat cag 770gca gga tta gag cag agt gag cgt gtg gtg cgg ttc ctt tgg gat cag 770
Ala Gly Leu Glu Gln Ser Glu Arg Val Val Arg Phe Leu Trp Asp GlnAla Gly Leu Glu Gln Ser Glu Arg Val Val Arg Phe Leu Trp Asp Gln
240 245 250240 245 250
cat ttc gcc gct gta gcg ggt gat gcg gtc agc ttt gag gtg tat ccg 818cat ttc gcc gct gta gcg ggt gat gcg gtc agc ttt gag gtg tat ccg 818
His Phe Ala Ala Val Ala Gly Asp A la Val Ser Phe Glu Val Tyr ProHis Phe Ala Ala Val Ala Gly Asp A la Val Ser Phe Glu Val Tyr Pro
255 260 265 270255 260 265 270
ccg gta gag aag gag tgg gat tta cat cat ttt ttg ctg gct ggg tgg 866ccg gta gag aag gag tgg gat tta cat cat ttt ttg ctg gct ggg tgg 866
Pro Val Glu Lys Glu Trp Asp Leu His His Phe Leu Leu Ala Gly TrpPro Val Glu Lys Glu Trp Asp Leu His His Phe Leu Leu Ala Gly Trp
275 280 285275 280 285
gga cta ccg att ggg gag atg ttt gat ctt gag ggg ttg aag gag gtg 914gga cta ccg att ggg gag atg ttt gat ctt gag ggg ttg aag gag gtg 914
Gly Leu Pro Ile Gly Glu Met Phe Asp Leu Glu Gly Leu Lys Glu ValGly Leu Pro Ile Gly Glu Met Phe Asp Leu Glu Gly Leu Lys Glu Val
290 295 300290 295 300
tgt gag agg ttg gga agg tgg acg ttt ttc gtt agt tct agt ccg ttg 962tgt gag agg ttg gga agg tgg acg ttt ttc gtt agt tct agt ccg ttg 962
Cys Glu Arg Leu Gly Arg Trp Thr Phe Phe Val Ser Ser Ser Pro LeuCys Glu Arg Leu Gly Arg Trp Thr Phe Phe Val Ser Ser Ser Ser Pro Leu
305 310 315305 310 315
aac tgt ggg aga ggt gtg tcg agt ccg ccg aat tgt atg gca att ttt 1010aac tgt ggg aga ggt gtg tcg agt ccg ccg aat tgt atg gca att ttt 1010
Asn Cys Gly Arg Gly Val Ser Ser Pro Pro Asn Cys Met Ala Ile PheAsn Cys Gly Arg Gly Val Ser Ser Pro Pro Asn Cys Met Ala Ile Phe
320 325 330320 325 330
taagaattc 1019taagaattc 1019
<210>97<210>97
<211>334<211>334
<212>PRT<212>PRT
<213>Aspergillus niger<213>Aspergillus niger
<400>97<400>97
Met Arg Gly Ser His His His His His His Ala Asp Ser Leu Pro LysMet Arg Gly Ser His His His His His His Ala Asp Ser Leu Pro Lys
1 5 10 151 5 10 15
Thr Tyr Asp Asp Leu Pro Asp Lys Arg Arg Tyr Trp Pro Ala Thr ProThr Tyr Asp Asp Leu Pro Asp Lys Arg Arg Tyr Trp Pro Ala Thr Pro
20 25 3020 25 30
Asn Ser Ala Asp Glu Gly Leu Gly Met Leu Arg Leu Leu Thr Pro GluAsn Ser Ala Asp Glu Gly Leu Gly Met Leu Arg Leu Leu Thr Pro Glu
35 40 4535 40 45
Ile Val Ala Asn Ala Ala Arg Thr Gln Ile Gln Thr Gly Glu Arg ValIle Val Ala Asn Ala Ala Arg Thr Gln Ile Gln Thr Gly Glu Arg Val
50 55 6050 55 60
Cys Leu Asn Trp Asp Leu Glu Lys Leu Asn Pro Pro Gly Phe Gly ArgCys Leu Asn Trp Asp Leu Glu Lys Leu Asn Pro Pro Gly Phe Gly Arg
65 70 75 8065 70 75 80
Lys Pro Phe Ala His His Val Lys Trp Val Ser Glu Gly His Ala PheLys Pro Phe Ala His His Val Lys Trp Val Ser Glu Gly His Ala Phe
85 90 9585 90 95
Asp Asp Glu Tyr His Phe Asn Pro Gln Gln Ser Ser Gln Trp Asp GlyAsp Asp Glu Tyr His Phe Asn Pro Gln Gln Ser Ser Gln Trp Asp Gly
100 105 110100 105 110
Leu Arg His His Asn Ala Pro Ala Pro Thr Pro Asp Asp Pro Asn ArgLeu Arg His His Asn Ala Pro Ala Pro Thr Pro Asp Asp Pro Asn Arg
115 120 125115 120 125
Arg Val Phe Tyr Gly Gly Thr Thr Ser Thr Glu Ile Leu Asp Pro SerArg Val Phe Tyr Gly Gly Thr Thr Ser Thr Glu Ile Leu Asp Pro Ser
130 135 140130 135 140
Ser Thr Arg Ile Gly Ile Ala His Trp Ala Lys Lys Gly Ile Ala GlySer Thr Arg Ile Gly Ile Ala His Trp Ala Lys Lys Gly Ile Ala Gly
145 150 155 160145 150 155 160
Arg Gly Val Leu Ile Asp Tyr Ala Ser Tyr Val Gln Arg Thr Lys GlyArg Gly Val Leu Ile Asp Tyr Ala Ser Tyr Val Gln Arg Thr Lys Gly
165 170 175165 170 175
Val Thr Val Asn Ala Leu Thr Arg His Thr Val Ser Leu Asp Asp ValVal Thr Val Asn Ala Leu Thr Arg His Thr Val Ser Leu Asp Asp Val
180 185 190180 185 190
Leu Thr Ile Ala Lys Glu Cys Asn Ile Thr Phe Glu Pro Gly Asp IleLeu Thr Ile Ala Lys Glu Cys Asn Ile Thr Phe Glu Pro Gly Asp Ile
195 200 205195 200 205
Leu Phe Leu Arg Val Gly Leu Pro Thr Thr Trp Asp Asn Met Thr AspLeu Phe Leu Arg Val Gly Leu Pro Thr Thr Trp Asp Asn Met Thr Asp
210 215 220210 215 220
Glu Glu Arg Val Lys Tyr Ser Gln Gln Glu Thr Pro Asn His Ala GlyGlu Glu Arg Val Lys Tyr Ser Gln Gln Glu Thr Pro Asn His Ala Gly
225 230 235 240225 230 235 240
Leu Glu Gln Ser Glu Arg Val Val Arg Phe Leu Trp Asp Gln His PheLeu Glu Gln Ser Glu Arg Val Val Arg Phe Leu Trp Asp Gln His Phe
245 250 255245 250 255
Ala Ala Val Ala Gly Asp Ala Val Ser Phe Glu Val Tyr Pro Pro ValAla Ala Val Ala Gly Asp Ala Val Ser Phe Glu Val Tyr Pro Pro Val
260 265 270260 265 270
Glu Lys Glu Trp Asp Leu His His Phe Leu Leu Ala Gly Trp Gly LeuGlu Lys Glu Trp Asp Leu His His Phe Leu Leu Ala Gly Trp Gly Leu
275 280 285275 280 285
Pro Ile Gly Glu Met Phe Asp Leu Glu Gly Leu Lys Glu Val Cys GluPro Ile Gly Glu Met Phe Asp Leu Glu Gly Leu Lys Glu Val Cys Glu
290 295 300290 295 300
Arg Leu Gly Arg Trp Thr Phe Phe Val Ser Ser Ser Pro Leu Asn CysArg Leu Gly Arg Trp Thr Phe Phe Val Ser Ser Ser Pro Leu Asn Cys
305 310 315 320305 310 315 320
Gly Arg Gly Val Ser Ser Pro Pro Asn Cys Met Ala Ile PheGly Arg Gly Val Ser Ser Pro Pro Asn Cys Met Ala Ile Phe
325 330325 330
<210>98<210>98
<211>975<211>975
<212>DNA<212>DNA
<213>Aspergillus niger<213>Aspergillus niger
<220><220>
<221>CDS<221> CDS
<222>(1)..(975)<222>(1)..(975)
<223><223>
<400>98<400>98
atg gcc gac tcc ctc ccg aaa acc tac gac gac ctc ccc gac aag cgg 48atg gcc gac tcc ctc ccg aaa acc tac gac gac ctc ccc gac aag cgg 48
Met Ala Asp Ser Leu Pro Lys Thr Tyr Asp Asp Leu Pro Asp Lys ArgMet Ala Asp Ser Leu Pro Lys Thr Tyr Asp Asp Leu Pro Asp Lys Arg
1 5 10 151 5 10 15
cgc tac tgg ccc gca acc ccc aac tcc gcc gac gag gga ctg ggg atg 96cgc tac tgg ccc gca acc ccc aac tcc gcc gac gag gga ctg ggg atg 96
Arg Tyr Trp Pro Ala Thr Pro Asn Ser A la Asp Glu Gly Leu Gly MetArg Tyr Trp Pro Ala Thr Pro Asn Ser A la Asp Glu Gly Leu Gly Met
20 25 3020 25 30
ctc cgt ctc ctc act cca gaa att gtc gcc aac gca gca cgc acc cag 144ctc cgt ctc ctc act cca gaa att gtc gcc aac gca gca cgc acc cag 144
Leu Arg Leu Leu Thr Pro Glu Ile Val Ala Asn Ala Ala Arg Thr GlnLeu Arg Leu Leu Thr Pro Glu Ile Val Ala Asn Ala Ala Arg Thr Gln
35 40 4535 40 45
atc cag acc ggc gag cgc gta tgc ctg aac tgg gac ctg gag aag ctg 192atc cag acc ggc gag cgc gta tgc ctg aac tgg gac ctg gag aag ctg 192
Ile Gln Thr Gly Glu Arg Val Cys Leu Asn Trp Asp Leu Glu Lys LeuIle Gln Thr Gly Glu Arg Val Cys Leu Asn Trp Asp Leu Glu Lys Leu
50 55 6050 55 60
aac cca cca ggc ttc ggc cgc aaa ccc ttc gcc cac cac gta aaa tgg 240aac cca cca ggc ttc ggc cgc aaa ccc ttc gcc cac cac gta aaa tgg 240
Asn Pro Pro Gly Phe Gly Arg Lys Pro Phe Ala His His Val Lys TrpAsn Pro Pro Gly Phe Gly Arg Lys Pro Phe Ala His His Val Lys Trp
65 70 75 8065 70 75 80
gtc tcc gaa ggc cac gcc ttc gac gac gaa tac cac ttc aac ccg caa 288gtc tcc gaa ggc cac gcc ttc gac gac gaa tac cac ttc aac ccg caa 288
Val Ser Glu Gly His Ala Phe Asp Asp Glu Tyr His Phe Asn Pro GlnVal Ser Glu Gly His Ala Phe Asp Asp Glu Tyr His Phe Asn Pro Gln
85 90 9585 90 95
caa tcc tcc caa tgg gac ggt ctg cga cac cac aac gcc ccg gct cca 336caa tcc tcc caa tgg gac ggt ctg cga cac cac aac gcc ccg gct cca 336
Gln Ser Ser Gln Trp Asp Gly Leu Arg His His Asn Ala Pro Ala ProGln Ser Ser Gln Trp Asp Gly Leu Arg His His Asn Ala Pro Ala Pro
100 105 110100 105 110
acc ccc gac gac cct aac cgc cgg gtc ttc tac ggc ggc acc acc tcc 384acc ccc gac gac cct aac cgc cgg gtc ttc tac ggc ggc acc acc tcc 384
Thr Pro Asp Asp Pro Asn Arg Arg Val Phe Tyr Gly Gly Thr Thr SerThr Pro Asp Asp Pro Asn Arg Arg Val Phe Tyr Gly Gly Thr Thr Ser
115 120 125115 120 125
acc gag atc ctc gac ccc tcg tcc acc cga atc ggc ata gcc cac tgg 432acc gag atc ctc gac ccc tcg tcc acc cga atc ggc ata gcc cac tgg 432
Thr Glu Ile Leu Asp Pro Ser Ser Thr Arg Ile Gly Ile Ala His TrpThr Glu Ile Leu Asp Pro Ser Ser Thr Arg Ile Gly Ile Ala His Trp
130 135 140130 135 140
gcc aag aaa ggc atc gcc ggc cgc ggc gtc ctc atc gac tac gcc tcc 480gcc aag aaa ggc atc gcc ggc cgc ggc gtc ctc atc gac tac gcc tcc 480
Ala Lys Lys Gly Ile Ala Gly Arg Gly Val Leu Ile Asp Tyr Ala SerAla Lys Lys Gly Ile Ala Gly Arg Gly Val Leu Ile Asp Tyr Ala Ser
145 150 155 160145 150 155 160
tac gtg caa cga acc aag ggc gtc aca gta aac gcc cta acc cgc cac 528tac gtg caa cga acc aag ggc gtc aca gta aac gcc cta acc cgc cac 528
Tyr Val Gln Arg Thr Lys Gly Val Thr Val Asn Ala Leu Thr Arg HisTyr Val Gln Arg Thr Lys Gly Val Thr Val Asn Ala Leu Thr Arg His
165 170 175165 170 175
acc gtc tcc ctg gac gac gtc ctc acc atc gcg aag gaa tgc aac atc 576acc gtc tcc ctg gac gac gtc ctc acc atc gcg aag gaa tgc aac atc 576
Thr Val Ser Leu Asp Asp Val Leu Thr Ile Ala Lys Glu Cys Asn IleThr Val Ser Leu Asp Asp Val Leu Thr Ile Ala Lys Glu Cys Asn Ile
180 185 190180 185 190
acc ttc gaa cca ggc gac att ctc ttc ctg cgc gtc ggt cta ccg act 624acc ttc gaa cca ggc gac att ctc ttc ctg cgc gtc ggt cta ccg act 624
Thr Phe Glu Pro Gly AspIle Leu Phe Leu Arg Val Gly Leu Pro ThrThr Phe Glu Pro Gly AspIle Leu Phe Leu Arg Val Gly Leu Pro Thr
195 200 205195 200 205
aca tgg gat aac atg acc gat gag gag agg gtc aag tat agt cag cag 672aca tgg gat aac atg acc gat gag gag agg gtc aag tat agt cag cag 672
Thr Trp Asp Asn Met Thr Asp Glu Glu Arg Val Lys Tyr Ser Gln GlnThr Trp Asp Asn Met Thr Asp Glu Glu Arg Val Lys Tyr Ser Gln Gln
210 215 220210 215 220
gaa acg ccc aac cat gca gga tta gag cag agt gag cgt gtg gtg cgg 720gaa acg ccc aac cat gca gga tta gag cag agt gag cgt gtg gtg cgg 720
Glu Thr Pro Asn His Ala Gly Leu Glu Gln Ser Glu Arg Val Val ArgGlu Thr Pro Asn His Ala Gly Leu Glu Gln Ser Glu Arg Val Val Arg
225 230 235 240225 230 235 240
ttc ctt tgg gat cag cat ttc gcc gct gta gcg ggt gat gcg gtc agc 768ttc ctt tgg gat cag cat ttc gcc gct gta gcg ggt gat gcg gtc agc 768
Phe Leu Trp Asp Gln His Phe Ala Ala Val Ala Gly Asp Ala Val SerPhe Leu Trp Asp Gln His Phe Ala Ala Val Ala Gly Asp Ala Val Ser
245 250 255245 250 255
ttt gag gtg tat ccg ccg gta gag aag gag tgg gat tta cat cat ttt 816ttt gag gtg tat ccg ccg gta gag aag gag tgg gat tta cat cat ttt 816
Phe Glu Val Tyr Pro Pro Val Glu Lys Glu Trp Asp Leu His His PhePhe Glu Val Tyr Pro Pro Val Glu Lys Glu Trp Asp Leu His His Phe
260 265 270260 265 270
ttg ctg gct ggg tgg gga cta ccg att ggg gag atg ttt gat ctt gag 864ttg ctg gct ggg tgg gga cta ccg att ggg gag atg ttt gat ctt gag 864
Leu Leu Ala Gly Trp Gly Leu Pro Ile Gly Glu Met Phe Asp Leu GluLeu Leu Ala Gly Trp Gly Leu Pro Ile Gly Glu Met Phe Asp Leu Glu
275 280 285275 280 285
ggg ttg aag gag gtg tgt gag agg ttg gga agg tgg acg ttt ttc gtt 912ggg ttg aag gag gtg tgt gag agg ttg gga agg tgg acg ttt ttc gtt 912
Gly Leu Lys Glu Val Cys Glu Arg Leu Gly Arg Trp Thr Phe Phe ValGly Leu Lys Glu Val Cys Glu Arg Leu Gly Arg Trp Thr Phe Phe Val
290 295 300290 295 300
agt tct agt ccg ttg aac tgt ggg aga ggt gtg tcg agt ccg ccg aat 960agt tct agt ccg ttg aac tgt ggg aga ggt gtg tcg agt ccg ccg aat 960
Ser Ser Ser Pro Leu Asn Cys Gly Arg Gly Val Ser Ser Pro Pro AsnSer Ser Ser Pro Leu Asn Cys Gly Arg Gly Val Ser Ser Pro Pro Asn
305 310 315 320305 310 315 320
tgt atg gca att ttt 975tgt atg gca att ttt 975
Cys Met Ala Ile PheCys Met Ala Ile Phe
325325
<210>99<210>99
<211>325<211>325
<212>PRT<212>PRT
<213>Aspergillus niger<213>Aspergillus niger
<400>99<400>99
Met Ala Asp Ser Leu Pro Lys Thr Tyr Asp Asp Leu Pro Asp Lys ArgMet Ala Asp Ser Leu Pro Lys Thr Tyr Asp Asp Leu Pro Asp Lys Arg
1 5 10 151 5 10 15
Arg Tyr Trp Pro Ala Thr Pro Asn Ser Ala Asp Glu Gly Leu Gly MetArg Tyr Trp Pro Ala Thr Pro Asn Ser Ala Asp Glu Gly Leu Gly Met
20 25 3020 25 30
Leu Arg Leu Leu Thr Pro Glu Ile Val Ala Asn Ala Ala Arg Thr GlnLeu Arg Leu Leu Thr Pro Glu Ile Val Ala Asn Ala Ala Arg Thr Gln
35 40 4535 40 45
Ile Gln Thr Gly Glu Arg Val Cys Leu Asn Trp Asp Leu Glu Lys LeuIle Gln Thr Gly Glu Arg Val Cys Leu Asn Trp Asp Leu Glu Lys Leu
50 55 6050 55 60
Asn Pro Pro Gly Phe Gly Arg Lys Pro Phe Ala His His Val Lys TrpAsn Pro Pro Gly Phe Gly Arg Lys Pro Phe Ala His His Val Lys Trp
65 70 75 8065 70 75 80
Val Ser Glu Gly His Ala Phe Asp Asp Glu Tyr His Phe Asn Pro GlnVal Ser Glu Gly His Ala Phe Asp Asp Glu Tyr His Phe Asn Pro Gln
85 90 9585 90 95
Gln Ser Ser Gln Trp Asp Gly Leu Arg His His Asn Ala Pro Ala ProGln Ser Ser Gln Trp Asp Gly Leu Arg His His Asn Ala Pro Ala Pro
100 105 110100 105 110
Thr Pro Asp Asp Pro Asn Arg Arg Val Phe Tyr Gly Gly Thr Thr SerThr Pro Asp Asp Pro Asn Arg Arg Val Phe Tyr Gly Gly Thr Thr Ser
115 120 125115 120 125
Thr Glu Ile Leu Asp Pro Ser Ser Thr Arg Ile Gly Ile Ala His TrpThr Glu Ile Leu Asp Pro Ser Ser Thr Arg Ile Gly Ile Ala His Trp
130 135 140130 135 140
Ala Lys Lys Gly Ile Ala Gly Arg Gly Val Leu Ile Asp Tyr Ala SerAla Lys Lys Gly Ile Ala Gly Arg Gly Val Leu Ile Asp Tyr Ala Ser
145 150 155 160145 150 155 160
Tyr Val Gln Arg Thr Lys Gly Val Thr Val Asn Ala Leu Thr Arg HisTyr Val Gln Arg Thr Lys Gly Val Thr Val Asn Ala Leu Thr Arg His
165 170 175165 170 175
Thr Val Ser Leu Asp Asp Val Leu Thr Ile Ala Lys Glu Cys Asn IleThr Val Ser Leu Asp Asp Val Leu Thr Ile Ala Lys Glu Cys Asn Ile
180 185 190180 185 190
Thr Phe Glu Pro Gly Asp Ile Leu Phe Leu Arg Val Gly Leu Pro ThrThr Phe Glu Pro Gly Asp Ile Leu Phe Leu Arg Val Gly Leu Pro Thr
195 200 205195 200 205
Thr Trp Asp Asn Met Thr Asp Glu Glu Arg Val Lys Tyr Ser Gln GlnThr Trp Asp Asn Met Thr Asp Glu Glu Arg Val Lys Tyr Ser Gln Gln
210 215 220210 215 220
Glu Thr Pro Asn His Ala Gly Leu Glu Gln Ser Glu Arg Val Val ArgGlu Thr Pro Asn His Ala Gly Leu Glu Gln Ser Glu Arg Val Val Arg
225 230 235 240225 230 235 240
Phe Leu Trp Asp Gln His Phe Ala Ala Val Ala Gly Asp Ala Val SerPhe Leu Trp Asp Gln His Phe Ala Ala Val Ala Gly Asp Ala Val Ser
245 250 255245 250 255
Phe Glu Val Tyr Pro Pro Val Glu Lys Glu Trp Asp Leu His His PhePhe Glu Val Tyr Pro Pro Val Glu Lys Glu Trp Asp Leu His His Phe
260 265 270260 265 270
Leu Leu Ala Gly Trp Gly Leu Pro Ile Gly Glu Met Phe Asp Leu GluLeu Leu Ala Gly Trp Gly Leu Pro Ile Gly Glu Met Phe Asp Leu Glu
275 280 285275 280 285
Gly Leu Lys Glu Val Cys Glu Arg Leu Gly Arg Trp Thr Phe Phe ValGly Leu Lys Glu Val Cys Glu Arg Leu Gly Arg Trp Thr Phe Phe Val
290 295 300290 295 300
Ser Ser Ser Pro Leu Asn Cys Gly Arg Gly Val Ser Ser Pro Pro AsnSer Ser Ser Pro Leu Asn Cys Gly Arg Gly Val Ser Ser Pro Pro Asn
305 310 315 320305 310 315 320
Cys Met Ala Ile PheCys Met Ala Ile Phe
325325
Claims (9)
Applications Claiming Priority (3)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| US45922003P | 2003-03-28 | 2003-03-28 | |
| US60/459,220 | 2003-03-28 | ||
| PCT/EP2004/002902 WO2004085651A2 (en) | 2003-03-28 | 2004-03-19 | Pantolactone hydrolase |
Publications (2)
| Publication Number | Publication Date |
|---|---|
| CN1768141A CN1768141A (en) | 2006-05-03 |
| CN1768141B true CN1768141B (en) | 2012-10-10 |
Family
ID=33098294
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| CN2004800085042A Expired - Lifetime CN1768141B (en) | 2003-03-28 | 2004-03-19 | Pantolactone hydrolase |
Country Status (5)
| Country | Link |
|---|---|
| JP (1) | JP4571936B2 (en) |
| KR (1) | KR101156882B1 (en) |
| CN (1) | CN1768141B (en) |
| GB (1) | GB2413329B (en) |
| WO (1) | WO2004085651A2 (en) |
Families Citing this family (4)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN105669837B (en) * | 2015-06-03 | 2020-01-17 | 苏州大学 | Vascular endothelial cadherin epitopes, antibodies and their applications |
| CN110358687B (en) * | 2018-12-19 | 2020-12-04 | 安徽瑞达健康产业有限公司 | Gibberellin for producing D-pantolactone hydrolase and application and fermentation method thereof |
| CN111621541B (en) * | 2019-02-27 | 2025-10-31 | 广安摩珈生物科技有限公司 | Method for resolution of optical isomers using electrodialysis technique |
| EP3748009A1 (en) * | 2019-06-07 | 2020-12-09 | DSM IP Assets B.V. | Synthesis of (r)-pantolactone acetate |
Citations (1)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN1166182A (en) * | 1995-09-13 | 1997-11-26 | 富士药品工业株式会社 | D-pantolactone hydrolase and the gene encoding the enzyme |
Family Cites Families (4)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US5411875A (en) * | 1991-11-01 | 1995-05-02 | University Of Iowa Research Foundation | Method for retrieval of unknown flanking DNA sequence |
| AU2945792A (en) * | 1991-11-25 | 1993-06-28 | Keygene N.V. | A novel pcr method with a single primer for nucleic acid analysis |
| DE19849348A1 (en) * | 1998-10-26 | 2000-04-27 | Univ Ludwigs Albert | Identification and/or sequencing of an unknown DNA or RNA sequence adjacent to a known DNA or RNA region comprises linker-mediated PCR following amplification by linear PCR |
| CN1384880A (en) * | 1999-10-29 | 2002-12-11 | 巴斯福股份公司 | L-pantolactone-hydrolase and method for producing D-pantolactone |
-
2004
- 2004-03-19 CN CN2004800085042A patent/CN1768141B/en not_active Expired - Lifetime
- 2004-03-19 KR KR1020057018207A patent/KR101156882B1/en not_active Expired - Fee Related
- 2004-03-19 JP JP2006504751A patent/JP4571936B2/en not_active Expired - Fee Related
- 2004-03-19 WO PCT/EP2004/002902 patent/WO2004085651A2/en not_active Ceased
- 2004-03-19 GB GB0516602A patent/GB2413329B/en not_active Expired - Fee Related
Patent Citations (1)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN1166182A (en) * | 1995-09-13 | 1997-11-26 | 富士药品工业株式会社 | D-pantolactone hydrolase and the gene encoding the enzyme |
Also Published As
| Publication number | Publication date |
|---|---|
| GB0516602D0 (en) | 2005-09-21 |
| GB2413329A (en) | 2005-10-26 |
| KR20050119151A (en) | 2005-12-20 |
| KR101156882B1 (en) | 2012-07-05 |
| JP4571936B2 (en) | 2010-10-27 |
| GB2413329B (en) | 2007-09-19 |
| CN1768141A (en) | 2006-05-03 |
| JP2006521103A (en) | 2006-09-21 |
| WO2004085651A3 (en) | 2005-01-20 |
| WO2004085651A2 (en) | 2004-10-07 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| JP2001505403A (en) | Esterase | |
| KR20020097210A (en) | Isolation and expression of a gene for a nitrilase from acidovorax facilis 72w | |
| US8034579B2 (en) | Proteins with enhanced functionality and methods of making novel proteins using circular permutation | |
| US11634698B2 (en) | Nucleases and methods for making and using them | |
| EP0433117B1 (en) | Novel polypeptides, the DNA sequences allowing their expression, method of preparation, and utilization | |
| ES2628225T3 (en) | Beta-amino acid preparation | |
| CN1768141B (en) | Pantolactone hydrolase | |
| CN100379867C (en) | butynol Ⅰ esterase | |
| CN100402640C (en) | Production method and application of lactonase | |
| US7229817B2 (en) | Recombinant porcine liver esterases, their use and a method for the production thereof | |
| JP2010521192A (en) | Isoform of porcine liver esterase | |
| CN110951711A (en) | Esterase with activity of degrading chiral ester and encoding gene and application thereof | |
| KR101081012B1 (en) | Chlorohydrin and hydroxy carboxylic ester hydrolase gene | |
| CA2635380C (en) | Polypeptide having esterase activity and recombinant esterase and use thereof | |
| JP2540540B2 (en) | Lipase gene and recombinant DNA | |
| JP5619400B2 (en) | Acid phosphatase, nucleic acid encoding the same, and method for producing riboflavin-5'-phosphate using the same | |
| JP3855329B2 (en) | Esterase gene and use thereof | |
| JP4345425B2 (en) | Chlorohydrin and hydroxycarboxylate asymmetric hydrolase genes | |
| WO1993023547A1 (en) | ENZYMATIC PROCESS FOR PRODUCTION OF S-6-METHOXY-α-METHYL-2-NAPHTHALENEACETIC ACID | |
| CN114736887A (en) | Use of carboxylesterase | |
| JP2002247985A (en) | DNA repair enzyme gene | |
| JP2001238682A (en) | Gene encoding reductase | |
| JP2005261317A (en) | Dimethylarginine dimethylaminohydrolase, its gene, recombinant dna and method for producing dimethylarginine dimethylaminohydrolase |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| C06 | Publication | ||
| PB01 | Publication | ||
| C10 | Entry into substantive examination | ||
| SE01 | Entry into force of request for substantive examination | ||
| C14 | Grant of patent or utility model | ||
| GR01 | Patent grant | ||
| CX01 | Expiry of patent term | ||
| CX01 | Expiry of patent term |
Granted publication date: 20121010 |