CN1768141B

CN1768141B - Pantolactone hydrolase

Info

Publication number: CN1768141B
Application number: CN2004800085042A
Authority: CN
Inventors: 爱琦巴尔·吉尔; 郭达海; 马丁·莱玛恩; 德莫特·P·莫纳戈; 娆·翰奴曼萨·瓦里威提
Original assignee: DSM IP Assets BV
Current assignee: DSM IP Assets BV
Priority date: 2003-03-28
Filing date: 2004-03-19
Publication date: 2012-10-10
Anticipated expiration: 2024-03-19
Also published as: GB0516602D0; GB2413329A; KR20050119151A; KR101156882B1; JP4571936B2; GB2413329B; CN1768141A; JP2006521103A; WO2004085651A3; WO2004085651A2

Abstract

The present invention relates to nucleotide sequences of genes encoding pantolactone hydrolase, as well as vectors and host cells comprising the above nucleotide sequences, and pantolactone hydrolase encoded by the above nucleotide sequences. The invention also provides methods of making and using pantolactone hydrolases.

Description

pantolactone hydrolase

本发明涉及可用于泛酸内酯(pantolactone)对映异构体的光拆分(optical resolution)的酶，涉及编码所述酶的基因，还涉及所述酶在制备R-泛酸或其盐或R-泛酰醇(R-panthenol)的方法中的用途。The present invention relates to the enzyme that can be used for the photoresolution (optical resolution) of pantolactone (pantolactone) enantiomer, relates to the gene of encoding said enzyme, also relates to said enzyme in the preparation of R-pantothenic acid or its salt or R - Use of R-panthenol in a method.

对映异构体在其生理效应，即毒理和药理效应，与酶的反应以及其感知特征(sensorial characteristics)方面是不同的。已知仅有R-泛酸内酯是R-泛酸制备中的中间产物，R-泛酸是维生素，其对人类和动物的生长、繁殖以及正常生理功能来说都是必要的。泛酸作为辅酶A的前体以及作为脂肪酸合成酶的酰基载体，涉及到超过一百种的不同代谢途径，其中包括碳水化合物、蛋白质和脂类的能量代谢，以及脂类、神经递质、甾类激素、卟啉和激素的合成。直到近来，广泛用于R-泛酸内酯制备的方法是对化学合成的R-和S-泛酸内酯进行的光拆分。此种方法非常昂贵，因为其需要使用昂贵的光拆分试剂。此外，对泛酸内酯的R-对映异构体的回收也相当困难。因此，提供可用于泛酸内酯的R-和S-对映异构体的光拆分的泛酸内酯水解酶是人们所需要的。使用这种酶，尤其是水解泛酸内酯的S-或R-形式，是分离泛酸内酯的两种对映异构体的更为经济且更容易操作的方法。R-泛酸的盐的例子包括R-泛酸钙。Enantiomers differ in their physiological effects, ie toxicological and pharmacological effects, reactions with enzymes, and in their sensorial characteristics. Only R-pantothenic acid lactone is known as an intermediate in the preparation of R-pantothenic acid, a vitamin which is essential for growth, reproduction and normal physiological functions of humans and animals. As the precursor of coenzyme A and as the acyl carrier of fatty acid synthase, pantothenic acid is involved in more than one hundred different metabolic pathways, including energy metabolism of carbohydrates, proteins and lipids, as well as lipids, neurotransmitters, steroids Hormones, porphyrins, and hormone synthesis. Until recently, the widely used method for the preparation of R-pantolactones was the photoresolution of chemically synthesized R- and S-pantolactones. This method is very expensive as it requires the use of expensive photoresolution reagents. In addition, the recovery of the R-enantiomer of pantolactone is also quite difficult. Accordingly, it would be desirable to provide pantolactone hydrolases useful for the photoresolution of the R- and S-enantiomers of pantolactone. The use of this enzyme, especially to hydrolyze the S- or R-form of pantothenolactone, is a more economical and more manipulative method for the separation of the two enantiomers of pantothenolactone. Examples of salts of R-pantothenic acid include calcium R-pantothenate.

在一种实施方式中，本发明提供了编码泛酸内酯水解酶的基因的核苷酸序列，所述泛酸内酯水解酶是从马肝脏、A.niger ATCC 9142、A.nigerawamori ATCC 38854或A.niger MacRae ATCC 46951中获得的。公众可从American Type Culture Collection(ATCC)，10801 University Boulvard，Manassas，VA 20110-2209USA获得上述菌株。In one embodiment, the present invention provides the nucleotide sequence of the gene encoding pantothenate lactone hydrolase, which is obtained from horse liver, A. niger ATCC 9142, A. nigerawamori ATCC 38854 or A. Obtained in .niger MacRae ATCC 46951. The public can obtain the above strains from American Type Culture Collection (ATCC), 10801 University Boulvard, Manassas, VA 20110-2209USA.

在另一种实施方式中，本发明提供了编码泛酸内酯水解酶的核苷酸序列，所述序列包含：当泛酸内酯水解酶来自马肝脏时，如SEQ ID NO：1所示的核苷酸序列；当泛酸内酯水解酶来自A.niger MacRae ATCC 46951时，如SEQ ID NO：5所示的核苷酸序列；当泛酸内酯水解酶来自A.nigerATCC9142时，如SEQ ID NO：7所示的序列；以及当泛酸内酯水解酶来自A.niger awamori ATCC38854时，如SEQ ID NO：9所示的序列。In another embodiment, the present invention provides a nucleotide sequence encoding pantothenate hydrolase, said sequence comprising: when the pantothenate hydrolase is from horse liver, the nucleus shown in SEQ ID NO: 1 Nucleotide sequence; When the pantothenate lactone hydrolase is from A.niger MacRae ATCC 46951, the nucleotide sequence shown in SEQ ID NO: 5; when the pantothenate lactone hydrolase is from A.nigerATCC9142, such as SEQ ID NO: The sequence shown in 7; and when the pantothenate lactone hydrolase is from A. niger awamori ATCC38854, the sequence shown in SEQ ID NO: 9.

所述核苷酸序列可以包含上述每种泛酸内酯水解酶的编码序列以及调控序列。The nucleotide sequence may comprise the coding sequence and the regulatory sequence of each pantothenate hydrolase mentioned above.

“调控序列”在此被定义为：指导可操作地连接的核酸转录的核酸控制序列的排列。一个此类表达控制序列的例子是启动子。“启动子”包括在转录起始位点附近的必要核酸序列。启动子还可选地包括远端增强子或阻遏元件，其可定位于转录起始位点之后的多达数千个碱基对上。“组成型”启动子是在大多数环境及发育条件下都有活性的启动子。“诱导型”启动子是在环境或发育调控中具有活性的启动子。术语“可操作地连接”指核酸表达控制序列(例如启动子或转录因子结合位点的排列)以及第二种核酸序列之间的功能性连接，其中所述表达控制序列指导对应于第二种序列的核酸的转录。A "regulatory sequence" is defined herein as an arrangement of nucleic acid control sequences that direct the transcription of an operably linked nucleic acid. An example of such an expression control sequence is a promoter. A "promoter" includes the necessary nucleic acid sequences near the site where transcription begins. A promoter also optionally includes distal enhancer or repressor elements, which can be positioned as many as several thousand base pairs after the start site of transcription. A "constitutive" promoter is one that is active under most environmental and developmental conditions. An "inducible" promoter is a promoter that is active under environmental or developmental regulation. The term "operably linked" refers to a functional linkage between a nucleic acid expression control sequence (such as a promoter or an arrangement of transcription factor binding sites) and a second nucleic acid sequence, wherein the expression control sequence directs a sequence corresponding to the second Transcription of a sequence of nucleic acids.

可以使用核酸序列的片段，例如，在杂交试验中作为探针。Fragments of nucleic acid sequences can be used, for example, as probes in hybridization assays.

在宿主生物中插入核苷酸序列，令其转录及翻译，以制造功能性多肽的情况下，技术人员将认识到，由于密码简并性，很多种多核苷酸序列都将编码同样的多肽。这些变异体也明确地包括在本发明的范围之内。此外，本发明还明确地包括下述序列，所述序列彼此之间实质上相同(按照下文所述来确定)，它们编码是野生型多肽的突变体的多肽或保留有多肽功能的多肽(例如由对多肽氨基酸的保守取代得到的)。此外，变异体可以是编码下述显性失活突变体的。Where a nucleotide sequence is inserted into a host organism, transcribed and translated to produce a functional polypeptide, the skilled artisan will recognize that, due to codon degeneracy, many polynucleotide sequences will encode the same polypeptide. Such variants are also expressly included within the scope of the present invention. In addition, the present invention expressly includes sequences that are substantially identical to one another (determined as described below) that encode polypeptides that are mutants of wild-type polypeptides or polypeptides that retain polypeptide function (e.g. resulting from conservative substitutions of polypeptide amino acids). In addition, variants may be those encoding dominant negative mutants described below.

如果按照下文所述进行最大一致性比对时，两条核酸序列或多肽各自的核苷酸序列或氨基酸残基是同样的，这两条核苷酸序列或多肽会被认为是“相同的”。在关于两条或多条核酸或多肽序列的上下文中，术语“相同的”或百分比“相同性”指：在比较窗(comparison window)上进行最大一致性比较及比对时，用下述序列比较算法中的一种或通过手工比对及眼睛观察来测量的情况下，两条或多条序列或亚序列是同样的，或其中同样的氨基酸残基或核苷酸具有特定的百分比。当序列相同性的百分比针对蛋白质或肽来使用时，应认识到，不相同的残基位置通常是由于保守的氨基酸取代而导致不同的，其中氨基酸残基被取代为其它具有相似化学性质(例如，电荷或疏水性)的氨基酸残基，因此并不改变分子的功能属性。当序列因保守取代而不同时，可因取代的保守本质，将百分比序列相同性上调以进行校正。进行这种调节的方法是本领域技术人员公知的。典型地，这包括将保守取代作为部分而非完全的错配进行评分，因而提高百分比序列相同性。因此，例如，在给相同的氨基酸1分，给非保守取代0分的情况下，给保守取代0和1之间的分数。按照，例如，Meyers&Miller，Computer Applic.Biol.Sci.4：11-17(1988)的算法，例如，在程序PC/GENE(Intelligenetics，Mountain View，Calif.，USA)中实现，来计算对保守取代的给分。Two nucleic acid sequences or polypeptides are said to be "identical" if their respective nucleotide sequences or amino acid residues are identical when aligned for maximum identity as described below . The term "identical" or percent "identity" in the context of two or more nucleic acid or polypeptide sequences refers to: when comparing and aligning for maximum identity over a comparison window, the sequence Two or more sequences or subsequences are identical, or have a specified percentage of amino acid residues or nucleotides that are identical, as measured by one of the comparison algorithms or by manual alignment and visual inspection. When percentages of sequence identity are used with respect to proteins or peptides, it is recognized that residue positions that are not identical typically differ by conservative amino acid substitutions, in which amino acid residues are replaced by others with similar chemical properties (e.g. , charge or hydrophobicity) amino acid residues and thus do not alter the functional properties of the molecule. When sequences differ by conservative substitutions, the percent sequence identity can be adjusted upwards to correct for the conservative nature of the substitutions. Methods for making such adjustments are well known to those skilled in the art. Typically, this involves scoring conservative substitutions as partial rather than complete mismatches, thus increasing the percent sequence identity. So, for example, give a score between 0 and 1 for conservative substitutions where identical amino acids are given 1 point and non-conservative substitutions are given 0 points. Conservative substitutions are calculated according to, for example, the algorithm of Meyers & Miller, Computer Applic. Biol. Sci. 4: 11-17 (1988), for example, implemented in the program PC/GENE (Intelligenetics, Mountain View, Calif., USA). give points.

在关于两条核酸或多肽的上下文中，短语“实质上相同”指：在比较窗上进行最大一致性比对时，用下述序列比较算法中的一种或通过手工比对及眼睛观察来测量的情况下，具有至少60％，优选80％，最优选90-95％的核苷酸或氨基酸残基相同性的序列或亚序列。该定义还指能与待测序列杂交的序列的互补序列。The phrase "substantially identical" in the context of two nucleic acids or polypeptides means that when aligned for maximum identity over the comparison window, they are identified using one of the following sequence comparison algorithms or by manual alignment and visual inspection. Where measured, a sequence or subsequence having at least 60%, preferably 80%, most preferably 90-95% nucleotide or amino acid residue identity. This definition also refers to the complement of a sequence that hybridizes to the test sequence.

对序列比较而言，典型地，一条序列用作参照序列，将待测序列与其进行比较。使用序列比较算法时，将待测和参照序列都输入到计算机中，如果需要的话，指定亚序列坐标，以及指定序列算法程序参数。可以使用缺省的程序参数，或者指定其它参数。然后序列比较算法会基于程序参数，对待测序列相对于参照序列的百分比序列相同性进行计算。For sequence comparison, typically one sequence acts as a reference sequence, to which test sequences are compared. When using a sequence comparison algorithm, test and reference sequences are entered into a computer, subsequence coordinates are designated, if necessary, and sequence algorithm program parameters are designated. Default program parameters can be used, or other parameters can be specified. The sequence comparison algorithm then calculates the percent sequence identities for the test sequences relative to the reference sequence, based on the program parameters.

本文中使用的“比较窗”包括：指一种片断，其连续位置的数量选自由20至600，通常大约50至大约200，更通常为大约100至大约150所构成的组，其中，序列可与具有同样数量的连续位置的参照序列进行比较，这是在两条序列被最优排列之后进行的。对序列进行排列以进行比较的方法是本领域内公知的。As used herein, "comparison window" includes: refers to a fragment whose number of consecutive positions is selected from the group consisting of 20 to 600, usually about 50 to about 200, more usually about 100 to about 150, wherein the sequence can be Comparisons are made to a reference sequence having the same number of consecutive positions, after the two sequences have been optimally aligned. Methods for aligning sequences for comparison are well known in the art.

“经保守修饰的变异体”用于氨基酸及核酸序列。就具体的核酸序列而言，经保守修饰的变异体指编码相同或基本相同的氨基酸序列的核酸，或者，当核酸不编码氨基酸序列时，指基本相同的序列。由于遗传密码的简并性，有多种功能上相同的核酸密码子编码任何给定的蛋白质。例如，密码子GCA、GCC、GCG和GCU都编码丙氨酸这种氨基酸。因此，在丙氨酸被密码子确定的每个位置上，密码子可变为上述相应密码子中的任何一个，而不会改变编码出的多肽。此类核酸变异是“沉默变异”，其是经保守修饰的变异中的一种。本文中每条编码多肽的核酸序列也还描述该核酸的每种可能的沉默变异。技术人员将认识到，核酸中的每个密码子(AUG除外，其通常是编码甲硫氨酸的唯一的密码子)都可被修饰，以产生功能上相同的分子。因此，编码多肽的核酸的每种沉默变异都隐含在每条被描述过的序列中。"Conservatively modified variants" applies to amino acid and nucleic acid sequences. With respect to particular nucleic acid sequences, conservatively modified variants refers to nucleic acids that encode identical or essentially identical amino acid sequences, or, where the nucleic acid does not encode an amino acid sequence, to essentially identical sequences. Due to the degeneracy of the genetic code, there are multiple functionally identical nucleic acid codons that encode any given protein. For example, the codons GCA, GCC, GCG, and GCU all encode the amino acid alanine. Therefore, at each position where alanine is defined by a codon, the codon can be changed to any of the corresponding codons described above without changing the encoded polypeptide. Such nucleic acid variations are "silent variations," which are one species of conservatively modified variations. Every nucleic acid sequence herein which encodes a polypeptide also describes every possible silent variation of the nucleic acid. The skilled artisan will recognize that every codon in a nucleic acid (except AUG, which is usually the only codon encoding methionine) can be modified to produce a functionally identical molecule. Accordingly, every silent variation of a nucleic acid which encodes a polypeptide is implicit in every described sequence.

对氨基酸序列而言，技术人员将认识到，对核酸的个别取代、缺失或增加，或对肽、多肽或蛋白质序列进行的在被编码的序列中改变单个氨基酸或小百分比(即，小于20％，例如15％、10％、5％、4％、3％、2％或1％)的氨基酸的取代，是“经保守修饰的变异体”，其中的改变是氨基酸被化学上相似的氨基酸所取代。提供功能上相似的氨基酸的保守取代表在本领域内是公知的。With respect to amino acid sequences, the skilled artisan will recognize that individual substitutions, deletions or additions to nucleic acids, or to peptide, polypeptide or protein sequences, change single amino acids or small percentages (i.e., less than 20%) in the encoded sequence , such as 15%, 10%, 5%, 4%, 3%, 2%, or 1%) of amino acid substitutions are "conservatively modified variants" in which the amino acid is replaced by a chemically similar amino acid replace. Conservative substitution tables providing functionally similar amino acids are well known in the art.

下述六组中的每一组都含有可以互相保守取代的氨基酸：Each of the following six groups contains amino acids that may be conservatively substituted for each other:

丙氨酸(A)、丝氨酸(S)、苏氨酸(T)；Alanine (A), Serine (S), Threonine (T);

天冬氨酸(D)、谷氨酸(E)；Aspartic acid (D), glutamic acid (E);

天冬酰胺(N)、谷氨酰胺(Q)；Asparagine (N), Glutamine (Q);

精氨酸(R)、赖氨酸(K)；Arginine (R), Lysine (K);

异亮氨酸(I)、亮氨酸(L)、甲硫氨酸(M)、缬氨酸(V)；以及Isoleucine (I), Leucine (L), Methionine (M), Valine (V); and

苯丙氨酸(F)、酪氨酸(Y)、色氨酸(W)。(见，例如，Creighton，Proteins(1984))。Phenylalanine (F), Tyrosine (Y), Tryptophan (W). (See, eg, Creighton, Proteins (1984)).

两条核酸序列或多肽实质上相同的标志是：第一条核酸编码的多肽可与因对抗第二条核酸编码的多肽而产生的抗体进行免疫交叉反应。因此，例如，当两条肽仅仅只由于保守取代而有不同时，则典型地，其中一条多肽与第二条多肽实质上相同。两条核酸序列实质上相同的另一种标志是：这两个分子或其互补体在如下文所述的严谨条件下能相互杂交。The sign that two nucleic acid sequences or polypeptides are substantially identical is that the polypeptide encoded by the first nucleic acid can perform immunological cross-reaction with the antibody raised against the polypeptide encoded by the second nucleic acid. Thus, for example, when two peptides differ only by conservative substitutions, then typically one of the polypeptides is substantially identical to the second polypeptide. Another indication that two nucleic acid sequences are substantially identical is that the two molecules, or their complements, hybridize to each other under stringent conditions as described below.

短语“与……特异性杂交”指：在严谨杂交条件下，当序列存在于复杂混合物(例如，细胞总DNA或RNA，或文库DNA或RNA)中时，分子仅与特定的核苷酸序列结合、形成双链或杂交。The phrase "hybridizes specifically to" means that, under stringent hybridization conditions, a molecule only binds to a specific nucleotide sequence when the sequence is present in a complex mixture (e.g., total cellular DNA or RNA, or library DNA or RNA) Combine, form double strands or hybridize.

短语“严谨杂交条件”指探针将与其目标序列而不与其它序列杂交的条件，典型地，其目标序列处于核酸序列的复杂混合物中。严谨条件是取决于序列的，在不同的情况下也有所不同。较长的序列会在较高的温度下特异性杂交。通常，高严谨度条件被选择为：比在给定的离子强度和pH下特异性序列的热融温度(T_m)低大约5-10℃。低严谨度条件通常被选择为比T_m低大约15-30℃。T_m是：(在给定的离子强度、pH和核酸浓度下，)平衡时，互补于目标的探针中的50％与目标序列杂交的温度(目标序列过量存在时，Tm处，平衡时50％的探针被结合)。严谨条件将是下述条件：盐浓度低于大约1.0M的钠离子，典型地，大约0.01至1.0M的钠离子浓度(或其它盐)，pH7.0至8.3，对短探针(例如，10至50个核苷酸)而言温度为至少大约30℃，对长探针(例如，多于50个核苷酸)而言，温度为至少大约60℃。严谨条件还可以通过加入去稳定试剂，例如甲酰胺来获得。对选择性或特异性杂交而言，阳性信号是背景的至少两倍，优选10倍于背景杂交。The phrase "stringent hybridization conditions" refers to conditions under which a probe will hybridize to its target sequence, typically in a complex mixture of nucleic acid sequences, but to no other sequences. Stringent conditions are sequence-dependent and will be different in different circumstances. Longer sequences hybridize specifically at higher temperatures. Generally, high stringency conditions are selected to be about 5-10°C lower than the melting temperature ( _Tm ) for the specific sequence at a given ionic strength and pH. Low stringency conditions are typically selected to be about 15-30°C lower than the _Tm . _Tm is: (at a given ionic strength, pH and nucleic acid concentration,) at equilibrium, the temperature at which 50% of the probes complementary to the target hybridize to the target sequence (when the target sequence is present in excess, at Tm, at equilibrium 50% of the probes were bound). Stringent conditions will be the following: a salt concentration of less than about 1.0 M sodium ion, typically a sodium ion concentration (or other salt) of about 0.01 to 1.0 M, pH 7.0 to 8.3, for short probes (e.g., 10 to 50 nucleotides), the temperature is at least about 30°C, and for long probes (eg, greater than 50 nucleotides), the temperature is at least about 60°C. Stringent conditions can also be achieved by the addition of destabilizing agents, such as formamide. For selective or specific hybridization, a positive signal is at least two times background, preferably 10 times background hybridization.

如果在严谨条件下不能相互杂交的核酸所编码的多肽是实质上相同的，那么这样的核酸也仍然是实质上相同的。这会发生于，例如，用由遗传密码允许的最大密码子简并度产生核酸拷贝的情况下。在此类情况下，典型地，核酸在中度严谨杂交条件下杂交。Nucleic acids that do not hybridize to each other under stringent conditions are also substantially identical if the polypeptides encoded by such nucleic acids are substantially identical. This would occur, for example, if a copy of the nucleic acid was created with the maximum codon degeneracy permitted by the genetic code. In such cases, typically, the nucleic acids hybridize under moderately stringent hybridization conditions.

在本发明中，用本文中公开的核酸序列，在严谨条件下进行标准的Southern印迹实验，可鉴定出含有本发明核酸的基因组DNA(gDNA)或cDNA。就此种公开的目的而言，适合于此类杂交的严谨条件包括：在37℃下在40％甲酰胺、1M NaCl、1％十二烷基硫酸钠(SDS)缓冲液中杂交，在至少大约50℃的温度下于0.2 X SSC中洗至少一次，20分钟，温度通常是在大约55℃至大约60℃；或是相当的条件。阳性杂交至少是背景的两倍。普通技术人员很容易认识到，其它杂交及洗涤条件可用于提供具有相似严谨度的条件。In the present invention, the genomic DNA (gDNA) or cDNA containing the nucleic acid of the present invention can be identified by performing standard Southern blot experiments under stringent conditions using the nucleic acid sequences disclosed herein. For the purposes of this disclosure, stringent conditions suitable for such hybridization include hybridization at 37°C in 40% formamide, 1M NaCl, 1% sodium dodecyl sulfate (SDS) buffer at least about Wash at least once in 0.2X SSC for 20 minutes at a temperature of 50°C, usually between about 55°C and about 60°C; or equivalent conditions. Positive hybridization is at least twice background. One of ordinary skill will readily recognize that other hybridization and wash conditions can be used to provide conditions of similar stringency.

关于两条多核苷酸相同的另一种标志是：通过一对寡核苷酸引物扩增出来的参照序列，在严谨杂交条件下可被用作为探针，以从cDNA或基因组文库中分离出待测序列，或在northern或Southern印迹实验中鉴定出待测序列。Another indication that two polynucleotides are identical is that a reference sequence amplified by a pair of oligonucleotide primers can be used as a probe under stringent hybridization conditions to isolate cDNA or genomic libraries. The sequence to be tested, or the sequence to be tested was identified in a northern or Southern blot.

本发明还包括本文所定义的表达载体。表达载体包括上文列出的多核苷酸序列中任一种的一个或多个拷贝。本发明的表达载体可以含有本文定义的任何多核苷酸序列。The invention also includes expression vectors as defined herein. Expression vectors include one or more copies of any of the polynucleotide sequences listed above. An expression vector of the invention may contain any polynucleotide sequence as defined herein.

因此，在另一种实施方式中，本发明提供了一种表达载体，所述载体包含编码泛酸内酯水解酶的基因的核苷酸序列，所述泛酸内酯水解酶来自马肝脏、Bacillus subtilis、A.niger ATCC 9142、A.niger awamori ATCC38854或A.niger MacRae ATCC46951；或者所述载体包含编码泛酸内酯水解酶的基因的核苷酸序列，所述泛酸内酯水解酶来自马肝脏，如SEQID NO：2所示，来自Bacillus subtilis，如SEQ ID NO：4所示，来自A.niger ATCC9142，如SEQ ID NO：8所示，来自A.niger awamori ATCC38854，如SEQ ID NO：10所示，以及来自A.niger MacRae ATCC46951，部分如SEQ ID NO：6所示。Therefore, in another embodiment, the present invention provides an expression vector comprising the nucleotide sequence of a gene encoding pantothenate hydrolase from horse liver, Bacillus subtilis , A.niger ATCC 9142, A.niger awamori ATCC38854 or A.niger MacRae ATCC46951; or the vector contains the nucleotide sequence of a gene encoding pantothenate hydrolase derived from horse liver, such as As shown in SEQ ID NO: 2, from Bacillus subtilis, as shown in SEQ ID NO: 4, from A. niger ATCC9142, as shown in SEQ ID NO: 8, from A. niger awamori ATCC38854, as shown in SEQ ID NO: 10 , and from A. niger MacRae ATCC46951, shown in part in SEQ ID NO:6.

可选地，表达载体中的多核苷酸序列可与上文定义的表达控制序列可操作地连接。Alternatively, the polynucleotide sequence in the expression vector may be operably linked to the expression control sequences defined above.

在本文中使用的短语“表达载体”是可复制的(replicatable)载体，其带有编码本文列出的多核苷酸序列的DNA序列，并能调控这些DNA序列的表达。在本文上下文中，术语“可复制的”指，在引入载体的给定类型的宿主细胞中，载体可被复制。在感兴趣的多核苷酸序列上游接近该多核苷酸的地方，可能会提供有编码信号肽的序列，其存在确保了带有载体的宿主细胞所表达的被编码的多肽得以分泌。信号序列可以是与被选出的多核苷酸序列天然相连接的，或者可以来自另一种来源。The phrase "expression vector" as used herein is a replicatable vector carrying DNA sequences encoding the polynucleotide sequences set forth herein and capable of regulating the expression of these DNA sequences. In the present context, the term "replicable" means that the vector can be replicated in a given type of host cell into which the vector is introduced. Near the polynucleotide sequence upstream of the polynucleotide sequence of interest, a sequence encoding a signal peptide may be provided, the presence of which ensures secretion of the encoded polypeptide expressed by the host cell carrying the vector. The signal sequence may be naturally associated with the selected polynucleotide sequence, or may be derived from another source.

载体可以是传统用于重组DNA工艺的任何载体，对载体的选择将通常取决于其将被引入的宿主细胞。因此，载体可以是自主复制载体，即，作为染色体外物质存在的载体，其复制独立于染色体复制；此种载体的例子是质粒、噬菌体、粘粒或迷你染色体。另外，载体可以是下述载体，当其被引入到宿主细胞中时，会整合到宿主细胞基因组中，并与其被整合到的宿主细胞基因组一起复制。本发明的表达载体可以带有本发明的任何DNA序列，可被用于表达本发明的任何多肽。The vector can be any vector conventionally used in recombinant DNA techniques, the choice of which will generally depend on the host cell into which it is to be introduced. Thus, a vector may be an autonomously replicating vector, ie a vector that exists as extrachromosomal material that replicates independently of chromosomal replication; examples of such vectors are plasmids, phages, cosmids or minichromosomes. In addition, the vector may be a vector which, when introduced into a host cell, is integrated into the host cell genome and replicated together with the host cell genome into which it has been integrated. The expression vector of the present invention can carry any DNA sequence of the present invention, and can be used to express any polypeptide of the present invention.

本发明还包括经培养的细胞或宿主细胞，其中含有一条或多条本文公开的多核苷酸序列和/或一个或多个本文公开的表达载体。在本文中使用的“经培养的细胞”包括，任何能够在给定条件下生长，并能表达本发明的多核苷酸编码的一种或多种多肽的细胞。优选地，经培养的细胞是酵母、真菌、细菌或藻类。更优选地，经培养的细胞是Escherichia coli、Saccharomyces cerevisiae、Pichia pastoris、Aspergillus niger或Bacillussubtilis。The invention also includes cultured cells or host cells containing one or more polynucleotide sequences disclosed herein and/or one or more expression vectors disclosed herein. "Cultured cells" as used herein include any cells capable of growing under given conditions and expressing one or more polypeptides encoded by the polynucleotides of the present invention. Preferably, the cultured cells are yeast, fungi, bacteria or algae. More preferably, the cultured cells are Escherichia coli, Saccharomyces cerevisiae, Pichia pastoris, Aspergillus niger or Bacillus subtilis.

在另一种实施方式中，本发明提供了从马肝脏、A.niger ATCC 9142、A.niger awamori ATCC 38854或A.niger MacRae ATCC 46951中获得的泛酸内酯水解酶。In another embodiment, the present invention provides pantothenate lactone hydrolase obtained from horse liver, A. niger ATCC 9142, A. niger awamori ATCC 38854 or A. niger MacRae ATCC 46951.

在又一种实施方式中，本发明提供了一种泛酸内酯水解酶，其氨基酸序列包含：当来自马肝脏时，SEQ ID NO：2所示的氨基酸序列；当来自A.niger MacRae ATCC 46951时，SEQ ID NO：6所示的氨基酸序列；当来自A.niger ATCC 9142时，SEQ ID NO：8所示的氨基酸序列；当来自A.niger awamori ATCC 38854时，SEQ ID NO：10所示的氨基酸序列。In yet another embodiment, the present invention provides a pantothenate lactone hydrolase, the amino acid sequence of which comprises: when derived from horse liver, the amino acid sequence shown in SEQ ID NO: 2; when derived from A. niger MacRae ATCC 46951 When from A. niger ATCC 9142, the amino acid sequence shown in SEQ ID NO: 8; when from A.niger awamori ATCC 38854, the amino acid sequence shown in SEQ ID NO: 10 amino acid sequence.

SEQ ID NO：2所示的来自马肝脏的和SEQ ID NO：4所示的来自Bacillus subtilis的泛酸内酯水解酶显示S-泛酸内酯水解酶活性。SEQ IDNO：8所示的来自A.niger ATCC 9142的、SEQ ID NO：10所示的来自A.niger awamori ATCC 38854的和部分如SEQ ID NO：6所示的来自A.nigerMacRae ATCC 46951的泛酸内酯水解酶显示R-泛酸内酯水解酶活性。Pantolactone hydrolase from horse liver as shown in SEQ ID NO: 2 and from Bacillus subtilis as shown in SEQ ID NO: 4 exhibits S-pantolactone hydrolase activity. Pantothenic acid from A. niger ATCC 9142 as shown in SEQ ID NO: 8, from A. niger awamori ATCC 38854 as shown in SEQ ID NO: 10 and partially from A. nigerMacRae ATCC 46951 as shown in SEQ ID NO: 6 The lactone hydrolase exhibits R-pantolactone hydrolase activity.

在另一种实施方式中，本发明包含上述氨基酸序列的片段，所述片段具有完整蛋白质的酶活性。In another embodiment, the present invention comprises a fragment of the above amino acid sequence, which fragment has the enzymatic activity of the complete protein.

根据本发明的“氨基酸序列的片段”可被定义为缺少一段氨基酸的序列，该段氨基酸对蛋白质正确折叠来说可能是必要的。然而，一旦蛋白质被折叠，该段氨基酸就可被删除，而不会破坏单个蛋白质的功能。这些段可以存在于被折叠的蛋白质中，例如环，或者可以是与被折叠的蛋白质的活性不直接相关的N-或C-末端序列。A "fragment of an amino acid sequence" according to the present invention may be defined as a sequence lacking a stretch of amino acids which may be essential for the correct folding of a protein. However, once the protein is folded, this stretch of amino acids can be deleted without disrupting the individual protein's function. These segments may be present in the folded protein, such as loops, or may be N- or C-terminal sequences not directly related to the activity of the folded protein.

此外，本发明包括包含所述R-和S-泛酸内酯水解酶活性的蛋白质的衍生物。所述衍生物包括被固定的酶，例如通过内含(inclusion)被固定，其中所述的酶被保留于膜装置上，例如中空纤维、聚合网状结构或微囊体。还可能通过交联或通过产生酶晶体来对酶进行固定。另一种对酶进行固定的可能方法是将其结合到正在生长的多聚物上，例如二氧化硅溶胶凝胶或二氧化硅石墨溶胶凝胶，或者通过将酶结合到预制载体上，例如EupergitC、Eupergit 250L、PEG、Celite和Amberlite XAD-7。对本发明的R-和S-泛酸内酯水解酶适用的另一种固定酶的技术是将其分散在与水无法混溶的有机溶剂中。通常，通过本领域内已知的几乎所有固定技术，可对具有酶活性的蛋白质，例如本发明的酶进行固定。Furthermore, the present invention includes derivatives of proteins comprising said R- and S-pantolactone hydrolase activity. Said derivatives include immobilized enzymes, for example by inclusion, wherein said enzymes are retained on membrane devices, such as hollow fibers, polymeric networks or microcapsules. It is also possible to immobilize the enzyme by cross-linking or by creating enzyme crystals. Another possibility for the immobilization of enzymes is to bind them to growing polymers, such as silica sol-gels or silica-graphite sol-gels, or by binding enzymes to prefabricated supports such as Eupergit C, Eupergit 250L, PEG, Celite, and Amberlite XAD-7. Another enzyme immobilization technique suitable for the R- and S-pantolactone hydrolases of the present invention is to disperse them in a water-immiscible organic solvent. In general, proteins with enzymatic activity, such as the enzymes of the invention, can be immobilized by almost any immobilization technique known in the art.

在又一种实施方式中，本发明涉及用经分离的泛酸内酯水解酶选择性水解R-或S-泛酸内酯水解酶的用途，这可用于对泛酸内酯的R-对映异构体的工业生产，泛酸内酯的R-对映异构体是生产R-泛酸和R-泛酰醇的前体。In yet another embodiment, the present invention relates to the use of the isolated pantothenolactone hydrolase for the selective hydrolysis of R- or S-pantolactone hydrolase, which can be used for the R-enantiomerization of pantolactone The R-enantiomer of pantolactone is the precursor for the production of R-pantothenic acid and R-panthenol.

因此，本发明的一个目的是提供用本文所述的经分离的泛酸内酯水解酶来对R-或S-泛酸内酯进行选择性水解的方法或途径。Accordingly, it is an object of the present invention to provide a method or route for the selective hydrolysis of R- or S-pantolactone using the isolated pantolactone hydrolase described herein.

在另一种实施方式中，本发明提供了生产R-泛酸或其盐或R-泛酰醇的方法，其中包括在本发明的泛酸内酯水解酶存在下对R-或S-泛酸内酯进行选择性水解的步骤。In another embodiment, the present invention provides a method for producing R-pantothenic acid or a salt thereof or R-panthenol, which comprises degrading R- or S-pantothenolactone in the presence of the pantothenolactone hydrolase of the present invention A step of selective hydrolysis is carried out.

在另一种实施方式中，本发明提供了克隆与已知序列直接相邻的核苷酸序列的方法，例如编码新颖的泛酸内酯水解酶或其部分的核苷酸序列。In another embodiment, the present invention provides a method for cloning a nucleotide sequence immediately adjacent to a known sequence, such as a nucleotide sequence encoding a novel pantolactone hydrolase or a portion thereof.

具体而言，本发明包括能扩增未知序列的新的PCR策略。所述新的PCR策略包括在第一条和第二条DNA链的合成中使用同一种引物。该引物能与将被扩增的gDNA的已经已知的区域杂交。该引物显示与已知序列100％的同源性。在该引物的帮助下，进行第一次PCR，提供基因组DNA的第一条单链拷贝，到达未知序列。用一部分第一次PCR反应产物作为模板来进行第二次PCR，其中使用的(嵌套)引物能与定位于第一次PCR中所用的引物的3’方向上的DNA片段杂交，但逻辑上仍处于序列的已知部分。该嵌套引物与第一次聚合酶反应产生的单链DNA进行随机杂交之后，产生该单链DNA的反义链，其自身作为用于第二次PCR、产生第二条链的同样的引物的模板。在最后，起始于已知序列内部，延伸于邻近的基因组DNA或延伸于缺失部分序列的任何其它DNA的DNA片段通过与新片段两端都能杂交的一种引物被制造出来。Specifically, the present invention includes novel PCR strategies capable of amplifying unknown sequences. The new PCR strategy involves using the same primers for the synthesis of the first and second DNA strands. The primers hybridize to already known regions of the gDNA to be amplified. This primer showed 100% homology to the known sequence. With the help of this primer, the first PCR is performed, providing the first single-stranded copy of the genomic DNA, reaching the unknown sequence. Using a portion of the product of the first PCR reaction as a template to perform a second PCR using (nested) primers that hybridize to DNA fragments positioned 3' to the primers used in the first PCR, but logically Still in a known part of the sequence. After random hybridization of this nested primer with the single-stranded DNA produced by the first polymerase reaction, the antisense strand of this single-stranded DNA is generated, which itself serves as the same primer used in the second PCR to generate the second strand template. In the end, DNA fragments starting within the known sequence, extending to adjacent genomic DNA or to any other DNA with missing parts of the sequence, are produced with a primer that hybridizes to both ends of the new fragment.

因此，在另一种实施方式中，本发明提供了克隆与已知序列直接相邻的核苷酸序列的方法，所述方法包括PCR扩增，所述扩增包括：Accordingly, in another embodiment, the present invention provides a method of cloning a nucleotide sequence immediately adjacent to a known sequence, said method comprising PCR amplification comprising:

a)用第一次PCR的单链DNA作为模板进行第二次PCR，在所述第二次PCR中用能与第一次PCR产生的单链DNA随机杂交的引物；以及a) using the single-stranded DNA of the first PCR as a template to perform a second PCR in which primers capable of randomly hybridizing to the single-stranded DNA produced by the first PCR are used; and

b)由此，仍用步骤a)中的随机杂交引物，产生自身作为第二条链模板的反义链。b) Thus, still using the random hybridization primers from step a), an antisense strand is generated which itself serves as a template for the second strand.

下述实施例对编码泛酸内酯水解酶基因的鉴定、表征和表达进行了详细的描述。这些实施例仅是示例性的，而非欲以任何方式对本发明的范围加以限制。The following examples describe in detail the identification, characterization and expression of genes encoding pantolactone hydrolase. These examples are illustrative only and are not intended to limit the scope of the invention in any way.

实施例1从可商购的马肝脏酯酶丙酮粉末来纯化(S)-泛酸内酯水解活性Example 1 Purification of (S)-pantolactone hydrolytic activity from commercially available horse liver esterase acetone powder

将1g马肝脏酯酶丙酮粉末(Fluka Holding AG，Industriestrasse 25，P.O.Box 260，CH-9471 Buchs，瑞士)溶于40ml 50mM的Tris/HCl中，pH7.5，其中含有2M硫酸铵、1mM CaCl₂、1mM MgCl₂和10μM EDTA(缓冲液A)。通过离心分离出未溶解的物质，之后，用0.45μM过滤器进行过滤灭菌。将蛋白质溶液上样到用缓冲液A平衡过的丁基琼脂糖柱HR26/10(Amersham Pharmacia Biotech Europe GmbH，Dübendorf，瑞士)中。用100ml缓冲液A洗该柱。之后应用400ml从0至100％线性梯度的缓冲液B(没有硫酸铵的缓冲液A)。内酯酶在梯度中间被洗脱出来。收集显示泛酸内酯水解活性的片断，将缓冲液换为pH8.0的20mMTris/HCl、1mM CaCl₂、1mM MgCl₂以及10μM EDTA。收集的片断上样到MonoQ-预填(pre-packed)柱HR 26/10(Amersham Pharmacia BiotechEurope GmbH，Dübendorf，瑞士)中，应用0至350mM NaCl的线性梯度。蛋白质在该梯度的开始即被洗脱出来。再次收集活性片断，浓缩到1ml，上样至Superdex 200柱(Amersham Pharmacia Biotech Europe GmbH，Dübendorf，瑞士)上。用pH8.0的50mM Tris/HCl、1mM CaCl₂、1mMMgCl₂以及10μM EDTA作为洗脱缓冲液，内酯酶作为最后一个峰被洗脱出来，其显示大约31kDa的分子大小，与SDS-PAGE所示的尺寸相符。1 g of horse liver esterase acetone powder (Fluka Holding AG, Industriestrasse 25, POBox 260, CH-9471 Buchs, Switzerland) was dissolved in 40 ml of 50 mM Tris/HCl, pH 7.5, containing 2 M ammonium sulfate, 1 mM CaCl ₂ , 1 mM MgCl ₂ and 10 μM EDTA (buffer A). Undissolved material was separated by centrifugation, after which filter sterilization was performed with a 0.45 μM filter. The protein solution was loaded onto a Butyl Sepharose column HR26/10 (Amersham Pharmacia Biotech Europe GmbH, Dübendorf, Switzerland) equilibrated with buffer A. The column was washed with 100 ml of buffer A. Then 400 ml of a linear gradient from 0 to 100% of buffer B (buffer A without ammonium sulfate) was applied. Lactonase was eluted in the middle of the gradient. Fragments showing pantolactone hydrolytic activity were collected, and the buffer was exchanged with 20 mM Tris/HCl, pH 8.0, 1 mM CaCl ₂ , 1 mM MgCl ₂ , and 10 μM EDTA. The pooled fragments were loaded onto a MonoQ-pre-packed column HR 26/10 (Amersham Pharmacia Biotech Europe GmbH, Dübendorf, Switzerland) applying a linear gradient of 0 to 350 mM NaCl. Proteins were eluted at the beginning of the gradient. The active fragments were collected again, concentrated to 1 ml, and loaded onto a Superdex 200 column (Amersham Pharmacia Biotech Europe GmbH, Dübendorf, Switzerland). Using 50 mM Tris/HCl, pH 8.0, 1 mM CaCl ₂ , 1 mM MgCl ₂ , and 10 μM EDTA as the elution buffer, lactonase was eluted as the last peak, which showed a molecular size of about 31 kDa, which was consistent with that of SDS-PAGE. match the dimensions shown.

通过在37℃将酶于pH9.0的200mM Tris/乙酸缓冲液、5mM MgCl₂中培养60分钟进行测定，发现最高比活性为13.3U/(mg蛋白质)。通过HPLC来测定(S)-和(R)-泛酸内酯的浓度。The highest specific activity was found to be 13.3 U/(mg protein) as determined by incubating the enzyme in 200 mM Tris/acetate buffer, pH 9.0, 5 mM MgCl ₂ at 37° C. for 60 minutes. (S)- and (R)-pantolactone concentrations were determined by HPLC.

实施例2对来自马肝脏的(S)-泛酸内酯水解酶进行初步的表征Example 2 Preliminary characterization of (S)-pantolactone hydrolase from horse liver

(a)分子大小、结构以及等电点：(a) Molecular size, structure and isoelectric point:

根据SDA-PAGE，变性蛋白具有大约35kDa的分子量。用经校正的凝胶过滤柱的洗脱时间，天然蛋白质的分子量被计算为大约31kDa。这意味着来自马肝脏的内酯酶是30至35kDa的单体。用2D凝胶，等电点被测定为pH5.5的范围内。According to SDA-PAGE, the denatured protein has a molecular weight of approximately 35 kDa. Using corrected elution times from the gel filtration column, the molecular weight of the native protein was calculated to be approximately 31 kDa. This means that lactonase from horse liver is a 30 to 35 kDa monomer. With 2D gels, the isoelectric point was determined to be in the range of pH 5.5.

(b)N-末端和内部氨基酸测序：(b) N-terminal and internal amino acid sequencing:

用胰蛋白酶对2D凝胶电泳之后富集的蛋白质进行消化。通过质谱对产生的肽进行分析。SEQ NO：11中示出了肽No.66的N-末端序列。Proteins enriched after 2D gel electrophoresis were digested with trypsin. The resulting peptides were analyzed by mass spectrometry. The N-terminal sequence of peptide No. 66 is shown in SEQ NO: 11.

该序列与MS-谱一起显示，该蛋白质属于已知为衰老标记蛋白-30或钙调蛋白(regucalcins)的蛋白质家族，其主要被发现于脊椎动物中。来自大鼠、小鼠、人、鸡、兔、牛、Xenopus laevis的钙调蛋白序列，以及来自Drosophila melanogaster的两条较少同源性的序列已被测定。然而，来自马的该基因还未被分离及测序。The sequence, together with the MS-spectrum, revealed that the protein belongs to a family of proteins known as senescence marker protein-30 or regucalcins, which are mainly found in vertebrates. Calmodulin sequences from rat, mouse, human, chicken, rabbit, bovine, Xenopus laevis, and two less homologous sequences from Drosophila melanogaster have been determined. However, this gene from horses has not been isolated and sequenced.

(c)金属依赖性：(c) Metal dependence:

为测定不同二价金属对马肝脏酶的泛酸内酯酶活性的影响，将1.3、2.6、5.2或10.4mM的Ca²⁺、Mg²⁺或Mn²⁺加入到由pH9.0的200mM Tris/乙酸盐和50mM(R，S)-泛酸内酯组成的反应混合物中。在37℃对反应混合物进行60分钟的培养，用等体积的含有2mM EDTA的甲醇来终止反应。通过HPLC测定(S)-泛酸内酯的减少。钙令酶活性降低，而镁和锰则使酶活性增高。To determine the effect of different divalent metals on the pantothenate lactonase activity of horse liver enzymes, 1.3, 2.6, 5.2 or 10.4 mM Ca ²⁺ , Mg ²⁺ or Mn ²⁺ were added to 200 mM Tris/ Acetate and 50mM (R, S)-pantothenolactone in the reaction mixture. The reaction mixture was incubated at 37°C for 60 minutes and terminated with an equal volume of methanol containing 2 mM EDTA. The reduction of (S)-pantolactone was determined by HPLC. Calcium reduces enzyme activity, while magnesium and manganese increase enzyme activity.

(d)热稳定性：(d) thermal stability:

在30至80℃的温度下，将经纯化的酶在5.2mM MgCl₂和pH9.0的200mM Tris/乙酸盐中培养15分钟。冰上30分钟之后，用标准试验方法来测定活性(60分钟，37℃)。直到60℃酶依然是稳定的。其在65℃开始失活，在70℃丢失超过50％的活性。The purified enzyme was incubated in 5.2 mM MgCl ₂ and 200 mM Tris/acetate pH 9.0 at a temperature of 30 to 80° C. for 15 minutes. After 30 minutes on ice, activity was determined using standard assay methods (60 minutes, 37°C). The enzyme remains stable up to 60°C. It starts to inactivate at 65°C and loses more than 50% of its activity at 70°C.

实施例3克隆来自马肝脏的(S)-泛酸内酯酶Example 3 Cloning of (S)-pantothenate lactonase from horse liver

用来自所有已知哺乳动物基因(见下文)的序列信息来设计引物，用于针对从来自Bavaria的一年龄“Hafflinger”的马肝脏制得的cDNA进行的PCR，所述cDNA由Roche Molecular Biochemicals，Penzberg，德国友情提供。用QIAGEN RNeasy Maxi Kit(Qiagen，Hilden，德国)来制备来自肝脏组织的总RNA。Sequence information from all known mammalian genes (see below) was used to design primers for PCR against cDNA prepared from a one-year-old "Hafflinger" horse liver from Bavaria supplied by Roche Molecular Biochemicals, Courtesy of Penzberg, Germany. Total RNA from liver tissue was prepared with the QIAGEN RNeasy Maxi Kit (Qiagen, Hilden, Germany).

马肝脏内酯酶与钙调蛋白家族其它成员之间的同源性显示于表1中。测定是通过GCG程序包中的GAP程序，使用标准参数来测定的。相似性的数字显示于表中对角线的右边，而相同性的数字显示于对角线左边。The homology between horse liver lactonase and other members of the calmodulin family is shown in Table 1. Assays were performed by the GAP program in the GCG package, using standard parameters. Numbers for similarity are shown to the right of the diagonal in the table, and numbers for identity are shown to the left of the diagonal.

表1：钙调蛋白家族中的氨基酸序列相似性和相同性Table 1: Amino acid sequence similarities and identities within the calmodulin family

马 horse 牛 ox 兔 rabbit 人 people 大鼠rat 小鼠mouse 鸡 chicken X.laevX.laev D.melaD.mela 马 horse -- 92.692.6 92.092.0 90.690.6 88.388.3 88.388.3 84.384.3 77.977.9 48.848.8 牛 ox 91.091.0 -- 90.390.3 93.093.0 90.390.3 90.090.0 83.683.6 76.676.6 48.948.9 兔 rabbit 90.690.6 89.389.3 -- 91.091.0 87.687.6 87.687.6 82.682.6 77.677.6 47.747.7 人 people 89.689.6 91.391.3 89.389.3 -- 90.090.0 90.690.6 82.382.3 76.976.9 48.448.4 大鼠rat 86.086.0 87.387.3 85.085.0 88.388.3 -- 95.795.7 80.980.9 77.977.9 49.349.3 小鼠mouse 86.386.3 87.387.3 85.385.3 88.688.6 94.094.0 -- 81.681.6 77.377.3 50.050.0 鸡 chicken 77.977.9 77.977.9 76.676.6 77.977.9 75.975.9 75.975.9 -- 81.981.9 48.948.9 X.laevX.laev 70.970.9 69.969.9 71.271.2 70.270.2 71.671.6 70.270.2 73.973.9 -- 47.547.5 D.melaD.mela 36.836.8 37.037.0 37.237.2 37.537.5 37.037.0 38.438.4 36.336.3 37.037.0 --

鸡：鸡；X.laev：Xenopus laevis；D.mela：Drosophila melanogasterChicken: Chicken; X.laev: Xenopus laevis; D.mela: Drosophila melanogaster

用Titan One Tube RT-PCT Kit(Roche Molecular Biochemicals，Penzberg，德国)和覆盖了起始密码子及终止密码子上游50bp序列的同源引物，利用RT-PCT扩增出885bp的片段(SEQ ID NO：1的1-885位核苷酸)。反应按照厂商推荐的方法进行。Using Titan One Tube RT-PCT Kit (Roche Molecular Biochemicals, Penzberg, Germany) and homologous primers covering the upstream 50bp sequence of the start codon and stop codon, a fragment of 885bp was amplified by RT-PCT (SEQ ID NO : 1-885 nucleotides of 1). The reaction was carried out according to the method recommended by the manufacturer.

用于第一次PCR的N-末端5’引物(SEQ ID NO：12)和C-末端3’-引物(3’-CAGTTTCCTTAAGGGGGGATA-5’)(SEQ ID NO：13)的构建基于来自牛(SEQ ID NO：14)、兔(SEQ ID NO：15)、人(SEQ IDNO：16)、大鼠(SEQ ID NO：17)、小鼠(SEQ ID NO：18)、鸡(SEQ ID NO：19)和Xenopus(SEQ ID NO：20)的N-末端序列以及来自牛(SEQ ID NO：21)、兔(SEQ ID NO：22)、人(SEQ ID NO：23)、大鼠(SEQ ID NO：24)、小鼠(SEQ ID NO：25)、鸡(SEQ IDNO：26)和Xenopus(SEQ ID NO：27)的C-末端序列。The N-terminal 5' primer (SEQ ID NO: 12) and the C-terminal 3'-primer (3'-CAGTTTCCTTAAGGGGGGATA-5') (SEQ ID NO: 13) for the first PCR were constructed based on SEQ ID NO: 14), rabbit (SEQ ID NO: 15), human (SEQ ID NO: 16), rat (SEQ ID NO: 17), mouse (SEQ ID NO: 18), chicken (SEQ ID NO: 19) and the N-terminal sequence of Xenopus (SEQ ID NO: 20) and from bovine (SEQ ID NO: 21), rabbit (SEQ ID NO: 22), human (SEQ ID NO: 23), rat (SEQ ID NO: 24), mouse (SEQ ID NO: 25), chicken (SEQ ID NO: 26) and the C-terminal sequence of Xenopus (SEQ ID NO: 27).

将该片段克隆到TA克隆载体(Invitrogen，Carlsberg，CA，USA)中。测序证明其来自马肝脏钙调蛋白。通过被称为5’/3’RACE的方法，用Roche Molecular Biochemicals，Penzberg，德国提供的单独的试剂盒，分离出缺失的5’-末端：用完美匹配的引物(SEQ ID NO：28)从经分离的马肝脏cDNA生产单链DNA。纯化该单链DNA，加上polyC尾巴，用第一条引物5’端的嵌套引物、polyG引物来进行另一次PCR。代表着基因的缺失的5’-末端的DNA片段被扩增出来。用根据已经已知的序列设计的正向引物(SEQ ID NO：29)和主要根据牛和兔基因设计的起始于基因最后一个氨基酸的引物(SEQ ID NO：30)来克隆出3’-末端。在最后一次PCR中，用获得的序列信息来设计覆盖基因起始和终止密码子的5’-和3’-引物，按照上文所述从总RNA中分离出内酯酶的全长cDNA。该cDNA和氨基酸序列展示于SEQ ID NO：1和SEQ ID NO：2中。该基因由900bp/299个氨基酸构成。This fragment was cloned into the TA cloning vector (Invitrogen, Carlsberg, CA, USA). Sequencing proved it to be from horse liver calmodulin. The missing 5'-end was isolated by a method called 5'/3'RACE with a separate kit provided by Roche Molecular Biochemicals, Penzberg, Germany: with a perfectly matched primer (SEQ ID NO: 28) from Single-stranded DNA was produced from isolated horse liver cDNA. The single-stranded DNA is purified, polyC tails are added, and another PCR is performed using the nested primer at the 5' end of the first primer, the polyG primer. A DNA fragment representing the deleted 5'-end of the gene is amplified. The 3'- end. In the final PCR, the obtained sequence information was used to design 5'- and 3'-primers covering the start and stop codons of the gene, and the full-length cDNA of lactonase was isolated from the total RNA as described above. The cDNA and amino acid sequences are shown in SEQ ID NO: 1 and SEQ ID NO: 2. The gene consists of 900bp/299 amino acids.

用一种商业途径可获得的表达载体，例如来自Qiagen的pQE60，在E.coli中表达完整的基因，其C-末端与His标签融合。按照标准程序对加上His标签的蛋白质进行表达和纯化，产生出显示(S)-泛酸内酯酶活性的33kDa的蛋白质。A commercially available expression vector, eg pQE60 from Qiagen, was used to express the complete gene in E. coli with a His-tag fused at its C-terminus. Expression and purification of the His-tagged protein following standard procedures yielded a 33 kDa protein exhibiting (S)-pantolactonase activity.

实施例4由可商购的A.niger脂肪酶A粗制剂来纯化(R)-泛酸内酯水解活Example 4 Purification of (R)-pantolactone hydrolysis activity by commercially available A.niger lipase A crude preparation 性sex

在可从商业途径获得的不同种的由A.niger发酵获得的酶产品中，鉴定出了特异性水解(R)-泛酸内酯的活性。为纯化具体的酶，使用了来自Amano Pharmaceuticals Co.Ltd.，Nagoya，Japan的被称为Lipase A“Amano”的商业制剂。根据从与内酯酶共同纯化的两个酶获得的序列数据，该制剂来自经冻干和破碎的过量表达来自A.tubigensis的脂肪酶的A.awamori细胞。将该材料溶于pH7.5的20mM Tris/HCl、2.5mM的MgCl₂(标准缓冲液)中。加入5％的乙醇，以增加材料的溶解度。通过离心(30分钟，15000rpm，4℃)以及接着的过滤(用0.45μm的过滤器)，分离出无法进入到溶液中的材料。先在Amicon cell超滤器(50kDa分离点)中通过超滤对已溶解的部分进行第一次纯化，以去除任何盐类和小分子化合物，它们会扰乱后续的阴离子交换色谱步骤。然后将蛋白质上样至HR26/10预填Q-琼脂糖凝胶柱(Amersham Pharmacia Biotech EuropeGmbH，Dübendorf，瑞士)。用200ml标准缓冲液洗柱，然后通过0至350mM NaCl的线性梯度(400ml)洗脱出蛋白质。含有活性的片断被收集起来。加入硫酸铵，至终浓度为1.5M，将经过处理的蛋白质溶液上样至HR 26/10丁基琼脂糖凝胶预填柱(Amersham Pharmacia Biotech EuropeGmbH，Dübendorf，瑞士)中。用200ml含有1.5M硫酸铵的标准缓冲液洗涤，通过1.5至0M递减的硫酸铵梯度(400ml)分离出蛋白质。此时活性在两个峰被洗脱出，并被分别收集。两次收集物的体积被减少到小于2ml。用标准缓冲液，通过在Superdex 200柱上进行凝胶过滤分离出两种浓缩物。每种分离物的活性片段被收集和浓缩。收集物II的活性趋向于稍早于收集物I被洗脱出来，这表明，相应蛋白质的分子量可能稍高。Among the different commercially available enzyme products obtained from the fermentation of A. niger, specific hydrolysis of (R)-pantothenolactone was identified. For purification of specific enzymes, a commercial preparation known as Lipase A "Amano" from Amano Pharmaceuticals Co. Ltd., Nagoya, Japan was used. The preparation was obtained from lyophilized and disrupted A. awamori cells overexpressing lipase from A. tubigensis based on sequence data obtained from two enzymes co-purified with lactonase. This material was dissolved in 20 mM Tris/HCl, pH 7.5, 2.5 mM _MgCl2 (standard buffer). Add 5% ethanol to increase the solubility of the material. Material that did not go into solution was separated by centrifugation (30 min, 15000 rpm, 4°C) followed by filtration (with a 0.45 μm filter). The dissolved fraction was first purified by ultrafiltration on an Amicon cell ultrafilter (50kDa cut point) to remove any salts and small molecules that would disturb the subsequent anion exchange chromatography step. Proteins were then loaded onto HR26/10 prepacked Q-Sepharose columns (Amersham Pharmacia Biotech Europe GmbH, Dübendorf, Switzerland). The column was washed with 200 ml of standard buffer, and then the protein was eluted by a linear gradient (400 ml) of 0 to 350 mM NaCl. Fragments containing activity are pooled. Ammonium sulfate was added to a final concentration of 1.5M, and the treated protein solution was loaded onto a HR 26/10 butyl sepharose prepacked column (Amersham Pharmacia Biotech Europe GmbH, Dübendorf, Switzerland). Proteins were separated by washing with 200 ml of standard buffer containing 1.5 M ammonium sulfate and a gradient of decreasing ammonium sulfate from 1.5 to 0 M (400 ml). At this point the activity elutes in two peaks which are collected separately. The volume of the two pools was reduced to less than 2ml. The two concentrates were separated by gel filtration on a Superdex 200 column with standard buffer. Active fragments from each isolate were pooled and concentrated. The activity of pool II tended to elute slightly earlier than pool I, suggesting that the corresponding protein may be of slightly higher molecular weight.

80U/ml(40℃)是针对收集物I和收集物II分别已测出的最高活性。如2D凝胶所示，蛋白质制剂不完全是同源的，这表示真正的比活性甚至会更高。然而，因为内酯酶活性对应于凝胶过滤洗脱曲线中38kDa的蛋白质条带，也是最终制剂中占优势的蛋白质，该条带最有可能是内酯酶。根据凝胶过滤的结果，天然内酯酶具有75kDa的分子量。这暗示着该酶的同型二聚体结构。80 U/ml (40°C) is the highest activity that has been measured for pool I and pool II respectively. As shown by the 2D gels, the protein preparations were not completely homogeneous, suggesting that the true specific activity could be even higher. However, since lactonase activity corresponds to the 38 kDa protein band in the gel filtration elution profile and is the predominant protein in the final preparation, this band is most likely lactonase. According to the results of gel filtration, the natural lactonase has a molecular weight of 75 kDa. This suggests a homodimeric structure of the enzyme.

实施例5对(R)-泛酸内酯水解酶进行初步的表征Example 5 Preliminary characterization of (R)-pantolactone hydrolase

(a)天然形式的结构和分子量：(a) Structure and molecular weight of native form:

用如实施例4所述的条件，在72ml处，用Superdex 200(AmershamPharmacia Biotech Europe GmbH，Dübendorf，瑞士)洗脱出酶。从标准曲线推断，这反映了75kDa的分子量。SDS-PAGE表明分子量为38kDa。上述数据的组合暗示来自A.niger的(R)-泛酸内酯酶是由两个相同的38kDa亚基组成的同型二聚体。The enzyme was eluted with Superdex 200 (AmershamPharmacia Biotech Europe GmbH, Dübendorf, Switzerland) at 72 ml using conditions as described in Example 4. Extrapolated from the standard curve, this reflects a molecular weight of 75 kDa. SDS-PAGE showed a molecular weight of 38 kDa. The combination of the above data suggests that the (R)-pantolactonase from A. niger is a homodimer composed of two identical 38 kDa subunits.

(b)热稳定性：(b) thermal stability:

在pH 7.5的20mM Tris/HCl中于0、30、40、50、55、60、70℃对50μl经稀释的酶(0.1U)进行一小时的培养。冰上15分钟之后，用标准试验方法测定残余活性。直到50℃酶都是稳定的。在更高的温度下，其开始失活。50 μl of diluted enzyme (0.1 U) were incubated for one hour at 0, 30, 40, 50, 55, 60, 70°C in 20 mM Tris/HCl, pH 7.5. After 15 minutes on ice, residual activity was determined using standard assay methods. The enzyme is stable up to 50°C. At higher temperatures, it begins to deactivate.

(c)最佳温度：(c) Optimum temperature:

采用下述条件，于20、30、40、50、55和60℃，对来自商业来源(见实施例5)的强烈富集的(R)-泛酸内酯酶进行30分钟的培养：Strongly enriched (R)-pantolactonase from a commercial source (see Example 5) was incubated for 30 minutes at 20, 30, 40, 50, 55 and 60°C using the following conditions:

100mM(R，S)-泛酸内酯100mM (R, S)-pantothenate

100mM MgCl₂ 100mM _MgCl2

200mM Tris/HCl，pH 8.0200mM Tris/HCl, pH 8.0

0.05U(R)-泛酸内酯酶，最终体积为10μl。0.05 U(R)-pantolactonase in a final volume of 10 μl.

在所用的条件下，来自商业化A.niger制剂的(R)-泛酸内酯酶的最佳温度为大约50℃。The temperature optimum for (R)-pantolactonase from a commercial A. niger preparation was about 50°C under the conditions used.

(d)辅助因子依赖性和抑制：(d) Cofactor dependence and inhibition:

来自A.niger的(R)-泛酸内酯酶的活性需要Mg²⁺或Mn²⁺。加入1mMEDTA会完全抑制其活性。The activity of (R)-pantolactonase from A. niger requires Mg ²⁺ or Mn ²⁺ . Adding 1mM EDTA will completely inhibit its activity.

(e)最佳pH：(e) optimal pH:

当pH在7.0至11.0之间变化时，该酶并未显示出“正常的”pH-活性曲线。在40℃，对200mM TEA-乙酸盐缓冲液(pH7.0至11.0)中的10mg Amano Lipase A进行20分钟的培养，所述缓冲液含有0.1％Span 80，20mM乙酸镁。加入等体积的500mM MES缓冲液来终止样品的反应，所述缓冲液pH为5.5，含有50mM EDTA；对样品进行离心(10,000g，10分钟)，然后通过HPLC进行分析，其中，用0.46×15cm CHIRADEX柱，用70：30的水-甲醇来洗脱，1ml/分钟，20℃。最高活性在pH为大约10时达到。从这个位置到pH11.0之间，活性并无可观的改变。最高选择性在pH大约9.0时达到。The enzyme did not show a "normal" pH-activity profile when the pH was varied between 7.0 and 11.0. 10 mg Amano Lipase A in 200 mM TEA-acetate buffer (pH 7.0 to 11.0) containing 0.1% Span 80, 20 mM magnesium acetate was incubated for 20 minutes at 40°C. Add an equal volume of 500mM MES buffer to stop the reaction of the sample, the pH of the buffer is 5.5, containing 50mM EDTA; the sample is centrifuged (10,000g, 10 minutes), and then analyzed by HPLC, wherein, with 0.46 × 15cm CHIRADEX column, eluted with 70:30 water-methanol, 1ml/min, 20°C. The highest activity is reached at a pH of about 10. There was no appreciable change in activity from this position to pH 11.0. The highest selectivity is achieved at about pH 9.0.

实施例6培养A.niger ATCC9142、9029、26875、62863和10864，并对Embodiment 6 cultivates A.niger ATCC9142, 9029, 26875, 62863 and 10864, and to 其(R)-泛酸内酯酶活性进行评测Its (R)-pantothenic acid lactonase activity was evaluated

为获知内酯酶活性是否在Aspergillus属中广泛存在，在下述条件下培养五个A.niger菌株：To see if lactonase activity is widespread in the Aspergillus genus, five A. niger strains were grown under the following conditions:

培养基(相对于每升的量)：1.2g NaNO₃、0.5g KH₂PO₄、0.2gMgSO₄ x 7H₂O、0.5g酵母提取物、5％葡萄糖和0.04ml的痕量金属溶液，所述溶液如Vishniac和Santer，Bacteriol.Rev.21：195-213(1957)所述。Medium (relative to volume per liter): 1.2g NaNO ₃ , 0.5g KH ₂ PO ₄ , 0.2g MgSO ₄ x 7H ₂ O, 0.5g yeast extract, 5% glucose and 0.04ml trace metal solution, all Such solutions are described by Vishniac and Santer, Bacteriol. Rev. 21:195-213 (1957).

用每ml 5×10⁶个孢子于30℃去接种11瓶中的300ml预培养物。8小时后，将预培养物加入到21发酵罐中的1.71培养基中，发酵罐带有pH和溶解氧控制。通过自动加入5M NaOH将pH保持在5.5。在低通气条件(5-10％的空气饱和度)下对细胞进行14小时的培养。之后将溶解氧水平增加至30-50％空气饱和度。5小时后，收获菌丝体，对其进行过滤，用去矿物质水洗两次，冷冻于液氮中。用French Press对其中的一部分直接进行裂解。用标准试验方法对培养基和细胞裂解液进行针对泛酸内酯酶活性的研究。A 300 ml preculture in 11 flasks was inoculated at 30°C with 5 x 10 ⁶ spores per ml. After 8 hours, the preculture was added to 1.71 medium in a 21 fermenter with pH and dissolved oxygen control. The pH was maintained at 5.5 by automatic addition of 5M NaOH. Cells were incubated for 14 hours under hypoaerobic conditions (5-10% air saturation). The dissolved oxygen level was then increased to 30-50% air saturation. After 5 hours, the mycelia were harvested, filtered, washed twice with demineralized water, and frozen in liquid nitrogen. A part of them was directly lysed with a French Press. Media and cell lysates were investigated for pantothenate activity using standard assay methods.

(R)-泛酸内酯酶活性主要被发现于所有被测试的A.niger菌株的裂解液中。在培养基中仅发现了非常小的活性。(R)-Pantolactonase activity was mainly found in the lysates of all tested A. niger strains. Only very little activity was found in the medium.

实施例7对来自脂肪酶制剂的经富集的(R)-泛酸内酯酶进行胰蛋白酶消化Example 7 Trypsin Digestion of Enriched (R)-Pantolactonase from a Lipase Preparation 产生的小肽的氨基酸序列进行测定The amino acid sequence of the resulting small peptide was determined

在2D凝胶上对实施例5的经过富集的制剂进行分析。用胰蛋白酶对两个主要的点进行消化，其分别是33和40kDa，4.7和5.6的pI。通过HPLC来分离产生的肽(设备：Hewlett Packard 1090，柱：Vydac C18(0.21×25cm)，210nm处吸收，流速：0.20ml/分钟，缓冲液A：0.075％TFA，缓冲液B：80％乙腈中的0.065％TFA，梯度：2％B(0-10分钟)，2-75％B(10-120分钟))。通过Edman降解法来对选出的肽进行测序，点C[pI 5.6，39kDa(Poll C)]：Fxn 42(肽1)(SEQ ID NO：81)、Fxn 57(肽3)(SEQ IDNO：82)、Fx 73(肽2)(SEQ ID NO：83)、Fx 74(SEQ ID NO：84)和Fx 81(SEQ ID NO：85)，点C[pI 4.7，33kDa(Poll A/B)]：N末端(SEQ ID NO：86)、Fx 58(SEQ ID NO：87)、Fx 64(SEQ ID NO：88)、Fx 65(SEQ IDNO：89)、Fx 73(SEQ ID NO：90)和Fx 68(SEQ ID NO：91)。X代表未确定的氨基酸，圆括号中的氨基酸是无法被明确确定的。The enriched formulation of Example 5 was analyzed on a 2D gel. Two major spots, 33 and 40 kDa, pi of 4.7 and 5.6, respectively, were digested with trypsin. The resulting peptides were separated by HPLC (equipment: Hewlett Packard 1090, column: Vydac C18 (0.21×25 cm), absorption at 210 nm, flow rate: 0.20 ml/min, buffer A: 0.075% TFA, buffer B: 80% acetonitrile 0.065% TFA in , gradient: 2% B (0-10 min), 2-75% B (10-120 min)). The selected peptides were sequenced by Edman degradation method, point C [pI 5.6, 39kDa (Poll C)]: Fxn 42 (peptide 1) (SEQ ID NO: 81), Fxn 57 (peptide 3) (SEQ ID NO: 82), Fx 73 (peptide 2) (SEQ ID NO:83), Fx 74 (SEQ ID NO:84) and Fx 81 (SEQ ID NO:85), point C [pI 4.7, 33kDa (Poll A/B) ]: N-terminal (SEQ ID NO: 86), Fx 58 (SEQ ID NO: 87), Fx 64 (SEQ ID NO: 88), Fx 65 (SEQ ID NO: 89), Fx 73 (SEQ ID NO: 90) and Fx 68 (SEQ ID NO: 91). X represents an unidentified amino acid, and the amino acid in parentheses cannot be clearly identified.

用所有短肽序列作为目标进行数据库搜索，没有得到可靠的信息能帮助确定两个主要的点中的哪一个代表着内酯酶。因此，我们利用一套阴离子交换色谱对实施例5的内酯酶制剂进行进一步的富集。在预填MonoQ柱HR 16/10(Amersham Pharmacia Biotech Europe GmbH，Dübendorf，瑞士)上，0至350mM NaCl梯度下，对蛋白质溶液(在pH 7.5的20mMTris/HCl、2.5mM MgCl₂中)进行分离。对所有片断的活性进行测定。按照厂商(Invitrogen，Carlsbad，CA，USA)所述进行等电聚焦以及SDS-PAGE，对活性片断进行分析。比较SDS-PAGE凝胶和IEF得出的片断活性数据，暗示内酯酶是38至40kDa的蛋白质，pI在pH 5.6左右。该数据与点PollC吻合得非常好。因此，该点的氨基酸序列就被用于设计PCR引物和用于Southern印迹试验的寡核苷酸。收集最具有活性的片断，再在2D凝胶上进行分析，分析显示，较之之前的2D凝胶，PollC相对于PollA/B更加清楚地富集。A database search using all short peptide sequences as targets yielded no reliable information to help determine which of the two major spots represented the lactonase. Therefore, we used a set of anion exchange chromatography to further enrich the lactonase preparation of Example 5. Protein solutions (in 20 mM Tris/HCl, pH 7.5, 2.5 mM MgCl ₂ ) were separated on a prepacked MonoQ column HR 16/10 (Amersham Pharmacia Biotech Europe GmbH, Dübendorf, Switzerland) with a gradient of 0 to 350 mM NaCl. The activity of all fragments was assayed. Active fragments were analyzed by isoelectric focusing and SDS-PAGE as described by the manufacturer (Invitrogen, Carlsbad, CA, USA). Comparison of fragment activity data from SDS-PAGE gels and IEF suggested that the lactonase is a 38 to 40 kDa protein with a pi around pH 5.6. This data fits very well with point PollC. Therefore, the amino acid sequence of this spot was used to design PCR primers and oligonucleotides for Southern blot experiments. The most active fragments were collected and analyzed on a 2D gel, which showed that PollC was more clearly enriched relative to PollA/B than the previous 2D gel.

实施例8设计用于Southern印迹试验的简并寡核苷酸以及用于从A.nigerEmbodiment 8 is designed to be used for the degenerate oligonucleotide of Southern blotting experiment and be used for from A.niger 克隆(R)-泛酸内酯酶的PCR引物PCR primers for cloning (R)-pantolactonase

使用来自不同Aspergillus种的12条不同的植酸酶序列，通过GCG程序包版本10.2的CODONFREQUENCY程序，得到Aspergillus的密码子使用情况，将其用于减少由于遗传密码简并性所造成的可能的不同DNA序列组合的数目。用Aspergillus密码子使用情况作为决定性因素，引物3’-末端都考虑到了每种可能的组合，但引物5’-末端就忽略了一些稀有的密码子。在用寡核苷酸进行杂交实验的情况下，用同样的方式对3’-和5’-末端进行处理。名称中含有“S(sense)”或“AS(anti-sense)”的寡核苷酸是用于PCR的。名称中不含S或AS的寡核苷酸是用于杂交实验的。Using 12 different phytase sequences from different Aspergillus species, the codon usage of Aspergillus was obtained by the CODONFRENCY program of GCG package version 10.2, which was used to reduce possible differences due to the degeneracy of the genetic code The number of DNA sequence combinations. Using Aspergillus codon usage as the determining factor, every possible combination was considered at the 3'-end of the primers, but some rare codons were ignored at the 5'-end of the primers. In the case of hybridization experiments with oligonucleotides, the 3'- and 5'-ends were treated in the same way. Oligonucleotides with "S (sense)" or "AS (anti-sense)" in their names are intended for PCR. Oligonucleotides without S or AS in the name are for hybridization experiments.

用于从A.niger中克隆(R)-泛酸内酯酶的寡核苷酸是Fxn42(SEQ IDNO：31)、Fxn42S(SEQ ID NO：32)、Fxn42AS(SEQ ID NO：33)、Fx57(SEQ ID NO：34)、Fxn57S(SEQ ID NO：35)、Fxn57AS(SEQ ID NO：36)、Fx73(SEQ ID NO：37)、Fx73S(SEQ ID NO：38)、Fx73AS(SEQ IDNO：39)、Fx74(SEQ ID NO：40)、Fx74S(SEQ ID NO：41)和Fx74AS(SEQ ID NO：42)。The oligonucleotides used to clone (R)-pantolactonase from A. niger are Fxn42 (SEQ ID NO: 31), Fxn42S (SEQ ID NO: 32), Fxn42AS (SEQ ID NO: 33), Fx57 ( SEQ ID NO: 34), Fxn57S (SEQ ID NO: 35), Fxn57AS (SEQ ID NO: 36), Fx73 (SEQ ID NO: 37), Fx73S (SEQ ID NO: 38), Fx73AS (SEQ ID NO: 39) , Fx74 (SEQ ID NO: 40), Fx74S (SEQ ID NO: 41) and Fx74AS (SEQ ID NO: 42).

实施例9从A.niger ATCC9142、A.niger NRRL3135、A.niger ssp.Embodiment 9 is from A.niger ATCC9142, A.niger NRRL3135, A.niger ssp. awamori(ATCC38854)或A.niger MacRae(ATCC46951)中分离基因Genes isolated from awamori (ATCC38854) or A. niger MacRae (ATCC46951) 组DNADNA

用1×10⁶孢子/ml去接种300ml的YPD培养基[Sherman et al.，Laboratory course manual for methods in yeast genetics，Cold Spring HarborLaoratory，Cold Spring Harbor，New York(1986)]。在猛烈振荡下(200rpm)，于30℃对培养物进行过夜培养。用Whatman滤纸通过过滤收集菌丝体，用PBS(等渗磷酸缓冲液)洗，在液氮中冷冻。在冰冷的研钵中，将菌丝体磨成细粉末。将该粉末重新悬浮于1/3体积的pH8.0的50mM Tris/HCl、1.0％SDS、50mM EDTA中，于65℃频繁颠倒条件下培养15分钟。将溶液冷却至50℃。加入蛋白酶K(终浓度为100μg/ml)。将溶液在50℃培养1小时。另外加入100mg/ml的蛋白酶K，继续培养3小时。加入1/3体积的苯酚/氯仿/异戊醇(50/49/1)。彻底且轻柔的混合之后，对乳化液进行离心(12000g，20分钟，4℃)。移出水相，再次萃取。另一个离心步骤(12000g，10分钟，4℃)之后，用氯仿对水相进行萃取。向水相中加入0.7倍体积的异丙醇，轻柔混合溶液15分钟。通过离心(10000g，30分钟，4℃)收集沉淀DNA。向残留的上清液中加入1/10体积的pH5.2的3M KCl，在轻柔振荡下培养15分钟。再一次离心步骤之后，用70％的乙醇对两份沉淀洗两次，空气中干燥，溶于0.5ml水中。最后，用RNase处理DNA以去除残留的RNA。Use 1×10 ⁶ spores/ml to inoculate 300 ml of YPD medium [Sherman et al., Laboratory course manual for methods in yeast genetics, Cold Spring Harbor Laboratory, Cold Spring Harbor, New York (1986)]. The cultures were grown overnight at 30°C with vigorous shaking (200 rpm). Mycelia were collected by filtration with Whatman filter paper, washed with PBS (isotonic phosphate buffered saline), and frozen in liquid nitrogen. In an ice-cold mortar, grind the mycelium to a fine powder. The powder was resuspended in 1/3 volume of 50 mM Tris/HCl, pH 8.0, 1.0% SDS, 50 mM EDTA and incubated at 65°C for 15 minutes with frequent inversions. The solution was cooled to 50 °C. Proteinase K was added (final concentration 100 μg/ml). The solution was incubated at 50°C for 1 hour. In addition, 100 mg/ml of proteinase K was added, and the culture was continued for 3 hours. Add 1/3 volume of phenol/chloroform/isoamyl alcohol (50/49/1). After thorough and gentle mixing, the emulsion was centrifuged (12000g, 20 minutes, 4°C). The aqueous phase was removed and extracted again. After another centrifugation step (12000 g, 10 min, 4° C.), the aqueous phase was extracted with chloroform. Add 0.7 volumes of isopropanol to the aqueous phase and mix the solution gently for 15 min. Precipitated DNA was collected by centrifugation (10000 g, 30 min, 4°C). To the remaining supernatant was added 1/10 volume of 3M KCl, pH 5.2, and incubated for 15 minutes with gentle shaking. After another centrifugation step, the two pellets were washed twice with 70% ethanol, air dried and dissolved in 0.5 ml of water. Finally, the DNA is treated with RNase to remove residual RNA.

实施例10克隆来自A.niger ATCC9142的(R)-泛酸内酯酶Example 10 Cloning of (R)-pantothenate lactonase from A. niger ATCC9142

按照实施例6所述来制备A.niger ATCC9142的基因组DNA(gDNA)。用来自Stratagene(Stratagene，La Jolla，CA，USA)的Robocycler Infinity作为PCR仪，用来自Roche Molecular Biochemicals(Penzberg，德国)的TAQ聚合酶试剂盒来进行PCR：Genomic DNA (gDNA) of A. niger ATCC9142 was prepared as described in Example 6. The Robocycler Infinity from Stratagene (Stratagene, La Jolla, CA, USA) was used as PCR machine, and the TAQ polymerase kit from Roche Molecular Biochemicals (Penzberg, Germany) was used to carry out PCR:

1μl gDNA(1/10稀释) 步骤1：5分钟-95℃1μl gDNA (1/10 dilution) Step 1: 5 minutes -95°C

1μl核苷酸混合物步骤2：30秒-95℃1 μl Nucleotide Mixture Step 2: 30 sec -95°C

5μl带Mg²⁺的PCR标准缓冲液步骤3：30秒-50℃5 μl PCR standard buffer with Mg ²⁺ Step 3: 30 sec -50°C

1μl引物S(100pM/μl) 步骤4：1分钟-72℃1 μl Primer S (100pM/μl) Step 4: 1 minute -72°C

1μl引物AS(100pM/μl)1μl Primer AS (100pM/μl)

1μl来自Roche Molecular Biochemicals的TAQ聚合酶1 μl TAQ polymerase from Roche Molecular Biochemicals

40μl H₂O40 μl _H2O

步骤2至4重复35次。Steps 2 to 4 are repeated 35 times.

引物组合：Primer combination:

编号 serial number 上游引物upstream primer 下游引物downstream primer PCR产物[bp]PCR product [bp] 1 1 Fxn42SFxn42S Fxn57ASFxn57AS 300，310300, 310 2 2 Fxn42SFxn42S Fx73ASFx73AS -- 33 Fxn42SFxn42S Fx74ASFx74AS -- 44 Fxn57SFxn57S Fxn42ASFxn42AS 8080 55 Fxn57SFxn57S Fx73ASFx73AS 580580 66 Fxn57SFxn57S Fx74ASFx74AS 510510

77 Fx73SFx73S Fxn42ASFxn42AS -- 8 8 Fx73SFx73S Fxn57ASFxn57AS 300，315，320300, 315, 320 9 9 Fx73SFx73S Fx74ASFx74AS 1010 Fx74SFx74S Fxn42ASFxn42AS 180180 1111 Fx74SFx74S Fxn57ASFxn57AS 340340 1212 Fx74SFx74S Fx73ASFx73AS 840840 1313 Fxn42SFxn42S Fx57AS，Fx73AS，Fx74ASFx57AS, Fx73AS, Fx74AS -- 1414 Fxn57SFxn57S Fxn42AS，Fx73AS，Fx74AS-Fxn42AS, Fx73AS, Fx74AS- 580580 1515 Fx73SFx73S Fxn42AS，Fxn57AS，Fx74ASFxn42AS, Fxn57AS, Fx74AS -- 1616 Fx74SFx74S Fxn42AS，Fxn57AS，Fx73ASFxn42AS, Fxn57AS, Fx73AS 350350

对所有引物都会进行关于自身锚定(self-priming)的检查。Fxn57AS是唯一会产生相当数量自身结合产物的引物，自身锚定产物为大约300至320bp。用55℃的杂交温度，PCR 5、6、8、10和11仍会产生一条或多条产物。60℃下采用30秒扩增和30个循环，PCR 5、6、8、10和11仍会产生一条或多条与较低杂交温度下产生的条带相对应的PCR产物。All primers are checked for self-priming. Fxn57AS was the only primer that produced a significant amount of self-anchored product, with a self-anchored product of approximately 300 to 320 bp. With a hybridization temperature of 55°C, PCR 5, 6, 8, 10 and 11 still produced one or more products. Using 30-second amplification and 30 cycles at 60°C, PCRs 5, 6, 8, 10, and 11 still produced one or more PCR products corresponding to the bands produced at the lower hybridization temperature.

用来自Qiagen(Qiagen，Hilden，德国)的QIAEX II凝胶抽提试剂盒从琼脂糖凝胶中分离出PCR反应12的840bp的产物，将其溶于40μl H₂O中。The 840 bp product of PCR reaction 12 was isolated from the agarose gel with the QIAEX II gel extraction kit from Qiagen (Qiagen, Hilden, Germany) and dissolved in 40 μl _H2O .

1μl 840bp的片段(1/10稀释) 步骤1：5分钟-95℃1μl 840bp fragment (1/10 dilution) Step 1: 5 minutes -95°C

5μl带Mg²⁺的PCR标准缓冲液步骤3：30秒-55℃5 μl PCR standard buffer with Mg ²⁺ Step 3: 30 sec -55°C

1μl引物S(10pMol/μl) 步骤4：30秒-72℃1μl Primer S (10pMol/μl) Step 4: 30 seconds -72°C

1μl引物AS(10pMol/μl)1μl Primer AS (10pMol/μl)

40μl H₂O40 μl _H2O

步骤2至4重复30次。Repeat steps 2 to 4 30 times.

使用了反应2、5、10、11和12的引物组合。除反应2之外，其它所有反应都有PCR产物产生。这意味着引物42S/AS、57S/AS、73S/AS和74S/AS的序列是840bp片段的部分。这四对引物的高度接近性强烈预示，840bp的片段是感兴趣的内酯酶基因的一部分。这种假设得到了测序的支持。肽Fx57(SEQ ID NO：90)的氨基酸序列在840bp的片段中得到了证实，而肽Fxn42(SEQ ID NO：45)的序列仅得到了部分证实。Primer combinations for reactions 2, 5, 10, 11 and 12 were used. Except reaction 2, all other reactions produced PCR products. This means that the sequences of primers 42S/AS, 57S/AS, 73S/AS and 74S/AS are part of a 840 bp fragment. The high proximity of these four pairs of primers strongly indicates that the 840bp fragment is part of the lactonase gene of interest. This hypothesis was supported by sequencing. The amino acid sequence of peptide Fx57 (SEQ ID NO: 90) was confirmed in a fragment of 840 bp, whereas the sequence of peptide Fxn42 (SEQ ID NO: 45) was only partially confirmed.

还可以分离出A.niger MacRae的相应DNA片段。关于A.awamori的情况，通过该方法仅能分离出引物Fxn57S和Fx73AS的产物。上述Aspergillus的序列在DNA水平上与A.niger ATCC9142的序列具有大约90％的相同性。The corresponding DNA fragment of A. niger MacRae can also be isolated. In the case of A. awamori, only the products of primers Fxn57S and Fx73AS could be isolated by this method. The above sequence of Aspergillus has about 90% identity with the sequence of A. niger ATCC9142 at the DNA level.

为分离内酯酶基因的5’-和3’-末端，使用下述方法：To isolate the 5'- and 3'-ends of the lactonase gene, the following method was used:

(a)分离3’-末端：(a) Isolation of the 3'-end:

1μl A.niger ATCC9142的gDNA 步骤1：5分钟-95℃1 μl gDNA of A. niger ATCC9142 Step 1: 5 minutes -95°C

1μl核苷酸混合物步骤2：30秒-95℃1μl Nucleotide Mixture Step 2: 30 sec -95°C

1μl引物Oal2AS或Oal3S(10pMol/μl) 步骤4：30秒-72℃1μl primer Oal2AS or Oal3S (10pMol/μl) Step 4: 30 seconds -72°C

41μl H₂O41 μl _H2O

步骤2至4重复30次。Repeat steps 2 to 4 30 times.

用等体积的苯酚/氯仿对PCR混合进行两次萃取，再用氯仿单独进行一次。然后通过加入1/10体积的pH5.2的3M乙酸钾和2倍体积的乙醇沉淀出DNA。15分钟的离心(Eppendorf板式离心管中，14000rpm)之后，用70％的乙醇洗沉淀，空气干燥，最终溶于20μl灭菌水中。The PCR mix was extracted twice with equal volumes of phenol/chloroform and once with chloroform alone. The DNA was then precipitated by adding 1/10 volume of 3M potassium acetate pH 5.2 and 2 volumes of ethanol. After 15 minutes of centrifugation (14000 rpm in Eppendorf plate centrifuge tubes), the pellet was washed with 70% ethanol, air-dried and finally dissolved in 20 μl sterile water.

第二次PCR：Second PCR:

4μl来自第一次PCR的DNA 步骤1：5分钟-95℃4 μl of DNA from the first PCR Step 1: 5 min -95°C

5μl带Mg²⁺的PCR标准缓冲液步骤3：30秒-58℃5 μl PCR standard buffer with Mg ²⁺ Step 3: 30 sec -58°C

1μl引物Oal4S或Oal1AS(10pMol/μl) 步骤4：1分钟-72℃1 μl primer Oal4S or Oal1AS (10pMol/μl) Step 4: 1 minute -72°C

1μl来自Roche Molecular Biochemicals的High Fidelity Mix1 μl High Fidelity Mix from Roche Molecular Biochemicals

38μl H₂O38 μl _H2O

步骤2至4重复35次。Steps 2 to 4 are repeated 35 times.

用Oal4S引物，产生了一段900bp的DNA分子。测序显示其编码基因的3’-末端。Using the Oal4S primer, a 900 bp DNA molecule was generated. Sequencing revealed the 3'-end of its encoding gene.

(b)分离5’-末端：(b) Isolation of the 5'-end:

用DIG标记过的寡核苷酸Fxn42、Fxn57、Fx73和Fx74作为探针，使用带有X-光胶片化学荧光探测的DIG系统(Roche MolecularBiochemicals)，对A.niger ATCC9142的基因组DNA进行杂交实验。杂交和探测按照厂商的推荐来进行。Using DIG-labeled oligonucleotides Fxn42, Fxn57, Fx73 and Fx74 as probes, the genomic DNA of A. niger ATCC9142 was hybridized using the DIG system (Roche Molecular Biochemicals) with X-ray film chemiluminescence detection. Hybridization and probing were performed according to the manufacturer's recommendations.

用BamHI、EcoRI、HindIII、PstI、SacI和XhoI来对10μg的基因组DNA进行消化。用Roche Molecular Biochemicals杂交溶液在42℃进行过夜杂交。室温用2x SSC、0.1％SDS进行两次洗涤步骤之后，接着在50℃用0.5x SSC、0.1％SDS再进行两次洗涤步骤。用CSPD(Disodium 2-chloro-5-(4-methoxyspiro{1，2-dioxetane-3，2’-(5’-chloro)tricyclo[3.3.1.1^3，7]decan}-4-yl)-1-phenyl phospate)作为底物，对碱性磷酸酶进行探测，之后将X-光胶片曝光于滤光器(filter)1小时。胶片上没有合用的信号。10 μg of genomic DNA was digested with BamHI, EcoRI, HindIII, PstI, SacI and XhoI. Hybridization was performed overnight at 42°C using Roche Molecular Biochemicals hybridization solution. Two wash steps with 2x SSC, 0.1% SDS at room temperature were followed by two more wash steps with 0.5x SSC, 0.1% SDS at 50°C. With CSPD (Disodium 2-chloro-5-(4-methoxyspiro{1,2-dioxetane-3,2'-(5'-chloro)tricyclo[3.3.1.1 ^3,7 ]decan}-4-yl)-1 -phenyl phospate) was used as a substrate to detect alkaline phosphatase, and then the X-ray film was exposed to a filter (filter) for 1 hour. No usable signal on film.

因此，使用反应5和11的随机锚定的DIG标记过的PCR产物(见实施例11)作为探针。按照同样的方法来进行杂交，只是洗涤条件稍微改变为室温下2×5分钟，55℃下2×15分钟。胶片上显示了下述条带：Therefore, randomly anchored DIG-labeled PCR products from reactions 5 and 11 (see Example 11) were used as probes. Hybridization was performed in the same manner, except that the washing conditions were slightly changed to 2 x 5 minutes at room temperature and 2 x 15 minutes at 55°C. The following bands were shown on the film:

限制性酶Restriction enzyme 340bp的探针[kb]340bp probe [kb] 580bp的探针[kb]580bp probe [kb] BamHIBamHI 1010 10/2.810/2.8 EcoRIEcoRI 4.54.5 4.5/5.24.5/5.2 HindIIIHind III 4.2/6.54.2/6.5 3.0/4.2/6.53.0/4.2/6.5 PstIPstI 4.44.4 2.8/4.42.8/4.4 SacISacI 23 twenty three 23/3.923/3.9 XhoIXhoI 66 6/206/20

为克隆整个基因或至少是其缺失的5’-末端，根据杂交实验，用HindIII对A.niger ATCC9142的染色体DNA进行消化，产生出大约4kb的含有内酯酶基因的片段，实验方法如下：In order to clone the entire gene or at least its missing 5'-end, according to the hybridization experiment, the chromosomal DNA of A. niger ATCC9142 was digested with HindIII to produce a fragment containing the lactonase gene of about 4 kb. The experimental method is as follows:

20μl染色体DNA20 μl chromosomal DNA

20μl10x缓冲液20 μl 10x buffer

5μl HindIII(10U/μl)5μl HindIII (10U/μl)

175μl H₂O175 μl _H2O

在37℃培养16小时。Incubate at 37°C for 16 hours.

用来自Qiagen(Qiagen，Hilden，德国)的PCR纯化试剂盒对经过消化的DNA进行纯化，并洗脱为100μl。下述连接反应在16℃过夜进行。将经过纯化的DNA用水稀释到2ml。按照厂商(Roche MolecularBiochemicals)推荐加入连接酶和连接酶缓冲液。连接完成的溶液被用作模板，进行不同的PCR，其中使用经部分分离的内酯酶基因的内部引物：The digested DNA was purified with a PCR purification kit from Qiagen (Qiagen, Hilden, Germany) and eluted in 100 μl. The following ligation reactions were performed overnight at 16°C. Dilute the purified DNA to 2ml with water. Ligase and ligase buffer were added as recommended by the manufacturer (Roche Molecular Biochemicals). The ligated solution was used as a template for a different PCR using internal primers for the partially isolated lactonase gene:

5μl连接反应溶液步骤1：5分钟-95℃5 μl ligation reaction solution Step 1: 5 minutes -95°C

1μl sense引物(10pMol/μl) 步骤4：3分钟-72℃1μl sense primer (10pMol/μl) Step 4: 3 minutes -72°C

1μl anti-sense引物(10pMol/μl)1μl anti-sense primer (10pMol/μl)

1μl来自Roche Molecular Biochemicals的High Fidelity聚合酶混合液1 μl High Fidelity Polymerase Mix from Roche Molecular Biochemicals

40μl H₂O40 μl _H2O

步骤2至4重复32次。Steps 2 to 4 are repeated 32 times.

四种引物组合，Oal3S/Fxn57AS、Oal4S/Fxn57AS、Oal3S/Oal8AS和Oal4S/Oal8AS，都能产生单条4kb的DNA片段。所有引物对都定向于基因的外部。因此，内酯酶基因应该位于作为自连接HindIII片段的环状DNA分子上。Four primer combinations, Oal3S/Fxn57AS, Oal4S/Fxn57AS, Oal3S/Oal8AS and Oal4S/Oal8AS, can all generate a single 4kb DNA fragment. All primer pairs are directed to the outside of the gene. Therefore, the lactonase gene should be located on a circular DNA molecule as a self-ligating HindIII fragment.

用来自Qiagen的QIAEX II凝胶抽提试剂盒从凝胶中分离出片段，按照厂商(Applied Biosystems，Foster City，CA，USA)推荐的二去氧方法，用ABI 310Genetic Analyzer来进行测序。获得的序列证实，分离出的片段含有来自A.niger ATCC9142的(R)-泛酸内酯酶基因的两个末端。Fragments were isolated from the gel with the QIAEX II gel extraction kit from Qiagen, and sequenced with the ABI 310 Genetic Analyzer according to the dideoxy method recommended by the manufacturer (Applied Biosystems, Foster City, CA, USA). The sequence obtained confirmed that the isolated fragment contained both ends of the (R)-pantolactonase gene from A. niger ATCC9142.

用引物Oall0S和Oal12AS，也可能分离出A.niger ssp.awamori的同源基因，其中含有与所有被测出的肽完全一样的氨基酸序列。唯一的区别仅在于肽Fxn42的序列。此时Fxn42来自对两个肽的测序。最可能的情况是，当进行HPLC来分离经过消化的酶片段时，同时洗脱出了两个不同的肽。测序首先仅揭示了主要的肽的序列，但其却带有很强的背景。对第一个肽的测序完成后，剩下的就是第二个肽的序列，其具有另外四个氨基酸：Using the primers Oall0S and Oal12AS, it was also possible to isolate the homologous gene of A. niger ssp.awamori, which contains the exact same amino acid sequence as all the peptides tested. The only difference is only in the sequence of the peptide Fxn42. At this point Fxn42 comes from the sequencing of two peptides. Most likely, when HPLC was performed to separate the digested enzyme fragments, two different peptides eluted simultaneously. Sequencing first revealed only the sequence of the main peptide, but with a strong background. Once the sequencing of the first peptide is complete, all that remains is the sequence of the second peptide, which has four more amino acids:

a)K-P-F-A-H-Q-V-K(SEQ ID NO：43)和a) K-P-F-A-H-Q-V-K (SEQ ID NO: 43) and

b)R-H-H-N-A-P-A-P-T-P-E-D-P-E-R-R(SEQ ID NO：44)给出b) R-H-H-N-A-P-A-P-T-P-E-D-P-E-R-R (SEQ ID NO: 44) gives

K-P-F-A-H-Q-V-K-T-x-E-D(SEQ ID NO：45)，其为Fxn 42。K-P-F-A-H-Q-V-K-T-x-E-D (SEQ ID NO: 45), which is Fxn 42.

这还解释了为何基于肽Fxn42的寡核苷酸Fxn42S和Fxn42AS无法给出合用的结果。根据氨基酸序列，来自菌株ATCC9142的蛋白质具有36573Da的分子量，280nm处估计的吸光率为1.83(1mg/ml蛋白质溶液)，而来自A.niger ssp.awamori的氨基酸序列的分子量被计算为36547Da，对1mg/ml蛋白质溶液而言，估计的E_280nm为1.831(Pace et al.，1995)。This also explains why the oligonucleotides Fxn42S and Fxn42AS based on the peptide Fxn42 failed to give combined results. Based on the amino acid sequence, the protein from strain ATCC9142 has a molecular weight of 36573 Da with an estimated absorbance at 280 nm of 1.83 (1 mg/ml protein solution), while the molecular weight of the amino acid sequence from A. niger ssp. awamori is calculated to be 36547 Da for 1 mg /ml protein solution, the estimated _E280nm is 1.831 (Pace et al., 1995).

实施例11对来自A.niger ATCC 9142的(R)-泛酸内酯酶基因进行克隆和Embodiment 11 is cloned from the (R)-pantothenate lactonase gene of A.niger ATCC 9142 and 表达Express

对按照实施例10中所述的全部三种方法获得的序列进行比较，将其进行装配，产生一条连续的覆盖整个内酯酶基因的2443bp的序列。为测定该基因的起始密码子、终止密码子和可能的内含子，将从来自商业制剂的纯化基因得到的肽的可获得的氨基酸序列，与GCG程序包版本10.2的两个程序TESTCODE和CODONPREFERENCE获得的结果联合使用。虽然按照同样的方式两个程序都测定出了终止密码子和一个内含子，而起始密码子却并不明显，具体而言，因为没有肽Fx74(SEQ ID NO：84)的其它可获得的N-末端已被测序的肽。对数据库搜索期间发现的向同源氨基酸序列的所有可能的翻译进行的比较也没有帮助，因为蛋白质的N-末端部分是完全异源的。因此，在肽Fx74的位置上游但仍在同一个阅读框内发现的两个甲硫氨酸残基被选出，将其用于构建不同的表达盒。在SEQ IDNO：7的666-742号碱基对之间发现了富含CT的区域。然而，该富含CT的区域的位置并非像起始密码子那样真正地偏好Met而不是其它氨基酸，因为在真菌中该区域与起始密码子之间的距离通常有相当大的变化。此外，在基因的下游鉴定出了可能的聚腺苷化位点。The sequences obtained by all three methods described in Example 10 were compared and assembled to generate a contiguous 2443 bp sequence covering the entire lactonase gene. To determine the start codon, stop codon and possible introns of the gene, the available amino acid sequences of peptides obtained from purified genes from commercial preparations were compared with the two programs TESTCODE and The results obtained with CODONPREFERENCE are used in conjunction. While both programs determined the stop codon and an intron in the same way, the start codon was not evident, specifically because no other variants of the peptide Fx74 (SEQ ID NO:84) were available. The N-terminus of the peptide has been sequenced. A comparison of all possible translations to homologous amino acid sequences found during database searches was also not helpful, since the N-terminal part of the protein is completely heterologous. Therefore, two methionine residues found upstream of the position of peptide Fx74 but still in the same reading frame were selected and used to construct different expression cassettes. A CT-rich region was found between base pairs 666-742 of SEQ ID NO:7. However, the location of this CT-rich region does not really favor Met over other amino acids like the start codon, since the distance between this region and the start codon often varies considerably in fungi. In addition, possible polyadenylation sites were identified downstream of the gene.

对来自A.niger ATCC9142的基因的72bp长的内含子而言，从第976至981位核苷酸的DNA段最可能代表5’-剪切位点，从第1029至1035位或从1016至1022位核苷酸的段最可能代表推测出的内部保守序列(consensus sequence)，从第1045至1047位核苷酸的DNA段最可能代表3’-剪切位点(SEQ ID NO：7)。A.niger MacRae基因的相应序列在51bp长的内含子(SEQ ID NO：79)中定位于bp 32至37、66至72、以及80至82。对A.niger ssp.awamori的oal基因的50bp长的内含子而言，5’-剪切位点始自第203位核苷酸(GTGCCC)，内部保守区域在第236位核苷酸处(AACTAAC)，3’-剪切位点在第250位核苷酸处(CAG)(SEQID NO：9)。For the 72 bp long intron of the gene from A. niger ATCC9142, the DNA segment from 976 to 981 nucleotides most likely represents the 5'-splice site, from 1029 to 1035 or from 1016 The segment to 1022 nucleotides is most likely to represent a deduced internal conservative sequence (consensus sequence), and the DNA segment from 1045 to 1047 nucleotides is most likely to represent a 3'-cleavage site (SEQ ID NO: 7 ). The corresponding sequences of the A. niger MacRae gene are located at bp 32 to 37, 66 to 72, and 80 to 82 in a 51 bp long intron (SEQ ID NO: 79). For the 50bp long intron of the oal gene of A. niger ssp.awamori, the 5'-cleavage site starts from the 203rd nucleotide (GTGCCC), and the internal conserved region is at the 236th nucleotide (AACTAAC), the 3'-cleavage site is at the 250th nucleotide (CAG) (SEQID NO: 9).

oal的内含子intron of oal 长度 length 5’-位点5'-site 内部位点internal site 3’-位点3'-site A.niger ATCC 9142A. niger ATCC 9142 7272 GTACCCGTACCC ACCTAACACCTAAC TAGTAG A.niger MacRaeA. niger MacRae 5151 GTAATCGTAATC AACTAACAACTAAC CAGCAG A.niger ssp awamoriA. niger ssp awamori 5050 GTGCCCGTGCCC AACTAACAACTAAC CAGCAG

5’-剪切位点的保守序列是GTPuNGPy。列出的位点中没有一个在第5位有G。用于套索(lariat)形成的内部位点的酵母的保守序列是TACTAAC。3’-剪切位点具有PyAG的保守序列[Unkles，Gene organizationin industrial filamentous fungi.In Applied molecular genetics of filamentousfungi(Kinghoun，ed.)，pp.28-53.Blackie Academic&Professional，WesterCleddens Road，Bishopbriggs，Glasgow G64 2NZ，UK，Glasgow(1992)]。The conserved sequence for the 5'-cleavage site is GTPuNGPy. None of the sites listed have a G at position 5. The conserved sequence in yeast for the internal site of lariat formation is TACTAAC. The 3'-cleavage site has the conserved sequence of PyAG [Unkles, Gene organization in industrial filamentous fungi. In Applied molecular genetics of filamentous fungi (Kinghoun, ed.), pp.28-53. Blackie Academic & Professional, WesterCleddens Road, Bishopbriggs, Glasgow G64 2NZ, UK, Glasgow (1992)].

产生了下述cDNA：The following cDNA was produced:

-用于在E.coli中表达的oal(SEQ ID NO：98)- oal (SEQ ID NO: 98) for expression in E.coli

-用于在Saccharomyces cerevisiae中表达的oalEco(SEQ ID NO：94)- oalEco (SEQ ID NO: 94) for expression in Saccharomyces cerevisiae

-用于在S.cerevisiae中表达的oalEShort(用SEQ ID NO：95的第32位处的第二个可能的Met的31个氨基酸的较短的版本)- oalEShort for expression in S.cerevisiae (with a shorter version of 31 amino acids of the second possible Met at position 32 of SEQ ID NO:95)

-用于在E.coli中表达的oalS(oalS被制成是因为Fxn42肽的令人误解的序列，其最终被鉴定为人工产物)- oalS for expression in E. coli (oalS was made because of the misleading sequence of the Fxn42 peptide, which was eventually identified as an artifact)

-用于在S.cerevisiae中表达的oalSEco-oalSEco for expression in S.cerevisiae

-用于在E.coli中表达的含有C-末端his-标签的oalhis- oalhis containing C-terminal his-tag for expression in E.coli

-用于在S.cerevisiae中表达的oalEhis- oalEhis for expression in S. cerevisiae

-用于在E.coli中表达的含有N-末端his-标签的oalNhis(SEQ IDNO：92)- oalNhis (SEQ ID NO: 92) containing the N-terminal his-tag for expression in E.coli

-用于在S.cerevisiae中表达的oalENhis(SEQ ID NO：96)(构建体oalsec，其含有A.terreus cbs的植酸酶的信号肽，该信号肽用于在S.cerevisiae中的表达和分泌)- oalENhis (SEQ ID NO: 96) for expression in S.cerevisiae (construct oalsec, it contains the signal peptide of the phytase of A.terreus cbs, and this signal peptide is used for the expression in S.cerevisiae and secretion)

通常的工序：Usual procedure:

首先，在两次PCR中分离出oal基因的假设编码序列(cDNA)。通过第三次PCR将两种PCR产物装配起来。使用下述引物：First, the putative coding sequence (cDNA) of the oal gene was isolated in two PCRs. The two PCR products were assembled by a third PCR. Use the following primers:

Oal1AS(SEQ ID NO：46)、Oal2AS(SEQ ID NO：47)、Oal3S(SEQ ID NO：48)、Oal4S(SEQ ID NO：49)、Oal5S(SEQ ID NO：50)、Oal6AS(SEQ ID NO：51)、Oal7AS(SEQ ID NO：52)、Oal8AS(SEQ ID NO：53)、Oal9AS(SEQ ID NO：54)、Oall 0S(SEQID NO：55)、Oal10SEco(SEQ ID NO：56)、OalENhis(SEQ ID NO：57)、Oal10SShis(SEQ ID NO：58)、Oal11S(SEQ ID NO：59)、Oal12AS(SEQ ID NO：60)、Oal12ASEco(SEQ ID NO：61)、Oal12AShis(SEQ ID NO：62)、Oal12ASHisEco(SEQ ID NO：63)、Oal13S(SEQ ID NO：64)、Oal13AS(SEQ ID NO：65)、Oal14S(SEQID NO：66)、Oal14AS(SEQ ID NO：67)、Oal15S(SEQ ID NO：68)、Oal15AS(SEQ ID NO：69)、Oal16S(SEQ ID NO：70)(第二个Met)、OalsecS(SEQ ID NO：71)、OalsecAS(SEQ ID NO：72)、pQE80EcoS(SEQ ID NO：73)、pQE80EcoNhisS(SEQ ID NO：74)、pQE80BamAS(SEQ ID NO：75)、pQE80BamhisAS(SEQ ID NO：76)、pQE80EcoSshort(SEQ ID NO：77)和CP-a(SEQ ID NO：78)。Oal1AS (SEQ ID NO: 46), Oal2AS (SEQ ID NO: 47), Oal3S (SEQ ID NO: 48), Oal4S (SEQ ID NO: 49), Oal5S (SEQ ID NO: 50), Oal6AS (SEQ ID NO : 51), Oal7AS (SEQ ID NO: 52), Oal8AS (SEQ ID NO: 53), Oal9AS (SEQ ID NO: 54), Oall0S (SEQ ID NO: 55), Oal10SEco (SEQ ID NO: 56), OalENhis (SEQ ID NO: 57), Oal10SShis (SEQ ID NO: 58), Oal11S (SEQ ID NO: 59), Oal12AS (SEQ ID NO: 60), Oal12ASEco (SEQ ID NO: 61), Oal12AShis (SEQ ID NO: 62), Oal12ASHisEco (SEQ ID NO: 63), Oal13S (SEQ ID NO: 64), Oal13AS (SEQ ID NO: 65), Oal14S (SEQ ID NO: 66), Oal14AS (SEQ ID NO: 67), Oal15S (SEQ ID NO: 68), Oal15AS (SEQ ID NO: 69), Oal16S (SEQ ID NO: 70) (second Met), OalsecS (SEQ ID NO: 71), OalsecAS (SEQ ID NO: 72), pQE80EcoS ( SEQ ID NO: 73), pQE80EcoNhisS (SEQ ID NO: 74), pQE80BamAS (SEQ ID NO: 75), pQE80BamhisAS (SEQ ID NO: 76), pQE80EcoSshort (SEQ ID NO: 77) and CP-a (SEQ ID NO :78).

Oal11S直接始自起始密码子的更为上游的位点前面，而Oal9AS始自假设的终止密码子下游若干核苷酸处。为能在第三次PCR中不用内含子将得到的两种PCR产物装配起来，Oal14S和Oal14AS是互补的，其含有从外显子I的终点到外显子II的起点之间的序列。Oal11S begins directly in front of the site more upstream of the start codon, while Oal9AS begins several nucleotides downstream of the putative stop codon. In order to assemble the two resulting PCR products without introns in the third PCR, Oal14S and Oal14AS are complementary, containing the sequence from the end of exon I to the beginning of exon II.

1μl Oal11S(10pMol/μl) 步骤4：1分钟-72℃1 μl Oal11S (10pMol/μl) Step 4: 1 minute -72°C

1μl Oal14AS(10pMol/μl)1μl Oal14AS (10pMol/μl)

1μl来自Roche Molecular Biochemicals的High Fidelity聚合酶混合液40μl H₂O1 μl High Fidelity Polymerase Mixture from Roche Molecular Biochemicals 40 μl _H2O

步骤2至4重复35次。Steps 2 to 4 are repeated 35 times.

1μl Oal14S(10pMol/μl) 步骤4：1分钟-72℃1 μl Oal14S (10pMol/μl) Step 4: 1 minute -72°C

1μl Oal9AS(10pMol/μl)1μl Oal9AS (10pMol/μl)

步骤2至4重复35次。Steps 2 to 4 are repeated 35 times.

对A.niger ATCC9142的gDNA进行的PCR给出了具有期望大小的PCR产物。通过琼脂糖凝胶电泳对PCR产物进行纯化。用来自Qiagen(Qiagen，Hilden，德国)的QIAEX II试剂盒从凝胶中提取出PCR产物，将其用于第三次PCR：PCR on gDNA of A. niger ATCC9142 gave a PCR product of the expected size. PCR products were purified by agarose gel electrophoresis. The PCR product was extracted from the gel with the QIAEX II kit from Qiagen (Qiagen, Hilden, Germany) and used for the third PCR:

PCR产物1和2各0.5μl 步骤1：5分钟-95℃0.5 μl each of PCR products 1 and 2 Step 1: 5 minutes -95°C

1μl Oal12S(10pMol/μl) 步骤4：1分钟-72℃1 μl Oal12S (10pMol/μl) Step 4: 1 minute -72°C

1μl Oal10AS(10pMol/μl)1μl Oal10AS (10pMol/μl)

步骤2至4重复30次。Repeat steps 2 to 4 30 times.

通过琼脂糖凝胶电泳对终产物进行纯化，从凝胶中提取出来，并按照Invitrogen(carlsbad，CA，USA)提供的方法将其克隆进TA-载体，用于对PCR产物进行直接克隆。所有其它必需的克隆步骤都按照Sambrook et al.(1989)所述来进行。通过测序检测出来的含有正确插入(oal1)的质粒在所有后续的构建步骤中被用作模板。The final product was purified by agarose gel electrophoresis, extracted from the gel, and cloned into a TA-vector according to the method provided by Invitrogen (carlsbad, CA, USA) for direct cloning of PCR products. All other necessary cloning steps were performed as described by Sambrook et al. (1989). The plasmid containing the correct insert (oal1) detected by sequencing was used as template in all subsequent construction steps.

(a)用于在E.coli和Saccharomyces cerevisiae中表达的构建体oal和oalEco：按照下述方案，用引物pQE80EcoS和pQE80BamAS对oal1进行PCR，以产生构建体Eco-oal-Bam：(a) Constructs oal and oalEco for expression in E. coli and Saccharomyces cerevisiae: PCR was performed on oal1 with primers pQE80EcoS and pQE80BamAS to generate construct Eco-oal-Bam according to the following scheme:

1μl含有作为插入的oal1的质粒(10ng)1 μl of plasmid (10 ng) containing oal1 as insert

1μl核苷酸混合物1 μl nucleotide mix

5μl带Mg²⁺的PCR标准缓冲液5μl PCR standard buffer with Mg2 ⁺

1μl pQE80EcoS(10pMol/μl)1μl pQE80EcoS (10pMol/μl)

1μl pQE80BamAS(10pMol/μl)1μl pQE80BamAS (10pMol/μl)

步骤1：5分钟-95℃Step 1: 5 minutes -95°C

步骤1：30秒-95℃Step 1: 30 seconds -95°C

步骤1：30秒-55℃Step 1: 30 seconds -55°C

步骤4：1分钟-72℃Step 4: 1 minute -72°C

步骤2至4重复30次。Repeat steps 2 to 4 30 times.

通过琼脂糖凝胶电泳对PCR产物进行纯化，从凝胶中提取人们感兴趣的片段(QIAEX II，Qiagen)，用EcoRI和BamHI进行消化，用琼脂糖凝胶电泳再次进行纯化，将其连接到来自Qiagen的载体pQE80上，转化进E.coli Top10。分离出四个克隆的质粒，对插入进行测序。含有正确构建体的质粒被转化进E.coli M15或相当的E.coli菌株。所有分子技术都是技术人员已知的标准技术。The PCR product was purified by agarose gel electrophoresis, the fragment of interest (QIAEX II, Qiagen) was extracted from the gel, digested with EcoRI and BamHI, purified again by agarose gel electrophoresis, and ligated into Transformed into E.coli Top10 on the vector pQE80 from Qiagen. Plasmids of four clones were isolated and the inserts were sequenced. Plasmids containing the correct constructs were transformed into E. coli M15 or equivalent E. coli strains. All molecular techniques are standard techniques known to the skilled person.

对酵母表达构建体而言，将所用的引物换为Oal10EcoS和Oal12ASEco，用EcoRI对PCR产物进行单独消化，通过载体上的EcoRI限制性位点，将终产物克隆进pYES2或用于S.cerevisiae的相当的表达载体。按照Hinnen et al.，Proc.Natl.Acad.Sci.USA 75：1929-1933(1978)或按照与菌株同时提供的说明书，来进行对S.cerevisiae菌株，例如INVScl(Invitrogen，Carlsbad，CA，USA)的转化。For yeast expression constructs, the primers used were changed to Oal10EcoS and Oal12ASEco, the PCR product was digested separately with EcoRI, and the final product was cloned into pYES2 or used for S.cerevisiae through the EcoRI restriction site on the vector. equivalent expression vector. According to Hinnen et al., Proc.Natl.Acad.Sci.USA 75:1929-1933 (1978) or according to the specification sheet provided simultaneously with bacterial strain, carry out to S.cerevisiae bacterial strain, for example INVScl (Invitrogen, Carlsbad, CA, USA ) conversion.

(b)用于在E.coli中表达的构建体pQE80oalS(用第二个可能的Met的31个氨基酸的减短版本)：(b) Construct pQE80oalS for expression in E. coli (with a 31 amino acid shortened version of the second possible Met):

用跟上述内容一样的PCR方案，用同样的模板，引物用pQE80EcoSshort和pQE80BamAS。按照(a)中所述对PCR反应完成之后的反应液进行处理。Use the same PCR scheme as above, use the same template, and use pQE80EcoSshort and pQE80BamAS as primers. Treat the reaction solution after the PCR reaction as described in (a).

(c)用于在S.cerevisiae中表达的构建体oalS：(c) Construct oalS for expression in S. cerevisiae:

使用引物Oal16S和Oal12ASEco。其它都按照(a)中所述来进行。Primers Oal16S and Oal12ASEco were used. Everything else was carried out as described in (a).

(d)用于在E.coli中表达的含有C-末端his-标签的构建体pQE80oalhis：(d) Construct pQE80oalhis containing a C-terminal his-tag for expression in E. coli:

使用引物pQE80EcoS和pQE80BamhisAS来进行PCR，见(a)。PCR was performed using primers pQE80EcoS and pQE80BamhisAS, see (a).

(e)用于在S.cerevisiae中表达的构建体oalhis：(e) Construct oalhis for expression in S. cerevisiae:

使用引物Oal10Seco和Oal12ASHisEco来进行PCR，见(a)。PCR was performed using primers Oal10Seco and Oal12AShisEco, see (a).

(f)用于在E.coli中表达的含有N-末端His-标签的构建体pQE80oalNhis：(f) Construct pQE80oalNhis containing the N-terminal His-tag for expression in E. coli:

使用引物pQE80EcoNhisS和pQE80BamAS来进行PCR(见a)。PCR was performed using primers pQE80EcoNhisS and pQE80BamAS (see a).

(g)用于在S.cerevisiae中表达的构建体oalENhis：(g) Construct oalENhis for expression in S. cerevisiae:

使用引物OalENhis和Oal12ASEco来进行PCR(见a)。PCR was performed using primers OalENhis and Oal12ASEco (see a).

(h)构建体oalsec，其含有A.terreus cbs的植酸酶的信号肽，该信号肽用于在S.cerevisiae中的表达和分泌：(h) Construct oalsec containing the signal peptide of the phytase of A. terreus cbs for expression and secretion in S. cerevisiae:

用三次独立的PCR来制备该构建体。首先，通过对保守植酸酶基因进行PCR分离出信号序列[Lehmann et al.，Protein Eng.13：49-57(2000)]。This construct was prepared using three independent PCRs. First, the signal sequence was isolated by PCR on the conserved phytase gene [Lehmann et al., Protein Eng. 13:49-57 (2000)].

1.0μl含有作为插入的保守植酸酶基因的质粒(10ng)1.0 μl of plasmid (10 ng) containing the conserved phytase gene as an insert

1.0μl核苷酸混合物1.0 μl nucleotide mix

5.0μl带Mg²⁺的PCR标准缓冲液5.0μl PCR standard buffer with Mg2 ⁺

1.0μl CP-a(10pMol/μl)1.0μl CP-a (10pMol/μl)

1.0μl OalsecAS(10pMol/μl)1.0μl OalsecAS (10pMol/μl)

1.0μl来自Roche Molecular Biochemicals的High Fidelity聚合酶混合液1.0 μl High Fidelity Polymerase Mix from Roche Molecular Biochemicals

40μl H₂O40 μl _H2O

步骤1：5分钟-95℃Step 1: 5 minutes -95°C

步骤1：30秒-95℃Step 1: 30 seconds -95°C

步骤1：30秒-55℃Step 1: 30 seconds -55°C

步骤4：30秒-72℃Step 4: 30 seconds -72°C

步骤2至4重复30次。Repeat steps 2 to 4 30 times.

1.0μl含有oal1基因的质粒(10ng)1.0 μl plasmid containing oal1 gene (10ng)

1.0μl核苷酸混合物1.0 μl nucleotide mix

5.0μl带Mg²⁺的PCR标准缓冲液5.0μl PCR standard buffer with Mg2 ⁺

1.0μl OalsecS(10pMol/μl)1.0μl OalsecS (10pMol/μl)

1.0μl Oal12ASEco(10pMol/μl)1.0μl Oal12ASEco (10pMol/μl)

40μl H₂O40 μl _H2O

步骤1：5分钟-95℃Step 1: 5 minutes -95°C

步骤1：30秒-95℃Step 1: 30 seconds -95°C

步骤1：30秒-55℃Step 1: 30 seconds -55°C

步骤4：45秒-72℃Step 4: 45 seconds -72°C

步骤2至4重复30次。Repeat steps 2 to 4 30 times.

通过在1.5％的琼脂糖凝胶上进行琼脂糖凝胶电泳来纯化PCR产物，从凝胶中提取PCR产物(QIAEX II，Qiagen)，将其用于第三次PCR：The PCR product was purified by agarose gel electrophoresis on a 1.5% agarose gel, from which the PCR product was extracted (QIAEX II, Qiagen) and used for a third PCR:

0.5μl PCR1的产物0.5 μl PCR1 product

05μl PCR2的产物05 μl PCR2 product

1.0μl核苷酸混合物1.0 μl nucleotide mix

5.0μl带Mg²⁺的PCR标准缓冲液5.0μl PCR standard buffer with Mg2 ⁺

1.0μl CP-a(10pMol/μl)1.0μl CP-a (10pMol/μl)

1.0μl Oal12ASEco(10pMol/μl)1.0μl Oal12ASEco (10pMol/μl)

40μl H₂O40 μl _H2O

步骤1：5分钟-95℃Step 1: 5 minutes -95°C

步骤1：30秒-95℃Step 1: 30 seconds -95°C

步骤1：30秒-55℃Step 1: 30 seconds -55°C

步骤4：1分钟-72℃Step 4: 1 minute -72°C

步骤2至4重复30次。Repeat steps 2 to 4 30 times.

通过琼脂糖凝胶电泳对PCR产物进行纯化，从凝胶中提取出PCR产物，用EcoRI进行消化，再次用琼脂糖凝胶电泳进行纯化，按照上述连接到S.cerevisiae表达载体中。The PCR product was purified by agarose gel electrophoresis, extracted from the gel, digested with EcoRI, purified by agarose gel electrophoresis again, and ligated into the S. cerevisiae expression vector as described above.

实施例12 oal的表达The expression of embodiment 12 oal

(a)在E.coli中：(a) In E.coli:

在E.coli M15中表达pQE80oal、pQE80oalS、pQE80oalhis和pQE80oalNhis，所述E.coli M15含有pREP4，其含有lac启动子的阻遏子(见德国，Hilden的Qiagen的表达说明书)。将过夜培养物稀释至600nm的OD为0.1，在30℃培养至OD_600nm为1.5。然后用0.5mM IPTG诱导培养物，再在30℃培养6小时。通过离心(5000g，20分钟)收获细胞，冷冻于-80℃。按照厂商提供的方案，用B-PER(Pierce，Rockford，IL，USA)对它们进行裂解。再一次离心步骤之后，将上清液用于对(R)-泛酸内酯的水解。pQE80oal, pQE80oalS, pQE80oalhis and pQE80oalNhis were expressed in E. coli M15 containing pREP4 which contains the repressor of the lac promoter (see expression instructions from Qiagen, Hilden, Germany). The overnight culture was diluted to an OD600nm of 0.1 and grown at 30°C to an _OD600nm of 1.5. The cultures were then induced with 0.5 mM IPTG and incubated at 30°C for 6 hours. Cells were harvested by centrifugation (5000 g, 20 minutes) and frozen at -80°C. They were cleaved with B-PER (Pierce, Rockford, IL, USA) according to the protocol provided by the manufacturer. After another centrifugation step, the supernatant was used for the hydrolysis of (R)-pantolactone.

在含有构建体pQE80oalS和pQE80oalhis的转化子中，上清液和细胞裂解液中都没有发现活性，而用构建体pQE80oal和pQE80oalNhis的情况下，可溶蛋白质中的大约5％是具有活性的(R)-泛酸内酯酶。In transformants containing constructs pQE80oalS and pQE80oalhis, no activity was found in the supernatants or cell lysates, whereas with constructs pQE80oal and pQE80oalNhis approximately 5% of the soluble protein was active (R )-pantothenic acid lactonase.

用与没有His-标签的基因同样的方法，来表达含有N-末端His-标签的构建体。用来自Qiagen(Hilden，德国)的“QIAexpressionist”所述的方案，从细胞裂解液中对天然形式的加上His-标签的蛋白质进行纯化。The construct containing the N-terminal His-tag was expressed in the same way as the gene without the His-tag. His-tagged protein in native form was purified from cell lysates using the protocol described by "QIAexpressionist" from Qiagen (Hilden, Germany).

(b)在S.cerevisiae中：(b) In S. cerevisiae:

构建体oalEco、oalES、oalEhis、oalENhis和oalsec在S.cerevisiaeINVScl或类似相当的菌株中表达。转化并在SD^ura-培养基[Sherman et al.，Laboratory course manual for methods in yeast genetics，Cold Spring HarborLaboratory，Cold Spring Harbor，New York(1986)]上涂布之后，在30℃对平板进行3至4天的培养，捡出培养出的菌落，将其转移至2ml SD^ura-液体培养基中，在猛烈振荡下于30℃培养3天。上述培养物被用作为25mlSD^ura-培养基的预培养物。在猛烈振荡下于30℃再培养3天后，用制备的培养物[Sherman et al.，如上所述]接种21瓶中的500ml YPD培养基。再在相同条件下培养3天之后，通过离心从培养基中分离出细胞，用Y-PER(Pierce，Rockford，IL)进行裂解。The constructs oalEco, oalES, oalEhis, oalENhis and oalsec were expressed in S. cerevisiaeINVScl or similar equivalent strains. After transformation and plating on SD ^ura -medium [Sherman et al., Laboratory course manual for methods in yeast genetics, Cold Spring Harbor Laboratory, Cold Spring Harbor, New York (1986)], the plates were incubated at 30°C for 3 to After 4 days of cultivation, the cultured colonies were picked out, transferred to 2ml SD ^ura- liquid medium, and cultured at 30° C. for 3 days under vigorous shaking. The above culture was used as a pre-culture in 25 ml of SD ^ura -medium. After an additional 3 days of cultivation at 30°C with vigorous shaking, 500 ml of YPD medium in 21 flasks was inoculated with the prepared culture [Sherman et al., as described above]. After an additional 3 days of culture under the same conditions, cells were separated from the medium by centrifugation and lysed with Y-PER (Pierce, Rockford, IL).

如在E.coli中表达的情况一样，仅有oalEco和oalENhis构建体导致了活性(R)-泛酸内酯酶的表达，其仅在细胞裂解液中被发现。用75ml YPD培养物，也可以产生300U的(R)-泛酸内酯酶。相应的裂解液显示大约5U/mg总蛋白的比活性。As in the case of expression in E. coli, only the oalEco and oalENhis constructs resulted in the expression of active (R)-pantothenolactonase, which was found only in cell lysates. 300 U of (R)-pantolactonase can also be produced with 75 ml of YPD culture. The corresponding lysates showed a specific activity of approximately 5 U/mg total protein.

用上述内含子位置的序列数据，还可用类似的方式对来自A.awamori的oal基因和来自A.niger ATCC46951的oal基因进行构建和表达。The oal gene from A. awamori and the oal gene from A. niger ATCC46951 can also be constructed and expressed in a similar manner using the sequence data of the above intron position.

实施例13对在E.coli或S.cerevisiae中异源表达的Oal进行固定，以及Example 13 fixes Oal heterologously expressed in E.coli or S.cerevisiae, and 将其用于对(RS)-泛酸内酯的水解拆分Use it for the hydrolytic resolution of (RS)-pantolactone

为在生物反应器中发挥多种用途，对来自A.niger的(R)-泛酸内酯酶进行固定。在E.coli或S.cerevisiae中表达之后，通过化学手段，例如B-PER(E.coli)或Y-PER(S.cerevisiae)(Pierce，Rockford，IL，USA)，或通过机械手段例如超声波或高压对细胞进行破碎。通过离心来澄清裂解液。将制剂直接用于固定。或者，在固定之前，用另一项纯化步骤，例如硫酸铵沉淀，来进一步地增加Oal比活性。(R)-Pantolactonase from A. niger was immobilized for multiple uses in bioreactors. After expression in E.coli or S.cerevisiae, by chemical means such as B-PER (E.coli) or Y-PER (S.cerevisiae) (Pierce, Rockford, IL, USA), or by mechanical means such as ultrasound Or high pressure to disrupt the cells. Lysates were clarified by centrifugation. Preparations were used directly for fixation. Alternatively, use another purification step, such as ammonium sulfate precipitation, prior to fixation to further increase Oal specific activity.

按照下述方法来固定该生物催化剂：The biocatalyst was immobilized as follows:

(1)在二氧化硅溶胶凝胶(Silica Sol-Gel)上固定：(1) Immobilized on Silica Sol-Gel:

按照文献[Gill and Ballesteros，J.Am.Chem.Soc.120：8587-8598(1998)]所述来制备聚(硅酸甘油)-1.0(PGS-1.0)，将其溶于其质量一半的冰冷的水中。取其中100或200mg的部分，与50或100mg冰冷的生物催化剂贮存液制剂(来自若干制剂的可溶的酶)彻底地混合，混合于2ml或5ml小瓶中的50mM磷酸盐中进行，pH7.0，将小瓶轻柔旋转2分钟，以在瓶壁上形成水凝胶的薄层。将小瓶置于冰上20分钟，然后(开启)转移至冰箱。水凝胶在5℃成熟48-72小时，以形成干凝胶(xerogel)。通过在5℃以100rpm振荡，用2×1或2×4ml pH为7的50mM磷酸缓冲液来洗涤有干凝胶的小瓶，然后是2×1或2×4ml pH为7的50mM TEA-乙酸盐，其中含有10mM乙酸镁。According to the literature [Gill and Ballesteros, J.Am.Chem.Soc.120:8587-8598 (1998)], poly(glycerol silicate)-1.0 (PGS-1.0) was prepared and dissolved in half of its mass icy water. A 100 or 200 mg portion of this was mixed thoroughly with 50 or 100 mg of ice-cold biocatalyst stock solution preparation (soluble enzyme from several preparations) in 50 mM phosphate in 2 ml or 5 ml vials, pH 7.0 , swirling the vial gently for 2 minutes to form a thin layer of hydrogel on the vial walls. Place the vial on ice for 20 minutes, then transfer (open) to the refrigerator. The hydrogel was matured at 5°C for 48-72 hours to form a xerogel. Wash the vial with xerogel with 2 x 1 or 2 x 4 ml of 50 mM phosphate buffer pH 7 followed by 2 x 1 or 2 x 4 ml of 50 mM TEA-B at pH 7 by shaking at 100 rpm at 5°C. acid salt, which contains 10mM magnesium acetate.

(2)将Oal固定于二氧化硅聚乙烯醇溶胶凝胶(Sol-Gel Silica-Polyvinyl Alcohol)上：(2) Fix Oal on silica polyvinyl alcohol sol gel (Sol-Gel Silica-Polyvinyl Alcohol):

方法与(1)中类似，不同之处是在与内酯酶贮存液结合之前，DGS溶液要与恰当数量的聚乙烯醇(PVA，来自Aldrich，13％w/w的水中的85K)混合。后续方法如(1)所示。The method was similar to (1), except that the DGS solution was mixed with the appropriate amount of polyvinyl alcohol (PVA, 85K in water at 13% w/w from Aldrich) before combining with the lactonase stock solution. The follow-up method is shown in (1).

(3)将Oal固定到用戊二醛活化并支持的聚乙烯亚胺上：(3) Oal is fixed on the polyethylenimine activated and supported with glutaraldehyde:

将聚乙烯亚胺(1g，无水，高分子量，分支聚合物，来自Aldrich)、CA(0.2g，36％Ac)和乙酸丁酸纤维素(0.2g，48％的乙酸和丁酸含量，来自Aldrich)溶于5mL的二氯甲烷中，用溶液对7g漂白土(Fullers Earth，30-60目，来自Aldrich)进行包衣。经过湿包衣的材料在空气中于室温下干燥0.5小时，直到给出大约9g的被包衣的漂白土。然后将其重新悬浮于戊二醛(8mL，50％w/w，来自Aldrich)和磷酸缓冲液(50mL，50mM，pH 8)的冰冷的混合物中，以200rpm搅拌2小时。用水洗(4×10mL)经过活化的支持物，然后用磷酸缓冲液洗(4×10mL，20mM，pH 8)，然后排出液体。将湿的支持物与冰冷的内酯酶贮存液(50mM磷酸中，pH8)混合，以200rpm对悬浮液进行20-30小时的搅拌。将液体从被固定的酶上轻柔移走，用磷酸盐洗(3×5mL)湿的固定物，用乙醇胺(20mL，20mM，pH 8)进行处理，再用磷酸盐洗(2×5mL，20mM，pH 7)，然后排出液体。Polyethyleneimine (1 g, anhydrous, high molecular weight, branched polymer from Aldrich), CA (0.2 g, 36% Ac) and cellulose acetate butyrate (0.2 g, 48% acetic acid and butyric acid content, ex Aldrich) was dissolved in 5 mL of dichloromethane and 7 g of Fullers Earth (Fullers Earth, 30-60 mesh, ex Aldrich) was coated with the solution. The wet-coated material was dried in air at room temperature for 0.5 hours until approximately 9 g of coated Fuller's earth was given. It was then resuspended in an ice-cold mixture of glutaraldehyde (8 mL, 50% w/w from Aldrich) and phosphate buffer (50 mL, 50 mM, pH 8) and stirred at 200 rpm for 2 hours. The activated support was washed with water (4×10 mL), then with phosphate buffer (4×10 mL, 20 mM, pH 8), and the liquid was drained. The wet support was mixed with ice-cold lactonase stock solution (50 mM phosphoric acid, pH 8) and the suspension was stirred at 200 rpm for 20-30 hours. The liquid was gently removed from the immobilized enzyme, the wet fixation was washed with phosphate (3 × 5 mL), treated with ethanolamine (20 mL, 20 mM, pH 8), and washed with phosphate (2 × 5 mL, 20 mM , pH 7), and then drain the liquid.

(4)用于对(RS)-泛酸内酯进行水解拆分的反应条件(4) Reaction conditions for hydrolytic resolution of (RS)-pantolactone

反应在2或4mL的小瓶中进行，其中使用50或100mg的生物催化剂以及1或2mL的0.5M外消旋内酯(在0.75M三乙胺-乙酸盐中，pH8.5，含有20mM的乙酸镁)。将小瓶在40℃、200rpm下培养必需的时间段，将溶液排出，并用手性HPLC进行分析，在开始下一次循环之前用新鲜的底物溶液洗(2×1或2mL)催化剂。用等体积的500mM MES缓冲液来终止用于分析的样品的反应，所述缓冲液pH为5.5，其中含有50mM EDTA，离心(10000g，10分钟)然后通过HPLC进行分析，其中使用0.46×15cm的CHIRADEX柱，用70∶30的水-甲醇进行洗脱，1mL/分钟，在20℃进行。在15分钟的时候对初始速度进行测量，在每个循环的末尾测定E值。Reactions were carried out in 2 or 4 mL vials using 50 or 100 mg of biocatalyst and 1 or 2 mL of 0.5 M racemic lactone (in 0.75 M triethylamine-acetate, pH 8.5, containing 20 mM magnesium acetate). The vials were incubated at 40°C, 200 rpm for the necessary period of time, the solution was drained and analyzed by chiral HPLC, the catalyst was washed (2 x 1 or 2 mL) with fresh substrate solution before starting the next cycle. The reaction of samples for analysis was stopped with an equal volume of 500 mM MES buffer, pH 5.5 containing 50 mM EDTA, centrifuged (10000 g, 10 min) and then analyzed by HPLC using a 0.46 x 15 cm CHIRADEX column, eluted with 70:30 water-methanol, 1 mL/min, at 20°C. Initial velocity measurements were taken at 15 minutes and E values were determined at the end of each cycle.

表2中概括了选用的被固定的酶制剂在对(RS)-泛酸内酯进行分批拆分过程中的作用。溶胶凝胶和共价固定方案都被证明为有效的，甚至在24个每批为1小时的循环后，催化剂保留了其初始活性的62-72％，并显示出：对映异构体的过量(excess)值和对(R)-泛酸内酯的对映体选择性分别为89-98％和71-88％。此外，当延长批次进行的时间进行试验时，催化剂运作方式也是非常相似的，即当每批反应进行一天而不是1小时的情况下，其展示出了良好的长时间操作稳定性。Table 2 summarizes the effect of selected immobilized enzyme preparations on the batch resolution of (RS)-pantolactone. Both sol-gel and covalent immobilization schemes proved to be effective, and even after 24 1-hour batch cycles, the catalyst retained 62–72% of its initial activity and showed: Excess values and enantioselectivities to (R)-pantolactone were 89-98% and 71-88%, respectively. Furthermore, the catalyst behaved very similarly when the batches were run for extended periods of time, showing good long-term operational stability when each batch was run for a day instead of 1 hour.

表2：通过被固定的内酯酶制剂对(RS)-泛酸内酯进行分批拆分Table 2: Batch resolution of (RS)-pantolactone by immobilized lactonase preparations

催化剂^*：Biocatalyst/Prod.Organism(Recomb.Activity)Catalyst ^* : Biocatalyst/Prod.Organism(Recomb.Activity)

实施例14被固定的由重组E.coli表达的Oal对(RS)-泛酸内酯进行连续The Oal expressed by recombinant E.coli that is fixed in embodiment 14 is carried out continuously to (RS)-pantolactone 拆分的应用split app

按照实施例14(2)中所述来制备的经溶胶凝胶-PVA固定的重组内酯酶生物催化剂(E.coli中表达的)，被用于在填充床(packed-bed)反应器中对外消旋内酯进行连续拆分：将被固定的生物催化剂(1.1g，0.21kUg^-1，总共0.231kU)干填入带有Teflon末端(endpiece)的0.46×15cm的Omnifit加套玻璃柱中。其与1×10cm的Omnifit加套玻璃柱相连，1×10cm的玻璃柱中填有1-2mm的纤维素珠，其是通过两个装有50mL玻璃/Teflon注射器的注射器泵填入的。这些柱都与循环水浴相连，拆分进行期间被保持在40℃。按照下述方法对反应器进行调节：室温下以5mL min^-1的速率加入Tris-乙酸盐缓冲液(100mM，pH 7，含有100mM乙酸镁)，进行30分钟；接着是在室温下以1∶1的体积比，2mL min^-1的总流速，加入外消旋内酯溶液(水中1.5M)和Tris-乙酸缓冲液(2.5M，pH8.25，含有50mM乙酸镁)的组合，进行30分钟。然后将流速降为1.2mL min^-1，将柱子加热到40℃，系统被允许平衡15分钟。因此对反应器进行了大约45分钟的操作，在冰浴上收集洗脱液，在此条件下，以46-48％的不变的转化率对总共52.7mL的进料物(相当于5.14g RSPL)进行了处理。用硫酸(10％v/v水溶液)将洗脱液的pH调为6.5，用二氯甲烷对溶液进行萃取(10×75mL)，有机层在无水硫酸镁上干燥，旋转式蒸发，产生由10％的RPL和90％的SPL(由手性HPLC分析得出)组成的(S)-泛酸内酯富含物(2.67g，理论上98％)。用硫酸(40％w/w)对水相进行酸化，至pH1.5，加热至70℃，1小时，再在冰上冷却，用氯化钠饱和，然后用二氯甲烷萃取(10×75mL)，有机层在无水硫酸镁上干燥，旋转式蒸发，产生(R)-泛酸内酯富含物(2.23g，理论产量92％)，其对映异构体纯度为93％。对(R)-/(S)-的组合回收率为95％。A sol-gel-PVA immobilized recombinant lactonase biocatalyst (expressed in E. coli) prepared as described in Example 14(2) was used in a packed-bed reactor Sequential resolution of racemic lactones: The immobilized biocatalyst (1.1 g, 0.21 kUg ^-1 , 0.231 kU in total) was dry packed into a 0.46×15 cm Omnifit jacketed glass column with Teflon endpieces . It was attached to a 1 x 10 cm Omnifit jacketed glass column filled with 1-2 mm cellulose beads via two syringe pumps fitted with 50 mL glass/Teflon syringes. These columns were connected to a circulating water bath, which was maintained at 40°C during the resolution. The reactor was conditioned as follows: Tris-acetate buffer (100 mM, pH 7, containing 100 mM magnesium acetate) was added at a rate of ⁵ mL min at room temperature for 30 minutes; followed by 1 :1 volume ratio, total flow rate of 2 mL min ^-1 , adding a combination of racemic lactone solution (1.5M in water) and Tris-acetic acid buffer (2.5M, pH 8.25, containing 50mM magnesium acetate) for 30 minute. The flow rate was then reduced to 1.2 mL min ^-1 , the column was heated to 40°C, and the system was allowed to equilibrate for 15 minutes. The reactor was thus operated for approximately 45 minutes, collecting the eluate on an ice bath, under which conditions a total of 52.7 mL of feed (corresponding to 5.14 g RSPL) were processed. The pH of the eluent was adjusted to 6.5 with sulfuric acid (10% v/v aqueous solution), the solution was extracted with dichloromethane (10×75 mL), the organic layer was dried over anhydrous magnesium sulfate, and the rotary evaporation yielded (S)-Pantolactone enrichment (2.67 g, 98% of theory) consisting of 10% RPL and 90% SPL (by chiral HPLC analysis). The aqueous phase was acidified with sulfuric acid (40% w/w) to pH 1.5, heated to 70 °C for 1 h, cooled on ice, saturated with NaCl, and extracted with dichloromethane (10 x 75 mL ), the organic layer was dried over anhydrous magnesium sulfate and rotary evaporated to yield (R)-pantolactone rich (2.23 g, 92% of theoretical yield) with an enantiomeric purity of 93%. The combined recovery for (R)-/(S)- was 95%.

实施例(13)和(14)的结果清楚地表明，重组内酯酶对于水解拆分外消旋泛酸内酯来说是有效的生物催化剂，该催化剂还可被有效固定，固定物可用于生产对映异构过量达90-98％的高度富集的(R)-泛酸内酯，不管是分批操作还是连续操作的情况下都如此。The results of Examples (13) and (14) clearly show that the recombinant lactonase is an effective biocatalyst for the hydrolytic resolution of racemic pantothenate lactone, the catalyst can also be effectively fixed, and the fixed material can be used for the production of Highly enriched (R)-pantolactone with an enantiomeric excess of 90-98%, both in the case of batch and continuous operations.

实施例15克隆和表达来自B.subtilis的(S)-泛酸内酯特异性内酯酶Example 15 Cloning and expression of (S)-pantolactone-specific lactonase from B. subtilis

用来自牛和马肝脏的内酯酶的氨基酸序列，我们发现了若干种其它显示与上述序列具有有限同源性的序列。通过PCR将它们中间来自B.subtilis的yvre基因(YVRE BACSU，被注释为SMP30/CGR1家族的成员/假设的33.2kDa蛋白)从基因组DNA中克隆出来，其中使用的引物的5’和3’-末端还含有后续向表达载体中克隆的步骤所需要的限制性位点(EcoRI和PstI)。PCR条件与从A.niger的基因组DNA分离oal基因(见实施例8)所用的条件相同。用EcoRI和PstI来消化PCR产物和载体(来自Stratagene(La Jolla，CA，USA)的PET41a)，通过琼脂糖凝胶电泳来纯化，连接，并将其转化进E.coli Top10细胞(Stratagene，La Jolla，CA，USA)。使用该种策略，该基因被融合进来自E.coli的GST蛋白基因的阅读框中。Using the amino acid sequences of lactonases from bovine and equine liver, we found several other sequences showing limited homology to the above sequences. Among them, the yvre gene from B. subtilis (YVRE BACSU, annotated as a member of the SMP30/CGR1 family/hypothetical 33.2kDa protein) was cloned from genomic DNA by PCR using the 5' and 3'- The ends also contained restriction sites (EcoRI and PstI) required for subsequent cloning steps into expression vectors. The PCR conditions were the same as those used to isolate the oal gene from A. niger genomic DNA (see Example 8). The PCR product and vector (PET41a from Stratagene (La Jolla, CA, USA)) were digested with EcoRI and PstI, purified by agarose gel electrophoresis, ligated, and transformed into E.coli Top10 cells (Stratagene, La Jolla, CA). Jolla, CA, USA). Using this strategy, the gene was fused into the reading frame of the GST protein gene from E. coli.

根据厂商提供的方案来进行表达和纯化。用其中额外含有5mM CaCl₂的标准试验在40℃来测定被纯化的蛋白质的活性。该酶显示了对(S)-泛酸内酯的非常高的选择性。该制剂具有相对于每mg富集融合蛋白1-2U的比活性。该酶仅在Mg²⁺或Ca²⁺离子存在的情况下具有活性。Expression and purification were performed according to the protocol provided by the manufacturer. The activity of the purified protein was determined using a standard assay at 40°C with additional 5 mM _CaCl2 . This enzyme shows very high selectivity towards (S)-pantolactone. This preparation has a specific activity of 1-2 U per mg of enriched fusion protein. The enzyme is only active in the presence of Mg ²⁺ or Ca ²⁺ ions.

序列表sequence listing

<110>帝斯曼知识产权资产管理有限公司<110> DSM Intellectual Property Asset Management Co., Ltd.

<120>新颖的酶<120> Novel enzymes

<130>21717<130>21717

<160>99<160>99

<170>PatentIn version 3.1<170>PatentIn version 3.1

<210>1<210>1

<211>900<211>900

<212>DNA<212>DNA

<213>Equus caballus<213> Equus caballus

<220><220>

<221>CDS<221> CDS

<222>(1)..(900)<222>(1)..(900)

<223><223>

<400>1<400>1

atg tct tcc att aag att gag tgt gtt ctg ccc gag aac tgc cag tgc 48atg tct tcc att aag att gag tgt gtt ctg ccc gag aac tgc cag tgc 48

Met Ser Ser Ile Lys Ile Glu Cys Val Leu Pro Glu Asn Cys Gln CysMet Ser Ser Ile Lys Ile Glu Cys Val Leu Pro Glu Asn Cys Gln Cys

1 5 10 151 5 10 15

ggt gag tcg cca gtg tgg gag gaa gcg tcc agc tct ctg ctc ttc gta 96ggt gag tcg cca gtg tgg gag gaa gcg tcc agc tct ctg ctc ttc gta 96

Gly Glu Ser Pro Val Trp Glu Glu Ala Ser Ser Ser Leu Leu Phe ValGly Glu Ser Pro Val Trp Glu Glu Ala Ser Ser Ser Leu Leu Phe Val

20 25 3020 25 30

gat att cct gcc aaa aag gtt tgc cgg tgg gat gcg ctc agc aag caa 144gat att cct gcc aaa aag gtt tgc cgg tgg gat gcg ctc agc aag caa 144

Asp Ile Pro Ala Lys Lys Val Cys Arg Trp Asp Ala Leu Ser Lys GlnAsp Ile Pro Ala Lys Lys Val Cys Arg Trp Asp Ala Leu Ser Lys Gln

35 40 4535 40 45

gtg cag cga atg acc gtg gat gcc ccg gtt ggc tca gtg gcc ctc cgc 192gtg cag cga atg acc gtg gat gcc ccg gtt ggc tca gtg gcc ctc cgc 192

Val Gln Arg Met Thr Val Asp Ala Pro Val Gly Ser Val Ala Leu ArgVal Gln Arg Met Thr Val Asp Ala Pro Val Gly Ser Val Ala Leu Arg

50 55 6050 55 60

cag tcg gga ggc tat gtc gcc aca att gga acg aag ttc tgt gct ttg 240cag tcg gga ggc tat gtc gcc aca att gga acg aag ttc tgt gct ttg 240

Gln Ser Gly Gly Tyr Val Ala Thr Ile Gly Thr Lys Phe Cys Ala LeuGln Ser Gly Gly Tyr Val Ala Thr Ile Gly Thr Lys Phe Cys Ala Leu

65 70 75 8065 70 75 80

aac tgg gaa gat cag tca gta gtt gtc ctg gcc gcg gta gag aaa gac 288aac tgg gaa gat cag tca gta gtt gtc ctg gcc gcg gta gag aaa gac 288

Asn Trp Glu Asp Gln Ser Val Val Val Leu Ala Ala Val Glu Lys AspAsn Trp Glu Asp Gln Ser Val Val Val Leu Ala Ala Val Glu Lys Asp

85 90 9585 90 95

aag aaa aac aat cga ttc aat gac ggg aag gtg gat ccc gcg ggg aga 336aag aaa aac aat cga ttc aat gac ggg aag gtg gat ccc gcg ggg aga 336

Lys Lys Asn Asn Arg Phe Asn Asp Gly Lys Val Asp Pro Ala Gly ArgLys Lys Asn Asn Arg Phe Asn Asp Gly Lys Val Asp Pro Ala Gly Arg

100 105 110100 105 110

tac ttt gct ggc acc atg gct gag gaa aca gcc ccg gca gtt acg gag 384tac ttt gct ggc acc atg gct gag gaa aca gcc ccg gca gtt acg gag 384

Tyr Phe Ala Gly Thr Met Ala Glu Glu Thr Ala Pro Ala Val Thr GluTyr Phe Ala Gly Thr Met Ala Glu Glu Thr Ala Pro Ala Val Thr Glu

115 120 125115 120 125

cgg cac caa ggg tcc ctg tac gcc ctc ttt cct gac cac cac gtg gaa 432cgg cac caa ggg tcc ctg tac gcc ctc ttt cct gac cac cac gtg gaa 432

Arg His Gln Gly Ser Leu Tyr Ala Leu Phe Pro Asp His His Val GluArg His Gln Gly Ser Leu Tyr Ala Leu Phe Pro Asp His His Val Glu

130 135 140130 135 140

aag tac ttt gac cag gtg gac atc tcc aac ggt ttg gat tgg tcc ctg 480aag tac ttt gac cag gtg gac atc tcc aac ggt ttg gat tgg tcc ctg 480

Lys Tyr Phe Asp Gln Val Asp Ile Ser Asn Gly Leu Asp Trp Ser LeuLys Tyr Phe Asp Gln Val Asp Ile Ser Asn Gly Leu Asp Trp Ser Leu

145 150 155 160145 150 155 160

gac cac aaa atc ttc tat tac atc gac agc ctg tcc tac tct gtg gat 528gac cac aaa atc ttc tat tac atc gac agc ctg tcc tac tct gtg gat 528

Asp His Lys Ile Phe Tyr Tyr Ile Asp Ser Leu Ser Tyr Ser Val AspAsp His Lys Ile Phe Tyr Tyr Ile Asp Ser Leu Ser Tyr Ser Val Asp

165 170 175165 170 175

gcc ttt gac tatgac ctg ctg aca gga agg atc tct aac cgc aga agt 576gcc ttt gac tatgac ctg ctg aca gga agg atc tct aac cgc aga agt 576

Ala Phe Asp Tyr Asp Leu Leu Thr Gly ArgIle Ser Asn Arg Arg SerAla Phe Asp Tyr Asp Leu Leu Thr Gly ArgIle Ser Asn Arg Arg Ser

180 185 190180 185 190

gtt tac aag ctg gag aag gaa gaa cat atc cca gat gga atg tgt att 624gtt tac aag ctg gag aag gaa gaa cat atc cca gat gga atg tgt att 624

Val Tyr Lys Leu Glu Lys Glu Glu His Ile Pro Asp Gly Met Cys IleVal Tyr Lys Leu Glu Lys Glu Glu His Ile Pro Asp Gly Met Cys Ile

195 200 205195 200 205

gat act gag ggg aag ctc tgg gtg gcc tgt tac aac gga gga aga gtg 672gat act gag ggg aag ctc tgg gtg gcc tgt tac aac gga gga aga gtg 672

Asp Thr Glu Gly Lys Leu Trp Val Ala Cys Tyr Asn Gly Gly Arg ValAsp Thr Glu Gly Lys Leu Trp Val Ala Cys Tyr Asn Gly Gly Arg Val

210 215 220210 215 220

att cgt tta gat cct gag aca ggg aaa aga ctc caa act gtg aag ttg 720att cgt tta gat cct gag aca ggg aaa aga ctc caa act gtg aag ttg 720

Ile Arg Leu Asp pro Glu Thr Gly Lys Arg Leu Gln Thr Val Lys LeuIle Arg Leu Asp pro Glu Thr Gly Lys Arg Leu Gln Thr Val Lys Leu

225 230 235 240225 230 235 240

cct gtt gat aaa acc act tcg tgc tgc ttc gga ggg aag gat tac tct 768cct gtt gat aaa acc act tcg tgc tgc ttc gga ggg aag gat tac tct 768

Pro Val Asp Lys Thr Thr Ser Cys Cys Phe Gly Gly Lys Asp Tyr SerPro Val Asp Lys Thr Thr Ser Cys Cys Phe Gly Gly Lys Asp Tyr Ser

245 250 255245 250 255

gaa atg tac gtg acc tgc gcc cgg gct gga ctc gac ccc gag gct ctc 816gaa atg tac gtg acc tgc gcc cgg gct gga ctc gac ccc gag gct ctc 816

Glu Met Tyr Val Thr Cys Ala Arg Ala Gly Leu Asp Pro Glu Ala LeuGlu Met Tyr Val Thr Cys Ala Arg Ala Gly Leu Asp Pro Glu Ala Leu

260 265 270260 265 270

tcg cgg cag cct gaa gct ggt ggc att ttc aag ata act ggc ctg ggg 864tcg cgg cag cct gaa gct ggt ggc att ttc aag ata act ggc ctg ggg 864

Ser Arg Gln Pro Glu Ala Gly Gly Ile Phe Lys Ile Thr Gly Leu GlySer Arg Gln Pro Glu Ala Gly Gly Ile Phe Lys Ile Thr Gly Leu Gly

275 280 285275 280 285

gtc aaa gga att cct ccc tat gcc tac gca gga tga 900gtc aaa gga att cct ccc tat gcc tac gca gga tga 900

Val Lys Gly Ile Pro Pro Tyr Ala Tyr Ala GlyVal Lys Gly Ile Pro Pro Tyr Ala Tyr Ala Gly

290 295290 295

<210>2<210>2

<211>299<211>299

<212>PRT<212>PRT

<213>Equus caballus<213> Equus caballus

<400>2<400>2

1 5 10 151 5 10 15

20 25 3020 25 30

35 40 4535 40 45

50 55 6050 55 60

65 70 75 8065 70 75 80

85 90 9585 90 95

100 105 110100 105 110

115 120 125115 120 125

130 135 140130 135 140

145 150 155 160145 150 155 160

165 170 175165 170 175

Ala Phe Asp Tyr Asp Leu Leu Thr Gly Arg Ile Ser Asn Arg Arg SerAla Phe Asp Tyr Asp Leu Leu Thr Gly Arg Ile Ser Asn Arg Arg Ser

180 185 190180 185 190

195 200 205195 200 205

210 215 220210 215 220

225 230 235 240225 230 235 240

245 250 255245 250 255

260 265 270260 265 270

275 280 285275 280 285

290 295290 295

<210>3<210>3

<211>879<211>879

<212>DNA<212>DNA

<213>Bacillus subtilis<213>Bacillus subtilis

<220><220>

<221>CDS<221> CDS

<222>(1)..(879)<222>(1)..(879)

<223><223>

<400>3<400>3

atg gat gca gta ttg gaa gca gac act cgg gca gtg att ggt gaa ggc 48atg gat gca gta ttg gaa gca gac act cgg gca gtg att ggt gaa ggc 48

Met Asp Ala Val Leu Glu Ala Asp Thr Arg Ala Val Ile Gly Glu GlyMet Asp Ala Val Leu Glu Ala Asp Thr Arg Ala Val Ile Gly Glu Gly

1 5 10 151 5 10 15

ccg tta tgg gat gaa gag aac ggc cgc tta tat tgg gtt gat atc ctg 96ccg tta tgg gat gaa gag aac ggc cgc tta tat tgg gtt gat atc ctg 96

Pro Leu Trp Asp Glu Glu Asn Gly Arg Leu Tyr Trp Val Asp Ile LeuPro Leu Trp Asp Glu Glu Asn Gly Arg Leu Tyr Trp Val Asp Ile Leu

20 25 3020 25 30

ggg agc gag ctc cac atc ttt gac cct gaa gaa aaa atc aac cga tca 144ggg agc gag ctc cac atc ttt gac cct gaa gaa aaa atc aac cga tca 144

Gly Ser Glu Leu His Ile Phe Asp Pro Glu Glu Lys Ile Asn Arg SerGly Ser Glu Leu His Ile Phe Asp Pro Glu Glu Lys Ile Asn Arg Ser

35 40 4535 40 45

atc aaa ttc aaa tcc ttt gtg acg gcg ctt gcg aaa tat tca aag gat 192atc aaa ttc aaa tcc ttt gtg acg gcg ctt gcg aaa tat tca aag gat 192

Ile Lys Phe Lys Ser Phe Val Thr Ala Leu Ala Lys Tyr Ser Lys AspIle Lys Phe Lys Ser Phe Val Thr Ala Leu Ala Lys Tyr Ser Lys Asp

50 55 6050 55 60

gaa ctg att atg acg atg aag gac ggg ttt tac ctg tat cat ctt cgg 240gaa ctg att atg acg atg aag gac ggg ttt tac ctg tat cat ctt cgg 240

Glu Leu Ile Met Thr Met Lys Asp Gly Phe Tyr Leu Tyr His Leu ArgGlu Leu Ile Met Thr Met Lys Asp Gly Phe Tyr Leu Tyr His Leu Arg

65 70 75 8065 70 75 80

gat gac agc ttg gaa aaa att aaa cag ccg aag gac atg cat gag agc 288gat gac agc ttg gaa aaa att aaa cag ccg aag gac atg cat gag agc 288

Asp Asp Ser Leu Glu Lys Ile Lys Gln Pro Lys Asp Met His Glu SerAsp Asp Ser Leu Glu Lys Ile Lys Gln Pro Lys Asp Met His Glu Ser

85 90 9585 90 95

ctg aga ttt aat gat gct aaa tgt gac ccg tac gga agg ctt tgg gcg 336ctg aga ttt aat gat gct aaa tgt gac ccg tac gga agg ctt tgg gcg 336

Leu Arg Phe Asn Asp Ala Lys Cys Asp Pro Tyr Gly Arg Leu Trp AlaLeu Arg Phe Asn Asp Ala Lys Cys Asp Pro Tyr Gly Arg Leu Trp Ala

100 105 110100 105 110

ggg acg acg agc atg gaa ggc gag caa aaa cag gcg tcg ctg tac cgt 384ggg acg acg agc atg gaa ggc gag caa aaa cag gcg tcg ctg tac cgt 384

Gly Thr Thr Ser Met Glu Gly Glu Gln Lys Gln Ala Ser Leu Tyr ArgGly Thr Thr Ser Met Glu Gly Glu Gln Lys Gln Ala Ser Leu Tyr Arg

115 120 125115 120 125

ttg aat cta gac ggc agt ctt gtc aaa atc aag gat caa gtc tcc acc 432ttg aat cta gac ggc agt ctt gtc aaa atc aag gat caa gtc tcc acc 432

Leu Asn Leu Asp Gly Ser Leu Val Lys Ile Lys Asp Gln Val Ser ThrLeu Asn Leu Asp Gly Ser Leu Val Lys Ile Lys Asp Gln Val Ser Thr

130 135 140130 135 140

tca aac ggt ttg gat tgg gac cgc gag cgg aat ttg atg tat tac atc 480tca aac ggt ttg gat tgg gac cgc gag cgg aat ttg atg tat tac atc 480

Ser Asn Gly Leu Asp Trp Asp Arg Glu Arg Asn Leu Met Tyr Tyr IleSer Asn Gly Leu Asp Trp Asp Arg Glu Arg Asn Leu Met Tyr Tyr Ile

145 150 155 160145 150 155 160

gac acg ccg acc cag gag att gta cgt tac agc tat gat cct caa agc 528gac acg ccg acc cag gag att gta cgt tac agc tat gat cct caa agc 528

Asp Thr Pro Thr Gln Glu Ile Val Arg Tyr Ser Tyr Asp Pro Gln SerAsp Thr Pro Thr Gln Glu Ile Val Arg Tyr Ser Tyr Asp Pro Gln Ser

165 170 175165 170 175

gga gat gtt tcg aat cca gaa cct gtc tat cgt ttt gat cag tca gac 576gga gat gtt tcg aat cca gaa cct gtc tat cgt ttt gat cag tca gac 576

Gly Asp Val Ser Asn Pro Glu Pro Val Tyr Arg Phe Asp Gln Ser AspGly Asp Val Ser Asn Pro Glu Pro Val Tyr Arg Phe Asp Gln Ser Asp

180 185 190180 185 190

gga ttg ccg gac ggc atg aca att gac caa aat ggc atg ctg tgg gtg 624gga ttg ccg gac ggc atg aca att gac caa aat ggc atg ctg tgg gtg 624

Gly Leu Pro Asp Gly Met Thr Ile Asp Gln Asn Gly Met Leu Trp ValGly Leu Pro Asp Gly Met Thr Ile Asp Gln Asn Gly Met Leu Trp Val

195 200 205195 200 205

gcg ctg ttt ggc ggc agc cgc gtt gtt cac att gac ccg ttt cag aaa 672gcg ctg ttt ggc ggc agc cgc gtt gtt cac att gac ccg ttt cag aaa 672

Ala Leu Phe Gly Gly Ser Arg Val Val His Ile Asp Pro Phe Gln LysAla Leu Phe Gly Gly Ser Arg Val Val His Ile Asp Pro Phe Gln Lys

210 215 220210 215 220

aaa gaa atc aat tca atc agc gtg ccg gct aaa tat gtc acg tgc tgc 720aaa gaa atc aat tca atc agc gtg ccg gct aaa tat gtc acg tgc tgc 720

Lys Glu Ile Asn Ser Ile Ser Val Pro Ala Lys Tyr Val Thr Cys CysLys Glu Ile Asn Ser Ile Ser Val Pro Ala Lys Tyr Val Thr Cys Cys

225 230 235 240225 230 235 240

gcg ttc ggc ggc aga gac tta aaa acc ctt tac att aca acg gca aca 768gcg ttc ggc ggc aga gac tta aaa acc ctt tac att aca acg gca aca 768

Ala Phe Gly Gly Arg Asp Leu Lys Thr Leu Tyr Ile Thr Thr Ala ThrAla Phe Gly Gly Arg Asp Leu Lys Thr Leu Tyr Ile Thr Thr Ala Thr

245 250 255245 250 255

gaa caa atg aca gaa aaa gag aga tac gag cag cct cac gct gga ggg 816gaa caa atg aca gaa aaa gag aga tac gag cag cct cac gct gga ggg 816

Glu Gln Met Thr Glu Lys Glu Arg Tyr Glu Gln Pro His Ala Gly GlyGlu Gln Met Thr Glu Lys Glu Arg Tyr Glu Gln Pro His Ala Gly Gly

260 265 270260 265 270

ttg ttt tca gca caa ctg gaa aca ggc gga tat cag ccg gta cca ttt 864ttg ttt tca gca caa ctg gaa aca ggc gga tat cag ccg gta cca ttt 864

Leu Phe Ser Ala Gln Leu Glu Thr Gly Gly Tyr Gln Pro Val Pro PheLeu Phe Ser Ala Gln Leu Glu Thr Gly Gly Tyr Gln Pro Val Pro Phe

275 280 285275 280 285

gct gga gac gta taa 879gct gga gac gta taa 879

Ala Gly Asp ValAla Gly Asp Val

290290

<210>4<210>4

<211>292<211>292

<212>PRT<212>PRT

<213>Bacillus subtilis<213>Bacillus subtilis

<400>4<400>4

1 5 10 151 5 10 15

20 25 3020 25 30

35 40 4535 40 45

50 55 6050 55 60

65 70 75 8065 70 75 80

85 90 9585 90 95

100 105 110100 105 110

115 120 125115 120 125

130 135 140130 135 140

145 150 155 160145 150 155 160

165 170 175165 170 175

180 185 190180 185 190

195 200 205195 200 205

210 215 220210 215 220

225 230 235 240225 230 235 240

245 250 255245 250 255

260 265 270260 265 270

275 280 285275 280 285

Ala Gly Asp ValAla Gly Asp Val

290290

<210>5<210>5

<211>726<211>726

<212>DNA<212>DNA

<213>Aspergillus niger<213>Aspergillus niger

<220><220>

<221>CDS<221> CDS

<222>(1)..(726)<222>(1)..(726)

<223><223>

<400>5<400>5

aac tgg gac ctg gaa aaa ctc aat cct cca ggc ttc ggc cgc aaa ccc 48aac tgg gac ctg gaa aaa ctc aat cct cca ggc ttc ggc cgc aaa ccc 48

Asn Trp Asp Leu Glu Lys Leu Asn Pro Pro Gly Phe Gly Arg Lys ProAsn Trp Asp Leu Glu Lys Leu Asn Pro Pro Gly Phe Gly Arg Lys Pro

1 5 10 151 5 10 15

ttc gcc cac cag gtt aaa tgg gtc agc gaa ggc cac gcc ttc gac gac 96ttc gcc cac cag gtt aaa tgg gtc agc gaa ggc cac gcc ttc gac gac 96

Phe Ala His Gln Val Lys Trp Val Ser Glu Gly His Ala Phe Asp AspPhe Ala His Gln Val Lys Trp Val Ser Glu Gly His Ala Phe Asp Asp

20 25 3020 25 30

gag tac cac ttc aac cca cag caa tcc tcc caa tgg gac ggc cta cga 144gag tac cac ttc aac cca cag caa tcc tcc caa tgg gac ggc cta cga 144

Glu Tyr His Phe Asn Pro Gln Gln Ser Ser Gln Trp Asp Gly Leu ArgGlu Tyr His Phe Asn Pro Gln Gln Ser Ser Gln Trp Asp Gly Leu Arg

35 40 4535 40 45

cac cac aat gcc cca gcc cca aca ccc gac gac cct gac cgc cgc gtc 192cac cac aat gcc cca gcc cca aca ccc gac gac cct gac cgc cgc gtc 192

His His Asn Ala Pro Ala Pro Thr Pro Asp Asp Pro Asp Arg Arg ValHis His Asn Ala Pro Ala Pro Thr Pro Asp Asp Pro Asp Arg Arg Val

50 55 6050 55 60

ttc tac ggc ggc acc acc tcc acc gag atc ctc gac ccg tcc tca gct 240ttc tac ggc ggc acc acc tcc acc acc gag atc ctc gac ccg tcc tca gct 240

Phe Tyr Gly Gly Thr Thr Ser Thr Glu Ile Leu Asp Pro Ser Ser AlaPhe Tyr Gly Gly Thr Thr Ser Thr Glu Ile Leu Asp Pro Ser Ser Ala

65 70 75 8065 70 75 80

cgc atc ggc atc gct cac tgg gcc aag aag ggc atc gcc ggc cgt ggc 288cgc atc ggc atc gct cac tgg gcc aag aag ggc atc gcc ggc cgt ggc 288

Arg Ile Gly Ile Ala His Trp Ala Lys Lys Gly Ile Ala Gly Arg GlyArg Ile Gly Ile Ala His Trp Ala Lys Lys Gly Ile Ala Gly Arg Gly

85 90 9585 90 95

gtc ctc atc gac tac gcg tcc tac atc cag cga acc aag ggc ata cca 336gtc ctc atc gac tac gcg tcc tac atc cag cga acc aag ggc ata cca 336

Val Leu Ile Asp Tyr Ala Ser Tyr Ile Gln Arg Thr Lys Gly Ile ProVal Leu Ile Asp Tyr Ala Ser Tyr Ile Gln Arg Thr Lys Gly Ile Pro

100 105 110100 105 110

gta aac gcc cta acc cgg cac acc gtc tcc ctg gac gac gtc ctc acc 384gta aac gcc cta acc cgg cac acc gtc tcc ctg gac gac gtc ctc acc 384

Val Ash Ala Leu Thr Arg His Thr Val Ser Leu Asp Asp Val Leu ThrVal Ash Ala Leu Thr Arg His Thr Val Ser Leu Asp Asp Val Leu Thr

115 120 125115 120 125

atc gcg aaa gaa tgc aac atc acc ttc cag cca ggc gac att ctc ttc 432atc gcg aaa gaa tgc aac atc acc ttc cag cca ggc gac att ctc ttc 432

Ile Ala Lys Glu Cys Asn Ile Thr Phe Gln Pro Gly Asp Ile Leu PheIle Ala Lys Glu Cys Asn Ile Thr Phe Gln Pro Gly Asp Ile Leu Phe

130 135 140130 135 140

ttg cgt gtc ggc ctg ccc acc aca tgg gat aac atg tcc gat gac gag 480ttg cgt gtc ggc ctg ccc acc aca tgg gat aac atg tcc gat gac gag 480

Leu Arg Val Gly Leu Pro Thr Thr Trp Asp Asn Met Ser Asp Asp GluLeu Arg Val Gly Leu Pro Thr Thr Trp Asp Asn Met Ser Asp Asp Glu

145 150 155 160145 150 155 160

aag gtc aaa tac agt caa cag gaa atg cct cag cat gca ggg tta gag 528aag gtc aaa tac agt caa cag gaa atg cct cag cat gca ggg tta gag 528

Lys Val Lys Tyr Ser Gln Gln Glu Met Pro Gln His Ala Gly Leu GluLys Val Lys Tyr Ser Gln Gln Glu Met Pro Gln His Ala Gly Leu Glu

165 170 175165 170 175

cag agt gag cgc gtg gtg cgg ttc ctc tgg gac cag cat ttc gcg gct 576cag agt gag cgc gtg gtg cgg ttc ctc tgg gac cag cat ttc gcg gct 576

Gln Ser Glu Arg Val Val Arg Phe Leu Trp Asp Gln His Phe Ala AlaGln Ser Glu Arg Val Val Arg Phe Leu Trp Asp Gln His Phe Ala Ala

180 185 190180 185 190

gtg gcg ggt gat gcg gtc agc ttc gag gtg tat ccg cct gta gag aag 624gtg gcg ggt gat gcg gtc agc ttc gag gtg tat ccg cct gta gag aag 624

Val Ala Gly Asp Ala Val Ser Phe Glu Val Tyr Pro Pro Val Glu LysVal Ala Gly Asp Ala Val Ser Phe Glu Val Tyr Pro Pro Val Glu Lys

195 200 205195 200 205

gag tgg gat ttg cat cat ttt ctg ttg gcc ggg tgg gga gtg ccg att 672gag tgg gat ttg cat cat ttt ctg ttg gcc ggg tgg gga gtg ccg att 672

Glu Trp Asp Leu His His Phe Leu Leu Ala Gly Trp Gly Val Pro IleGlu Trp Asp Leu His His Phe Leu Leu Ala Gly Trp Gly Val Pro Ile

210 215 220210 215 220

ggg gag atg ttt gat ctg gag ggg ttg aag gag gtt tgt gag agg ttg 720ggg gag atg ttt gat ctg gag ggg ttg aag gag gtt tgt gag agg ttg 720

Gly Glu Met Phe Asp Leu Glu Gly Leu Lys Glu Val Cys Glu Arg LeuGly Glu Met Phe Asp Leu Glu Gly Leu Lys Glu Val Cys Glu Arg Leu

225 230 235 240225 230 235 240

ggc agg 726ggc agg 726

Gly ArgGly Arg

<210>6<210>6

<211>242<211>242

<212>PRT<212>PRT

<213>Aspergillus niger<213>Aspergillus niger

<400>6<400>6

1 5 10 151 5 10 15

20 25 3020 25 30

35 40 4535 40 45

50 55 6050 55 60

65 70 75 8065 70 75 80

85 90 9585 90 95

100 105 110100 105 110

Val Asn Ala Leu Thr Arg His Thr Val Ser Leu Asp Asp Val Leu ThrVal Asn Ala Leu Thr Arg His Thr Val Ser Leu Asp Asp Val Leu Thr

115 120 125115 120 125

130 135 140130 135 140

145 150 155 160145 150 155 160

165 170 175165 170 175

180 185 190180 185 190

195 200 205195 200 205

210 215 220210 215 220

225 230 235 240225 230 235 240

Gly ArgGly Arg

<210>7<210>7

<211>2443<211>2443

<212>DNA<212>DNA

<213>Aspergillus niger<213>Aspergillus niger

<220><220>

<221>CDS<221> CDS

<222>(774)..(977)<222>(774)..(977)

<223><223>

<220><220>

<221>CDS<221> CDS

<222>(1050)..(1820)<222>(1050)..(1820)

<223><223>

<400>7<400>7

ccgctatcac tcagcggcag tcgggcgcag acaatgctac cgaggattca gggcacaagg 60ccgctatcac tcagcggcag tcgggcgcag acaatgctac cgaggattca gggcacaagg 60

gggatcagtt acgtggggcg tgacgatgta tgggtttcat gaggatgcgt ttacgagtgc 120gggatcagtt acgtggggcg tgacgatgta tgggtttcat gaggatgcgt ttacgagtgc 120

gtttggggtg gtggaagagt tgggggcgaa gatcccgttt ccggtggctg atagtcggtt 180gtttggggtg gtggaagagt tgggggcgaa gatcccgttt ccggtggctg atagtcggtt 180

gagagggggt gcagtgaagg aggtgagatg gttggaatat gtgttcagat tggtgttgca 240gagagggggt gcagtgaagg aggtgagatg gttggaatat gtgttcagat tggtgttgca 240

gggggtgcag ggcttgattt tgtggctggt gggagaggga aaagggaagc agaaggcgag 300gggggtgcag ggcttgattt tgtggctggt gggagaggga aaagggaagc agaaggcgag 300

ctagaacctt ttacacactg cctctcgaat gaacattcag aaattttgga tagacaaggt 360ctagaacctt ttacacactg cctctcgaat gaacattcag aaattttgga tagacaaggt 360

ttctgtcggt gctattctgg gcgttgcaat acctgtcaca ttgaataacc tccgaggtga 420ttctgtcggt gctattctgg gcgttgcaat acctgtcaca ttgaataacc tccgaggtga 420

ttatccacgc aatcccatat catctgatgt accccttgca taatctgtct ttaccaaatc 480ttatccacgc aatcccatat catctgatgt accccttgca taatctgtct ttaccaaatc 480

aggtagaaag taagaccatc cggataggct gtatactacc tcgtgtccac atcggggata 540aggtagaaag taagaccatc cggataggct gtatactacc tcgtgtccac atcggggata 540

ctgaacacga ttgtatatca attcgatgat ggagatacaa ccctaatgaa ccaagcacag 600ctgaacacga ttgtatatca attcgatgat ggagatacaa ccctaatgaa ccaagcacag 600

ccgtggatct ccccggactt tctccgatgg cttccccacc cgccaatcca gcatcaacta 660ccgtggatct ccccggactt tctccgatgg cttccccacc cgccaatcca gcatcaacta 660

aacaacccct cacgctcttc atcatctctc atttcactcc cacctatcat ctacagcttc 720aacaacccct cacgctcttc atcatctctc atttcactcc cacctatcat ctacagcttc 720

acaaacaaca ctactccctc ccaagcacac acctctactg acaagatacc acg atg 776acaaacaaca ctactccctc ccaagcacac acctctactg acaagatacc acg atg 776

MetMet

1 1

gcc gac tcc ctc ccg aaa acc tac gac gac ctc ccc gac aag cgg cgc 824gcc gac tcc ctc ccg aaa acc tac gac gac ctc ccc gac aag cgg cgc 824

Ala Asp Ser Leu Pro Lys Thr Tyr Asp Asp Leu Pro Asp Lys Arg ArgAla Asp Ser Leu Pro Lys Thr Tyr Asp Asp Leu Pro Asp Lys Arg Arg

5 10 155 10 15

tac tgg ccc gca acc ccc aac tcc gcc gac gag gga ctg ggg atg ctc 872tac tgg ccc gca acc ccc aac tcc gcc gac gag gga ctg ggg atg ctc 872

Tyr Trp Pro Ala Thr Pro Asn Ser Ala Asp Glu Gly Leu Gly Met LeuTyr Trp Pro Ala Thr Pro Asn Ser Ala Asp Glu Gly Leu Gly Met Leu

20 25 3020 25 30

cgt ctc ctc act cca gaa att gtc gcc aac gca gca cgc acc cag atc 920cgt ctc ctc act cca gaa att gtc gcc aac gca gca cgc acc cag atc 920

Arg Leu Leu Thr Pro Glu Ile Val Ala Asn Ala Ala Arg Thr Gln IleArg Leu Leu Thr Pro Glu Ile Val Ala Asn Ala Ala Arg Thr Gln Ile

35 40 4535 40 45

cag acc ggc gag cgc gta tgc ctg aac tgg gac ctg gag aag ctg aac 968cag acc ggc gag cgc gta tgc ctg aac tgg gac ctg gag aag ctg aac 968

Gln Thr Gly Glu Arg Val Cys Leu Asn Trp Asp Leu Glu Lys Leu AsnGln Thr Gly Glu Arg Val Cys Leu Asn Trp Asp Leu Glu Lys Leu Asn

50 55 60 6550 55 60 65

cca cca ggt accctccctt ctacctcagt gacataggtt cacccccaac 1017cca cca ggt accctccctt ctacctcagt gacataggtt cacccccaac 1017

Pro Pro GlyPro Pro Gly

ctcacccatc aacctaactt aaaaacatag gc ttc ggc cgc aaa ccc ttc gcc 1070ctcacccatc aacctaactt aaaaacatag gc ttc ggc cgc aaa ccc ttc gcc 1070

Phe Gly Arg Lys Pro Phe AlaPhe Gly Arg Lys Pro Phe Ala

70 7570 75

cac cac gta aaa tgg gtc tcc gaa ggc cac gcc ttc gac gac gaa tac 1118cac cac gta aaa tgg gtc tcc gaa ggc cac gcc ttc gac gac gaa tac 1118

His His Val Lys Trp Val Ser Glu Gly His Ala Phe Asp Asp Glu TyrHis His Val Lys Trp Val Ser Glu Gly His Ala Phe Asp Asp Glu Tyr

80 85 9080 85 90

cac ttc aac ccg caa caa tcc tcc caa tgg gac ggt ctg cga cac cac 1166cac ttc aac ccg caa caa tcc tcc caa tgg gac ggt ctg cga cac cac 1166

His Phe Asn Pro Gln Gln Ser Ser Gln Trp Asp Gly Leu Arg His HisHis Phe Asn Pro Gln Gln Ser Ser Gln Trp Asp Gly Leu Arg His His

95 100 10595 100 105

aac gcc ccg gct cca acc ccc gac gac cct aac cgc cgg gtc ttc tac 1214aac gcc ccg gct cca acc ccc gac gac cct aac cgc cgg gtc ttc tac 1214

Asn Ala Pro Ala Pro Thr Pro Asp Asp Pro Asn Arg Arg Val Phe TyrAsn Ala Pro Ala Pro Thr Pro Asp Asp Pro Asn Arg Arg Val Phe Tyr

110 115 120110 115 120

ggc ggc acc acc tcc acc gag atc ctc gac ccc tcg tcc acc cga atc 1262ggc ggc acc acc tcc acc gag atc ctc gac ccc tcg tcc acc cga atc 1262

Gly Gly Thr Thr Ser Thr Glu Ile Leu Asp Pro Ser Ser Thr Arg IleGly Gly Thr Thr Ser Thr Glu Ile Leu Asp Pro Ser Ser Thr Arg Ile

125 130 135125 130 135

ggc ata gcc cac tgg gcc aag aaa ggc atc gcc ggc cgc ggc gtc ctc 1310ggc ata gcc cac tgg gcc aag aaa ggc atc gcc ggc cgc ggc gtc ctc 1310

Gly Ile Ala His Trp Ala Lys Lys Gly Ile Ala Gly Arg Gly Val LeuGly Ile Ala His Trp Ala Lys Lys Gly Ile Ala Gly Arg Gly Val Leu

140 145 150 155140 145 150 155

atc gac tac gcc tcc tac gtg caa cga acc aag ggc gtc aca gta aac 1358atc gac tac gcc tcc tac gtg caa cga acc aag ggc gtc aca gta aac 1358

Ile Asp Tyr Ala Ser Tyr Val Gln Arg Thr Lys Gly Val Thr Val AsnIle Asp Tyr Ala Ser Tyr Val Gln Arg Thr Lys Gly Val Thr Val Asn

160 165 170160 165 170

gcc cta acc cgc cac acc gtc tcc ctg gac gac gtc ctc acc atc gcg 1406gcc cta acc cgc cac acc gtc tcc ctg gac gac gtc ctc acc atc gcg 1406

Ala Leu Thr Arg His Thr Val Ser Leu Asp Asp Val Leu Thr Ile AlaAla Leu Thr Arg His Thr Val Ser Leu Asp Asp Val Leu Thr Ile Ala

175 180 185175 180 185

aag gaa tgc aac atc acc ttc gaa cca ggc gac att ctc ttc ctg cgc 1454aag gaa tgc aac atc acc ttc gaa cca ggc gac att ctc ttc ctg cgc 1454

Lys Glu Cys Asn Ile Thr Phe Glu Pro Gly Asp Ile Leu Phe Leu ArgLys Glu Cys Asn Ile Thr Phe Glu Pro Gly Asp Ile Leu Phe Leu Arg

190 195 200190 195 200

gtc ggt cta ccg act aca tgg gat aac atg acc gat gag gag agg gtc 1502gtc ggt cta ccg act aca tgg gat aac atg acc gat gag gag agg gtc 1502

Val Gly Leu Pro Thr Thr Trp Asp Asn Met Thr Asp Glu Glu Arg ValVal Gly Leu Pro Thr Thr Trp Asp Asn Met Thr Asp Glu Glu Arg Val

205 210 215205 210 215

aag tat agt cag cag gaa acg ccc aac cat gca gga tta gag cag agt 1550aag tat agt cag cag gaa acg ccc aac cat gca gga tta gag cag agt 1550

Lys Tyr Ser Gln Gln Glu Thr Pro Asn His Ala Gly Leu Glu Gln SerLys Tyr Ser Gln Gln Glu Thr Pro Asn His Ala Gly Leu Glu Gln Ser

220 225 230 235220 225 230 235

gag cgt gtg gtg cgg ttc ctt tgg gat cag cat ttc gcc gct gta gcg 1598gag cgt gtg gtg cgg ttc ctt tgg gat cag cat ttc gcc gct gta gcg 1598

Glu Arg Val Val Arg Phe Leu Trp Asp Gln His Phe Ala Ala Val AlaGlu Arg Val Val Arg Phe Leu Trp Asp Gln His Phe Ala Ala Val Ala

240 245 250240 245 250

ggt gat gcg gtc agc ttt gag gtg tat ccg ccg gta gag aag gag tgg 1646ggt gat gcg gtc agc ttt gag gtg tat ccg ccg gta gag aag gag tgg 1646

Gly Asp Ala Val Ser Phe Glu Val Tyr Pro Pro Val Glu Lys Glu TrpGly Asp Ala Val Ser Phe Glu Val Tyr Pro Pro Val Glu Lys Glu Trp

255 260 265255 260 265

gat tta cat cat ttt ttg ctg gct ggg tgg gga cta ccg att ggg gag 1694gat tta cat cat ttt ttg ctg gct ggg tgg gga cta ccg att ggg gag 1694

Asp Leu His His Phe Leu Leu Ala Gly Trp Gly Leu Pro Ile Gly GluAsp Leu His His Phe Leu Leu Ala Gly Trp Gly Leu Pro Ile Gly Glu

270 275 280270 275 280

atg ttt gat ctt gag ggg ttg aag gag gtg tgt gag agg ttg gga agg 1742atg ttt gat ctt gag ggg ttg aag gag gtg tgt gag agg ttg gga agg 1742

Met Phe Asp Leu Glu Gly Leu Lys Glu Val Cys Glu Arg Leu Gly ArgMet Phe Asp Leu Glu Gly Leu Lys Glu Val Cys Glu Arg Leu Gly Arg

285 290 295285 290 295

tgg acg ttt ttc gtt agt tct agt ccg ttg aac tgt ggg aga ggt gtg 1790tgg acg ttt ttc gtt agt tct agt ccg ttg aac tgt ggg aga ggt gtg 1790

Trp Thr Phe Phe Val Ser Ser Ser Pro Leu Asn Cys Gly Arg Gly ValTrp Thr Phe Phe Val Ser Ser Ser Ser Pro Leu Asn Cys Gly Arg Gly Val

300 305 310 315300 305 310 315

tcg agt ccg ccg aat tgt atg gca att ttt taaaatgcag aggtggtgtg 1840tcg agt ccg ccg aat tgt atg gca att ttt taaaatgcag aggtggtgtg 1840

Ser Ser Pro Pro Asn Cys Met Ala Ile PheSer Ser Pro Pro Asn Cys Met Ala Ile Phe

320 325320 325

ggaacggagt tggatggacg atggagtgga tagaacagcg ggagttgatg tattgagctg 1900ggaacggagt tggatggacg atggagtgga tagaacagcg ggagttgatg tattgagctg 1900

gggccctagt catgatttgt ctgaatctgg agactataca tagcaccttc agggcatcaa 1960gggccctagt catgatttgt ctgaatctgg agactataca tagcaccttc agggcatcaa 1960

gaggcataat atgcagttat acttgtgata ttacattctt tggccacagt tatgactgtc 2020gaggcataat atgcagttat acttgtgata ttacattctt tggccacagt tatgactgtc 2020

tcttatattg actttgaagt gtcttctttg agccgagtat acttggatta taggattcca 2080tcttatattg actttgaagt gtcttctttg agccgagtat acttggatta taggattcca 2080

atgacggtac atcccgaagg tctgtcccag cacttcccta atcgcagtgg taatacctac 2140atgacggtac atcccgaagg tctgtcccag cacttcccta atcgcagtgg taatacctac 2140

tactagccag tatatctaaa gtacaccact cggaactcat tcacatatac gaatggtgac 2200tactagccag tatatctaaa gtacaccact cggaactcat tcacatatac gaatggtgac 2200

tatgaagggg taatacattc acaacaagat cagtaaatag cagttgcgag gaatgcaaga 2260tatgaagggg taatacattc acaacaagat cagtaaatag cagttgcgag gaatgcaaga 2260

ctagcccttc gtacattcaa tatcagaagt tgggactata aaggccactg cctccctcac 2320ctagcccttc gtacattcaa tatcagaagt tgggactata aaggccactg cctccctcac 2320

tagcccgtaa agcaatacct cgctgctacc attgccaact gcagggtgtt tccagtaaac 2380tagcccgtaa agcaatacct cgctgctacc attgccaact gcagggtgtt tccagtaaac 2380

ggagtggctg ttcccaccgt gcagcactct gtacccactc cccttcctgc tgtcattctg 2440ggagtggctg ttcccaccgt gcagcactct gtacccactc cccttcctgc tgtcattctg 2440

att 2443att 2443

<210>8<210>8

<211>325<211>325

<212>PRT<212>PRT

<213>Aspergillus niger<213>Aspergillus niger

<400>8<400>8

Met Ala Asp Ser Leu Pro Lys Thr Tyr Asp Asp Leu Pro Asp Lys ArgMet Ala Asp Ser Leu Pro Lys Thr Tyr Asp Asp Leu Pro Asp Lys Arg

1 5 10 151 5 10 15

Arg Tyr Trp Pro Ala Thr Pro Asn Ser Ala Asp Glu Gly Leu Gly MetArg Tyr Trp Pro Ala Thr Pro Asn Ser Ala Asp Glu Gly Leu Gly Met

20 25 3020 25 30

Leu Arg Leu Leu Thr Pro Glu Ile Val Ala Asn Ala Ala Arg Thr GlnLeu Arg Leu Leu Thr Pro Glu Ile Val Ala Asn Ala Ala Arg Thr Gln

35 40 4535 40 45

Ile Gln Thr Gly Glu Arg Val Cys Leu Asn Trp Asp Leu Glu Lys LeuIle Gln Thr Gly Glu Arg Val Cys Leu Asn Trp Asp Leu Glu Lys Leu

50 55 6050 55 60

Asn Pro Pro Gly Phe Gly Arg Lys Pro Phe Ala His His Val Lys TrpAsn Pro Pro Gly Phe Gly Arg Lys Pro Phe Ala His His Val Lys Trp

65 70 75 8065 70 75 80

Val Ser Glu Gly His Ala Phe Asp Asp Glu Tyr His Phe Asn Pro GlnVal Ser Glu Gly His Ala Phe Asp Asp Glu Tyr His Phe Asn Pro Gln

85 90 9585 90 95

Gln Ser Ser Gln Trp Asp Gly Leu Arg His His Asn Ala Pro Ala ProGln Ser Ser Gln Trp Asp Gly Leu Arg His His Asn Ala Pro Ala Pro

100 105 110100 105 110

Thr Pro Asp Asp Pro Asn Arg Arg Val Phe Tyr Gly Gly Thr Thr SerThr Pro Asp Asp Pro Asn Arg Arg Val Phe Tyr Gly Gly Thr Thr Ser

115 120 125115 120 125

Thr Glu Ile Leu Asp Pro Ser Ser Thr Arg Ile Gly Ile Ala His TrpThr Glu Ile Leu Asp Pro Ser Ser Thr Arg Ile Gly Ile Ala His Trp

130 135 140130 135 140

Ala Lys Lys Gly Ile Ala Gly Arg Gly Val Leu Ile Asp Tyr Ala SerAla Lys Lys Gly Ile Ala Gly Arg Gly Val Leu Ile Asp Tyr Ala Ser

145 150 155 160145 150 155 160

Tyr Val Gln Arg Thr Lys Gly Val Thr Val Asn Ala Leu Thr Arg HisTyr Val Gln Arg Thr Lys Gly Val Thr Val Asn Ala Leu Thr Arg His

165 170 175165 170 175

Thr Val Ser Leu Asp Asp Val Leu Thr Ile Ala Lys Glu Cys Asn IleThr Val Ser Leu Asp Asp Val Leu Thr Ile Ala Lys Glu Cys Asn Ile

180 185 190180 185 190

Thr Phe Glu Pro Gly Asp Ile Leu Phe Leu Arg Val Gly Leu Pro ThrThr Phe Glu Pro Gly Asp Ile Leu Phe Leu Arg Val Gly Leu Pro Thr

195 200 205195 200 205

Thr Trp Asp Asn Met Thr Asp Glu Glu Arg Val Lys Tyr Ser Gln GlnThr Trp Asp Asn Met Thr Asp Glu Glu Arg Val Lys Tyr Ser Gln Gln

210 215 220210 215 220

Glu Thr Pro Asn His Ala Gly Leu Glu Gln Ser Glu Arg Val Val ArgGlu Thr Pro Asn His Ala Gly Leu Glu Gln Ser Glu Arg Val Val Arg

225 230 235 240225 230 235 240

Phe Leu Trp Asp Gln His Phe Ala Ala Val Ala Gly Asp Ala Val SerPhe Leu Trp Asp Gln His Phe Ala Ala Val Ala Gly Asp Ala Val Ser

245 250 255245 250 255

Phe Glu Val Tyr Pro Pro Val Glu Lys Glu Trp Asp Leu His His PhePhe Glu Val Tyr Pro Pro Val Glu Lys Glu Trp Asp Leu His His Phe

260 265 270260 265 270

Leu Leu Ala Gly Trp Gly Leu Pro Ile Gly Glu Met Phe Asp Leu GluLeu Leu Ala Gly Trp Gly Leu Pro Ile Gly Glu Met Phe Asp Leu Glu

275 280 285275 280 285

Gly Leu Lys Glu Val Cys Glu Arg Leu Gly Arg Trp Thr Phe Phe ValGly Leu Lys Glu Val Cys Glu Arg Leu Gly Arg Trp Thr Phe Phe Val

290 295 300290 295 300

Ser Ser Ser Pro Leu Asn Cys Gly Arg Gly Val Ser Ser Pro Pro AsnSer Ser Ser Pro Leu Asn Cys Gly Arg Gly Val Ser Ser Pro Pro Asn

305 310 315 320305 310 315 320

Cys Met Ala Ile PheCys Met Ala Ile Phe

325325

<210>9<210>9

<211>1025<211>1025

<212>DNA<212>DNA

<213>Aspergillus awamorii<213>Aspergillus awamorii

<220><220>

<221>CDS<221> CDS

<222>(1)..(204)<222>(1)..(204)

<223><223>

<220><220>

<221>CDS<221> CDS

<222>(255)..(1025)<222>(255)..(1025)

<223><223>

<400>9<400>9

atg gcc gac tcc ctc ccg aaa acc tac gac gac ctc ccc gac aag cgg 48atg gcc gac tcc ctc ccg aaa acc tac gac gac ctc ccc gac aag cgg 48

1 5 10 151 5 10 15

cgc tac tgg ccc gca gcc ccc aac tcc gcc gac gag ggg ctg ggg atg 96cgc tac tgg ccc gca gcc ccc aac tcc gcc gac gag ggg ctg ggg atg 96

Arg Tyr Trp Pro Ala Ala Pro Asn Ser Ala Asp Glu Gly Leu Gly MetArg Tyr Trp Pro Ala Ala Pro Asn Ser Ala Asp Glu Gly Leu Gly Met

20 25 3020 25 30

ctc cgt ctc ctc acc cca gaa gtg gtc gcc aac gca gca cgc acc cag 144ctc cgt ctc ctc acc cca gaa gtg gtc gcc aac gca gca cgc acc cag 144

Leu Arg Leu Leu Thr Pro Glu Val Val Ala Asn Ala Ala Arg Thr GlnLeu Arg Leu Leu Thr Pro Glu Val Val Ala Asn Ala Ala Arg Thr Gln

35 40 4535 40 45

atc cag acc ggc gag cgt gta tgc ctc aac tgg gac ctg gag aag cta 192atc cag acc ggc gag cgt gta tgc ctc aac tgg gac ctg gag aag cta 192

50 55 6050 55 60

aac cca cca ggt gcccctcctt ccacccccat cgtctctcac caactaacca 244aac cca cca ggt gcccctcctt ccacccccat cgtctctcac caactaacca 244

Asn Pro Pro GlyAsn Pro Pro Gly

6565

aaacacaggt ttc ggc cgc aaa ccc ttc gcc cac cag gtt aaa tgg gtc 293aaacacaggt ttc ggc cgc aaa ccc ttc gcc cac cag gtt aaa tgg gtc 293

Phe Gly Arg Lys Pro Phe Ala His Gln Val Lys Trp ValPhe Gly Arg Lys Pro Phe Ala His Gln Val Lys Trp Val

70 75 8070 75 80

aac gaa ggc cac gcc ttc gac gac gaa tac cac ttc aat ccc cag caa 341aac gaa ggc cac gcc ttc gac gac gaa tac cac ttc aat ccc cag caa 341

Asn Glu Gly His Ala Phe Asp Asp Glu Tyr His Phe Asn Pro Gln GlnAsn Glu Gly His Ala Phe Asp Asp Glu Tyr His Phe Asn Pro Gln Gln

85 90 9585 90 95

tcc tcc caa tgg gac ggc ctc cga cac cac aac gcc ccg gcc cca aca 389tcc tcc caa tgg gac ggc ctc cga cac cac aac gcc ccg gcc cca aca 389

Ser Ser Gln Trp Asp Gly Leu Arg His His Asn Ala Pro Ala Pro ThrSer Ser Gln Trp Asp Gly Leu Arg His His Asn Ala Pro Ala Pro Thr

100 105 110100 105 110

ccc gaa gac cct gag cgc cgc gtc ttc tac ggc ggc acc acc tcc acc 437ccc gaa gac cct gag cgc cgc gtc ttc tac ggc ggc acc acc tcc acc 437

Pro Glu Asp Pro Glu Arg Arg Val Phe Tyr Gly Gly Thr Thr Ser ThrPro Glu Asp Pro Glu Arg Arg Val Phe Tyr Gly Gly Thr Thr Ser Thr

115 120 125115 120 125

gag atc ctc gac ccc tcg tcc gca cgg atc ggc ata gcc cac tgg gcc 485gag atc ctc gac ccc tcg tcc gca cgg atc ggc ata gcc cac tgg gcc 485

Glu Ile Leu Asp Pro Ser Ser Ala Arg Ile Gly Ile Ala His Trp AlaGlu Ile Leu Asp Pro Ser Ser Ala Arg Ile Gly Ile Ala His Trp Ala

130 135 140 145130 135 140 145

aag aag ggc atc gcc ggc cgc ggc gtc ctc ata gac tac gcc tcc tac 533aag aag ggc atc gcc ggc cgc ggc gtc ctc ata gac tac gcc tcc tac 533

Lys Lys Gly Ile Ala Gly Arg Gly Val Leu Ile Asp Tyr Ala Ser TyrLys Lys Gly Ile Ala Gly Arg Gly Val Leu Ile Asp Tyr Ala Ser Tyr

150 155 160150 155 160

atc cag cga acc aag ggc atc aca gta aac gcc cta acc cgc cac acc 581atc cag cga acc aag ggc atc aca gta aac gcc cta acc cgc cac acc 581

Ile Gln Arg Thr Lys Gly Ile Thr Val Asn Ala Leu Thr Arg His ThrIle Gln Arg Thr Lys Gly Ile Thr Val Asn Ala Leu Thr Arg His Thr

165 170 175165 170 175

gtc tcc ctc gac gac gtc ctc acc atc gcg aag gaa tgc aac att acc 629gtc tcc ctc gac gac gtc ctc acc atc gcg aag gaa tgc aac att acc 629

Val Ser Leu Asp Asp Val Leu Thr Ile Ala Lys Glu Cys Asn Ile ThrVal Ser Leu Asp Asp Val Leu Thr Ile Ala Lys Glu Cys Asn Ile Thr

180 185 190180 185 190

ttc cag cca ggc gac att ctc ttc ctg cgc gtt ggt cta ccg aca acg 677ttc cag cca ggc gac att ctc ttc ctg cgc gtt ggt cta ccg aca acg 677

Phe Gln Pro Gly Asp Ile Leu Phe Leu Arg Val Gly Leu Pro Thr ThrPhe Gln Pro Gly Asp Ile Leu Phe Leu Arg Val Gly Leu Pro Thr Thr

195 200 205195 200 205

tgg gat aac atg tcc gat gaa gag aag gtc gagtat agt caa cag caa 725tgg gat aac atg tcc gat gaa gag aag gtc gagtat agt caa cag caa 725

Trp Asp Asn Met Ser Asp Glu Glu Lys Val Glu Tyr Ser Gln Gln GlnTrp Asp Asn Met Ser Asp Glu Glu Lys Val Glu Tyr Ser Gln Gln Gln

210 215 220 225210 215 220 225

aca ccc cag cat gca ggt tta gag cag agc gag cgt gtg gtg cgg ttc 773aca ccc cag cat gca ggt tta gag cag agc gag cgt gtg gtg cgg ttc 773

Thr Pro Gln His Ala Gly Leu Glu Gln Ser Glu Arg Val Val Arg PheThr Pro Gln His Ala Gly Leu Glu Gln Ser Glu Arg Val Val Arg Phe

230 235 240230 235 240

ctc tgg gac cag cat ttc gcg gct gtg gcg ggc gat gcg gtc agc ttt 821ctc tgg gac cag cat ttc gcg gct gtg gcg ggc gat gcg gtc agc ttt 821

Leu Trp Asp Gln His Phe Ala Ala Val Ala Gly Asp Ala Val Ser PheLeu Trp Asp Gln His Phe Ala Ala Val Ala Gly Asp Ala Val Ser Phe

245 250 255245 250 255

gag gtg tat ccg ccg gta gag aag gag tgg gat tta cac cat ttc ttg 869gag gtg tat ccg ccg gta gag aag gag tgg gat tta cac cat ttc ttg 869

Glu Val Tyr Pro Pro Val Glu Lys Glu Trp Asp Leu His His Phe LeuGlu Val Tyr Pro Pro Val Glu Lys Glu Trp Asp Leu His His Phe Leu

260 265 270260 265 270

ctt gct ggg tgg gga gtg ccg att gga gag atg ttt gat ctg gag ggg 917ctt gct ggg tgg gga gtg ccg att gga gag atg ttt gat ctg gag ggg 917

Leu Ala Gly Trp Gly Val Pro Ile Gly Glu Met Phe Asp Leu Glu GlyLeu Ala Gly Trp Gly Val Pro Ile Gly Glu Met Phe Asp Leu Glu Gly

275 280 285275 280 285

ttg aag gag gtg tgt gag agg ttg ggc agg tgg acg ttt ttt gtt agt 965ttg aag gag gtg tgt gag agg ttg ggc agg tgg acg ttt ttt gtt agt 965

Leu Lys Glu Val Cys Glu Arg Leu Gly Arg Trp Thr Phe Phe Val SerLeu Lys Glu Val Cys Glu Arg Leu Gly Arg Trp Thr Phe Phe Val Ser

290 295 300 305290 295 300 305

tcg agt ccg ttg aat tgt aag acg ggg gtg tcg agt ccg ccg aat tgt 1013tcg agt ccg ttg aat tgt aag acg ggg gtg tcg agt ccg ccg aat tgt 1013

Ser Ser Pro Leu Asn Cys Lys Thr Gly Val Ser Ser Pro Pro Asn CysSer Ser Pro Leu Asn Cys Lys Thr Gly Val Ser Ser Pro Pro Asn Cys

310 315 320310 315 320

atg gca att ttt 1025atg gca att ttt 1025

Met Ala Ile PheMet Ala Ile Phe

325325

<210>10<210>10

<211>325<211>325

<212>PRT<212>PRT

<213>Aspergillus awamorii<213>Aspergillus awamorii

<400>10<400>10

1 5 10 151 5 10 15

20 25 3020 25 30

35 40 4535 40 45

50 55 6050 55 60

Asn Pro Pro Gly Phe Gly Arg Lys Pro Phe Ala His Gln Val Lys TrpAsn Pro Pro Gly Phe Gly Arg Lys Pro Phe Ala His Gln Val Lys Trp

65 70 75 8065 70 75 80

Val Asn Glu Gly His Ala Phe Asp Asp Glu Tyr His Phe Asn Pro GlnVal Asn Glu Gly His Ala Phe Asp Asp Glu Tyr His Phe Asn Pro Gln

85 90 9585 90 95

100 105 110100 105 110

Thr Pro Glu Asp Pro Glu Arg Arg Val Phe Tyr Gly Gly Thr Thr SerThr Pro Glu Asp Pro Glu Arg Arg Val Phe Tyr Gly Gly Thr Thr Ser

115 120 125115 120 125

Thr Glu Ile Leu Asp Pro Ser Ser Ala Arg Ile Gly Ile Ala His TrpThr Glu Ile Leu Asp Pro Ser Ser Ala Arg Ile Gly Ile Ala His Trp

130 135 140130 135 140

145 150 155 160145 150 155 160

Tyr Ile Gln Arg Thr Lys Gly Ile Thr Val Asn Ala Leu Thr Arg HisTyr Ile Gln Arg Thr Lys Gly Ile Thr Val Asn Ala Leu Thr Arg His

165 170 175165 170 175

180 185 190180 185 190

Thr Phe Gln Pro Gly Asp Ile Leu Phe Leu Arg Val Gly Leu Pro ThrThr Phe Gln Pro Gly Asp Ile Leu Phe Leu Arg Val Gly Leu Pro Thr

195 200 205195 200 205

Thr Trp Asp Asn Met Ser Asp Glu Glu Lys Val Glu Tyr Ser Gln GlnThr Trp Asp Asn Met Ser Asp Glu Glu Lys Val Glu Tyr Ser Gln Gln

210 215 220210 215 220

Gln Thr Pro Gln His Ala Gly Leu Glu Gln Ser Glu Arg Val Val ArgGln Thr Pro Gln His Ala Gly Leu Glu Gln Ser Glu Arg Val Val Arg

225 230 235 240225 230 235 240

245 250 255245 250 255

260 265 270260 265 270

Leu Leu Ala Gly Trp Gly Val Pro Ile Gly Glu Met Phe Asp Leu GluLeu Leu Ala Gly Trp Gly Val Pro Ile Gly Glu Met Phe Asp Leu Glu

275 280 285275 280 285

290 295 300290 295 300

Ser Ser Ser Pro Leu Asn Cys Lys Thr Gly Val Ser Ser Pro pro AsnSer Ser Ser Pro Leu Asn Cys Lys Thr Gly Val Ser Ser Pro pro Asn

305 310 315 320305 310 315 320

Cys Met Ala Ile PheCys Met Ala Ile Phe

325325

<210>11<210>11

<211>11<211>11

<212>PRT<212>PRT

<213>Eguus caballus<213> Eguus caballus

<400>11<400>11

His Gln Gly Ser Leu Val Ala Leu Phe Pro AspHis Gln Gly Ser Leu Val Ala Leu Phe Pro Asp

1 5 101 5 10

<210>12<210>12

<211>23<211>23

<212>DNA<212>DNA

<213>人工<213> Artificial

<220><220>

<221>misc_feature<221>misc_feature

<222>(1)..(23)<222>(1)..(23)

<223>5’-引物<223>5'-primer

<400>12<400>12

atgtcttcca tcaagattga atg 23atgtcttcca tcaagattga atg 23

<210>13<210>13

<211>21<211>21

<212>DNA<212>DNA

<213>人工<213> Artificial

<220><220>

<221>misc_feature<221>misc_feature

<222>(1)..(21)<222>(1)..(21)

<223>3’-引物<223>3'-primer

<400>13<400>13

gtcaaaggaa ttccccccta t 21gtcaaaggaa ttccccccta t 21

<210>14<210>14

<211>30<211>30

<212>DNA<212>DNA

<213>Bos sp.<213>Bos sp.

<400>14<400>14

atgtcttcca ttaagattga gtgtgttttg 30atgtcttcca ttaagattga gtgtgttttg 30

<210>15<210>15

<21l>30<21l>30

<212>DNA<212>DNA

<213>Oryctolagus cuniculus<213>Oryctolagus cuniculus

<400>15<400>15

atgtcttcca ttaagatcga gtgtgttttg 30atgtcttcca ttaagatcga gtgtgttttg 30

<210>16<210>16

<211>30<211>30

<212>DNA<212>DNA

<213>Homo sapiens<213>Homo sapiens

<400>16<400>16

atgtcttcca ttaagattga gtgtgttttg 30atgtcttcca ttaagattga gtgtgttttg 30

<210>17<210>17

<211>30<211>30

<212>DNA<212>DNA

<213>Rattus norvegicus<213>Rattus norvegicus

<400>17<400>17

atgtcttcca tcaagattga atgtgtttta 30atgtcttcca tcaagattga atgtgtttta 30

<210>18<210>18

<211>30<211>30

<212>DNA<212>DNA

<213>Mus sp.<213>Mus sp.

<400>18<400>18

atgtcttcca tcaaagttga atgtgtttta 30atgtcttcca tcaaagttga atgtgtttta 30

<210>19<210>19

<211>30<211>30

<212>DNA<212>DNA

<213>Gallus sp.<213>Gallus sp.

<400>19<400>19

atgtcgtccg t taagatcga gtgcgtcggg 30atgtcgtccg t taagatcga gtgcgtcggg 30

<210>20<210>20

<211>30<211>30

<212>DNA<212>DNA

<213>Xenopus sp.<213>Xenopus sp.

<400>20<400>20

atgtcttcca tcaagataga atgtgtagtc 30atgtcttcca tcaagataga atgtgtagtc 30

<210>21<210>21

<211>39<211>39

<212>DNA<212>DNA

<213>Bos sp.<213>Bos sp.

<400>21<400>21

ataactggcc taggagtcaa aggaattcct ccctatcct 39ataactggcc taggagtcaa aggaattcct ccctatcct 39

<210>22<210>22

<211>39<211>39

<212>DNA<212>DNA

<213>Oryctolagus cuniculus<213>Oryctolagus cuniculus

<400>22<400>22

ataactggcc tgggggtcaa aggaattccc ccctactcc 39ataactggcc tgggggtcaa aggaattccc ccctactcc 39

<210>23<210>23

<211>39<211>39

<212>DNA<212>DNA

<213>Homo sapiens<213>Homo sapiens

<400>23<400>23

ataactggtc tgggggtcaa aggaattgct ccctactcc 39ataactggtc tgggggtcaa aggaattgct ccctactcc 39

<210>24<210>24

<211>39<211>39

<212>DNA<212>DNA

<213>Rattus norvegicus<213>Rattus norvegicus

<400>24<400>24

ataacaggtc ttggggtcaa aggaattgct ccatattcc 39ataacaggtc ttggggtcaa aggaattgct ccatattcc 39

<210>25<210>25

<211>39<211>39

<212>DNA<212>DNA

<213>Mus sp.<213>Mus sp.

<400>25<400>25

ataacaggtc tcggagtcaa aggaattgcc ccatattcc 39ataacaggtc tcggagtcaa aggaattgcc ccatattcc 39

<210>26<210>26

<211>39<211>39

<212>DNA<212>DNA

<213>Gallus sp.<213>Gallus sp.

<400>26<400>26

atcactgggc ttggtgtgaa aggaatcccg ccatatcca 39atcactgggc ttggtgtgaa aggaatcccg ccatatcca 39

<210>27<210>27

<211>39<211>39

<212>DNA<212>DNA

<213>Xenopus sp.<213>Xenopus sp.

<400>27<400>27

attactggac ttggagtaaa aggaatcgca ccaactgca 39attackggac ttggagtaaa aggaatcgca ccaactgca 39

<210>28<210>28

<211>21<211>21

<212>DNA<212>DNA

<213>人工<213> Artificial

<220><220>

<221>misc_feature<221>misc_feature

<222>(1)..(21)<222>(1)..(21)

<223>完美匹配的引物<223>Perfectly matched primers

<400>28<400>28

aatacacatt ccatctggga t 21aatacacatt ccatctggga t 21

<210>29<210>29

<211>21<211>21

<212>DNA<212>DNA

<213>人工<213> Artificial

<220><220>

<221>misc_feature<221>misc_feature

<222>(1)..(21)<222>(1)..(21)

<223>正向引物<223> forward primer

<400>29<400>29

gaactgtgaa gttgcctgtt g 21gaactgtgaa gttgcctgtt g 21

<210>30<210>30

<211>21<211>21

<212>DNA<212>DNA

<213>人工<213> Artificial

<220><220>

<221>misc_feature<221>misc_feature

<222>(1)..(21)<222>(1)..(21)

<223>引物<223> Primer

<400>30<400>30

ggaagaggat gtcactcatc c 21ggaagaggat gtcactcatc c 21

<210>31<210>31

<211>26<211>26

<212>DNA<212>DNA

<213>人工<213> Artificial

<220><220>

<221>misc_feature<221>misc_feature

<222>(6)..(6)<222>(6)..(6)

<223>n表示C、G或T<223>n means C, G or T

<220><220>

<221>misc_feature<221>misc_feature

<222>(9)..(9)<222>(9)..(9)

<223>n表示C或T<223>n means C or T

<220><220>

<221>misc_feature<221>misc_feature

<222>(12)..(12)<222>(12)..(12)

<223>n表示C、G或T<223>n means C, G or T

<220><220>

<221>misc_feature<221>misc_feature

<222>(15)..(15)<222>(15)..(15)

<223>n表示C或T<223>n means C or T

<220><220>

<221>misc_feature<221>misc_feature

<222>(18)..(18)<222>(18)..(18)

<223>n表示A或G<223>n means A or G

<220><220>

<221>misc_feature<221>misc_feature

<222>(21)..(21)<222>(21)..(21)

<223>n表示C或G<223>n means C or G

<220><220>

<221>misc_feature<221>misc_feature

<222>(1)..(26)<222>(1)..(26)

<223>寡核苷酸Fxn42<223> oligonucleotide Fxn42

<400>31<400>31

aagccnttng cncancangt naagac 26aagccnttng cnccancangt naagac 26

<210>32<210>32

<211>26<211>26

<212>DNA<212>DNA

<213>人工<213> Artificial

<220><220>

<221>misc_feature<221>misc_feature

<222>(15)..(15)<222>(15)..(15)

<223>n表示C或T<223>n means C or T

<220><220>

<221>misc_feature<221>misc_feature

<222>(1)..(26)<222>(1)..(26)

<223>寡核苷酸Fxn42S<223> oligonucleotide Fxn42S

<220><220>

<221>misc_feature<221>misc_feature

<222>(18)..(18)<222>(18)..(18)

<223>n表示A或G<223>n means A or G

<220><220>

<221>misc_feature<221>misc_feature

<222>(21)..(21)<222>(21)..(21)

<223>n表示C、G或T<223>n means C, G or T

<220><220>

<221>misc_feature<221>misc_feature

<222>(24)..(24)<222>(24)..(24)

<223>n表示A或G<223>n means A or G

<400>32<400>32

aagccgttcg cccancangt naanac 26aagccgttcg cccancangt naanac 26

<210>33<210>33

<211>26<211>26

<212>DNA<212>DNA

<213>人工<213> Artificial

<220><220>

<221>misc_feature<221>misc_feature

<222>(6)..(6)<222>(6)..(6)

<223>n表示C或G<223>n means C or G

<220><220>

<221>misc_feature<221>misc_feature

<222>(15)..(15)<222>(15)..(15)

<223>n表示C或G<223>n means C or G

<220><220>

<221>misc_feature<221>misc_feature

<222>(18)..(18)<222>(18)..(18)

<223>n表示A或G<223>n means A or G

<220><220>

<221>misc_feature<221>misc_feature

<222>(21)..(21)<222>(21)..(21)

<223>n表示A、C、T或G<223>n means A, C, T or G

<220><220>

<221>misc_feature<221>misc_feature

<222>(24)..(24)<222>(24)..(24)

<223>n表示C或T<223>n means C or T

<220><220>

<221>misc_feature<221>misc_feature

<222>(1)..(26)<222>(1)..(26)

<223>寡核苷酸Fxn42AS<223> oligonucleotide Fxn42AS

<400>33<400>33

gtcttnacct ggtgngcnaa nggntt 26gtcttnacct ggtgngcnaa nggntt 26

<210>34<210>34

<211>24<211>24

<212>DNA<212>DNA

<213>人工<213> Artificial

<220><220>

<221>misc_feature<221>misc_feature

<222>(3)..(3)<222>(3)..(3)

<223>n表示C或T<223>n means C or T

<220><220>

<221>misc_feature<221>misc_feature

<222>(6)..(6)<222>(6)..(6)

<223>n表示A、C或G<223>n means A, C or G

<220><220>

<221>misc_feature<221>misc_feature

<222>(9)..(9)<222>(9)..(9)

<223>n表示C或T<223>n means C or T

<220><220>

<221>misc_feature<221>misc_feature

<222>(12)..(12)<222>(12)..(12)

<223>n表示C或G<223>n means C or G

<220><220>

<221>misc_feature<221>misc_feature

<222>(15)..(15)<222>(15)..(15)

<223>n表示C或T<223>n means C or T

<220><220>

<221>misc_feature<221>misc_feature

<222>(21)..(21)<222>(21)..(21)

<223>n表示C或G<223>n means C or G

<220><220>

<221>misc_feature<221>misc_feature

<222>(1)..(24)<222>(1)..(24)

<223>寡核苷酸Fxn57<223> oligonucleotide Fxn57

<400>34<400>34

atnggnatng cncantgggc naag 24atnggnatng cncantgggc naag 24

<210>35<210>35

<211>24<211>24

<212>DNA<212>DNA

<213>人工<213> Artificial

<220><220>

<221>misc_feature<221>misc_feature

<222>(9)..(9)<222>(9)..(9)

<223>n表示C或T<223>n means C or T

<220><220>

<221>misc_feature<221>misc_feature

<222>(12)..(12)<222>(12)..(12)

<223>n表示C或G<223>n means C or G

<220><220>

<221>misc_feature<221>misc_feature

<222>(15)..(15)<222>(15)..(15)

<223>n表示C或T<223>n means C or T

<220><220>

<221>misc_feature<221>misc_feature

<222>(21)..(21)<222>(21)..(21)

<223>n表示C、G或T<223>n means C, G or T

<220><220>

<221>misc_feature<221>misc_feature

<222>(24)..(24)<222>(24)..(24)

<223>n表示A或G<223>n means A or G

<220><220>

<221>misc_feature<221>misc_feature

<222>(1)..(24)<222>(1)..(24)

<223>寡核苷酸Fxn57S<223> oligonucleotide Fxn57S

<400>35<400>35

atcggcatng cncantgggc naan 24atcggcatng cncantgggc naan 24

<210>36<210>36

<211>24<211>24

<212>DNA<212>DNA

<213>人工<213> Artificial

<220><220>

<221>misc_feature<221>misc_feature

<222>(13)..(13)<222>(13)..(13)

<223>n表示A、C或G<223>n means A, C or G

<220><220>

<221>misc_feature<221>misc_feature

<222>(16)..(16)<222>(16)..(16)

<223>n表示A或G<223>n means A or G

<220><220>

<221>misc_feature<221>misc_feature

<222>(19)..(19)<222>(19)..(19)

<223>n表示A、C、T或G<223>n means A, C, T or G

<220><220>

<221>misc_feature<221>misc_feature

<222>(22)..(22)<222>(22)..(22)

<223>n表示A或G<223>n means A or G

<220><220>

<221>misc_feature<221>misc_feature

<222>(1)..(24)<222>(1)..(24)

<223>寡核苷酸Fxn57AS<223> oligonucleotide Fxn57AS

<400>36<400>36

cttggcccag tgngcnatnc cnat 24cttggcccag tgngcnatnc cnat 24

<210>37<210>37

<211>39<211>39

<212>DNA<212>DNA

<213>人工<213> Artificial

<220><220>

<221>misc_feature<221>misc_feature

<222>(6)..(6)<222>(6)..(6)

<223>n表示C、G或T<223>n means C, G or T

<220><220>

<221>misc_feature<221>misc_feature

<222>(9)..(9)<222>(9)..(9)

<223>n表示C或T<223>n means C or T

<220><220>

<221>misc_feature<221>misc_feature

<222>(12)..(12)<222>(12)..(12)

<223>n表示C或T<223>n means C or T

<220><220>

<221>misc_feature<221>misc_feature

<222>(15)..(15)<222>(15)..(15)

<223>n表示C或G<223>n means C or G

<220><220>

<221>misc_feature<221>misc_feature

<222>(18)..(18)<222>(18)..(18)

<223>n表示C或G<223>n means C or G

<220><220>

<221>misc_feature<221>misc_feature

<222>(21)..(21)<222>(21)..(21)

<223>n表示C或G<223>n means C or G

<220><220>

<221>misc_feature<221>misc_feature

<222>(24)..(24)<222>(24)..(24)

<223>n表示C或G<223>n means C or G

<220><220>

<221>misc_feature<221>misc_feature

<222>(27)..(27)<222>(27)..(27)

<223>n表示C、G或T<223>n means C, G or T

<220><220>

<221>misc_feature<221>misc_feature

<222>(30)..(30)<222>(30)..(30)

<223>n表示C或G<223>n means C or G

<220><220>

<221>misc_feature<221>misc_feature

<222>(33)..(33)<222>(33)..(33)

<223>n表示C或T<223>n means C or T

<220><220>

<221>misc_feature<221>misc_feature

<222>(1)..(39)<222>(1)..(39)

<223>寡核苷酸Fx73<223> oligonucleotide Fx73

<220><220>

<221>misc_feature<221>misc_feature

<222>(36)..(36)<222>(36)..(36)

<223>n表示C或T<223>n means C or T

<400>37<400>37

tggacnttnt tngtntcntc ntcnccnctn aantgnaag 39tggacnttnt tngtntcntc ntcnccnctn aantgnaag 39

<210>38<210>38

<211>39<211>39

<212>DNA<212>DNA

<213>人工<213> Artificial

<220><220>

<221>misc_feature<221>misc_feature

<222>(24)..(24)<222>(24)..(24)

<223>n表示C或G<223>n means C or G

<220><220>

<221>misc_feature<221>misc_feature

<222>(27)..(27)<222>(27)..(27)

<223>n表示C或G<223>n means C or G

<220><220>

<221>misc_feature<221>misc_feature

<222>(30)..(30)<222>(30)..(30)

<223>n表示C或G<223>n means C or G

<220><220>

<221>misc_feature<221>misc_feature

<222>(33)..(33)<222>(33)..(33)

<223>n表示C或T<223>n means C or T

<220><220>

<221>misc_feature<221>misc_feature

<222>(36)..(36)<222>(36)..(36)

<223>n表示C或T<223>n means C or T

<220><220>

<221>misc_feature<221>misc_feature

<222>(39)..(39)<222>(39)..(39)

<223>n表示A或G<223>n means A or G

<220><220>

<221>misc_feature<221>misc_feature

<222>(1)..(39)<222>(1)..(39)

<223>寡核苷酸Fx73S<223> oligonucleotide Fx73S

<400>38<400>38

tggaccttgt tggtgtcctc ctcnccnctn aantgnaan 39tggaccttgt tggtgtcctc ctcnccnctn aantgnaan 39

<210>39<210>39

<211>39<211>39

<212>DNA<212>DNA

<213>人工<213> Artificial

<220><220>

<221>misc_feature<221>misc_feature

<222>(22)..(22)<222>(22)..(22)

<223>n表示C或G<223>n means C or G

<220><220>

<221>misc_feature<221>misc_feature

<222>(25)..(25)<222>(25)..(25)

<223>n表示C或G<223>n means C or G

<220><220>

<221>misc_feature<221>misc_feature

<222>(28)..(28)<222>(28)..(28)

<223>n表示A或C<223>n means A or C

<220><220>

<221>misc_feature<221>misc_feature

<222>(31)..(31)<222>(31)..(31)

<223>n表示A或C<223>n means A or C

<220><220>

<221>misc_feature<221>misc_feature

<222>(34)..(34)<222>(34)..(34)

<223>n表示A、C或G<223>n means A, C or G

<220><220>

<221>misc_feature<221>misc_feature

<222>(1)..(39)<222>(1)..(39)

<223>寡核苷酸Fx73AS<223> oligonucleotide Fx73AS

<400>39<400>39

cttgcagttc agcggggagg anganacnaa naangtcca 39cttgcagttc agcggggagg anganacnaa naangtcca 39

<210>40<210>40

<211>38<211>38

<212>DNA<212>DNA

<213>人工<213> Artificial

<220><220>

<221>misc_feature<221>misc_feature

<222>(3)..(3)<222>(3)..(3)

<223>n表示C或G<223>n means C or G

<220><220>

<221>misc_feature<221>misc_feature

<222>(6)..(6)<222>(6)..(6)

<223>n表示C或T<223>n means C or T

<220><220>

<221>misc_feature<221>misc_feature

<222>(9)..(9)<222>(9)..(9)

<223>n表示C或G<223>n means C or G

<220><220>

<221>misc_feature<221>misc_feature

<222>(12)..(12)<222>(12)..(12)

<223>n表示C或T<223>n means C or T

<220><220>

<221>misc_feature<221>misc_feature

<222>(18)..(18)<222>(18)..(18)

<223>n表示C或T<223>n means C or T

<220><220>

<221>misc_feature<221>misc_feature

<222>(21)..(21)<222>(21)..(21)

<223>n表示C或G<223>n means C or G

<220><220>

<221>misc_feature<221>misc_feature

<222>(24)..(24)<222>(24)..(24)

<223>n表示A或G<223>n means A or G

<220><220>

<221>misc_feature<221>misc_feature

<222>(27)..(27)<222>(27)..(27)

<223>n表示A或G<223>n means A or G

<220><220>

<221>misc_feature<221>misc_feature

<222>(30)..(30)<222>(30)..(30)

<223>n表示C或G<223>n means C or G

<220><220>

<221>misc_feature<221>misc_feature

<222>(33)..(33)<222>(33)..(33)

<223>n表示C或T<223>n means C or T

<220><220>

<221>misc_feature<221>misc_feature

<222>(36)..(36)<222>(36)..(36)

<223>n表示C或G<223>n means C or G

<220><220>

<221>misc_feature<221>misc_feature

<222>(1)..(38)<222>(1)..(38)

<223>寡核苷酸FX74<223> oligonucleotide FX74

<400>40<400>40

gtntgnctna antggganct nganaanctn aanccncc 38gtntgnctna antggganct nganaanctn aanccncc 38

<210>41<210>41

<211>29<211>29

<212>DNA<212>DNA

<213>人工<213> Artificial

<220><220>

<221>misc_feature<221>misc_feature

<222>(15)..(15)<222>(15)..(15)

<223>n表示A或G<223>n means A or G

<220><220>

<221>misc_feature<221>misc_feature

<222>(18)..(18)<222>(18)..(18)

<223>n表示A或G<223>n means A or G

<220><220>

<221>misc_feature<221>misc_feature

<222>(21)..(21)<222>(21)..(21)

<223>n表示C或G<223>n means C or G

<220><220>

<221>misc_feature<221>misc_feature

<222>(24)..(24)<222>(24)..(24)

<223>n表示C或T<223>n means C or T

<220><220>

<221>misc_feature<221>misc_feature

<222>(27)..(27)<222>(27)..(27)

<223>n表示C、G或T<223>n means C, G or T

<220><220>

<221>misc_feature<221>misc_feature

<222>(1)..(29)<222>(1)..(29)

<223>寡核苷酸Fx74S<223> oligonucleotide Fx74S

<400>41<400>41

aactgggacc tgganaanct naanccncc 29aactgggacc tgganaanct naanccncc 29

<210>42<210>42

<211>27<211>27

<212>DNA<212>DNA

<213>人工<213> Artificial

<220><220>

<221>misc_feature<221>misc_feature

<222>(12)..(12)<222>(12)..(12)

<223>n表示C或T<223>n means C or T

<220><220>

<221>misc_feature<221>misc_feature

<222>(15)..(15)<222>(15)..(15)

<223>n表示C或T<223>n means C or T

<220><220>

<221>misc_feature<221>misc_feature

<222>(18)..(18)<222>(18)..(18)

<223>n表示C或G<223>n means C or G

<220><220>

<221>misc_feature<221>misc_feature

<222>(21)..(21)<222>(21)..(21)

<223>n表示A或G<223>n means A or G

<220><220>

<221>misc_feature<221>misc_feature

<222>(27)..(27)<222>(27)..(27)

<223>n表示A或G<223>n means A or G

<220><220>

<221>misc_feature<221>misc_feature

<222>(1)..(27)<222>(1)..(27)

<223>寡核苷酸Fx74AS<223> oligonucleotide Fx74AS

<400>42<400>42

ggcgggttca gnttntcnag ntcccan 27ggcgggttca gnttntcnag ntcccan 27

<210>43<210>43

<211>8<211>8

<212>PRT<212>PRT

<213>Aspergillus sp.<213>Aspergillus sp.

<400>43<400>43

Lys Pro Phe Ala His Gln Val LysLys Pro Phe Ala His Gln Val Lys

1 51 5

<210>44<210>44

<211>16<211>16

<212>PRT<212>PRT

<213>Aspergillus sp.<213>Aspergillus sp.

<400>44<400>44

Arg His His Asn Ala Pro Ala Pro Thr Pro Glu Asp Pro Glu Arg ArgArg His His Asn Ala Pro Ala Pro Thr Pro Glu Asp Pro Glu Arg Arg

1 5 10 151 5 10 15

<210>45<210>45

<211>12<211>12

<212>PRT<212>PRT

<213>Aspergillus sp.<213>Aspergillus sp.

<220><220>

<221>MISC_FEATURE<221>MISC_FEATURE

<222>(10)..(10)<222>(10)..(10)

<223>x表示？<223>x means?

<400>45<400>45

Lys Pro Phe Ala His Gln Val Lys Thr Xaa Glu AspLys Pro Phe Ala His Gln Val Lys Thr Xaa Glu Asp

1 5 101 5 10

<210>46<210>46

<211>21<211>21

<212>DNA<212>DNA

<213>人工<213> Artificial

<220><220>

<221>misc_feature<221>misc_feature

<222>(1)..(21)<222>(1)..(21)

<223>引物Oal1AS<223> Primer Oal1AS

<400>46<400>46

cgaagggtttgcggccgaag c 21cgaagggtttgcggccgaag c 21

<210>47<210>47

<211>21<211>21

<212>DNA<212>DNA

<213>人工<213> Artificial

<220><220>

<221>misc_feature<221>misc_feature

<222>(1)..(21)<222>(1)..(21)

<223>引物Oal2AS<223> Primer Oal2AS

<400>47<400>47

aaggcgtggc c ttcggagac c 21aaggcgtggc c ttcggagac c 21

<210>48<210>48

<211>21<211>21

<212>DNA<212>DNA

<213>人工<213> Artificial

<220><220>

<221>misc_feature<221>misc_feature

<222>(1)..(21)<222>(1)..(21)

<223>引物Oal3S<223> Primer Oal3S

<400>48<400>48

gtagcgggtg atgcggtcag c 21gtagcgggtg atgcggtcag c 21

<210>49<210>49

<211>21<211>21

<212>DNA<212>DNA

<213>人工<213> Artificial

<220><220>

<221>misc_feature<221>misc_feature

<222>(1)..(21)<222>(1)..(21)

<223>引物Oal4S<223> Primer Oal4S

<400>49<400>49

tgtatccgcc ggtagagaag g 21tgtatccgcc ggtagagaag g 21

<210>50<210>50

<211>21<211>21

<212>DNA<212>DNA

<213>人工<213> Artificial

<220><220>

<221>misc_feature<221>misc_feature

<222>(1)..(21)<222>(1)..(21)

<223>引物Oal5S<223> Primer Oal5S

<400>50<400>50

ccctcgtcca cccgaatcgg c 21ccctcgtcca cccgaatcgg c 21

<210>51<210>51

<211>21<211>21

<212>DNA<212>DNA

<213>人工<213> Artificial

<220><220>

<221>misc_feature<221>misc_feature

<222>(1)..(21)<222>(1)..(21)

<223>引物Oal6AS<223> Primer Oal6AS

<400>51<400>51

gcgtagtcgatgaggacgcc g 21gcgtagtcgatgaggacgcc g 21

<210>52<210>52

<211>21<211>21

<212>DNA<212>DNA

<213>人工<213> Artificial

<220><220>

<221>misc_feature<221>misc_feature

<222>(1)..(21)<222>(1)..(21)

<223>引物Oal7AS<223> Primer Oal7AS

<400>52<400>52

tgactatact tgaccctctc c 21tgactatact tgaccctctc c 21

<210>53<210>53

<211>22<211>22

<212>DNA<212>DNA

<213>人工<213> Artificial

<220><220>

<221>misc_feature<221>misc_feature

<222>(1)..(22)<222>(1)..(22)

<223>引物Oal8AS<223> Primer Oal8AS

<400>53<400>53

ggagacccat tttacgtggtgg 22ggagacccat tttacgtggtgg 22

<210>54<210>54

<211>21<211>21

<212>DNA<212>DNA

<213>人工<213> Artificial

<220><220>

<221>misc_feature<221>misc_feature

<222>(1)..(21)<222>(1)..(21)

<223>引物Oal9AS<223> Primer Oal9AS

<400>54<400>54

ccgttcccac accacctctg c 21ccgttcccac accacctctg c 21

<210>55<210>55

<211>24<211>24

<212>DNA<212>DNA

<213>人工<213> Artificial

<220><220>

<221>misc_feature<221>misc_feature

<222>(1)..(24)<222>(1)..(24)

<223>引物Oal10S<223> Primer Oal10S

<400>55<400>55

atggccgact ccctcccgaa aacc 24atggccgact ccctcccgaa aacc 24

<210>56<210>56

<211>27<211>27

<212>DNA<212>DNA

<213>人工<213> Artificial

<220><220>

<221>misc_feature<221>misc_feature

<222>(1)..(27)<222>(1)..(27)

<223>引物Oal10SEco<223> Primer Oal10SEco

<400>56<400>56

atataagaat tcaaatggcc gactccc 27atataagaat tcaaatggcc gactccc 27

<210>57<210>57

<211>53<211>53

<212>DNA<212>DNA

<213>人工<213> Artificial

<220><220>

<221>misc_feature<221>misc_feature

<222>(1)..(53)<222>(1)..(53)

<223>引物OalENhis<223> Primer OalENhis

<400>57<400>57

gaattcaaat gagaggatcc catcaccatc accatcacgc cgactccctc ccg 53gaattcaaat gagaggatcc catcaccatc accatcacgc cgactccctc ccg 53

<210>58<210>58

<211>19<211>19

<212>DNA<212>DNA

<213>人工<213> Artificial

<220><220>

<221>misc_feature<221>misc_feature

<222>(1)..(19)<222>(1)..(19)

<223>引物Oal10SShis<223> Primer Oal10SShis

<400>58<400>58

atgagaggat cgcatcacc 19atgagaggat cgcatcacc 19

<210>59<210>59

<211>21<211>21

<212>DNA<212>DNA

<213>人工<213> Artificial

<220><220>

<221>misc_feature<221>misc_feature

<222>(1)..(21)<222>(1)..(21)

<223>引物Oal11S<223> Primer Oal11S

<400>59<400>59

ctctactgac aagataccac g 21ctctactgac aagataccac g 21

<210>60<210>60

<211>26<211>26

<212>DNA<212>DNA

<213>人工<213> Artificial

<220><220>

<221>misc_feature<221>misc_feature

<222>(1)..(26)<222>(1)..(26)

<223>引物Oal12AS<223> Primer Oal12AS

<400>60<400>60

ttaaaaaatt gccatacaat tcggcg 26ttaaaaaatt gccatacaat tcggcg 26

<210>61<210>61

<211>38<211>38

<212>DNA<212>DNA

<213>人工<213> Artificial

<220><220>

<221>misc_feature<221>misc_feature

<222>(1)..(38)<222>(1)..(38)

<223>引物Oal12ASEco<223> Primer Oal12ASEco

<400>61<400>61

tatatagaat tcttaaaaaattgccataca attcggcg 38tatatagaat tcttaaaaaattgccataca attcggcg 38

<210>62<210>62

<211>44<211>44

<212>DNA<212>DNA

<213>人工<213> Artificial

<220><220>

<221>misc_feature<221>misc_feature

<222>(1)..(44)<222>(1)..(44)

<223>引物Oal12AShis<223> Primer Oal12AShis

<400>62<400>62

tcaatggtga tggtgatgat gaaaaattgc catacaattc ggcg 44tcaatggtga tggtgatgat gaaaaattgc catacaattc ggcg 44

<210>63<210>63

<211>56<211>56

<212>DNA<212>DNA

<213>人工<213> Artificial

<220><220>

<221>misc_feature<221>misc_feature

<222>(1)..(56)<222>(1)..(56)

<223>引物Oal12ASHisEco<223> Primer Oal12ASHisEco

<400>63<400>63

tatatagaat tcttaatggt gatggtgatg atgaaaaatt gccatacaat tcggcg 56tatatagaat tcttaatggt gatggtgatg atgaaaaatt gccatacaat tcggcg 56

<210>64<210>64

<211>21<211>21

<212>DNA<212>DNA

<213>人工<213> Artificial

<220><220>

<221>misc_feature<221>misc_feature

<222>(1)..(21)<222>(1)..(21)

<223>引物Oal13S<223> Primer Oal13S

<400>64<400>64

ggagatacaa ccctaatgaa c 21ggagatacaa ccctaatgaa c 21

<210>65<210>65

<211>21<211>21

<212>DNA<212>DNA

<213>人工<213> Artificial

<220><220>

<221>misc_feature<221>misc_feature

<222>(1)..(21)<222>(1)..(21)

<223>引物Oal13AS<223> Primer Oal13AS

<400>65<400>65

gttcattagg gttgtatctc c 21gttcattagg gttgtatctc c 21

<210>66<210>66

<211>37<211>37

<212>DNA<212>DNA

<213>人工<213> Artificial

<220><220>

<221>misc_feature<221>misc_feature

<222>(1)..(37)<222>(1)..(37)

<223>引物Oal14S<223> Primer Oal14S

<400>66<400>66

gagaagctga acccaccagg cttcggccgc aaaccct 37gagaagctga accccaccagg cttcggccgc aaaccct 37

<210>67<210>67

<211>37<211>37

<212>DNA<212>DNA

<213>人工<213> Artificial

<220><220>

<221>misc_feature<221>misc_feature

<222>(1)..(37)<222>(1)..(37)

<223>引物Oal14AS<223> Primer Oal14AS

<400>67<400>67

agggtttgcg gccgaagcct ggtgggttca gcttctc 37agggtttgcg gccgaagcct ggtgggttca gcttctc 37

<210>68<210>68

<211>37<211>37

<212>DNA<212>DNA

<213>人工<213> Artificial

<220><220>

<221>misc_feature<221>misc_feature

<222>(1)..(37)<222>(1)..(37)

<223>引物Oal15S<223> Primer Oal15S

<400>68<400>68

cttcgcccac cacgtaaaaa cccccgacga ccctaac 37cttcgcccac cacgtaaaaa cccccgacga ccctaac 37

<210>69<210>69

<211>37<211>37

<212>DNA<212>DNA

<213>人工<213> Artificial

<220><220>

<221>misc_feature<221>misc_feature

<222>(1)..(37)<222>(1)..(37)

<223>引物Oal15AS<223> Primer Oal15AS

<400>69<400>69

gttagggtcg tcgggggttt ttacgtggtg ggcgaag 37gttagggtcg tcgggggttt ttacgtggtg ggcgaag 37

<210>70<210>70

<211>32<211>32

<212>DNA<212>DNA

<213>人工<213> Artificial

<220><220>

<221>misc_feature<221>misc_feature

<222>(1)..(32)<222>(1)..(32)

<223>引物Oal16S<223> Primer Oal16S

<400>70<400>70

gaattcaaat gctccgtctc ctcactccac tc 32gaattcaaat gctccgtctc ctcactccac tc 32

<210>71<210>71

<211>30<211>30

<212>DNA<212>DNA

<213>人工<213> Artificial

<220><220>

<221>misc_feature<221>misc_feature

<222>(1)..(30)<222>(1)..(30)

<223>引物OalsecS<223> Primer OalsecS

<400>71<400>71

ggtcctcgtg gtaatgccga ctccctcccg 30ggtcctcgtg gtaatgccga ctccctcccg 30

<210>72<210>72

<211>27<211>27

<212>DNA<212>DNA

<213>人工<213> Artificial

<220><220>

<221>misc_feature<221>misc_feature

<222>(1)..(27)<222>(1)..(27)

<223>引物OalsecAS<223> Primer OalsecAS

<400>72<400>72

cgggagggag tcggcattac cacgagg 27cgggaggggag tcggcattac cacgagg 27

<210>73<210>73

<211>48<211>48

<212>DNA<212>DNA

<213>人工<213> Artificial

<220><220>

<221>misc_feature<221>misc_feature

<222>(1)..(48)<222>(1)..(48)

<223>引物pQE80EcoS<223> Primer pQE80EcoS

<400>73<400>73

tatatagaat tcattaaaga ggagaaatta actatggccg actccctc 48tatatagaat tcattaaaga ggagaaatta actatggccg actccctc 48

<210>74<210>74

<211>52<211>52

<212>DNA<212>DNA

<213>人工<213> Artificial

<220><220>

<221>misc_feature<221>misc_feature

<222>(1)..(52)<222>(1)..(52)

<223>引物pQE80EcoNhisS<223> Primer pQE80EcoNhisS

<400>74<400>74

tatatagaat tcattaaaga ggagaaatta actatgagag gatcgcatca cc 52tatatagaat tcattaaaga ggagaaatta actatgagag gatcgcatca cc 52

<210>75<210>75

<211>34<211>34

<212>DNA<212>DNA

<213>人工<213> Artificial

<220><220>

<221>misc_feature<221>misc_feature

<222>(1)..(34)<222>(1)..(34)

<223>引物pQE80BamAS<223> Primer pQE80BamAS

<400>75<400>75

tatatagtcg acttaaaaaa ttgccataca attc 34tatatagtcg acttaaaaaa ttgccataca attc 34

<210>76<210>76

<211>56<211>56

<212>DNA<212>DNA

<213>人工<213> Artificial

<220><220>

<221>misc_feature<221>misc_feature

<222>(1)..(56)<222>(1)..(56)

<223>引物pQE80BamhisAS<223> Primer pQE80BamhisAS

<400>76<400>76

tatatagtcg actcaatggtgatggtgatg atgaaaaatt gccatacaat tcggcg 56tatatagtcg actcaatggtgatggtgatg atgaaaaatt gccatacaat tcggcg 56

<210>77<210>77

<211>57<211>57

<212>DNA<212>DNA

<213>人工<213> Artificial

<220><220>

<221>misc_feature<221>misc_feature

<222>(1)..(57)<222>(1)..(57)

<223>引物pQE80EcoSshort<223> Primer pQE80EcoSshort

<400>77<400>77

tatatagaat tcattaaaga ggagaaatta actatgctcc gtctcctcac tccactc 57tatatagaat tcattaaaga ggagaaatta actatgctcc gtctcctcac tccactc 57

<210>78<210>78

<211>26<211>26

<212>DNA<212>DNA

<213>人工<213> Artificial

<220><220>

<221>misc_feature<221>misc_feature

<222>(1)..(26)<222>(1)..(26)

<223>引物CP-a<223> Primer CP-a

<400>78<400>78

tatatgaatt catgggcgtg ttcgtc 26tatatgaatt catgggcgtg ttcgtc 26

<210>79<210>79

<211>777<211>777

<212>DNA<212>DNA

<213>Aspergillus niger<213>Aspergillus niger

<220><220>

<221>Intron<221> Intron

<222>(32)..(82)<222>(32)..(82)

<223><223>

<220><220>

<221>CDS<221> CDS

<222>(83)..(777)<222>(83)..(777)

<223><223>

<220><220>

<221>CDS<221> CDS

<222>(1)..(31)<222>(1)..(31)

<223><223>

<400>79<400>79

aac tgg gac ctg gaa aaa ctc aat cct cca g gtaatcccct ccttccaacc 51aac tgg gac ctg gaa aaa ctc aat cct cca g gtaatcccct ccttccaacc 51

Asn Trp Asp Leu Glu Lys Leu Asn Pro ProAsn Trp Asp Leu Glu Lys Leu Asn Pro Pro

1 5 101 5 10

catcctctat cgccaactaa cctaaacaca g gc ttc ggc cgc aaa ccc ttc 102catcctctat cgccaactaa cctaaacaca g gc ttc ggc cgc aaa ccc ttc 102

Gly Phe Gly Arg Lys Pro PheGly Phe Gly Arg Lys Pro Phe

15

gcc cac cag gtt aaa tgg gtc agc gaa ggc cac gcc ttc gac gac gag 150gcc cac cag gtt aaa tgg gtc agc gaa ggc cac gcc ttc gac gac gag 150

Ala His Gln Val Lys Trp Val Ser Glu Gly His Ala Phe Asp Asp GluAla His Gln Val Lys Trp Val Ser Glu Gly His Ala Phe Asp Asp Glu

20 25 3020 25 30

tac cac ttc aac cca cag caa tcc tcc caa tgg gac ggc cta cga cac 198tac cac ttc aac cca cag caa tcc tcc caa tgg gac ggc cta cga cac 198

Tyr His Phe Asn Pro Gln Gln Ser Ser Gln Trp Asp Gly Leu Arg HisTyr His Phe Asn Pro Gln Gln Ser Ser Gln Trp Asp Gly Leu Arg His

35 40 4535 40 45

cac aat gcc cca gcc cca aca ccc gac gac cct gac cgc cgc gtc ttc 246cac aat gcc cca gcc cca aca ccc gac gac cct gac cgc cgc gtc ttc 246

His Asn Ala Pro Ala Pro Thr Pro Asp Asp Pro Asp Arg Arg Val PheHis Asn Ala Pro Ala Pro Thr Pro Asp Asp Pro Asp Arg Arg Val Phe

50 55 60 6550 55 60 65

tac ggc ggc acc acc tcc acc gag atc ctc gac ccg tcc tca gct cgc 294tac ggc ggc acc acc tcc acc gag atc ctc gac ccg tcc tca gct cgc 294

Tyr Gly Gly Thr Thr Ser Thr Glu Ile Leu Asp Pro Ser Ser Ala ArgTyr Gly Gly Thr Thr Ser Thr Glu Ile Leu Asp Pro Ser Ser Ala Arg

70 75 8070 75 80

atc ggc atc gct cac tgg gcc aag aag ggc atc gcc ggc cgt ggc gtc 342atc ggc atc gct cac tgg gcc aag aag ggc atc gcc ggc cgt ggc gtc 342

Ile Gly Ile Ala His Trp Ala Lys Lys Gly Ile Ala Gly Arg Gly ValIle Gly Ile Ala His Trp Ala Lys Lys Gly Ile Ala Gly Arg Gly Val

85 90 9585 90 95

ctc atc gac tac gcg tcc tac atc cag cga acc aag ggc ata cca gta 390ctc atc gac tac gcg tcc tac atc cag cga acc aag ggc ata cca gta 390

Leu Ile Asp Tyr Ala Ser Tyr Ile Gln Arg Thr Lys Gly Ile Pro ValLeu Ile Asp Tyr Ala Ser Tyr Ile Gln Arg Thr Lys Gly Ile Pro Val

100 105 110100 105 110

aac gcc cta acc cgg cac acc gtc tcc ctg gac gac gtc ctc acc atc 438aac gcc cta acc cgg cac acc gtc tcc ctg gac gac gtc ctc acc atc 438

Asn Ala Leu Thr Arg His Thr Val Ser Leu Asp Asp Val Leu Thr IleAsn Ala Leu Thr Arg His Thr Val Ser Leu Asp Asp Val Leu Thr Ile

115 120 125115 120 125

gcg aaa gaa tgc aac atc acc ttc cag cca ggc gac att ctc ttc ttg 486gcg aaa gaa tgc aac atc acc ttc cag cca ggc gac att ctc ttc ttg 486

Ala Lys Glu Cys Asn Ile Thr Phe Gln Pro Gly Asp Ile Leu Phe LeuAla Lys Glu Cys Asn Ile Thr Phe Gln Pro Gly Asp Ile Leu Phe Leu

130 135 140 145130 135 140 145

cgt gtc ggc ctg ccc acc aca tgg gat aac atg tcc gat gac gag aag 534cgt gtc ggc ctg ccc acc aca tgg gat aac atg tcc gat gac gag aag 534

Arg Val Gly Leu Pro Thr Thr Trp Asp Asn Met Ser Asp Asp Glu LysArg Val Gly Leu Pro Thr Thr Trp Asp Asn Met Ser Asp Asp Glu Lys

150 155 160150 155 160

gtc aaa tac agt caa cag gaa atg cct cag cat gca ggg tta gag cag 582gtc aaa tac agt caa cag gaa atg cct cag cat gca ggg tta gag cag 582

Val Lys Tyr Ser Gln Gln Glu Met Pro Gln His Ala Gly Leu Glu GlnVal Lys Tyr Ser Gln Gln Glu Met Pro Gln His Ala Gly Leu Glu Gln

165 170 175165 170 175

agt gag cgc gtg gtg cgg ttc ctc tgg gac cag cat ttc gcg gct gtg 630agt gag cgc gtg gtg cgg ttc ctc tgg gac cag cat ttc gcg gct gtg 630

Ser Glu Arg Val Val Arg Phe Leu Trp Asp Gln His Phe Ala Ala ValSer Glu Arg Val Val Arg Phe Leu Trp Asp Gln His Phe Ala Ala Val

180 185 190180 185 190

gcg ggt gat gcg gtc agc ttc gag gtg tat ccg cct gta gag aag gag 678gcg ggt gat gcg gtc agc ttc gag gtg tat ccg cct gta gag aag gag 678

Ala Gly Asp Ala Val Ser Phe Glu Val Tyr Pro Pro Val Glu Lys GluAla Gly Asp Ala Val Ser Phe Glu Val Tyr Pro Pro Val Glu Lys Glu

195 200 205195 200 205

tgg gat ttg cat cat ttt ctg ttg gcc ggg tgg gga gtg ccg att ggg 726tgg gat ttg cat cat ttt ctg ttg gcc ggg tgg gga gtg ccg att ggg 726

Trp Asp Leu His His Phe Leu Leu Ala Gly Trp Gly Val Pro Ile GlyTrp Asp Leu His His Phe Leu Leu Ala Gly Trp Gly Val Pro Ile Gly

210 215 220 225210 215 220 225

gag atg ttt gat ctg gag ggg ttg aag gag gtt tgt gag agg ttg ggc 774gag atg ttt gat ctg gag ggg ttg aag gag gtt tgt gag agg ttg ggc 774

Glu Met Phe Asp Leu Glu Gly Leu Lys Glu Val Cys Glu Arg Leu GlyGlu Met Phe Asp Leu Glu Gly Leu Lys Glu Val Cys Glu Arg Leu Gly

230 235 240230 235 240

agg 777agg 777

ArgArg

<210>80<210>80

<211>242<211>242

<212>PRT<212>PRT

<213>Aspergillus niger<213>Aspergillus niger

<400>80<400>80

1 5 10 151 5 10 15

20 25 3020 25 30

35 40 4535 40 45

50 55 6050 55 60

65 70 75 8065 70 75 80

85 90 9585 90 95

100 105 110100 105 110

115 120 125115 120 125

130 135 140130 135 140

145 150 155 160145 150 155 160

165 170 175165 170 175

180 185 190180 185 190

195 200 205195 200 205

210 215 220210 215 220

225 230 235 240225 230 235 240

Gly ArgGly Arg

<210>81<210>81

<211>12<211>12

<212>PRT<212>PRT

<213>Aspergillus niger<213>Aspergillus niger

<220><220>

<221>MISC_FEATURE<221>MISC_FEATURE

<222>(10)..(10)<222>(10)..(10)

<223>undefined amino acid<223>undefined amino acid

<400>81<400>81

1 5 101 5 10

<210>82<210>82

<211>8<211>8

<212>PRT<212>PRT

<213>Aspergillus niger<213>Aspergillus niger

<400>82<400>82

Ile Gly Ile Ala His Trp Ala LysIle Gly Ile Ala His Trp Ala Lys

1 51 5

<210>83<210>83

<211>13<211>13

<212>PRT<212>PRT

<213>Aspergillus niger<213>Aspergillus niger

<400>83<400>83

Trp Thr Phe Phe Val Ser Ser Ser Pro Leu Asn Cys LysTrp Thr Phe Phe Val Ser Ser Ser Ser Pro Leu Asn Cys Lys

1 5 101 5 10

<210>84<210>84

<211>15<211>15

<212>PRT<212>PRT

<213>Aspergillus niger<213>Aspergillus niger

<220><220>

<221>MISC_FEATURE<221>MISC_FEATURE

<222>(5)..(5)<222>(5)..(5)

<223>氨基酸无法被明确确定<223>Amino acid cannot be unambiguously determined

<220><220>

<221>MISC_FEATURE<221>MISC_FEATURE

<222>(14)..(14)<222>(14)..(14)

<223>未确定的氨基酸<223> Unidentified amino acid

<400>84<400>84

Val Cys Leu Asn Trp Asp Leu Glu Lys Leu Asn Pro Pro Xaa PheVal Cys Leu Asn Trp Asp Leu Glu Lys Leu Asn Pro Pro Xaa Phe

1 5 10 151 5 10 15

<210>85<210>85

<211>15<211>15

<212>PRT<212>PRT

<213>Aspergillus niger<213>Aspergillus niger

<400>85<400>85

Glu Cys Asn Ile Thr Phe Gln Pro Gly Asp Ile Leu Phe Leu ArgGlu Cys Asn Ile Thr Phe Gln Pro Gly Asp Ile Leu Phe Leu Arg

1 5 10 151 5 10 15

<210>86<210>86

<211>11<211>11

<212>PAT<212>PAT

<213>Aspergillus niger<213>Aspergillus niger

<220><220>

<221>MISC_FEATURE<221>MISC_FEATURE

<222>(8)..(8)<222>(8)..(8)

<223>未确定的氨基酸<223> Unidentified amino acid

<400>86<400>86

Ala Glu Thr Thr Thr Asn Pro Xaa Tyr Phe PheAla Glu Thr Thr Thr Asn Pro Xaa Tyr Phe Phe

1 5 101 5 10

<210>87<210>87

<211>8<211>8

<212>PRT<212>PRT

<213>Aspergillus niger<213>Aspergillus niger

<400>87<400>87

Tyr His Leu Leu Gln Ala Glu AspTyr His Leu Leu Gln Ala Glu Asp

1 51 5

<210>88<210>88

<211>10<211>10

<212>PRT<212>PRT

<213>Aspergillus niger<213>Aspergillus niger

<400>88<400>88

Leu Glu Ser Leu Val Glu Glu Val Tyr LysLeu Glu Ser Leu Val Glu Glu Val Tyr Lys

1 5 101 5 10

<210>89<210>89

<211>16<211>16

<212>PRT<212>PRT

<213>Aspergillus niger<213>Aspergillus niger

<220><220>

<221>MISC_FEATURE<221>MISC_FEATURE

<222>(15)..(15)<222>(15)..(15)

<223>未确定的氨基酸<223> Unidentified amino acid

<400>89<400>89

Leu Phe Glu Asp Val Tyr Ala Gln Lys Pro Glu Thr Ala Pro Xaa AspLeu Phe Glu Asp Val Tyr Ala Gln Lys Pro Glu Thr Ala Pro Xaa Asp

1 5 10 151 5 10 15

<210>90<210>90

<211>9<211>9

<212>PRT<212>PRT

<213>Aspergillus niger<213>Aspergillus niger

<220><220>

<221>MISC_FEATURE<221>MISC_FEATURE

<222>(7)..(7)<222>(7)..(7)

<223>氨基酸无法被明确确定<223> Amino acids cannot be unambiguously determined

<400>90<400>90

Phe Leu Phe Asn Val Pro Tyr Thr SerPhe Leu Phe Asn Val Pro Tyr Thr Ser

1 51 5

<210>91<210>91

<211>8<211>8

<212>PRT<212>PRT

<213>Aspergillus niger<213>Aspergillus niger

<220><220>

<221>MISC_FEATURE<221>MISC_FEATURE

<222>(8)..(8)<222>(8)..(8)

<400>91<400>91

Gly Val Met Gly Gly Leu Gly GlyGly Val Met Gly Gly Leu Gly Gly

1 51 5

<210>92<210>92

<211>1005<211>1005

<212>DNA<212>DNA

<213>Aspergillus niger<213>Aspergillus niger

<220><220>

<221>CDS<221> CDS

<222>(1)..(1002)<222>(1)..(1002)

<223><223>

<400>92<400>92

atg aga gga tcg cat cac cat cac cat cac gcc gac tcc ctc ccg aaa 48atg aga gga tcg cat cac cat cac cat cac gcc gac tcc ctc ccg aaa 48

Met Arg Gly Ser His His His His His His Ala Asp Ser Leu Pro LysMet Arg Gly Ser His His His His His His Ala Asp Ser Leu Pro Lys

1 5 10 151 5 10 15

acc tac gac gac ctc ccc gac aag cgg cgc tac tgg ccc gca acc ccc 96acc tac gac gac ctc ccc gac aag cgg cgc tac tgg ccc gca acc ccc 96

Thr Tyr Asp Asp Leu Pro Asp Lys Arg Arg Tyr Trp Pro Ala Thr ProThr Tyr Asp Asp Leu Pro Asp Lys Arg Arg Tyr Trp Pro Ala Thr Pro

20 25 3020 25 30

aac tcc gcc gac gag gga ctg ggg atg ctc cgt ctc ctc act cca gaa 144aac tcc gcc gac gag gga ctg ggg atg ctc cgt ctc ctc act cca gaa 144

Asn Ser Ala Asp Glu Gly Leu Gly Met Leu Arg Leu Leu Thr Pro GluAsn Ser Ala Asp Glu Gly Leu Gly Met Leu Arg Leu Leu Thr Pro Glu

35 40 4535 40 45

att gtc gcc aac gca gca cgc acc cag atc cag acc ggc gag cgc gta 192att gtc gcc aac gca gca cgc acc cag atc cag acc ggc gag cgc gta 192

Ile Val Ala Asn Ala Ala Arg Thr Gln Ile Gln Thr Gly Glu Arg ValIle Val Ala Asn Ala Ala Arg Thr Gln Ile Gln Thr Gly Glu Arg Val

50 55 6050 55 60

tgc ctg aac tgg gac ctg gag aag ctg aac cca cca ggc ttc ggc cgc 240tgc ctg aac tgg gac ctg gag aag ctg aac cca cca ggc ttc ggc cgc 240

Cys Leu Asn Trp Asp Leu Glu Lys Leu Asn Pro Pro Gly Phe Gly ArgCys Leu Asn Trp Asp Leu Glu Lys Leu Asn Pro Pro Gly Phe Gly Arg

65 70 75 8065 70 75 80

aaa ccc ttc gcc cac cac gta aaa tgg gtc tcc gaa ggc cac gcc ttc 288aaa ccc ttc gcc cac cac gta aaa tgg gtc tcc gaa ggc cac gcc ttc 288

Lys Pro Phe Ala His His Val Lys Trp Val Ser Glu Gly His Ala PheLys Pro Phe Ala His His Val Lys Trp Val Ser Glu Gly His Ala Phe

85 90 9585 90 95

gac gac gaa tac cac ttc aac ccg caa caa tcc tcc caa tgg gac ggt 336gac gac gaa tac cac ttc aac ccg caa caa tcc tcc caa tgg gac ggt 336

Asp Asp Glu Tyr His Phe Asn Pro Gln Gln Ser Ser Gln Trp Asp GlyAsp Asp Glu Tyr His Phe Asn Pro Gln Gln Ser Ser Gln Trp Asp Gly

100 105 110100 105 110

ctg cga cac cac aac gcc ccg gct cca acc ccc gac gac cct aac cgc 384ctg cga cac cac aac gcc ccg gct cca acc ccc gac gac cct aac cgc 384

Leu Arg His His Asn Ala Pro Ala Pro Thr Pro Asp Asp Pro Asn ArgLeu Arg His His Asn Ala Pro Ala Pro Thr Pro Asp Asp Pro Asn Arg

115 120 125115 120 125

cgg gtc ttc tac ggc ggc acc acc tcc acc gag atc ctc gac ccc tcg 432cgg gtc ttc tac ggc ggc acc acc acc tcc acc gag atc ctc gac ccc tcg 432

Arg Val Phe Tyr Gly Gly Thr Thr Ser Thr Glu Ile Leu Asp Pro SerArg Val Phe Tyr Gly Gly Thr Thr Ser Thr Glu Ile Leu Asp Pro Ser

130 135 140130 135 140

tcc acc cga atc ggc ata gcc cac tgg gcc aag aaa ggc atc gcc ggc 480tcc acc cga atc ggc ata gcc cac tgg gcc aag aaa ggc atc gcc ggc 480

Ser Thr Arg Ile Gly Ile Ala His Trp Ala Lys Lys Gly Ile Ala GlySer Thr Arg Ile Gly Ile Ala His Trp Ala Lys Lys Gly Ile Ala Gly

145 150 155 160145 150 155 160

cgc ggc gtc ctc atc gac tac gcc tcc tac gtg caa cga acc aag ggc 528cgc ggc gtc ctc atc gac tac gcc tcc tac gtg caa cga acc aag ggc 528

Arg Gly Val Leu Ile Asp Tyr Ala Ser Tyr Val Gln Arg Thr Lys GlyArg Gly Val Leu Ile Asp Tyr Ala Ser Tyr Val Gln Arg Thr Lys Gly

165 170 175165 170 175

gtc aca gta aac gcc cta acc cgc cac acc gtc tcc ctg gac gac gtc 576gtc aca gta aac gcc cta acc cgc cac acc gtc tcc ctg gac gac gtc 576

Val Thr Val Asn Ala Leu Thr Arg His Thr Val Ser Leu Asp Asp ValVal Thr Val Asn Ala Leu Thr Arg His Thr Val Ser Leu Asp Asp Val

180 185 190180 185 190

ctc acc atc gcg aag gaa tgc aac atc acc ttc gaa cca ggc gac att 624ctc acc atc gcg aag gaa tgc aac atc acc ttc gaa cca ggc gac att 624

Leu Thr Ile Ala Lys Glu Cys Asn Ile Thr Phe Glu Pro Gly Asp IleLeu Thr Ile Ala Lys Glu Cys Asn Ile Thr Phe Glu Pro Gly Asp Ile

195 200 205195 200 205

ctc ttc ctg cgc gtc ggt cta ccg act aca tgg gat aac atg acc gat 672ctc ttc ctg cgc gtc ggt cta ccg act aca tgg gat aac atg acc gat 672

Leu Phe Leu Arg Val Gly Leu Pro Thr Thr Trp Asp Asn Met Thr AspLeu Phe Leu Arg Val Gly Leu Pro Thr Thr Trp Asp Asn Met Thr Asp

210 215 220210 215 220

gag gag agg gtc aag tat agt cag cag gaa acg ccc aac cat gca gga 720gag gag agg gtc aag tat agt cag cag gaa acg ccc aac cat gca gga 720

Glu Glu Arg Val Lys Tyr Ser Gln Gln Glu Thr Pro Asn His Ala GlyGlu Glu Arg Val Lys Tyr Ser Gln Gln Glu Thr Pro Asn His Ala Gly

225 230 235 240225 230 235 240

tta gag cag agt gag cgt gtg gtg cgg ttc ctt tgg gat cag cat ttc 768tta gag cag agt gag cgt gtg gtg cgg ttc ctt tgg gat cag cat ttc 768

Leu Glu Gln Ser Glu Arg Val Val Arg Phe Leu Trp Asp Gln His PheLeu Glu Gln Ser Glu Arg Val Val Arg Phe Leu Trp Asp Gln His Phe

245 250 255245 250 255

gcc gct gta gcg ggt gat gcg gtc agc ttt gag gtg tat ccg ccg gta 816gcc gct gta gcg ggt gat gcg gtc agc ttt gag gtg tat ccg ccg gta 816

Ala Ala Val Ala Gly Asp Ala Val Ser Phe Glu Val Tyr Pro Pro ValAla Ala Val Ala Gly Asp Ala Val Ser Phe Glu Val Tyr Pro Pro Val

260 265 270260 265 270

gag aag gag tgg gat tta cat cat ttt ttg ctg gct ggg tgg gga cta 864gag aag gag tgg gat tta cat cat ttt ttg ctg gct ggg tgg gga cta 864

Glu Lys Glu Trp Asp Leu His His Phe Leu Leu Ala Gly Trp Gly LeuGlu Lys Glu Trp Asp Leu His His Phe Leu Leu Ala Gly Trp Gly Leu

275 280 285275 280 285

ccg att ggg gag atg ttt gat ctt gag ggg ttg aag gag gtg tgt gag 912ccg att ggg gag atg ttt gat ctt gag ggg ttg aag gag gtg tgt gag 912

Pro Ile Gly Glu Met Phe Asp Leu Glu Gly Leu Lys Glu Val Cys GluPro Ile Gly Glu Met Phe Asp Leu Glu Gly Leu Lys Glu Val Cys Glu

290 295 300290 295 300

agg ttg gga agg tgg acg ttt ttc gtt agt tct agt ccg ttg aac tgt 960agg ttg gga agg tgg acg ttt ttc gtt agt tct agt ccg ttg aac tgt 960

Arg Leu Gly Arg Trp Thr Phe Phe Val Ser Ser Ser Pro Leu Asn CysArg Leu Gly Arg Trp Thr Phe Phe Val Ser Ser Ser Pro Leu Asn Cys

305 310 315 320305 310 315 320

ggg aga ggt gtg tcg agt ccg ccg aat tgt atg gca att ttt taa 1005ggg aga ggt gtg tcg agt ccg ccg aat tgt atg gca att ttt taa 1005

Gly Arg Gly Val Ser Ser Pro Pro Asn Cys Met Ala Ile PheGly Arg Gly Val Ser Ser Pro Pro Asn Cys Met Ala Ile Phe

325 330325 330

<210>93<210>93

<211>334<211>334

<212>PRT<212>PRT

<213>Aspergillus niger<213>Aspergillus niger

<400>93<400>93

1 5 10 151 5 10 15

20 25 3020 25 30

35 40 4535 40 45

50 55 6050 55 60

65 70 75 8065 70 75 80

85 90 9585 90 95

100 105 110100 105 110

115 120 125115 120 125

130 135 140130 135 140

145 150 155 160145 150 155 160

165 170 175165 170 175

180 185 190180 185 190

195 200 205195 200 205

210 215 220210 215 220

225 230 235 240225 230 235 240

245 250 255245 250 255

260 265 270260 265 270

275 280 285275 280 285

290 295 300290 295 300

305 310 315 320305 310 315 320

325 330325 330

<210>94<210>94

<211>992<211>992

<212>DNA<212>DNA

<213>Aspergillus niger<213>Aspergillus niger

<220><220>

<221>CDS<221> CDS

<222>(9)..(983)<222>(9)..(983)

<223><223>

<400>94<400>94

gaattcaa atg gcc gac tcc ctc ccg aaa acc tac gac gac ctc ccc gac 50gaattcaa atg gcc gac tcc ctc ccg aaa acc tac gac gac ctc ccc gac 50

Met Ala Asp Ser Leu Pro Lys Thr Tyr Asp Asp Leu Pro AspMet Ala Asp Ser Leu Pro Lys Thr Tyr Asp Asp Leu Pro Asp

1 5 101 5 10

aag cgg cgc tac tgg ccc gca acc ccc aac tcc gcc gac gag gga ctg 98aag cgg cgc tac tgg ccc gca acc ccc aac tcc gcc gac gag gga ctg 98

Lys Arg Arg Tyr Trp Pro Ala Thr Pro Asn Ser Ala Asp Glu Gly LeuLys Arg Arg Tyr Trp Pro Ala Thr Pro Asn Ser Ala Asp Glu Gly Leu

15 20 25 3015 20 25 30

ggg atg ctc cgt ctc ctc act cca gaa att gtc gcc aac gca gca cgc 146ggg atg ctc cgt ctc ctc act cca gaa att gtc gcc aac gca gca cgc 146

Gly Met Leu Arg Leu Leu Thr Pro Glu Ile Val Ala Asn Ala Ala ArgGly Met Leu Arg Leu Leu Thr Pro Glu Ile Val Ala Asn Ala Ala Arg

35 40 4535 40 45

acc cag atc cag acc ggc gag cgc gta tgc ctg aac tgg gac ctg gag 194acc cag atc cag acc ggc gag cgc gta tgc ctg aac tgg gac ctg gag 194

Thr Gln Ile Gln Thr Gly Glu Arg Val Cys Leu Asn Trp Asp Leu GluThr Gln Ile Gln Thr Gly Glu Arg Val Cys Leu Asn Trp Asp Leu Glu

50 55 6050 55 60

aag ctg aac cca cca ggc ttc ggc cgc aaa ccc ttc gcc cac cac gta 242aag ctg aac cca cca ggc ttc ggc cgc aaa ccc ttc gcc cac cac gta 242

Lys Leu Asn Pro Pro Gly Phe Gly Arg Lys Pro Phe Ala His His ValLys Leu Asn Pro Pro Gly Phe Gly Arg Lys Pro Phe Ala His His Val

65 70 7565 70 75

aaa tgg gtc tcc gaa ggc cac gcc ttc gac gac gaa tac cac ttc aac 290aaa tgg gtc tcc gaa ggc cac gcc ttc gac gac gaa tac cac ttc aac 290

Lys Trp Val Ser Glu Gly His Ala Phe Asp Asp Glu Tyr His Phe AsnLys Trp Val Ser Glu Gly His Ala Phe Asp Asp Glu Tyr His Phe Asn

80 85 9080 85 90

ccg caa caa tcc tcc caa tgg gac ggt ctg cga cac cac aac gcc ccg 338ccg caa caa tcc tcc caa tgg gac ggt ctg cga cac cac aac gcc ccg 338

Pro Gln Gln Ser Ser Gln Trp Asp Gly Leu Arg His His Asn Ala ProPro Gln Gln Ser Ser Gln Trp Asp Gly Leu Arg His His Asn Ala Pro

95 100 105 11095 100 105 110

gct cca acc ccc gac gac cct aac cgc cgg gtc ttc tac ggc ggc acc 386gct cca acc ccc gac gac cct aac cgc cgg gtc ttc tac ggc ggc acc 386

Ala Pro Thr Pro Asp Asp Pro Asn Arg Arg Val Phe Tyr Gly Gly ThrAla Pro Thr Pro Asp Asp Pro Asn Arg Arg Val Phe Tyr Gly Gly Thr

115 120 125115 120 125

acc tcc acc gag atc ctc gac ccc tcg tcc acc cga atc ggc ata gcc 434acc tcc acc gag atc ctc gac ccc tcg tcc acc cga atc ggc ata gcc 434

Thr Ser Thr Glu Ile Leu Asp Pro Ser Ser Thr Arg Ile Gly Ile AlaThr Ser Thr Glu Ile Leu Asp Pro Ser Ser Thr Arg Ile Gly Ile Ala

130 135 140130 135 140

cac tgg gcc aag aaa ggc atc gcc ggc cgc ggc gtc ctc atc gac tac 482cac tgg gcc aag aaa ggc atc gcc ggc cgc ggc gtc ctc atc gac tac 482

His Trp Ala Lys Lys Gly Ile Ala Gly Arg Gly Val Leu Ile Asp TyrHis Trp Ala Lys Lys Gly Ile Ala Gly Arg Gly Val Leu Ile Asp Tyr

145 150 155145 150 155

gcc tcc tac gtg caa cga acc aag ggc gtc aca gta aac gcc cta acc 530gcc tcc tac gtg caa cga acc aag ggc gtc aca gta aac gcc cta acc 530

Ala Ser Tyr Val Gln Arg Thr Lys Gly Val Thr Val Asn Ala Leu ThrAla Ser Tyr Val Gln Arg Thr Lys Gly Val Thr Val Asn Ala Leu Thr

160 165 170160 165 170

cgc cac acc gtc tcc ctg gac gac gtc ctc acc atc gcg aag gaa tgc 578cgc cac acc gtc tcc ctg gac gac gtc ctc acc atc gcg aag gaa tgc 578

Arg His Thr Val Ser Leu Asp Asp Val Leu Thr Ile Ala Lys Glu CysArg His Thr Val Ser Leu Asp Asp Val Leu Thr Ile Ala Lys Glu Cys

175 180 185 190175 180 185 190

aac atc acc ttc gaa cca ggc gac att ctc ttc ctg cgc gtc ggt cta 626aac atc acc ttc gaa cca ggc gac att ctc ttc ctg cgc gtc ggt cta 626

Asn Ile Thr Phe Glu Pro Gly Asp Ile Leu Phe Leu Arg Val Gly LeuAsn Ile Thr Phe Glu Pro Gly Asp Ile Leu Phe Leu Arg Val Gly Leu

195 200 205195 200 205

ccg act aca tgg gat aac atg acc gat gag gag agg gtc aag tat agt 674ccg act aca tgg gat aac atg acc gat gag gag agg gtc aag tat agt 674

Pro Thr Thr Trp Asp Asn Met Thr Asp Glu Glu Arg Val Lys Tyr SerPro Thr Thr Trp Asp Asn Met Thr Asp Glu Glu Arg Val Lys Tyr Ser

210 215 220210 215 220

cag cag gaa acg ccc aac cat gca gga tta gag cag agt gag cgt gtg 722cag cag gaa acg ccc aac cat gca gga tta gag cag agt gag cgt gtg 722

Gln Gln Glu Thr Pro Asn His Ala Gly Leu Glu Gln Ser Glu Arg ValGln Gln Glu Thr Pro Asn His Ala Gly Leu Glu Gln Ser Glu Arg Val

225 230 235225 230 235

gtg cgg ttc ctt tgg gat cag cat ttc gcc gct gta gcg ggt gat gcg 770gtg cgg ttc ctt tgg gat cag cat ttc gcc gct gta gcg ggt gat gcg 770

Val Arg Phe Leu Trp Asp Gln His Phe Ala Ala Val Ala Gly Asp AlaVal Arg Phe Leu Trp Asp Gln His Phe Ala Ala Val Ala Gly Asp Ala

240 245 250240 245 250

gtc agc ttt gag gtg tat ccg ccg gta gag aag gag tgg gat tta cat 818gtc agc ttt gag gtg tat ccg ccg gta gag aag gag tgg gat tta cat 818

Val Ser Phe Glu Val Tyr Pro Pro Val Glu Lys Glu Trp Asp Leu HisVal Ser Phe Glu Val Tyr Pro Pro Val Glu Lys Glu Trp Asp Leu His

255 260 265 270255 260 265 270

cat ttt ttg ctg gct ggg tgg gga cta ccg att ggg gag atg ttt gat 866cat ttt ttg ctg gct ggg tgg gga cta ccg att ggg gag atg ttt gat 866

His Phe Leu Leu Ala Gly Trp Gly Leu Pro Ile Gly Glu Met Phe AspHis Phe Leu Leu Ala Gly Trp Gly Leu Pro Ile Gly Glu Met Phe Asp

275 280 285275 280 285

ctt gag ggg ttg aag gag gtg tgt gag agg ttg gga agg tgg acg ttt 914ctt gag ggg ttg aag gag gtg tgt gag agg ttg gga agg tgg acg ttt 914

Leu Glu Gly Leu Lys Glu Val Cys Glu Arg Leu Gly Arg Trp Thr PheLeu Glu Gly Leu Lys Glu Val Cys Glu Arg Leu Gly Arg Trp Thr Phe

290 295 300290 295 300

ttc gtt agt tct agt ccg ttg aac tgt ggg aga ggt gtg tcg agt ccg 962ttc gtt agt tct agt ccg ttg aac tgt ggg aga ggt gtg tcg agt ccg 962

Phe Val Ser Ser Ser Pro Leu Asn Cys Gly Arg Gly Val Ser Ser ProPhe Val Ser Ser Ser Pro Leu Asn Cys Gly Arg Gly Val Ser Ser Pro

305 310 315305 310 315

ccg aat tgt atg gca att ttt taagaattc 992ccg aat tgt atg gca att ttt taagaattc 992

Pro Asn Cys Met Ala Ile PhePro Asn Cys Met Ala Ile Phe

320 325320 325

<210>95<210>95

<211>325<211>325

<212>PRT<212>PRT

<213>Aspergillus niger<213>Aspergillus niger

<400>95<400>95

1 5 10 151 5 10 15

20 25 3020 25 30

35 40 4535 40 45

50 55 6050 55 60

65 70 75 8065 70 75 80

85 90 9585 90 95

100 105 110100 105 110

115 120 125115 120 125

130 135 140130 135 140

145 150 155 160145 150 155 160

165 170 175165 170 175

180 185 190180 185 190

195 200 205195 200 205

210 215 220210 215 220

225 230 235 240225 230 235 240

245 250 255245 250 255

260 265 270260 265 270

275 280 285275 280 285

290 295 300290 295 300

305 310 315 320305 310 315 320

Cys Met Ala Ile PheCys Met Ala Ile Phe

325325

<210>96<210>96

<211>1019<211>1019

<212>DNA<212>DNA

<213>Aspergillus niger<213>Aspergillus niger

<220><220>

<221>CDS<221> CDS

<222>(9)..(1010)<222>(9)..(1010)

<223><223>

<400>96<400>96

gaattcaa atg aga gga tcg cat cac cat cac cat cac gcc gac tcc ctc 50gaattcaa atg aga gga tcg cat cac cat cac cat cac gcc gac tcc ctc 50

Met Arg Gly Ser His His His His His His Ala Asp Ser Leu Met Arg Gly Ser His His His His His His His Ala Asp Ser Leu

1 5 101 5 10

ccg aaa acc tac gac gac ctc ccc gac aag cgg cgc tac tgg ccc gca 98ccg aaa acc tac gac gac ctc ccc gac aag cgg cgc tac tgg ccc gca 98

Pro Lys Thr Tyr Asp Asp Leu Pro Asp Lys Arg Arg Tyr Trp Pro AlaPro Lys Thr Tyr Asp Asp Leu Pro Asp Lys Arg Arg Tyr Trp Pro Ala

15 20 25 3015 20 25 30

acc ccc aac tcc gcc gac gag gga ctg ggg atg ctc cgt ctc ctc act 146acc ccc aac tcc gcc gac gag gga ctg ggg atg ctc cgt ctc ctc act 146

Thr Pro Asn Ser Ala Asp Glu Gly Leu Gly Met Leu Arg Leu Leu ThrThr Pro Asn Ser Ala Asp Glu Gly Leu Gly Met Leu Arg Leu Leu Thr

35 40 4535 40 45

cca gaa att gtc gcc aac gca gca cgc acc cag atc cag acc ggc gag 194cca gaa att gtc gcc aac gca gca cgc acc cag atc cag acc ggc gag 194

Pro Glu Ile Val Ala Asn Ala Ala Arg Thr Gln Ile Gln Thr Gly GluPro Glu Ile Val Ala Asn Ala Ala Arg Thr Gln Ile Gln Thr Gly Glu

50 55 6050 55 60

cgc gta tgc ctg aac tgg gac ctg gag aag ctg aac cca cca ggc ttc 242cgc gta tgc ctg aac tgg gac ctg gag aag ctg aac cca cca ggc ttc 242

Arg Val Cys Leu Asn Trp Asp Leu Glu Lys Leu Asn Pro Pro Gly PheArg Val Cys Leu Asn Trp Asp Leu Glu Lys Leu Asn Pro Pro Gly Phe

65 70 7565 70 75

ggc cgc aaa ccc ttc gcc cac cac gta aaa tgg gtc tcc gaa ggc cac 290ggc cgc aaa ccc ttc gcc cac cac gta aaa tgg gtc tcc gaa ggc cac 290

Gly Arg Lys Pro Phe Ala His His Val Lys Trp Val Ser Glu Gly HisGly Arg Lys Pro Phe Ala His His Val Lys Trp Val Ser Glu Gly His

80 85 9080 85 90

gcc ttc gac gac gaa tac cac ttc aac ccg caa caa tcc tcc caa tgg 338gcc ttc gac gac gaa tac cac ttc aac ccg caa caa tcc tcc caa tgg 338

Ala Phe Asp Asp Glu Tyr His Phe Asn Pro Gln Gln Ser Ser Gln TrpAla Phe Asp Asp Glu Tyr His Phe Asn Pro Gln Gln Ser Ser Gln Trp

95 100 105 11095 100 105 110

gac ggt ctg cga cac cac aac gcc ccg gct cca acc ccc gac gac cct 386gac ggt ctg cga cac cac aac gcc ccg gct cca acc ccc gac gac cct 386

Asp Gly Leu Arg His His Asn Ala Pro Ala Pro Thr Pro Asp Asp ProAsp Gly Leu Arg His His Asn Ala Pro Ala Pro Thr Pro Asp Asp Pro

115 120 125115 120 125

aac cgc cgg gtc ttc tac ggc ggc acc acc tcc acc gag atc ctc gac 434aac cgc cgg gtc ttc tac ggc ggc acc acc tcc acc gag atc ctc gac 434

Asn Arg Arg Val Phe Tyr Gly Gly Thr Thr Ser Thr Glu Ile Leu AspAsn Arg Arg Val Phe Tyr Gly Gly Thr Thr Ser Thr Glu Ile Leu Asp

130 135 140130 135 140

ccc tcg tcc acc cga atc ggc ata gcc cac tgg gcc aag aaa ggc atc 482ccc tcg tcc acc cga atc ggc ata gcc cac tgg gcc aag aaa ggc atc 482

Pro Ser Ser Thr Arg Ile Gly Ile Ala His Trp Ala Lys Lys Gly IlePro Ser Ser Thr Arg Ile Gly Ile Ala His Trp Ala Lys Lys Gly Ile

145 150 155145 150 155

gcc ggc cgc ggc gtc ctc atc gac tac gcc tcc tac gtg caa cga acc 530gcc ggc cgc ggc gtc ctc atc gac tac gcc tcc tac gtg caa cga acc 530

Ala Gly Arg Gly Val Leu Ile Asp Tyr Ala Ser Tyr Val Gln Arg ThrAla Gly Arg Gly Val Leu Ile Asp Tyr Ala Ser Tyr Val Gln Arg Thr

160 165 170160 165 170

aag ggc gtc aca gta aac gcc cta acc cgc cac acc gtc tcc ctg gac 578aag ggc gtc aca gta aac gcc cta acc cgc cac acc gtc tcc ctg gac 578

Lys Gly Val Thr Val Asn Ala Leu Thr Arg His Thr Val Ser Leu AspLys Gly Val Thr Val Asn Ala Leu Thr Arg His Thr Val Ser Leu Asp

175 180 185 190175 180 185 190

gac gtc ctc acc atc gcg aag gaa tgc aac atc acc ttc gaa cca ggc 626gac gtc ctc acc atc gcg aag gaa tgc aac atc acc ttc gaa cca ggc 626

Asp Val Leu Thr Ile Ala Lys Glu Cys Asn Ile Thr Phe Glu Pro GlyAsp Val Leu Thr Ile Ala Lys Glu Cys Asn Ile Thr Phe Glu Pro Gly

195 200 205195 200 205

gac att ctc ttc ctg cgc gtc ggt cta ccg act aca tgg gat aac atg 674gac att ctc ttc ctg cgc gtc ggt cta ccg act aca tgg gat aac atg 674

Asp Ile Leu Phe Leu Arg Val Gly Leu Pro Thr Thr Trp Asp Asn MetAsp Ile Leu Phe Leu Arg Val Gly Leu Pro Thr Thr Trp Asp Asn Met

210 215 220210 215 220

acc gat gag gag agg gtc aag tat agt cag cag gaa acg ccc aac cat 722acc gat gag gag agg gtc aag tat agt cag cag gaa acg ccc aac cat 722

Thr Asp Glu Glu Arg Val Lys Tyr Ser Gln Gln Glu Thr Pro Asn HisThr Asp Glu Glu Arg Val Lys Tyr Ser Gln Gln Glu Thr Pro Asn His

225 230 235225 230 235

gca gga tta gag cag agt gag cgt gtg gtg cgg ttc ctt tgg gat cag 770gca gga tta gag cag agt gag cgt gtg gtg cgg ttc ctt tgg gat cag 770

Ala Gly Leu Glu Gln Ser Glu Arg Val Val Arg Phe Leu Trp Asp GlnAla Gly Leu Glu Gln Ser Glu Arg Val Val Arg Phe Leu Trp Asp Gln

240 245 250240 245 250

cat ttc gcc gct gta gcg ggt gat gcg gtc agc ttt gag gtg tat ccg 818cat ttc gcc gct gta gcg ggt gat gcg gtc agc ttt gag gtg tat ccg 818

His Phe Ala Ala Val Ala Gly Asp A la Val Ser Phe Glu Val Tyr ProHis Phe Ala Ala Val Ala Gly Asp A la Val Ser Phe Glu Val Tyr Pro

255 260 265 270255 260 265 270

ccg gta gag aag gag tgg gat tta cat cat ttt ttg ctg gct ggg tgg 866ccg gta gag aag gag tgg gat tta cat cat ttt ttg ctg gct ggg tgg 866

Pro Val Glu Lys Glu Trp Asp Leu His His Phe Leu Leu Ala Gly TrpPro Val Glu Lys Glu Trp Asp Leu His His Phe Leu Leu Ala Gly Trp

275 280 285275 280 285

gga cta ccg att ggg gag atg ttt gat ctt gag ggg ttg aag gag gtg 914gga cta ccg att ggg gag atg ttt gat ctt gag ggg ttg aag gag gtg 914

Gly Leu Pro Ile Gly Glu Met Phe Asp Leu Glu Gly Leu Lys Glu ValGly Leu Pro Ile Gly Glu Met Phe Asp Leu Glu Gly Leu Lys Glu Val

290 295 300290 295 300

tgt gag agg ttg gga agg tgg acg ttt ttc gtt agt tct agt ccg ttg 962tgt gag agg ttg gga agg tgg acg ttt ttc gtt agt tct agt ccg ttg 962

Cys Glu Arg Leu Gly Arg Trp Thr Phe Phe Val Ser Ser Ser Pro LeuCys Glu Arg Leu Gly Arg Trp Thr Phe Phe Val Ser Ser Ser Ser Pro Leu

305 310 315305 310 315

aac tgt ggg aga ggt gtg tcg agt ccg ccg aat tgt atg gca att ttt 1010aac tgt ggg aga ggt gtg tcg agt ccg ccg aat tgt atg gca att ttt 1010

Asn Cys Gly Arg Gly Val Ser Ser Pro Pro Asn Cys Met Ala Ile PheAsn Cys Gly Arg Gly Val Ser Ser Pro Pro Asn Cys Met Ala Ile Phe

320 325 330320 325 330

taagaattc 1019taagaattc 1019

<210>97<210>97

<211>334<211>334

<212>PRT<212>PRT

<213>Aspergillus niger<213>Aspergillus niger

<400>97<400>97

1 5 10 151 5 10 15

20 25 3020 25 30

35 40 4535 40 45

50 55 6050 55 60

65 70 75 8065 70 75 80

85 90 9585 90 95

100 105 110100 105 110

115 120 125115 120 125

130 135 140130 135 140

145 150 155 160145 150 155 160

165 170 175165 170 175

180 185 190180 185 190

195 200 205195 200 205

210 215 220210 215 220

225 230 235 240225 230 235 240

245 250 255245 250 255

260 265 270260 265 270

275 280 285275 280 285

290 295 300290 295 300

305 310 315 320305 310 315 320

325 330325 330

<210>98<210>98

<211>975<211>975

<212>DNA<212>DNA

<213>Aspergillus niger<213>Aspergillus niger

<220><220>

<221>CDS<221> CDS

<222>(1)..(975)<222>(1)..(975)

<223><223>

<400>98<400>98

1 5 10 151 5 10 15

cgc tac tgg ccc gca acc ccc aac tcc gcc gac gag gga ctg ggg atg 96cgc tac tgg ccc gca acc ccc aac tcc gcc gac gag gga ctg ggg atg 96

Arg Tyr Trp Pro Ala Thr Pro Asn Ser A la Asp Glu Gly Leu Gly MetArg Tyr Trp Pro Ala Thr Pro Asn Ser A la Asp Glu Gly Leu Gly Met

20 25 3020 25 30

ctc cgt ctc ctc act cca gaa att gtc gcc aac gca gca cgc acc cag 144ctc cgt ctc ctc act cca gaa att gtc gcc aac gca gca cgc acc cag 144

35 40 4535 40 45

atc cag acc ggc gag cgc gta tgc ctg aac tgg gac ctg gag aag ctg 192atc cag acc ggc gag cgc gta tgc ctg aac tgg gac ctg gag aag ctg 192

50 55 6050 55 60

aac cca cca ggc ttc ggc cgc aaa ccc ttc gcc cac cac gta aaa tgg 240aac cca cca ggc ttc ggc cgc aaa ccc ttc gcc cac cac gta aaa tgg 240

65 70 75 8065 70 75 80

gtc tcc gaa ggc cac gcc ttc gac gac gaa tac cac ttc aac ccg caa 288gtc tcc gaa ggc cac gcc ttc gac gac gaa tac cac ttc aac ccg caa 288

85 90 9585 90 95

caa tcc tcc caa tgg gac ggt ctg cga cac cac aac gcc ccg gct cca 336caa tcc tcc caa tgg gac ggt ctg cga cac cac aac gcc ccg gct cca 336

100 105 110100 105 110

acc ccc gac gac cct aac cgc cgg gtc ttc tac ggc ggc acc acc tcc 384acc ccc gac gac cct aac cgc cgg gtc ttc tac ggc ggc acc acc tcc 384

115 120 125115 120 125

acc gag atc ctc gac ccc tcg tcc acc cga atc ggc ata gcc cac tgg 432acc gag atc ctc gac ccc tcg tcc acc cga atc ggc ata gcc cac tgg 432

130 135 140130 135 140

gcc aag aaa ggc atc gcc ggc cgc ggc gtc ctc atc gac tac gcc tcc 480gcc aag aaa ggc atc gcc ggc cgc ggc gtc ctc atc gac tac gcc tcc 480

145 150 155 160145 150 155 160

tac gtg caa cga acc aag ggc gtc aca gta aac gcc cta acc cgc cac 528tac gtg caa cga acc aag ggc gtc aca gta aac gcc cta acc cgc cac 528

165 170 175165 170 175

acc gtc tcc ctg gac gac gtc ctc acc atc gcg aag gaa tgc aac atc 576acc gtc tcc ctg gac gac gtc ctc acc atc gcg aag gaa tgc aac atc 576

180 185 190180 185 190

acc ttc gaa cca ggc gac att ctc ttc ctg cgc gtc ggt cta ccg act 624acc ttc gaa cca ggc gac att ctc ttc ctg cgc gtc ggt cta ccg act 624

Thr Phe Glu Pro Gly AspIle Leu Phe Leu Arg Val Gly Leu Pro ThrThr Phe Glu Pro Gly AspIle Leu Phe Leu Arg Val Gly Leu Pro Thr

195 200 205195 200 205

aca tgg gat aac atg acc gat gag gag agg gtc aag tat agt cag cag 672aca tgg gat aac atg acc gat gag gag agg gtc aag tat agt cag cag 672

210 215 220210 215 220

gaa acg ccc aac cat gca gga tta gag cag agt gag cgt gtg gtg cgg 720gaa acg ccc aac cat gca gga tta gag cag agt gag cgt gtg gtg cgg 720

225 230 235 240225 230 235 240

ttc ctt tgg gat cag cat ttc gcc gct gta gcg ggt gat gcg gtc agc 768ttc ctt tgg gat cag cat ttc gcc gct gta gcg ggt gat gcg gtc agc 768

245 250 255245 250 255

ttt gag gtg tat ccg ccg gta gag aag gag tgg gat tta cat cat ttt 816ttt gag gtg tat ccg ccg gta gag aag gag tgg gat tta cat cat ttt 816

260 265 270260 265 270

ttg ctg gct ggg tgg gga cta ccg att ggg gag atg ttt gat ctt gag 864ttg ctg gct ggg tgg gga cta ccg att ggg gag atg ttt gat ctt gag 864

275 280 285275 280 285

ggg ttg aag gag gtg tgt gag agg ttg gga agg tgg acg ttt ttc gtt 912ggg ttg aag gag gtg tgt gag agg ttg gga agg tgg acg ttt ttc gtt 912

290 295 300290 295 300

agt tct agt ccg ttg aac tgt ggg aga ggt gtg tcg agt ccg ccg aat 960agt tct agt ccg ttg aac tgt ggg aga ggt gtg tcg agt ccg ccg aat 960

305 310 315 320305 310 315 320

tgt atg gca att ttt 975tgt atg gca att ttt 975

Cys Met Ala Ile PheCys Met Ala Ile Phe

325325

<210>99<210>99

<211>325<211>325

<212>PRT<212>PRT

<213>Aspergillus niger<213>Aspergillus niger

<400>99<400>99

1 5 10 151 5 10 15

20 25 3020 25 30

35 40 4535 40 45

50 55 6050 55 60

65 70 75 8065 70 75 80

85 90 9585 90 95

100 105 110100 105 110

115 120 125115 120 125

130 135 140130 135 140

145 150 155 160145 150 155 160

165 170 175165 170 175

180 185 190180 185 190

195 200 205195 200 205

210 215 220210 215 220

225 230 235 240225 230 235 240

245 250 255245 250 255

260 265 270260 265 270

275 280 285275 280 285

290 295 300290 295 300

305 310 315 320305 310 315 320

Cys Met Ala Ile PheCys Met Ala Ile Phe

325325

Claims

1. nucleotide sequence, its coding has the active albumen of selective hydrolysis R-or S-pantolactone, and said nucleotide sequence is selected from the group that following polynucleotide constitute:

(a) polynucleotide shown in SEQ ID NO:5;

(b) polynucleotide of the polypeptide of coding shown in SEQ ID NO:6.

2. nucleotide sequence as claimed in claim 1, it obtains from A.niger MacRae ATCC46951.

3. an expression vector wherein comprises nucleotide sequence according to claim 1 or claim 2.

4. the host cell that transformed with expression vector as claimed in claim 3.

5. a peptide species, it has the pantolactone hydrolase activity, and is described nucleotide sequence coded by claim 1.

6. polypeptide as claimed in claim 5, it is SEQ ID NO:6.

7. like any described polypeptide in claim 5 or 6, it obtains from A.niger MacRae ATCC 46951.

8. be used for R-or S-pantolactone are carried out the purposes of selective hydrolysis like any described polypeptide in the claim 5 to 7.

9. method of producing R-pantothenic acid or its salt or R-panthenol, said method comprise the steps: under like the situation that any described polypeptide exists in the claim 5 to 7, R-or S-pantolactone to be carried out selective hydrolysis.